SurfSense/.cursor/skills/geo-content-optimizer/references/ai-citation-patterns.md
DESKTOP-RTLN3BA\$punk 7ea840dbb2
Some checks failed
Build and Push Docker Images / tag_release (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Has been cancelled
Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Has been cancelled
Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Has been cancelled
feat: enhance SurfSense with new skills, blog section, and improve SEO metadata
- Added multiple new skills to skills-lock.json from the repository `aaron-he-zhu/seo-geo-claude-skills`.
- Introduced `fuzzy-search` dependency in package.json for improved search functionality.
- Updated pnpm-lock.yaml to include the new `fuzzy-search` package.
- Enhanced SEO metadata across various pages, including canonical links and descriptions for better search visibility.
- Improved layout and structure of several components, including the homepage and changelog, to enhance user experience.
2026-04-11 23:38:12 -07:00

504 lines
13 KiB
Markdown

# AI Citation Patterns
How different AI systems select and cite content. Understanding these patterns helps optimize content for AI visibility.
## Google AI Overviews
### Citation Behavior
**Format preferences**:
- Prefers structured, factual content
- Cites multiple sources per overview
- Shows source links as footnotes
- Displays "Sources" section at bottom
**What gets cited**:
- Clear, direct answers to queries
- Statistics with recent dates
- Step-by-step instructions
- Comparison tables
- Definition blocks
- List-formatted content
**Content structure preferences**:
- Short paragraphs (2-3 sentences)
- Bullet points and numbered lists
- Clear headings matching query intent
- Tables for comparison data
- FAQ formats
**Authority signals**:
- Domain authority (trusted sites favored)
- E-E-A-T signals (expertise, authoritativeness, trustworthiness)
- Recent publication/update dates
- Author credentials visible
- Citations to other authoritative sources
**Citation frequency**: Typically cites 3-8 sources per AI Overview
---
## ChatGPT (with Browsing)
### Citation Behavior
**Format preferences**:
- Inline citations with numbers [1], [2]
- "Sources" list at end of response
- Clickable source links
- Sometimes quotes directly with quotation marks
**What gets cited**:
- Specific facts and statistics
- Expert quotes
- Technical explanations
- Recent information (prioritizes freshness)
- Authoritative domain content
- Well-structured, scannable content
**Source selection patterns**:
- Favors .edu, .gov, .org domains
- Prioritizes recognized brands/publishers
- Values comprehensive content over thin pages
- Prefers content with clear attribution
- Looks for consensus across multiple sources
**Quoting behavior**:
- Pulls exact quotes when information is distinctive
- Paraphrases general information
- Combines information from multiple sources
- Attributes specific claims to sources
**Citation frequency**: 1-6 sources per response depending on complexity
---
## Perplexity AI
### Citation Behavior
**Format preferences**:
- Superscript numbers [1] inline
- Numbered source list with snippets
- Shows brief excerpt from each source
- Displays domain name and publish date
**What gets cited**:
- Recent content (strong freshness bias)
- Authoritative sources
- Content with clear, quotable statements
- Statistical data with sources
- Primary sources over secondary
- Content matching query intent precisely
**Content structure preferences**:
- Extremely well-structured content
- Clear topic sentences
- Quotable, standalone statements
- Factual density (stats, data, specifics)
- Headings that match question formats
**Authority signals**:
- Domain credibility
- Author expertise
- Publication reputation
- Recency of content
- Depth of coverage
**Citation frequency**: Typically 5-10 sources per response (more than others)
**Unique behavior**: Often shows "Follow-up Questions" that can reveal additional citation opportunities
---
## Claude (Knowledge-Based Responses)
### Citation Behavior
**Note**: Claude typically relies on training data rather than live web access, but understanding preferences helps create citeable content.
**Format preferences**:
- When citing, uses clear attribution phrases
- "According to [source]..."
- "Research from [source] shows..."
- May reference general knowledge without specific citations
**What gets remembered/prioritized**:
- Clear, authoritative definitions
- Widely-accepted facts and statistics
- Well-established methodologies
- Consensus information
- Content from recognized authorities
**Content characteristics valued**:
- Factual accuracy and precision
- Logical structure and clarity
- Comprehensive explanations
- Technical accuracy
- Unambiguous language
---
## Common Traits Across All AI Systems
### Universal Citation Factors
**Content quality**:
- Factual accuracy (incorrect info won't be cited)
- Clear, unambiguous language
- Proper grammar and spelling
- Comprehensive coverage
- Up-to-date information
**Structure**:
- Scannable format (headings, lists, tables)
- Logical organization
- Clear topic segmentation
- Short paragraphs
- Visual hierarchy
**Authority**:
- Domain credibility
- Author credentials
- Source citations in content
- Expertise signals
- Editorial quality
**Relevance**:
- Precise match to query intent
- Topic focus (not meandering)
- Keyword-topic alignment
- Depth of coverage on specific topic
---
## Optimal Content Structures for Citation
### 1. Definition Blocks
AI systems love clear, quotable definitions.
**Structure**:
```markdown
**[Term]** is [clear category] that [primary function], [key characteristic].
```
**Example**:
> **Search Engine Optimization (SEO)** is a digital marketing practice that improves website visibility in organic search results by optimizing content, technical elements, and authority signals.
**Why it works**: Standalone, complete, unambiguous, proper scope.
---
### 2. Statistic Blocks
Facts with sources are highly citeable.
**Structure**:
```markdown
According to [Source], [specific statistic] as of [timeframe].
```
**Example**:
> According to HubSpot's 2024 State of Marketing Report, 82% of marketers actively invest in content marketing, making it the most widely adopted digital marketing strategy.
**Why it works**: Specific, attributed, recent, verifiable.
---
### 3. Q&A Pairs
Question-answer formats match AI query patterns.
**Structure**:
```markdown
### [Question matching common query]?
[Direct answer in 40-60 words]
[Optional supporting detail]
```
**Example**:
> ### How long does SEO take to show results?
>
> SEO typically takes 3-6 months to show significant results for new websites, though this varies based on competition, domain authority, and strategy. Established sites may see improvements in 1-3 months for less competitive keywords.
**Why it works**: Matches query format, provides concise answer, includes qualifiers.
---
### 4. Comparison Tables
Structured comparisons are easy for AI to parse and cite.
**Structure**:
```markdown
| Feature | Option A | Option B |
|---------|----------|----------|
| [Factor 1] | [Specific value] | [Specific value] |
| [Factor 2] | [Specific value] | [Specific value] |
| **Best for** | [Use case] | [Use case] |
```
**Example**:
| Factor | Technical SEO | On-Page SEO |
|--------|---------------|-------------|
| Focus | Site infrastructure | Content optimization |
| Timeframe | 1-3 months | Ongoing |
| Complexity | High | Medium |
| **Best for** | Site-wide issues | Individual page improvements |
**Why it works**: Clear comparison, specific values, scannable format.
---
### 5. Step-by-Step Processes
Numbered lists for "how to" queries.
**Structure**:
```markdown
1. **[Action]** - [Brief explanation]
2. **[Action]** - [Brief explanation]
3. **[Action]** - [Brief explanation]
```
**Example**:
> To conduct keyword research:
> 1. **Identify seed keywords** - List 5-10 topics your audience searches for
> 2. **Use keyword research tools** - Expand seed keywords into hundreds of variations
> 3. **Analyze search intent** - Determine what content format each keyword requires
> 4. **Evaluate competition** - Assess ranking difficulty for each keyword
> 5. **Prioritize keywords** - Choose based on volume, difficulty, and relevance
**Why it works**: Clear process, actionable steps, logical sequence.
---
### 6. List-Based Content
Curated lists with brief explanations.
**Structure**:
```markdown
**[Item name]**: [Clear description with key benefit]
```
**Example**:
> Top on-page SEO factors:
> - **Title tags**: Most important on-page element; include primary keyword within first 60 characters
> - **Header tags**: Structure content hierarchically; use one H1, multiple H2s for main sections
> - **Meta descriptions**: Don't directly impact rankings but affect CTR; keep under 160 characters
> - **URL structure**: Use descriptive, keyword-rich URLs without unnecessary parameters
**Why it works**: Scannable, specific, actionable.
---
### 7. Before/After Examples
Concrete examples showing transformation.
**Structure**:
```markdown
**Before**: [Weak example]
**After**: [Strong example]
**Why it's better**: [Explanation]
```
**Example**:
> **Before**: "Email marketing is pretty effective."
> **After**: "Email marketing delivers an average ROI of $42 for every $1 spent, according to the Data & Marketing Association."
> **Why it's better**: Specific statistic, attributed source, quantifiable claim.
**Why it works**: Shows concrete improvement, demonstrates principle.
---
### 8. Key Insight Callouts
Highlighted important points.
**Structure**:
```markdown
> **Key insight**: [Memorable, quotable statement]
```
**Example**:
> **Key insight**: According to Google's John Mueller, internal linking is one of the most underutilized SEO tactics, with properly structured internal links often delivering faster ranking improvements than external link building.
**Why it works**: Visually distinct, authoritative, quotable.
---
## Content Optimization by Query Type
### Informational Queries ("What is...", "How does...", "Why...")
**AI citation priorities**:
1. Clear definitions
2. Comprehensive explanations
3. Expert perspectives
4. Supporting statistics
5. Real-world examples
**Optimal structure**:
- Definition in first paragraph
- "Why it matters" section
- How it works explanation
- Common use cases
- Expert quotes or citations
---
### Comparison Queries ("[A] vs [B]", "Best [category]")
**AI citation priorities**:
1. Comparison tables
2. Clear pros/cons lists
3. Use case recommendations
4. Specific differentiators
5. Verdict or recommendation
**Optimal structure**:
- Quick comparison table upfront
- Individual descriptions
- Feature-by-feature comparison
- "Choose X if..." recommendations
- Summary verdict
---
### How-To Queries ("How to...", "Steps to...")
**AI citation priorities**:
1. Numbered step-by-step processes
2. Required tools/prerequisites
3. Time estimates
4. Success indicators
5. Troubleshooting tips
**Optimal structure**:
- Prerequisites listed first
- Clear numbered steps
- Sub-steps where needed
- Visual indicators of progress
- Common problems and solutions
---
### Statistical Queries ("How much...", "How many...", "Statistics about...")
**AI citation priorities**:
1. Specific numbers with sources
2. Recent data (within 1-2 years)
3. Multiple data points
4. Context for statistics
5. Trend information
**Optimal structure**:
- Lead with key statistic
- Source attribution immediately after
- Context and interpretation
- Related statistics
- Takeaways from data
---
## Citation Likelihood Factors
### High Citation Likelihood
- [ ] Content from recognized authority domains
- [ ] Published or updated within 12 months
- [ ] Clear, standalone statements
- [ ] Proper source attribution
- [ ] Specific statistics with dates
- [ ] Structured with headings/lists/tables
- [ ] Comprehensive topic coverage
- [ ] Author credentials visible
- [ ] Technical accuracy verified
- [ ] Consensus with other sources
### Medium Citation Likelihood
- [ ] Content from less-known but quality domains
- [ ] Published 1-2 years ago
- [ ] Clear but requires slight context
- [ ] General industry claims
- [ ] Good structure but less scannable
- [ ] Moderate depth of coverage
- [ ] No author listed but quality content
- [ ] Some supporting evidence
### Low Citation Likelihood
- [ ] Content from unknown/low-authority domains
- [ ] Published 3+ years ago without updates
- [ ] Vague or ambiguous statements
- [ ] No sources cited
- [ ] Poor content structure (walls of text)
- [ ] Thin or superficial coverage
- [ ] Promotional or biased tone
- [ ] Factual inconsistencies
- [ ] No expertise signals
---
## AI System Comparison Summary
| Factor | Google AI Overviews | ChatGPT | Perplexity | Claude |
|--------|---------------------|---------|------------|--------|
| **Freshness bias** | High | Medium | Very high | N/A (training data) |
| **Authority weight** | Very high | High | High | High |
| **Structure importance** | High | Medium | Very high | Medium |
| **Citation count** | 3-8 | 1-6 | 5-10 | N/A |
| **Quotable focus** | High | Medium | Very high | High |
| **Domain trust** | Very high | High | Medium | High |
| **Factual density** | High | High | Very high | Very high |
---
## Tracking AI Citations
### Manual Monitoring
**Check if your content appears in**:
- Google AI Overviews for target keywords
- ChatGPT responses (search your domain in ChatGPT)
- Perplexity results for relevant queries
- Other AI search engines
**Test queries**:
- Exact-match questions from your FAQ
- Definitions of terms you've defined
- Statistics you've cited with attribution
- Processes you've documented
### Indicators of AI Visibility
- Increased direct traffic (AI users clicking sources)
- Traffic spikes from unusual referrers
- Engagement metrics: low bounce rate, high time-on-page
- Return visitors (AI users coming back for more depth)
---
## Optimization Checklist for AI Citations
Content ready for AI citation should have:
- [ ] At least 3 clear, quotable definitions
- [ ] 5+ specific statistics with sources and dates
- [ ] Q&A format sections covering top queries
- [ ] Comparison tables where relevant
- [ ] Numbered lists for processes
- [ ] Content published or updated within 12 months
- [ ] Author credentials visible
- [ ] External citations to authoritative sources
- [ ] Structured with clear H2/H3 headings
- [ ] Short paragraphs (2-4 sentences)
- [ ] No promotional language
- [ ] Technical accuracy verified
- [ ] Mobile-friendly formatting