SurfSense/.cursor/skills/geo-content-optimizer/references/ai-citation-patterns.md
DESKTOP-RTLN3BA\$punk 7ea840dbb2
Some checks failed
Build and Push Docker Images / tag_release (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Has been cancelled
Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Has been cancelled
Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Has been cancelled
feat: enhance SurfSense with new skills, blog section, and improve SEO metadata
- Added multiple new skills to skills-lock.json from the repository `aaron-he-zhu/seo-geo-claude-skills`.
- Introduced `fuzzy-search` dependency in package.json for improved search functionality.
- Updated pnpm-lock.yaml to include the new `fuzzy-search` package.
- Enhanced SEO metadata across various pages, including canonical links and descriptions for better search visibility.
- Improved layout and structure of several components, including the homepage and changelog, to enhance user experience.
2026-04-11 23:38:12 -07:00

13 KiB

AI Citation Patterns

How different AI systems select and cite content. Understanding these patterns helps optimize content for AI visibility.

Google AI Overviews

Citation Behavior

Format preferences:

  • Prefers structured, factual content
  • Cites multiple sources per overview
  • Shows source links as footnotes
  • Displays "Sources" section at bottom

What gets cited:

  • Clear, direct answers to queries
  • Statistics with recent dates
  • Step-by-step instructions
  • Comparison tables
  • Definition blocks
  • List-formatted content

Content structure preferences:

  • Short paragraphs (2-3 sentences)
  • Bullet points and numbered lists
  • Clear headings matching query intent
  • Tables for comparison data
  • FAQ formats

Authority signals:

  • Domain authority (trusted sites favored)
  • E-E-A-T signals (expertise, authoritativeness, trustworthiness)
  • Recent publication/update dates
  • Author credentials visible
  • Citations to other authoritative sources

Citation frequency: Typically cites 3-8 sources per AI Overview


ChatGPT (with Browsing)

Citation Behavior

Format preferences:

  • Inline citations with numbers [1], [2]
  • "Sources" list at end of response
  • Clickable source links
  • Sometimes quotes directly with quotation marks

What gets cited:

  • Specific facts and statistics
  • Expert quotes
  • Technical explanations
  • Recent information (prioritizes freshness)
  • Authoritative domain content
  • Well-structured, scannable content

Source selection patterns:

  • Favors .edu, .gov, .org domains
  • Prioritizes recognized brands/publishers
  • Values comprehensive content over thin pages
  • Prefers content with clear attribution
  • Looks for consensus across multiple sources

Quoting behavior:

  • Pulls exact quotes when information is distinctive
  • Paraphrases general information
  • Combines information from multiple sources
  • Attributes specific claims to sources

Citation frequency: 1-6 sources per response depending on complexity


Perplexity AI

Citation Behavior

Format preferences:

  • Superscript numbers [1] inline
  • Numbered source list with snippets
  • Shows brief excerpt from each source
  • Displays domain name and publish date

What gets cited:

  • Recent content (strong freshness bias)
  • Authoritative sources
  • Content with clear, quotable statements
  • Statistical data with sources
  • Primary sources over secondary
  • Content matching query intent precisely

Content structure preferences:

  • Extremely well-structured content
  • Clear topic sentences
  • Quotable, standalone statements
  • Factual density (stats, data, specifics)
  • Headings that match question formats

Authority signals:

  • Domain credibility
  • Author expertise
  • Publication reputation
  • Recency of content
  • Depth of coverage

Citation frequency: Typically 5-10 sources per response (more than others)

Unique behavior: Often shows "Follow-up Questions" that can reveal additional citation opportunities


Claude (Knowledge-Based Responses)

Citation Behavior

Note: Claude typically relies on training data rather than live web access, but understanding preferences helps create citeable content.

Format preferences:

  • When citing, uses clear attribution phrases
  • "According to [source]..."
  • "Research from [source] shows..."
  • May reference general knowledge without specific citations

What gets remembered/prioritized:

  • Clear, authoritative definitions
  • Widely-accepted facts and statistics
  • Well-established methodologies
  • Consensus information
  • Content from recognized authorities

Content characteristics valued:

  • Factual accuracy and precision
  • Logical structure and clarity
  • Comprehensive explanations
  • Technical accuracy
  • Unambiguous language

Common Traits Across All AI Systems

Universal Citation Factors

Content quality:

  • Factual accuracy (incorrect info won't be cited)
  • Clear, unambiguous language
  • Proper grammar and spelling
  • Comprehensive coverage
  • Up-to-date information

Structure:

  • Scannable format (headings, lists, tables)
  • Logical organization
  • Clear topic segmentation
  • Short paragraphs
  • Visual hierarchy

Authority:

  • Domain credibility
  • Author credentials
  • Source citations in content
  • Expertise signals
  • Editorial quality

Relevance:

  • Precise match to query intent
  • Topic focus (not meandering)
  • Keyword-topic alignment
  • Depth of coverage on specific topic

Optimal Content Structures for Citation

1. Definition Blocks

AI systems love clear, quotable definitions.

Structure:

**[Term]** is [clear category] that [primary function], [key characteristic].

Example:

Search Engine Optimization (SEO) is a digital marketing practice that improves website visibility in organic search results by optimizing content, technical elements, and authority signals.

Why it works: Standalone, complete, unambiguous, proper scope.


2. Statistic Blocks

Facts with sources are highly citeable.

Structure:

According to [Source], [specific statistic] as of [timeframe].

Example:

According to HubSpot's 2024 State of Marketing Report, 82% of marketers actively invest in content marketing, making it the most widely adopted digital marketing strategy.

Why it works: Specific, attributed, recent, verifiable.


3. Q&A Pairs

Question-answer formats match AI query patterns.

Structure:

### [Question matching common query]?

[Direct answer in 40-60 words]

[Optional supporting detail]

Example:

How long does SEO take to show results?

SEO typically takes 3-6 months to show significant results for new websites, though this varies based on competition, domain authority, and strategy. Established sites may see improvements in 1-3 months for less competitive keywords.

Why it works: Matches query format, provides concise answer, includes qualifiers.


4. Comparison Tables

Structured comparisons are easy for AI to parse and cite.

Structure:

| Feature | Option A | Option B |
|---------|----------|----------|
| [Factor 1] | [Specific value] | [Specific value] |
| [Factor 2] | [Specific value] | [Specific value] |
| **Best for** | [Use case] | [Use case] |

Example:

Factor Technical SEO On-Page SEO
Focus Site infrastructure Content optimization
Timeframe 1-3 months Ongoing
Complexity High Medium
Best for Site-wide issues Individual page improvements

Why it works: Clear comparison, specific values, scannable format.


5. Step-by-Step Processes

Numbered lists for "how to" queries.

Structure:

1. **[Action]** - [Brief explanation]
2. **[Action]** - [Brief explanation]
3. **[Action]** - [Brief explanation]

Example:

To conduct keyword research:

  1. Identify seed keywords - List 5-10 topics your audience searches for
  2. Use keyword research tools - Expand seed keywords into hundreds of variations
  3. Analyze search intent - Determine what content format each keyword requires
  4. Evaluate competition - Assess ranking difficulty for each keyword
  5. Prioritize keywords - Choose based on volume, difficulty, and relevance

Why it works: Clear process, actionable steps, logical sequence.


6. List-Based Content

Curated lists with brief explanations.

Structure:

**[Item name]**: [Clear description with key benefit]

Example:

Top on-page SEO factors:

  • Title tags: Most important on-page element; include primary keyword within first 60 characters
  • Header tags: Structure content hierarchically; use one H1, multiple H2s for main sections
  • Meta descriptions: Don't directly impact rankings but affect CTR; keep under 160 characters
  • URL structure: Use descriptive, keyword-rich URLs without unnecessary parameters

Why it works: Scannable, specific, actionable.


7. Before/After Examples

Concrete examples showing transformation.

Structure:

**Before**: [Weak example]
**After**: [Strong example]
**Why it's better**: [Explanation]

Example:

Before: "Email marketing is pretty effective." After: "Email marketing delivers an average ROI of $42 for every $1 spent, according to the Data & Marketing Association." Why it's better: Specific statistic, attributed source, quantifiable claim.

Why it works: Shows concrete improvement, demonstrates principle.


8. Key Insight Callouts

Highlighted important points.

Structure:

> **Key insight**: [Memorable, quotable statement]

Example:

Key insight: According to Google's John Mueller, internal linking is one of the most underutilized SEO tactics, with properly structured internal links often delivering faster ranking improvements than external link building.

Why it works: Visually distinct, authoritative, quotable.


Content Optimization by Query Type

Informational Queries ("What is...", "How does...", "Why...")

AI citation priorities:

  1. Clear definitions
  2. Comprehensive explanations
  3. Expert perspectives
  4. Supporting statistics
  5. Real-world examples

Optimal structure:

  • Definition in first paragraph
  • "Why it matters" section
  • How it works explanation
  • Common use cases
  • Expert quotes or citations

Comparison Queries ("[A] vs [B]", "Best [category]")

AI citation priorities:

  1. Comparison tables
  2. Clear pros/cons lists
  3. Use case recommendations
  4. Specific differentiators
  5. Verdict or recommendation

Optimal structure:

  • Quick comparison table upfront
  • Individual descriptions
  • Feature-by-feature comparison
  • "Choose X if..." recommendations
  • Summary verdict

How-To Queries ("How to...", "Steps to...")

AI citation priorities:

  1. Numbered step-by-step processes
  2. Required tools/prerequisites
  3. Time estimates
  4. Success indicators
  5. Troubleshooting tips

Optimal structure:

  • Prerequisites listed first
  • Clear numbered steps
  • Sub-steps where needed
  • Visual indicators of progress
  • Common problems and solutions

Statistical Queries ("How much...", "How many...", "Statistics about...")

AI citation priorities:

  1. Specific numbers with sources
  2. Recent data (within 1-2 years)
  3. Multiple data points
  4. Context for statistics
  5. Trend information

Optimal structure:

  • Lead with key statistic
  • Source attribution immediately after
  • Context and interpretation
  • Related statistics
  • Takeaways from data

Citation Likelihood Factors

High Citation Likelihood

  • Content from recognized authority domains
  • Published or updated within 12 months
  • Clear, standalone statements
  • Proper source attribution
  • Specific statistics with dates
  • Structured with headings/lists/tables
  • Comprehensive topic coverage
  • Author credentials visible
  • Technical accuracy verified
  • Consensus with other sources

Medium Citation Likelihood

  • Content from less-known but quality domains
  • Published 1-2 years ago
  • Clear but requires slight context
  • General industry claims
  • Good structure but less scannable
  • Moderate depth of coverage
  • No author listed but quality content
  • Some supporting evidence

Low Citation Likelihood

  • Content from unknown/low-authority domains
  • Published 3+ years ago without updates
  • Vague or ambiguous statements
  • No sources cited
  • Poor content structure (walls of text)
  • Thin or superficial coverage
  • Promotional or biased tone
  • Factual inconsistencies
  • No expertise signals

AI System Comparison Summary

Factor Google AI Overviews ChatGPT Perplexity Claude
Freshness bias High Medium Very high N/A (training data)
Authority weight Very high High High High
Structure importance High Medium Very high Medium
Citation count 3-8 1-6 5-10 N/A
Quotable focus High Medium Very high High
Domain trust Very high High Medium High
Factual density High High Very high Very high

Tracking AI Citations

Manual Monitoring

Check if your content appears in:

  • Google AI Overviews for target keywords
  • ChatGPT responses (search your domain in ChatGPT)
  • Perplexity results for relevant queries
  • Other AI search engines

Test queries:

  • Exact-match questions from your FAQ
  • Definitions of terms you've defined
  • Statistics you've cited with attribution
  • Processes you've documented

Indicators of AI Visibility

  • Increased direct traffic (AI users clicking sources)
  • Traffic spikes from unusual referrers
  • Engagement metrics: low bounce rate, high time-on-page
  • Return visitors (AI users coming back for more depth)

Optimization Checklist for AI Citations

Content ready for AI citation should have:

  • At least 3 clear, quotable definitions
  • 5+ specific statistics with sources and dates
  • Q&A format sections covering top queries
  • Comparison tables where relevant
  • Numbered lists for processes
  • Content published or updated within 12 months
  • Author credentials visible
  • External citations to authoritative sources
  • Structured with clear H2/H3 headings
  • Short paragraphs (2-4 sentences)
  • No promotional language
  • Technical accuracy verified
  • Mobile-friendly formatting