mirror of
https://github.com/MODSetter/SurfSense.git
synced 2026-05-08 15:22:39 +02:00
Some checks failed
Build and Push Docker Images / tag_release (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-24.04-arm, linux/arm64, arm64) (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_backend, ./surfsense_backend/Dockerfile, backend, surfsense-backend, ubuntu-latest, linux/amd64, amd64) (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-24.04-arm, linux/arm64, arm64) (push) Has been cancelled
Build and Push Docker Images / build (./surfsense_web, ./surfsense_web/Dockerfile, web, surfsense-web, ubuntu-latest, linux/amd64, amd64) (push) Has been cancelled
Build and Push Docker Images / create_manifest (backend, surfsense-backend) (push) Has been cancelled
Build and Push Docker Images / create_manifest (web, surfsense-web) (push) Has been cancelled
- Added multiple new skills to skills-lock.json from the repository `aaron-he-zhu/seo-geo-claude-skills`. - Introduced `fuzzy-search` dependency in package.json for improved search functionality. - Updated pnpm-lock.yaml to include the new `fuzzy-search` package. - Enhanced SEO metadata across various pages, including canonical links and descriptions for better search visibility. - Improved layout and structure of several components, including the homepage and changelog, to enhance user experience.
504 lines
13 KiB
Markdown
504 lines
13 KiB
Markdown
# AI Citation Patterns
|
|
|
|
How different AI systems select and cite content. Understanding these patterns helps optimize content for AI visibility.
|
|
|
|
## Google AI Overviews
|
|
|
|
### Citation Behavior
|
|
|
|
**Format preferences**:
|
|
- Prefers structured, factual content
|
|
- Cites multiple sources per overview
|
|
- Shows source links as footnotes
|
|
- Displays "Sources" section at bottom
|
|
|
|
**What gets cited**:
|
|
- Clear, direct answers to queries
|
|
- Statistics with recent dates
|
|
- Step-by-step instructions
|
|
- Comparison tables
|
|
- Definition blocks
|
|
- List-formatted content
|
|
|
|
**Content structure preferences**:
|
|
- Short paragraphs (2-3 sentences)
|
|
- Bullet points and numbered lists
|
|
- Clear headings matching query intent
|
|
- Tables for comparison data
|
|
- FAQ formats
|
|
|
|
**Authority signals**:
|
|
- Domain authority (trusted sites favored)
|
|
- E-E-A-T signals (expertise, authoritativeness, trustworthiness)
|
|
- Recent publication/update dates
|
|
- Author credentials visible
|
|
- Citations to other authoritative sources
|
|
|
|
**Citation frequency**: Typically cites 3-8 sources per AI Overview
|
|
|
|
---
|
|
|
|
## ChatGPT (with Browsing)
|
|
|
|
### Citation Behavior
|
|
|
|
**Format preferences**:
|
|
- Inline citations with numbers [1], [2]
|
|
- "Sources" list at end of response
|
|
- Clickable source links
|
|
- Sometimes quotes directly with quotation marks
|
|
|
|
**What gets cited**:
|
|
- Specific facts and statistics
|
|
- Expert quotes
|
|
- Technical explanations
|
|
- Recent information (prioritizes freshness)
|
|
- Authoritative domain content
|
|
- Well-structured, scannable content
|
|
|
|
**Source selection patterns**:
|
|
- Favors .edu, .gov, .org domains
|
|
- Prioritizes recognized brands/publishers
|
|
- Values comprehensive content over thin pages
|
|
- Prefers content with clear attribution
|
|
- Looks for consensus across multiple sources
|
|
|
|
**Quoting behavior**:
|
|
- Pulls exact quotes when information is distinctive
|
|
- Paraphrases general information
|
|
- Combines information from multiple sources
|
|
- Attributes specific claims to sources
|
|
|
|
**Citation frequency**: 1-6 sources per response depending on complexity
|
|
|
|
---
|
|
|
|
## Perplexity AI
|
|
|
|
### Citation Behavior
|
|
|
|
**Format preferences**:
|
|
- Superscript numbers [1] inline
|
|
- Numbered source list with snippets
|
|
- Shows brief excerpt from each source
|
|
- Displays domain name and publish date
|
|
|
|
**What gets cited**:
|
|
- Recent content (strong freshness bias)
|
|
- Authoritative sources
|
|
- Content with clear, quotable statements
|
|
- Statistical data with sources
|
|
- Primary sources over secondary
|
|
- Content matching query intent precisely
|
|
|
|
**Content structure preferences**:
|
|
- Extremely well-structured content
|
|
- Clear topic sentences
|
|
- Quotable, standalone statements
|
|
- Factual density (stats, data, specifics)
|
|
- Headings that match question formats
|
|
|
|
**Authority signals**:
|
|
- Domain credibility
|
|
- Author expertise
|
|
- Publication reputation
|
|
- Recency of content
|
|
- Depth of coverage
|
|
|
|
**Citation frequency**: Typically 5-10 sources per response (more than others)
|
|
|
|
**Unique behavior**: Often shows "Follow-up Questions" that can reveal additional citation opportunities
|
|
|
|
---
|
|
|
|
## Claude (Knowledge-Based Responses)
|
|
|
|
### Citation Behavior
|
|
|
|
**Note**: Claude typically relies on training data rather than live web access, but understanding preferences helps create citeable content.
|
|
|
|
**Format preferences**:
|
|
- When citing, uses clear attribution phrases
|
|
- "According to [source]..."
|
|
- "Research from [source] shows..."
|
|
- May reference general knowledge without specific citations
|
|
|
|
**What gets remembered/prioritized**:
|
|
- Clear, authoritative definitions
|
|
- Widely-accepted facts and statistics
|
|
- Well-established methodologies
|
|
- Consensus information
|
|
- Content from recognized authorities
|
|
|
|
**Content characteristics valued**:
|
|
- Factual accuracy and precision
|
|
- Logical structure and clarity
|
|
- Comprehensive explanations
|
|
- Technical accuracy
|
|
- Unambiguous language
|
|
|
|
---
|
|
|
|
## Common Traits Across All AI Systems
|
|
|
|
### Universal Citation Factors
|
|
|
|
**Content quality**:
|
|
- Factual accuracy (incorrect info won't be cited)
|
|
- Clear, unambiguous language
|
|
- Proper grammar and spelling
|
|
- Comprehensive coverage
|
|
- Up-to-date information
|
|
|
|
**Structure**:
|
|
- Scannable format (headings, lists, tables)
|
|
- Logical organization
|
|
- Clear topic segmentation
|
|
- Short paragraphs
|
|
- Visual hierarchy
|
|
|
|
**Authority**:
|
|
- Domain credibility
|
|
- Author credentials
|
|
- Source citations in content
|
|
- Expertise signals
|
|
- Editorial quality
|
|
|
|
**Relevance**:
|
|
- Precise match to query intent
|
|
- Topic focus (not meandering)
|
|
- Keyword-topic alignment
|
|
- Depth of coverage on specific topic
|
|
|
|
---
|
|
|
|
## Optimal Content Structures for Citation
|
|
|
|
### 1. Definition Blocks
|
|
|
|
AI systems love clear, quotable definitions.
|
|
|
|
**Structure**:
|
|
```markdown
|
|
**[Term]** is [clear category] that [primary function], [key characteristic].
|
|
```
|
|
|
|
**Example**:
|
|
> **Search Engine Optimization (SEO)** is a digital marketing practice that improves website visibility in organic search results by optimizing content, technical elements, and authority signals.
|
|
|
|
**Why it works**: Standalone, complete, unambiguous, proper scope.
|
|
|
|
---
|
|
|
|
### 2. Statistic Blocks
|
|
|
|
Facts with sources are highly citeable.
|
|
|
|
**Structure**:
|
|
```markdown
|
|
According to [Source], [specific statistic] as of [timeframe].
|
|
```
|
|
|
|
**Example**:
|
|
> According to HubSpot's 2024 State of Marketing Report, 82% of marketers actively invest in content marketing, making it the most widely adopted digital marketing strategy.
|
|
|
|
**Why it works**: Specific, attributed, recent, verifiable.
|
|
|
|
---
|
|
|
|
### 3. Q&A Pairs
|
|
|
|
Question-answer formats match AI query patterns.
|
|
|
|
**Structure**:
|
|
```markdown
|
|
### [Question matching common query]?
|
|
|
|
[Direct answer in 40-60 words]
|
|
|
|
[Optional supporting detail]
|
|
```
|
|
|
|
**Example**:
|
|
> ### How long does SEO take to show results?
|
|
>
|
|
> SEO typically takes 3-6 months to show significant results for new websites, though this varies based on competition, domain authority, and strategy. Established sites may see improvements in 1-3 months for less competitive keywords.
|
|
|
|
**Why it works**: Matches query format, provides concise answer, includes qualifiers.
|
|
|
|
---
|
|
|
|
### 4. Comparison Tables
|
|
|
|
Structured comparisons are easy for AI to parse and cite.
|
|
|
|
**Structure**:
|
|
```markdown
|
|
| Feature | Option A | Option B |
|
|
|---------|----------|----------|
|
|
| [Factor 1] | [Specific value] | [Specific value] |
|
|
| [Factor 2] | [Specific value] | [Specific value] |
|
|
| **Best for** | [Use case] | [Use case] |
|
|
```
|
|
|
|
**Example**:
|
|
| Factor | Technical SEO | On-Page SEO |
|
|
|--------|---------------|-------------|
|
|
| Focus | Site infrastructure | Content optimization |
|
|
| Timeframe | 1-3 months | Ongoing |
|
|
| Complexity | High | Medium |
|
|
| **Best for** | Site-wide issues | Individual page improvements |
|
|
|
|
**Why it works**: Clear comparison, specific values, scannable format.
|
|
|
|
---
|
|
|
|
### 5. Step-by-Step Processes
|
|
|
|
Numbered lists for "how to" queries.
|
|
|
|
**Structure**:
|
|
```markdown
|
|
1. **[Action]** - [Brief explanation]
|
|
2. **[Action]** - [Brief explanation]
|
|
3. **[Action]** - [Brief explanation]
|
|
```
|
|
|
|
**Example**:
|
|
> To conduct keyword research:
|
|
> 1. **Identify seed keywords** - List 5-10 topics your audience searches for
|
|
> 2. **Use keyword research tools** - Expand seed keywords into hundreds of variations
|
|
> 3. **Analyze search intent** - Determine what content format each keyword requires
|
|
> 4. **Evaluate competition** - Assess ranking difficulty for each keyword
|
|
> 5. **Prioritize keywords** - Choose based on volume, difficulty, and relevance
|
|
|
|
**Why it works**: Clear process, actionable steps, logical sequence.
|
|
|
|
---
|
|
|
|
### 6. List-Based Content
|
|
|
|
Curated lists with brief explanations.
|
|
|
|
**Structure**:
|
|
```markdown
|
|
**[Item name]**: [Clear description with key benefit]
|
|
```
|
|
|
|
**Example**:
|
|
> Top on-page SEO factors:
|
|
> - **Title tags**: Most important on-page element; include primary keyword within first 60 characters
|
|
> - **Header tags**: Structure content hierarchically; use one H1, multiple H2s for main sections
|
|
> - **Meta descriptions**: Don't directly impact rankings but affect CTR; keep under 160 characters
|
|
> - **URL structure**: Use descriptive, keyword-rich URLs without unnecessary parameters
|
|
|
|
**Why it works**: Scannable, specific, actionable.
|
|
|
|
---
|
|
|
|
### 7. Before/After Examples
|
|
|
|
Concrete examples showing transformation.
|
|
|
|
**Structure**:
|
|
```markdown
|
|
**Before**: [Weak example]
|
|
**After**: [Strong example]
|
|
**Why it's better**: [Explanation]
|
|
```
|
|
|
|
**Example**:
|
|
> **Before**: "Email marketing is pretty effective."
|
|
> **After**: "Email marketing delivers an average ROI of $42 for every $1 spent, according to the Data & Marketing Association."
|
|
> **Why it's better**: Specific statistic, attributed source, quantifiable claim.
|
|
|
|
**Why it works**: Shows concrete improvement, demonstrates principle.
|
|
|
|
---
|
|
|
|
### 8. Key Insight Callouts
|
|
|
|
Highlighted important points.
|
|
|
|
**Structure**:
|
|
```markdown
|
|
> **Key insight**: [Memorable, quotable statement]
|
|
```
|
|
|
|
**Example**:
|
|
> **Key insight**: According to Google's John Mueller, internal linking is one of the most underutilized SEO tactics, with properly structured internal links often delivering faster ranking improvements than external link building.
|
|
|
|
**Why it works**: Visually distinct, authoritative, quotable.
|
|
|
|
---
|
|
|
|
## Content Optimization by Query Type
|
|
|
|
### Informational Queries ("What is...", "How does...", "Why...")
|
|
|
|
**AI citation priorities**:
|
|
1. Clear definitions
|
|
2. Comprehensive explanations
|
|
3. Expert perspectives
|
|
4. Supporting statistics
|
|
5. Real-world examples
|
|
|
|
**Optimal structure**:
|
|
- Definition in first paragraph
|
|
- "Why it matters" section
|
|
- How it works explanation
|
|
- Common use cases
|
|
- Expert quotes or citations
|
|
|
|
---
|
|
|
|
### Comparison Queries ("[A] vs [B]", "Best [category]")
|
|
|
|
**AI citation priorities**:
|
|
1. Comparison tables
|
|
2. Clear pros/cons lists
|
|
3. Use case recommendations
|
|
4. Specific differentiators
|
|
5. Verdict or recommendation
|
|
|
|
**Optimal structure**:
|
|
- Quick comparison table upfront
|
|
- Individual descriptions
|
|
- Feature-by-feature comparison
|
|
- "Choose X if..." recommendations
|
|
- Summary verdict
|
|
|
|
---
|
|
|
|
### How-To Queries ("How to...", "Steps to...")
|
|
|
|
**AI citation priorities**:
|
|
1. Numbered step-by-step processes
|
|
2. Required tools/prerequisites
|
|
3. Time estimates
|
|
4. Success indicators
|
|
5. Troubleshooting tips
|
|
|
|
**Optimal structure**:
|
|
- Prerequisites listed first
|
|
- Clear numbered steps
|
|
- Sub-steps where needed
|
|
- Visual indicators of progress
|
|
- Common problems and solutions
|
|
|
|
---
|
|
|
|
### Statistical Queries ("How much...", "How many...", "Statistics about...")
|
|
|
|
**AI citation priorities**:
|
|
1. Specific numbers with sources
|
|
2. Recent data (within 1-2 years)
|
|
3. Multiple data points
|
|
4. Context for statistics
|
|
5. Trend information
|
|
|
|
**Optimal structure**:
|
|
- Lead with key statistic
|
|
- Source attribution immediately after
|
|
- Context and interpretation
|
|
- Related statistics
|
|
- Takeaways from data
|
|
|
|
---
|
|
|
|
## Citation Likelihood Factors
|
|
|
|
### High Citation Likelihood
|
|
|
|
- [ ] Content from recognized authority domains
|
|
- [ ] Published or updated within 12 months
|
|
- [ ] Clear, standalone statements
|
|
- [ ] Proper source attribution
|
|
- [ ] Specific statistics with dates
|
|
- [ ] Structured with headings/lists/tables
|
|
- [ ] Comprehensive topic coverage
|
|
- [ ] Author credentials visible
|
|
- [ ] Technical accuracy verified
|
|
- [ ] Consensus with other sources
|
|
|
|
### Medium Citation Likelihood
|
|
|
|
- [ ] Content from less-known but quality domains
|
|
- [ ] Published 1-2 years ago
|
|
- [ ] Clear but requires slight context
|
|
- [ ] General industry claims
|
|
- [ ] Good structure but less scannable
|
|
- [ ] Moderate depth of coverage
|
|
- [ ] No author listed but quality content
|
|
- [ ] Some supporting evidence
|
|
|
|
### Low Citation Likelihood
|
|
|
|
- [ ] Content from unknown/low-authority domains
|
|
- [ ] Published 3+ years ago without updates
|
|
- [ ] Vague or ambiguous statements
|
|
- [ ] No sources cited
|
|
- [ ] Poor content structure (walls of text)
|
|
- [ ] Thin or superficial coverage
|
|
- [ ] Promotional or biased tone
|
|
- [ ] Factual inconsistencies
|
|
- [ ] No expertise signals
|
|
|
|
---
|
|
|
|
## AI System Comparison Summary
|
|
|
|
| Factor | Google AI Overviews | ChatGPT | Perplexity | Claude |
|
|
|--------|---------------------|---------|------------|--------|
|
|
| **Freshness bias** | High | Medium | Very high | N/A (training data) |
|
|
| **Authority weight** | Very high | High | High | High |
|
|
| **Structure importance** | High | Medium | Very high | Medium |
|
|
| **Citation count** | 3-8 | 1-6 | 5-10 | N/A |
|
|
| **Quotable focus** | High | Medium | Very high | High |
|
|
| **Domain trust** | Very high | High | Medium | High |
|
|
| **Factual density** | High | High | Very high | Very high |
|
|
|
|
---
|
|
|
|
## Tracking AI Citations
|
|
|
|
### Manual Monitoring
|
|
|
|
**Check if your content appears in**:
|
|
- Google AI Overviews for target keywords
|
|
- ChatGPT responses (search your domain in ChatGPT)
|
|
- Perplexity results for relevant queries
|
|
- Other AI search engines
|
|
|
|
**Test queries**:
|
|
- Exact-match questions from your FAQ
|
|
- Definitions of terms you've defined
|
|
- Statistics you've cited with attribution
|
|
- Processes you've documented
|
|
|
|
### Indicators of AI Visibility
|
|
|
|
- Increased direct traffic (AI users clicking sources)
|
|
- Traffic spikes from unusual referrers
|
|
- Engagement metrics: low bounce rate, high time-on-page
|
|
- Return visitors (AI users coming back for more depth)
|
|
|
|
---
|
|
|
|
## Optimization Checklist for AI Citations
|
|
|
|
Content ready for AI citation should have:
|
|
|
|
- [ ] At least 3 clear, quotable definitions
|
|
- [ ] 5+ specific statistics with sources and dates
|
|
- [ ] Q&A format sections covering top queries
|
|
- [ ] Comparison tables where relevant
|
|
- [ ] Numbered lists for processes
|
|
- [ ] Content published or updated within 12 months
|
|
- [ ] Author credentials visible
|
|
- [ ] External citations to authoritative sources
|
|
- [ ] Structured with clear H2/H3 headings
|
|
- [ ] Short paragraphs (2-4 sentences)
|
|
- [ ] No promotional language
|
|
- [ ] Technical accuracy verified
|
|
- [ ] Mobile-friendly formatting
|