GEO Glossary | Plain-Language Definitions for AI Search Terms

A

Above-the-Fold Alignment: Whether the most important content appears in the visible area before a user scrolls. Part of Answer-First Format. #
Accessibility: Website usability for people with disabilities: ARIA labels, form labels, color contrast, semantic HTML. Checked in the Site Health category. #
AI Brand Check: Queries AI models (ChatGPT, Google Gemini, Claude, Perplexity, and Llama) about your company to verify what they say. Available in the full scan only. Part of the Brand Authority category. #
AI Crawler: Programs used by AI companies to read websites. Examples: GPTBot (OpenAI), ClaudeBot (Anthropic), Google-Extended (Google), PerplexityBot (Perplexity). #
AI Overviews: Google's feature placing AI-generated answers above traditional search results. Launched 2024. Reaches 1B+ users across 100+ countries. #
Alt Text Coverage: Percentage of images that have alt text attributes. Part of Image Optimization. #
Alt Text Quality: Whether alt text is descriptive and contextually relevant rather than generic. Part of Image Optimization. #
Anchor Text Quality: Whether link text is descriptive rather than generic ("click here"). Part of Link Structure. #
Answer-First Content: Writing approach placing key takeaways at the beginning of each section, before the explanation. AI models prefer this structure for answer extraction. #
ARIA Landmark Regions: Accessible Rich Internet Application landmarks (main, nav, aside) that help screen readers navigate page sections. Part of Accessibility. #
Author Attribution: Whether the content is attributed to a named author rather than being anonymous or brand-only. Part of Author Expertise. #
Author Bio Section: Presence of a dedicated author biography with credentials and background. Part of Author Expertise. #
Author Signals: Presence of author names, bios, and links to author profiles. Part of E-E-A-T Signals. #
Authority Signals: External validation: citations, awards, industry recognition, institutional affiliations. Part of E-E-A-T Signals. #
Average Sentence Length: Mean number of words per sentence. Shorter sentences are easier for AI models to extract and cite. Part of Readability. #

B

Body Content: The main informational content within an llms.txt file. Part of llms.txt. #
Both Dates Present: Whether a page includes both published and modified date metadata, a signal of maintained content. Part of Content Freshness. #
Brand Entity Consistency: Alignment between how your brand is described across your website, schema markup, social profiles, and business listings. Inconsistency confuses AI models. #
Brand Name Consistency: Whether the brand name is spelled and capitalized identically across all page elements. Part of Brand Entity Consistency. #
Brand Prominence in Content: How prominently the brand entity appears relative to other content on the page. Part of Brand Entity Consistency. #
Brand Recognition: Whether AI models recognize the brand when asked directly. Part of AI Brand Check. #
Breadcrumb Schema: Navigation hierarchy markup showing where a page sits within a site. Part of Schema Markup. #
Broken Anchor Links: Links to page sections (#anchors) that don't exist on the target page. Part of Link Structure. #
Broken Link Check: Detection of links pointing to pages that return errors. Part of Link Structure. #
Browser Cache Headers: Whether the server sends caching directives so browsers don't re-download unchanged files. Part of Performance Signals. #

C

Canonical URL: Tag designating the official version of a page when content exists at multiple URLs. Prevents citation confusion. #
Category: One of six GeoScored scoring categories: AI Discovery, Content Quality, Brand Authority, Citation Readiness, Site Health, and Emerging Signals. The first five are scored; Emerging Signals is informational. #
Character Encoding: The declared character set (usually UTF-8) ensuring text displays correctly. Part of Meta Tags. #
Check: A single test examining one aspect of a page. Each check produces a score, severity rating, and recommendation with an effort estimate. #
Citation: When an AI model references your content in its response. The primary goal of GEO. #
Citation Source Authority: The credibility level of sources cited in the content. Part of Fact Density. #
Citation-Worthy Content: How much of the content contains specific, verifiable claims that AI models could cite. Part of Fact Density. #
Claim Variety: The diversity of claim types in the content: statistics, dates, definitions, comparisons, and named entities. Part of Fact Density. #
Clickjack Protection: X-Frame-Options header preventing the page from being embedded in frames on malicious sites. Part of Security Headers. #
Complex Word Usage: Percentage of words with three or more syllables. Lower percentages indicate clearer writing. Part of Readability. #
Concrete Opening Lines: Whether sections begin with specific, factual statements rather than broad generalizations. Part of Answer-First Format. #
Content Completeness: How thoroughly the llms.txt file covers the site's key information for AI consumption. Part of llms.txt. #
Content Depth: Thoroughness of topic coverage measured by word count, subheadings, and supporting details. #
Content Extraction Surface: How much of a page is primary, extractable content versus navigation, sidebars, and interface chrome. Part of the Citation Readiness category. #
Content Freshness: How recently a page was updated. Detected through HTTP headers and meta tags. AI engines weight recent content higher. #
Content Integrity: Overall preservation of content meaning and structure through format conversion. Part of Markdown Fidelity. #
Content Recency: How recently the page content was published or updated, based on metadata dates. Part of Content Freshness. #
Content Type Schema: Schema.org type declared for the page (Article, Product, FAQ, etc.). Part of Schema Markup. #
Content Under Headings: Whether each heading is followed by substantive content rather than being empty or minimal. Part of Heading Hierarchy. #
Content Visibility Threshold: Whether critical page content is hidden behind JavaScript interactions, tabs, or accordions that AI crawlers cannot trigger. #
Content-to-Code Ratio: Proportion of visible text to underlying HTML code. Low ratios suggest content-light pages. Part of Content Depth. #
Content-Type Protection: X-Content-Type-Options header preventing browsers from guessing file types, which blocks certain attacks. Part of Security Headers. #
CSP (Content Security): Content Security Policy header that restricts which scripts and resources can load on a page. Part of Security Headers. #
CSS & JS Minification: Whether CSS and JavaScript files have been compressed by removing whitespace and comments. Part of Performance Signals. #

D

Date Markup: Schema.org date fields (datePublished, dateModified) embedded in structured data. Part of Schema Markup. #
Deprecated HTML Elements: Usage of outdated HTML tags that reduce accessibility and standards compliance. Part of Accessibility. #
Description Consistency: Whether the brand description matches across meta tags, schema, and body content. Part of Brand Entity Consistency. #
Description Section: The summary field in an llms.txt file explaining what the site offers. Part of llms.txt. #
Descriptive Headings: Whether headings use specific, topic-relevant language rather than generic labels like "Introduction." Part of Heading Hierarchy. #
Descriptive Link Text: Whether links use meaningful text rather than generic text ("click here"). Part of Accessibility. #
Direct Lead Paragraph: Whether the first paragraph delivers the key takeaway before supporting details. Part of Answer-First Format. #
DOM Element Count: Number of HTML elements on the page. Excessive elements (over 1,500) degrade performance. Part of Performance Signals. #
DOM Nesting Depth: How deeply HTML elements are nested inside each other. Deep nesting slows browser rendering. Part of Performance Signals. #
Domain Link Confirmed: Whether the knowledge graph entity is linked to the scanned domain. Part of Knowledge Graph. #
Duplicate Content: Identical or near-identical content at multiple URLs. Canonical URLs resolve this. #

E

E-E-A-T: Experience, Expertise, Authoritativeness, Trustworthiness. Google's content quality framework. GeoScored checks for signals of each. #
Effort Tag: Estimate attached to each recommendation: low (under 30 minutes), medium (1-3 hours), or high (half day or more). #
Embedded Content (iframes): Content loaded through iframes, which most AI crawlers cannot access. Part of Content Depth. #
Emerging Signals: Newer AI discovery mechanisms tracked but not yet scored: llms.txt, llms-full.txt, technology stack detection. Reported in the Emerging Signals category. #
Entity in Knowledge Graph: Whether the brand or subject is recognized as an entity in Wikidata or similar knowledge bases. Part of Knowledge Graph. #
Entity Mention Position: Whether key entity names appear early in the page where crawlers are most likely to process them. Part of Content Visibility Threshold. #
Entity Strength: How well AI systems recognize and describe your brand. Measured through knowledge graph presence, E-E-A-T signals, brand consistency, and AI model responses. #
Expertise Signals: Evidence of subject-matter expertise: credentials, experience, specialized knowledge. Part of E-E-A-T Signals. #
External Link Quality: Quality assessment of links pointing to other websites. Part of Link Structure. #

F

Fact Density: Number of verifiable facts (statistics, dates, named sources) per 100 words. Also a sub-test measuring facts per 100 words within the Fact Density check. Research shows +40% AI citation rate for high fact density content. #
FAQ Schema: Structured FAQ markup that AI models can parse into question-answer pairs. Part of Schema Markup. #
Favicon: The small icon displayed in browser tabs and bookmarks. Part of Meta Tags. #
Font Loading Strategy: How web fonts are loaded: whether they use preloading, font-display swap, or other optimization techniques. Part of Performance Signals. #
Form Field Labels: Whether form inputs have associated label elements for screen reader accessibility. Part of Accessibility. #
Formatting Quality: Whether the llms.txt file follows proper formatting conventions and spec compliance. Part of llms.txt. #
Free Scan: GeoScored's no-cost GEO scan. 3 per day, no credit card required. Results available immediately. #
Freshness Score Over Time: How a page's freshness score decays as time passes since its last update. Newer content scores higher. Part of Content Freshness. #
Full Scan: GeoScored's complete scan tier running all scored checks including the AI Brand Check. Includes PDF export and shareable links. #

G

GEO (Generative Engine Optimization): Optimizing website content for visibility in AI-powered search engines like ChatGPT, Perplexity, and Gemini. The AI-era evolution of SEO. #
GEO Score: A number from 0 to 100 measuring website readiness for AI discovery and citation. Weighted combination of five scoring categories. #

H

H1 Heading Position: Whether the main heading appears within the first portion of HTML that crawlers process. Part of Content Visibility Threshold. #
H2 Subheadings: The presence and quantity of H2-level subheadings that break content into scannable sections. Part of Heading Hierarchy. #
Heading Hierarchy: Structure of page headings (H1, H2, H3) helping AI understand section topics and locate specific answers. #
Heading Text Quality: Overall quality of heading text including length, keyword relevance, and clarity. Part of Heading Hierarchy. #
Heading-to-Answer Match: Whether the content immediately following a heading answers the question the heading implies. Part of Answer-First Format. #
Heading-to-Content Ratio: Balance between headings and the body content beneath them. Part of Content Extraction Surface. #
Headings Preserved: Whether headings convert cleanly when the page is processed into Markdown format. Part of Markdown Fidelity. #
Hreflang Code Validation: Whether hreflang language and region codes follow ISO standards. Part of Indexability. #
Hreflang Tags: HTML tags indicating alternate language or regional versions of a page. Part of Indexability. #
HSTS (Strict Transport): HTTP Strict Transport Security header that forces browsers to use HTTPS. Part of Security Headers. #
HTML Doctype: The document type declaration that tells browsers which HTML version to use. Part of Indexability. #
HTML Document Size: Total size of the raw HTML file. Pages over 200KB take longer to download and parse. Part of Performance Signals. #
HTTP Status Code: The server response code (200, 301, 404, etc.) indicating the page's availability. Part of Indexability. #
HTTP/2 Protocol: Whether the server uses HTTP/2, which loads multiple resources simultaneously over a single connection. Part of Performance Signals. #
HTTPS Encryption: Whether the page is served over a secure, encrypted connection. Part of Security Headers. #

I

Image Alt Text Quality: Whether image alt text provides useful descriptions rather than filenames or empty attributes. Part of Accessibility. #
Image Dimensions Set: Whether images declare width and height attributes, preventing layout shifts during loading. Part of Image Optimization. #
Image Optimization: Configuring images with alt text, compression, and modern formats for performance and AI understanding. #
In-Content Expertise: Whether expert credentials and experience are woven into the content body, not just in a bio. Part of Author Expertise. #
Indexability: Whether search engines and AI crawlers can include your page in their index. Non-indexed pages are invisible. #
Industry Category Match: Whether the stated industry or category is consistent across all structured data. Part of Brand Entity Consistency. #
Inline Style Overuse: Excessive use of inline CSS styles instead of external stylesheets, which increases HTML size and reduces cacheability. Part of Performance Signals. #
INP Interaction Delay Risk: Risk of slow Interaction to Next Paint (INP), a Core Web Vital measuring how quickly a page responds to user input. Part of Performance Signals. #
Internal Link Count: Number of links pointing to other pages on the same website. Part of Link Structure. #
Internal Link Density: Concentration of internal links relative to content length, indicating how well-connected the page is within the site. Part of Topical Authority. #
Internal Nofollow Usage: Whether internal links incorrectly use rel="nofollow", which wastes link equity. Part of Link Structure. #

J

JavaScript Rendering Gap: Difference between raw HTML content and what appears after JavaScript executes. Many AI crawlers read only raw HTML. #
Script Footprint: The number of script elements present after JavaScript execution, including scripts dynamically injected at runtime by tag managers or analytics loaders. More scripts means more processing overhead for crawlers. Part of JS Rendering Gap. #
JS vs. HTML Content Gap: The difference between what appears in the raw HTML source and what appears after JavaScript runs. Large gaps mean AI crawlers may miss content. Part of JS Rendering Gap. #
JSON-LD: JavaScript Object Notation for Linked Data. Method for adding machine-readable structured information to web pages. #

K

Key Content Position: Whether the page's most important passages appear within the crawler's processing window. Part of Content Visibility Threshold. #
Knowledge Graph: Database of facts about entities (people, companies, places) maintained by Google, Wikidata, and AI models. Greater presence improves AI accuracy. #
Knowledge Graph Detail: How much structured information exists about the entity in the knowledge graph. Part of Knowledge Graph. #

L

Language Attribute: The HTML lang attribute declaring the page's language, essential for screen readers and AI processing. Part of Accessibility. #
Language Declaration: The HTML lang attribute specifying the page's primary language. Part of Meta Tags. #
Last Modified Date: Whether the page includes a dateModified timestamp in its metadata. Part of Content Freshness. #
Lazy Loading Setup: Whether off-screen images use lazy loading to improve initial page load speed. Part of Image Optimization. #
Link Health: Condition of internal and external links. Broken links and redirect chains signal neglect to crawlers. #
Link-to-Content Ratio: Balance between navigational links and substantive text content. Part of Content Extraction Surface. #
Linked Resources: URLs referenced in an llms.txt file pointing to key pages or documentation. Part of llms.txt. #
Lists Preserved: Whether bulleted and numbered lists survive Markdown conversion intact. Part of Markdown Fidelity. #
LLMs-full.txt: Extended version of llms.txt providing complete content for AI model consumption rather than just URLs. #
LLMs.txt: Text file at /llms.txt directing AI models to your most important pages. Specification at llmstxt.org. #
Low Filler Content: How free the content is from vague opening phrases and filler words that add no information. Part of Answer-First Format. #

M

Main Content Ratio: Percentage of the page that is primary content versus surrounding elements. Part of Content Extraction Surface. #
Markdown Fidelity: Percentage of page content that survives conversion to plain text format. Affects what AI models actually process. #
Meta Description: The HTML meta description summarizing the page for search engines and AI. Part of Meta Tags. #
Meta Tags: HTML title tag and meta description that appear in search results and AI summaries. #
Meta Tags in Source HTML: Whether important meta tags are present in the initial HTML before JavaScript executes. Part of JS Rendering Gap. #
Mixed Content Check: Whether a secure (HTTPS) page loads any insecure (HTTP) resources. Part of Security Headers. #

N

Named Entities & Nouns: The presence of proper nouns, brand names, and specific entity references that AI models use for verification. Part of Fact Density. #
Navigation & UI Ratio: Proportion of the page occupied by navigation menus, sidebars, and interface elements. Part of Content Extraction Surface. #
No Heading Level Skips: Whether headings follow a logical sequence (H1 to H2 to H3) without jumping levels. Part of Heading Hierarchy. #
No-JavaScript Fallback: Whether the page provides meaningful content when JavaScript is disabled or fails to load. Part of JS Rendering Gap. #

O

Open Graph: Protocol controlling page appearance when shared on social platforms. Specifies title, description, and image. #
Open Graph Essentials: Required Open Graph tags (title, description, image, URL) for social media previews. Part of Social Tags. #
Open Graph Quality: Whether Open Graph content is well-written and images meet platform size requirements. Part of Social Tags. #
Organization Schema: Structured data markup identifying the organization behind a website. Part of Schema Markup. #
Outbound Link Text Quality: Whether links to external sites use descriptive anchor text relevant to the linked content. Part of Topical Authority. #

P

Page Speed: Loading performance measured through resource sizes, render-blocking scripts, and compression. #
Page Summary Present: Whether the page includes a summary or abstract that captures its main points. Part of Answer-First Format. #
Passage Length: Average length of self-contained text passages. AI citation-friendly passages are typically 20-80 words. Part of Passage Self-Containment. #
Passage Self-Containment: Whether individual paragraphs make sense without surrounding context. Critical because AI extracts single passages. #
Passage Specificity: Whether individual passages include enough specific detail to be meaningful when extracted in isolation. Part of Passage Self-Containment. #
Preload & Prefetch Hints: Resource hint tags that tell browsers to fetch critical assets early. Part of Performance Signals. #
Published Date: Whether the page includes a datePublished timestamp in its metadata. Part of Content Freshness. #

R

Readability: Content clarity measured by reading level and sentence complexity. Clearer writing improves AI parsing accuracy. #
Reading Grade Level: The Flesch-Kincaid grade level of the content. Lower grades mean broader accessibility. Part of Readability. #
Redirect Chain Length: Number of redirects between the requested URL and the final destination. Long chains slow crawlers and waste crawl budget. Part of Redirect Chains. #
Redirect Status Types: Whether redirects use appropriate HTTP status codes (301 for permanent, 302 for temporary). Part of Redirect Chains. #
Render-Blocking Resources: Scripts and stylesheets in the head that prevent the page from displaying until they load. Part of Performance Signals. #
Repeated Content Blocks: Detection of identical text blocks appearing multiple times on the same page. Part of Duplicate Content. #
Repeated Template Ratio: Percentage of page content that appears on other pages (headers, footers, sidebars). Part of Duplicate Content. #
Response Compression: Whether the server uses gzip or Brotli compression to reduce file transfer size. Part of Performance Signals. #
Response Consistency: Whether different AI models give consistent information about the brand. Part of AI Brand Check. #
Response Detail: How much detail AI models provide when describing the brand. Part of AI Brand Check. #
Robots Meta Directives: Meta robots tags and X-Robots-Tag headers that control how search engines index a page. Part of Indexability. #
Robots.txt: File at yoursite.com/robots.txt instructing crawlers which pages they can access. #
Robots.txt Validation: Whether the site's robots.txt file follows proper syntax and doesn't accidentally block important content. Part of Indexability. #

S

Schema Completeness: How thoroughly required and recommended properties are filled in across all schema types on a page. Part of Schema Markup. #
Schema Content Quality: Whether schema markup content is meaningful and non-generic rather than placeholder text. Part of Schema Markup. #
Schema Data Position: Whether structured data appears early enough in the HTML for crawlers to find it. Part of Content Visibility Threshold. #
Schema Markup: Structured data (typically JSON-LD) explicitly telling search engines and AI about page content. Common types: Organization, Article, FAQPage, Product. #
Search Crawler Access: Whether the page is accessible to search engine crawlers based on robots.txt rules. Part of Indexability. #
Section Structure: How well-organized the sections are within an llms.txt file. Part of llms.txt. #
Security Headers: HTTP response headers protecting site visitors: HTTPS, Content-Security-Policy, Strict-Transport-Security. #
Self-Contained References: Whether passages avoid pronouns like "this," "that," or "it" that require reading prior context to understand. Part of Passage Self-Containment. #
SEO (Search Engine Optimization): Traditional practice of improving visibility in Google and Bing results. GEO is the AI search equivalent. #
Share Image Quality: Resolution, dimensions, and visual quality of the image shown in social media previews. Part of Social Tags. #
Simple HTML Nesting: Whether the page avoids deeply nested HTML that can break during AI content extraction. Part of Markdown Fidelity. #
Single H1 Tag: Whether the page has exactly one H1 heading, establishing a clear primary topic. Part of Heading Hierarchy. #
Sitemap Quality: Validity and completeness of the site's XML sitemap. Part of Indexability. #
Skip Navigation Link: A hidden link allowing keyboard users to skip past navigation menus to the main content. Part of Accessibility. #
Social Tags: Open Graph and Twitter Card meta tags controlling page appearance when shared. #
Source Attribution: Whether claims and statistics are attributed to named sources. Part of Fact Density. #
Speakable Markup: Schema.org property identifying content suitable for text-to-speech and voice assistants. Part of Schema Markup. #
Sponsored & UGC Labels: Whether paid links and user-generated content links use proper rel attributes. Part of Link Structure. #
Structure After Rendering: Whether headings, lists, and other structural elements survive JavaScript rendering intact. Part of JS Rendering Gap. #

T

Table of Contents Links: Whether the page includes navigation links to its own sections, aiding both readers and AI extraction. Part of Heading Hierarchy. #
Tables Preserved: Whether data tables maintain their structure during Markdown conversion. Part of Markdown Fidelity. #
Technology Stack: The CMS, framework, and plugins a website uses. Detected as an Emerging Signal. Some stacks have known AI accessibility patterns. #
Text Uniqueness: How much of the visible text is unique to this page versus duplicated elsewhere on the site. Part of Duplicate Content. #
Third-Party Script Impact: The number and size of scripts loaded from external domains (analytics, ads, chat widgets). Part of Performance Signals. #
Title & H1 Alignment: Whether the page title and main heading convey consistent messaging. Part of Meta Tags. #
Title Section: The title field in an llms.txt file that identifies the site to AI models. Part of llms.txt. #
Title Tag: The HTML title element that appears in browser tabs and search results. Part of Meta Tags. #
Topic Section Coverage: How many distinct subtopics or sections the page addresses. Part of Content Depth. #
Topical Authority: Depth and breadth of coverage on a topic. AI models preferentially cite sources that demonstrate consistent expertise. #
Topical Relevance Ratio: Proportion of internal links pointing to topically related content versus unrelated pages. Part of Topical Authority. #
Tracking Parameters: Presence of analytics tracking parameters (utm_source, fbclid) that create duplicate URLs. Part of URL Structure. #
Trailing Slash Consistency: Whether the site consistently uses or omits trailing slashes, avoiding duplicate URLs. Part of URL Structure. #
Trust Signals: Transparency indicators: contact information, privacy policy, editorial standards. Part of E-E-A-T Signals. #
Twitter/X Card: Twitter Card meta tags that control how the page appears when shared on X (formerly Twitter). Part of Social Tags. #

U

Unique Information Value: Whether the page contributes new information not widely available elsewhere. Part of Fact Density. #
URL Consistency: Whether URLs referenced in structured data match the actual page URLs. Part of Brand Entity Consistency. #
URL Length: Total character count of the page URL. Shorter, cleaner URLs are easier for AI to process and cite. Part of URL Structure. #
URL Normalization: Whether variations of the same URL (www vs. non-www, http vs. https) resolve consistently. Part of Redirect Chains. #
URL Path Depth: Number of directory levels in the URL. Deeply nested URLs suggest poor site architecture. Part of URL Structure. #
URL Query Parameters: Presence of query strings (?key=value) which can create duplicate content issues. Part of URL Structure. #
URL Slug Quality: Readability and descriptiveness of the URL path. Part of URL Structure. #
URL Structure: Organization of page URLs: length, readability, and hierarchy. Clean, descriptive URLs aid AI topic understanding. #

V

Viewport Tag: Mobile responsiveness declaration that tells browsers how to scale the page. Part of Meta Tags. #

W

Word Count: Total words on the page. Thin content under 300 words rarely provides enough substance for AI citation. Part of Content Depth. #

A

B

C

D

E

F

G

H

I

J

K

L

M

N

O

P

R

S

T

U

V

W

See where your content stands