GEO Glossary

Plain-language definitions for the terms you will encounter in your GEO report and throughout AI search optimization.

A

Above-the-Fold Alignment
Whether the most important content appears in the visible area before a user scrolls. Part of Answer-First Format. #
Accessibility
Website usability for people with disabilities: ARIA labels, form labels, color contrast, semantic HTML. Checked in the Site Health category. #
AI Brand Check
Queries AI models (ChatGPT, Google Gemini, Claude, Perplexity, and Llama) about your company to verify what they say. Available in the full scan only. Part of the Brand Authority category. #
AI Crawler
Programs used by AI companies to read websites. Examples: GPTBot (OpenAI), ClaudeBot (Anthropic), Google-Extended (Google), PerplexityBot (Perplexity). #
AI Overviews
Google's feature placing AI-generated answers above traditional search results. Launched 2024. Reaches 1B+ users across 100+ countries. #
Alt Text Coverage
Percentage of images that have alt text attributes. Part of Image Optimization. #
Alt Text Quality
Whether alt text is descriptive and contextually relevant rather than generic. Part of Image Optimization. #
Anchor Text Quality
Whether link text is descriptive rather than generic ("click here"). Part of Link Structure. #
Answer-First Content
Writing approach placing key takeaways at the beginning of each section, before the explanation. AI models prefer this structure for answer extraction. #
ARIA Landmark Regions
Accessible Rich Internet Application landmarks (main, nav, aside) that help screen readers navigate page sections. Part of Accessibility. #
Author Attribution
Whether the content is attributed to a named author rather than being anonymous or brand-only. Part of Author Expertise. #
Author Bio Section
Presence of a dedicated author biography with credentials and background. Part of Author Expertise. #
Author Signals
Presence of author names, bios, and links to author profiles. Part of E-E-A-T Signals. #
Authority Signals
External validation: citations, awards, industry recognition, institutional affiliations. Part of E-E-A-T Signals. #
Average Sentence Length
Mean number of words per sentence. Shorter sentences are easier for AI models to extract and cite. Part of Readability. #

B

Body Content
The main informational content within an llms.txt file. Part of llms.txt. #
Both Dates Present
Whether a page includes both published and modified date metadata, a signal of maintained content. Part of Content Freshness. #
Brand Entity Consistency
Alignment between how your brand is described across your website, schema markup, social profiles, and business listings. Inconsistency confuses AI models. #
Brand Name Consistency
Whether the brand name is spelled and capitalized identically across all page elements. Part of Brand Entity Consistency. #
Brand Prominence in Content
How prominently the brand entity appears relative to other content on the page. Part of Brand Entity Consistency. #
Brand Recognition
Whether AI models recognize the brand when asked directly. Part of AI Brand Check. #
Browser Cache Headers
Whether the server sends caching directives so browsers don't re-download unchanged files. Part of Performance Signals. #

C

Canonical URL
Tag designating the official version of a page when content exists at multiple URLs. Prevents citation confusion. #
Category
One of six GeoScored scoring categories: AI Discovery, Content Quality, Brand Authority, Citation Readiness, Site Health, and Emerging Signals. The first five are scored; Emerging Signals is informational. #
Character Encoding
The declared character set (usually UTF-8) ensuring text displays correctly. Part of Meta Tags. #
Check
A single test examining one aspect of a page. Each check produces a score, severity rating, and recommendation with an effort estimate. #
Citation
When an AI model references your content in its response. The primary goal of GEO. #
Citation Source Authority
The credibility level of sources cited in the content. Part of Fact Density. #
Citation-Worthy Content
How much of the content contains specific, verifiable claims that AI models could cite. Part of Fact Density. #
Claim Variety
The diversity of claim types in the content: statistics, dates, definitions, comparisons, and named entities. Part of Fact Density. #
Clickjack Protection
X-Frame-Options header preventing the page from being embedded in frames on malicious sites. Part of Security Headers. #
Complex Word Usage
Percentage of words with three or more syllables. Lower percentages indicate clearer writing. Part of Readability. #
Concrete Opening Lines
Whether sections begin with specific, factual statements rather than broad generalizations. Part of Answer-First Format. #
Content Completeness
How thoroughly the llms.txt file covers the site's key information for AI consumption. Part of llms.txt. #
Content Depth
Thoroughness of topic coverage measured by word count, subheadings, and supporting details. #
Content Extraction Surface
How much of a page is primary, extractable content versus navigation, sidebars, and interface chrome. Part of the Citation Readiness category. #
Content Freshness
How recently a page was updated. Detected through HTTP headers and meta tags. AI engines weight recent content higher. #
Content Integrity
Overall preservation of content meaning and structure through format conversion. Part of Markdown Fidelity. #
Content Recency
How recently the page content was published or updated, based on metadata dates. Part of Content Freshness. #
Content Type Schema
Schema.org type declared for the page (Article, Product, FAQ, etc.). Part of Schema Markup. #
Content Under Headings
Whether each heading is followed by substantive content rather than being empty or minimal. Part of Heading Hierarchy. #
Content Visibility Threshold
Whether critical page content is hidden behind JavaScript interactions, tabs, or accordions that AI crawlers cannot trigger. #
Content-to-Code Ratio
Proportion of visible text to underlying HTML code. Low ratios suggest content-light pages. Part of Content Depth. #
Content-Type Protection
X-Content-Type-Options header preventing browsers from guessing file types, which blocks certain attacks. Part of Security Headers. #
CSP (Content Security)
Content Security Policy header that restricts which scripts and resources can load on a page. Part of Security Headers. #
CSS & JS Minification
Whether CSS and JavaScript files have been compressed by removing whitespace and comments. Part of Performance Signals. #

D

Date Markup
Schema.org date fields (datePublished, dateModified) embedded in structured data. Part of Schema Markup. #
Deprecated HTML Elements
Usage of outdated HTML tags that reduce accessibility and standards compliance. Part of Accessibility. #
Description Consistency
Whether the brand description matches across meta tags, schema, and body content. Part of Brand Entity Consistency. #
Description Section
The summary field in an llms.txt file explaining what the site offers. Part of llms.txt. #
Descriptive Headings
Whether headings use specific, topic-relevant language rather than generic labels like "Introduction." Part of Heading Hierarchy. #
Direct Lead Paragraph
Whether the first paragraph delivers the key takeaway before supporting details. Part of Answer-First Format. #
DOM Element Count
Number of HTML elements on the page. Excessive elements (over 1,500) degrade performance. Part of Performance Signals. #
DOM Nesting Depth
How deeply HTML elements are nested inside each other. Deep nesting slows browser rendering. Part of Performance Signals. #
Duplicate Content
Identical or near-identical content at multiple URLs. Canonical URLs resolve this. #

E

E-E-A-T
Experience, Expertise, Authoritativeness, Trustworthiness. Google's content quality framework. GeoScored checks for signals of each. #
Effort Tag
Estimate attached to each recommendation: low (under 30 minutes), medium (1-3 hours), or high (half day or more). #
Embedded Content (iframes)
Content loaded through iframes, which most AI crawlers cannot access. Part of Content Depth. #
Emerging Signals
Newer AI discovery mechanisms tracked but not yet scored: llms.txt, llms-full.txt, technology stack detection. Reported in the Emerging Signals category. #
Entity in Knowledge Graph
Whether the brand or subject is recognized as an entity in Wikidata or similar knowledge bases. Part of Knowledge Graph. #
Entity Mention Position
Whether key entity names appear early in the page where crawlers are most likely to process them. Part of Content Visibility Threshold. #
Entity Strength
How well AI systems recognize and describe your brand. Measured through knowledge graph presence, E-E-A-T signals, brand consistency, and AI model responses. #
Expertise Signals
Evidence of subject-matter expertise: credentials, experience, specialized knowledge. Part of E-E-A-T Signals. #

F

Fact Density
Number of verifiable facts (statistics, dates, named sources) per 100 words. Also a sub-test measuring facts per 100 words within the Fact Density check. Research shows +40% AI citation rate for high fact density content. #
FAQ Schema
Structured FAQ markup that AI models can parse into question-answer pairs. Part of Schema Markup. #
Favicon
The small icon displayed in browser tabs and bookmarks. Part of Meta Tags. #
Font Loading Strategy
How web fonts are loaded: whether they use preloading, font-display swap, or other optimization techniques. Part of Performance Signals. #
Form Field Labels
Whether form inputs have associated label elements for screen reader accessibility. Part of Accessibility. #
Formatting Quality
Whether the llms.txt file follows proper formatting conventions and spec compliance. Part of llms.txt. #
Free Scan
GeoScored's no-cost GEO scan. 3 per day, no credit card required. Results available immediately. #
Freshness Score Over Time
How a page's freshness score decays as time passes since its last update. Newer content scores higher. Part of Content Freshness. #
Full Scan
GeoScored's complete scan tier running all scored checks including the AI Brand Check. Includes PDF export and shareable links. #

G

GEO (Generative Engine Optimization)
Optimizing website content for visibility in AI-powered search engines like ChatGPT, Perplexity, and Gemini. The AI-era evolution of SEO. #
GEO Score
A number from 0 to 100 measuring website readiness for AI discovery and citation. Weighted combination of five scoring categories. #

H

H1 Heading Position
Whether the main heading appears within the first portion of HTML that crawlers process. Part of Content Visibility Threshold. #
H2 Subheadings
The presence and quantity of H2-level subheadings that break content into scannable sections. Part of Heading Hierarchy. #
Heading Hierarchy
Structure of page headings (H1, H2, H3) helping AI understand section topics and locate specific answers. #
Heading Text Quality
Overall quality of heading text including length, keyword relevance, and clarity. Part of Heading Hierarchy. #
Heading-to-Answer Match
Whether the content immediately following a heading answers the question the heading implies. Part of Answer-First Format. #
Heading-to-Content Ratio
Balance between headings and the body content beneath them. Part of Content Extraction Surface. #
Headings Preserved
Whether headings convert cleanly when the page is processed into Markdown format. Part of Markdown Fidelity. #
Hreflang Code Validation
Whether hreflang language and region codes follow ISO standards. Part of Indexability. #
Hreflang Tags
HTML tags indicating alternate language or regional versions of a page. Part of Indexability. #
HSTS (Strict Transport)
HTTP Strict Transport Security header that forces browsers to use HTTPS. Part of Security Headers. #
HTML Doctype
The document type declaration that tells browsers which HTML version to use. Part of Indexability. #
HTML Document Size
Total size of the raw HTML file. Pages over 200KB take longer to download and parse. Part of Performance Signals. #
HTTP Status Code
The server response code (200, 301, 404, etc.) indicating the page's availability. Part of Indexability. #
HTTP/2 Protocol
Whether the server uses HTTP/2, which loads multiple resources simultaneously over a single connection. Part of Performance Signals. #
HTTPS Encryption
Whether the page is served over a secure, encrypted connection. Part of Security Headers. #

I

Image Alt Text Quality
Whether image alt text provides useful descriptions rather than filenames or empty attributes. Part of Accessibility. #
Image Dimensions Set
Whether images declare width and height attributes, preventing layout shifts during loading. Part of Image Optimization. #
Image Optimization
Configuring images with alt text, compression, and modern formats for performance and AI understanding. #
In-Content Expertise
Whether expert credentials and experience are woven into the content body, not just in a bio. Part of Author Expertise. #
Indexability
Whether search engines and AI crawlers can include your page in their index. Non-indexed pages are invisible. #
Industry Category Match
Whether the stated industry or category is consistent across all structured data. Part of Brand Entity Consistency. #
Inline Style Overuse
Excessive use of inline CSS styles instead of external stylesheets, which increases HTML size and reduces cacheability. Part of Performance Signals. #
INP Interaction Delay Risk
Risk of slow Interaction to Next Paint (INP), a Core Web Vital measuring how quickly a page responds to user input. Part of Performance Signals. #
Internal Nofollow Usage
Whether internal links incorrectly use rel="nofollow", which wastes link equity. Part of Link Structure. #

J

JavaScript Rendering Gap
Difference between raw HTML content and what appears after JavaScript executes. Many AI crawlers read only raw HTML. #
Script Footprint
The number of script elements present after JavaScript execution, including scripts dynamically injected at runtime by tag managers or analytics loaders. More scripts means more processing overhead for crawlers. Part of JS Rendering Gap. #
JS vs. HTML Content Gap
The difference between what appears in the raw HTML source and what appears after JavaScript runs. Large gaps mean AI crawlers may miss content. Part of JS Rendering Gap. #
JSON-LD
JavaScript Object Notation for Linked Data. Method for adding machine-readable structured information to web pages. #

K

Key Content Position
Whether the page's most important passages appear within the crawler's processing window. Part of Content Visibility Threshold. #
Knowledge Graph
Database of facts about entities (people, companies, places) maintained by Google, Wikidata, and AI models. Greater presence improves AI accuracy. #
Knowledge Graph Detail
How much structured information exists about the entity in the knowledge graph. Part of Knowledge Graph. #

L

Language Attribute
The HTML lang attribute declaring the page's language, essential for screen readers and AI processing. Part of Accessibility. #
Language Declaration
The HTML lang attribute specifying the page's primary language. Part of Meta Tags. #
Last Modified Date
Whether the page includes a dateModified timestamp in its metadata. Part of Content Freshness. #
Lazy Loading Setup
Whether off-screen images use lazy loading to improve initial page load speed. Part of Image Optimization. #
Linked Resources
URLs referenced in an llms.txt file pointing to key pages or documentation. Part of llms.txt. #
Lists Preserved
Whether bulleted and numbered lists survive Markdown conversion intact. Part of Markdown Fidelity. #
LLMs-full.txt
Extended version of llms.txt providing complete content for AI model consumption rather than just URLs. #
LLMs.txt
Text file at /llms.txt directing AI models to your most important pages. Specification at llmstxt.org. #
Low Filler Content
How free the content is from vague opening phrases and filler words that add no information. Part of Answer-First Format. #

M

Main Content Ratio
Percentage of the page that is primary content versus surrounding elements. Part of Content Extraction Surface. #
Markdown Fidelity
Percentage of page content that survives conversion to plain text format. Affects what AI models actually process. #
Meta Description
The HTML meta description summarizing the page for search engines and AI. Part of Meta Tags. #
Meta Tags
HTML title tag and meta description that appear in search results and AI summaries. #
Meta Tags in Source HTML
Whether important meta tags are present in the initial HTML before JavaScript executes. Part of JS Rendering Gap. #
Mixed Content Check
Whether a secure (HTTPS) page loads any insecure (HTTP) resources. Part of Security Headers. #

N

Named Entities & Nouns
The presence of proper nouns, brand names, and specific entity references that AI models use for verification. Part of Fact Density. #
No Heading Level Skips
Whether headings follow a logical sequence (H1 to H2 to H3) without jumping levels. Part of Heading Hierarchy. #
No-JavaScript Fallback
Whether the page provides meaningful content when JavaScript is disabled or fails to load. Part of JS Rendering Gap. #

O

Open Graph
Protocol controlling page appearance when shared on social platforms. Specifies title, description, and image. #
Open Graph Essentials
Required Open Graph tags (title, description, image, URL) for social media previews. Part of Social Tags. #
Open Graph Quality
Whether Open Graph content is well-written and images meet platform size requirements. Part of Social Tags. #
Organization Schema
Structured data markup identifying the organization behind a website. Part of Schema Markup. #

P

Page Speed
Loading performance measured through resource sizes, render-blocking scripts, and compression. #
Page Summary Present
Whether the page includes a summary or abstract that captures its main points. Part of Answer-First Format. #
Passage Length
Average length of self-contained text passages. AI citation-friendly passages are typically 20-80 words. Part of Passage Self-Containment. #
Passage Self-Containment
Whether individual paragraphs make sense without surrounding context. Critical because AI extracts single passages. #
Passage Specificity
Whether individual passages include enough specific detail to be meaningful when extracted in isolation. Part of Passage Self-Containment. #
Preload & Prefetch Hints
Resource hint tags that tell browsers to fetch critical assets early. Part of Performance Signals. #
Published Date
Whether the page includes a datePublished timestamp in its metadata. Part of Content Freshness. #

R

Readability
Content clarity measured by reading level and sentence complexity. Clearer writing improves AI parsing accuracy. #
Reading Grade Level
The Flesch-Kincaid grade level of the content. Lower grades mean broader accessibility. Part of Readability. #
Redirect Chain Length
Number of redirects between the requested URL and the final destination. Long chains slow crawlers and waste crawl budget. Part of Redirect Chains. #
Redirect Status Types
Whether redirects use appropriate HTTP status codes (301 for permanent, 302 for temporary). Part of Redirect Chains. #
Render-Blocking Resources
Scripts and stylesheets in the head that prevent the page from displaying until they load. Part of Performance Signals. #
Repeated Content Blocks
Detection of identical text blocks appearing multiple times on the same page. Part of Duplicate Content. #
Repeated Template Ratio
Percentage of page content that appears on other pages (headers, footers, sidebars). Part of Duplicate Content. #
Response Compression
Whether the server uses gzip or Brotli compression to reduce file transfer size. Part of Performance Signals. #
Response Consistency
Whether different AI models give consistent information about the brand. Part of AI Brand Check. #
Response Detail
How much detail AI models provide when describing the brand. Part of AI Brand Check. #
Robots Meta Directives
Meta robots tags and X-Robots-Tag headers that control how search engines index a page. Part of Indexability. #
Robots.txt
File at yoursite.com/robots.txt instructing crawlers which pages they can access. #
Robots.txt Validation
Whether the site's robots.txt file follows proper syntax and doesn't accidentally block important content. Part of Indexability. #

S

Schema Completeness
How thoroughly required and recommended properties are filled in across all schema types on a page. Part of Schema Markup. #
Schema Content Quality
Whether schema markup content is meaningful and non-generic rather than placeholder text. Part of Schema Markup. #
Schema Data Position
Whether structured data appears early enough in the HTML for crawlers to find it. Part of Content Visibility Threshold. #
Schema Markup
Structured data (typically JSON-LD) explicitly telling search engines and AI about page content. Common types: Organization, Article, FAQPage, Product. #
Search Crawler Access
Whether the page is accessible to search engine crawlers based on robots.txt rules. Part of Indexability. #
Section Structure
How well-organized the sections are within an llms.txt file. Part of llms.txt. #
Security Headers
HTTP response headers protecting site visitors: HTTPS, Content-Security-Policy, Strict-Transport-Security. #
Self-Contained References
Whether passages avoid pronouns like "this," "that," or "it" that require reading prior context to understand. Part of Passage Self-Containment. #
SEO (Search Engine Optimization)
Traditional practice of improving visibility in Google and Bing results. GEO is the AI search equivalent. #
Share Image Quality
Resolution, dimensions, and visual quality of the image shown in social media previews. Part of Social Tags. #
Simple HTML Nesting
Whether the page avoids deeply nested HTML that can break during AI content extraction. Part of Markdown Fidelity. #
Single H1 Tag
Whether the page has exactly one H1 heading, establishing a clear primary topic. Part of Heading Hierarchy. #
Sitemap Quality
Validity and completeness of the site's XML sitemap. Part of Indexability. #
Social Tags
Open Graph and Twitter Card meta tags controlling page appearance when shared. #
Source Attribution
Whether claims and statistics are attributed to named sources. Part of Fact Density. #
Speakable Markup
Schema.org property identifying content suitable for text-to-speech and voice assistants. Part of Schema Markup. #
Structure After Rendering
Whether headings, lists, and other structural elements survive JavaScript rendering intact. Part of JS Rendering Gap. #

T

Tables Preserved
Whether data tables maintain their structure during Markdown conversion. Part of Markdown Fidelity. #
Technology Stack
The CMS, framework, and plugins a website uses. Detected as an Emerging Signal. Some stacks have known AI accessibility patterns. #
Text Uniqueness
How much of the visible text is unique to this page versus duplicated elsewhere on the site. Part of Duplicate Content. #
Third-Party Script Impact
The number and size of scripts loaded from external domains (analytics, ads, chat widgets). Part of Performance Signals. #
Title & H1 Alignment
Whether the page title and main heading convey consistent messaging. Part of Meta Tags. #
Title Section
The title field in an llms.txt file that identifies the site to AI models. Part of llms.txt. #
Title Tag
The HTML title element that appears in browser tabs and search results. Part of Meta Tags. #
Topic Section Coverage
How many distinct subtopics or sections the page addresses. Part of Content Depth. #
Topical Authority
Depth and breadth of coverage on a topic. AI models preferentially cite sources that demonstrate consistent expertise. #
Topical Relevance Ratio
Proportion of internal links pointing to topically related content versus unrelated pages. Part of Topical Authority. #
Tracking Parameters
Presence of analytics tracking parameters (utm_source, fbclid) that create duplicate URLs. Part of URL Structure. #
Trailing Slash Consistency
Whether the site consistently uses or omits trailing slashes, avoiding duplicate URLs. Part of URL Structure. #
Trust Signals
Transparency indicators: contact information, privacy policy, editorial standards. Part of E-E-A-T Signals. #
Twitter/X Card
Twitter Card meta tags that control how the page appears when shared on X (formerly Twitter). Part of Social Tags. #

U

Unique Information Value
Whether the page contributes new information not widely available elsewhere. Part of Fact Density. #
URL Consistency
Whether URLs referenced in structured data match the actual page URLs. Part of Brand Entity Consistency. #
URL Length
Total character count of the page URL. Shorter, cleaner URLs are easier for AI to process and cite. Part of URL Structure. #
URL Normalization
Whether variations of the same URL (www vs. non-www, http vs. https) resolve consistently. Part of Redirect Chains. #
URL Path Depth
Number of directory levels in the URL. Deeply nested URLs suggest poor site architecture. Part of URL Structure. #
URL Query Parameters
Presence of query strings (?key=value) which can create duplicate content issues. Part of URL Structure. #
URL Slug Quality
Readability and descriptiveness of the URL path. Part of URL Structure. #
URL Structure
Organization of page URLs: length, readability, and hierarchy. Clean, descriptive URLs aid AI topic understanding. #

V

Viewport Tag
Mobile responsiveness declaration that tells browsers how to scale the page. Part of Meta Tags. #

W

Word Count
Total words on the page. Thin content under 300 words rarely provides enough substance for AI citation. Part of Content Depth. #

See where your content stands

Enter any URL. Get your score in 60 seconds. Free.

Run a free GEO scan