Definition
What this term means
The designated 'preferred' URL for a piece of content when that content is accessible at multiple URLs. Canonical tags (rel='canonical') tell search engines and AI systems which version of a page is the authoritative one, consolidating ranking signals and preventing duplicate content issues. This is essential when the same content appears at different URLs due to parameters, pagination, or syndication.
Why it matters
The business impact
Without canonical URLs, AI systems may encounter the same content at multiple addresses and split their authority assessment between them, weakening each version. By declaring a canonical URL, you consolidate all authority signals onto one definitive source, increasing the likelihood that AI systems will cite and recommend that specific page with full confidence.
Used in context
How you might use this term
“A brand's product page was accessible at five different URLs due to tracking parameters and category navigation. AI systems were citing different versions inconsistently. After implementing canonical tags pointing to a single clean URL, citation consistency improved and the canonical page accumulated significantly stronger authority signals.”
Related terms
Explore connected concepts
Crawl Budget
The total number of pages that search engine and AI crawlers will fetch from your website within a given time period. Crawl budget is determined by a combination of your site's perceived authority, server performance, URL structure, and content freshness signals. Crawlers allocate their budget based on these factors, spending more time on sites they consider valuable and efficient to crawl.
Structured Data
Machine-readable code embedded in web pages that explicitly defines entities, attributes, and relationships using a standardised vocabulary. JSON-LD (JavaScript Object Notation for Linked Data) is the preferred format. It sits in a script tag on the page and tells AI systems exactly what the page is about: the organisation behind it, the author's credentials, the product details, the article's topic, and more.
Duplicate Content
Substantially identical or very similar content appearing at multiple URLs, either within the same website or across different websites. Duplicate content confuses search engines and AI systems about which version is the original, authoritative source, leading to diluted authority signals and inconsistent citation behaviour. Common causes include URL parameters, printer-friendly pages, syndicated content, and product variations.