01 The Rule
Every indexable URL must use lowercase characters, hyphens as word separators, and reflect the site's content hierarchy in its path segments. URLs must be deterministic — the same content always resolves to exactly one URL.
Last updated: 2025-12-01
Every indexable URL must use lowercase characters, hyphens as word separators, and reflect the site's content hierarchy in its path segments. URLs must be deterministic — the same content always resolves to exactly one URL.
Search engines treat URLs as unique identifiers. Case variations, underscores vs hyphens, and inconsistent trailing slashes create duplicate content. At scale, even small URL inconsistencies multiply into thousands of duplicate signals that dilute crawl budget and link equity.
Mixed case URLs serving identical content
Duplicate content signals, split link equity between case variants
Underscores in URLs instead of hyphens
Google treats underscores as joiners, not separators — words aren't tokenized correctly
Parameters for primary content (?id=123)
Poor crawl priority, fragile canonicalization, analytics attribution problems
Audit all URL patterns with a crawl tool. Implement server-side normalization that redirects non-canonical variants (uppercase, underscore, parameter-based) to the canonical form with a 301.