JSON-LD (JavaScript Object Notation for Linked Data) encodes semantic data as a `<script type='application/ld+json'>` block within the page's HTML. The data describes the page's content in a vocabulary defined by schema.org — product prices, review ratings, event dates, organisation details, and more. Search engines parse JSON-LD blocks to power rich results in search listings.
For scrapers, JSON-LD is a high-quality extraction target because the data is clean and structured by the site operator — no fragile CSS selector or XPath is needed to extract product prices from rendered text. The schema.org vocabulary provides a predictable key set across different publishers using the same content type.
JSON-LD blocks can be extracted with a simple regex or HTML parser targeting the script tag, then parsed as standard JSON. Multiple JSON-LD blocks may appear on a single page for different schema types (e.g., a `Product` block alongside a `BreadcrumbList` block).