> ## Documentation Index > Fetch the complete documentation index at: https://developers.scrapeunblocker.com/llms.txt > Use this file to discover all available pages before exploring further. # Parsed data extraction > Get structured JSON instead of raw HTML. Extraction uses Schema.org, __NEXT_DATA__, or AI-generated rules. For most scraping workloads you don't want raw HTML - you want clean JSON with the fields that matter. Pass `parsed_data=true` to `/getPageSource` and ScrapeUnblocker extracts structured data using the best available method for that page. ## Request ```bash cURL theme={null} curl -X POST "https://api.scrapeunblocker.com/getPageSource?url=https://www.amazon.com/dp/B08N5WRWNW&parsed_data=true" \ -H "x-scrapeunblocker-key: YOUR_API_KEY" ``` ```python Python theme={null} import requests r = requests.post( "https://api.scrapeunblocker.com/getPageSource", params={ "url": "https://www.amazon.com/dp/B08N5WRWNW", "parsed_data": True, }, headers={"x-scrapeunblocker-key": "YOUR_API_KEY"}, timeout=120, ) payload = r.json() ``` ```javascript Node.js theme={null} const res = await fetch( "https://api.scrapeunblocker.com/getPageSource?url=https://www.amazon.com/dp/B08N5WRWNW&parsed_data=true", { method: "POST", headers: { "x-scrapeunblocker-key": "YOUR_API_KEY" }, } ); const payload = await res.json(); ``` ## Response shape ```json theme={null} { "data": { "page_type": "product", "source": "schema_org", "data": { "title": "Echo Dot (4th Gen)", "price": "49.99", "currency": "USD", "brand": "Amazon", "availability": "InStock", "rating": 4.7, "review_count": 123456 } } } ``` ### `page_type` Detected category of the page. Common values: * `product` - e-commerce product detail page * `listing` - search results or category page * `article` - news, blog, or editorial content * `job` - job posting * `real_estate` - property listing * `unknown` - extractor could not classify the page ### `source` Which extraction strategy produced the data: | Source | What it means | | ------------ | ------------------------------------------------------------------------------------------------------- | | `schema_org` | The page exposed JSON-LD or microdata using [schema.org](https://schema.org) vocabulary. Most reliable. | | `next_data` | Extracted from a Next.js `__NEXT_DATA__` `