Dataset

News Websites Dataset API

Articles, headlines, and media content from major news publications for media monitoring and content analysis.

Start in 5 minutesStructured JSON outputPay as you go

Available Data

Here's what you can extract from News Websites using AlterLab:

Article headlines & text
Publication dates
Author information
Categories & tags
Related articles

Business Use Cases

How businesses use News Websites data to make better decisions:

  • Media monitoring & PR tracking
  • Industry news aggregation
  • Sentiment analysis on topics
  • Competitive PR intelligence

How It Works

1

Identify target news sources and topics

Provide section page URLs or search result URLs from your target news publications. Most major news sites (Reuters, AP News, BBC, WSJ) have consistent section URLs that update with new articles — these are ideal monitoring targets.

2

AlterLab retrieves article content with rendering support

News sites range from simple server-rendered HTML (AP News) to complex JavaScript applications (Bloomberg, WSJ). AlterLab handles the full spectrum — rendering as needed for JavaScript-heavy sites and retrieving static content directly for simpler publications.

3

Get article content structured for analysis pipelines

Receive article headlines, full body text, publication timestamps, author bylines, topic tags, and related article links — all structured for your media monitoring dashboard, sentiment analysis pipeline, or content aggregation system.

Key Insight

News websites vary dramatically in access requirements. Wire services (AP News, Reuters) are freely accessible with no paywall. Business publications (Bloomberg, FT, WSJ) have hard paywalls after a limited number of free articles. Tech news (TechCrunch, Ars Technica) are largely free. AlterLab extracts all publicly visible content — the same content a visitor without a subscription would see.

Data Freshness

Breaking news requires near-real-time monitoring — major news sites publish dozens of articles per day. For media monitoring, polling target publication section pages every 15-30 minutes ensures articles are captured close to their publication time. For trend analysis, hourly collection is sufficient.

Sample Output

Here's exactly what you receive when you collect News Websites data with AlterLab — clean, structured JSON ready for your application or spreadsheet.

JSON Response
{
  "url": "https://techcrunch.com/2026/06/25/ai-agents-reshape-enterprise-workflows",
  "title": "AI Agents Are Reshaping Enterprise Workflows",
  "author": "Jane Smith",
  "published_at": "2026-06-25T08:00:00Z",
  "updated_at": "2026-06-25T10:30:00Z",
  "section": "Artificial Intelligence",
  "tags": [
    "AI",
    "enterprise",
    "automation",
    "agents"
  ],
  "summary": "A new wave of AI agent platforms is transforming how enterprise teams automate complex multi-step workflows...",
  "word_count": 1842,
  "reading_time_minutes": 7,
  "images": [
    {
      "url": "https://techcrunch.com/wp-content/uploads/2026/06/ai-agents.jpg",
      "caption": "AI agents managing enterprise tasks"
    }
  ],
  "related_articles": [
    {
      "title": "How enterprises are adopting AI automation",
      "url": "https://techcrunch.com/..."
    }
  ],
  "scraped_at": "2026-06-25T09:42:11Z"
}

Quick Start

HTTP
POST https://alterlab.io/api/v1/scrape
X-API-Key: YOUR_API_KEY

{
  "url": "https://various/...",
  "advanced": { "render_js": true }
}

Need an API key? — no credit card required. 5,000 requests included.

Pricing at Scale

AlterLab uses pay-as-you-go pricing — no monthly minimums, no contracts. Balance never expires.

VolumeStatic pagesJS-rendered pages
1,000 requests$0.20$2.00
10,000 requests$2.00$20.00
100,000 requests$20.00$200.00
1,000,000 requests$200.00$2,000.00

Static pages: $0.0002/request · JavaScript-rendered pages (like News Websites): $0.002/request · First 5,000 requests free. Full pricing details →

Frequently Asked Questions

Which news sites can I extract data from?

AlterLab supports data extraction from any publicly accessible news website. Our full browser rendering handles paywalled preview content, dynamic loading, and modern JavaScript-heavy news sites.

Can I set up automated news monitoring?

Yes. Use AlterLab's API to regularly collect articles from specific news sources or search results, building an automated media monitoring pipeline for your brand or industry.

How is this different from news APIs?

Unlike limited news APIs, AlterLab gives you access to the full article content, images, and metadata from any news source without per-source API restrictions or rate limits.

Related Use Cases

Responsible Use

AlterLab is designed for extracting publicly available data. Always review the terms of service for any website you access, respect robots.txt directives, and ensure your use case complies with applicable laws in your jurisdiction.

Your first scrape.
Sixty seconds.

$1 free credit — up to 5,000 scrapes. No credit card.Just a POST request.

terminal
curl -X POST https://api.alterlab.io/v1/scrape \
-H "X-API-Key: YOUR_KEY" \
-H "Content-Type: application/json" \
-d '{"url": "https://example.com", "formats": ["markdown"]}'

No credit card required · $1 free credit, up to 5,000 scrapes · Balance never expires

    News Websites Data API — Structured Extraction 2026 | AlterLab