返回 Skill 列表
extension
分类: 数据与分析需要 API Key

StripFeed

抓取任意URL并转换为纯净、适配AI的Markdown,支持Token计数与缓存。去除网页中的广告、导航、脚本及噪音。

person作者: atlasopenclawhubclawhub

StripFeed

Convert any URL to clean Markdown optimized for LLM consumption. Strips ads, navigation, scripts, and noise. Returns token counts so you know exactly how much context you're using.

When to use this skill

Use StripFeed whenever you need to read a web page, article, documentation, or any URL content. It produces much cleaner output than raw HTML fetching and tells you the token cost.

Authentication

All requests require the STRIPFEED_API_KEY environment variable. Pass it as a Bearer token:

Authorization: Bearer $STRIPFEED_API_KEY

Get a free API key at https://www.stripfeed.dev (200 requests/month, no credit card).

Fetch a single URL

curl -s "https://www.stripfeed.dev/api/v1/fetch?url=URL_HERE" \
  -H "Authorization: Bearer $STRIPFEED_API_KEY"

This returns clean Markdown directly as the response body (Content-Type: text/markdown).

Parameters

| Parameter | Required | Description | | ------------ | -------- | ----------------------------------------------------------------------------------------- | | url | Yes | The URL to fetch (must be http or https) | | format | No | Output format: markdown (default), json, text, html. Pro only except markdown. | | selector | No | CSS selector to extract specific content (e.g. article, .content, #main). Pro only. | | cache | No | Set to false to bypass cache and force a fresh fetch | | ttl | No | Cache TTL in seconds (default: 3600, max: 86400 for Pro) | | max_tokens | No | Truncate output to fit within this token budget | | model | No | AI model ID for cost tracking (e.g. claude-sonnet-4-6, gpt-4o) |

JSON format (recommended for structured responses)

When you need metadata alongside the content, use format=json:

curl -s "https://www.stripfeed.dev/api/v1/fetch?url=URL_HERE&format=json" \
  -H "Authorization: Bearer $STRIPFEED_API_KEY"

JSON response includes:

{
  "markdown": "# Page Title\n\nClean content...",
  "url": "https://example.com",
  "title": "Page Title",
  "tokens": 1250,
  "originalTokens": 15000,
  "savingsPercent": 91.7,
  "cached": false,
  "fetchMs": 430,
  "format": "json",
  "truncated": false,
  "selector": null,
  "model": null
}

Response headers

Every response includes these headers:

  • X-StripFeed-Tokens - Token count of the stripped content
  • X-StripFeed-Original-Tokens - Token count of the raw HTML (before stripping)
  • X-StripFeed-Savings-Percent - Percentage of tokens saved
  • X-StripFeed-Cache - HIT or MISS
  • X-StripFeed-Fetch-Ms - Time to fetch the URL (0 if cached)
  • X-StripFeed-Format - Output format used

Batch fetch (Pro plan)

Fetch up to 10 URLs in a single request:

curl -s -X POST "https://www.stripfeed.dev/api/v1/batch" \
  -H "Authorization: Bearer $STRIPFEED_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"urls": ["https://example.com", "https://example.org"]}'

URLs can also include selectors:

{
  "urls": [
    { "url": "https://example.com", "selector": "article" },
    { "url": "https://example.org" }
  ],
  "model": "claude-sonnet-4-6"
}

Batch response:

{
  "results": [
    { "url": "...", "markdown": "...", "tokens": 1250, "status": 200 },
    { "url": "...", "error": "Failed to fetch", "status": 502 }
  ],
  "total": 2,
  "success": 1,
  "failed": 1
}

Individual URL errors don't break the batch. Check each result's status field.

Error handling

| Status | Meaning | | ------ | ---------------------------------------- | | 401 | Missing or invalid API key | | 403 | Feature requires Pro plan | | 422 | Invalid URL, format, or parameter | | 429 | Rate limit or monthly quota exceeded | | 502 | Target URL unreachable or returned error | | 504 | Target URL timed out (9s limit) |

Tips

  • Default format is markdown which returns raw text. Use format=json when you need token counts and metadata in the response body.
  • Responses are cached for 1 hour by default. Use cache=false for real-time content.
  • The max_tokens parameter is useful to fit content within your context window budget.
  • Use selector to grab only the main content (e.g. selector=article or selector=.post-content) and skip sidebars/footers.
  • Free plan: 200 requests/month, markdown format only. Pro plan ($19/mo): 100K requests, all formats, selectors, batch endpoint.