返回 Skill 列表
extension
分类: 内容与媒体无需 API Key

claim-extraction

从AI研究内容中提取结构化的声明、预测、提示和观点。在处理推文、博客文章、Substacks或其他来自AI研究人员的内容时使用,以识别关于AI能力、局限性和进展的实质性断言。

person作者: jakexiaohubgithub

Claim Extraction Skill

Extract all substantive claims from AI research content. A claim is any assertion that:

  • States something as true about AI capabilities, limitations, or progress
  • Predicts future developments
  • Hints at unreleased work
  • Expresses a positioned opinion on the field's direction
  • Critiques others' claims or work

Extraction Schema

For each claim, extract:

1. claimText

The claim in clear, standalone form. Paraphrase if needed for clarity.

2. claimType

  • fact: Assertion about current state ("GPT-4 can do X")
  • prediction: Forward-looking ("By 2026, we'll have...")
  • hint: Implies unreleased work ("We've been seeing interesting results with...")
  • opinion: Positioned take ("I think scaling is/isn't sufficient")
  • critique: Challenges others ("Marcus is wrong because...")
  • question: Genuine uncertainty expressed ("I'm not sure if...")

3. topic

Primary topic category:

  • scaling: Scaling laws, compute, training efficiency
  • reasoning: LLM reasoning, chain-of-thought, planning
  • agents: AI agents, tool use, autonomy
  • safety: AI safety, alignment, control
  • interpretability: Mechanistic interpretability
  • multimodal: Vision, audio, video models
  • rlhf: RLHF, preference learning, Constitutional AI
  • benchmarks: Evals, benchmarks, capability measurement
  • infrastructure: Training infra, chips, hardware
  • policy: AI policy, regulation, governance
  • general: General AI commentary

4. stance

  • bullish: Optimistic about AI progress/capabilities
  • bearish: Skeptical/pessimistic about AI progress
  • neutral: Balanced or factual without clear stance

5. bullishness

Float from 0.0 (maximally bearish) to 1.0 (maximally bullish)

6. confidence

How confident does the author seem? (0.0-1.0)

  • Hedging language: "might", "could", "I think", "possibly" → lower
  • Certainty language: "will", "definitely", "it's clear that" → higher

7. timeframe (for predictions)

  • near-term: < 1 year
  • medium-term: 1-3 years
  • long-term: 3-10 years
  • unspecified: No clear timeframe
  • null: Not a prediction

8. evidenceProvided

  • strong: Cites data, papers, or detailed reasoning
  • moderate: Some reasoning but not rigorous
  • weak: Assertion without support
  • appeal-to-authority: "Trust me, I work on this"

9. quoteworthiness

Is this claim notable enough to quote in a digest? (0.0-1.0)

Output Format

Return JSON:

{
  "claims": [
    {
      "claimText": "The claim in clear form",
      "claimType": "prediction",
      "topic": "reasoning",
      "stance": "bullish",
      "bullishness": 0.8,
      "confidence": 0.7,
      "timeframe": "medium-term",
      "evidenceProvided": "moderate",
      "quoteworthiness": 0.6,
      "relatedTo": ["o1", "chain-of-thought"],
      "originalQuote": "Brief relevant quote if notable"
    }
  ]
}

Guidelines

  • Extract MULTIPLE claims from a single piece of content if present
  • Don't over-extract - only substantive, meaningful claims
  • A tweet saying "Interesting paper" is NOT a claim
  • Look for IMPLICIT claims ("We've made a lot of progress" implies capability gains)
  • Pay attention to WHO is speaking - lab researchers hinting at their own work is high signal
  • Critics often make claims by contradiction ("X is wrong, therefore Y")

Author Context Matters

Consider the author's affiliation when assessing:

  • Lab researchers (Anthropic, OpenAI, DeepMind): May hint at unreleased work
  • Critics (Marcus, Chollet, Mitchell): Often make claims through critique
  • Independent (Simon Willison, Jim Fan): Provide practitioner perspectives