返回 Skill 列表
extension
分类: 开发与工程需要 API Key

Deepgram Transcribe

通过Deepgram Nova-3 API转录音频(WER 5.26%,比Whisper快40倍,内置说话人分离)。用于用户请求转录音频、播客等。

person作者: jessewunderlichhubclawhub

Deepgram Nova-3 Transcription

Transcribe audio files using Deepgram's Nova-3 model — more accurate and faster than OpenAI Whisper.

Why Deepgram over Whisper

  • 5.26% word error rate (vs ~8-10% for Whisper)
  • 40x faster for batch processing
  • Built-in speaker diarization (who said what)
  • Smart formatting (numbers, dates, punctuation)
  • $200 free credits on signup at deepgram.com

Setup

Store your API key:

echo "YOUR_DEEPGRAM_API_KEY" > ~/.openclaw/secrets/deepgram-api-key.txt

Or set the environment variable:

export DEEPGRAM_API_KEY="your-key-here"

Usage

bash scripts/transcribe.sh /path/to/audio.mp3
bash scripts/transcribe.sh recording.mp3 --out transcript.txt
bash scripts/transcribe.sh recording.mp3 --json --out full.json
bash scripts/transcribe.sh recording.mp3 --language es
bash scripts/transcribe.sh recording.mp3 --model nova-2

Models

| Model | WER | Cost/min | Best for | |-------|-----|----------|----------| | nova-3 (default) | 5.26% | $0.0077 | Best accuracy | | nova-2 | ~8% | $0.0043 | Budget-friendly | | whisper-large | ~8-10% | $0.0048 | Whisper parity |

Supported Formats

mp3, wav, m4a, ogg, flac, webm, aiff

Fallback

If DEEPGRAM_API_KEY is not set, use the OpenAI Whisper skill instead (if installed).