返回 Skill 列表
extension
分类: 开发与工程需要 API Key

Transcribe audio via Groq API (~10x cheaper than OpenAI API)

通过 Groq 自动语音识别 (ASR) 模型 (Whisper) 转录音频。

person作者: maxceemhubclawhub

Groq Whisper API (curl)

Transcribe an audio file via Groq’s OpenAI-compatible /openai/v1/audio/transcriptions endpoint.

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.m4a

Defaults:

  • Model: whisper-large-v3-turbo
  • Output: <input>.txt

Useful flags

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-large-v3 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json

API key

Set GROQ_API_KEY, or configure it in ~/.openclaw/openclaw.json:

{
  skills: {
    "groq-whisper-api": {
      apiKey: "GROQ_KEY_HERE",
    },
  },
}