OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s /v1/audio/transcriptions endpoint.

Quick start

bash

{baseDir}/scripts/transcribe.sh /path/to/audio.m4a

Defaults:

Model: whisper-1
Output: <input>.txt

Useful flags

bash

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json

API key

Set OPENAI_API_KEY, or configure it in ~/.openclaw/openclaw.json:

json5

{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE",
    },
  },
}

Actions

install --global skills.sh

npx skills add openclaw/openclaw/skills/openai-whisper-api

Usage Guide

1. Run Install Command
Copy the installation command above and run it in your terminal to install globally.
2. Configure Environment
Add the required environment variables to your MCP client according to the skill description.
3. Use in Client
Configure and enable this skill in any MCP-compatible app (e.g. Claude or Cursor).