SKILL.md
readonly

openai-whisper-api

Author:openclaw
Category:
Development
323.6k
62.4k
name: openai-whisper-api
description: Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
homepage: https://platform.openai.com/docs/guides/speech-to-text

OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s /v1/audio/transcriptions endpoint.

Quick start

bash
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a

Defaults:

  • Model: whisper-1
  • Output: <input>.txt

Useful flags

bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json

API key

Set OPENAI_API_KEY, or configure it in ~/.openclaw/openclaw.json:

json5
{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE",
    },
  },
}

Actions

install --global skills.sh
npx skills add openclaw/openclaw/skills/openai-whisper-api

Usage Guide

  1. 1. Run Install Command

    Copy the installation command above and run it in your terminal to install globally.

  2. 2. Configure Environment

    Add the required environment variables to your MCP client according to the skill description.

  3. 3. Use in Client

    Configure and enable this skill in any MCP-compatible app (e.g. Claude or Cursor).