Container Requirements

Minimum Size small

Functions

`build-transcribe-args`

fn (input-path: Str, output-dir: Str, opts: TranscribeOpts): Vec

Build whisper CLI arguments after the whisper executable name.

`check-box-result`

fn (result: Map, tool-name: Str): Map

`file-ext`

fn (path: Str): Str

`join-args`

fn (args: Vec): Str

`model-to-min-size`

fn (model: Str): Str

`ns` alias

Alias of ::whisper/

OpenAI Whisper speech-to-text via Hot Box containers.

Runs the whisper CLI inside onerahmet/openai-whisper-asr-webservice (or a custom image) to transcribe or translate audio referenced by hot:// URLs. Outputs are written back to Hot storage.

Quick Start

out ::whisper/transcribe("hot://uploads/recording.mp3")
tap(out.text)
tap(out.url)

// With explicit output path
out ::whisper/transcribe("hot://uploads/recording.mp3", {
    model: "small",
    output: "hot://transcripts/recording.json",
})

`parse-transcribe-output`

fn (stdout: Str?): Map

Parse whisper JSON output (full transcript, segments, detected language).

`resolve-output`

fn (user-output: Str?, default: Str): Str

`run-whisper-box`

fn (shell-script: Str, box-size: Str, image: Str?, timeout: Int): Map

`start-transcribe`

fn (input: Str): Map
fn (input: Str, opts: TranscribeOpts): Map

Start a Whisper transcription in a container without waiting for completion.

Returns the box task info from ::box/start. Use ::task/await on the returned id to wait for the result.

Example

info ::whisper/start-transcribe("hot://uploads/voice.mp3", {model: "small", language: "en"})
result ::task/await(info.id)

Container: small

`start-whisper-box`

fn (shell-script: Str, box-size: Str, image: Str?, timeout: Int): Map

`transcribe`

fn (input: Str): Map
fn (input: Str, opts: TranscribeOpts): Map

Transcribe (or translate) audio to text using OpenAI Whisper in a container.

Returns {text: Str, language: Str?, segments: Vec?, url: Str}. With format: "json" (default), text is the full transcript, segments holds timed segments, and language is the detected or forced language code. url is the hot:// URL of the output file (same format as requested).

Box size defaults from the model (tiny/base → small, small → medium, etc.).

Example

out ::whisper/transcribe("hot://uploads/voice.mp3")
out ::whisper/transcribe("hot://uploads/voice.mp3", {
    model: "small",
    language: "en",
    format: "json",
})

Container: small

Types

`TranscribeOpts`

TranscribeOpts type {
    model: Str?,
    language: Str?,
    format: Str?,
    task: Str?,
    temperature: Str?,
    word-timestamps: Bool?,
    initial-prompt: Str?,
    output: Str?,
    image: Str?,
    size: Str?,
    timeout: Int?
}

Options for Whisper transcription.

Fields

model — Model size: "tiny", "base", "small", "medium", "large" (default: "base")
language — Language code, e.g. "en", "fr" (omit for auto-detect)
format — Output file format: "txt", "json", "srt", "vtt" (default: "json")
task — "transcribe" or "translate" (default: whisper default / transcribe)
temperature — Sampling temperature as string, e.g. "0"
output — Output hot:// URL (default: auto-generated)
image — Custom Docker image with whisper CLI
size — Box size override (default: derived from model)

hot.dev/whisper

Container Requirements

Functions

build-transcribe-args

check-box-result

file-ext

join-args

model-to-min-size

ns alias

parse-transcribe-output

resolve-output

run-whisper-box

start-transcribe

start-whisper-box

transcribe