Skip to content

ML.PrepareAudioTokens

Name Mandatory Description Default Type
⬅️ Input The input of the shard, if any String
Output ➡️ The resulting output of the shard [Int]Tensor
Tokenizer No The tokenizer to use. none Var(Tokenizer)
Language No The language code (e.g. 'en', 'fr', etc). none String
Task No The task type ('transcribe' or 'translate'). none String
Timestamps No Whether to include timestamp tokens. true Bool
AsTensor No Output as tensor instead of sequence. false Bool
GPU No If true, the output tensor will be on GPU. false Bool

Prepares the initial tokens for Whisper audio processing with language and task settings.