Speech to Text - Whisper

post https://api.monsterapi.ai/v1/generate/whisper

Speech to Text API allows you to transcribe any audio file using OpenAI-Whisper Large-v2 model.

OpenAI Whisper is an open-source automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web.
Large-v2 is a biggest version of whisper model and offers superior transcription quality.

Make sure to use Fetch Results API after process_id is received

This API supports following use-cases:

Want to upload a file < 8MB: Checkout Whisper - Sample Code Recipe
Want to upload a file > 8MB: Checkout File Upload > 8MB Recipe
Want to send file url instead of upload: Checkout Whisper File URL Recipe

Recipes

📁

File Upload > 8MB

Open Recipe

🔊

Whisper - Sample Code

Open Recipe

🔊

Whisper File URL

Open Recipe

Language

Credentials

Bearer

JWT

Click Try It! to start a request and see the response here!