rafaelgalle/whisper-diarization-advanced

Ultra-fast, customizable speech-to-text and speaker diarization for noisy, multi-speaker audio. Includes advanced noise reduction, stereo channel support, and flexible audio preprocessing—ideal for call centers, meetings, and podcasts.

Input
Configure the inputs for the AI model.

Names/acronyms, separated by punctuation

Direct URL

Language code like 'en', 'pt'

Audio file

Translate to English

0
4

0=None, 1=Sanitize, 2=+Filter, 3=+ReduceNoise, 4=+Normalize

Base64 audio

0
100
0
100
1
50

Leave empty to autodetect

0
100
0
100
Output
The generated output will appear here.

No output yet

Click "Generate" to create an output.