- How to Diarize Audio - Complete diarization workflow
- Diarization and Speech-to-Text Merge - Combining diarization with transcription
- How to Upload Files - Working with local files
- Confidence Scores - Understanding and using confidence scores
- Speaker Configuration - Configuring speaker counts and overlapping speech
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
URL of the audio file to be processed
"https://example.com/audio.wav"
Webhook URL to receive results when job is completed (optional)
"https://example.com/webhook"
precision-1, precision-2, community-1 "precision-2"
Number of speakers. Only use if the number of speakers is known in advance. Number of speakers is detected automatically if not provided. Setting this value results in better overall diarization performance. In rare cases where we cannot honor this request (e.g. short files and large number of speakers), a warning will be added to the output. Equivalent to sending minSpeakers==maxSpeakers
x >= 12
Minimum number of speakers (must be <= maxSpeakers if both are set)
x >= 11
Maximum number of speakers (must be >= minSpeakers if both are set)
x >= 14
Includes turn-level confidence values in the output.
true
Includes exclusive diarization values in the output in exclusiveDiarization key (equivalent to diarization but without overlapping speech).
true
Include confidence values in the output. Output is considerably larger when this option is enabled. Output includes a list of confidence scores with a resolution.
true