Add Configurable Number of Speakers Support Description
Description
Enhance the speaker diarization workflow and playback experience by introducing configurable speaker count
Add support for an optional num_speakers parameter across the transcription and diarization workflow, allowing users to either:
- use automatic speaker detection
- manually specify speaker count for deterministic diarization output
Tasks
-
Extend
TranscribeOptionswith:numSpeakers?: number | null
-
Pass
num_speakersthrough:createTranscriptionJobcreateTranscriptionJobFromFilediarizeAndTranscribe
-
Append value to
FormDataonly when valid -
Add persistent React state +
numSpeakersRef -
Support:
- chunked uploads
- full file uploads
- benchmark analysis workflows
-
Add dynamic numeric input beside diarization toggle
-
Keep empty state as backend auto-detection mode
-
Restrict values to max
10
Acceptance Criteria
- Empty input uses auto-detection
- Valid speaker count reaches backend correctly
- Existing transcription flows remain unaffected
- State persists correctly during audio workflows
Edited by ashritha kunjeti