Speaker diarization for more than 2 speakers
Speaker diarization for more than 2 speakers.
I dont feel this should be marked as resolved. Would expect support for at least 10 speakers. Additionally its currently really poor and switches between speaker 1 and 2 almost randomly. Please make this more intelligent. Its a deal breaker for us and I'm sure many others. Especially considering the google alternative can handle unlimited speakers and is far more accurate at identifying them.
https://cloud.google.com/speech-to-text/docs/multiple-voices
And no... expecting a sample to train it for each voice is not an option. We literally just need it to assign a number to the speaker and not identify who the speaker is.
