Recognize multiple speakers in audio file and when they speak
For example 2 minutes audio file. First 30 seconds Speaker A, then Speaker B from 30 to 1.30 and then again speaker A from 1.30 to 2 mins.
You can now do this type of recognition using this public sample application: https://github.com/Microsoft/Cognitive-SpeakerRecognition-Windows/tree/master/Streaming.
In the future, we plan to fully support this scenario from the service side to avoid sending too many requests from the client side, and to include speech recognition results as well. This means that you’ll get a response stating who the speaker is, and what is being said.
I am trying to create an app. while people talking in a meeting. the app will writes down who says what. is it possible? I am not able to do it. any help?
Please increase the number of people the speaker recognition API can identify. Provide some information on whether or not the maximum amount of people identified can be changed according to the pricing. Also, your sales team doesn't ever seem to pick up the phone, when I call. They always leave me on voicemail.
+1 - I want to be able to give you guys a conversation clip rather than breaking the audio apart myself.
how to use Speaker Recognition in xamarin pcl please help
David Rodríguez commented
Would be great to have ability to identify several people in same audio.
Lorenzo Galloni commented
Is this being developed now or in the near future? Thank you for the amazing work!
Any chance to add this soon?
Thank you very much for the idea. It almost sounds like a tracking method. I like the sound of it!