Speaker Recognition
Welcome to the Speaker Recognition Forum
Categories
API – Any ideas or feedback pertaining to features or enhancements to Speaker Recognition API.
Documentation – Any ideas or suggestions for the API Reference or Documentation.
Language Support – Submit a request to have a particular language supported.
Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.
Attention!
We have moved our Customer Feedback & Ideas for Azure Cognitive Services portal to the Azure Feedback Forum.
-
Please Provide Sample Android app to use Speaker Recognition API
It would be helpful if you provide Sample to use Speaker Recognition API like how you provided for Face Verification /detection samples for android?
11 votesHello,
We have a public Android SDK located here: https://github.com/Microsoft/Cognitive-SpeakerRecognition-Android.
Thanks,
Luke -
API in C# for Web. Near-Real time audio samples. Examples?
I would like to build a service to authenticate thru the web. Hopefully in C#, without trying to translate from Python. And are there any more examples. Anyone?
2 votesHello,
We have a sample that does this here: https://github.com/Microsoft/Cognitive-SpeakerRecognition-Windows/tree/master/Verification.
Thanks,
Luke -
Chinese zh-CH Language support for Speaker recognition
Chinese zh-CH Language support for Speaker recognition
2 votesMarking as completed because we now support speaker identification in Chinese.
-
Audio captured using UWP API does not work
The Speaker Recognition services does not accept legit WAV captured using UWP API.
See https://mtaulty.com/2016/02/10/project-oxfordspeaker-verification-from-a-windows-10uwp-app/. The author had to fix the WAV stream for it to be accepted by the service.
1 vote -
Speaker Recognition with shorter limited recording time
Is it possible that we can reduce the recording time when we use speaker recognition to confirm the identity.
2 votesWe have release a new feature that allows you to waive the audio limit. Just add “ShortAudio” parameter to instruct the service to waive the recommended minimum audio limit needed for enrollment. Set value to “true” to force enrollment using any audio length.
More details can be found here,
- https://dev.projectoxford.ai/docs/services/563309b6778daf02acc0a508/operations/5645c3271984551c84ec6797
- https://dev.projectoxford.ai/docs/services/563309b6778daf02acc0a508/operations/5645c523778daf217c292592 -
Recognize multiple speakers in audio file and when they speak
For example 2 minutes audio file. First 30 seconds Speaker A, then Speaker B from 30 to 1.30 and then again speaker A from 1.30 to 2 mins.
36 votesHello,
You can now do this type of recognition using this public sample application: https://github.com/Microsoft/Cognitive-SpeakerRecognition-Windows/tree/master/Streaming.
In the future, we plan to fully support this scenario from the service side to avoid sending too many requests from the client side, and to include speech recognition results as well. This means that you’ll get a response stating who the speaker is, and what is being said.
Luke
-
Speaker Recognition with shorter phrase?
I would love to create a pug-in for my home automation. which already uses Kinects, that can utilize the speaker Identification from Oxford. Main issue is most statements are short - ie: Computer, turn on family room light. So I never generate a 20 Second clip - Recognition with at least a 5 second clip or so would be great, even if recognition is only say 80% accurate for this case....
7 votesWe have release a new feature that allows you to waive the audio limit. Just add “ShortAudio” parameter to instruct the service to waive the recommended minimum audio limit needed for enrollment. Set value to “true” to force enrollment using any audio length.
More details can be found here,
- https://dev.projectoxford.ai/docs/services/563309b6778daf02acc0a508/operations/5645c3271984551c84ec6797
- https://dev.projectoxford.ai/docs/services/563309b6778daf02acc0a508/operations/5645c523778daf217c292592
- Don't see your idea?