Microsoft

Speaker Recognition

Welcome to the Speaker Recognition Forum

Categories

API – Any ideas or feedback pertaining to features or enhancements to Speaker Recognition API.

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.


  • Hot ideas
  • Top ideas
  • New ideas
  • My feedback
  1. Please Provide Sample Android app to use Speaker Recognition API

    It would be helpful if you provide Sample to use Speaker Recognition API like how you provided for Face Verification /detection samples for android?

    11 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    8 comments  ·  Samples & SDK Requests  ·  Flag idea as inappropriate…  ·  Admin →
  2. API in C# for Web. Near-Real time audio samples. Examples?

    I would like to build a service to authenticate thru the web. Hopefully in C#, without trying to translate from Python. And are there any more examples. Anyone?

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Samples & SDK Requests  ·  Flag idea as inappropriate…  ·  Admin →
  3. Chinese zh-CH Language support for Speaker recognition

    Chinese zh-CH Language support for Speaker recognition

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
    Completed  ·  Luke Bayler responded

    Marking as completed because we now support speaker identification in Chinese.

  4. Audio captured using UWP API does not work

    The Speaker Recognition services does not accept legit WAV captured using UWP API.

    See https://mtaulty.com/2016/02/10/project-oxfordspeaker-verification-from-a-windows-10uwp-app/. The author had to fix the WAV stream for it to be accepted by the service.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Completed  ·  1 comment  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  5. Speaker Recognition with shorter limited recording time

    Is it possible that we can reduce the recording time when we use speaker recognition to confirm the identity.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Speaker Verification  ·  Flag idea as inappropriate…  ·  Admin →
    Completed  ·  Raymond responded

    We have release a new feature that allows you to waive the audio limit. Just add “ShortAudio” parameter to instruct the service to waive the recommended minimum audio limit needed for enrollment. Set value to “true” to force enrollment using any audio length.

    More details can be found here,
    - https://dev.projectoxford.ai/docs/services/563309b6778daf02acc0a508/operations/5645c3271984551c84ec6797
    - https://dev.projectoxford.ai/docs/services/563309b6778daf02acc0a508/operations/5645c523778daf217c292592

  6. Recognize multiple speakers in audio file and when they speak

    For example 2 minutes audio file. First 30 seconds Speaker A, then Speaker B from 30 to 1.30 and then again speaker A from 1.30 to 2 mins.

    35 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    8 comments  ·  Speaker Identification  ·  Flag idea as inappropriate…  ·  Admin →
  7. Speaker Recognition with shorter phrase?

    I would love to create a pug-in for my home automation. which already uses Kinects, that can utilize the speaker Identification from Oxford. Main issue is most statements are short - ie: Computer, turn on family room light. So I never generate a 20 Second clip - Recognition with at least a 5 second clip or so would be great, even if recognition is only say 80% accurate for this case....

    7 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Speaker Identification  ·  Flag idea as inappropriate…  ·  Admin →
    Completed  ·  Raymond responded

    We have release a new feature that allows you to waive the audio limit. Just add “ShortAudio” parameter to instruct the service to waive the recommended minimum audio limit needed for enrollment. Set value to “true” to force enrollment using any audio length.

    More details can be found here,
    - https://dev.projectoxford.ai/docs/services/563309b6778daf02acc0a508/operations/5645c3271984551c84ec6797
    - https://dev.projectoxford.ai/docs/services/563309b6778daf02acc0a508/operations/5645c523778daf217c292592

  • Don't see your idea?

Feedback and Knowledge Base