Microsoft

Speech Service

  1. confidence number value per word or per speech fragment

    I am doing a POC with speech recognition for long speeches.
    https://docs.microsoft.com/de-de/azure/cognitive-services/speech/concepts#recognition-modes

    The recognition mode "conversation" with format "detailed" delivers message responses of type "SpeechPhrase" including confidence value.

    The recognition mode "dictation" with format "detailed" delivers message responses of type "SpeechFragment" and "SpeechPhrase" (including confidence value). But the fragments do not contain any information about confidence value.
    With the C# service library and the recognition mode "dictation" you'll get partial results with a confidence value (enum). But this is not our desired solution, because the confidence value seems to belong to the whole phrase (Confidence: Indicates the level of confidence…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  2. 3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Sample Requests  ·  Flag idea as inappropriate…  ·  Admin →
    Completed  ·  Allison Light responded

    Thanks for letting us know about the broken code! We’ve updated our documentation to link to the built-in Windows 10 Speech API which is the suggested way to call Speech API through UWP applications. You can read more about it using the links below.

    Documentation: https://msdn.microsoft.com/en-us/library/windows/apps/windows.media.speechrecognition.aspx.
    Sample: https://github.com/Microsoft/Windows-universal-samples/tree/master/Samples/SpeechRecognitionAndSynthesis

1 2 4 Next →
  • Don't see your idea?

Feedback and Knowledge Base