Microsoft

Speech service

How can we improve Speech service?

(thinking…)

Enter your idea and we'll search to see if someone has already suggested it.

If a similar idea already exists, you can support and comment on it.

If it doesn't exist, you can post your idea so others can support it.

Enter your idea and we'll search to see if someone has already suggested it.

  1. need amr to be supported in Speech to Text services

    I saw that currently only wav audio format is supported.
    https://docs.microsoft.com/en-us/azure/cognitive-services/speech/getstarted/getstartedrest?tabs=Powershell

    However, my customer is using Cordova App and the default format in Android Cordova is AMR audio.
    We hope Azure Speech to Text can accept AMR as well.

    This cloud services accept various format of audio including AMR, so I think technically it is feasible for Azure.
    http://www.folio3.com/speech-to-text-services/

    1 vote
    Sign in
    Check!
    (thinking…)
    Reset
    or sign in with
    • facebook
    • google
      Password icon
      Signed in as (Sign out)

      We’ll send you updates on this idea

      0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
    • Samples

      I haven't been able to find samples of the TTS anywhere.

      1 vote
      Sign in
      Check!
      (thinking…)
      Reset
      or sign in with
      • facebook
      • google
        Password icon
        Signed in as (Sign out)

        We’ll send you updates on this idea

        0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
      • Connector for PowerApps

        Please provide a connector which can be used in PowerApps.

        I have tried to use the service from the demo in an azure function, but this fails currently by an unknown error :(

        2 votes
        Sign in
        Check!
        (thinking…)
        Reset
        or sign in with
        • facebook
        • google
          Password icon
          Signed in as (Sign out)

          We’ll send you updates on this idea

          0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
        • improve the recognition of numbers

          The recognition of words like "tenthousend" is "10 1.000" but should be "10000" this is bad if you try to allow an Input of numbers bigger then 1000

          1 vote
          Sign in
          Check!
          (thinking…)
          Reset
          or sign in with
          • facebook
          • google
            Password icon
            Signed in as (Sign out)

            We’ll send you updates on this idea

            0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
          • captialization

            This service is not doing a good job of capitalizing the first word in sentences. I say "I ate lunch period it was good" and I get

            I ate lunch period it was good. This later becomes

            I ate lunch. it was good

            For some reason the recognition of a period in punctuation doesn't produce the capitalization of the next sentence.

            1 vote
            Sign in
            Check!
            (thinking…)
            Reset
            or sign in with
            • facebook
            • google
              Password icon
              Signed in as (Sign out)

              We’ll send you updates on this idea

              0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
            • confidence number value per word or per speech fragment

              I am doing a POC with speech recognition for long speeches.
              https://docs.microsoft.com/de-de/azure/cognitive-services/speech/concepts#recognition-modes

              The recognition mode "conversation" with format "detailed" delivers message responses of type "SpeechPhrase" including confidence value.

              The recognition mode "dictation" with format "detailed" delivers message responses of type "SpeechFragment" and "SpeechPhrase" (including confidence value). But the fragments do not contain any information about confidence value.
              With the C# service library and the recognition mode "dictation" you'll get partial results with a confidence value (enum). But this is not our desired solution, because the confidence value seems to belong to the whole phrase (Confidence: Indicates the level of confidence…

              1 vote
              Sign in
              Check!
              (thinking…)
              Reset
              or sign in with
              • facebook
              • google
                Password icon
                Signed in as (Sign out)

                We’ll send you updates on this idea

                0 comments  ·  Flag idea as inappropriate…  ·  Admin →
              • Don't see your idea?

              Feedback and Knowledge Base