Bing Speech

Welcome to the Bing Speech API Forum

Categories

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Requests – Let us know if you would like to see a tutorial or sample provided.

Speech to Text – API & SDK – Ideas and feature requests to Speech Recognition and Speech to Text (STT).

Text to Speech – Ideas and feature requests for Text to Speech (TTS) – API only

How can we improve Bing Speech API?

(thinking…)

Enter your idea and we'll search to see if someone has already suggested it.

If a similar idea already exists, you can support and comment on it.

If it doesn't exist, you can post your idea so others can support it.

Enter your idea and we'll search to see if someone has already suggested it.

  1. Speech to Text API - Korean language

    https://azure.microsoft.com/ko-kr/services/cognitive-services/speech/

    We know that Text to Speech is supported in Korean.
    (Text to Speech: Korean - KR / HeamiRUS)

    However, Speech to Text does not support Korean. Only English, Chinese, French, German, Italian, and Spanish are in the list. There is no Korean in the list. Please add Korean.

    6 votes
    Sign in
    Check!
    (thinking…)
    Reset
    or sign in with
    • facebook
    • google
      Password icon
      I agree to the terms of service
      Signed in as (Sign out)

      We’ll send you updates on this idea

      0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
    • Strong name

      Please strong name the SpeechClient.dll so we can included in signed projects.

      1 vote
      Sign in
      Check!
      (thinking…)
      Reset
      or sign in with
      • facebook
      • google
        Password icon
        I agree to the terms of service
        Signed in as (Sign out)

        We’ll send you updates on this idea

        0 comments  ·  Samples & SDK Request  ·  Flag idea as inappropriate…  ·  Admin →
      • ARM64 version of libandroid_platform.so is needed

        Your SDK cannot be used with the vast majority of current Android hardware.

        Your team has commented elsewhere (Github?) that you intend to open source this portion of the product rather than provide prebuilt binaries. If that's true could you give a date for when it will happen?

        1 vote
        Sign in
        Check!
        (thinking…)
        Reset
        or sign in with
        • facebook
        • google
          Password icon
          I agree to the terms of service
          Signed in as (Sign out)

          We’ll send you updates on this idea

          0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
        • Speech to text - Dutch

          Dutch support for the Bing Speech to Text!

          2 votes
          Sign in
          Check!
          (thinking…)
          Reset
          or sign in with
          • facebook
          • google
            Password icon
            I agree to the terms of service
            Signed in as (Sign out)

            We’ll send you updates on this idea

            0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
          • Need timestamp information for speech to text

            Hello,

            Please include timestamps in your speech to text api output.

            Thank you.

            williamj

            2 votes
            Sign in
            Check!
            (thinking…)
            Reset
            or sign in with
            • facebook
            • google
              Password icon
              I agree to the terms of service
              Signed in as (Sign out)

              We’ll send you updates on this idea

              0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
            • Speech to Text Android

              PAI used AudioRecord to record, but did not return an instance to us and did not return the volume of the recording. We need to get the volume to show to the user.

              How do we deal with it?

              Thanks

              1 vote
              Sign in
              Check!
              (thinking…)
              Reset
              or sign in with
              • facebook
              • google
                Password icon
                I agree to the terms of service
                Signed in as (Sign out)

                We’ll send you updates on this idea

                0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
              • Dutch language support in Bing text-to-speech API

                The Bing text-to-speech API supports 10 languages, but of course, there are many more. Dutch is not yet supported.

                https://www.microsoft.com/cognitive-services/en-us/speech-api

                I have a Cognitive Services account for Bing Speech and I have software working to provide TTS in the supported languages. But the main language that is of interest to me is Dutch.

                I would very much appreciate if Dutch can be added.

                3 votes
                Sign in
                Check!
                (thinking…)
                Reset
                or sign in with
                • facebook
                • google
                  Password icon
                  I agree to the terms of service
                  Signed in as (Sign out)

                  We’ll send you updates on this idea

                  4 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
                • How can we implement this speech API in a J2ee web app. I am using servlet, jsp, hibernate, eclipse ee for development.

                  How can we implement this speech API in a J2ee web app. I am using servlet, jsp, hibernate, eclipse ee for development.
                  On the front end i am using html, css, js, jquery.

                  I want to fill the form by using text input . as well as the actions should take place like navigating to a particular page.

                  1 vote
                  Sign in
                  Check!
                  (thinking…)
                  Reset
                  or sign in with
                  • facebook
                  • google
                    Password icon
                    I agree to the terms of service
                    Signed in as (Sign out)

                    We’ll send you updates on this idea

                    0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
                  • 1 vote
                    Sign in
                    Check!
                    (thinking…)
                    Reset
                    or sign in with
                    • facebook
                    • google
                      Password icon
                      I agree to the terms of service
                      Signed in as (Sign out)

                      We’ll send you updates on this idea

                      0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                    • Improved English GB Male voice

                      At https://www.microsoft.com/cognitive-services/en-us/Speech-api/documentation/API-Reference-REST/BingVoiceOutput#SupLocales there are some new voices suffixed by RUS.
                      I cannot find what RUS stands for, but they are significantly better quality. There are now two British Female voices, one acceptable, one excellent, while the British Male voice remains very low quality.

                      1 vote
                      Sign in
                      Check!
                      (thinking…)
                      Reset
                      or sign in with
                      • facebook
                      • google
                        Password icon
                        I agree to the terms of service
                        Signed in as (Sign out)

                        We’ll send you updates on this idea

                        0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
                      • Maximum request length

                        There's no clear documentation on the maximum request length that the Text-to-Speech API can support. My plan was to chunk my text according to this maximum request length. Since my application is aggressive/greedy (try to max out every call), I often get a 413 Error "RequestEntityTooLarge" most of the time.

                        I found this on Microsoft's web site: "The maximum amount of audio returned for a given request must not exceed 15 seconds." Which is quite useless because the length of the audio cannot be known from the client side at the time the request is generated. And I found that…

                        4 votes
                        Sign in
                        Check!
                        (thinking…)
                        Reset
                        or sign in with
                        • facebook
                        • google
                          Password icon
                          I agree to the terms of service
                          Signed in as (Sign out)

                          We’ll send you updates on this idea

                          0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
                        • Support for Windows 10 UWP in Bing.Speech

                          The NuGet packages for

                          Microsoft.Bing.Speech (2.0.2)
                          Microsoft.ProjectOxford.SpeechRecognition-x64 (1.0.0.3)
                          Microsoft.ProjectOxford.SpeechRecognition-x86 (1.0.0.3)

                          Do not support Windows 10 UWP apps. Trying to install results in "Package Microsoft.Bing.Speech 2.0.2 is not compatible with uap10.0 (UAP,Version=v10.0)"

                          4 votes
                          Sign in
                          Check!
                          (thinking…)
                          Reset
                          or sign in with
                          • facebook
                          • google
                            Password icon
                            I agree to the terms of service
                            Signed in as (Sign out)

                            We’ll send you updates on this idea

                            0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                          • Please could you add 8-bit audio to this and speaker reco

                            Hi folks, I've worked across large and mid range contact center and speech services in the industry. Your SDK's all appear to lack 8-bit 8-KHZ support. I don't understand as your mcdonalds luis demo handles well with very poor sound quality. Every other vendor supports the 8-bit format except you. It basically means over the PSTN phone channel your products are totally irrelevant unless you are on fast wifi supporting 16-bit, mobile data channel. Given the above, means these platforms are no good for PSTN telephony connected systems shutting you out of this multi-billion dollar market. This surprises me but…

                            1 vote
                            Sign in
                            Check!
                            (thinking…)
                            Reset
                            or sign in with
                            • facebook
                            • google
                              Password icon
                              I agree to the terms of service
                              Signed in as (Sign out)

                              We’ll send you updates on this idea

                              0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                            • Provide per word timecodes on final result

                              When returning results on other ASR services you get usually an array of words with a per word timecode and confidence.

                              2 votes
                              Sign in
                              Check!
                              (thinking…)
                              Reset
                              or sign in with
                              • facebook
                              • google
                                Password icon
                                I agree to the terms of service
                                Signed in as (Sign out)

                                We’ll send you updates on this idea

                                2 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                              • Microsoft Bing Speech Recognition

                                Hi,

                                I am using Bing Speech API microsoft.com/cognitive-services/en-us/Spee.. for ASR.

                                I want to do continuous speech recognition from the microphone in Java. But the data we get from the microphone is raw data. I know we have to set wav header to the raw audio data before calling the REST API.

                                I am using the below code to set the header

                                byte[] header = new byte44;
                                ByteArrayOutputStream baos = null;
                                DataOutputStream dos = null;
                                try { // create byte array output stream
                                baos = new ByteArrayOutputStream();
                                short nChannels = 1;
                                short mBitsPersample = 16; // create data output stream
                                dos…

                                3 votes
                                Sign in
                                Check!
                                (thinking…)
                                Reset
                                or sign in with
                                • facebook
                                • google
                                  Password icon
                                  I agree to the terms of service
                                  Signed in as (Sign out)

                                  We’ll send you updates on this idea

                                  0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                                • Korean language support in text to speech api

                                  It would be nice if we could have Korean language support.

                                  7 votes
                                  Sign in
                                  Check!
                                  (thinking…)
                                  Reset
                                  or sign in with
                                  • facebook
                                  • google
                                    Password icon
                                    I agree to the terms of service
                                    Signed in as (Sign out)

                                    We’ll send you updates on this idea

                                    6 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
                                  • iOS STT Needs to be faster

                                    Based on my test, MS iOS STT is slower than Nuance's SpeechToText iOS service.

                                    I installed the MS iOS STT sample project and the Nuance iOS sample project on the same iPhone. Then I spoke to both apps the same sentence - "What's the date today". It took 3+ seconds for Nuance's app to return the right answer. However it took 5+ seconds for MS iOS to return the answer.

                                    I hope that MS can improve the performance.

                                    2 votes
                                    Sign in
                                    Check!
                                    (thinking…)
                                    Reset
                                    or sign in with
                                    • facebook
                                    • google
                                      Password icon
                                      I agree to the terms of service
                                      Signed in as (Sign out)

                                      We’ll send you updates on this idea

                                      0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                                    • feedback for mistakes?

                                      The speech to text api seems to have a lot of trouble with names, especially foreign names. I was wondering if there was a way to give feedback (or 'label it correctly), so that it won't keep repeating the same mistake.

                                      5 votes
                                      Sign in
                                      Check!
                                      (thinking…)
                                      Reset
                                      or sign in with
                                      • facebook
                                      • google
                                        Password icon
                                        I agree to the terms of service
                                        Signed in as (Sign out)

                                        We’ll send you updates on this idea

                                        1 comment  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                                      • Start and Duration for RecognizedPhrase or RecognitionResult

                                        The built in Windows Speech Recognition APIs allow us to tie recorded text to the corresponding portion of the audio. Could such ability be introduced to Bing Speech?

                                        3 votes
                                        Sign in
                                        Check!
                                        (thinking…)
                                        Reset
                                        or sign in with
                                        • facebook
                                        • google
                                          Password icon
                                          I agree to the terms of service
                                          Signed in as (Sign out)

                                          We’ll send you updates on this idea

                                          0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
                                        • Get to know time offset

                                          I suppose that tme offiset information could be needed, Sometimes, for example, To compose subtitle of video clip using speech to text service because of being sync up video and text or clipping the silence frame and so on.

                                          2 votes
                                          Sign in
                                          Check!
                                          (thinking…)
                                          Reset
                                          or sign in with
                                          • facebook
                                          • google
                                            Password icon
                                            I agree to the terms of service
                                            Signed in as (Sign out)

                                            We’ll send you updates on this idea

                                            0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                                          ← Previous 1 3
                                          • Don't see your idea?

                                          Feedback and Knowledge Base