Bing Speech

Welcome to the Bing Speech API Forum

Categories

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Requests – Let us know if you would like to see a tutorial or sample provided.

Speech to Text – API & SDK – Ideas and feature requests to Speech Recognition and Speech to Text (STT).

Text to Speech – Ideas and feature requests for Text to Speech (TTS) – API only

How can we improve Bing Speech API?

(thinking…)

Enter your idea and we'll search to see if someone has already suggested it.

If a similar idea already exists, you can support and comment on it.

If it doesn't exist, you can post your idea so others can support it.

Enter your idea and we'll search to see if someone has already suggested it.

  1. Maximum request length

    There's no clear documentation on the maximum request length that the Text-to-Speech API can support. My plan was to chunk my text according to this maximum request length. Since my application is aggressive/greedy (try to max out every call), I often get a 413 Error "RequestEntityTooLarge" most of the time.

    I found this on Microsoft's web site: "The maximum amount of audio returned for a given request must not exceed 15 seconds." Which is quite useless because the length of the audio cannot be known from the client side at the time the request is generated. And I found that…

    3 votes
    Sign in
    Check!
    (thinking…)
    Reset
    or sign in with
    • facebook
    • google
      Password icon
      I agree to the terms of service
      Signed in as (Sign out)

      We’ll send you updates on this idea

      0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
    • Dutch language support in Bing text-to-speech API

      The Bing text-to-speech API supports 10 languages, but of course, there are many more. Dutch is not yet supported.

      https://www.microsoft.com/cognitive-services/en-us/speech-api

      I have a Cognitive Services account for Bing Speech and I have software working to provide TTS in the supported languages. But the main language that is of interest to me is Dutch.

      I would very much appreciate if Dutch can be added.

      1 vote
      Sign in
      Check!
      (thinking…)
      Reset
      or sign in with
      • facebook
      • google
        Password icon
        I agree to the terms of service
        Signed in as (Sign out)

        We’ll send you updates on this idea

        0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
      • Please could you add 8-bit audio to this and speaker reco

        Hi folks, I've worked across large and mid range contact center and speech services in the industry. Your SDK's all appear to lack 8-bit 8-KHZ support. I don't understand as your mcdonalds luis demo handles well with very poor sound quality. Every other vendor supports the 8-bit format except you. It basically means over the PSTN phone channel your products are totally irrelevant unless you are on fast wifi supporting 16-bit, mobile data channel. Given the above, means these platforms are no good for PSTN telephony connected systems shutting you out of this multi-billion dollar market. This surprises me but…

        1 vote
        Sign in
        Check!
        (thinking…)
        Reset
        or sign in with
        • facebook
        • google
          Password icon
          I agree to the terms of service
          Signed in as (Sign out)

          We’ll send you updates on this idea

          0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
        • Support for Windows 10 UWP in Bing.Speech

          The NuGet packages for

          Microsoft.Bing.Speech (2.0.2)
          Microsoft.ProjectOxford.SpeechRecognition-x64 (1.0.0.3)
          Microsoft.ProjectOxford.SpeechRecognition-x86 (1.0.0.3)

          Do not support Windows 10 UWP apps. Trying to install results in "Package Microsoft.Bing.Speech 2.0.2 is not compatible with uap10.0 (UAP,Version=v10.0)"

          1 vote
          Sign in
          Check!
          (thinking…)
          Reset
          or sign in with
          • facebook
          • google
            Password icon
            I agree to the terms of service
            Signed in as (Sign out)

            We’ll send you updates on this idea

            0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
          • Provide per word timecodes on final result

            When returning results on other ASR services you get usually an array of words with a per word timecode and confidence.

            1 vote
            Sign in
            Check!
            (thinking…)
            Reset
            or sign in with
            • facebook
            • google
              Password icon
              I agree to the terms of service
              Signed in as (Sign out)

              We’ll send you updates on this idea

              1 comment  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
            • Microsoft Bing Speech Recognition

              Hi,

              I am using Bing Speech API microsoft.com/cognitive-services/en-us/Spee.. for ASR.

              I want to do continuous speech recognition from the microphone in Java. But the data we get from the microphone is raw data. I know we have to set wav header to the raw audio data before calling the REST API.

              I am using the below code to set the header

              byte[] header = new byte44;
              ByteArrayOutputStream baos = null;
              DataOutputStream dos = null;
              try { // create byte array output stream
              baos = new ByteArrayOutputStream();
              short nChannels = 1;
              short mBitsPersample = 16; // create data output stream
              dos…

              3 votes
              Sign in
              Check!
              (thinking…)
              Reset
              or sign in with
              • facebook
              • google
                Password icon
                I agree to the terms of service
                Signed in as (Sign out)

                We’ll send you updates on this idea

                0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
              • Korean language support in text to speech api

                It would be nice if we could have Korean language support.

                5 votes
                Sign in
                Check!
                (thinking…)
                Reset
                or sign in with
                • facebook
                • google
                  Password icon
                  I agree to the terms of service
                  Signed in as (Sign out)

                  We’ll send you updates on this idea

                  3 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
                • iOS STT Needs to be faster

                  Based on my test, MS iOS STT is slower than Nuance's SpeechToText iOS service.

                  I installed the MS iOS STT sample project and the Nuance iOS sample project on the same iPhone. Then I spoke to both apps the same sentence - "What's the date today". It took 3+ seconds for Nuance's app to return the right answer. However it took 5+ seconds for MS iOS to return the answer.

                  I hope that MS can improve the performance.

                  2 votes
                  Sign in
                  Check!
                  (thinking…)
                  Reset
                  or sign in with
                  • facebook
                  • google
                    Password icon
                    I agree to the terms of service
                    Signed in as (Sign out)

                    We’ll send you updates on this idea

                    0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                  • feedback for mistakes?

                    The speech to text api seems to have a lot of trouble with names, especially foreign names. I was wondering if there was a way to give feedback (or 'label it correctly), so that it won't keep repeating the same mistake.

                    3 votes
                    Sign in
                    Check!
                    (thinking…)
                    Reset
                    or sign in with
                    • facebook
                    • google
                      Password icon
                      I agree to the terms of service
                      Signed in as (Sign out)

                      We’ll send you updates on this idea

                      1 comment  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                    • Start and Duration for RecognizedPhrase or RecognitionResult

                      The built in Windows Speech Recognition APIs allow us to tie recorded text to the corresponding portion of the audio. Could such ability be introduced to Bing Speech?

                      2 votes
                      Sign in
                      Check!
                      (thinking…)
                      Reset
                      or sign in with
                      • facebook
                      • google
                        Password icon
                        I agree to the terms of service
                        Signed in as (Sign out)

                        We’ll send you updates on this idea

                        0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
                      • Please support Windows 7 and .NET Framework 4.0

                        Please support Windows 7 and .NET Framework 4.0, most of the users are still using Windows 7 and .NET Framework 4.0.

                        1 vote
                        Sign in
                        Check!
                        (thinking…)
                        Reset
                        or sign in with
                        • facebook
                        • google
                          Password icon
                          I agree to the terms of service
                          Signed in as (Sign out)

                          We’ll send you updates on this idea

                          0 comments  ·  Samples & SDK Request  ·  Flag idea as inappropriate…  ·  Admin →
                        • please add hebrew tts

                          please add hebrew tts

                          2 votes
                          Sign in
                          Check!
                          (thinking…)
                          Reset
                          or sign in with
                          • facebook
                          • google
                            Password icon
                            I agree to the terms of service
                            Signed in as (Sign out)

                            We’ll send you updates on this idea

                            0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
                          • Get to know time offset

                            I suppose that tme offiset information could be needed, Sometimes, for example, To compose subtitle of video clip using speech to text service because of being sync up video and text or clipping the silence frame and so on.

                            1 vote
                            Sign in
                            Check!
                            (thinking…)
                            Reset
                            or sign in with
                            • facebook
                            • google
                              Password icon
                              I agree to the terms of service
                              Signed in as (Sign out)

                              We’ll send you updates on this idea

                              0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                            • Text to Phoneme

                              Provide text to phoneme capability to the API, on top of only the speech output.

                              2 votes
                              Sign in
                              Check!
                              (thinking…)
                              Reset
                              or sign in with
                              • facebook
                              • google
                                Password icon
                                I agree to the terms of service
                                Signed in as (Sign out)

                                We’ll send you updates on this idea

                                0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
                              • Punctuation in REST API

                                It appears the iOS and Android versions of the speech to text tools can add punctuation. I'd like the see the same functionality in the REST API.

                                When is that functionality coming?

                                1 vote
                                Sign in
                                Check!
                                (thinking…)
                                Reset
                                or sign in with
                                • facebook
                                • google
                                  Password icon
                                  I agree to the terms of service
                                  Signed in as (Sign out)

                                  We’ll send you updates on this idea

                                  1 comment  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                                • Oxford Speech2Text in NodeJS

                                  a sample speech2text to use Oxford in NodeJS, Or at least a sample to detect silences or key word (hey cortana) to throw REST request.

                                  1 vote
                                  Sign in
                                  Check!
                                  (thinking…)
                                  Reset
                                  or sign in with
                                  • facebook
                                  • google
                                    Password icon
                                    I agree to the terms of service
                                    Signed in as (Sign out)

                                    We’ll send you updates on this idea

                                    0 comments  ·  Samples & SDK Request  ·  Flag idea as inappropriate…  ·  Admin →
                                  • Support speech to text for long interview / meeting recordings

                                    I'm working on a podcast with my friends and we do lots of interviews. I'd like to use the speech-to-text API to convert the recordings to transcripts to make post-editing easier. However, there is a limit of the input audio file size, less than 14 seconds, according to the documentation.

                                    This feature would also be useful to generate transcripts of meeting recordings for searching.

                                    3 votes
                                    Sign in
                                    Check!
                                    (thinking…)
                                    Reset
                                    or sign in with
                                    • facebook
                                    • google
                                      Password icon
                                      I agree to the terms of service
                                      Signed in as (Sign out)

                                      We’ll send you updates on this idea

                                      2 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                                    • Please give more detail in curl command

                                      in the steps, it says Replace your_instance_id, your_request_id, your_locale, your_device_os in accordance to your own application.
                                      But there is no explanation of those variables, what are those? how can i get them? what does it mean "your own application"?

                                      2 votes
                                      Sign in
                                      Check!
                                      (thinking…)
                                      Reset
                                      or sign in with
                                      • facebook
                                      • google
                                        Password icon
                                        I agree to the terms of service
                                        Signed in as (Sign out)

                                        We’ll send you updates on this idea

                                        0 comments  ·  Documentation  ·  Flag idea as inappropriate…  ·  Admin →
                                      • SEA languages

                                        It would be great to have SEA languages support. At least those:
                                        - Vietnamese
                                        - Filipino
                                        - Indonesian, Javanese

                                        1 vote
                                        Sign in
                                        Check!
                                        (thinking…)
                                        Reset
                                        or sign in with
                                        • facebook
                                        • google
                                          Password icon
                                          I agree to the terms of service
                                          Signed in as (Sign out)

                                          We’ll send you updates on this idea

                                          0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
                                        • Improve noise reduction so speech to text translation is accurate

                                          Per your helpdesk (REG116092614718982): At the moment, our models are not able to handle noise and hence the transcription results are inaccurate.

                                          This results in the following scenario:

                                          Actual recording:
                                          Hi Agostino, this is Chris with Oracle. I sent you a couple of emails and just wanted to check to see if they were at all relevant to you. If you could please give me a call back or respond to one of those emails, my number is 512XXXXXXX. Thanks a lot Agostino.

                                          Response from Bing Speech API:
                                          I could get a couple emails do you please give me a…

                                          3 votes
                                          Sign in
                                          Check!
                                          (thinking…)
                                          Reset
                                          or sign in with
                                          • facebook
                                          • google
                                            Password icon
                                            I agree to the terms of service
                                            Signed in as (Sign out)

                                            We’ll send you updates on this idea

                                            0 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
                                          ← Previous 1
                                          • Don't see your idea?

                                          Feedback and Knowledge Base