Microsoft

Bing Speech

Welcome to the Bing Speech Forum

The Cognitive Service's Speech Service is replacing Bing Speech. Please refer to their forum for speech product feedback


Categories

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Requests – Let us know if you would like to see a tutorial or sample provided.

Speech to Text – API & SDK – Ideas and feature requests to Speech Recognition and Speech to Text (STT).

Text to Speech – Ideas and feature requests for Text to Speech (TTS) – API only


  1. Need timestamp information for speech to text

    Hello,

    Please include timestamps in your speech to text api output.

    Thank you.

    williamj

    7 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  2 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
  2. How can we implement this speech API in a J2ee web app. I am using servlet, jsp, hibernate, eclipse ee for development.

    How can we implement this speech API in a J2ee web app. I am using servlet, jsp, hibernate, eclipse ee for development.
    On the front end i am using html, css, js, jquery.

    I want to fill the form by using text input . as well as the actions should take place like navigating to a particular page.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
  3. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
  4. Improved English GB Male voice

    At https://www.microsoft.com/cognitive-services/en-us/Speech-api/documentation/API-Reference-REST/BingVoiceOutput#SupLocales there are some new voices suffixed by RUS.
    I cannot find what RUS stands for, but they are significantly better quality. There are now two British Female voices, one acceptable, one excellent, while the British Male voice remains very low quality.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
  5. Dutch language support in Bing text-to-speech API

    The Bing text-to-speech API supports 10 languages, but of course, there are many more. Dutch is not yet supported.

    https://www.microsoft.com/cognitive-services/en-us/speech-api

    I have a Cognitive Services account for Bing Speech and I have software working to provide TTS in the supported languages. But the main language that is of interest to me is Dutch.

    I would very much appreciate if Dutch can be added.

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  4 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
  6. Please could you add 8-bit audio to this and speaker reco

    Hi folks, I've worked across large and mid range contact center and speech services in the industry. Your SDK's all appear to lack 8-bit 8-KHZ support. I don't understand as your mcdonalds luis demo handles well with very poor sound quality. Every other vendor supports the 8-bit format except you. It basically means over the PSTN phone channel your products are totally irrelevant unless you are on fast wifi supporting 16-bit, mobile data channel. Given the above, means these platforms are no good for PSTN telephony connected systems shutting you out of this multi-billion dollar market. This surprises me but…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
  7. Maximum request length

    There's no clear documentation on the maximum request length that the Text-to-Speech API can support. My plan was to chunk my text according to this maximum request length. Since my application is aggressive/greedy (try to max out every call), I often get a 413 Error "RequestEntityTooLarge" most of the time.

    I found this on Microsoft's web site: "The maximum amount of audio returned for a given request must not exceed 15 seconds." Which is quite useless because the length of the audio cannot be known from the client side at the time the request is generated. And I found that…

    7 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
  8. Support for Windows 10 UWP in Bing.Speech

    The NuGet packages for

    Microsoft.Bing.Speech (2.0.2)
    Microsoft.ProjectOxford.SpeechRecognition-x64 (1.0.0.3)
    Microsoft.ProjectOxford.SpeechRecognition-x86 (1.0.0.3)

    Do not support Windows 10 UWP apps. Trying to install results in "Package Microsoft.Bing.Speech 2.0.2 is not compatible with uap10.0 (UAP,Version=v10.0)"

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  2 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
  9. Provide per word timecodes on final result

    When returning results on other ASR services you get usually an array of words with a per word timecode and confidence.

    7 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  5 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
  10. Microsoft Bing Speech Recognition

    Hi,

    I am using Bing Speech API microsoft.com/cognitive-services/en-us/Spee.. for ASR.

    I want to do continuous speech recognition from the microphone in Java. But the data we get from the microphone is raw data. I know we have to set wav header to the raw audio data before calling the REST API.

    I am using the below code to set the header

    byte[] header = new byte44;
    ByteArrayOutputStream baos = null;
    DataOutputStream dos = null;
    try { // create byte array output stream
    baos = new ByteArrayOutputStream();
    short nChannels = 1;
    short mBitsPersample = 16; // create data output stream
    dos…

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
  11. Korean language support in text to speech api

    It would be nice if we could have Korean language support.

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Completed  ·  6 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  12. feedback for mistakes?

    The speech to text api seems to have a lot of trouble with names, especially foreign names. I was wondering if there was a way to give feedback (or 'label it correctly), so that it won't keep repeating the same mistake.

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  2 comments  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
  13. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Documentation  ·  Flag idea as inappropriate…  ·  Admin →
  14. Please support Windows 7 and .NET Framework 4.0

    Please support Windows 7 and .NET Framework 4.0, most of the users are still using Windows 7 and .NET Framework 4.0.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Samples & SDK Request  ·  Flag idea as inappropriate…  ·  Admin →
  15. Get to know time offset

    I suppose that tme offiset information could be needed, Sometimes, for example, To compose subtitle of video clip using speech to text service because of being sync up video and text or clipping the silence frame and so on.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
  16. Start and Duration for RecognizedPhrase or RecognitionResult

    The built in Windows Speech Recognition APIs allow us to tie recorded text to the corresponding portion of the audio. Could such ability be introduced to Bing Speech?

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text to Speech - API Only  ·  Flag idea as inappropriate…  ·  Admin →
  17. please add hebrew tts

    please add hebrew tts

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  18. Punctuation in REST API

    It appears the iOS and Android versions of the speech to text tools can add punctuation. I'd like the see the same functionality in the REST API.

    When is that functionality coming?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Completed  ·  1 comment  ·  Speech to Text - API & SDK  ·  Flag idea as inappropriate…  ·  Admin →
  19. Oxford Speech2Text in NodeJS

    a sample speech2text to use Oxford in NodeJS, Or at least a sample to detect silences or key word (hey cortana) to throw REST request.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Samples & SDK Request  ·  Flag idea as inappropriate…  ·  Admin →
  20. SEA languages

    It would be great to have SEA languages support. At least those:
    - Vietnamese
    - Filipino
    - Indonesian, Javanese

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →

Feedback and Knowledge Base