Speech Service

                               Attention!





We have moved our Customer Feedback & Ideas for Azure Cognitive Services portal to the Azure Feedback Forum.





Please go to the link below to access our new Feedback and Ideas Page.



  1. Extremely Bulky

    I signed on hesitantly, to test out the text to speech function, am required to put in all sorts of personal information including credit card. To then go into a maze to try to find the **** program, which is like finding the lost ark. this is ridiculous to initial users and am very turned off by it. Really considering not using your services at all. Shame on you Microsoft. no training not tutorial except for other overly complex things. This should have been a simple click and go procedure. Fix your maze!

    I need something simple, place text, hear…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  2. speech studio

    I have been trying to test out the Audio Content Creation. But this error message pops up: "The server is currently unable to handle your request", every time I try. Please advise

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  3. German: einbauen

    Pronounciation of german "einbauen" is terrible. The rest is almost impossible to tell computers and humans apart.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  4. Welsh....

    Is Welsh anywhere on the road map? In the UK there are some legal requirements for services to be available in Welsh alongside English.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  5. Don't pronounce initials as abbreviations in Dutch

    Various initials are mispronounced as abbreviations in Dutch by text-to-speech. Since Teams uses this service for the voicemail messages, this is very annoying. These are a few examples:

    b. - pronounced as bruto
    c. - pronounced as cent
    e. - pronounced as edidiet
    f. - pronounced as forte
    H. - pronounced as heilige
    i. - pronounced as in
    l. - pronounced as loco
    p. - pronounced as piano
    t. - pronounced as temperatuur
    H.M. - pronounced as Hare Majesteit (Her Majesty)
    V.J. - pronounced as vorig jaar (last year).
    G.B. - Gedaan en bieden
    B.G. - Bovengenoemde

    We would like…

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  6. Neural voices in Northern Europe

    Make neural voices available in the Northern Europe region.
    We want to use the Norwegian Neural voice in our product (based in Norway, naturally) but are for some reason locked out of this feature.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  7. Améliorer la lecture des nombres pour la Suisse

    Bonjour,
    La voix fr-CH, French (Switzerland), Male, "fr-CH-Guillaume" ne lit pas les nombres comme nous le faisons en Suisse, elle le fait comme en France.
    En effet, 70 doit se dire "septante" et non "soixante-" et 90 doit se dire "nonante" et non "quatre-vingt-". En outre selon les régions 80 se dit "huitante". Ce qui en rapport à la majorité des autres langues devrait être la norme.
    Avec mes meilleures salutations,

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  8. does TTS support Speech SDK when using containers?

    Hi, can we use Speech SDK to access TTS service in container?
    Why if using container, STT only supports SDK, and TTS only supports REST API?

    By tests, it seems REST API is slower than SDK, why? Thanks.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  9. Mbps is read as Megabytes per second instead of Megabits per second

    MBps and Mbps are very different things. MBps is 8 times larger than Mbps, so Aria needs to know the difference between the two.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  10. This chapter has many issues with Aria Neural TTS

    This textbook (Computer Security Handbook by Seymour Bosworth et al., Chapter 33) seems to cause countless errors using the Aria Neural voice. (attached txt and pdf were trimmed to respect the copyright of the author)

    It messes up the chapter markers, saying "January First, Thirty Three" when it says "33.1.1" (as well as all the other section markers)

    802.11 is pronounced "eight hundred and two point one one"

    SSIDs is pronounced "sids"

    BSSIDs is pronounced "bsids"

    2Mb/s (as well as other Mb/s numbers) is pronounced "two em bee slash ess" which should be pronounced 2 Megabits per second.

    LAN is…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  11. 802.11 is mispronounced

    802.11 as a wireless standard should be pronounced "eight-o-two-eleven" instead of "eight hundred and two point one one"

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  12. Japanese sentence is mispronounced

    このレストランではタバコを吸ってはいけません。is mispronounced. The はいけません part should be pronounced waikemasen and not haikemasen. This is with the Nanami Neural voice.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  13. No SSML restrictions on creating TTS audio tuning files

    No SSML restrictions to one SSML elements in a SSML file, if you want to realise multiple tunings in the Audio Content Creation, like breaks, pronunciation, intonation etc.
    (source:
    Improve synthesis with the Audio Content Creation tool
    > Create an audio tuning file:
    "SSML restrictions Each SSML file can only contain a single piece of SSML."
    (https://docs.microsoft.com/en-US/azure/cognitive-services/speech-service/how-to-audio-content-creation)

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  14. BUG - The TTS engine (in Engish) doesn't pronounce well numbers after words. COVID-19, NASDAQ 100, S&P 500, NIKKEI 225

    This is a bug not a feature. covid-19 sounds like covid 19. NASDAQ 100
    S&P500
    etc.
    For the neural voices

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  15. Move Irish text to phonetic ipa conversion upstream.

    Irish (Gaeilge) tts works if correct IPA syntax is used when sending the request for tts with the IPA option included.

                                Example... 
    

    Ba chuid mhór den togra seo teacht ar ábhar a bheadh chomh maith nó níos fearr ná an fhuinseog agus an t-ábhar a bheith níos inmharthana.

                         translated to IPA..
    

    "bˠɑː xɪdʲ woːˈr dʲəɴʲ tʲɔɡˈrə ʃoː tʃæxt eˈr ɑːwəˈr ɑː vʲəh xɔv mˠɑːh ɴˠoː ɴʲiːsˠ fʲæˈrr ɴˠɑː en ɪɴʲʃoːɡ əɡʊsˠ en tʲɑːwəˈr ɑː vʲeɪh ɴʲiːsˠ ɪɴʲwəˈrhɑːɴˠɑː"

    The IPA (irish) text can be read and spoken accurately by Neural EN-GB or EN-US voice.

    I have attached a file that converts…

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  16. en-GB Neural voice Mia pronounces number 4 too quickly

    En-GB mia neural voice pronounces the number 4 too quickly in sentences. Whether it is the digit or the word, same problem. I’m able to workaround this by adjusting the prosody speed of just that number :)

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  17. School

    Teacher

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  18. Possibility to talk English, but in the different foreign accents.

    Like a German/French/Spanish/Italian person speaking English, all have their own accent. Perfect for applications like Air Traffic Control etc.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  19. Pronounced Unites Incorrectly. (E.g mili Watt as mW)

    Some of the unites are not pronounces correctly.
    In my case i am using "9 Mega Watt" as (9MW) and its speaking correctly, but for 9 mili Watt (mW), its speaking wrong and saying Mega Watt. Can you update it, or provide separate access for customization acronyms.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
  20. ARM32 Support for Microsoft.CognitiveServices.Speech API (on Raspbian)

    I'd like to see the speech API coming to Raspberry PI, meaning having FULL ARM32 support for the linux implementation of this SDK. This would enable the raspi maker community to use Azure Speech seamlessly in their devices.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text to Speech  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1
  • Don't see your idea?

Feedback and Knowledge Base