Microsoft

Custom Speech Service

Welcome to the Custom Speech Service API Forum

Categories

API – Any ideas or feedback pertaining to features or enhancements to Custom Speech Service API.

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.

How can we improve Custom Speech Service for Cognitive Services?

(thinking…)

Enter your idea and we'll search to see if someone has already suggested it.

If a similar idea already exists, you can support and comment on it.

If it doesn't exist, you can post your idea so others can support it.

Enter your idea and we'll search to see if someone has already suggested it.

  1. Language model training stuck on running

    I have been trying to train a new language model with custom adaptation data and the status is stuck perpetually on "Running".

    2 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  2. Problems with dataset

    I'm trying to create a custom model. But when I upload the acoustic dataset, I always get a failed status.

    1 vote
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  3. Problems with dataset

    I'm trying to create a custom model. But when I upload the acoustic dataset, I always get a failed status.

    1 vote
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  4. Using Custom voice font based on a different locale

    My goal is to translate translated text to speech using a voice as close as the original speeker. Since the speaker doesn't speak the target language, I can not create a custom voice font for the targeting locale. I saw a Microsoft demo a few years ago, seeming give a english speaker his own voice to speak Chinese. I wonder how that is done.

    1 vote
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  5. Xamarin supported sdk

    I want to use the Cognitive speech service (real-time continuous speech to text and interim results) in the **Xamarin** app. Is there any SDKs or plugin available? Since REST API has some limitations (no interim results), i am unable to go with it.

    3 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  3 comments  ·  Samples & SDK Requests  ·  Flag idea as inappropriate…  ·  Admin →
  6. Acoustic model support for languages "de-DE" and "fr-FR"

    The standard speech service isn't enough specific. Hence, we want to train the Microsoft Custom Speech Service (CRIS) for dynamic conditions in a non-english environmnent - at first for locale "de-DE" later for "fr-FR". But there is no support for the two languages currently.
    Otherwise for creating language model the locales "de-DE" and "fr-FR" are available.

    2 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    3 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  7. Confidence / probability value per recognized word or fragment

    A confidence number value per word or per speech fragment would be very interesting. Because with this confidence value it would be possible to get self-assessments of the Microsoft speech service, if it recognizes the word or the fragment correctly or not. Unfortunately I didn't found a such possibility.

    4 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  8. Add the ability to version the same endpoint

    Add the ability to deploy new versions of the same endpoint have updateable stage/prod endpoints (like LUIS) so that we can easily swap models under an endpoint without needing to adapt the application that uses it.

    1 vote
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  9. Add the ability to collaborate on the same projects (datasets, models and enpoints)

    We are multiple users with a company account subscription for the cognitive services. We would like to be able to collaborate on the same datasets, models and endpoints. LUIS currently supports this, but CRIS doesn't.

    1 vote
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  10. Does‘not it support Chinese and English mixing?

    I need to create a custom language model, which is based on Chinese. But there may be some English words in the sentence.
    E.g. "如何使用Veeva?"

    2 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  11. How do we choose a suitable API protocol in endpoint?

    Here are 5 Protocols when I deploy, if I concern the response time, is there any different between them?

    1 vote
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  12. Add the ability to provide a custom dictionary

    Right now, our only option when we simply want to provide a custom set of words that are not commonly used in the English language (E.G. medical terms), our only option is to upload the dictionary as if it was a set of transcriptions, which creates a bias as it does not contain full sentences. Creating full sentences from these terms can be a long and painful process (around 100k words in the dictionary)

    1 vote
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  13. Create a C# SDK for the WebSocket with the Speech Protocol

    With the C# SDKs and endpoints now deprecated, there is not supported SDK for C#. It would be great to have the support for the Speech Protocol in C#.

    2 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Samples & SDK Requests  ·  Flag idea as inappropriate…  ·  Admin →
  14. Provide an API to fetch the logs with audio (endpoint data export)

    It would be nice to be able to programmatically pull the audio logs from the system so that we may re-train the system with all utterances if something goes wrong.

    1 vote
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  15. Deployment Data Exports download wav files are corrupt

    Tried the new Customer Speech Service -> Deployment Endpoint -> Deployment Data Exports functionality. Created the instance and downloaded the results, but the wav files in the zip file are corrupt.

    1 vote
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  16. I would like to build a customm language model based on a grammar rather than a set of examples. Is there any support for this?

    I would like to build a customm language model based on a grammar rather than a set of examples. Is there any support for this?

    For example: all speech will start with a page direction request (e.g. ' go to page', 'show me page', 'show me') followed by a page specification, which will either be a number or a letter followed by a number.

    2 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  17. India English Language Support - en-in Base models

    Microsoft Conversational Base Model with en-in language, mainly for the Indian English, any plans/tentative dates?

    With en-us, USA English, not getting the best results, and the accuracy percentage is too lower, to make use it!

    9 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Planned  ·  0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  18. Pay as you go pricing

    Pay as you go pricing. The Scale Out option is $6.452/day. However what if I only need to scale out for 2-4 hours a day (and none on weekends?)

    Please consider a pay-per-usage model. That would be amazing!!!

    *Also the calculator is broken (it does not multiply the $6 x 31 days)

    Thanks!

    2 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  19. Language Japanese - ja-jp

    We have some customer in Japan that want to use Japanese Voice data.

    8 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Started  ·  2 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  20. Usage of custom language and accoustic models

    Get more precsion with my own language & accoustic models

    3 votes
    Sign in
    (thinking…)
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Resolved  ·  1 comment  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1
  • Don't see your idea?

Custom Speech Service

Feedback and Knowledge Base