Microsoft

Custom Speech Service

Welcome to the Custom Speech Service API Forum

Categories

API – Any ideas or feedback pertaining to features or enhancements to Custom Speech Service API.

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.

  1. Train existing model with additional data

    After having created and trained a model with a dataset, whenever i get additional data for training, i would like to -


    1. create a new dataset with just the additional data - possible currently.

    2. re-train an existing model with this new additional dataset.

    So all models created by me need to added as baseline models so I can specify my own model as the baseline model in the API and provide the id of the new dataset.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  2. Create Model API doesn't seem to be working

    I tried to create a new model using an existing dataset through the REST api. I get a 202 response but the UI doesn't reflect the new model. Am i doing something wrong?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  3. Delete project does not work

    I can't delete the project, looks like a bug

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  4. Custom model training with multiple datasets

    I'm trying to train a custom speech model using about 5 GB of data that I have. I realize the upload limit is 2GB archives, so I've split my data into multiple bundles to upload them separately. However, in the UI I don't see how it is possible to train a model using multiple archives. In particular, the "select training data" flow uses radio buttons to select data so I am only able to select a single model. Is it possible to train a model with multiple archives of data?

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  5. Language model training stuck on running

    I have been trying to train a new language model with custom adaptation data and the status is stuck perpetually on "Running".

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  6. Problems with dataset

    I'm trying to create a custom model. But when I upload the acoustic dataset, I always get a failed status.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  7. Problems with dataset

    I'm trying to create a custom model. But when I upload the acoustic dataset, I always get a failed status.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  8. Using Custom voice font based on a different locale

    My goal is to translate translated text to speech using a voice as close as the original speeker. Since the speaker doesn't speak the target language, I can not create a custom voice font for the targeting locale. I saw a Microsoft demo a few years ago, seeming give a english speaker his own voice to speak Chinese. I wonder how that is done.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  9. Xamarin supported sdk

    I want to use the Cognitive speech service (real-time continuous speech to text and interim results) in the Xamarin app. Is there any SDKs or plugin available? Since REST API has some limitations (no interim results), i am unable to go with it.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  4 comments  ·  Samples & SDK Requests  ·  Flag idea as inappropriate…  ·  Admin →
  10. Acoustic model support for languages "de-DE" and "fr-FR"

    The standard speech service isn't enough specific. Hence, we want to train the Microsoft Custom Speech Service (CRIS) for dynamic conditions in a non-english environmnent - at first for locale "de-DE" later for "fr-FR". But there is no support for the two languages currently.
    Otherwise for creating language model the locales "de-DE" and "fr-FR" are available.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    3 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  11. Confidence / probability value per recognized word or fragment

    A confidence number value per word or per speech fragment would be very interesting. Because with this confidence value it would be possible to get self-assessments of the Microsoft speech service, if it recognizes the word or the fragment correctly or not. Unfortunately I didn't found a such possibility.

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  12. Add the ability to version the same endpoint

    Add the ability to deploy new versions of the same endpoint have updateable stage/prod endpoints (like LUIS) so that we can easily swap models under an endpoint without needing to adapt the application that uses it.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  13. Add the ability to collaborate on the same projects (datasets, models and enpoints)

    We are multiple users with a company account subscription for the cognitive services. We would like to be able to collaborate on the same datasets, models and endpoints. LUIS currently supports this, but CRIS doesn't.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  14. Does‘not it support Chinese and English mixing?

    I need to create a custom language model, which is based on Chinese. But there may be some English words in the sentence.
    E.g. "如何使用Veeva?"

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  15. How do we choose a suitable API protocol in endpoint?

    Here are 5 Protocols when I deploy, if I concern the response time, is there any different between them?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  16. Add the ability to provide a custom dictionary

    Right now, our only option when we simply want to provide a custom set of words that are not commonly used in the English language (E.G. medical terms), our only option is to upload the dictionary as if it was a set of transcriptions, which creates a bias as it does not contain full sentences. Creating full sentences from these terms can be a long and painful process (around 100k words in the dictionary)

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  17. Create a C# SDK for the WebSocket with the Speech Protocol

    With the C# SDKs and endpoints now deprecated, there is not supported SDK for C#. It would be great to have the support for the Speech Protocol in C#.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Samples & SDK Requests  ·  Flag idea as inappropriate…  ·  Admin →
  18. Provide an API to fetch the logs with audio (endpoint data export)

    It would be nice to be able to programmatically pull the audio logs from the system so that we may re-train the system with all utterances if something goes wrong.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  19. Deployment Data Exports download wav files are corrupt

    Tried the new Customer Speech Service -> Deployment Endpoint -> Deployment Data Exports functionality. Created the instance and downloaded the results, but the wav files in the zip file are corrupt.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  20. I would like to build a customm language model based on a grammar rather than a set of examples. Is there any support for this?

    I would like to build a customm language model based on a grammar rather than a set of examples. Is there any support for this?

    For example: all speech will start with a page direction request (e.g. ' go to page', 'show me page', 'show me') followed by a page specification, which will either be a number or a letter followed by a number.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1
  • Don't see your idea?

Custom Speech Service

Categories

Feedback and Knowledge Base