Speech Service

                               Attention!





We have moved our Customer Feedback & Ideas for Azure Cognitive Services portal to the Azure Feedback Forum.





Please go to the link below to access our new Feedback and Ideas Page.



  1. Support for austrian planned?

    Hi, is there already a date when you will support austrian?

    Kind regards
    Florian Over

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  2. Support for austrian planned?

    Hi, is there already a date when you will support austrian?

    Kind regards
    Florian Over

    0 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  3. Masking Sensitive Information

    Dealing with Numbers
    After transcribing audio into text, we have a Python script were we are trying to deal with privacy. Therefore, we are masking all the numbers (e.g. 0,1,2,3,4,5,6... and one, two, three etc.).

    Unfortunately, this is not fulfilling, since we want to remove on the one hand all the personal info. However, on the other hand keeping product information (e.g. serial number, product number etc.).

    Therefore my question: are there any best practices for this problem (since I cannot imagine that we are the first one dealing with this :-)).

    Many thanks in advance!

    Example:
    Original: Hi, my…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  4. set time out for batch transcription requests

    From what I see in the documentation, there's no option to set a timeout for a batch transcription job. I would like the transcription that is in "NotStarted" status for more than a specific amount of time to be discarded. And I would like to set the timeout value while I am creating the request

    I am doing batch transcription in a loop for a set of audio files one after another. For one of the transcription jobs which was created on 1:26 pm EST 10/22/2020 and until now (2:46 pm) is still in "NotStarted" state. I am not even…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  5. Any CPU build

    Currently the .NET project has to be built in either x64 or x86. Any CPU is not supported. Please change this

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  6. Rest API to accept authorization token for RecordingsUrl

    Hi, my company has been using Azure's speech to text service and are happy with the results.

    However, we've hit a snag. For the RecordingsUrl parameter, I understand we will be passing in a blob uri that's public facing (no auth required), but our audio files are stored in a 3rd party service whereby an auth bearer token is required to be passed into the header to be able to access the .wav file (request header: { Authorization: Bearer <token> }).

    For the speech to text Rest API, is it possible if we can have an additional parameter (eg. RecordingsUrlToken)…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  7. Improve workflow for Intent recognition training

    I used the following workflow for training my intent recognition:
    1) I've a series of entities, features and patterns edited
    2) I've a series of example inputs for training
    3) all the samples have the entities marked
    4) now I train the examples
    5) execute a series of batch test cases

    The issues recommended for improvement
    - The test cases for the batch testing require character positions, startPos and endPos. I had no other option than counting these manually, which is error prone.
    - When loading the batch test cases, the feedback / error log is hard to find and…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  8. ccextractor

    Hello,

    May I recommend you use the closed-captioned text extractor tool called, "ccextractor" in order to compare the results of your Speech-to-Text service.

    The url is:

    https://www.ccextractor.org/

    Thank you.

    Regards,
    William Johnson

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  9. Support for specifying an External ID when creating a batch transcription request, which will be part of the response of the Web Hook

    It would be nice if you could specify an “extern id” when you create a batch transcription request and that the “external id” is also returned in the response of the web hook callback.

    Why? To be able to link a request to an id of a running process/workflow. For example in a durable function. The durable function (using a orchestration) looks like:
    1. Durable function send a message to the speech to text service (STTS) to create a transcription.
    2. Durable function makes a call to context.WaitForExternalEvent<string>("TranscriptionCompleted");
    3. At some point in time the STTS is finished and calls…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  10. How to set VS Code access Mic in MAC OS

    MAC OS like Catalina ask permission to access Mic from Privacy settings. I have resolved issue and add Terminal to the list and run the SDK sample code in Terminal and and Jupyter Notebook successfully. But how can I allow VS Code to run these code normally access Mic?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  11. de vilm wil ik graag in hert deuts wat nu in het pools is

    film in het deuts wat nu in het pools is

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  12. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  13. Properties to define : Max Audio Recognition Time from Microphone OR Stop Recognition on silence

    Hello,

    I am doing speech recognition and am using Android SDK. I plan to move to containers in future. Stopping on silence is default as per documentation. How do i define the following the max audio time recognition time as below:

    If the user is speaking and has spoken more than 15 seconds. The sdk should automatically stop the recognizer on Android end. If the user has spoken less than 15 seconds and was silent in between then it should be based on silence detection. The speech sdk should stop the microphone on android either on silence detection (when spoken…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  14. segmentation length config for recognized result

    from https://github.com/Azure-Samples/cognitive-services-speech-sdk/issues/610#event-3282436941

    The reason and scenario of this asking is that, some time the output recognized text is too long to render friendly, e.g. mobile app with recognized text limit of 2 lines each max 20-char (or less).

    E.g.

    Utterance: I will go to bookstore this afternoon to check if any new arrivals. After that Jack will pick up me there to gym for practice. We need to prepare a match in two weeks. Dinner will be taken in gym to save commute overhead. I will arrive home around 8:30 in the evening.

    Current result from speech sdk:
    RECOGNIZED: Text=I…

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  15. Fluency format of recognized result

    from https://github.com/Azure-Samples/cognitive-services-speech-sdk/issues/598#event-3275944556

    Suggest add fluency format for scenario like formal meeting transcription, translation, etc. , which does not expect spoken text forms.

    E.g. :

    Utterance: "i want to ah, to book a flight to Denver, i mean, to Boston, the day, the day after, after Monday. "

    RECOGNIZED: "I want to are to book a flight to Denver. I mean to Boston the day, the day after after Monday."

    Expected: " I want to book a flight to Boston the day after Monday."

    Thank you.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  16. Adding custom headers to speech to text Websocket requests

    Adding the ability of adding custom headers to the speech to text sdk so that the intermediate servers can verify the headers to authenticate and authroize . This is required for container versions

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  17. Azure AD Authentication

    Support for authenticating to the Service using Azure AD to allow an alternative to keys.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  18. Language Support for Greek

    Is Greek on the roadmap? Please let me know when it is planned. If not, please add it.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  19. About the display of the character acquired by SpeechToText (SpeechSDK)

    The results obtained by the SpeechRecognizer's Recognized event are not broken by punctuation marks, and sentences are connected even if the speaker changes.
    Therefore, it is not possible to know the timing of the change of the speaker.
    I want you to improve it so that an event occurs for each punctuation mark.

    38 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
  20. Show multilanguage translations on a single screen during a presentation

    Hello,
    I have the following use case (international wedding): The presenter speaks in French. I would like to show the German and Catalan live translations on a single screen for the audience. Is this possible? I know that there is the conversation feature readily available, but not everybody in the audience has a smartphone.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Speech to Text  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1
  • Don't see your idea?

Feedback and Knowledge Base