Microsoft

Form Recognizer

  1. Allow different formats to be trained in the same model

    Allow different formats to be trained in the same model. Currently, for each format, a new model is created.

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  2. Analyzing forms with Fiddler

    This may be obvious to many with more experience but it took me half a day to figure it out. I'm using Fiddler instead of cURL and when I was trying to call the analyze API with a file in the Request Body I was getting a 415 Unsupported Media Type. The API does not support multipart/form-data content type which is how Fiddler defaults binary files in the Request Body. The way to work around this is after selecting an "Upload file..." in the Request Body is to remove the multipart tags from the request body and change the content-type…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  3. Model Id is not maintained.

    Train the data using the sample label tool.
    List all custom model using .Net SDK method (GetCustomModelsAsync())
    Trained Model id doesn't available in the result

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  4. 2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  5. Structure of table data

    First off, I love this tool, the key value pairs is very powerful and really solves a business need.

    Having the ability to label the table elements would take this offering to the next level. If you were able to tag columns, eg. Column 1, column 2 and then the tool provides a row from the table as object...
    {"fields":

                 [{   "Address": {
    
    ...
    },
    "Name": {
    ...
    },
    "Size": {
    ...
    }},
    { "Address": {
    ...
    },
    "Name": {
    ...
    },
    "Size": {
    ...
    }},
    { "Address": {
    ...
    },
    "Name": {
    ...
    },
    "Size": {
    ...
    }}]
    }
    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  6. Release date for V2 including local container support?

    For planning purposes we need to know when V2 is planned to be released. We also need to use the local container version so we need to know if the release of that will be different than V2 GA. Ideally we would like to test the local V2 container prior to GA.

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  7. Label tool and form processing

    We did some testing with the examples. Particularly we want to start with the recognition of invoices so we need only the payment information, amount, … We started with the .NET SDK example and the custom form recognizer model. But we are receiving to much unnecessary information. Instead we are using the sample labeling tool which helps us to tag the necessary fields. Is there a way to upload the data of the training of the labeling tool in the same way as the customer form recognizer does?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  8. Cognitive Service Container with V2.0 Support

    The containers provided are with V1.0 API and all limitations ( no Layout API, 4Mb Dataset for training, ...).
    When containers will be updated for V2.0 API ?

    Thx

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    3 comments  ·  Flag idea as inappropriate…  ·  Admin →
  9. Roadmap

    Are you able to disclose what the roadmap is for the Form Recognizer product? I'm particularly interested in using the Custom Model and Labelling Tool.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  10. Fix .Net SDK error for Analyze method

    I'm getting the following error when using the analyze method of the SDK:

    AnalyzeResult result = await _formRecognizerClient.AnalyzeWithCustomModelAsync(new Guid(modelId), stream, contentType: "application/pdf");

    In case of HTML form data, the multipart request must contain a document with a media type of - 'application/pdf', 'image/jpeg' or 'image/png'.

    Error code: UnsupportedMediaType
    Status code: 415

    The data in the stream is correct, it contains a small pdf file.

    More code leading up to the error:

    var file = HttpContext.Current.Request.Files.Count > 0 ? HttpContext.Current.Request.Files[0] : null;
    if (file != null && file.ContentLength > 0)
    {

    using (var stream = file.InputStream){
    
    AnalyzeResult result = await _formRecognizerClient.AnalyzeWithCustomModelAsync(new
    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  11. Make it possible to use a download of the pageResult property of the GetAnalyzeFormResult API as an input for the TrainCustomModel API

    Make it possible to use the same structure of the pageResult property of the GetAnalyzeFormResult API, as an input for the TrainCustomModel API label files data structure. This would make it easier to improve the model key-value associations based on the results of the model.
    I would like to update the content of the pageResult structure and re-use it in a next TrainCustomModel API call.
    In the current version these are 2 different structures with similar data.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  12. How to label a table in a form?

    In the Azure form recognizer official website "https://azure.microsoft.com/en-in/services/cognitive-services/form-recognizer/" few examples have tables in the sample file. In output also we have an attribute called table in sample json. Please guide us in labeling a table in a custom layout form.

    20 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    3 comments  ·  Flag idea as inappropriate…  ·  Admin →

    Form Recognizer discovers and extracts tables automatically. Table results are part of the pageResults section in the JSON output. If the table in the form was not discovered you can label tables a values by labeling each table cell and training with the maximum number of rows in the tables. Form Recognizer does not yet support labeling tables as tables.

  13. How to label an electronic generated form for custom training?

    After going through steps mentioned in the below source url "https://docs.microsoft.com/en-us/azure/cognitive-services/form-recognizer/quickstarts/label-tool" we didn't get all the products tagged as expected(Refer Output.jpg).

    In the output file the highlighted fields were not detected can anyone please suggest us the way to tag the data and to improve the model accuracy.

    We have trained the model with 10 samples of same template in different variations.

    For Example one form having 2 products in the Description table and another having 6 products and so on.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →

    Can you please try and label 5 forms with the maximum number of rows you would like to extract ? For example all labeled data have 6 rows and when analyzing extract any number of rows forms.

  14. Prebuilt receipts doesn't recognize logo'd merchant names

    If a receipt only has the merchant logo and not the merchant name on a receipt, the prebuilt receipt model returns nothing for the MerchantName.
    For example, Home Depot has their logo and "The Home Depot" in reverse font on their logo but the OCR doesn't pick it up.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  15. Prebuilt receipts doesn't recognize invoice style receipts

    Many businesses have 8.5x11 invoice style receipts (parts stores, rental businesses, service businesses, etc.). The prebuilt receipts model returns no data on these even though they are accepted with no errors.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  16. Support well known tax forms as pre-built form

    Start supporting well known tax forms i.e. w2, 1099 forms etc as pre-built forms so users don't have to create custom model for it and train them.

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Planned  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  17. Add the ability to train a model with a blob larger than 4mb

    I am trying to use Microsoft Form Recogniser to get the key value pairs from medical forms. However there are a number of different types of form and cannot currently train an accurate enough model to use with the current size limit.

    10 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  18. Define a key/value when its wrong.

    I make several test and works very fine but in some cases the key value is wrong even the key is on de left of the value, it would be nice to specify the key when its wrong to train a better model.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  19. Handwritten forms

    We need an ability to extract key-value pairs from a form which is hand filled or hand written. For example - an insurance claim form which is filled in by a person or an agent. Is some work being done on this ?

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  20. Form with tables, checkboxes, and whole lot of square boxes

    I have trained my form recognizer model using all the requirements and tips provided in documentation. However, the output is not good at all.
    The form that I am trying to train on is attached (empty version). I have used 2 filled in forms as well as per requirements.
    The form has two small tables at the top - even these haven't been read correctly. And then there are whole lot of square boxes (representing one character) to fill in details of the users. The boundaries of these square boxes are being read either as 1 or as a dash…

    9 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  • Don't see your idea?

Form Recognizer

Categories

Feedback and Knowledge Base