Microsoft

Form Recognizer

  1. Process multiple documents contained in a single file

    I’m working on a project to create custom Machine Learning models for the processing of invoices using Azure Form Recognizer and OCR Form Labeling Tool.

    In general it’s working really well, however, I have a situation where I have more than one invoice in the same PDF file. In is current form, the OCR Form Labeling Tool & Azure Form Recognizer don’t handle this situation very well. I‘m wondering if there are any tips or guidelines for this situation or will this use case be covered in the future?

    Any guidance would be greatly appreciated!!

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  2. Auto recognise wich custom model to use.

    Hi
    Woult it be posible to auto detect wich custom model to use? In a normal invoicing flow you would have one big pile of different invoices and send Them to be recognizer.
    Then it would sort Them in 2 piles one wher a model was found and one without.
    Do you understand what i Mean by that?

    11 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    6 comments  ·  Flag idea as inappropriate…  ·  Admin →
  3. How to label a table in a form?

    In the Azure form recognizer official website "https://azure.microsoft.com/en-in/services/cognitive-services/form-recognizer/" few examples have tables in the sample file. In output also we have an attribute called table in sample json. Please guide us in labeling a table in a custom layout form.

    20 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    3 comments  ·  Flag idea as inappropriate…  ·  Admin →

    Form Recognizer discovers and extracts tables automatically. Table results are part of the pageResults section in the JSON output. If the table in the form was not discovered you can label tables a values by labeling each table cell and training with the maximum number of rows in the tables. Form Recognizer does not yet support labeling tables as tables.

  4. What's the best way to train a table without line? (See Pic)

    I know Form Recognizer does not yet support labeling tables as tables. Suggestions please.
    Please see pic1 for reference. Number of Line Items may vary at large.
    And, What can we do if some text is not recognized by the OCR labeling tool see pic1 ?

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  5. Limit processing to first X pages

    We should be able to use a setting to only analyze the first X pages of forms.
    I.e. we have forms related to phone bills, which the first 2 pages have relevant info then there is 20+ pages of just call records which we do not want to analyze.
    We should be able to tell the analyze forms function to only process the first X number of pages.

    Especially in the power automate function.

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  6. Recognize Checkbox and Signature sections

    We have a number of paper forms that are completed by staff. The forms include checkboxes. We need to confirm that the checkboxes have been checked.

    Additionally, we require the form to be signed by the person completing it. We'd like to confirm that a mark has been made in the signature area.

    It would be great if there was a way that Form Recognizer could just register that a pen mark has been made in a certain area, even if it can't read it (e.g. a signature or a tick in a box)

    36 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    7 comments  ·  Flag idea as inappropriate…  ·  Admin →
  7. Publish sample C# client code that compiles and works

    When I copied and pasted the sample C# client code snippets from the Quickstart documentation, the code doesn't even compile. Please share a simple working C# client program via github so we don't have to copy/paste bits and pieces and we don't have to struggle with compiler errors. The sample should work as long as the FORMRECOGNIZERENDPOINT and FORMRECOGNIZERKEY environment variables are set to valid values. [Oh, please make this textarea box resizable]

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  8. Allow to add a name to model when training

    Hello,

    it would be nice to allow to name the model when training.. will allow to not waist time saving id ect

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  9. Can't create a project

    Hello, I followed the tutorial here: https://docs.microsoft.com/en-us/azure/cognitive-services/form-recognizer/quickstarts/label-tool

    I setup a Azure web app (ACI) to run the docker image according to this tutorial: https://docs.microsoft.com/en-us/azure/cognitive-services/form-recognizer/deploy-label-tool#deploy-with-azure-container-instances-aci

    I setup the connection. Then when I went to create the project it kept giving me this error and I have no idea what is wrong? I tried removing all spaces and non alpha characters from project name but nothing seems to work :(

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  10. Add lasso selection functionality to Form Labeling tool

    It would be really nice to have a "Lasso" selection functionality in the form labeling tool. I know I can hold the left mouse button down to highlight multiple words but there are some areas of forms (e.g. remarks) where we have several hundred words that have to be selected. They usually are in a rectangle shape so hence a lasso selection would work perfectly.

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Planned  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  11. Old labeling tool vs the new one

    The original forms labeling tool (VoTT) creates vott files and the new one (OCR Form Labeling Tool) uses fott files. I labeled and trained a bunch of forms using the original labeling tool. Is there a way for me to migrate those vott files to the fott file format?

    It seems kind of strange you would switch labeling tools without providing any guidance on how to migrate their trained forms.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  12. Reinforcement learning

    whenever something is wrong I want the model to learn

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  13. Use My Own OCR to capture other languages? Or when will support Simplified Chinese.

    Can i use my own ocr service to capture text ? Cause many languages are not supported and there is no schedule.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  14. Recognizing form regions

    At the moment we don't have features like recognizing checkboxes, radio buttons, signatures, etc.

    Can form recognizer recognize a region of the form, so we can cut that part out and put for further processing?

    For example, I would like to know where on the document are the radio button questions, so that I know in which area I need to do custom processing.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  15. OCR validate text detection

    Often, OCR detects handwritten text incorrectly.

    For example:

    "Bridget Sims, MD" was detected at "Bridge+ Sims, MD"

    There should be a way to correct this and enter in the correct value of the text detected as "Bridget Sims MD" after the OCR has done its work.

    Is there a way to do this already?

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  16. Add the ability to train a model with a blob larger than 4mb

    I am trying to use Microsoft Form Recogniser to get the key value pairs from medical forms. However there are a number of different types of form and cannot currently train an accurate enough model to use with the current size limit.

    10 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  17. 2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  18. Cognitive Service Container with V2.0 Support

    The containers provided are with V1.0 API and all limitations ( no Layout API, 4Mb Dataset for training, ...).
    When containers will be updated for V2.0 API ?

    Thx

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    3 comments  ·  Flag idea as inappropriate…  ·  Admin →
  19. Roadmap

    Are you able to disclose what the roadmap is for the Form Recognizer product? I'm particularly interested in using the Custom Model and Labelling Tool.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  20. Separate hand written text from printed text

    Occasionally, OCR puts together printed and handwritten text as a single tag. I would like them to be separate.

    For example:

    Compay Handwritten company name

    Printed text is usually the label while the handwritten text is what we are looking for.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Planned  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1 3 4 5
  • Don't see your idea?

Form Recognizer

Categories

Feedback and Knowledge Base