Microsoft

Form Recognizer

  1. What is the best way to train 1k different vendor invoices for custom model train? as there is a limit of 500 pages per model id

    What is the best way to train a model having 1k vendor invoices, as there is a limit of 500 pages.
    As per the limit mentioned we can max 100 vendor invoices per model id.

    Can you please put some light on this from the implementation perspective ?

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  2. Process multiple documents contained in a single file

    I’m working on a project to create custom Machine Learning models for the processing of invoices using Azure Form Recognizer and OCR Form Labeling Tool.

    In general it’s working really well, however, I have a situation where I have more than one invoice in the same PDF file. In is current form, the OCR Form Labeling Tool & Azure Form Recognizer don’t handle this situation very well. I‘m wondering if there are any tips or guidelines for this situation or will this use case be covered in the future?

    Any guidance would be greatly appreciated!!

    7 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    3 comments  ·  Flag idea as inappropriate…  ·  Admin →
  3. How to label a table in a form?

    In the Azure form recognizer official website "https://azure.microsoft.com/en-in/services/cognitive-services/form-recognizer/" few examples have tables in the sample file. In output also we have an attribute called table in sample json. Please guide us in labeling a table in a custom layout form.

    26 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    5 comments  ·  Flag idea as inappropriate…  ·  Admin →

    Form Recognizer discovers and extracts tables automatically. Table results are part of the pageResults section in the JSON output. If the table in the form was not discovered you can label tables a values by labeling each table cell and training with the maximum number of rows in the tables. Form Recognizer does not yet support labeling tables as tables.

  4. Auto recognise wich custom model to use.

    Hi
    Woult it be posible to auto detect wich custom model to use? In a normal invoicing flow you would have one big pile of different invoices and send Them to be recognizer.
    Then it would sort Them in 2 piles one wher a model was found and one without.
    Do you understand what i Mean by that?

    11 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    6 comments  ·  Flag idea as inappropriate…  ·  Admin →
  5. What's the best way to train a table without line? (See Pic)

    I know Form Recognizer does not yet support labeling tables as tables. Suggestions please.
    Please see pic1 for reference. Number of Line Items may vary at large.
    And, What can we do if some text is not recognized by the OCR labeling tool see pic1 ?

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  6. labeling tool improvements


    • Add regex option to form tag format. Let users specify what is expected result and help FR filter out text that does not belong to expected value. validate result with regex and adjust confidence score accordingly.


    • modify recognized field in training form. Sometimes recognized value is incorrect and contains additional text. I would like to be able to flag and correct this value so FR would better recognize this type of mistake


    • Add versioning to trained models. Models are immutable, but they belong to a same project. Show a list of previously trained models, so users could switch between them…

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  7. Limit processing to first X pages

    We should be able to use a setting to only analyze the first X pages of forms.
    I.e. we have forms related to phone bills, which the first 2 pages have relevant info then there is 20+ pages of just call records which we do not want to analyze.
    We should be able to tell the analyze forms function to only process the first X number of pages.

    Especially in the power automate function.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Planned  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  8. OCR validate text detection

    Often, OCR detects handwritten text incorrectly.

    For example:

    "Bridget Sims, MD" was detected at "Bridge+ Sims, MD"

    There should be a way to correct this and enter in the correct value of the text detected as "Bridget Sims MD" after the OCR has done its work.

    Is there a way to do this already?

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  9. selectionMark recognition not working properly

    I'm using forms with lots of check box fields (ACORD 28). With the 2.1 beta I'm finding that the labeling tool does not always recognize a check box (e.g. selectionMark) as a check box. Many times it will simply think it's a text field and will not allow me to identify it as a selectionMark type. The reverse is also true. Sometimes if I try to label a field as String it will tell me the type is not compatible because it assumes it's a selectionMark. I think you need to take a different design approach for selectionMark labeling and…

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  10. Recognize Checkbox and Signature sections

    We have a number of paper forms that are completed by staff. The forms include checkboxes. We need to confirm that the checkboxes have been checked.

    Additionally, we require the form to be signed by the person completing it. We'd like to confirm that a mark has been made in the signature area.

    It would be great if there was a way that Form Recognizer could just register that a pen mark has been made in a certain area, even if it can't read it (e.g. a signature or a tick in a box)

    37 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    8 comments  ·  Flag idea as inappropriate…  ·  Admin →
    Planned  ·  Winona Azure responded

    Thank you for the request. This checkbox feature is being planned for the next release.

  11. Publish sample C# client code that compiles and works

    When I copied and pasted the sample C# client code snippets from the Quickstart documentation, the code doesn't even compile. Please share a simple working C# client program via github so we don't have to copy/paste bits and pieces and we don't have to struggle with compiler errors. The sample should work as long as the FORMRECOGNIZERENDPOINT and FORMRECOGNIZERKEY environment variables are set to valid values. [Oh, please make this textarea box resizable]

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  12. Allow to add a name to model when training

    Hello,

    it would be nice to allow to name the model when training.. will allow to not waist time saving id ect

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  13. Can't create a project

    Hello, I followed the tutorial here: https://docs.microsoft.com/en-us/azure/cognitive-services/form-recognizer/quickstarts/label-tool

    I setup a Azure web app (ACI) to run the docker image according to this tutorial: https://docs.microsoft.com/en-us/azure/cognitive-services/form-recognizer/deploy-label-tool#deploy-with-azure-container-instances-aci

    I setup the connection. Then when I went to create the project it kept giving me this error and I have no idea what is wrong? I tried removing all spaces and non alpha characters from project name but nothing seems to work :(

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  14. Cognitive Service Container with V2.0 Support

    The containers provided are with V1.0 API and all limitations ( no Layout API, 4Mb Dataset for training, ...).
    When containers will be updated for V2.0 API ?

    Thx

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Planned  ·  4 comments  ·  Flag idea as inappropriate…  ·  Admin →
  15. Add lasso selection functionality to Form Labeling tool

    It would be really nice to have a "Lasso" selection functionality in the form labeling tool. I know I can hold the left mouse button down to highlight multiple words but there are some areas of forms (e.g. remarks) where we have several hundred words that have to be selected. They usually are in a rectangle shape so hence a lasso selection would work perfectly.

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Planned  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  16. Old labeling tool vs the new one

    The original forms labeling tool (VoTT) creates vott files and the new one (OCR Form Labeling Tool) uses fott files. I labeled and trained a bunch of forms using the original labeling tool. Is there a way for me to migrate those vott files to the fott file format?

    It seems kind of strange you would switch labeling tools without providing any guidance on how to migrate their trained forms.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  17. Reinforcement learning

    whenever something is wrong I want the model to learn

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  18. CSV as output

    Please add feature to obtain table data as CSV which makes it easier to use ETL tools like ADF for loading data into relational database

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  19. Use My Own OCR to capture other languages? Or when will support Simplified Chinese.

    Can i use my own ocr service to capture text ? Cause many languages are not supported and there is no schedule.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  20. Form with tables, checkboxes, and whole lot of square boxes

    I have trained my form recognizer model using all the requirements and tips provided in documentation. However, the output is not good at all.
    The form that I am trying to train on is attached (empty version). I have used 2 filled in forms as well as per requirements.
    The form has two small tables at the top - even these haven't been read correctly. And then there are whole lot of square boxes (representing one character) to fill in details of the users. The boundaries of these square boxes are being read either as 1 or as a dash…

    12 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    3 comments  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1 3 4 5
  • Don't see your idea?

Form Recognizer

Categories

Feedback and Knowledge Base