Microsoft

Form Recognizer

  1. Add the ability to train a model with a blob larger than 4mb

    I am trying to use Microsoft Form Recogniser to get the key value pairs from medical forms. However there are a number of different types of form and cannot currently train an accurate enough model to use with the current size limit.

    10 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  2. Define a key/value when its wrong.

    I make several test and works very fine but in some cases the key value is wrong even the key is on de left of the value, it would be nice to specify the key when its wrong to train a better model.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  3. Handwritten forms

    We need an ability to extract key-value pairs from a form which is hand filled or hand written. For example - an insurance claim form which is filled in by a person or an agent. Is some work being done on this ?

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  4. Form with tables, checkboxes, and whole lot of square boxes

    I have trained my form recognizer model using all the requirements and tips provided in documentation. However, the output is not good at all.
    The form that I am trying to train on is attached (empty version). I have used 2 filled in forms as well as per requirements.
    The form has two small tables at the top - even these haven't been read correctly. And then there are whole lot of square boxes (representing one character) to fill in details of the users. The boundaries of these square boxes are being read either as 1 or as a dash…

    12 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    3 comments  ·  Flag idea as inappropriate…  ·  Admin →
  5. Form Recognizer doesn't work with Japanese characters even with very basic files.

    The attached example is used for training, but the trained model is not able to segregate keys and values. All the keys are set-up to "Tokens" value ( eg: "0": [ "Tokens" ])

    Analyze Forms component will only extract some of the data, and will anyway allocate all of them on only one key.

    Is there a version of Form Recognizer that works better with Japanese characters ? I am currently using the one on West Europe.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  6. Does Form Recognizer work with files in Blob Storage

    Having an issue with the QuickStart for Form Recognizer .NET SDK.

    The example works great when I feed it a PDF file that is local to my PC. I want to pass it a PDF file that exists in Azure Blob Storage so I modified the example to use a MemoryStream variable instead of a FileStream. This give me a "415 - UnsupportedMediaType" error message even though the same file works correctly if I analyze it from my local PC.

    What am I missing? Does Form Recognizer support analyzing files from blob storage?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  7. Better detection of field values wihtin character boxes

    Our forms (or our client forms, which we have no control over) use the style of value field where there are a series of boxes, and every character must be entered into a box. I believe these are designed for template-based form recognition, but this seems to be a big problem for Form Recognizer.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  8. Recognize Checkbox and Signature sections

    We have a number of paper forms that are completed by staff. The forms include checkboxes. We need to confirm that the checkboxes have been checked.

    Additionally, we require the form to be signed by the person completing it. We'd like to confirm that a mark has been made in the signature area.

    It would be great if there was a way that Form Recognizer could just register that a pen mark has been made in a certain area, even if it can't read it (e.g. a signature or a tick in a box)

    37 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    8 comments  ·  Flag idea as inappropriate…  ·  Admin →
    Planned  ·  Winona Azure responded

    Thank you for the request. This checkbox feature is being planned for the next release.

  9. Increase Timeout on Training using PDF and Logic Apps

    When I use logic apps to train a model, it seems to timeout when I use more than 5 PDFs (I used 10). 5 PDFs seemed to work at 52 seconds.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →

    Form Recognizer Logic Connector currently uses the Form Recognizer v1.0 API which is a sync API and has a HTTP timeout limit. Form Recognizer v2.0 (preview) is an Async API and does not have this timeout limit. You can create a custom Logic App connector to Form Recognizer v2.0 (preview) API.

  10. Tips for handling key value pairs from Form Recognizer Json output

    Can you offer any practical pointers, thoughts or guidance on ways to deserialize the Json output please? The structure of the key/value pairs is understandable and has helpful data like BoundingBoxes but it is complex to deserialize?

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  11. Fixed: Microsoft.Azure.CognitiveServices.FormRecognizer.Models.ErrorResponseException: ‘Operation returned an invalid status

    TrainCustomModelAsync: Microsoft.Azure.CognitiveServices.FormRecognizer.Models.ErrorResponseException: ‘Operation returned an invalid status code ‘RequestEntityTooLarge’

    We have followed code examples but we are getting the following error (FR1.png) when calling TrainCustomModelAsync: Microsoft.Azure.CognitiveServices.FormRecognizer.Models.ErrorResponseException: ‘Operation returned an invalid status code ‘RequestEntityTooLarge’

    We are trying to pass a reference to the Training Data that has been populated in Azure (FR2.png). It is a collection of about 5 pdfs. Each with approx 4 pages at 300dpi or so.

    We adapted from the following sample: https://docs.microsoft.com/en-us/azure/cognitive-services/form-recognizer/quickstarts/dotnet-sdk

    Any help is gratefully received.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  12. Add support for Cyrillic

    I would like to see support for Cyrillic-based languages, such as Bulgarian, Russian, etc.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  13. Increase limit of number of samples for custom training

    We understand that the number of samples for custom training is currently limited to 50. We hope that by providing more samples, the model would be more accurate in the end on large volumes, so would it be possible to raise that limit?

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  14. Issue on fields with multiple line headers

    First, congrats to the team for the great new product!
    It works already quite well on the forms we tried, despite small variation of templates for same type of document, which is a great help!
    But at the time being, we can't consider to use it, as it seems to have an issue with fields having headers split on multiple lines. We can provide samples but not on a public forum.

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  15. Manual Training

    Make it trainable (maybe with a UI) where users can manually point or map fields they need. We have tons of different templates of Invoices but we basically need only the main fields on the Invoice recognized, and currently we have no way of pointing out what we need and with the 50 pages\4 Mb per model and we feel very limited.

    Maybe a way of saying Invoice #, or Invoice No. or Inv# should be mapped to Invoice Number for all of the results for this model; this would be of much help when extracting data from different Invoice…

    29 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  16. Return JSON properly formatted

    Currently the JSON response we are getting back is being formated with single quotes instead of double quotes. Can this be changed to double quotes so that programs like VScode see this as properly formated JSON?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Resolved  ·  3 comments  ·  Flag idea as inappropriate…  ·  Admin →
  17. When will form-recognizer support the complex forms?

    Forms usually have complex structures as well and that is where automated services are needed. I have forms that contain merged cells, and nested tables. On documentation of form-recognizer, it is stated that such forms are not supported. Does Microsoft looking forward towards making form-recognizer capable for such forms as well? If yes, any expected time frame?

    17 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  18. How can we improve the accuracy of the form-recognizer model?

    I am using Microsoft form-recognizer service. My forms are bit complex and I tried training a model for them. The performance I achieved is not really good. Is there anyway I can improve this accuracy? Is there anyway to tune this model? I have trained the model using 5 different populated forms of the same type.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  19. Can we train a single model for multiple type of forms?

    I am using Microsoft form-recognizer cognitive service to train a model for forms. My question is that can we train a single model for multiple type of forms? Is it recommended? I have many multiple types of forms and I choose to train a single model for four different types, it get trained but is it a recommended approach?

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  20. Response status code: 408

    Iam using Microsoft form-recognizer service and was able to train a model with 5 forms of the same type and it was successful. I was able to analyze that model on test forms as well, But when I tried training the same forms again, I faced the following error:

    Response status code: 408 Response body: {'error': {'code': '1011', 'innerError': {'requestId': 'fb75e36b-b704-49fe-a9bf-2fecaa71ca44'}, 'message': 'The operation exceeded allowed time limit and was canceled. The common reasons are that the data source is too large or contains unsupported content. Please check that your request conforms to service limits and retry with redacted data…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Resolved  ·  2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  • Don't see your idea?

Form Recognizer

Categories

Feedback and Knowledge Base