Microsoft

Form Recognizer

  1. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  2. Method of confirming prediction as correct or incorrect

    I uploaded 5 documents –– 2 batches of similar files. However, the model's confidence is very low. When I upload files though the model is working correctly. I'd love to be able to confirm the model if it were correct and automatically have it added to the training so that I don't have to manually tag each of the fields for that document.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  3. The folder path for the container may not be working.

    I could not get tagging to work with jpgs in a folder in the container and the same folder name in the project in the Docker image.
    Is there a GitHub project for this?

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  4. Old labeling tool vs the new one

    The original forms labeling tool (VoTT) creates vott files and the new one (OCR Form Labeling Tool) uses fott files. I labeled and trained a bunch of forms using the original labeling tool. Is there a way for me to migrate those vott files to the fott file format?

    It seems kind of strange you would switch labeling tools without providing any guidance on how to migrate their trained forms.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  5. Label screen has spinning cursor, never completes

    I pulled the latest Docker image on April 27th, on two separate computers and two separate subscriptions/tenants. I can create the connection and the project, but upon project save, it moves to the label screen and seems hung.

    I have tried the hosted Docker image and the local one. Both act the same, as does my corporate and personal Azure accounts. Any idea as to what I am doing wrong?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  6. Differentiate string entities

    Hi, at the moment, labels can be string / date / number. Could be better to differentiate string types like person / address / ... This is possible in TextAnalyze service.

    To improve our training, do Form Recognizer use these features to extract labels:
    - NLP ? (words type adjective / verbs, ...)
    - words before / after the label ?
    - word case ? (1st capital letter / all upper-case / ...)
    - text position in document ? (top right / center / ...)

    Thanks for info

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  7. Increase the total size of the training data set

    The current total size of the training data set can be up to 500 pages. I'm planning to use this service for thousands of my customers that have multiple types of forms. Is the support of large training data sets something that is in your scope for the near future?

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  8. I can't make a project, but I don't know what it means.

    I can't make a project, but I don't know what it means.

    Both API keys and endpoints are correct.
    The same work was done well last week.

    Let me know if anyone knows how to handle it.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  9. Add splitbar to Form Labeler

    If you use long tag names it exceeds the visible area. It would be nice to be able to expand the tag name area via a splitbar.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  10. Use higher contrast highlighting in the Form Labeling tool

    Using different shades of green for words that are selected vs just highlighted from OCR does not provide enough visual indication. For those of us that are color blind it's difficult to see the subtle difference. You might want to check out Microsoft's Accessibility Insights tool.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  11. Add lasso selection functionality to Form Labeling tool

    It would be really nice to have a "Lasso" selection functionality in the form labeling tool. I know I can hold the left mouse button down to highlight multiple words but there are some areas of forms (e.g. remarks) where we have several hundred words that have to be selected. They usually are in a rectangle shape so hence a lasso selection would work perfectly.

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Planned  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  12. Return the document results in form order

    It would be nice if the Get Custom Model and Get Analyze Form Result API's returned the fields/documentResults arrays in order in which the fields appear on the form (e.g. top/down, left/right). Currently Get Custom Model returns the fields in alphabetical order and Get Analyze Form Result returns them in a random order. My forms have a few hundred fields on them so it takes my consumers a while to figure out the form mappings.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  13. In addition to bounding box, include area, height, and width with Analyze Layout result

    The Analyze Layout API call currently provides the bounding box as a result for each line of text. While this information is useful, it always requires the consumer to perform additional processing to make it useful.

    Consider directly providing the key calculations as part of the service result to reduce redundancy on the consuming side.

    Provide the following values:


    • Area of the bounding box

    • Height of the bounding box

    • Width of the bounding box

    At a minimum, consider providing the height and the width as the area can be easily calculated from that.

    The height of the bounding box can…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  14. Add support for placing result layout analysis result directly into Azure Storage or Azure Queues

    The current model for using the Analyze Layout service endpoint requires polling to retrieve the completed analysis result.

    This model is inefficient and resource intensive on the application side. Consider a case when using Azure Functions: this would require multiple Function invocations to check if the analysis is completed.

    This can be partially alleviated by incorporating a Shared Access Signature type functionality so that clients can directly query the Get Analyze Layout Result endpoint OR provide a parameter which allows the caller to specify an endpoint where the resultant JSON can be stored as a file.

    For example, add two…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  15. Allow one-time use keys or shared access signature for direct submission

    At current, there is no built-in mechanism to support one-time use access keys for clients to submit files directly from the client endpoint.

    Form Recognizer should incorporate similar functionality to Azure Storage Shared Access Signature keys or provide a mechanism of issuing one time use keys to allow clients to directly submit files to the server to bypass additional superficial server-side processing.

    At current, the model requires the usage of some additional processing layer to inject the secret APIM key. This is very much a waste of resources and application throughput as it is merely a mechanism of securing the…

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  16. Auto recognise wich custom model to use.

    Hi
    Woult it be posible to auto detect wich custom model to use? In a normal invoicing flow you would have one big pile of different invoices and send Them to be recognizer.
    Then it would sort Them in 2 piles one wher a model was found and one without.
    Do you understand what i Mean by that?

    11 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    6 comments  ·  Flag idea as inappropriate…  ·  Admin →
  17. analyze results - missing labels in the result (lavel, value - pairs)

    I have trained the model using labels.
    In the result, I dont get something like

    label 1, value
    label 2, value

    Is there any way to get this label and value pairs?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  18. Data Type for labels

    Ability to specify a data type for a label, and have the recognizer look for that data type's formatting when extracting text for that field.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  19. Recognizing form regions

    At the moment we don't have features like recognizing checkboxes, radio buttons, signatures, etc.

    Can form recognizer recognize a region of the form, so we can cut that part out and put for further processing?

    For example, I would like to know where on the document are the radio button questions, so that I know in which area I need to do custom processing.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Flag idea as inappropriate…  ·  Admin →
  20. Separate hand written text from printed text

    Occasionally, OCR puts together printed and handwritten text as a single tag. I would like them to be separate.

    For example:

    Compay Handwritten company name

    Printed text is usually the label while the handwritten text is what we are looking for.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Planned  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  • Don't see your idea?

Form Recognizer

Categories

Feedback and Knowledge Base