Microsoft

Computer Vision API

Welcome to the Computer Vision API Forum

Categories

API – Any ideas or feedback pertaining to features or enhancements to Computer Vision API.

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.

Custom/Sample Images – Have an image you’ve tested and not getting the results you are seeking? Upload the image and describe the information or tags you would like to be included.

How can we improve Computer Vision API?

(thinking…)

Enter your idea and we'll search to see if someone has already suggested it.

If a similar idea already exists, you can support and comment on it.

If it doesn't exist, you can post your idea so others can support it.

Enter your idea and we'll search to see if someone has already suggested it.

  • Hot ideas
  • Top ideas
  • New ideas
  • My feedback
  1. The PDF coordinate system needs some work

    The Read api (https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/2afb498089f74080d7ef85eb) does not handle PDF word coordinates correctly.

    Did not see a way to report the bug on the API page, so I am reporting it here.

    I am sending it a 8.5x11 page and getting coordinates in inches that are outside the box.

    The width is marked as 10.9833 and height as 8.4567, however it is getting words outside that box:

    12.3419
    0.4349
    13.2572
    0.4294
    13.2715
    0.501
    12.3562
    0.5065

    When I print the coordinates to the page some documents are spot on the the x coordinate, but roughly half as far down the page…

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  2. Improve accuracy for combination of numbers and special characters

    When i try to detect license number from image of pattern 12/234232/3454, the slash symbol is recognized as 1.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  3. Information on Computer Vision Image and Data Retention

    People need to know what you are doing with their images during and after processing. Please have a clearly stated policy on this.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Documentation  ·  Flag idea as inappropriate…  ·  Admin →
  4. i need to find some kind of objects like cars or phones

    i want to filter the search about a object or something in the image

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Object Detection  ·  Flag idea as inappropriate…  ·  Admin →
  5. Recognize Vancouver Olympic Torch in photo

    Photo of landmark in Vancouver, Canada is incorrectly identified as "a plane parked on the side of the road" - it should at least be identified as a monument or sculpture.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  6. Please provide the taxonomy for tagging and taxonomy used for object detection in the documentation.

    Please provide the taxonomy for tagging and taxonomy used for object detection in the documentation.

    It would fine if it is not an exhaustive list. This taxonomy would be more helpful to users to set predefined conditions for objects or tags.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Documentation  ·  Flag idea as inappropriate…  ·  Admin →
  7. Generate Human friendly operationId in swagger

    Almost all tools that can generate API Client out of Swagger API defenition, will use OperationId to generate method name in a client. In your swagger API, OperationIds looks like "56f91f2e778daf14a499e1fa", and in C# it will produce method name public System.Threading.Tasks.Task 56f91f2e778daf14a499e1faAsync();
    It's extremely hard to recognize what method doing, based on this operationID

    Please, modify operationIds in a swagger documentation https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/export?DocumentFormat=Swagger&ApiName=Computer%20Vision%20API%20-%20v2.0 to make it human and code generator tools friendly.

    P.S. I got this swagger file from https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/587f2c6a154055056008f200

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Documentation  ·  Flag idea as inappropriate…  ·  Admin →
  8. Not able detect all the character accuratly with Printed API

    I am trying to upload the identity proof and trying to verify with the user input. Some characters are replaced with other characters. like 1 is replaced with 7 ... so the date of birth will change..

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  9. Extract table data from image with table structure

    The main purpose of this idea is to take the table structure out of the image when the user selects a part of the image.

    14 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  10. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Web UI  ·  Flag idea as inappropriate…  ·  Admin →
  11. Recognize halogram in picture

    When analyzing regular pics vs. Pics with Halograms embedded inside

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  12. Computer Vision API doesn't recognize clear text surrounded by @

    Computer Vision API doesn't recognize very clear and obvious text surrounded by @ character

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  13. API do find duplicates in Images

    Function to search in whole Blob with Images to recognize duplicated images.

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  14. How far has the API gone with trying to detect people from the back view?

    Can the API detect humans even from a back view. It might not be very accurate just like humans but after you have lived with someone else for a long time there is a high chance that you can guess who they are even from a back view. Therefore was wondering if the computer detected multiple images of people even from back view whether it would be able to guess.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Face Detection  ·  Flag idea as inappropriate…  ·  Admin →
  15. Make detecting hollow text better

    Trying to find text on an image with hollow text doesn't return the right text results from the image and sometimes returns none at all.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  16. Add A Quickstart Guide.

    The guide for recognizeText is not available for php. So I would like to add one. How should I add?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Samples & SDK Request  ·  Flag idea as inappropriate…  ·  Admin →
  17. Image Compare

    Feature like to compare the image with master image (Good Image) and tell a difference between two images.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  18. Increase file size limit from 4MB

    Could we have the API File size limit increased? currently we work with a lot of imagery with high resolution images that average ~8MB.

    Can the Size limit be increased

    12 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  4 comments  ·  Service Limits  ·  Flag idea as inappropriate…  ·  Admin →
  19. Add ability detect checkboxes or radio buttons to OCR and Handwritten text

    I've successfully been able to use vision to extract handwritten text out of fixed forms by knowing the coordinates of each form field. However many of my forms have checkboxes and/or radio buttons that the users will be filling in with pen. It doesn't seem that vision has a way to detect this type of content.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  3 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  20. Compare two images and return confidence and heatmap of difference areas

    You have two images like tax forms. Compare the two images. Return a confidence they are the same. Return a set of cords for the image where the primary differences occur. PDF as well as jpg and bmp support would be nice.

    7 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  2 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1 3 4 5
  • Don't see your idea?

Feedback and Knowledge Base