Microsoft

Computer Vision API

Welcome to the Computer Vision API Forum

Categories

API – Any ideas or feedback pertaining to features or enhancements to Computer Vision API.

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.

Custom/Sample Images – Have an image you’ve tested and not getting the results you are seeking? Upload the image and describe the information or tags you would like to be included.

How can we improve Computer Vision API?

(thinking…)

Enter your idea and we'll search to see if someone has already suggested it.

If a similar idea already exists, you can support and comment on it.

If it doesn't exist, you can post your idea so others can support it.

Enter your idea and we'll search to see if someone has already suggested it.

  • Hot ideas
  • Top ideas
  • New ideas
  • My feedback
  1. Posts a locally stored JPEG image to Microsoft Computer Vision in java

    How to send a local image instead of URL to Microsoft Computer Vision API using JAVA

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Samples & SDK Request  ·  Flag idea as inappropriate…  ·  Admin →
  2. Symbols or individual characters not recognized.

    MS Handwriting recognition is best-in-class in reading poor handwriting, but seems algorithm chooses not to recognize individual symbols or characters by themselves. All text is not written in full words.

    The following examples never process the same symbol correctly on successive lines. Symbols, or even "0" or "o" are ignored when left by themselves.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  3 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  3. Provide a way to migrate an existing custom vision application (at customvision.ai/projects) into the Azure portal

    I have trained a project at customvision.ai/projects. How can I migrate / import this to my production Azure subscription?

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Documentation  ·  Flag idea as inappropriate…  ·  Admin →
  4. Include DBpedia id in Celebrity and Landmark Models

    To disambiguate similar names, include a DBpedia graph id in the response. This will help more with categorizing results and using the data in more meaningful ways.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  5. Add a unique id to recognized celebrities

    To avoid homonymy problems, it would be very useful to add a unique identifier to recognized celebrities in the api response

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Face Detection  ·  Flag idea as inappropriate…  ·  Admin →
  6. 5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Service Limits  ·  Flag idea as inappropriate…  ·  Admin →
  7. Add support for tires/wheels/rims

    I went online and got a bunch of random pictures of cars and piles of tires, etc.

    Each time I did a search, I get 0 tags relating to tires, wheels, rims, etc.

    Google Cloud Vision does detect these items though, with the following tags:
    tire
    alloy wheel
    automotive tire
    rim

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  8. How could I use http post enquery of C languege to get the Computer Vision API ?

    postRequest =(String)("POST ") +"/vision/v1.0/analyze?https://upload.wikimedia.org/wikipedia/commons/1/12/Broadway_and_Times_Square_by_night.jpg HTTP/1.1\n"
    + "visualFeatures: Categories,Description,Color"
    + "language: en\n"
    + "Content-Type: application/json"
    + "Ocp-Apim-Subscription-Key: {mysub key}\n"
    + "Host: westus.api.cognitive.microsoft.com\n"
    + "Content-Length: 152\r\n\r\n";

    As shown above, I'll get an error number of 401,which means Access denied due to missing subscription key , how do I organize the content of my post request ?

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Resolved  ·  1 comment  ·  Samples & SDK Request  ·  Flag idea as inappropriate…  ·  Admin →
  9. Pattern definition

    We have some standard forms that are in handwritten format but the pattern of the text is always the same (I think this can apply to a large number of scenarios). Allowing the handwritten recognizer to be configured with a expected pattern via a simple regex could improve a lot of scenarios.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  10. Offer relative position in 3D-space between two or more images

    If I take two photos of a subject, I would love to be able to know where the second photo was taken in real space relative to the first. In the example of handheld smartphone photography, if the first image is a baseline then I would love to know the xyz offset of the second image. Even if I believe that I'm holding a smartphone in the same position without moving it, it's likely moving at least an inch in multiple dimensions just due to hand shake. However, I might also take a side profile and it would be just…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  11. Offer subject pose dection

    While head tilt angle is useful for face detection, it would be incredible to see you support standard 25 (body, no fingers) and 65 (body w/fingers) bone placement for full-body images.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  12. Offer subject foreground/background segmentation

    Being able to reliably pull out complex bounding areas would be a game changer, ideally if the classifier had an understanding of human proportion and dimension. eg. if someone is wearing a white shirt against a white background, or even if they have white buttons, you shouldn't carve a hole through the person.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  13. Increase face landmark density

    Currently using Dlib facial detection which returns 68 facial landmarks vs Azure's 27. More is better; this is proving to be a political issue with the system architect.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Face Detection  ·  Flag idea as inappropriate…  ·  Admin →
  14. OCR Failures

    1. I'm getting no regions back from about half of the of images I submit for recognition in the attached example file.

    2. On full document scans, I often see the returned json resembling this:
    dwoo ,4.Ed co c Ex—co cc äö30a-oF— Z U) r-x.l (DC—a- 0 00 o o O O

    Can anyone suggest possible causes / solutions to either of these issues?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  15. Identify report format

    There is a need to use Computer Vision/OCR with report images. For that to work well it would be great if the API could make a guess/determination on the layout of the report. Today OCR returns boundingBox information which is helpful but if the API would identify field names from data values as well as header vs. detail fields that would be amazing!

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  16. recognize a barcode or shipping label tracking code for any shipping vendor, and provide a link to the most likely vendor

    Every shipping company and manufacturer chooses a different barcode that supports their needs, but they are not interchangeable, which is by design for automation, but we need to read all the barcodes and categorize them because we cant train our users to figure out which one to scan.

    6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  17. Regional availability - Please add support for Taiwan region

    Please add support for Taiwan region. Thanks

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  18. Font Size & Weight

    For each word found, note the size of the font as a percentage relative to the largest font found within the image. Or some other similar means. This would allow users to find headings, sub-headings, etc...

    The font weight (boldness and contrast) of each word would also help to identify headings and important text relative less important text within the image.

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  19. School Students Handwriting not able to Recoginze .

    Hi, I am from Bangalore. I tried few students answer sheet using Computer Vision API. Most of the Answer sheet handwriting not able to recognise by API. Can you improve the algorithm or let us know how to train the data. Thank you.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  20. Vision fails to return tag for specific image

    I encountered a strange issue using vision when trying to retrieve tags. This post works as expected:
    POST https://westcentralus.api.cognitive.microsoft.com/vision/v1.0/analyze?visualFeatures=Categories,Tags&details=Celebrities&language=en HTTP/1.1
    Content-Type: application/json
    Host: westcentralus.api.cognitive.microsoft.com
    Ocp-Apim-Subscription-Key: ••••••••••••••••••••••••••••••••

    {"url":"https://image.shutterstock.com/z/stock-photo-portrait-of-a-man-holding-a-cute-mixed-breed-dog-isolated-over-white-99972185.jpg"}

    but this image does not return any tags: https://thumbs.dreamstime.com/z/man-holding-unique-dog-2045365.jpg

    There is no error returned. Any ideas what is going on?

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  2 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  • Don't see your idea?

Feedback and Knowledge Base