Microsoft

Computer Vision API

Welcome to the Computer Vision API Forum

Categories

API – Any ideas or feedback pertaining to features or enhancements to Computer Vision API.

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.

Custom/Sample Images – Have an image you’ve tested and not getting the results you are seeking? Upload the image and describe the information or tags you would like to be included.

  • Hot ideas
  • Top ideas
  • New ideas
  • My feedback
  1. Improve OCR to detect 7-segment display

    The OCR does not detect 7-segment LCD displays that many devices have. It would be a game changer if it worked with this as there are many practical use cases.

    17 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  6 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  2. Improve Computer Vision image analyzer on portraits

    The Image analyzer does not provide enough information to distinguish real photo portraits from black and white drawn portraits or even painted portraits.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  3. Recognize celebrities needs better training

    The 'Recognize celebrities' feature of the computer vision API does not recognise photos of very famous people such as Barrack Obama, David Cameron, Wayne Rooney, Sting. It seems like it need a wider pool of training data or at the very least a list of the celebrities that it should be able to recognise.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Face Detection  ·  Flag idea as inappropriate…  ·  Admin →
  4. OCR not detecting Portuguese very well

    The response I receive from OCR demo page

    Numero d
    Data e Hora de Em ISSO
    Código de Verificação
    MOYZC008

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  5. Processing Video file through Computer Vision API

    I would like to get example of processing Video (MP4) and get the tagging data back.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Samples & SDK Request  ·  Flag idea as inappropriate…  ·  Admin →
  6. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  7. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  8. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  9. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  10. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  11. Not reading amount prepended with currency symbol(ex: $1.25). It is treating that as whole Image.

    In the scanned bills(Ex: taxi), If the amount is prepended with currency symbol, the following numbers are not read. I assume API is considering the whole number as image, because of that currency symbol.
    For example, from the scanned bills, $1.50 is skipped(not read as text), but 1.50 is read by API.

    Images tried from these Sources

    http://www.receipt-template.net/wp-content/uploads/2012/02/taxi-receipt.jpg
    https://expensify.files.wordpress.com/2010/03/ereceipt-sample.png
    http://farm9.staticflickr.com/8012/7587607410_2fde2c1bd8.jpg

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  12. 4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  13. Computer vision algorithms applied to a video or stream

    Could it be possible to use this algorithm with a video stream instead of single images

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  14. Include OfficeLens technology

    On the Computer Vision APIs – could you think about adding the technology from OfficeLens as API. So an uploaded photo of a whiteboard or receipt, printed page could be processed and then returned cleaned up, rotated correctly, etc.

    https://blogs.office.com/2015/04/02/office-lens-comes-to-iphone-and-android/

    10 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  15. Allow Computer Vision API to be used locally without a internet connection

    I'd like to be able to use Computer Vision in the event my app doesn't have a internet connection. A on-premise or on-client solution.

    27 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  3 comments  ·  On-Premises Solution  ·  Flag idea as inappropriate…  ·  Admin →
  16. Single Character recognition

    All I'm trying to recognize is single letters. Such as S, M, L, XL and so forth. Why is Microsoft's OCR unable to recognize these characters?

    10 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  17. Set image compression as a parameter for "Get thumbnail"

    The thumbnail API is excellent, but ships back a highly compressed image.

    It would be great if compression was a variable I could set. As it stands, it compresses the images too much to be usable for what I'm doing with it.

    Imagine you have a 1024x1024 image and you want a 1024x400 crop.

    Imagine the image is mostly blue sky with a plane flying somewhere in the image.

    The API is pretty good at cropping out all the blue sky and finding the plane. So despite the primary target being thumbnails, acting as an intelligent crop tool works pretty…

    10 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  2 comments  ·  Thumbnail Generation  ·  Flag idea as inappropriate…  ·  Admin →
  18. Logo detection

    Is there something on the roadmap for the vision API to include Logo Detection, similar to what Google is offering? https://cloud.google.com/vision/

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Object Detection  ·  Flag idea as inappropriate…  ·  Admin →
  19. Handwriting recognition through vision api

    The ability to recognize the handwritings is needed. For example, the user will write something in their Tablets and my app will convert those into characters automatically while writing.
    It will be helpful for old fashioned guys who don't wan to change their habit but want to be more productive.

    9 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
    Under Review  ·  Raymond responded

    Thank you for your feedback. How do you feel about including cursive handwriting?

  20. Returns location of objects on the image (people, tree, car's, etc.)

    Computer Vision API returns recognized objects locations on the image, what is the pixel location/rectangle.
    Would be of huge use!

    22 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  5 comments  ·  Object Detection  ·  Flag idea as inappropriate…  ·  Admin →
  • Don't see your idea?

Feedback and Knowledge Base