Microsoft

Computer Vision

Welcome to the Computer Vision API Forum

Categories

API – Any ideas or feedback pertaining to features or enhancements to Computer Vision API.

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.

Custom/Sample Images – Have an image you’ve tested and not getting the results you are seeking? Upload the image and describe the information or tags you would like to be included.

  • Hot ideas
  • Top ideas
  • New ideas
  • My feedback
  1. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  2. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  3. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  4. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  5. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  6. Not reading amount prepended with currency symbol(ex: $1.25). It is treating that as whole Image.

    In the scanned bills(Ex: taxi), If the amount is prepended with currency symbol, the following numbers are not read. I assume API is considering the whole number as image, because of that currency symbol.
    For example, from the scanned bills, $1.50 is skipped(not read as text), but 1.50 is read by API.

    Images tried from these Sources

    http://www.receipt-template.net/wp-content/uploads/2012/02/taxi-receipt.jpg
    https://expensify.files.wordpress.com/2010/03/ereceipt-sample.png
    http://farm9.staticflickr.com/8012/7587607410_2fde2c1bd8.jpg

    9 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  7. 6 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  8. Computer vision algorithms applied to a video or stream

    Could it be possible to use this algorithm with a video stream instead of single images

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  9. Include OfficeLens technology

    On the Computer Vision APIs – could you think about adding the technology from OfficeLens as API. So an uploaded photo of a whiteboard or receipt, printed page could be processed and then returned cleaned up, rotated correctly, etc.

    https://blogs.office.com/2015/04/02/office-lens-comes-to-iphone-and-android/

    10 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  10. Allow Computer Vision API to be used locally without a internet connection

    I'd like to be able to use Computer Vision in the event my app doesn't have a internet connection. A on-premise or on-client solution.

    27 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    3 comments  ·  On-Premises Solution  ·  Flag idea as inappropriate…  ·  Admin →
  11. Single Character recognition

    All I'm trying to recognize is single letters. Such as S, M, L, XL and so forth. Why is Microsoft's OCR unable to recognize these characters?

    14 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Started  ·  5 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  12. Set image compression as a parameter for "Get thumbnail"

    The thumbnail API is excellent, but ships back a highly compressed image.

    It would be great if compression was a variable I could set. As it stands, it compresses the images too much to be usable for what I'm doing with it.

    Imagine you have a 1024x1024 image and you want a 1024x400 crop.

    Imagine the image is mostly blue sky with a plane flying somewhere in the image.

    The API is pretty good at cropping out all the blue sky and finding the plane. So despite the primary target being thumbnails, acting as an intelligent crop tool works pretty…

    10 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  2 comments  ·  Thumbnail Generation  ·  Flag idea as inappropriate…  ·  Admin →
  13. Logo detection

    Is there something on the roadmap for the vision API to include Logo Detection, similar to what Google is offering? https://cloud.google.com/vision/

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Object Detection  ·  Flag idea as inappropriate…  ·  Admin →
  14. Handwriting recognition through vision api

    The ability to recognize the handwritings is needed. For example, the user will write something in their Tablets and my app will convert those into characters automatically while writing.
    It will be helpful for old fashioned guys who don't wan to change their habit but want to be more productive.

    9 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  15. Returns location of objects on the image (people, tree, car's, etc.)

    Computer Vision API returns recognized objects locations on the image, what is the pixel location/rectangle.
    Would be of huge use!

    22 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    5 comments  ·  Object Detection  ·  Flag idea as inappropriate…  ·  Admin →
  16. Support data URIs in image APIs (vision/emotion/face)

    For JavaScript clients, there are many circumstances when you want the image as represented in a Canvas to be uploaded to one of the MCS APIs that accept images. The problem right now is that Canvas returns its content in the form of a data URI (RFC 2397), and callers need to convert this to a blob before uploading. It would be convenient if the APIs simply accepted the data URI directly, although it would mean the network payload is larger than need be (because of the base64 encoding.)

    8 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  1 comment  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  17. Vision API: Provide an SDK that does image binarization step at the client

    For most computer vision applications, the first step is to remove color information and binarize the input image. We are proposing, to have Microsoft release an SDK (python or javascript or iOS or Android or others) that performs the binarization part at the client. This significantly saves the amount of data transferred to the server. It can reduce the amount of data pushed to server as much as 12x and sill have no impact on recognition quality. In addition, its savings for microsoft in incoming bandwidth and CPU time spent. Its also a win for the clients where they have…

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Samples & SDK Request  ·  Flag idea as inappropriate…  ·  Admin →
  18. Vision or sound api's for monitoring and publishing ocean wave engeries and spectrums at popular city beaches or world pro surfing venues.

    Vision and/or sound api's for monitoring and publishing ocean wave energies and spectrums at popular city beaches or world pro surfing venues. Could be used in conjunction with NOAA oceanographic services and raspery pi weather stations to provide realtime 'current' conditions at major city beaches or ocean facing ports. Vision api's might be able to provide local government services with usage statistics of ocean resources such as the number of people entering or leaving the water, the number of people on a beach or attending a surf carnival.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  19. Add the ability to get localized language support for image description with Vision API

    In order to allow a broader use of the service, it would be great to have the ability to specify the language in which we want to get the image description via Vision API.

    49 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    4 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
    Under Review  ·  Raymond responded

    We are researching options to bring this capability to Computer Vision.

  20. Allow control over minimal spacing between words in recognized text

    Sometimes it now recognizes spaces that arren't there, like "Company" is recongnized as "Co" "pa" "ny". It would be nice to indicate the minimal space between word regions, it the regions are very close together it can the merge then and return a single word.

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  • Don't see your idea?

Feedback and Knowledge Base