Microsoft

Computer Vision API

Welcome to the Computer Vision API Forum

Categories

API – Any ideas or feedback pertaining to features or enhancements to Computer Vision API.

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.

Custom/Sample Images – Have an image you’ve tested and not getting the results you are seeking? Upload the image and describe the information or tags you would like to be included.

  • Hot ideas
  • Top ideas
  • New ideas
  • My feedback
  1. Improve single-character recognition

    Single characters are having a very tough time being recognized, even in the most optimal conditions for OCR (shown in the attached PDF). Also attached is a text view of how the content gets OCR-ed.

    I saw that this is a recognized problem in a few ideas from a couple of years ago, but I was wondering if any progress was made in fixing this known problem.

    Functionally, this provides issues with OCR-ing tables that have a "Quantity" or any other amount less than 10.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  2. It is found that the subscriptionKey is placed in the request header during API use, which is unsafe

    After looking at the documentation, you'll notice that a lot of API requests put subscriptionKey in the request header. That way, you can grab the request and get the user's key. This allows illegal use of the service.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  3. api使用中发现subscriptionKey放到请求头中,不安全

    After looking at the documentation, you'll notice that a lot of API requests put subscriptionKey in the request header. That way, you can grab the request and get the user's key. This allows illegal use of the service.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  4. api使用中发现subscriptionKey放到请求头中,不安全

    After looking at the documentation, you'll notice that a lot of API requests put subscriptionKey in the request header. That way, you can grab the request and get the user's key. This allows illegal use of the service.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  5. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Service Limits  ·  Flag idea as inappropriate…  ·  Admin →
  6. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  7. Can handwriting(Japanese) be supported?

    Can handwriting(Japanese) be supported?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  8. Please, make a support for bulgarian language

    It's not so hard to add Bulgarian language once you have Russian (cyrillic).

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  9. Better support for artistic fonts.

    Artistic fonts like the ones found in comics are really hard to recognize, especially for North East Asian languages.

    Fix that, Google equivalent ocr option handles it a ton better.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  10. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  11. Fractions not recognized --

    Fractions are not being recognized..
    many different combinations and characters attempted. Probably similar to symbols issue. Alternative-- How can I train a model to deal with this? I am previewing Form Recognizer but suspect similar results.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  12. Get bounding box and confidence level for every individual character of a word

    Greetings,

    partially adding to https://cognitive.uservoice.com/forums/430309-computer-vision-api/suggestions/38584192-add-ocr-confidence-to-v1-0-ocr-api and https://cognitive.uservoice.com/forums/430309-computer-vision-api/suggestions/15634182-single-character-recognition :

    I am using the Cognitive Services Computer Vision API v2.0 (through the Python SDK) (with the Read Batch File / Get Read Operation Result combination) and it would be valuable to me if I could extract the bounding box (and optionally maybe even the confidence level) for every single character of a word. I already tried cropping out the words out of the images and submitting the cropped words to the API individually, but that still returns only the whole word.

    Thank you for your time.

    Best regards
    Christian Römer

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  13. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  14. The Analyze Image result is not as expected

    I called the computer vision api with this image url:
    https://lh3.googleusercontent.com/p/AF1QipMPhnsunVu5SWvwWTSUfelS8zRvMnznCiNlUXd8=s1600-w1600.

    There is no "people" in categories and there is no "faces" in result.

    Result detail:
    {
    "categories": [{

    "name": "outdoor_",
    
    "score": 0.046875,
    "detail": {
    "landmarks": []
    }

    }],
    "adult": {

    "isAdultContent": false,
    
    "isRacyContent": false,
    "adultScore": 0.12861922383308411,
    "racyScore": 0.14124451577663422

    },
    "color": {

    "dominantColorForeground": "Grey",
    
    "dominantColorBackground": "Grey",
    "dominantColors": ["Grey"],
    "accentColor": "844D47",
    "isBwImg": false,
    "isBWImg": false

    },
    "imageType": {

    "clipArtType": 0,
    
    "lineDrawingType": 0

    },
    "tags": [{

    "name": "outdoor",
    
    "confidence": 0.99072730541229248

    }, {

    "name": "seafood",
    
    "confidence": 0.98059272766113281

    }, {

    "name": "fishing",
    
    "confidence": 0.9629974365234375

    }, {

    "name": "person",
    
    "confidence": 0.94020164012908936

    }, {

    "name":
    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  15. spanish

    Spanish support for batch processing will be appreciated

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  16. Add OCR confidence to v1.0/ocr API

    Other OCR platforms provide OCR confidence sometimes per character and sometimes per word.

    The confidence meaning how likely the result is to match the input image, for very poorly scanned documents where noise is a problem this can cause the current API to return incorrect text frequently with no programmatic way to detect if the result should be trusted or sent to a user for verification.

    Having this as an optional query parameter on the API would be helpful, perhaps confidencePerWord=true and confidencePerCharacter=true

    13 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  17. How to add COGNITIVE_SERVICE_KEY and ENDPOINTS on window command prompt?

    I am writing a Python code to Analyze an image from its URL. In this code, I required KEY and ENDPOINT.
    I already have those key and endpoint. But I did not know how to add those KEY and ENDPOINT on the WINDOWS command prompt. Without adding those I can't access the python code from the command prompt.
    Tell me the syntax to add KEY and ENDPOINT on the WINDOWS command prompt. So that I can add those KEY and ENDPOINT to my PYTHON code.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  18. Unable to handle ₹ ( Indian rupee symbol ) converted to 3 while detection

    Currently vision API is not able to handle ₹ (INR) symbol.
    It converts it to 3. So if you want to detect amount ₹ 49, you will get 349 as output.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  19. Add Read API docker container

    Only Recognize Text is currently containerised. As this API is deprecated, could we please get a container version of the newer Read API?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  20. Add German Umlaute and ß to the Computer Vision API

    Hello,

    i'm using the handwriting recognition of the Computer Vision API to describe certain german documents. therefore i'm in need of the umlaute aswell as the ß sign. are there any plans on supporting this? sadly the localizable ocr results are not accurate enough.

    Regards

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1 3 4 5 6 7
  • Don't see your idea?

Feedback and Knowledge Base