Microsoft

Computer Vision API

Welcome to the Computer Vision API Forum

Categories

API – Any ideas or feedback pertaining to features or enhancements to Computer Vision API.

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.

Custom/Sample Images – Have an image you’ve tested and not getting the results you are seeking? Upload the image and describe the information or tags you would like to be included.

  • Hot ideas
  • Top ideas
  • New ideas
  • My feedback
  1. Add OCR confidence to v1.0/ocr API

    Other OCR platforms provide OCR confidence sometimes per character and sometimes per word.

    The confidence meaning how likely the result is to match the input image, for very poorly scanned documents where noise is a problem this can cause the current API to return incorrect text frequently with no programmatic way to detect if the result should be trusted or sent to a user for verification.

    Having this as an optional query parameter on the API would be helpful, perhaps confidencePerWord=true and confidencePerCharacter=true

    13 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  2. RecognizeText(Printed) is not recognizing the pound symbol (£)

    I have many cases of pictures of texts where one can find a pound sign (£) but the sign is NEVER correctly recognized by Azure Cognitive Services RecognizeText API, as far as I tested. Other symbols, like the dollar sign ($) for example, are identified without problems.

    I made tests with print screens of texts containing £, since these should be easy for the OCR tool to convert, and again the pound sign is not correctly identified (it becomes an f, a 2, a 1, a $ etc).

    I am suspecting that the pound sign is not included in the…

    15 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  3. Can handwriting(Japanese) be supported?

    Can handwriting(Japanese) be supported?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    1 comment  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  4. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Service Limits  ·  Flag idea as inappropriate…  ·  Admin →
  5. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  6. Please, make a support for bulgarian language

    It's not so hard to add Bulgarian language once you have Russian (cyrillic).

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  7. 7 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  8. Better support for artistic fonts.

    Artistic fonts like the ones found in comics are really hard to recognize, especially for North East Asian languages.

    Fix that, Google equivalent ocr option handles it a ton better.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  9. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  10. Fractions not recognized --

    Fractions are not being recognized..
    many different combinations and characters attempted. Probably similar to symbols issue. Alternative-- How can I train a model to deal with this? I am previewing Form Recognizer but suspect similar results.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  11. Get bounding box and confidence level for every individual character of a word

    Greetings,

    partially adding to https://cognitive.uservoice.com/forums/430309-computer-vision-api/suggestions/38584192-add-ocr-confidence-to-v1-0-ocr-api and https://cognitive.uservoice.com/forums/430309-computer-vision-api/suggestions/15634182-single-character-recognition :

    I am using the Cognitive Services Computer Vision API v2.0 (through the Python SDK) (with the Read Batch File / Get Read Operation Result combination) and it would be valuable to me if I could extract the bounding box (and optionally maybe even the confidence level) for every single character of a word. I already tried cropping out the words out of the images and submitting the cropped words to the API individually, but that still returns only the whole word.

    Thank you for your time.

    Best regards
    Christian Römer

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  12. 1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Language Support  ·  Flag idea as inappropriate…  ·  Admin →
  13. The Analyze Image result is not as expected

    I called the computer vision api with this image url:
    https://lh3.googleusercontent.com/p/AF1QipMPhnsunVu5SWvwWTSUfelS8zRvMnznCiNlUXd8=s1600-w1600.

    There is no "people" in categories and there is no "faces" in result.

    Result detail:
    {
    "categories": [{
    "name": "outdoor_",
    "score": 0.046875,
    "detail": {
    "landmarks": []
    }
    }],
    "adult": {
    "isAdultContent": false,
    "isRacyContent": false,
    "adultScore": 0.12861922383308411,
    "racyScore": 0.14124451577663422
    },
    "color": {
    "dominantColorForeground": "Grey",
    "dominantColorBackground": "Grey",
    "dominantColors": ["Grey"],
    "accentColor": "844D47",
    "isBwImg": false,
    "isBWImg": false
    },
    "imageType": {
    "clipArtType": 0,
    "lineDrawingType": 0
    },
    "tags": [{
    "name": "outdoor",
    "confidence": 0.99072730541229248
    }, {
    "name": "seafood",
    "confidence": 0.98059272766113281
    }, {
    "name": "fishing",
    "confidence": 0.9629974365234375
    }, {
    "name": "person",
    "confidence": 0.94020164012908936
    }, {
    "name":…

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  14. spanish

    Spanish support for batch processing will be appreciated

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  15. The PDF coordinate system needs some work

    The Read api (https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/2afb498089f74080d7ef85eb) does not handle PDF word coordinates correctly.

    Did not see a way to report the bug on the API page, so I am reporting it here.

    I am sending it a 8.5x11 page and getting coordinates in inches that are outside the box.

    The width is marked as 10.9833 and height as 8.4567, however it is getting words outside that box:

    12.3419
    0.4349
    13.2572
    0.4294
    13.2715
    0.501
    12.3562
    0.5065

    When I print the coordinates to the page some documents are spot on the the x coordinate, but roughly half as far down the page…

    5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  16. How to add COGNITIVE_SERVICE_KEY and ENDPOINTS on window command prompt?

    I am writing a Python code to Analyze an image from its URL. In this code, I required KEY and ENDPOINT.
    I already have those key and endpoint. But I did not know how to add those KEY and ENDPOINT on the WINDOWS command prompt. Without adding those I can't access the python code from the command prompt.
    Tell me the syntax to add KEY and ENDPOINT on the WINDOWS command prompt. So that I can add those KEY and ENDPOINT to my PYTHON code.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  17. Unable to handle ₹ ( Indian rupee symbol ) converted to 3 while detection

    Currently vision API is not able to handle ₹ (INR) symbol.
    It converts it to 3. So if you want to detect amount ₹ 49, you will get 349 as output.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  18. Local container only supports English, no Traditional Chinese support.

    Please consider add Traditional Chinese into support list of local container.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  On-Premises Solution  ·  Flag idea as inappropriate…  ·  Admin →
  19. Bug: color JSON from REST API contains both "isBwImg" and "isBWImg" (Typo)

    There is a little bug in the JSON response from the REST API:

    In the JSON response, the color object contains two most-likely identical features with different names - "isBWImg" and "isBwImg" (lower-case "w" vs. capital "W").

    Ex:

    "color": {
    "dominantColorForeground": "White",
    "isBwImg": false,
    "isBWImg": false,
    "accentColor": "228AAA",
    "dominantColorBackground": "White",
    "dominantColors": ["White"]
    }

    API version: 2.0
    endpoint: https://westeurope.api.cognitive.microsoft.com/vision/v2.0/analyze
    Region: West Europe

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  20. Add Read API docker container

    Only Recognize Text is currently containerised. As this API is deprecated, could we please get a container version of the newer Read API?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1 3 4 5 6 7
  • Don't see your idea?

Feedback and Knowledge Base