Computer Vision API
Welcome to the Computer Vision API Forum
Categories
API – Any ideas or feedback pertaining to features or enhancements to Computer Vision API.
Documentation – Any ideas or suggestions for the API Reference or Documentation.
Language Support – Submit a request to have a particular language supported.
Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.
Custom/Sample Images – Have an image you’ve tested and not getting the results you are seeking? Upload the image and describe the information or tags you would like to be included.
-
Better support for artistic fonts.
Artistic fonts like the ones found in comics are really hard to recognize, especially for North East Asian languages.
Fix that, Google equivalent ocr option handles it a ton better.
1 vote -
1 vote
-
Fractions not recognized --
Fractions are not being recognized..
many different combinations and characters attempted. Probably similar to symbols issue. Alternative-- How can I train a model to deal with this? I am previewing Form Recognizer but suspect similar results.1 vote -
Get bounding box and confidence level for every individual character of a word
Greetings,
partially adding to https://cognitive.uservoice.com/forums/430309-computer-vision-api/suggestions/38584192-add-ocr-confidence-to-v1-0-ocr-api and https://cognitive.uservoice.com/forums/430309-computer-vision-api/suggestions/15634182-single-character-recognition :
I am using the Cognitive Services Computer Vision API v2.0 (through the Python SDK) (with the Read Batch File / Get Read Operation Result combination) and it would be valuable to me if I could extract the bounding box (and optionally maybe even the confidence level) for every single character of a word. I already tried cropping out the words out of the images and submitting the cropped words to the API individually, but that still returns only the whole word.
Thank you for your time.
Best regards
Christian Römer1 vote -
1 vote
-
The Analyze Image result is not as expected
I called the computer vision api with this image url:
https://lh3.googleusercontent.com/p/AF1QipMPhnsunVu5SWvwWTSUfelS8zRvMnznCiNlUXd8=s1600-w1600.There is no "people" in categories and there is no "faces" in result.
Result detail:
{
"categories": [{
"name": "outdoor_",
"score": 0.046875,
"detail": {
"landmarks": []
}
}],
"adult": {
"isAdultContent": false,
"isRacyContent": false,
"adultScore": 0.12861922383308411,
"racyScore": 0.14124451577663422
},
"color": {
"dominantColorForeground": "Grey",
"dominantColorBackground": "Grey",
"dominantColors": ["Grey"],
"accentColor": "844D47",
"isBwImg": false,
"isBWImg": false
},
"imageType": {
"clipArtType": 0,
"lineDrawingType": 0
},
"tags": [{
"name": "outdoor",
"confidence": 0.99072730541229248
}, {
"name": "seafood",
"confidence": 0.98059272766113281
}, {
"name": "fishing",
"confidence": 0.9629974365234375
}, {
"name": "person",
"confidence": 0.94020164012908936
}, {
"name":…1 vote -
spanish
Spanish support for batch processing will be appreciated
1 vote -
Add OCR confidence to v1.0/ocr API
Other OCR platforms provide OCR confidence sometimes per character and sometimes per word.
The confidence meaning how likely the result is to match the input image, for very poorly scanned documents where noise is a problem this can cause the current API to return incorrect text frequently with no programmatic way to detect if the result should be trusted or sent to a user for verification.
Having this as an optional query parameter on the API would be helpful, perhaps confidencePerWord=true and confidencePerCharacter=true
13 votes -
How to add COGNITIVE_SERVICE_KEY and ENDPOINTS on window command prompt?
I am writing a Python code to Analyze an image from its URL. In this code, I required KEY and ENDPOINT.
I already have those key and endpoint. But I did not know how to add those KEY and ENDPOINT on the WINDOWS command prompt. Without adding those I can't access the python code from the command prompt.
Tell me the syntax to add KEY and ENDPOINT on the WINDOWS command prompt. So that I can add those KEY and ENDPOINT to my PYTHON code.1 voteResolving this feedback item since we have addressed the issue directly with the user who submitted.
For help getting started with the Computer Vision API please look at our QuickStarts in the documentation: https://docs.microsoft.com/azure/cognitive-services/computer-vision/quickstarts-sdk/python-sdk
Thanks
-
Unable to handle ₹ ( Indian rupee symbol ) converted to 3 while detection
Currently vision API is not able to handle ₹ (INR) symbol.
It converts it to 3. So if you want to detect amount ₹ 49, you will get 349 as output.1 vote -
Add Read API docker container
Only Recognize Text is currently containerised. As this API is deprecated, could we please get a container version of the newer Read API?
1 vote -
Add German Umlaute and ß to the Computer Vision API
Hello,
i'm using the handwriting recognition of the Computer Vision API to describe certain german documents. therefore i'm in need of the umlaute aswell as the ß sign. are there any plans on supporting this? sadly the localizable ocr results are not accurate enough.
Regards
1 vote -
Using opencv with Kinect for azure
k4a_image_t depth_image = k4a_capture_get_depth_image(capture);
color_frame=cv::Mat(k4a_image_get_height_pixels(color_image), k4a_image_get_width_pixels(color_image), CV_8UC3, k4a_image_get_buffer(color_image));
cv::imshow("color", color_frame);
The imshow here creates a 'segmentation fault' even though there are proper values in the matrix. I check this is not an opencv fault. Could someone please help to visualize color and depth data from custom application.
1 vote -
I want to extract only text.
ocr api I want to get only the text recognized as a result. What should I do?
1 vote -
Local container only supports English, no Traditional Chinese support.
Please consider add Traditional Chinese into support list of local container.
2 votes -
Tif's
Will Azure Machine Learning Service support Tif image formats in the future?
1 vote -
Bug: color JSON from REST API contains both "isBwImg" and "isBWImg" (Typo)
There is a little bug in the JSON response from the REST API:
In the JSON response, the color object contains two most-likely identical features with different names - "isBWImg" and "isBwImg" (lower-case "w" vs. capital "W").
Ex:
"color": {
"dominantColorForeground": "White",
"isBwImg": false,
"isBWImg": false,
"accentColor": "228AAA",
"dominantColorBackground": "White",
"dominantColors": ["White"]
}API version: 2.0
endpoint: https://westeurope.api.cognitive.microsoft.com/vision/v2.0/analyze
Region: West Europe1 vote -
Capturing PO number Issue
Hello Azure Team,
Using azure online tool , the attached image generates the correct json response(all image text), but when we use it via computervision, cognitiveserive API in web or mobile Android it returns less/incorrect json response which is invalid and we cannot pick number from it.
1 vote -
Getting worse results in API than in the Home page example with same image
Why im getting worse results in API than in the test form on the home page ( https://azure.microsoft.com/pt-br/services/cognitive-services/computer-vision/)
7 votes -
RecognizeText(Printed) is not recognizing the pound symbol (£)
I have many cases of pictures of texts where one can find a pound sign (£) but the sign is NEVER correctly recognized by Azure Cognitive Services RecognizeText API, as far as I tested. Other symbols, like the dollar sign ($) for example, are identified without problems.
I made tests with print screens of texts containing £, since these should be easy for the OCR tool to convert, and again the pound sign is not correctly identified (it becomes an f, a 2, a 1, a $ etc).
I am suspecting that the pound sign is not included in the…
15 votes
- Don't see your idea?