Computer Vision API
Welcome to the Computer Vision API Forum
Categories
API – Any ideas or feedback pertaining to features or enhancements to Computer Vision API.
Documentation – Any ideas or suggestions for the API Reference or Documentation.
Language Support – Submit a request to have a particular language supported.
Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.
Custom/Sample Images – Have an image you’ve tested and not getting the results you are seeking? Upload the image and describe the information or tags you would like to be included.
-
Improve single-character recognition
Single characters are having a very tough time being recognized, even in the most optimal conditions for OCR (shown in the attached PDF). Also attached is a text view of how the content gets OCR-ed.
I saw that this is a recognized problem in a few ideas from a couple of years ago, but I was wondering if any progress was made in fixing this known problem.
Functionally, this provides issues with OCR-ing tables that have a "Quantity" or any other amount less than 10.
1 vote -
It is found that the subscriptionKey is placed in the request header during API use, which is unsafe
After looking at the documentation, you'll notice that a lot of API requests put subscriptionKey in the request header. That way, you can grab the request and get the user's key. This allows illegal use of the service.
1 vote -
api使用中发现subscriptionKey放到请求头中,不安全
After looking at the documentation, you'll notice that a lot of API requests put subscriptionKey in the request header. That way, you can grab the request and get the user's key. This allows illegal use of the service.
1 vote -
api使用中发现subscriptionKey放到请求头中,不安全
After looking at the documentation, you'll notice that a lot of API requests put subscriptionKey in the request header. That way, you can grab the request and get the user's key. This allows illegal use of the service.
1 vote -
1 vote
-
1 vote
-
Can handwriting(Japanese) be supported?
Can handwriting(Japanese) be supported?
1 vote -
Please, make a support for bulgarian language
It's not so hard to add Bulgarian language once you have Russian (cyrillic).
1 vote -
Better support for artistic fonts.
Artistic fonts like the ones found in comics are really hard to recognize, especially for North East Asian languages.
Fix that, Google equivalent ocr option handles it a ton better.
1 vote -
1 vote
-
Fractions not recognized --
Fractions are not being recognized..
many different combinations and characters attempted. Probably similar to symbols issue. Alternative-- How can I train a model to deal with this? I am previewing Form Recognizer but suspect similar results.1 vote -
Get bounding box and confidence level for every individual character of a word
Greetings,
partially adding to https://cognitive.uservoice.com/forums/430309-computer-vision-api/suggestions/38584192-add-ocr-confidence-to-v1-0-ocr-api and https://cognitive.uservoice.com/forums/430309-computer-vision-api/suggestions/15634182-single-character-recognition :
I am using the Cognitive Services Computer Vision API v2.0 (through the Python SDK) (with the Read Batch File / Get Read Operation Result combination) and it would be valuable to me if I could extract the bounding box (and optionally maybe even the confidence level) for every single character of a word. I already tried cropping out the words out of the images and submitting the cropped words to the API individually, but that still returns only the whole word.
Thank you for your time.
Best regards
Christian Römer1 vote -
1 vote
-
The Analyze Image result is not as expected
I called the computer vision api with this image url:
https://lh3.googleusercontent.com/p/AF1QipMPhnsunVu5SWvwWTSUfelS8zRvMnznCiNlUXd8=s1600-w1600.There is no "people" in categories and there is no "faces" in result.
Result detail:
{
"categories": [{"name": "outdoor_",
"score": 0.046875,
"detail": {
"landmarks": []
}}],
"adult": {"isAdultContent": false,
"isRacyContent": false,
"adultScore": 0.12861922383308411,
"racyScore": 0.14124451577663422},
"color": {"dominantColorForeground": "Grey",
"dominantColorBackground": "Grey",
"dominantColors": ["Grey"],
"accentColor": "844D47",
"isBwImg": false,
"isBWImg": false},
"imageType": {"clipArtType": 0,
"lineDrawingType": 0},
"tags": [{"name": "outdoor",
"confidence": 0.99072730541229248}, {
"name": "seafood",
"confidence": 0.98059272766113281}, {
"name": "fishing",
"confidence": 0.9629974365234375}, {
"name": "person",
"confidence": 0.94020164012908936}, {
…"name":
1 vote -
spanish
Spanish support for batch processing will be appreciated
1 vote -
Add OCR confidence to v1.0/ocr API
Other OCR platforms provide OCR confidence sometimes per character and sometimes per word.
The confidence meaning how likely the result is to match the input image, for very poorly scanned documents where noise is a problem this can cause the current API to return incorrect text frequently with no programmatic way to detect if the result should be trusted or sent to a user for verification.
Having this as an optional query parameter on the API would be helpful, perhaps confidencePerWord=true and confidencePerCharacter=true
13 votes -
How to add COGNITIVE_SERVICE_KEY and ENDPOINTS on window command prompt?
I am writing a Python code to Analyze an image from its URL. In this code, I required KEY and ENDPOINT.
I already have those key and endpoint. But I did not know how to add those KEY and ENDPOINT on the WINDOWS command prompt. Without adding those I can't access the python code from the command prompt.
Tell me the syntax to add KEY and ENDPOINT on the WINDOWS command prompt. So that I can add those KEY and ENDPOINT to my PYTHON code.1 voteResolving this feedback item since we have addressed the issue directly with the user who submitted.
For help getting started with the Computer Vision API please look at our QuickStarts in the documentation: https://docs.microsoft.com/azure/cognitive-services/computer-vision/quickstarts-sdk/python-sdk
Thanks
-
Unable to handle ₹ ( Indian rupee symbol ) converted to 3 while detection
Currently vision API is not able to handle ₹ (INR) symbol.
It converts it to 3. So if you want to detect amount ₹ 49, you will get 349 as output.1 vote -
Add Read API docker container
Only Recognize Text is currently containerised. As this API is deprecated, could we please get a container version of the newer Read API?
1 vote -
Add German Umlaute and ß to the Computer Vision API
Hello,
i'm using the handwriting recognition of the Computer Vision API to describe certain german documents. therefore i'm in need of the umlaute aswell as the ß sign. are there any plans on supporting this? sadly the localizable ocr results are not accurate enough.
Regards
1 vote
- Don't see your idea?