Microsoft

Computer Vision API

Welcome to the Computer Vision API Forum

Categories

API – Any ideas or feedback pertaining to features or enhancements to Computer Vision API.

Documentation – Any ideas or suggestions for the API Reference or Documentation.

Language Support – Submit a request to have a particular language supported.

Samples & SDK Request – Let us know if you would like to see a Code sample or SDK provided.

Custom/Sample Images – Have an image you’ve tested and not getting the results you are seeking? Upload the image and describe the information or tags you would like to be included.

  • Hot ideas
  • Top ideas
  • New ideas
  • My feedback
  1. Add Read API docker container

    Only Recognize Text is currently containerised. As this API is deprecated, could we please get a container version of the newer Read API?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  2. Add German Umlaute and ß to the Computer Vision API

    Hello,

    i'm using the handwriting recognition of the Computer Vision API to describe certain german documents. therefore i'm in need of the umlaute aswell as the ß sign. are there any plans on supporting this? sadly the localizable ocr results are not accurate enough.

    Regards

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  3. Using opencv with Kinect for azure

    k4a_image_t depth_image = k4a_capture_get_depth_image(capture);

    color_frame=cv::Mat(k4a_image_get_height_pixels(color_image), k4a_image_get_width_pixels(color_image), CV_8UC3, k4a_image_get_buffer(color_image));

    cv::imshow("color", color_frame);

    The imshow here creates a 'segmentation fault' even though there are proper values in the matrix. I check this is not an opencv fault. Could someone please help to visualize color and depth data from custom application.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  4. I want to extract only text.

    ocr api I want to get only the text recognized as a result. What should I do?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  5. Local container only supports English, no Traditional Chinese support.

    Please consider add Traditional Chinese into support list of local container.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  On-Premises Solution  ·  Flag idea as inappropriate…  ·  Admin →
  6. Tif's

    Will Azure Machine Learning Service support Tif image formats in the future?

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  7. Bug: color JSON from REST API contains both "isBwImg" and "isBWImg" (Typo)

    There is a little bug in the JSON response from the REST API:

    In the JSON response, the color object contains two most-likely identical features with different names - "isBWImg" and "isBwImg" (lower-case "w" vs. capital "W").

    Ex:

    "color": {
    "dominantColorForeground": "White",
    "isBwImg": false,
    "isBWImg": false,
    "accentColor": "228AAA",
    "dominantColorBackground": "White",
    "dominantColors": ["White"]
    }

    API version: 2.0
    endpoint: https://westeurope.api.cognitive.microsoft.com/vision/v2.0/analyze
    Region: West Europe

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  8. Capturing PO number Issue

    Hello Azure Team,

    Using azure online tool , the attached image generates the correct json response(all image text), but when we use it via computervision, cognitiveserive API in web or mobile Android it returns less/incorrect json response which is invalid and we cannot pick number from it.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  API  ·  Flag idea as inappropriate…  ·  Admin →
  9. 5 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  10. RecognizeText(Printed) is not recognizing the pound symbol (£)

    I have many cases of pictures of texts where one can find a pound sign (£) but the sign is NEVER correctly recognized by Azure Cognitive Services RecognizeText API, as far as I tested. Other symbols, like the dollar sign ($) for example, are identified without problems.

    I made tests with print screens of texts containing £, since these should be easy for the OCR tool to convert, and again the pound sign is not correctly identified (it becomes an f, a 2, a 1, a $ etc).

    I am suspecting that the pound sign is not included in the…

    15 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    2 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  11. Response should contain column and row indices

    Why should I have to calculate row and column indices? You're processing this data and know those numbers exactly. This data is far more valuable (at least to me) than your IList<int> of 1 -> n integers (bounding boxes) - that itself could use some reorganization. I'm testing out the big 3 OCR engines - Amazon, Google and Azure - Azure so far has the best scan results, but the poorest return object IMO.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  12. The PDF coordinate system needs some work

    The Read api (https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/2afb498089f74080d7ef85eb) does not handle PDF word coordinates correctly.

    Did not see a way to report the bug on the API page, so I am reporting it here.

    I am sending it a 8.5x11 page and getting coordinates in inches that are outside the box.

    The width is marked as 10.9833 and height as 8.4567, however it is getting words outside that box:

    12.3419
    0.4349
    13.2572
    0.4294
    13.2715
    0.501
    12.3562
    0.5065

    When I print the coordinates to the page some documents are spot on the the x coordinate, but roughly half as far down the page…

    4 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  13. Improve accuracy for combination of numbers and special characters

    When i try to detect license number from image of pattern 12/234232/3454, the slash symbol is recognized as 1.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
  14. Information on Computer Vision Image and Data Retention

    People need to know what you are doing with their images during and after processing. Please have a clearly stated policy on this.

    3 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Documentation  ·  Flag idea as inappropriate…  ·  Admin →
  15. i need to find some kind of objects like cars or phones

    i want to filter the search about a object or something in the image

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Object Detection  ·  Flag idea as inappropriate…  ·  Admin →
  16. Recognize Vancouver Olympic Torch in photo

    Photo of landmark in Vancouver, Canada is incorrectly identified as "a plane parked on the side of the road" - it should at least be identified as a monument or sculpture.

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Custom/Sample Images  ·  Flag idea as inappropriate…  ·  Admin →
  17. Please provide the taxonomy for tagging and taxonomy used for object detection in the documentation.

    Please provide the taxonomy for tagging and taxonomy used for object detection in the documentation.

    It would fine if it is not an exhaustive list. This taxonomy would be more helpful to users to set predefined conditions for objects or tags.

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Documentation  ·  Flag idea as inappropriate…  ·  Admin →
  18. 2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Resolved  ·  1 comment  ·  Image Analysis  ·  Flag idea as inappropriate…  ·  Admin →
  19. Generate Human friendly operationId in swagger

    Almost all tools that can generate API Client out of Swagger API defenition, will use OperationId to generate method name in a client. In your swagger API, OperationIds looks like "56f91f2e778daf14a499e1fa", and in C# it will produce method name public System.Threading.Tasks.Task 56f91f2e778daf14a499e1faAsync();
    It's extremely hard to recognize what method doing, based on this operationID

    Please, modify operationIds in a swagger documentation https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/export?DocumentFormat=Swagger&ApiName=Computer%20Vision%20API%20-%20v2.0 to make it human and code generator tools friendly.

    P.S. I got this swagger file from https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/587f2c6a154055056008f200

    2 votes
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Documentation  ·  Flag idea as inappropriate…  ·  Admin →
  20. Not able detect all the character accuratly with Printed API

    I am trying to upload the identity proof and trying to verify with the user input. Some characters are replaced with other characters. like 1 is replaced with 7 ... so the date of birth will change..

    1 vote
    Sign in
    (thinking…)
    Sign in with: Facebook Google
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Under Review  ·  0 comments  ·  Text Recognition  ·  Flag idea as inappropriate…  ·  Admin →
← Previous 1 3 4 5 6
  • Don't see your idea?

Feedback and Knowledge Base