computervision-recipes/scenarios
PatrickBue dda2e4a983
Added tracking setup instructions (#581)
2020-07-06 10:32:42 -04:00
..
action_recognition amendments to action rec (#559) 2020-06-22 15:37:40 -04:00
classification Image similarity widget (#530) 2020-04-14 10:10:01 -04:00
detection Image similarity widget (#530) 2020-04-14 10:10:01 -04:00
keypoints updated readmes 2020-01-23 16:01:32 -05:00
media Adding image segmentation 01 and 11 notebooks 2020-06-09 20:36:49 -04:00
segmentation Image segmentation update (#549) 2020-06-15 16:04:27 -04:00
similarity Adding image segmentation 01 and 11 notebooks 2020-06-09 20:36:49 -04:00
tracking Added tracking setup instructions (#581) 2020-07-06 10:32:42 -04:00
README.md Adding image segmentation 01 and 11 notebooks 2020-06-09 20:36:49 -04:00

README.md

Overview

Scenario Description
Classification Image Classification is a supervised machine learning technique to learn and predict the category of a given image.
Similarity Image Similarity is a way to compute a similarity score given a pair of images. Given an image, it allows you to identify the most similar image in a given dataset.
Detection Object Detection is a technique that allows you to detect the bounding box of an object within an image.
Keypoints Keypoint Detection can be used to detect specific points on an object. A pre-trained model is provided to detect body joints for human pose estimation.
Segmentation Image Segmentation assigns a category to each pixel in an image.
Action Recognition Action Recognition (also known as activity recognition) consists of classifying various actions from a sequence of frames, such as "reading" or "drinking".

Scenarios

While the field of Computer Vision is growing rapidly, the majority of vision applications fall into one of these 4 categories:

  • Image classification: Given an input image, predict what object is present in the image. This is typically the easiest CV problem to solve, however classification requires objects to be reasonably large in the image.

       Image classification visualization

  • Image Similarity Given an input image, find all similar objects in images from a reference dataset. Here, rather than predicting a label and/or rectangle, the task is to sort through a reference dataset to find objects similar to that found in the query image.

       Image similarity visualization

  • Object Detection: Given an input image, identify and locate which objects are present (using rectangular coordinates). Object detection can find small objects in an image. Compared to image classification, both model training and manually annotating images is more time-consuming in object detection, since both the label and location are required.

       Object detect visualization

  • Keypoint Detection: Given an input image, identify and locate keypoints. Conceptually this runs an object detector first, followed by detecting keypoints on the objects. In practice, a single model runs both steps (almost) at once.

       Keypoint detect visualization

  • Image Segmentation Given an input image, assign a label to every pixel (e.g., background, bottle, hand, sky, etc.). In practice, this problem is less common in industry, in large part due to time required to label the ground truth segmentation required in order to train a solution.

       Image segmentation visualization

Data/Telemetry

The following notebooks collect browser usage data and sends it to Microsoft to help improve our product and services:

Read Microsoft's privacy statement to learn more.

To opt out of tracking, please go to the raw .ipynb files and remove the following line of code (the URL will be slightly different depending on the file):

    "![Impressions](https://PixelServer20190423114238.azurewebsites.net/api/impressions/ComputerVision/classification/notebooks/21_deployment_on_azure_container_instances.png)"