Computer Vision
Extend deep learning workflows with computer vision applications
Apply deep learning to computer vision applications by using Deep Learning Toolbox™ together with the Computer Vision Toolbox™.
Apps
Image Labeler | Label images for computer vision applications |
Video Labeler | Label video for computer vision applications |
Functions
Topics
Image Classification
- Train Vision Transformer Network for Image Classification
This example shows how to fine-tune a pretrained vision transformer (ViT) neural network to perform classification on a new collection of images.
Object Detection and Instance Segmentation
- Get Started with Object Detection Using Deep Learning (Computer Vision Toolbox)
Perform object detection using deep learning neural networks such as YOLOX, YOLO v4, and SSD. - Get Started with Instance Segmentation Using Deep Learning (Computer Vision Toolbox)
Segment objects using an instance segmentation model such as SOLOv2 or Mask R-CNN. - Choose an Object Detector (Computer Vision Toolbox)
Compare object detection deep learning models, such as YOLOX, YOLO v4, RTMDet, and SSD. - Augment Bounding Boxes for Object Detection
This example shows how to perform common kinds of image and bounding box augmentation as part of object detection workflows. - Import Pretrained ONNX YOLO v2 Object Detector
This example shows how to import a pretrained ONNX™ (Open Neural Network Exchange) you only look once (YOLO) v2 [1] object detection network and use the network to detect objects. - Export YOLO v2 Object Detector to ONNX
This example shows how to export a YOLO v2 object detection network to ONNX™ (Open Neural Network Exchange) model format. - Deploy Object Detection Model as Microservice (MATLAB Compiler SDK)
Use a microservice to detect objects in images.
Automated Visual Inspection
- Getting Started with Anomaly Detection Using Deep Learning (Computer Vision Toolbox)
Anomaly detection using deep learning is an increasingly popular approach to automating visual inspection tasks. - Detect Image Anomalies Using Explainable FCDD Network (Computer Vision Toolbox)
Use an anomaly detector to distinguish between normal pills and pills with anomalous chips or contamination. - Classify Defects on Wafer Maps Using Deep Learning (Computer Vision Toolbox)
Classify manufacturing defects on wafer maps using a simple convolutional neural network (CNN). - Detect Image Anomalies Using Pretrained ResNet-18 Feature Embeddings (Computer Vision Toolbox)
Train a similarity-based anomaly detector using one-class learning of feature embeddings extracted from a pretrained ResNet-18 convolutional neural network. - Localize Industrial Defects Using PatchCore Anomaly Detector (Computer Vision Toolbox)
Perform localization of anomalous defects in printed circuit boards (PCBs) using anomaly heat maps generated with the PatchCore anomaly detector.
Semantic Segmentation
- Get Started with Semantic Segmentation Using Deep Learning (Computer Vision Toolbox)
Segment objects by class using deep learning networks such as U-Net and DeepLab v3+. - Augment Pixel Labels for Semantic Segmentation
This example shows how to perform common kinds of image and pixel label augmentation as part of semantic segmentation workflows. - Semantic Segmentation Using Dilated Convolutions
This example shows how to train a semantic segmentation network using dilated convolutions. - Semantic Segmentation of Multispectral Images Using Deep Learning (Computer Vision Toolbox)
This example shows how to perform semantic segmentation of a multispectral image with seven channels using U-Net. - Explore Semantic Segmentation Network Using Grad-CAM
This example shows how to explore the predictions of a pretrained semantic segmentation network using Grad-CAM. - Generate Adversarial Examples for Semantic Segmentation (Computer Vision Toolbox)
Generate adversarial examples for a semantic segmentation network using the basic iterative method (BIM). - Prune and Quantize Semantic Segmentation Network
Reduce the memory footprint of a semantic segmentation network and speed-up inference by compressing the network using pruning and quantization.
Video Classification
- Activity Recognition from Video and Optical Flow Data Using Deep Learning
This example first shows how to perform activity recognition using a pretrained Inflated 3-D (I3D) two-stream convolutional neural network based video classifier and then shows how to use transfer learning to train such a video classifier using RGB and optical flow data from videos [1]. - Gesture Recognition using Videos and Deep Learning
Perform gesture recognition using a pretrained SlowFast video classifier.