Text Detection and Recognition

Detect and recognize text using image feature detection and description, deep learning, and OCR

Detecting and recognizing text in images is a common task performed in computer vision applications. For example, you can capture video of a road scene from a moving vehicle, recognize signposts in the captured scene, and alert the driver about the signs.

You can combine detection and recognition combined into a two-step process, where the first step finds regions that contain text, and then the second step recognizes the text within the regions.

Input image showing an accessible parking sign, connected to a detector, which outputs an image with predicted bounding boxes overlaid on the sign text, connected to a recognizer that outputs a list of the words recognized on the sign.

Text detection algorithms use local image features, machine learning or deep learning, to locate or segment text within an image. The examples in the Computer Vision Toolbox™ demonstrate how to use blob analysis, the maximally stable extremal regions (MSER) feature detector, and the character region awareness for text detection (CRAFT) deep learning model for text detection.

Once you have detected the text, text recognition models, based on machine learning or deep learning, process the text regions to return the predicted text. The ocr function uses pretrained language models to recognize text in multiple languages. You can also train a custom language model using the trainOCR function. For more information, see Getting Started with OCR.

App

Image Labeler

Label images for computer vision applications

Funzioni

espandi tutto

Text Recognition

`ocr`	Recognize text using optical character recognition
`ocrText`	Store OCR results
`visionSupportPackages`	Start Installer to download, install, or uninstall Computer Vision Toolbox data

Training and Evaluation

`trainOCR`	Train OCR model to recognize text in image (Da R2023a)
`evaluateOCR`	Evaluate OCR results against ground truth (Da R2023a)
`ocrMetrics`	Store OCR quality metrics (Da R2023a)
`ocrTrainingOptions`	Options for training OCR model (Da R2023a)
`ocrTrainingData`	Create training data for OCR from ground truth (Da R2023a)

Quantization

quantizeOCR Quantize OCR model (Da R2023a)

Text Detection

`detectTextCRAFT`	Detect texts in images by using CRAFT deep learning model (Da R2022a)
`detectMSERFeatures`	Detect MSER features
`vision.BlobAnalysis`	Properties of connected regions
`extractHOGFeatures`	Extract histogram of oriented gradients (HOG) features

Argomenti

Get Started

Getting Started with OCR
Detect and recognize text in multiple languages, train OCR models to recognize custom text.
Train Custom OCR Model
Train an optical character recognition (OCR) model to recognize custom text.
Install OCR Language Data Files
Support files for optical character recognition (OCR) languages.
Local Feature Detection and Extraction
Learn the benefits and applications of local feature detection and extraction.
Point Feature Types
Choose functions that return and accept points objects for several types of features.

Esempi in primo piano

Recognize Text Using Optical Character Recognition (OCR)

Recognize text in images using optical character recognition.

Apri live script

Recognize Seven-Segment Digits Using OCR

Use OCR to recognize seven-segmented digits in text detected by CRAFT and region properties.

Apri live script

Automatically Detect and Recognize Text Using MSER and OCR

Automatically detect and recognize text in images using MSER and OCR.

Apri live script

Automatically Detect and Recognize Text Using Pretrained CRAFT Network and OCR

Perform text recognition by using a deep learning based text detector and OCR. In the example, you use a pretrained CRAFT (character region awareness for text) deep learning network to detect the text regions in the input image. You can modify the region threshold and the affinity threshold values of the CRAFT model to localise an entire paragraph, a sentence, or a word. Then, you use OCR to recognize the characters in the detected text regions.

Apri live script

Automate Ground Truth Labeling for OCR

Automate the labeling of text for OCR training and evaluation.

Apri live script

Train an OCR Model to Recognize Seven-Segment Digits

Train an OCR model that can recognize seven-segment numerals.

Apri live script

Digit Classification Using HOG Features

Classify digits using HOG features and a multiclass SVM classifier.

Apri live script