This repository offers a variety of pretrained YOLO v8[1] networks for object detection and instance segmentation in MATLAB®. These networks are trained on the COCO 2017[2] dataset and are capable of detecting 80 different object categories, including person, car, traffic light, etc. Additionally, this repository supports training custom object detectors to fine-tune models for specific applications.
Creator: MathWorks Development
Support | Object Detection | Instance Segmentation |
---|---|---|
Inference | ✓ | ✓ |
Codegen | ✓ | ✓ |
Simulink | ✓ | ✗ |
Transfer learning | ✓ | ✗ |
The software and model weights are released under the GNU Affero General Public License v3.0. For alternative licensing, contact Ultralytics Licensing.
- MATLAB® R2024a or later
- Computer Vision Toolbox™
- Deep Learning Toolbox™
- Deep Learning Toolbox Converter for ONNX Model Format
- (optional) Visual Studio C++ compiler for training object detector on Windows
- (optional) MATLAB® Coder for code generation
- (optional) GPU Coder for code generation
Download or clone this repository to your machine and open it in MATLAB®.
This section shows how to use YOLO v8 for object detection. To use YOLO v8 for instance segmentation, navigate to the Instance Segmentation section.
Use the code below to download the pretrained network.
% Load YOLO v8 model
det = yolov8ObjectDetector('yolov8s');
% Analyze loaded model
analyzeNetwork(det.Network);
modelName of the pretrained YOLO v8 deep learning model, specified as one of these:
- yolov8n
- yolov8s
- yolov8m
- yolov8l
- yolov8x
Following is the description of various YOLO v8 models available in this repo:
Model | Description |
---|---|
yolov8n | Nano pretrained YOLO v8 model optimized for speed and efficiency. |
yolov8s | Small pretrained YOLO v8 model balances speed and accuracy, suitable for applications requiring real-time performance with good detection quality. |
yolov8m | Medium pretrained YOLO v8 model offers higher accuracy with moderate computational demands. |
yolov8l | Large pretrained YOLO v8 model prioritizes maximum detection accuracy for high-end systems, at the cost of computational intensity. |
yolov8x | Extra Large YOLOv8 model is the most accurate but requires significant computational resources, ideal for high-end systems prioritizing detection performance. |
To perform object detection on an example image using the pretrained model, utilize the provided code below.
% Read test image.
I = imread(fullfile('data','inputTeam.jpg'));
% Load YOLO v8 small network.
det = yolov8ObjectDetector('yolov8s');
% Perform detection using pretrained model.
[bboxes, scores, labels] = detect(det, I);
% Visualize detection results.
annotations = string(labels) + ': ' + string(scores);
Iout = insertObjectAnnotation(I, 'rectangle', bboxes, annotations);
figure, imshow(Iout);
Model | Input image resolution | Size (MB) | mAP |
---|---|---|---|
yolov8n | 640 x 640 | 10.7 | 37.3 |
yolov8s | 640 x 640 | 37.2 | 44.9 |
yolov8m | 640 x 640 | 85.4 | 50.2 |
yolov8l | 640 x 640 | 143.3 | 52.9 |
yolov8x | 640 x 640 | 222.7 | 53.9 |
mAP for models trained on the COCO dataset is computed as average over IoU of .5:.95.
Training of a YOLO v8 object detector requires GPU drivers. Additionally, when training on Windows, the Visual Studio C++ compiler is required to facilitate the process.
Run installUltralytics.m
to install the required Python files and set up the Python environment for training YOLOv8. This MATLAB script automates downloading and setting up a standalone Python environment tailored for YOLOv8 training. It determines the system architecture, downloads the appropriate Python build, extracts it, and configures MATLAB settings to use this Python interpreter. Finally, it installs the Ultralytics package and its dependencies using pip
.
Use the code below to download the multiclass object detection dataset, or the subsequent steps can be followed to create a custom dataset. The dataset downloaded using the following command will already be in the required format, allowing the Train YOLO v8 object detector section to be proceeded with directly.
helper.downloadMultiClassData()
For more information about multiclass object detection dataset, see Multiclass Object Detection Using YOLO v2 Deep Learning example.
Refer to getYOLOFormat.m
to configure custom data to the YOLO format for training the YOLO v8 object detector. The following sections provides more details on the dataset format.
When using custom dataset for YOLO v8 training, organize training and validation images and labels as shown in the datasets example directory below. It is mandatory to have both training and validation data to train YOLO v8 network.
Labels for training YOLO v8 must be in YOLO format, with each image having its own *.txt file. If an image contains no objects, a *.txt file is not needed. Each *.txt file should have one row per object in the format: class xCenter yCenter width height, where class numbers start from 0, following a zero-indexed system. Box coordinates should be in normalized xywh format, ranging from 0 to 1. If the coordinates are in pixels, divide xCenter and width by the image width, and yCenter and height by the image height as shown below.
The label file corresponding to the above image contains 3 chairs (class 2).
Refer to the code below to convert bounding box [xCord, yCord, wdt, ht] to [xCenter, yCenter, wdt, ht] format.
boxes(:,1) = boxes(:,1) + (boxes(:,3)./2);
boxes(:,2) = boxes(:,2) + (boxes(:,4)./2);
Refer to the code below to normalize ground truth bounding box [xCenter, yCenter, wdt, ht] with respect to input image.
% Read input image.
[Irow, Icol, ~] = size(inpImage);
% Normalize xCenter and width of ground truth bounding box.
xCenter = xCenter./Icol;
wdt = wdt./Icol;
% Normalize yCenter and height of ground truth bounding box.
yCenter = yCenter./Irow;
ht = ht./Irow;
Run below code to train YOLO v8 object detector on multiclass object detection dataset. For more information about evaluation, see Multiclass Object Detection Using YOLO v2 Deep Learning example.
yolov8Det = trainYOLOv8ObjectDetector('data.yaml','yolov8n.pt', ImageSize=[720 720 3], MaxEpochs=10);
% Detect Multiple Indoor Objects
I = imread(fullfile('data','indoorTest.jpg'));
[bbox,score,label] = detect(yolov8Det,I);
annotatedImage = insertObjectAnnotation(I,"rectangle",bbox,label,LineWidth=4,FontSize=24);
figure
imshow(annotatedImage)
Use the code below to download the pretrained network.
% Load YOLO v8 model
det = yolov8('yolov8s');
% Analyze loaded model
analyzeNetwork(det.Network);
modelName of the pretrained YOLO v8 deep learning model, specified as one of these:
- yolov8n
- yolov8s
- yolov8m
- yolov8l
- yolov8x
Following is the description of various YOLO v8 models available in this repo:
Model | Description |
---|---|
yolov8n | Nano pretrained YOLO v8 model optimized for speed and efficiency. |
yolov8s | Small pretrained YOLO v8 model balances speed and accuracy, suitable for applications requiring real-time performance with good detection quality. |
yolov8m | Medium pretrained YOLO v8 model offers higher accuracy with moderate computational demands. |
yolov8l | Large pretrained YOLO v8 model prioritizes maximum segmentation accuracy for high-end systems, at the cost of computational intensity. |
yolov8x | Extra Large YOLOv8 model is the most accurate but requires significant computational resources, ideal for high-end systems prioritizing detection performance. |
To perform instance segmentation on an example image using the pretrained model, utilize the provided code below.
% Read test image.
I = imread(fullfile('data','inputTeam.jpg'));
% Load YOLO v8 medium network.
det = yolov8('yolov8m');
% Perform detection using pretrained model.
[masks,labels,scores,bboxes] = segmentObjects(det, I, Threshold=0.25);
% Visualize detection results.
Idisp = insertObjectAnnotation(I,"rectangle",bboxes,labels);
numMasks = size(masks,3);
overlayedImage = insertObjectMask(Idisp,masks,MaskColor=lines(numMasks));
figure;imshow(overlayedImage);
Import the ONNX model into the following code. Use model.export
in Python to export the model in ONNX format. For example, use 'yolov8nSeg.onnx' from Releases folder to verify the workflow.
% Import YOLO v8 model.
net = importYOLOv8SegmentationModel(<model to be imported>);
To perform instance segmentation on an example image using imported model, utilize the provided code below.
% Read test image.
I = imread(fullfile('data','inputTeam.jpg'));
% Specify the input size that the model network is trained on.
inputSize = [640 640 3];
% Apply Preprocessing on the input image.
origSize = size(im);
Ibgr = im(:,:,[3,2,1]); % convert image to bgr
img = helper.preprocess(Ibgr, inputSize);
newSize = size(img);
Iout = img(:,:,[3,2,1]); % convert image to rgb
% Convert input image to 'SSCB' format.
dlInput = dlarray(Iout, 'SSCB');
% Obtain network predictions.
outFeatureMaps = cell(7,1);
[outFeatureMaps{:}] = predict(det, dlInput);
% Define number of classes, the model is trained on.
numClasses = 80;
% Apply postprocessing on the output feature maps.
[masks,labelIds, scores, bboxes] = helper.postprocessYOLOv8Seg(outFeatureMaps, ...
origSize, newSize, numClasses);
% Visualize detection results.
Idisp = insertObjectAnnotation(I,"rectangle",bboxes,labelsIds);
numMasks = size(masks,3);
overlayedImage = insertObjectMask(Idisp,masks,MaskColor=lines(numMasks));
figure;imshow(overlayedImage);
Model | Input image resolution | Size (MB) | mAP |
---|---|---|---|
yolov8n | 640 x 640 | ToDo | ToDo |
yolov8s | 640 x 640 | ToDo | ToDo |
yolov8m | 640 x 640 | ToDo | ToDo |
yolov8l | 640 x 640 | ToDo | ToDo |
yolov8x | 640 x 640 | ToDo | ToDo |
mAP for models trained on the COCO dataset is computed as average over IoU of .5:.95.
Code generation enables you to generate code and deploy YOLO v8 on multiple embedded platforms. The list of supported platforms is shown below:
Target | Support | Notes |
---|---|---|
GPU Coder | ✔ | run gpuCodegenYOLOv8.m |
MATLAB Coder | ✔ | run codegenYOLOv8.m |
To deploy YOLO v8 to GPU Coder, run gpuCodegenYOLOv8.m
. This script calls the yolov8Predict.m
entry point function and generate CUDA code for it. It will run the generated MEX and give an output.
For more information about codegen, see Deep Learning with GPU Coder.
Simulink is a block diagram environment used to design systems with multidomain models, simulate before moving to hardware, and deploy without writing code. For more information about simulink, see Get Started with Simulink
% Read test image.
I = imread(fullfile('data','inputTeam.jpg'));
% Open Simulink model.
open('yolov8SimulinkSupport.slx')
To run the simulation, click Run
from the Simulation
tab.
The output will be logged to the workspace variable out
from the Simulink model.
YOLO v8 is one of the best performing detectors and is considered as an improvement to the existing YOLO variants such as YOLO v5, and YOLOX.
Following are the key features of the YOLO v8 detector compared to its predecessors:
- Improved Accuracy: YOLO v8 is expected to offer enhanced accuracy in object detection compared to its previous versions. This improvement can lead to more precise and reliable detection results.
- Better Speed and Efficiency: YOLO v8 may have optimizations that allow it to achieve faster processing speeds while maintaining high accuracy. This can be crucial for real-time applications or scenarios with limited computational resources.
- Advanced Backbone Network: YOLO v8 might incorporate a more advanced backbone network architecture, such as Darknet-53 or a similar architecture, which can enable better feature extraction and representation.
- Enhanced Object Classification: YOLO v8 may introduce improvements in object classification capabilities, allowing for more accurate and detailed classification of detected objects.
[1] https://github.com/ultralytics/ultralytics
[2] Lin, T., et al. "Microsoft COCO: Common objects in context. arXiv 2014." arXiv preprint arXiv:1405.0312 (2014).
Copyright 2024 The MathWorks, Inc.