yolov2 object detector doesn't work

Question

Mario il 1 Set 2024

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/2149404-yolov2-object-detector-doesn-t-work

Commentato: Vivek Akkala il 1 Ott 2024 alle 5:10

Hi all.

I'm trying to build and train a yolov2 network, following the tutorial in this website. My network has to detect bycicles.

I have a problem: bboxes and labels after training are always empties. Moreover, when I preprocess my data and I show the result, the rectangles move and doesn't point out the correct object, such as in this image:

I don't know what can I do. I'm training my network with 300 images.

This is my code:

function B = augmentData(A)
% Apply random horizontal flipping, and random X/Y scaling. Boxes that get
% scaled outside the bounds are clipped if the overlap is above 0.25. Also,
% jitter image color.
B = cell(size(A));
I = A{1};
sz = size(I);
if numel(sz)==3 && sz(3) == 3
    I = jitterColorHSV(I,...
        "Contrast",0.2,...
        "Hue",0,...
        "Saturation",0.1,...
        "Brightness",0.2);
end
% Randomly flip and scale image.
tform = randomAffine2d("XReflection",true,"Scale",[1 1.1]);
rout = affineOutputView(sz,tform,"BoundsStyle","CenterOutput");
B{1} = imwarp(I,tform,"OutputView",rout);
% Sanitize boxes, if needed. This helper function is attached as a
% supporting file. Open the example in MATLAB to access this function.
A{2} = helperSanitizeBoxes(A{2});
% Apply same transform to boxes.
[B{2},indices] = bboxwarp(A{2},tform,rout,"OverlapThreshold",0.25);
B{3} = A{3}(indices);
% Return original data only when all boxes are removed by warping.
if isempty(indices)
    B = A;
end
end
function data = preprocessData(data, inputSize)
data{1} = imresize(data{1}, inputSize(1:2));
% Convert grayscale images to RGB by replicating the single channel
if size(data{1}, 3) == 1
    data{1} = cat(3, data{1}, data{1}, data{1});
end
% Resize the bounding boxes
scale = inputSize(1:2) ./ size(data{1}, 1:2);
data{2} = bboxresize(data{2}, scale);
end
function boxes = helperSanitizeBoxes(boxes, ~)
persistent hasInvalidBoxes
valid = all(boxes > 0, 2);
if any(valid)
    if ~all(valid) && isempty(hasInvalidBoxes)
        % Issue one-time warning about removing invalid boxes.
        hasInvalidBoxes = true;
        warning('Removing ground truth bouding box data with values <= 0.')
    end
    boxes = boxes(valid,:); 
end
end
arch = imageDatastore(".\immaginiBici");
%imageLabeler(arch)
rng(0);
shuffledIndices = randperm(height(gTruth300));
idx = floor(0.6 * length(shuffledIndices) );
trainingIdx = 1:idx;
trainingDataTbl = gTruth300(shuffledIndices(trainingIdx),:);
validationIdx = idx+1 : idx + 1 + floor(0.1 * length(shuffledIndices) );
validationDataTbl = gTruth300(shuffledIndices(validationIdx),:);
testIdx = validationIdx(end)+1 : length(shuffledIndices);
testDataTbl = gTruth300(shuffledIndices(testIdx),:);
imdsTrain = imageDatastore(trainingDataTbl{:,"imageFilename"});
bldsTrain = boxLabelDatastore(trainingDataTbl(:,"Bicicletta"));
imdsValidation = imageDatastore(validationDataTbl{:,"imageFilename"});
bldsValidation = boxLabelDatastore(validationDataTbl(:,"Bicicletta"));
imdsTest = imageDatastore(testDataTbl{:,"imageFilename"});
bldsTest = boxLabelDatastore(testDataTbl(:,"Bicicletta"));
trainingData = combine(imdsTrain,bldsTrain);
validationData = combine(imdsValidation,bldsValidation);
testData = combine(imdsTest,bldsTest);
data = read(trainingData);
I = data{1};
bbox = data{2};
annotatedImage = insertShape(I,"rectangle",bbox);
annotatedImage = imresize(annotatedImage,2);
figure
imshow(annotatedImage)
augmentedTrainingData = transform(trainingData,@augmentData);
augmentedData = cell(4,1);
for k = 1:4
    data = read(augmentedTrainingData);
    augmentedData{k} = insertShape(data{1},"rectangle",data{2});
    reset(augmentedTrainingData);
end
figure
montage(augmentedData,"BorderSize",10)
imageInputSize = [300 300 3];
preprocessedTrainingData = transform(augmentedTrainingData,@(data)preprocessData(data,imageInputSize));
preprocessedValidationData = transform(validationData,@(data)preprocessData(data,imageInputSize));
data = read(preprocessedTrainingData);
data = read(preprocessedTrainingData);
I = data{1};
bbox = data{2};
annotatedImage = insertShape(I,"rectangle",bbox);
annotatedImage = resize(annotatedImage,imageInputSize);
figure
imshow(annotatedImage)
net = mobilenetv2();
lgraph = layerGraph(net);
imgLayer = imageInputLayer(imageInputSize,"Name","input_1")
lgraph = replaceLayer(lgraph,"input_1",imgLayer);
featureExtractionLayer = "block_12_add";
index = find(strcmp({lgraph.Layers(1:end).Name},featureExtractionLayer));
lgraph = removeLayers(lgraph,{lgraph.Layers(index+1:end).Name});
filterSize = [3 3];
numFilters = 96;
detectionLayers = [
    convolution2dLayer(filterSize,numFilters,"Name","yolov2Conv1","Padding", "same", "WeightsInitializer",@(sz)randn(sz)*0.01)
    batchNormalizationLayer("Name","yolov2Batch1")
    reluLayer("Name","yolov2Relu1")
    convolution2dLayer(filterSize,numFilters,"Name","yolov2Conv2","Padding", "same", "WeightsInitializer",@(sz)randn(sz)*0.01)
    batchNormalizationLayer("Name","yolov2Batch2")
    reluLayer("Name","yolov2Relu2")
    ]
numClasses = 1;
numAnchors = 5
[anchorBoxes, meanIoU] = estimateAnchorBoxes(preprocessedTrainingData, numAnchors)
numPredictionsPerAnchor = 5;
numFiltersInLastConvLayer = numAnchors*(numClasses+numPredictionsPerAnchor);
detectionLayers = [
    detectionLayers
    convolution2dLayer(1,numFiltersInLastConvLayer,"Name","yolov2ClassConv",...
    "WeightsInitializer", @(sz)randn(sz)*0.01)
    yolov2TransformLayer(numAnchors,"Name","yolov2Transform")
    yolov2OutputLayer(anchorBoxes,"Name","yolov2OutputLayer")
    ]
lgraph = addLayers(lgraph,detectionLayers);
lgraph = connectLayers(lgraph,featureExtractionLayer,"yolov2Conv1");
options = trainingOptions("sgdm", ...
    "MiniBatchSize",16, ....
    "InitialLearnRate",1e-3, ...
    "MaxEpochs",20, ... 
    "CheckpointPath",tempdir, ...
    "ValidationData",preprocessedValidationData);
[detector,info] = trainYOLOv2ObjectDetector(preprocessedTrainingData,lgraph,options);
I = imread("fotoProva.jpg");
I = imresize(I,imageInputSize(1:2));
[bboxes,labels] = detect(detector,I, "Threshold", 0.1);
I = insertObjectAnnotation(I,"rectangle",bboxes,labels);
figure
imshow(I)
preprocessedTestData = transform(testData,@(data)preprocessData(data,imageInputSize));
detectionResults = detect(detector, preprocessedTestData, "Threshold", 0);
metrics = evaluateObjectDetection(detectionResults,preprocessedTestData);
classID = 1;
precision = metrics.ClassMetrics.Precision{classID};
recall = metrics.ClassMetrics.Recall{classID};
figure
plot(recall,precision)
xlabel("Recall")
ylabel("Precision")
grid on
title(sprintf("Average Precision = %.2f",metrics.ClassMetrics.mAP(classID)))

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Shivansh il 2 Set 2024

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/2149404-yolov2-object-detector-doesn-t-work#answer_1508849

Hello Mario,

The problem appears to be with the preprocessing step in the modeling. The "preprocessData" function should correctly scale both the image and the bounding boxes. You can also make sure that the bounding boxes are correctly resized and aligned with the image dimensions after the data augmentation step using the "augmentData" function.

If the issue persists, share the code for "preprocessData" along with a few sample images.

You can refer to the following documentation for more information:

yolov2ObjectDetector: https://www.mathworks.com/help/vision/ref/yolov2objectdetector.html.
Augment images for deep learning workflows: https://www.mathworks.com/help/deeplearning/ug/image-augmentation-using-image-processing-toolbox.html.
imageDataAugmentor: https://www.mathworks.com/help/deeplearning/ref/imagedataaugmenter.html.

I hope the above information helps in resolving the issue.

2 Commenti
Mostra NessunoNascondi Nessuno

Mario il 2 Set 2024 alle 19:28

Modificato: Mario il 2 Set 2024 alle 19:58

Hi, I modified the "preprocess data" function and now it works. The network finds bycicles in images, but average precision is about 0.6 and the bycicle isn't totally in the rectangle. What can I do?

This is an example:

Vivek Akkala il 1 Ott 2024 alle 5:10

Hi Mario,

You might consider adding more training images and retraining the newtork or updating the detector from YOLO v2 to YOLO v3 or YOLO v4.

Accedi per commentare.

yolov2 object detector doesn't work

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

2 Commenti
Mostra NessunoNascondi Nessuno

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

yolov2 object detector doesn't work

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

2 Commenti Mostra NessunoNascondi Nessuno

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

2 Commenti
Mostra NessunoNascondi Nessuno