python - Yolo from scratch dataset and output

Question

Welcome To Ask or Share your Answers For Others

python - Yolo from scratch dataset and output

asked Oct 7, 2021 in Technique[技术] by 深蓝 (71.8m points)

python - Yolo from scratch dataset and output

Hi I coded a YOLO model from scratch and just came to realise that my dataset does not fit the models output. This is what I mean: The model outputs a S x S x (B * 5 + C) matrix. The shape of y[0] (the answer for the first image) is (7,5). How will I make the model use the labels of mine. From what I knew and read the labels come in this format x,y,w,h,objectiveness_score, class_scores for the yolo algorithm so how come that the model will output a 3D matrix while the labels are a 2d matrix.

How will I solve the issue of mine by using numpy and keras?

question from:https://stackoverflow.com/questions/65915176/yolo-from-scratch-dataset-and-output

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-10-06T19:10:26+0000

According to the paper (section 2), the S x S x (B * 5 + C) shaped output represents the S x S grid cells that YoloV1 splits the image into. The last layer can be implemented as a fully connected layer with an output length S x S x (B * 5 + C), then you can simply reshape the output to a 3D shape.

The paper states that:

"Our system divides the input image into an S × S grid. If the center of an object falls into a grid cell, that grid cell is responsible for detecting that object."

Meaning you have to assign each label to its corresponding grid cell in order to do backpropagation. For reference, a keras/tensorflow implementation of the loss calculation can be found here (by the github user FMsunyh).

Categories

python - Yolo from scratch dataset and output

python - Yolo from scratch dataset and output

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags