Aleksey Bochkovskiy
7 min readDec 7, 2020

--

Scaled YOLOv4
Chart of Accuracy (vertical axis) and Latency (horizontal axis) on a Tesla V100 GPU (Volta) with batch = 1 without using TensorRT.
Accuracy rating of published neural networks: https://paperswithcode.com/sota/object-detection-on-coco
YOLOv4-tiny can run in real time with 39 FPS / 25ms latency on JetsonNano (416x416, fp16, batch = 1) tkDNN / TensorRT
Loss for YOLOv3, YOLOv4 and Scaled-YOLOv4
Scaled YOLOv4 comparison table
Scaled-YOLOv4 architecture
The simplest example of a CSP connection (on the left is a regular network, on the right is a CSP network)
An example of a CSP connection in YOLOv4-CSP / P5 / P6 / P7 (on the left is a regular network, on the right is a CSP network)
YOLOv4-tiny uses 2 CSP connections

--

--