This project implements real-time object detection using a YOLO v8 model, a Rust gRPC service (Tonic + Ort) for inference, and an Axum-based middleware that streams video from a webcam, collects ...
Abstract Enhancing the network architecture of the YOLO framework has been crucial for a long time but has focused on CNN-based improvements despite the proven superiority of attention mechanisms in ...