Abstract: In complex visual scenarios, people can understand occluded objects through contextual reasoning and association. However, existing video object detection (VOD) methods could not work well.
A video on TikTok in October appeared to show a woman being interviewed by a television reporter about food stamps. The women weren’t real. The conversation never happened. Generated by A.I. The video ...
Visualization examples of YOLOv10-N/S, YOLO11-N/S, YOLOv12-N/S, and YOLOv13-N/S. Representative visualization examples of adaptive hyperedges. The hyperedges in the first and second columns mainly ...
DEIMv2 is an evolution of the DEIM framework while leveraging the rich features from DINOv3. Our method is designed with various model sizes, from an ultra-light version up to S, M, L, and X, to be ...
Abstract: Video recognition requires rapid and accurate object recognition across continuous frames, presenting significant challenges for computing-constrained ...
Traffic monitoring plays a vital role in smart city infrastructure, road safety, and urban planning. Traditional detection systems, including earlier deep learning models, often struggle with ...
Small object detection is a critical task in applications like autonomous driving and ship black smoke detection. While Deformable DETR has advanced small object detection, it faces limitations due to ...
In recent years, underwater object detection (UOD) has become a prominent research area in the computer vision community. However, existing UOD approaches are still vulnerable to underwater ...