Abstract: Detecting small objects (e.g., those smaller than 20 × 20 pixels) in large-scale images remains a significant and challenging problem. Modern CNN-based detectors often struggle due to the ...
Abstract: Human-object interaction (HOI) detection often faces high levels of ambiguity and indeterminacy, as the same interaction can appear vastly different across different human-object pairs.
DEIMv2 is an evolution of the DEIM framework while leveraging the rich features from DINOv3. Our method is designed with various model sizes, from an ultra-light version up to S, M, L, and X, to be ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results