You Only Look Intensity Once: Event-Driven Long-Term High-Speed Object Detection
Wen Dong1,2, Haiyang Mei1,3, Yinglian Ji1, Yutong Jiang4, Ziqi Wei5, Shengfeng He2, Xin Yang1,*
1 Key Laboratory of Social Computing and Cognitive Intelligence (Ministry of Education), Dalian University of Technology
2 School of Computing and Information Systems, Singapore Management University
3 Show Lab, National University of Singapore
4 Chinese Scholartree Ridge State Key Laboratory, Vehicle Research Institute
5 Institute of Automation, Chinese Academy of Sciences
Contact us: xinyang@dlut.edu.cn
International Journal of Computer Vision (IJCV 2026)

Abstract
Cooperating intensity/RGB frames with event signals offers a practical and effective solution for high-speed object detection, while existing methods rely on frequent RGB frame updates, typically every 0.2 seconds, to preserve scene context. This leads to high computational costs, redundancy across frames, and latency issues, especially on edge devices. In this work, we propose the Delta Perception Network (DPNet), a novel learning-based detector that requires only a single RGB frame at the beginning of a sequence to detect high-speed objects over a 5-second duration, 25 times longer than prior methods. The key insight is that the initial frame can serve as a durable scene prior, while the continuous event stream supplies sufficient motion information to update this prior without needing further intensity frames. This design allows DPNet to track dynamic scene changes efficiently and accurately over extended periods. At the core of DPNet is a Delta Perception module that emphasizes temporal changes, and a Grid Sampling GRU that incrementally maintains and refines the scene representation. Extensive experiments show that DPNet outperforms state-of-the-art frame-based, event-based, and fusion-based detectors. Our results demonstrate that long-duration, high-speed object detection is not only feasible but also efficient using a single intensity/RGB frame and continuous event input, enabling real-time performance with reduced computational demands.
Downloads
[Paper|Pre-trained model|Source code]
Dataset
Both training set and testing set can be obtained via form request at here.
Recommended citation
@InProceedings{Wen_2026IJCV_DPNet,
title = {You Only Look Intensity Once: Event-Driven Long-Term High-Speed Object Detection},
author = {Wen Dong, Haiyang Mei, Yinglian Ji, Yutong Jiang, Ziqi Wei, Shengfeng He, Xin Yang},
booktitle = {International Journal of Computer Vision},
year = {2026}
}
