Kang Chen

I study neuromorphic vision, 3D vision, and reinforcement learning for vision-language-action models, with a focus on turning event and spike streams into useful perception systems.

In 2023, I cooperated closely with Prof. Lei Yu at Wuhan University on event-based motion deblurring. I am now pursuing a Ph.D. degree in Artificial Intelligence at Peking University under the guidance of Prof. Tiejun Huang and Prof. Zhaofei Yu. I have also joined Beijing Zhongguancun Academy, where I study post-training RL for VLA under the guidance of Prof. Chao Yu. If you would like to discuss research or collaboration, feel free to contact me via email.

Research Interests

Neuromorphic Vision

Leveraging spike and event cameras for high-speed imaging, motion deblurring, and temporal sequence reconstruction.

Primary Focus

3D Vision

Developing novel 3D reconstruction approaches using Gaussian Splatting with neuromorphic sensors.

Active Research

RL for VLA

Building efficient reinforcement learning frameworks for Vision-Language-Action models.

Core Expertise

News

2025.10 πRL has been submitted to arXiv!
2025.04 Cooperated papers accepted by ICML 2025 and IJCAI 2025!
2025.03 USP-Gaussian is accepted by CVPR 2025 (Highlight)!
2025.02 Released the open-source project Spike-Zoo for spike-to-image reconstruction.
2024.12 SpikeCLIP is accepted by AAAI 2025!
2024.09 SpikeReveal is accepted by NeurIPS 2024 (Spotlight)!
2024.07 One cooperated paper SpikeGS is accepted by ACM MM 2024!
2024.06 Proud recipient of the Telecom Pride (1%), Excellent Bachelor's Thesis, Outstanding Undergraduate Graduate.
2024.01 TRMD is accepted by IEEE TMM 2024!

Publications

arXiv
pi_rl

πRL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Kang Chen, Zhihao Liu, Tonghe Zhang, Zhen Guo, Si Xu, Hao Lin, Hongzhi Zang, Quanlu Zhang, Zhaofei Yu, Guoliang Fan, Tiejun Huang, Yu Wang, Chao Yu

[Paper] [Code] [Report] GitHub stars

  • We introduce πRL, the first open-source framework for efficient RL fine-tuning with flow-based VLAs.
CVPR 2025 - Highlight
sym

USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting

Kang Chen, Jiyuan Zhang, Zecheng Hao, Yajing Zheng, Tiejun Huang and Zhaofei Yu

[Paper] [Code] [Report] GitHub stars

  • We demonstrate that Spike-to-Image and 3D reconstruction tasks can mutually facilitate and enhance the optimization of each other.
NeurIPS 2024 - Spotlight
sym

SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams

Kang Chen, Shiyan Chen, Jiyuan Zhang, Baoyue Zhang, Yajing Zheng, Tiejun Huang and Zhaofei Yu

[Paper] [Code] GitHub stars

  • We develop a self-supervised spike-guided image deblurring framework, addressing the performance degradation due to the synthetic-real domain gap in supervised methods.
AAAI 2025
sym

Rethinking High-speed Image Reconstruction Framework with Spike Camera

Kang Chen, Yajing Zheng, Tiejun Huang and Zhaofei Yu

[Paper] [Code] [Report] GitHub stars

  • We introduce a novel spike-based image reconstruction framework, which leverages the CLIP model to supervise the network training by the class label of the captured object and the features of high-quality images.
TMM 2024
sym

Motion Deblur by Learning Residual from Events

Kang Chen and Lei Yu

[Paper] [Code] GitHub stars

  • We propose a Two-Stage Residual-based Motion Deblurring (TRMD) framework for event cameras, which utilizes the residual sequence as the intermediate variable, providing a stronger supervision signal for network training.

💻 Services

Conference Reviewer

  • Computer Vision and Pattern Recognition
  • Conference on Neural Information Processing Systems
  • International Conference on Learning Representations
  • AAAI Conference on Artificial Intelligence
  • ACM Multimedia

🍹 Misc