Home | Lingdong Kong

Recent Publications

* equal contributions ‡ project lead § corresponding author

Learning to Remove Lens Flare in Event Camera

Haiqian Han, Lingdong Kong, Jianing Li, Ao Liang, Chengtao Zhu, Jiacheng Lyu, Lai Xing Ng, Xiangyang Ji, Wei Tsang Ooi, Benoit R. Cottereau

Preprint, 2026

PDF | Home | Code | Data | Zhihu

WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Ao Liang*, Lingdong Kong*^,‡, Tianyi Yan*, Hongsi Liu*, Wesley Yang*, Ziqi Huang, Wei Yin, Jialong Zuo, Yixuan Hu, Dekai Zhu, et al.

Preprint, 2026

PDF | Home | Code | Data | Zhihu

AD-R1: Closed-Loop Reinforcement Learning for End-to-End Autonomous Driving with Impartial World Models

Tianyi Yan, Tao Tang, Xingtai Cui, Yongkang Li, ..., Lingdong Kong, et al.

Preprint, 2026

PDF | Home | Code | Zhihu

3D and 4D World Modeling: A Survey

Lingdong Kong*^,‡, Wesley Yang*, Jianbiao Mei*, Youquan Liu*, Ao Liang*, Dekai Zhu*, Dongyue Lu*, Wei Yin*, Xiaotao Hu, Mingkai Jia, Junyuan Deng, et al.

Preprint, 2026

PDF | Home | Code | Zhihu

LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences

Ao Liang, Youquan Liu, Yu Yang, Dongyue Lu, Linfeng Li, Lingdong Kong^‡, et al.

AAAI Conference on Artificial Intelligence (AAAI), 2026

Oral Presentation

PDF | Home | Code | Poster | Zhihu

La La LiDAR: Large-Scale Layout Generation from LiDAR Data

Youquan Liu, Lingdong Kong, Weidong Yang, Xin Li, Ao Liang, Runnan Chen, Ben Fei, Tongliang Liu

AAAI Conference on Artificial Intelligence (AAAI), 2026

PDF | Home | Code | Poster | Zhihu

Open-o3 Video: Grounded Video Reasoning with Spatio-Temporal Evidence

Jiahao Meng, Xiangtai Li, Haochen Wang, Yue Tan, Tao Zhang, Lingdong Kong, Yunhai Tong, Anran Wang, Zhiyang Teng, Yujing Wang, Zhuochen Wang

Preprint, 2025

PDF | Home | Code | Data

RewardMap: Tackling Sparse Rewards in Fine-Grained Visual Reasoning via Multi-Stage Reinforcement Learning

Sicheng Feng*, Kaiwen Tuo*, Song Wang, Lingdong Kong, Jianke Zhu, Huan Wang

Preprint, 2025

PDF | Home | Code | Data | Zhihu

EditMGT: Unleashing Potentials of Masked Generative Transformers in Image Editing

Wei Chow, Linfeng Li, Lingdong Kong, Zefeng Li, Qi Xu, Hang Song, Tian Ye, et al.

Preprint, 2025

PDF | Home | Code | Data

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Sicheng Feng*, Song Wang*, Shuyi Ouyang, Lingdong Kong, et al.

Preprint, 2025

PDF | Home | Code | Data | Zhihu

Stairway to Success: Zero-Shot Floor-Aware Object-Goal Navigation via LLM-Driven Coarse-to-Fine Exploration

Zeying Gong, Rong Li, Tianshuai Hu, Ronghe Qiu, Lingdong Kong, et al.

Preprint, 2025

PDF | Home | Code | Zhihu

PixelThink: Towards Efficient Chain-of-Pixel Reasoning

Song Wang, Gongfan Fang, Lingdong Kong, Xiangtai Li, Jianyun Xu, Sheng Yang, Qiang Li, Jianke Zhu, Xinchao Wang

Preprint, 2025

PDF | Home | Code | Data

See4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting

Dongyue Lu*, Ao Liang*, Tianxin Huang, Xiao Fu, Yuyang Zhao, LinFeng Li, Songhua Liu, Baorui Ma, Liang Pan, Lingdong Kong^‡, Ziwei Liu

Preprint, 2025

PDF | Home | Code

Talk2Event: Grounded Understanding of Dynamic Scenes from Event Cameras

Lingdong Kong*, Dongyue Lu*, Ao Liang*, Rong Li, Yuhao Dong, et al.

Neural Information Processing Systems (NeurIPS), 2025

Spotlight (3.2% = 688/21575)

VideoLucy: Deep Memory Backtracking for Long Video Understanding

Jialong Zuo, Yongtai Deng, Lingdong Kong, Jingkang Yang, Rui Jin, Yiwei Zhang, Nong Sang, Liang Pan, Ziwei Liu, Changxin Gao

Neural Information Processing Systems (NeurIPS), 2025

3EED: Ground Everything Everywhere in 3D

Rong Li*, Yuhao Dong*, Tianshuai Hu*, Ao Liang*, Youquan Liu*, Dongyue Lu*, Liang Pan, Lingdong Kong^‡, Junwei Liang, Ziwei Liu

Neural Information Processing Systems (NeurIPS), 2025

MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query

Wei Chow, Yuan Gao, Linfeng Li, Xian Wang, Qi Xu, Hang Song, Lingdong Kong, Ran Zhou, Yi Zeng, Yidong Cai, Botian Jiang, Shilin Xu, Jiajun Zhang, et al.

Neural Information Processing Systems (NeurIPS), 2025

SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation and Understanding

Dekai Zhu*, Yixuan Hu*, Youquan Liu, Dongyue Lu, Lingdong Kong, Slobodan Ilic

Neural Information Processing Systems (NeurIPS), 2025

PDF | Home | Code | Poster | Zhihu

FlexEvent: Towards Flexible Event-Frame Object Detection at Varying Operational Frequencies

Dongyue Lu, Lingdong Kong, Gim Hee Lee, Camille Simon Chane, Wei Tsang Ooi

Neural Information Processing Systems (NeurIPS), 2025

PDF | Home | Code | Poster

Perspective-Invariant 3D Object Detection

Ao Liang*, Lingdong Kong*, Dongyue Lu*, Youquan Liu, Jian Fang, Huaici Zhao, Wei Tsang Ooi

IEEE/CVF International Conference on Computer Vision (ICCV), 2025

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Shaoyuan Xie, Lingdong Kong^‡, Yuhao Dong, Chonghao Sima, et al.

IEEE/CVF International Conference on Computer Vision (ICCV), 2025

Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations

Xiang Xu, Lingdong Kong^‡, Song Wang, Chuanwei Zhou, Qingshan Liu

IEEE/CVF International Conference on Computer Vision (ICCV), 2025

PDF | Home | Code | Poster | Zhihu

MonoMRN: Monocular Semantic Scene Completion via Masked Recurrent Networks

Xuzhi Wang, Xinran Wu, Song Wang, Lingdong Kong^§, Ziping Zhao

IEEE/CVF International Conference on Computer Vision (ICCV), 2025

PDF | Home | Code | Poster

SafeMap: Robust HD Map Construction from Incomplete Observations

Xiaoshuai Hao, Lingdong Kong, Rong Yin, Pengwei Wang, Jing Zhang, Yunfeng Diao, Shu Zhao

International Conference on Machine Learning (ICML), 2025

PDF | Code | Poster

EventFly: Event Camera Perception from Ground to the Sky

Lingdong Kong, Dongyue Lu, Xiang Xu, Lai Xing Ng, Wei Tsang Ooi, Benoit R. Cottereau

Computer Vision and Pattern Recognition (CVPR), 2025

PDF | Home | Poster

LiMoE: Mixture of LiDAR Data Representation Learners from Automotive Scenes

Xiang Xu*, Lingdong Kong*, Hui Shuai, Liang Pan, Ziwei Liu, Qingshan Liu

Computer Vision and Pattern Recognition (CVPR), 2025

PDF | Home | Code | Poster | Zhihu

GEAL: Generalizable 3D Object Affordance Learning with Cross-Modal Consistency

Dongyue Lu, Lingdong Kong, Tianxin Huang, Gim Hee Lee

Computer Vision and Pattern Recognition (CVPR), 2025

PDF | Home | Code | Data | Poster

SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

Rong Li, Shijie Li, Lingdong Kong, Xulei Yang, Junwei Liang

Computer Vision and Pattern Recognition (CVPR), 2025

PDF | Home | Code | Poster | Zhihu

PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning

Song Wang, Xiaolu Liu, Lingdong Kong, Jianyun Xu, Chunyong Hu, et al.

Computer Vision and Pattern Recognition (CVPR), 2025

PDF | Home | Code | Poster

DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes

Hengwei Bian, Lingdong Kong, Haozhe Xie, Liang Pan, Yu Qiao, Ziwei Liu

International Conference on Learning Representations (ICLR), 2025

Spotlight (5.1% = 580/11372)

PDF | Home | Code | Poster | Zhihu

Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding

Lingdong Kong*, Xiang Xu*, Jun Cen, Wenwei Zhang, Kai Chen, Ziwei Liu

IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025

Oral Presentation

PDF | Home | Code | Poster | Zhihu

LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving

Lingdong Kong*, Xiang Xu*, Youquan Liu*, Jun Cen, Runnan Chen, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

PDF | Home | Code | Zhihu

Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving

Lingdong Kong, Xiang Xu, Jiawei Ren, Wenwei Zhang, Liang Pan, Kai Chen, Wei Tsang Ooi, Ziwei Liu

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

PDF | Home | Code | Zhihu

FRNet: Frustum-Range Networks for Scalable LiDAR-Based Semantic Segmentation

Xiang Xu, Lingdong Kong, Hui Shuai, Qingshan Liu

IEEE Transactions on Image Processing (TIP), 2025

PDF | Home | Code

NUC-Net: Non-Uniform Cylindrical Partition Networks for Efficient LiDAR Semantic Segmentation

Xuzhi Wang, Wei Feng, Lingdong Kong, Liang Wan

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025

PDF | Code

Visual Foundation Models Boost Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Jingyi Xu, Weidong Yang, Lingdong Kong, Youquan Liu, et al.

IEEE Transactions on Intelligent Transportation Systems (TITS), 2025

PDF | Code

Is Your LiDAR Placement Optimized for 3D Scene Understanding?

Ye Li, Lingdong Kong, Hanjiang Hu, Xiaohao Xu, Xiaonan Huang

Neural Information Processing Systems (NeurIPS), 2024

Spotlight (2.5% = 388/15671)

PDF | Home | Code | Poster

Is Your HD Map Constructor Reliable under Sensor Corruptions?

Xiaoshuai Hao, Mengchuan Wei, Yifan Yang, Haimei Zhao, Hui Zhang, Yi Zhou, Qiang Wang, Weiming Li, Lingdong Kong^§, Jing Zhang

Neural Information Processing Systems (NeurIPS), 2024

PDF | Home | Code | Poster

4D Contrastive Superflows are Dense 3D Representation Learners

Xiang Xu*, Lingdong Kong*, Hui Shuai, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Qingshan Liu

European Conference on Computer Vision (ECCV), 2024

PDF | Home | Code | Poster | Zhihu

Learning to Adapt SAM for Segmenting Cross-Domain Point Clouds

Xidong Peng, Runnan Chen, Feng Qiao, Lingdong Kong, Youquan Liu, Tai Wang, Xinge Zhu, Yuexin Ma

European Conference on Computer Vision (ECCV), 2024

PDF | Code | Poster

OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies

Lingdong Kong, Youquan Liu, Lai Xing Ng, Benoit R. Cottereau, Wei Tsang Ooi

Computer Vision and Pattern Recognition (CVPR), 2024

Highlight (2.8% = 324/11532)

PDF | Home | Code | Poster

Multi-Space Alignments Towards Universal LiDAR Segmentation

Youquan Liu*, Lingdong Kong*, Xiaoyang Wu, Runnan Chen, Xin Li, Liang Pan, Ziwei Liu, Yuexin Ma

Computer Vision and Pattern Recognition (CVPR), 2024

PDF | Home | Code | Poster

Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Networks

Fangzhou Hong, Lingdong Kong, Hui Zhou, Xingge Zhu, Hongsheng Li,
Ziwei Liu

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

PDF | Code

Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving

Shaoyuan Xie, Lingdong Kong, Wenwei Zhang, Jiawei Ren, et al.

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

PDF | Home | Code | Data | Zhihu

RoboDepth: Robust Out-of-Distribution Depth Estimation under Corruptions

Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Lai Xing Ng, Benoit R. Cottereau, Wei Tsang Ooi

Neural Information Processing Systems (NeurIPS), 2023

Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

Youquan Liu*, Lingdong Kong*, Jun Cen, Runnan Chen, Wenwei Zhang, et al.

Neural Information Processing Systems (NeurIPS), 2023

Spotlight (3.0% = 378/12343)

Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective

Pengfei Wei, Lingdong Kong, Xinghua Qu, Yi Ren, Zhiqiang Xu, et al.

Neural Information Processing Systems (NeurIPS), 2023

Towards Label-Free Scene Understanding by Vision Foundation Models

Runnan Chen, Youquan Liu, Lingdong Kong, Nenglun Chen, Xinge Zhu, Yuexin Ma, Tongliang Liu, Wenping Wang

Neural Information Processing Systems (NeurIPS), 2023

PDF | Code | Poster

Robo3D: Towards Robust and Reliable 3D Perception against Corruptions

Lingdong Kong*, Youquan Liu*, Xin Li*, Runnan Chen, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu

IEEE/CVF International Conference on Computer Vision (ICCV), 2023

Rethinking Range View Representation for LiDAR Segmentation

Lingdong Kong, Youquan Liu, Runnan Chen, Yuexin Ma, Xinge Zhu, Yikang Li, Yuenan Hou, Yu Qiao, Ziwei Liu

IEEE/CVF International Conference on Computer Vision (ICCV), 2023

PDF | Home | Poster

UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase

Youquan Liu, Runnan Chen, Xin Li, Lingdong Kong, Yuchen Yang, et al.

IEEE/CVF International Conference on Computer Vision (ICCV), 2023

PDF | Code | Poster

LaserMix for Semi-Supervised LiDAR Semantic Segmentation

Lingdong Kong*, Jiawei Ren*, Liang Pan, Ziwei Liu

Computer Vision and Pattern Recognition (CVPR), 2023

Highlight (2.5% = 235/9155)

CLIP2Scene: Towards Label-Efficient 3D Scene Understanding by CLIP

Runnan Chen, Youquan Liu, Lingdong Kong, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao, Wenping Wang

Computer Vision and Pattern Recognition (CVPR), 2023

PDF | Code | Video | Poster

ConDA: Unsupervised Domain Adaptation for LiDAR Segmentation via Regularized Domain Concatenation

Lingdong Kong, Niamul Quader, Venice Erin Liong

IEEE International Conference on Robotics and Automation (ICRA), 2023

PDF | Home | Poster | Zhihu

Benchmarking 3D Robustness to Common Corruptions and Sensor Failure

Lingdong Kong*, Youquan Liu*, Xin Li*, Runnan Chen, Wenwei Zhang, et al.

ICLR Workshop on Scene Representations for Autonomous Driving, 2023

Best Workshop Paper Award

PDF | Home | Code | Data

Tech Reports

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

Lingdong Kong, Shaoyuan Xie, Zeying Gong, Ye Li, Meng Chu, Ao Liang, et al.

Technical Report, 2025

PDF | Home | Code

The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Kai Chen, et al.

Technical Report, 2024

PDF | Home | Code

The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation

Lingdong Kong, Yaru Niu, Shaoyuan Xie, Hanjiang Hu, Lai Xing Ng, et al.

Technical Report, 2023

PDF | Home | Code

Workshop Organizers

IROS 2025 - The RoboSense Challenge
Webpage | Venue: Hangzhou, China

ICRA 2024 - The RoboDrive Challenge
Webpage | Venue: Yokohama, Japan

ICRA 2023 - The RoboDepth Challenge
Webpage | Venue: London, UK

ECCV 2022 - The PointCloud-C Challenge
Webpage | Venue: Virtual

Academic Services

Conference Reviewer

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
IEEE/CVF International Conference on Computer Vision (ICCV)
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
European Conference on Computer Vision (ECCV)
Conference on Neural Information Processing Systems (NeurIPS)
International Conference on Learning Representations (ICLR)
International Conference on Machine Learning (ICML)
IEEE International Conference on Robotics and Automation (ICRA)
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
AAAI Conference on Artificial Intelligence (AAAI)

Journal Reviewer

International Journal of Computer Vision (IJCV)
International Journal of Robotics Research (IJRR)
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
IEEE Transactions on Intelligent Vehicles (TIV)
IEEE Transactions on Intelligent Transportation Systems (TITS)
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
IEEE Transactions on Multimedia (TMM)
IEEE Transactions on Knowledge and Data Engineering (TKDE)
IEEE Robotics and Automation Letters (RA-L)
ISPRS Journal of Photogrammetry and Remote Sensing (P&RS)

Lingdong Kong

News

Industrial Experience

Apple AI/ML

CNRS@CREATE

NVIDIA Research

TikTok

Motional

Recent Publications

Tech Reports

Workshop Organizers

Academic Services

Conference Reviewer

Journal Reviewer