人才队伍
王鹤
王鹤 研究员/博士生导师

+86 (0)10 6276-1084

hewang

静园五院106-1

具身智能、多模态大模型 https://hughw19.github.io/

简介

王鹤博士是北京大学计算机学院前沿计算研究中心的助理教授及博士生导师,北京大学博雅青年学者,入选国家级海外高层次人才。他创立并领导了北大具身感知与交互实验室(EPIC Lab),旨在通过研发具身泛化技能及具身多模态大模型推进通用机器人的发展。他创立了银河通用机器人公司,担任首席技术官,并兼任中关村学院的研究型导师。

招生

硕博招生


•    北京大学计算机学院前沿计算研究中心(CFCS)博士名额:2名;
•    中科院自动化所联合培养博士:多名;
•    中关村学院联合培养博士:多名(联合学校有中科大、上交及浙大等);
•    北京大学面向港澳台及国际学生的博士名额:1名;
•    北京大学面向港澳台及国际学生的硕士名额:1名。

科研实习招生


   欢迎来自全球顶尖高校的优秀本科生和研究生申请6个月以上的线下科研实习。实习地点位于北京大学–银河通用具身智能联合实验室(中关村鼎好大厦), 并提供一流的科研环境及具有竞争力的生活与住宿津贴。

奖项与荣誉

•   2026年北京青年五四奖章

•   2025年世界互联网大会领先科技奖
•   2025年《麻省理工科技评论》“35岁以下科技创新35人”(TR 35中国区)
•   2024年蚂蚁科技奖
•   2024年英特尔中国学术英才计划荣誉学者       
•   2023年国际计算机视觉大会(ICCV)最佳论文候选
•   2023年国际机器人与自动化大会(ICRA)最佳操纵论文候选
•   2022年世界人工智能大会青年优秀论文奖
•   2019年欧洲图形学会议最佳论文提名

学术服务

•   工业和信息化部人工智能标准化技术委员会具身智能组副组长
•   上海证券交易所第三届科技创新咨询委员会委员
•   中国人工智能学会具身智能专业委员会常务委员

•   国际学术会议领域主席:CVPR、ICCV、CoRL等

发表论著

最新发表情况请见个人网站:https://hughw19.github.io/
*Equal contribution  †Corresponding author


HDFlow: Hierarchical Diffusion-Flow Planning for Long-horizon Tasks

Nandiraju Gireesh*, Yuanliang Ju*, Chaoyi Xu, Weiheng Liu, Yuxuan Wan, He Wang†

International Conference on Machine Learning (ICML 2026, spotlight)

 

LIMMT: Less is More for Motion Tracking

Yu Guan*, Zekun Qi*, Chenghuai Lin, Xuchuan Chen, Dairu Liu, Wenyao Zhang, Jilong Wang, Xinqiang Yu, He Wang†, Li Yi†

International Conference on Machine Learning (ICML 2026) 


LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Jiangran Lyu*, Kai Liu*, Xuheng Zhang*, Haoran Liao, Yusen Feng, Wenxuan Zhu, Tingrui Shen, Jiayi Chen, Jiazhao Zhang, Yifei Dong, Wenbo Cui, Senmao Qi, Shuo Wang, Yixin Zheng, Mi Yan, Xuesong Shi, Haoran Li, Dongbin Zhao, Ming-Yu Liu, Zhizheng Zhang†, Li Yi†, Yizhou Wang†, He Wang†

Robotics: Science and Systems (RSS 2026)

 

Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning

Yixin Zheng*, Jiangran Lyu*, Yifan Zhang, Jiayi Chen, Mi Yan, Yuntian Deng, Xuesong Shi, Xiaoguang Zhao, Yizhou Wang, Zhizheng Zhang†, He Wang†

Robotics: Science and Systems (RSS 2026)

 

StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision

Shengliang Deng*, Mi Yan*, Yixin Zheng*, Jiayi Su, Wenhao Zhang, Xiaoguang Zhao, Heming Cui, Zhizheng Zhang†, He Wang†

Robotics: Science and Systems (RSS 2026)


Humanoid Generative Pre-Training for Zero-Shot Motion Tracking

Zekun Qi*, Xuchuan Chen*, Jilong Wang*, Chenghuai Lin*, Yunrui Lian, Zhikai Zhang, Yu Guan, Wenyao Zhang, Xinqiang Yu, He Wang, Li Yi†

Conference on Computer Vision and Pattern Recognition 2026


Layered 4D-Rotor Gaussian Splatting: A Compressed Representation for Long Dynamic Scenes

Hanjie Xu*, Yuanxing Duan*, Qiyu Dai*, Ge Li†, Baoquan Chen†, He Wang†

Conference on Computer Vision and Pattern Recognition 2026


CLAR: Learning 3D Representations for Robotic Manipulation by Fusing Masked Reconstruction with Multi-Level Contrastive Alignment

Wenbo Cui*, Chengyang Zhao*, Yuhui Chen, Haoran Li, Zhizheng Zhang, Dongbin Zhao, He Wang†

IEEE International Conference on Robotics and Automation 2026


NavGSim: High-Fidelity Gaussian Splatting Simulator for Large-Scale Navigation

Jiahang Liu*, Yuanxing Duan*, Jiazhao Zhang*, Minghan Li, Shaoan Wang, Zhizheng Zhang†, He Wang†

IEEE International Conference on Robotics and Automation 2026


Robust Differentiable Collision Detection for General Objects

Jiayi Chen, Wei Zhao, Liangwang Ruan, Baoquan Chen, He Wang†

IEEE International Conference on Robotics and Automation 2026


UrbanVLA: A Vision-Language-Action Model for Urban Micromobility

Anqi Li*, Zhiyong Wang*, Jiazhao Zhang*, Minghan Li, Zhibo Chen, Zhizheng Zhang†, He Wang†

IEEE International Conference on Robotics and Automation 2026


TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking

Jiahang Liu*, Yunpeng Qi*, Jiazhao Zhang*, Minghan Li, Shaoan Wang, Kui Wu, Hanjing Ye, Hong Zhang, Zhibo Chen, Fangwei Zhong, Zhizheng Zhang†, He Wang†

IEEE International Conference on Robotics and Automation 2026


OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

Mengdi Jia*, Zekun Qi*, Shaochen Zhang, Wenyao Zhang, Xinqiang Yu, Jiawei He, He Wang†, Li Yi†

The Fourteenth International Conference on Learning Representations


Embodied Navigation Foundation Model

Jiazhao Zhang*, Anqi Li*, Yunpeng Qi*, Minghan Li*, Jiahang Liu, Shaoan Wang, Haoran Liu, Gengze Zhou, Yuze Wu, Xingxing Li, Yuxin Fan, Wenjun Li, Zhibo Chen, Fei Gao, Qi Wu, Zhizheng Zhang†, He Wang†

The Fourteenth International Conference on Learning Representations


FoldNet: Learning Generalizable Closed-Loop Policy for Garment Folding via Keypoint-Driven Asset and Demonstration Synthesis

Yuxing Chen*, Bowen Xiao*, He Wang†

IEEE Robotics and Automation Letters


SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Zekun Qi*, Wenyao Zhang*, Yufei Ding*, Runpei Dong, XinQiang Yu, Jingwen Li, Lingyun Xu, Baoyu Li, Xialin He, Guofan Fan, Jiazhao Zhang, Jiawei He, Jiayuan Gu, Xin Jin, Kaisheng Ma, Zhizheng Zhang†, He Wang†, Li Yi†

Neural Information Processing Systems 2025 (spotlight)


TrackVLA: Embodied Visual Tracking in the Wild

Shaoan Wang*, Jiazhao Zhang*, Minghan Li, Jiahang Liu, Anqi Li, Kui Wu, Fangwei Zhong, Junzhi Yu, Zhizheng Zhang†, He Wang†

Conference on Robot Learning 2025


FetchBot: Learning Generalizable Object Fetching in Cluttered Scenes via Zero-Shot Sim2Real

Weiheng Liu*, Yuxuan Wan*, Jilong Wang, Yuxuan Kuang, Wenbo Cui, Xuesong Shi, Haoran Li, Dongbin Zhao, Zhizheng Zhang†, He Wang†

Conference on Robot Learning 2025 (Oral)


GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data

Shengliang Deng*, Mi Yan*, Songlin Wei, Haixin Ma, Yuxin Yang, Jiayi Chen, Zhiqi Zhang, Taoyu Yang, Xuheng Zhang, Wenhao Zhang, Heming Cui, Zhizheng Zhang†, He Wang†

Conference on Robot Learning 2025


DexVLG: Dexterous Vision-Language-Grasp Model at Scale

Jiawei He*, Danshi Li*, Xinqiang Yu*, Zekun Qi, Wenyao Zhang, Jiayi Chen, Zhaoxiang Zhang†, Zhizheng Zhang†, Li Yi†, He Wang†

International Conference on Computer Vision 2025 (highlight)


DyWA: Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation

Jiangran Lyu, Ziming Li, Xuesong Shi, Chaoyi Xu, Yizhou Wang†, He Wang†

International Conference on Computer Vision 2025


RoboHanger: Learning Generalizable Robotic Hanger Insertion for Diverse Garments

Yuxing Chen, Songlin Wei, Bowen Xiao, Jiangran Lyu, Jiayi Chen, Feng Zhu, He Wang†

IEEE Robotics and Automation Letters


Dexonomy: Synthesizing All Dexterous Grasp Types in a Grasp Taxonomy

Jiayi Chen*, Yubin Ke*, Lin Peng, He Wang†

Robotics: Science and Systems 2025


Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks

Jiazhao Zhang, Kunyu Wang, Shaoan Wang, Minghan Li, Haoran Liu, Songlin Wei, Zhongyuan Wang, Zhizheng Zhang†, He Wang†

Robotics: Science and Systems 2025


Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

Enshen Zhou*, Qi Su*, Cheng Chi*†, Zhizheng Zhang, Zhongyuan Wang, Tiejun Huang, Lu Sheng†, He Wang†

Conference on Computer Vision and Pattern Recognition 2025


GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation

Wenbo Cui*, Chengyang Zhao*, Songlin Wei*, Jiazhao Zhang, Haoran Geng, Yaran Chen, He Wang†

IEEE International Conference on Robotics and Automation 2025


BODex: Scalable and Efficient Robotic Dexterous Grasp Synthesis Using Bilevel Optimization

Jiayi Chen*, Yubin Ke*, He Wang†

IEEE International Conference on Robotics and Automation 2025


NaVid-4D: Unleashing Spatial Intelligence in Egocentric RGB-D Videos for Vision-and-Language Navigation

Haoran Liu*, Weikang Wan*, Xiqian Yu*, Minghan Li*, Jiazhao Zhang, Bo Zhao, Zhibo Chen, Zhongyuan Wang, Zhizheng Zhang†, He Wang†

IEEE International Conference on Robotics and Automation 2025


QuadWBG: Generalizable Quadrupedal Whole-Body Grasping

Jilong Wang*, Javokhirbek Rajabov*, Chaoyi Xu, Yiming Zheng, He Wang†

IEEE International Conference on Robotics and Automation 2025


Watch Less, Feel More: Direct Sim-to-real RL for Articulated Object Manipulation with Motion Adaptation and Impedance Control

Tan-Dzung Do, Gireesh Nandiraju, Jilong Wang, He Wang†

IEEE International Conference on Robotics and Automation 2025