I'm Yanwei Li (李彦玮), currently working as a Research Scientist on Foundation Model for Vision & Language at ByteDance, Seattle, USA.
Before that, I obtained Ph.D degree in The Chinese University of Hong Kong (CUHK), supervised by Prof. Jiaya Jia.
More experiences about me please refer to
Publication
and my Google Scholar.
The Chinese University of Hong Kong
Fields: Computer Vision
Ph.D, Computer Science and Engineering, 2020 - 2024.
Institute of Automation, Chinese Academy of Sciences
Fields: Computer Vision
M.Phil, Pattern Recognition and Intelligent System, 2017 - 2020.
Central South University
B.E., Automation & Mechanical Engineering, 2013 - 2017.
NVIDIA Research
Fields: Perception for Autonomous Vehicle.
Research Intern, 2022.06 - 2023.04
Megvii (Face++)
Fields: 2D & 3D Detection and Segmentation.
Research Intern, 2019.01 - 2022.05
Horizon Robotics
Fields: Panoptic Segmentation.
Research Intern, 2018.04 - 2018.12
IBM Research
Fields:Fine-grained Recognition.
Research Intern, 2017.08 - 2018.01
Area Chair:
Neural Information Processing Systems (NeurIPS), 2025.
Academic Talk:
"LLaMA-VID:An Image is Worth 2 Tokens in Large Language Models", MIT/Huawei/Tencent, 2023. [slides]
"Representation for Multi-modality 3D Detection with Transformer", ZhiDongXi, 2022. [slides]
"Towards Fully Convolutional Panoptic Segmentation", ByteDance AI & BAAI, 2021. [slides]
"Dynamic Network and Semantic Segmentation", Paper Weekly, 2020. [slides]
"FPN-based Network for Panoptic Segmentation", ECCV COCO Workshop, 2018. [slides]
Microsoft Fellowship Nomination, 2022
Postgraduate Scholarship, 2020-2024
National Scholarship, 2019
National Scholarship, 2016