My Homepage

Research for a better world.

I'm Yanwei Li (李彦玮), currently working as a Research Scientist on Foundation Model for Vision & Language at ByteDance, Seattle, USA. Before that, I obtained Ph.D degree in The Chinese University of Hong Kong (CUHK), supervised by Prof. Jiaya Jia.

My research interest mainly focus on Multi-modality Foundation Model and Generative AI.
More experiences about me please refer to Publication and my Google Scholar.


Education

The Chinese University of Hong Kong
Fields: Computer Vision
Ph.D, Computer Science and Engineering, 2020 - 2024.

Institute of Automation, Chinese Academy of Sciences
Fields: Computer Vision
M.Phil, Pattern Recognition and Intelligent System, 2017 - 2020.

Central South University
B.E., Automation & Mechanical Engineering, 2013 - 2017.


Experience

NVIDIA Research
Fields: Perception for Autonomous Vehicle.
Research Intern, 2022.06 - 2023.04

Megvii (Face++)
Fields: 2D & 3D Detection and Segmentation.
Research Intern, 2019.01 - 2022.05

Horizon Robotics
Fields: Panoptic Segmentation.
Research Intern, 2018.04 - 2018.12

IBM Research
Fields:Fine-grained Recognition.
Research Intern, 2017.08 - 2018.01


Activity

Conference Reviewer:
International Conference on Learning Representations (ICLR).
International Conference on Machine Learning (ICML).
Neural Information Processing Systems (NeurIPS).
IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
IEEE International Conference on Computer Vision (ICCV).
European Conference on Computer Vision (ECCV).
AAAI Conference on Artificial Intelligence (AAAI).

Journal Reviewer:
IEEE Transactions on Pattern Analysis and Machine Intelligence.
International Journal of Computer Vision.
IEEE Transactions on Image Processing.
Pattern Recognition.

Academic Talk:
"LLaMA-VID:An Image is Worth 2 Tokens in Large Language Models", MIT/Huawei/Tencent, 2023. [slides]
"Representation for Multi-modality 3D Detection with Transformer", ZhiDongXi, 2022. [slides]
"Towards Fully Convolutional Panoptic Segmentation", ByteDance AI & BAAI, 2021. [slides]
"Dynamic Network and Semantic Segmentation", Paper Weekly, 2020. [slides]
"FPN-based Network for Panoptic Segmentation", ECCV COCO Workshop, 2018. [slides]

Teaching Assistant:
CSCI1580: Visual Programming, Fall, 2022.
ENGG5104: Image Processing and Computer Vision, Spring, 2022.
CSCI1580: Visual Programming, Fall, 2021.
CSCI2100B: Data Structures, Spring, 2021.


Award

Microsoft Fellowship Nomination, 2022
Postgraduate Scholarship, 2020-2024
National Scholarship, 2019
National Scholarship, 2016