|
EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model
Feipeng Ma,
Yizhou Zhou, Hebei Li, Zilong He, Siying Wu,
Fengyun Rao, Yueyi Zhang, Xiaoyan Sun
USTC / WeChat
arXiv, 2024
[arXiv]
|
|
Multi-Modal Generative Embedding Model
Feipeng Ma,
Hongwei Xue, Guangting Wang, Yizhou Zhou, Fengyun Rao, Shilin Yan, Yueyi Zhang, Siying Wu, Mike Zheng Shou, Xiaoyan Sun
USTC / WeChat / NUS / FDU
arXiv, 2024
[arXiv]
|
|
Visual Perception by Large Language Model's Weights
Feipeng Ma,
Hongwei Xue, Yizhou Zhou, Guangting Wang, Fengyun Rao, Shilin Yan, Yueyi Zhang, Siying Wu, Mike Zheng Shou, Xiaoyan Sun
USTC / WeChat / NUS / FDU
NeurIPS, 2024
[arXiv]
/
[code]
/
[project]
|
|
Task Navigator: Decomposing Complex Tasks for Multimodal Large Language Models
Feipeng Ma,
Yizhou Zhou, Yueyi Zhang, Siying Wu, Zheyu Zhang, Zilong He, Fengyun Rao, Xiaoyan Sun
USTC / WeChat
CVPR Workshop, 2024
[paper]
|
|
Image Captioning with Multi-Context Synthetic Data
Feipeng Ma,
Yizhou Zhou, Fengyun Rao, Yueyi Zhang, Xiaoyan Sun
USTC / WeChat
AAAI, 2024
[arXiv]
|
|
Multimodal Sentiment Analysis with Preferential Fusion and Distance-aware Contrastive Learning
Feipeng Ma,
Yueyi Zhang, Xiaoyan Sun
USTC
ICME, 2023   (Oral)
[paper] / [code]
|
|
Meta AI Video Similarity Challenge: Descriptor Track
CVPR, 2023   Rank 1
[Technical Report]
/
[code]
|
|
Meta AI Video Similarity Challenge: Matching Track
CVPR, 2023   Rank 1
[Technical Report]
/
[code]
|
WeChat, Tencent Inc., Beijing, China
Research Intern, Jan. 2023 - Present
|
University of Science and Technology of China, Hefei, China
PhD Student, Sept. 2021 - Present
|
Sun Yat-sen University, Guangzhou, China
Undergraduate Student, Sept. 2017 - Jun. 2021
|
|