Zhang Erli ZhangErliCarl

👋 Hello, I'm ZHANG Erli!

👤 About me

I am a first-year PhD student at the National University of Singapore 🇸🇬, majoring in Biomedical Engineering. Prior to this, I obtained a Bachelor of Engineering in Computer Science from Nanyang Technological University. My current research interests include AI in Healthcare, Surgical Video Analysis, and Large Multimodal Models.

Resume: Resume
Homepage: Homepage
Google Scholar: Profile
LinkedIn: LinkedIn

📖 Publications

Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning

Conference: NeurIPS 2024 Workshop
Description: Efficient Segment Anything 2 (SAM2) with frame pruning mechanism for real-time surgical video segmentation
📖 Paper

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

Conference: CVPR 2024
Description: Low-level visual instruction tuning for multi-modality LLMs
📖 Paper

Q-Bench: Multi-Modality Benchmarking

Conference: ICLR 2024 (spotlight)
Description: A benchmark for multi-modality LLMs on low-level vision and visual quality assessment.
📖 Paper

MaxVQA/MaxWell: Towards Explainable VQA

Conference: ACMMM 2023 (oral)
Description: Introduced a 16-dimensional VQA Dataset and Method for a more explainable VQA.
📖 Paper

DOVER: NR-VQA Method

Conference: ICCV 2023
Description: A state-of-the-art NR-VQA method that predicts disentangled aesthetic and technical quality.
📖 Paper

📬 Contact Me

Email: [email protected] or [email protected]
Twitter: @zhang_erli

Provide feedback

Saved searches

Use saved searches to filter your results more quickly