Skip to content
View cmhungsteve's full-sized avatar

Highlights

  • Pro

Organizations

@MediaTek-NeuroPilot

Block or report cmhungsteve

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
cmhungsteve/README.md

Hi there 👋

My name is Min-Hung (Steve) Chen (陳敏弘 in Chinese). I am a Senior Research Scientist at NVIDIA Research Taiwan, working on Vision X Multi-Modal AI. I received my Ph.D. degree from Georgia Tech, advised by Prof. Ghassan AlRegib and in collaboration with Prof. Zsolt Kira. Before joining NVIDIA, I was working on Biometric Research for Cognitive Services as a Research Engineer II at Microsoft Azure AI, and was working on Edge-AI Research as a Senior AI Engineer at MediaTek, respectively.

My research interest is mainly Multi-Modal AI, including Vision-Language, Video Understanding, Cross-Modal Learning, Efficient Tuning, and Transformer. I am also interested in Learning without Fully Supervision, including domain adaptation, transfer learning, continual learning, X-supervised learning, etc.

[Update] I released a comprehensive paper list for Vision Transformer & Attention to facilitate related research. Feel free to check it (I would be appreciative if you can ★STAR it)!

[Personal Website][LinkedIn][Twitter][Google Scholar][Resume]

Min-Hung (Steve)'s GitHub stats

Pinned Loading

  1. NVlabs/DoRA NVlabs/DoRA Public

    [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

    Python 665 44

  2. Awesome-Transformer-Attention Awesome-Transformer-Attention Public

    An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

    4.7k 490

  3. SSTDA SSTDA Public

    [CVPR 2020] Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation (PyTorch)

    Python 154 23

  4. TA3N TA3N Public

    [ICCV 2019 (Oral)] Temporal Attentive Alignment for Large-Scale Video Domain Adaptation (PyTorch)

    Python 261 41

  5. chihyaoma/Activity-Recognition-with-CNN-and-RNN chihyaoma/Activity-Recognition-with-CNN-and-RNN Public

    Temporal Segments LSTM and Temporal-Inception for Activity Recognition

    Lua 440 147

  6. MediaTek-NeuroPilot/mai21-learned-smartphone-isp MediaTek-NeuroPilot/mai21-learned-smartphone-isp Public

    The official codebase for the Learned Smartphone ISP Challenge in MAI @ CVPR 2021

    Jupyter Notebook 108 24