Skip to content
View thaoshibe's full-sized avatar
🐾
Why are you looking at me?
🐾
Why are you looking at me?

Highlights

  • Pro

Block or report thaoshibe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 599 28 Updated Aug 31, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 623 35 Updated Aug 5, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,195 773 Updated Aug 21, 2024

Utilities intended for use with Llama models.

Python 3,632 619 Updated Aug 30, 2024

Pytorch Implementation of "Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model"

Python 102 8 Updated Jul 14, 2024

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,331 208 Updated Apr 15, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,727 107 Updated Jul 29, 2024

🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant

42 1 Updated Jul 8, 2024

Your image is almost there!

Python 7,151 415 Updated Jul 26, 2024

A collection of Vietnamese women who are currently working in the field of Computer Science.

SCSS 11 Updated Jul 23, 2024
Jupyter Notebook 1 Updated Dec 31, 2023

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Python 270 20 Updated Jul 17, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 10,398 10,870 Updated Aug 28, 2024

A curated list of Awesome Makeup Transfer resources

207 31 Updated Apr 23, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,463 2,056 Updated Aug 9, 2024

[WACV 2024] An implementation of MEGANet for polyp segmentation with multi-scale edge-guided attention

Python 38 3 Updated Feb 5, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,449 744 Updated Aug 30, 2024

[ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers

Python 119 2 Updated Jun 14, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,901 780 Updated Aug 20, 2024

✏️ Edit One for All: Interactive Batch Image Editing (CVPR 2024)

Python 45 3 Updated Aug 8, 2024

💄 Lipstick ain't enough: Beyond Color-Matching for In-the-Wild Makeup Transfer (CVPR 2021)

Python 367 58 Updated Jul 22, 2024

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Python 1,489 99 Updated Jul 22, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,004 2,080 Updated Aug 12, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,637 1,504 Updated Aug 29, 2024

State-of-the-art 2D and 3D Face Analysis Project

Python 22,674 5,331 Updated Aug 30, 2024

👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)

Python 80 2 Updated Dec 19, 2023

Official PyTorch implementation of the paper "Neural Congealing: Aligning Images to a Joint Semantic Atlas" (CVPR 2023)

Python 47 4 Updated Aug 14, 2023

Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN

82 2 Updated Nov 8, 2023

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,197 653 Updated Aug 14, 2024
Next