Skip to content
View kobeshegu's full-sized avatar
🍉
Focusing
🍉
Focusing
Block or Report

Block or report kobeshegu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 858 47 Updated Aug 19, 2024

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 392 15 Updated Aug 16, 2024

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 5,697 519 Updated Aug 19, 2024

Official implementation of Add-SD: Rational Generation without Manual Reference.

Jupyter Notebook 25 1 Updated Aug 19, 2024

[ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback

Python 8 Updated Aug 18, 2024

Official inference repo for FLUX.1 models

Python 8,447 526 Updated Aug 16, 2024

[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

Python 181 10 Updated Jun 29, 2024

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models

Jupyter Notebook 230 9 Updated Aug 2, 2024

PyTorch implementation of MAR DiffLoss https://arxiv.org/abs/2406.11838

Python 563 28 Updated Aug 13, 2024

Utilities intended for use with Llama models.

Python 3,445 568 Updated Aug 15, 2024

A curated list of foundation models for vision and language tasks

725 33 Updated Aug 17, 2024

Tracking and collecting papers/projects/others related to Segment Anything.

1,490 128 Updated Aug 16, 2024

A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)

Python 239 15 Updated Aug 12, 2024

[ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image

16 1 Updated Jul 19, 2024
Python 11 Updated Jul 13, 2024
Python 7,045 546 Updated Aug 12, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 654 50 Updated Jul 29, 2024

Bring portraits to life!

Python 10,194 990 Updated Aug 19, 2024

Understand Human Behavior to Align True Needs

Python 3,182 279 Updated Jul 20, 2024

Vico: Compositional Video Generation as Flow Equalization

Python 43 2 Updated Jul 9, 2024

[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation

420 13 Updated Jul 1, 2024
Python 151 2 Updated Jul 15, 2024

The program used to occupy GPUs.

Python 6 1 Updated Mar 24, 2023

Kolors Team

Python 3,070 180 Updated Aug 6, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,658 104 Updated Aug 3, 2024

[BSQ-ViT] Image and Video Tokenization with Binary Spherical Quantization

Python 71 Updated Jun 12, 2024

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

Python 503 24 Updated Jul 5, 2024

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…

Jupyter Notebook 468 20 Updated Jul 13, 2024

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

Python 357 26 Updated Jul 26, 2024

Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"

Python 119 2 Updated Jul 3, 2024
Next