Skip to content
View NormXU's full-sized avatar
🎯
最後まで、絶対に諦めじゃだめ
🎯
最後まで、絶対に諦めじゃだめ

Block or report NormXU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

High-resolution models for human tasks.

Python 4,046 211 Updated Sep 25, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,877 213 Updated Sep 27, 2024

VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.

Python 4,924 368 Updated Aug 29, 2024

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 81,896 7,564 Updated Sep 28, 2024

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

JavaScript 23,517 2,375 Updated Sep 28, 2024

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,225 176 Updated Sep 27, 2024

SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.

4,386 409 Updated Sep 18, 2024

Visualizing the attention of vision-language models

Jupyter Notebook 40 3 Updated Aug 7, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 866 39 Updated Sep 26, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,637 1,029 Updated Sep 10, 2024

Official inference repo for FLUX.1 models

Python 14,236 1,022 Updated Sep 13, 2024

[ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"

Python 322 33 Updated Feb 13, 2024

real time face swap and one-click video deepfake with only a single image

Python 37,209 5,260 Updated Sep 28, 2024

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 2,657 231 Updated Sep 5, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,467 153 Updated Aug 30, 2024

【NeurIPS 2024】Dense Connector for MLLMs

Python 105 4 Updated Sep 26, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,334 384 Updated Sep 28, 2024
Python 17 1 Updated Aug 2, 2024

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 652 38 Updated Jul 30, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 27,853 3,156 Updated Sep 26, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,076 940 Updated Aug 21, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 418 43 Updated Sep 20, 2024

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 263 9 Updated Sep 23, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

736 20 Updated Jul 31, 2024

Official PyTorch implementation of Revisiting Image Pyramid Structure for High Resolution Salient Object Detection (ACCV 2022)

Python 448 69 Updated Jan 29, 2024

LLM101n: Let's build a Storyteller

28,968 1,585 Updated Aug 1, 2024

VisionLLM Series

Python 857 21 Updated Sep 13, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 650 36 Updated Aug 5, 2024

Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).

Python 277 14 Updated Sep 26, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

1,061 22 Updated Jul 31, 2024
Next