Block or Report
Block or report zcgeqian
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (1)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…
An open source implementation of CLIP.
This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024
EVA Series: Visual Representation Fantasies from BAAI
(CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C 等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
Lumina-T2X is a unified framework for Text to Any Modality Generation
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
A Change Detection Repo Standing on the Shoulders of Giants
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Collection of NSFW images URLs for the purposes of training an NSFW Image Classifier
The official implementation of Self-Play Preference Optimization (SPPO)
DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution
Enjoy the magic of Diffusion models!
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Explore the Limits of Omni-modal Pretraining at Scale
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Open-Sora: Democratizing Efficient Video Production for All
A generative speech model for daily dialogue.