Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …

Python 8,602 1,681 Updated Oct 1, 2024

RyanWangZf / MedCLIP

EMNLP'22 | MedCLIP: Contrastive Learning from Unpaired Medical Images and Texts

Python 436 48 Updated Apr 12, 2024

linzixuan45 / Robust_video_matting

优化过的人像抠图算法，实现了本地实时抠图

Python 1 Updated Aug 30, 2023

zjp-shadow / CharacterGen

[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization

JavaScript 517 44 Updated Sep 10, 2024

TMElyralab / MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Python 2,149 154 Updated Aug 7, 2024

MoyGcc / vid2avatar

Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition (CVPR2023)

Python 1,230 101 Updated May 21, 2024

tijiang13 / InstantAvatar

InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds (CVPR 2023)

Python 365 31 Updated Aug 2, 2024

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 18,557 1,882 Updated Oct 5, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 17,817 1,706 Updated Oct 4, 2024

songquanpeng / one-api

OpenAI 接口管理 & 分发系统，支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元，可用于二次分发管理 key，仅单可执行文件，已打包好 Docker 镜像，一键部署，开箱即用. OpenAI key management & redistributi…

JavaScript 18,249 4,125 Updated Sep 22, 2024

hiroi-sora / Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

Python 26,089 2,639 Updated Sep 29, 2024

NanGePlus / GraphragTest

提供了一种gpt大模型平替解决方案实现利用非gpt大模型去使用Graphrag，支持多类型大模型如本地大模型(Ollama)、阿里云通义千问、百度文心千帆、智谱ChatGML、讯飞星火认知、Ollama、Moonshot AI、Google Gemini等。示例代码使用阿里的通义千问大模型，其他大模型使用方式相同。

Python 106 24 Updated Sep 6, 2024

x007xyz / fly-cut

A web-based video editing tool implemented with WebCodecs, similar to CapCut Web.使用webcodecs实现的Web端视频编辑工具，类似剪映Web版。

Vue 380 55 Updated Sep 20, 2024

magic-research / bubogpt

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

Python 500 33 Updated Jul 21, 2023

modelscope / FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 3,371 360 Updated Aug 22, 2024

princepride / scratch-pytorch-step-by-step

教你只用最基本的python语法和numpy一步步实现深度学习框架

Jupyter Notebook 118 14 Updated Aug 2, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,184 659 Updated Sep 30, 2024

bdashore3 / flash-attention

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 233 20 Updated Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qxt-expor

Block or report qxt-expor

Lists (1)

video understanding

Starred repositories

evgkanias / sky-gui

PetrVevoda / pragueskymodel

skhu101 / GauHuman

PeterL1n / RobustVideoMatting

PeterL1n / BackgroundMattingV2

PaddlePaddle / PaddleSeg

RyanWangZf / MedCLIP

linzixuan45 / Robust_video_matting

zjp-shadow / CharacterGen

TMElyralab / MusePose

MoyGcc / vid2avatar

tijiang13 / InstantAvatar

infiniflow / ragflow

microsoft / graphrag

songquanpeng / one-api

hiroi-sora / Umi-OCR

NanGePlus / GraphragTest

x007xyz / fly-cut

magic-research / bubogpt

modelscope / FunClip

princepride / scratch-pytorch-step-by-step

modelscope / FunASR

bdashore3 / flash-attention

mbzuai-oryx / Video-ChatGPT

opendatalab / labelU

THUDM / GLM-4

andimarafioti / florence2-finetuning

mgjinnn / TurtleSoupBaseline

langgenius / dify

locaal-ai / obs-cleanstream

Starred topics

3d-human-reconstruction