Skip to content
View xingtianqz's full-sized avatar

Block or report xingtianqz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 56,874 6,026 Updated Nov 17, 2024

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

MATLAB 6,958 1,854 Updated Jun 1, 2024

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 4,144 473 Updated Nov 15, 2024

Brand new TTS solution

Python 14,497 1,100 Updated Nov 14, 2024

坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.

32,511 3,563 Updated May 29, 2024

Multilingual Voice Understanding Model

Python 3,440 311 Updated Oct 18, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 6,312 673 Updated Nov 15, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 8,993 1,422 Updated Aug 9, 2024

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.

Python 3,782 476 Updated May 25, 2024

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 369 30 Updated Jan 25, 2024

🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。

Python 19 1 Updated Jun 5, 2024

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM , etc. It is not excluded that more models will be supported in the future. At the …

Python 804 126 Updated Nov 14, 2024

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Python 330 54 Updated Oct 1, 2024

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 2,075 301 Updated Jul 15, 2024

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程

Jupyter Notebook 9,421 1,087 Updated Nov 16, 2024

The official Meta Llama 3 GitHub site

Python 27,140 3,070 Updated Aug 12, 2024

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 13,747 1,854 Updated Nov 18, 2024

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 18,341 4,844 Updated Nov 17, 2024

智能微秘书,全能的微信机器人管理平台,最简单的方式接入ChatGPT,FastGPT,Dify,Coze,扣子.支持绘图,语音识别,语音发送,定时任务,支持企微、公众号、5G 消息、WhatsApp

JavaScript 1,863 294 Updated Nov 17, 2024

🚀 MaxKB 是一款基于大语言模型和 RAG 的开源知识库问答系统,广泛应用于智能客服、企业内部知识库、学术研究与教育等场景。

Python 11,490 1,504 Updated Nov 15, 2024

Inference code for Llama models

Python 56,441 9,571 Updated Aug 18, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,508 237 Updated May 1, 2024

The open source platform for AI-native application development.

Python 6,214 317 Updated Nov 1, 2024

Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)

Jupyter Notebook 508 38 Updated Oct 29, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 98,220 7,820 Updated Nov 18, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

16,006 1,478 Updated Sep 19, 2024

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 233 23 Updated Mar 20, 2024

SplitML (Signal Processing Library for Interference rejecTion by Machine Learning) is a code repository for a set of tools for interference rejection in complex time-domain signals.

Python 6 2 Updated Nov 13, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80 languages recognition, provide data annotation and synthesis tools, support training and…

Python 44,296 7,826 Updated Nov 16, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 35,731 4,073 Updated Nov 7, 2024
Next