Skip to content
View mystijk's full-sized avatar

Block or report mystijk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open Platform for Embodied Agents

Python 268 15 Updated Oct 13, 2024

A Fundamental End-to-End Speech Recognition Toolkit

Python 1 Updated Oct 11, 2024

MMeRAG is an open-source RAG (Retrieval-Augmented Generation), Provides a parser for audio and video data to implement RAG for audio and video. MMeRAG是一个开源的RAG项目,提供了一种用于音频和视频数据的解析器,用来实现音视频的RAG。

Python 5 Updated Sep 24, 2024

A Simple and Efficient Implementation Of Fast Fourier Transform For Audio Denoise

C 100 57 Updated Aug 11, 2020

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python 776 61 Updated Aug 27, 2024

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1

Python 839 121 Updated Oct 21, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 429 29 Updated Oct 17, 2024
HTML 67 7 Updated May 10, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 867 60 Updated Nov 4, 2024

A diffusers pipeline for zero shot stylised couples portrait creation

Python 90 9 Updated Sep 25, 2024

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 725 29 Updated Nov 6, 2024

✨✨Latest Advances on Multimodal Large Language Models

12,607 806 Updated Nov 10, 2024

OmniControl: Control Any Joint at Any Time for Human Motion Generation, ICLR 2024

Python 243 17 Updated Jun 14, 2024

A 3DGS framework for omni urban scene reconstruction and simulation.

Python 574 46 Updated Sep 6, 2024

An ASR model for transcribing laughter and speech-laugh in conversational speech

Python 1 Updated Nov 12, 2024

实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …

Python 308 36 Updated Nov 8, 2024

SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark

63 2 Updated Nov 5, 2024

[SIGGRAPH Asia 2024] PuzzleAvatar: Assembling 3D Avatars from Personal Albums

Python 243 10 Updated Nov 12, 2024

Animatable Gaussian textured Avatar

Python 46 2 Updated Jun 24, 2024

[ICCV 2023]ToonTalker: Cross-Domain Face Reenactment

Python 104 8 Updated Oct 29, 2024

[ICML 2024] 🍅HumanTOMATO: Text-aligned Whole-body Motion Generation

Python 288 8 Updated Jun 19, 2024

A tool to tranform the flame texture space,shape and pose paramerter into SMPL or SMPLX model 's head(or face).

Python 35 2 Updated Mar 22, 2024

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,540 183 Updated Nov 6, 2024

20 high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,668 1,062 Updated Nov 11, 2024

Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

Jupyter Notebook 303 17 Updated Oct 6, 2024
Python 1 Updated Sep 19, 2024

ViViD: Video Virtual Try-on using Diffusion Models

Python 463 31 Updated Jun 21, 2024

[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

Python 256 33 Updated Jul 30, 2024

Talk to your database as if you were chatting with a friend. Turn natural language into powerful SQL queries effortlessly, and get your answers back in a language you understand. No technical jargo…

TypeScript 4 Updated Nov 12, 2024

First Place Winner at Delta Hacks 5. Analyses speech, hand gestures, and facial expressions and gives both real-time feedback as well as a summary of results at the end.

Python 37 5 Updated Dec 10, 2022
Next