Lists (1)
Sort Name ascending (A-Z)
Starred repositories
📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.
Markdown parser, done right. 100% CommonMark support, extensions, syntax plugins & high speed
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
text to image to generation: CogView3-Plus and CogView3(ECCV 2024)
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
A comment system powered by GitHub Discussions. 💬 💎
GPT4V-level open-source multi-modal model based on Llama3-8B
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
VideoSys: An easy and efficient system for video generation
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
Processing engine and React components for constructing configuration-based data transformation and processing pipelines.
Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具,欢迎提交issues 或者pull requests
基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.
🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation
🚀CodiumAI PR-Agent: An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
The official gpt4free repository | various collection of powerful language models
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
This repo contains implementation of different architectures for emotion recognition in conversations.