-
Zhejiang University
- Hangzhou, China
-
07:23
(UTC 08:00) - https://www.zju.edu.cn
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C 等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。
Conversational RPA SDK for Chatbot Makers. Join our Discord: https://discord.gg/7q8NBZbQzt
VideoSys: An easy and efficient system for video generation
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
LongWriter: Unleashing 10,000 Word Generation from Long Context LLMs
Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Easily train a good VC model with voice data <= 10 mins!
This is an unofficial API based on Python and FastAPI. It currently supports generating songs, lyrics, etc. 👇
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
chinese speech pretrained models
GUI for a Vocal Remover that uses Deep Neural Networks.
SoftVC VITS Singing Voice Conversion
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
vits2 backbone with multilingual-bert
The official Python API for ElevenLabs Text to Speech.
A programming framework for agentic AI 🤖