Skip to content
View krmao's full-sized avatar

Organizations

@multiapk @codesdancing

Block or report krmao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • TEN, the Next-Gen AI-Agent Framework, the world's first truly real-time multimodal AI agent framework.

    C Other Updated Nov 12, 2024
  • websocat Public

    Forked from vi/websocat

    Command-line client for WebSockets, like netcat (or curl) for ws:// with advanced socat-like functions

    Rust MIT License Updated Nov 11, 2024
  • Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

    Jupyter Notebook MIT License Updated Nov 11, 2024
  • Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

    Updated Nov 11, 2024
  • The Amazon Transcribe Streaming SDK is an async Python SDK for converting audio into text via Amazon Transcribe.

    Python Apache License 2.0 Updated Nov 9, 2024
  • Python MIT License Updated Nov 9, 2024
  • Whisper command line client compatible with original OpenAI client based on CTranslate2.

    Python MIT License Updated Nov 8, 2024
  • FunASR Public

    Forked from modelscope/FunASR

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

    Python Other Updated Nov 8, 2024
  • aTrain Public

    Forked from JuergenFleiss/aTrain

    A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.

    Python Other Updated Nov 7, 2024
  • A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

    TypeScript MIT License Updated Nov 7, 2024
  • silero-vad Public

    Forked from snakers4/silero-vad

    Silero VAD: pre-trained enterprise-grade Voice Activity Detector

    Python MIT License Updated Nov 7, 2024
  • Port of OpenAI's Whisper model in C/C

    C MIT License Updated Nov 6, 2024
  • A nearly-live implementation of OpenAI's Whisper.

    Python MIT License Updated Nov 5, 2024
  • Faster Whisper transcription with CTranslate2

    Python MIT License Updated Nov 5, 2024
  • CTranslate2 Public

    Forked from OpenNMT/CTranslate2

    Fast inference engine for Transformer models

    C MIT License Updated Nov 5, 2024
  • puma Public

    Forked from puma/puma

    A Ruby/Rack web server built for parallelism

    Ruby BSD 3-Clause "New" or "Revised" License Updated Nov 4, 2024
  • LightGBM Public

    Forked from microsoft/LightGBM

    A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

    C MIT License Updated Nov 4, 2024
  • asdf Public

    Forked from asdf-vm/asdf

    Extendable version manager with support for Ruby, Node.js, Elixir, Erlang & more

    Shell MIT License Updated Nov 1, 2024
  • numpy Public

    Forked from numpy/numpy

    The fundamental package for scientific computing with Python.

    Python Other Updated Oct 30, 2024
  • logseq Public

    Forked from logseq/logseq

    A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: http://trello.com/b/8txSM12G/roadmap

    Clojure GNU Affero General Public License v3.0 Updated Oct 30, 2024
  • xgboost Public

    Forked from dmlc/xgboost

    Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

    C Apache License 2.0 Updated Oct 29, 2024
  • Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

    Jupyter Notebook BSD 2-Clause "Simplified" License Updated Oct 27, 2024
  • nvitop Public

    Forked from XuehaiPan/nvitop

    An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

    Python Apache License 2.0 Updated Oct 27, 2024
  • whisper Public

    Forked from openai/whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

    Python MIT License Updated Oct 26, 2024
  • ragflow Public

    Forked from infiniflow/ragflow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

    Python Apache License 2.0 Updated Oct 26, 2024
  • Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80 languages recognition, provide data annotation and synthesis tools, support training and…

    Python Apache License 2.0 Updated Oct 25, 2024
  • wukong-robot Public

    Forked from wzpan/wukong-robot

    🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。

    Python MIT License Updated Oct 25, 2024
  • functionary Public

    Forked from MeetKai/functionary

    Chat language model that can use tools and interpret the results

    Python MIT License Updated Oct 25, 2024
  • PaddleX Public

    Forked from PaddlePaddle/PaddleX

    All-in-One Development Tool based on PaddlePaddle(飞桨低代码开发工具)

    Python Apache License 2.0 Updated Oct 25, 2024
  • 中文大模型能力评测榜单:目前已囊括128个大模型,覆盖chatgpt、gpt-4o、谷歌gemini、百度文心一言、阿里通义千问、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及qwen2.5、llama3.1、glm4、书生internLM2.5、openbuddy、AquilaChat等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!

    Updated Oct 24, 2024