willy808

Wei-Hsin Chen willy808

master:NYCU Photonic & AI Bachelor:NCU EE

1 follower · 14 following

in/wei-hsin-chen-8b0488196

Stars

apple / ml-sigmoid-attention

Python 155 6 Updated Sep 9, 2024

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,376 838 Updated Sep 4, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,288 232 Updated Sep 13, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,370 1,193 Updated Aug 21, 2024

microsoft / Industrial-Foundation-Models

Dedicated to building industrial foundation models for universal data intelligence across industries.

Python 22 1 Updated Aug 19, 2024

NVlabs / EAGLE

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Python 389 24 Updated Sep 10, 2024

showlab / Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 777 36 Updated Sep 12, 2024

NVIDIA / gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C 852 143 Updated Jul 8, 2024

idiap / multitask_asr_and_scd

Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers

2 Updated Jan 15, 2024

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 5,898 753 Updated Sep 11, 2024

DmitryRyumin / INTERSPEECH-2023-24-Papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…

629 42 Updated Aug 9, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,752 1,046 Updated Aug 15, 2024

Picovoice / picovoice

On-device voice assistant platform powered by deep learning

Python 564 109 Updated Sep 5, 2024

open-webui / open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 38,914 4,535 Updated Sep 13, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,162 376 Updated Sep 13, 2024

yeyupiaoling / Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 807 128 Updated Jul 18, 2024

microsoft / semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 21,310 3,133 Updated Sep 13, 2024

fal-ai / fal

⚡ Fastest way to serve open source ML models to millions

Python 498 43 Updated Sep 11, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 13,468 949 Updated Sep 13, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,674 857 Updated Aug 21, 2024

rapidsai / kvikio

KvikIO - High Performance File IO

Python 148 54 Updated Sep 12, 2024

mlfoundations / dclm

DataComp for Language Models

HTML 1,103 96 Updated Sep 5, 2024

muzishen / IMAGDressing

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Python 950 81 Updated Aug 28, 2024

CharlesGong12 / RECE

[ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

Jupyter Notebook 44 6 Updated Aug 25, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,475 425 Updated Sep 10, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,372 1,216 Updated Sep 13, 2024

cozodb / openai-multi-client

Making your requests to the OpenAI API go fast!

Python 104 10 Updated Jan 11, 2024

DioxusLabs / dioxus

Fullstack app framework for web, desktop, mobile, and more.

Rust 20,206 774 Updated Sep 13, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 17,101 1,607 Updated Sep 13, 2024

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,204 150 Updated Jun 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly