Skip to content
View m-bain's full-sized avatar
Block or Report

Block or report m-bain

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multimodal language model benchmark, featuring challenging examples

Python 139 6 Updated May 14, 2024
Python 254 7 Updated Jan 27, 2024

Structured Text Generation

Python 7,219 372 Updated Jul 18, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,360 432 Updated May 3, 2024

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

C 7,763 284 Updated Jun 28, 2024

LLM training code for Databricks foundation models

Python 3,863 507 Updated Jul 18, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,224 1,987 Updated Jul 14, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,027 126 Updated Jul 15, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 1,876 145 Updated May 23, 2024

MeetEval - A meeting transcription evaluation toolkit

Python 69 13 Updated Jul 15, 2024

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code …

616 41 Updated May 18, 2024
Python 11 2 Updated Jun 14, 2024

Tools for handling speech data in machine learning projects.

Python 905 205 Updated Jul 17, 2024

Easily create large video dataset from video urls

Python 505 59 Updated Jul 18, 2024

Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets

Python 10 1 Updated May 25, 2023

String-to-String Algorithms for Natural Language Processing

Jupyter Notebook 510 24 Updated May 24, 2023

ImageBind One Embedding Space to Bind Them All

Python 8,094 738 Updated Jul 10, 2024

the subtitle editor :)

C# 7,610 858 Updated Jul 17, 2024

Simple Diarization model

Python 37 3 Updated Nov 29, 2023
Python 14 1 Updated Sep 25, 2023

Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva

Python 78 23 Updated Jun 13, 2024

Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens

Python 393 14 Updated Nov 6, 2023

GPT4All: Chat with Local LLMs on Any Device

C 67,517 7,426 Updated Jul 18, 2024

[CVPR'23 Highlight] AutoAD: Movie Description in Context.

Python 85 Updated May 28, 2024

A database of movie scripts from several sources

Python 144 24 Updated May 3, 2024

Inference code for Llama models

Python 54,272 9,331 Updated Jul 13, 2024

gpu tester detects broken and slow gpus in a cluster

Python 63 6 Updated Feb 19, 2023

Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch

Python 66 15 Updated Sep 27, 2021

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,250 916 Updated Jul 17, 2024

AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.

Python 818 309 Updated Jul 18, 2024
Next