Stars
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Pushing the Limits of Zero-shot End-to-End Speech Translation
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Data and tools for generating and inspecting OLMo pre-training data.
A Framework of Small-scale Large Multimodal Models
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
This is the official repo for Towards Uncertainty-Aware Language Agent.
[ICML2024] Adaptive Text Watermark for Large Language Models
Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"
An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.
A curated list for Efficient Large Language Models
[Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
A modular graph-based Retrieval-Augmented Generation (RAG) system
GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"
This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"
(IJCV2024 & ICCV2023) LSKNet: A Foundation Lightweight Backbone for Remote Sensing
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"
PLLaMA: an Open-source Large Language Model for Plants