Skip to content
View ildoonet's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ildoonet

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 336 29 Updated Apr 23, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

861 32 Updated Sep 27, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,659 1,032 Updated Sep 10, 2024

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

Python 272 23 Updated May 4, 2024

Gemma 2B with 10M context length using Infini-attention.

Python 941 58 Updated May 12, 2024

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Python 492 56 Updated Mar 14, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,324 84 Updated Sep 23, 2024
Jupyter Notebook 2 1 Updated Dec 19, 2023

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 663 58 Updated Apr 7, 2023

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,165 83 Updated Aug 20, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 18,267 1,853 Updated Sep 30, 2024

Finetune llama2-70b and codellama on MacBook Air without quantization

Python 445 33 Updated Mar 28, 2024

Python bindings for llama.cpp

Python 7,811 934 Updated Sep 29, 2024

A list of resources for hacking on the Rabbit r1

73 8 Updated Aug 22, 2024

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Python 712 27 Updated Sep 25, 2024

An AI search engine inspired by Perplexity

TypeScript 891 122 Updated Jun 24, 2024

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Python 1,156 86 Updated Sep 30, 2024

☁️ Build multimodal AI applications with cloud-native stack

Python 20,987 2,216 Updated Sep 26, 2024

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 13,667 1,299 Updated Sep 26, 2024

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,522 271 Updated Sep 30, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 4,048 370 Updated Sep 29, 2024

Run python and pygame code in your html

45 18 Updated Sep 28, 2024

🔍 AI search engine - self-host with local or cloud LLMs

TypeScript 2,631 233 Updated Sep 27, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 23,394 3,489 Updated Sep 5, 2024

아이유 노래 가사를 이용하여 랜덤 가사 생성

Python 9 Updated Nov 12, 2020

pip-installable binaries (wheels) for the extended version of the Hugo static site generator (note: unofficial, community-maintained)

Python 13 1 Updated Sep 30, 2024

LLM based autonomous agent that does online comprehensive research on any given topic

Python 14,232 1,853 Updated Sep 29, 2024

Web-based SQLite database browser written in Python

Python 3,358 331 Updated Jul 31, 2024

HTTP reverse proxy designed to facilitate secure access to HTTP services located within an internal network

Python 3 Updated Jan 28, 2024
Next