Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,527 1,555 Updated Oct 19, 2024

X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,314 176 Updated Oct 15, 2024

findalexli / mllm-dpo

[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model

Jupyter Notebook 24 Updated Aug 17, 2024

mzbac / llama2-fine-tune

Scripts for fine-tuning Llama2 via SFT and DPO.

Python 179 37 Updated Aug 14, 2023

opendatalab / HA-DPO

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Python 65 5 Updated Jan 30, 2024

nyunAI / Faster-LLM-Survey

Python 40 7 Updated Apr 23, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,418 2,919 Updated Sep 2, 2024

mytechnotalent / Gemini

Google Gemini AI model w/speech recognition and voice.

Python 23 3 Updated Dec 17, 2023

tsb0601 / MMVP

Python 287 7 Updated Jan 27, 2024

GuyTevet / diversity-eval

Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"

Python 19 2 Updated Feb 23, 2021

HadiZayer / eyenerf

Python 25 6 Updated Jun 19, 2024

palchenli / VL-Instruction-Tuning

84 3 Updated Nov 25, 2023

geuk-hub / -Dacon-Multimodal-vqa

Jupyter Notebook 2 Updated Dec 3, 2023

LLaVA-VL / LLaVA-NeXT

Python 2,829 229 Updated Oct 16, 2024

chuangchuangtan / LLaVA-NeXT-Image-Llama3-Lora

LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft

Python 39 3 Updated Jul 17, 2024

teddysum / Korean_DCS_2024

Python 4 1 Updated Jul 2, 2024

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 9,169 818 Updated Aug 7, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 10,264 979 Updated Nov 6, 2024

RevenueCat / purchases-android

Android in-app purchases and subscriptions made easy.

Kotlin 258 52 Updated Nov 8, 2024

google / oboe

Oboe is a C library that makes it easy to build high-performance audio apps on Android.

C 3,714 569 Updated Nov 8, 2024

JEONG JIHYEOK jhCOR

Highlights

Lists (3)

[ALL] Remarkable research

[Audio model Implementation]

[Framework/Toolkit]

Stars