Skip to content
View jhCOR's full-sized avatar

Highlights

  • Pro

Block or report jhCOR

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,024 380 Updated Aug 7, 2024

[ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“

Python 40 6 Updated Jul 17, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,958 462 Updated Oct 29, 2024

cuDF - GPU DataFrame Library

C 8,426 903 Updated Nov 10, 2024

This repository contains simple but quite fun deep learning projects:)

Jupyter Notebook 3 Updated Oct 18, 2024

Official Pytorch implementation of CutMix regularizer

Python 1,224 159 Updated Sep 16, 2020

ORION: Orientation-boosted Voxel Nets for 3D Object Recognition

MATLAB 113 32 Updated Nov 7, 2017

[NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Models

Python 97 11 Updated Jul 1, 2024

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities

Jupyter Notebook 154 13 Updated Jul 27, 2022

Learning low-shot object classification with explicit shape bias learned from point clouds

Python 45 Updated Dec 8, 2021

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,527 1,555 Updated Oct 19, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,314 176 Updated Oct 15, 2024

[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model

Jupyter Notebook 24 Updated Aug 17, 2024

Scripts for fine-tuning Llama2 via SFT and DPO.

Python 179 37 Updated Aug 14, 2023

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Python 65 5 Updated Jan 30, 2024
Python 40 7 Updated Apr 23, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,418 2,919 Updated Sep 2, 2024

Google Gemini AI model w/speech recognition and voice.

Python 23 3 Updated Dec 17, 2023
Python 287 7 Updated Jan 27, 2024

Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"

Python 19 2 Updated Feb 23, 2021
Python 25 6 Updated Jun 19, 2024
Jupyter Notebook 2 Updated Dec 3, 2023
Python 2,829 229 Updated Oct 16, 2024

LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft

Python 39 3 Updated Jul 17, 2024
Python 4 1 Updated Jul 2, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 9,169 818 Updated Aug 7, 2024

An open source implementation of CLIP.

Python 10,264 979 Updated Nov 6, 2024

Android in-app purchases and subscriptions made easy.

Kotlin 258 52 Updated Nov 8, 2024

Oboe is a C library that makes it easy to build high-performance audio apps on Android.

C 3,714 569 Updated Nov 8, 2024
Next