Highlights
- Pro
Lists (5)
Sort Name ascending (A-Z)
Stars
Accelerator of Scientific Development and Research. A project template developed by XCCV group of cvpaper.challenge.
GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)
LlamaIndex is a data framework for your LLM applications
LAVIS - A One-stop Library for Language-Vision Intelligence
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…
X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
pdb , a drop-in replacement for pdb (the Python debugger)
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!
NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Code for ALBEF: a new vision-language pre-training method
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiase…
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
Code and Experiments for ACL-IJCNLP 2021 Paper "Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering."
image scene graph generation benchmark
Introduction to mathematical writing for undergraduate students majoring in engineering
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A PyTorch reimplementation of bottom-up-attention models
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Rich is a Python library for rich text and beautiful formatting in the terminal.
An Open-Source Package for Knowledge Embedding (KE)
Code for 'Low-Shot Learning with Imprinted Weights'