#

multimodal-retrieval

Here are 10 public repositories matching this topic...

adithya-s-k / VARAG

Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine

multimodal-retrieval rag colpali

Updated Sep 28, 2024
Python

naver / artemis

Official code release for ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity (published at ICLR 2022)

image-retrieval multimodal-deep-learning multimodal-retrieval

Updated Feb 9, 2023
Python

TIBHannover / cross-modal_entity_consistency

This repository contains the dataset and source files to reproduce the results in the publication Müller-Budack et al. 2021: "Multimodal news analytics using measures of cross-modal entity and context consistency", In: International Journal on Multimedia Information Retrieval (IJMIR), Vol. 10, Art. no. 2, 2021.

deep-learning multimodal-retrieval cross-modal-consistency cross-modal-entity-verification image-repurposing-detection

Updated Jul 23, 2023
Python

vikram-mm / Multimodal-Image-Retrieval

Explores early fusion and late fusion approaches for Multimodal medical Image Retrieval

kmeans latent-dirichlet-allocation multimodal-retrieval

Updated May 4, 2020
Python

JUNJIE99 / VISTA_Evaluation_FineTuning

Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original code and model can be accessed at FlagEmbedding.

multimodal-retrieval vision-language-model hybrid-modal-retrieval

Updated Oct 30, 2024
Python

sisinflab / Formal-MultiMod-Rec

Formalizing Multimedia Recommendation through Multimodal Deep Learning, accepted in ACM Transactions on Recommender Systems.

pytorch recommender-system reproducibility multimedia-systems multimodal-deep-learning multimodal-retrieval graph-neural-networks multimedia-recommendation

Updated Jul 2, 2024
Python

noagarcia / context-art-retrieval

Multimodal retrieval in art with context embeddings.

art computer-vision image-retrieval multimodal-retrieval

Updated Jan 5, 2022
Python

marialymperaiou / knowledge-enhanced-multimodal-learning

A list of research papers on knowledge-enhanced multimodal learning

knowledge-graph multi-task-learning visual-reasoning visual-dialog visual-question-answering vision-and-language multimodal-deep-learning visual-storytelling multimodal-retrieval visual-grounding visual-commonsense-reasoning vision-and-language-navigation story-visualization image-text-matching vision-language-transformer image-text-retrieval vision-and-language-pre-training conditional-image-generation knowledge-enhanced-multimodal-learning knowledge-enhanced-vision-language

Updated Dec 8, 2022

marcomoldovan / multimodal-self-distillation

A generalized self-supervised training paradigm for unimodal and multimodal alignment and fusion.

pytorch multimodal-deep-learning multimodal-retrieval self-supervised-learning multimodal-fusion multimodal-alignment self-distillation

Updated Jul 1, 2023
Python

aurooj / VLM_SS

Mini-batch selective sampling for knowledge adaption of VLMs for mammography.

medical-imaging miccai mammogram multimodal-learning vision-and-language multimodal-retrieval vision-language-transformer multimodal-representation-learning miccai2024 medical-vision-language-model minibatch-selective-sampling

Updated Oct 7, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the multimodal-retrieval topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-retrieval topic, visit your repo's landing page and select "manage topics."