#

large-vision-models

Here are 4 public repositories matching this topic...

Paranioar / Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

Updated Dec 15, 2024

PKU-Alignment / safe-sora

SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models (LVMs).

alignment human-preferences text-to-video-generation large-vision-models

Updated Aug 20, 2024
Python

afondiel / Prompt-Engineering-for-Vision-Models-DeepLearningAI

These notes and resources are compiled from the crash course Prompt Engineering for Vision Models offered by DeepLearning.AI.

computer-vision image-processing cnn video-processing vit generative-models fine-tuning diffusion-models convnets vision-models visual-prompting prompt-engineering vision-language-model large-vision-language-models meta-sam large-vision-models vision-model-prompting

Updated Aug 20, 2024
Jupyter Notebook

Rnamrata / image_enhancement_for_social_robots

Image enhancement using CNN and LVM

tensorflow keras jupyter-notebook python3 convolutional-neural-networks large-vision-models

Updated Dec 6, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the large-vision-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the large-vision-models topic, visit your repo's landing page and select "manage topics."