Lists (1)
Sort Name ascending (A-Z)
Stars
deep learning for image processing including classification and object-detection etc.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Official repository for VIGOR : Cross-View Image Geo-localization beyond One-to-one Retrieval
ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition
Framework, which loads lidar pointclouds and converts them into a Bird's Eye View RGB image
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
"Let's Look Down from the Air: Matching of Colored Point Cloud and Satellite Image for Cross-View Pose Estimation"
Image-to-image translation with conditional adversarial nets
semantic point cloud, localization, map matching, semantic point cloud map, image segmentation, image detection
Attention-Guided Version of 2D UNet for Automatic Brain Tumor Segmentation
The codebase of "GLOTS: Rethinking Transformers for Semantic Segmentation of Remote Sensing Images"
The codes for the work "Hybrid Shunted Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation"
An efficient 2-D semantic transformer model for semantic segmentation on aerial images.
We propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new model named Multi-Modal Segmentation TransFormer (MMSFormer) tha…
PyTorch implementation of "Segmenter: Transformer for Semantic Segmentation" Strudel et al. (2021)
[CoRL2022] CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers
[CVPR22] Official Implementation of DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic Segmentation
Official PyTorch implementation of Superpoint Transformer introduced in [ICCV'23] "Efficient 3D Semantic Segmentation with Superpoint Transformer" and SuperCluster introduced in [3DV'24 Oral] "Scal…
[NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation
This repository provides inference code to compute canopy height maps from aerial images, as described in the paper "Very high resolution canopy height maps from RGB imagery using self-supervised v…
CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).
Repository of the paper Multimodal Detection of Unknown Objects on Roads for Autonomous Driving at IEEE SMC
CEAM-YOLOv7: Improved YOLOv7 Based on Channel Expansion and Attention Mechanism for Driver Distraction Behavior Detection
MinkLoc : Lidar and Monocular Image Fusion for Place Recognition
ACM Multimedia2020 University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization 🚁 annotates 1652 buildings in 72 universities around the world.
Transforming multiple vehicle-mounted camera images into a bird’s-eye view (BEV), this project employs deep learning, including a U-Net backbone for semantic segmentation and a spatial transformer …