Stars
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…
BEVDet implemented by TensorRT, C ; Achieving real-time performance on Orin
[Pytorch Impl.] SBCFormer: Lightweight Network Capable of Full-size ImageNet Classification at 1 FPS on Single Board Computers -WACV2024 -Official Code
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
Industry leading face manipulation platform
[ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
Python Script written for automatic detection of vehicles during night time and auto switching of high beam.
A project to demonstrate how an adaptive LED high beam technology would work.
Unified Efficient Fine-Tuning of 100 LLMs (ACL 2024)
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
An Open Source Tools for Speaker Recognition
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
[CVPR 2024] PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
This repository is an official implementation of HVDetFusion
Dense Distinct Query for End-to-End Object Detection (CVPR2023)
A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution…
[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
CVPR2023-Occupancy-Prediction-Challenge