Lists (1)
Sort Name ascending (A-Z)
Stars
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Robust Speech Recognition via Large-Scale Weak Supervision
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features 🔥
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .
Source code for Twitter's Recommendation Algorithm
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
StyleGAN - Official TensorFlow Implementation
The easiest way to serve AI apps and models - Build reliable Inference APIs, LLM apps, Multi-model chains, RAG service, and much more!
Literature references for “Designing Data-Intensive Applications”
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Composable transformations of Python NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Code for Text2Human (SIGGRAPH 2022). Paper: Text2Human: Text-Driven Controllable Human Image Generation
implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)
Official PyTorch implementation of "VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization" (CVPR 2021)
Describes the full end to end smart parking application that is available with DeepStream 5.0
Real-time face swap for PC streaming or video calls
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
🔥🔥🔥🔥 (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥
This patch removes restriction on maximum number of simultaneous NVENC video encoding sessions imposed by Nvidia to consumer-grade GPUs.
The Triton Inference Server provides an optimized cloud and edge inferencing solution.