Skip to content
View favkiet's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report favkiet

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

Jupyter Notebook 195 32 Updated Sep 12, 2022

An open-source RAG-based tool for chatting with your documents.

Python 10,522 734 Updated Sep 6, 2024

The repository provides code for training/fine tune the Meta Segment Anything Model 2 (SAM 2)

Jupyter Notebook 69 8 Updated Sep 5, 2024

Hands-On Genetic Algorithms with Python, Published by Packt

Python 261 139 Updated Jan 30, 2023

A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!

Python 541 112 Updated Apr 16, 2022

đź’» A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline

Python 53,674 3,889 Updated Sep 6, 2024

MLOps for deploying a Credit Risk model

HTML 25 7 Updated Jun 21, 2023

Design/Implement stream/batch architecture on NYC taxi data | #DE

Scala 26 10 Updated Apr 29, 2021

Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO

Python 55 23 Updated Jul 21, 2023

GenAI Cookbook

Jupyter Notebook 273 63 Updated Sep 6, 2024

U-Net implementation in PyTorch for FLAIR abnormality segmentation in brain MRI

Python 706 186 Updated Mar 24, 2023

Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.

Jupyter Notebook 10,094 2,987 Updated Sep 6, 2024

This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated graph based algorithm to handle the tasks.

Jupyter Notebook 347 39 Updated Aug 29, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 5,514 492 Updated Sep 6, 2024

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All comp…

Python 174 70 Updated Oct 5, 2023

This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessary infrastructure components, including Apache Flink, Elastic…

Java 32 15 Updated Dec 4, 2023

The open-source tool for building high-quality datasets and computer vision models

Python 8,059 537 Updated Sep 6, 2024

real time face swap and one-click video deepfake with only a single image

Python 33,595 4,724 Updated Sep 5, 2024

High-quality datasets, tools, and concepts for LLM fine-tuning.

1,627 147 Updated Aug 18, 2024

Always know what to expect from your data.

Python 9,800 1,513 Updated Sep 6, 2024

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 2,450 204 Updated Sep 5, 2024

Let's build our own scalable chatbots for the Scalable Machine Learning and Deep Learning course at KTH!!

Jupyter Notebook 6 1 Updated Feb 23, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,408 1,663 Updated Sep 6, 2024

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper Code Demo]

Python 622 58 Updated Jun 24, 2024

A curated list of awesome open-source libraries for production LLM

311 21 Updated Sep 2, 2024

An Awesome List of Open-Source Data Engineering Projects

1,940 318 Updated Jun 19, 2024

MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.

Go 46,552 5,394 Updated Sep 6, 2024
Next