Stars
Developer-friendly, minimalism Cron alternative, but with much more capabilities. It aims to solve greater problems.
Apache DataFusion Comet Spark Accelerator
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source big data platform. This allows you to reduce your focus on und…
YTsaurus is a scalable and fault-tolerant open-source big data platform.
TigerBot: A multi-language multi-task LLM
microsoft / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
Open standard for machine learning interoperability
A Bridge between SDN and Cloud Native (Project under CNCF)
A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Optimized primitives for collective multi-GPU communication
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
A C vectorized database acceleration library aimed to optimizing query engines and data processing systems.
A Cloud Native Batch System (Project under CNCF)
Kata Containers version 1.x runtime (for version 2.x see https://github.com/kata-containers/kata-containers).
The Moby Project - a collaborative project for the container ecosystem to assemble container-based systems