Stars
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
A topic-centric list of HQ open datasets.
An awesome list of high-quality open datasets in public domains (on-going).
PostgreSQL wire protocol implemented as a rust library.
Automatically format your Python docstrings to conform with PEP 8 and PEP 257
Bayesian optimisation & Reinforcement Learning library developed by Huawei Noah's Ark Lab
Dike is a new benchmark suite for benchmarking distributed transactional databases (DDBMSs), which is extended from the popular TPC-C benchmark.
LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as Delta Lake, Apache Hudi, and Apache Iceberg.
PTSSBench: An Arena for Automatic Configuration Tuning Research on Software Systems
Infrastructure and application test and analysis framework
GPTuner is a manual-reading database tuning system leveraging domain knowlege automatically and extensively to enhance knob tuning process.
cluster data collected from production clusters in Alibaba for cluster management research
RunCVM (Run Container VM) is an experimental open-source Docker container runtime, for launching standard container workloads - as well as Systemd, Docker, even OpenWrt - in VMs using 'docker run`
Fast Static Symbol Table (FSST): efficient random-access string compression
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
Set of macros that guard against buffer overflows. Based on C99 VLA feature.
🥑 Language focused docker images, minus the operating system.
MLOS is a project to enable autotuning for systems.
Supercharge your Vim editor with AI-powered code completion using OpenAI Codex. Boost productivity and save time with intelligent suggestions.
A Toolchain to make Build and Run eBPF programs easier