Skip to content
View koolay's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Mars

Block or report koolay

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

bigdata

33 repositories

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 5,558 1,050 Updated Nov 17, 2024

Visually explore, understand, and present your data.

TypeScript 6,403 526 Updated Nov 13, 2024

A data visualization and analytics component, especially well-suited for large and/or streaming datasets.

C 8,526 1,183 Updated Nov 16, 2024

Use SQL to build ELT pipelines on a data lakehouse.

JavaScript 285 28 Updated May 25, 2022

⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

Python 1,912 209 Updated Nov 14, 2024

Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!

Scala 213 19 Updated Nov 15, 2024

The Universal Storage Engine

C 1,866 185 Updated Nov 17, 2024

re_data - fix data issues before your users & CEO would discover them 😊

HTML 1,552 121 Updated Apr 30, 2024

An analytics database that puts JSON and relational tables on equal footing

Go 1,392 66 Updated Nov 17, 2024

World's fastest log analysis: λ SQL JSON S3

Go 1,018 42 Updated Jan 7, 2024

A blazing fast tool for building data pipelines: read, process and output events. Our community: https://t.me/file_d_community

Go 350 79 Updated Nov 15, 2024

Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.

TypeScript 1,955 238 Updated Nov 15, 2024

Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang wit…

JavaScript 215 31 Updated Sep 10, 2024

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Go 5,781 659 Updated Nov 15, 2024

🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.

Python 21,966 1,317 Updated Nov 17, 2024

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such …

JavaScript 788 52 Updated Aug 10, 2022

An orchestration platform for the development, production, and observation of data assets.

Python 11,712 1,479 Updated Nov 17, 2024

Enso Analytics is a self-service data prep and analysis platform designed for data teams.

Scala 7,379 323 Updated Nov 17, 2024

🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications

Makefile 2,146 107 Updated Nov 14, 2024

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 7,955 771 Updated Nov 17, 2024

YTsaurus is a scalable and fault-tolerant open-source big data platform.

C 1,883 136 Updated Nov 17, 2024

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Python 1,846 166 Updated Nov 15, 2024

Modern, open-source event-processing

Rust 351 15 Updated Nov 10, 2023

Efficient data transformation and modeling framework that is backwards compatible with dbt.

Python 1,819 160 Updated Nov 17, 2024

Dozer is a real-time data movement tool that leverages CDC from various sources and moves data into various sinks.

Rust 1,513 124 Updated Jun 18, 2024

Build platforms that flexibly mix SQL, batch, and stream processing paradigms

Go 719 52 Updated Oct 17, 2024

MySQL replication topology management and HA

Go 5,637 933 Updated Jul 12, 2024

Self-serve BI to 10x your data team ⚡️

TypeScript 3,989 426 Updated Nov 15, 2024

A modern, scalable analytics system

HCL 1,384 60 Updated Nov 15, 2024

Data API Framework for AI Agents and Data Apps

TypeScript 644 28 Updated Jul 1, 2024