Performance Observability for Apache Spark
-
Updated
Oct 28, 2024 - TypeScript
Performance Observability for Apache Spark
🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK TypeScript
Jayvee is a domain-specific language and runtime for automated processing of data pipelines
⚡️ Next-generation data transformation framework for TypeScript that puts developer experience first
📺 Instill Console for 🔮 Instill Core: https://github.com/instill-ai/instill-core
Aqueduct Core is responsible for the core functionality of Aqueduct, an experiment management system.
Open-Source Generative AI for Data Engineering
Sync your team's data to your LLM applications in real-time
Watchmen Platform is a low code data platform for data pipeline, meta data management , analysis, indicator objective analysis and quality management
An extensible pipelining tool to build data pipelines from your bank account to any destination.
Create Database agnostic aggregations base on data pipelines
Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."