etl

EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more.

redis bigquery aws scala spark etl s3 gcp gcs zio etl-framework dataproc etl-pipeline

Updated Aug 26, 2024
Scala

yamrcraft / etl-light

Star

A light Kafka to HDFS/S3 ETL library based on Apache Spark

docker scala kafka spark protobuf avro etl job s3 batch hdfs parquet

Updated Jun 29, 2017
Scala

mayur2810 / sope

Star

Apache Spark ETL Utilities

yaml framework scala spark etl dsl transformer spark-sql

Updated Oct 23, 2024
Scala

SharpData / SharpETL

Star

Write ETL using your favorite SQL dialects

scala sql spark hive etl bigdata data-warehouse flink datawarehouse spark-sql etl-framework flink-sql paimon

Updated Jan 7, 2024
Scala

bebee4java / ides

Star

智能数据探索服务(Intelligent Data Exploration Service)，一站式Data AI数据解决方案！

data-science data sql ai spark etl bigdata ml stream-processing olap daas data-analysis batch-processing ides datalink

Updated Jul 10, 2023
Scala

shouweikun / estuary

Star

基于Akka实现的数据实时流式同步的应用,支持高可用

scala kafka akka etl canal

Updated Apr 25, 2019
Scala

vesoft-inc / nebula-exchange

Star

NebulaGraph Exchange is an Apache Spark application to parse data from different sources to NebulaGraph in a distributed environment. It supports both batch and streaming data in various formats and sources including other Graph Databases, RDBMS, Data warehouses, NoSQL, Message Bus, File systems, etc.

spark etl data-import graph-database hacktoberfest data-pipeline nebulagraph

Updated Jul 17, 2024
Scala

sparsecode / DaFlow

Star

Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.

json scala csv apache-spark hive hadoop avro etl parquet transformation-rules etl-framework etl-pipeline join-data

Updated Jun 7, 2021
Scala

Guidewire / cda-client

Star

Cloud Data Access client

kafka spark etl parquet

Updated Dec 28, 2022
Scala

Improve this page

Add a description, image, and links to the etl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etl

Here are 73 public repositories matching this topic...

YotpoLtd / metorikku

nightscape / spark-excel

SETL-Framework / setl

51zero / eel-sdk

TianLangStudio / DataXServer

AbsaOSS / cobrix

datainsider-co / rocket-bi

dimajix / flowman

galliaproject / gallia-core

zhaoyachao / zdh_server

starlake-ai / starlake

tharwaninitin / etlflow

yamrcraft / etl-light

mayur2810 / sope

SharpData / SharpETL

bebee4java / ides

shouweikun / estuary

vesoft-inc / nebula-exchange

sparsecode / DaFlow

Guidewire / cda-client

Improve this page

Add this topic to your repo