Skip to content
View datasherlock's full-sized avatar
The Light Behind Your Cloud
The Light Behind Your Cloud

Organizations

@GoogleCloudPlatform @googlers

Block or report datasherlock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is a repo with links to everything you'd ever want to learn about data engineering

11,250 1,573 Updated Nov 6, 2024

A technical explainer by @kognise of how your computer runs programs, from start to finish.

MDX 4,978 156 Updated Jun 15, 2024

A collective list of free APIs

Python 316,835 33,775 Updated Oct 31, 2024

Spark: The Definitive Guide's Code Repository

Scala 2,869 2,766 Updated Aug 26, 2020

Implementing best practices for PySpark ETL jobs and applications.

Python 1,681 709 Updated Jan 1, 2023

This blog explains a solution architecture to handle fast changing reference data stored in DynamoDB through an AWS Glue Streaming job

Python 1 Updated Jan 24, 2022

pipeline for migrating lichess data into postgresql

Python 209 9 Updated Nov 12, 2021

🎓 A collection of interactive courses for the swirl R package.

R 4,321 7,241 Updated Jan 10, 2024

The Leek group guide to data sharing

6,535 243,479 Updated Aug 7, 2024