Skip to content

An interactive data visualization of where open source comes from

Notifications You must be signed in to change notification settings

davidfischer/gdc2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Where does open source come from?

Sometimes a picture is worth a thousand words.

This interactive visualization of the location of contributors to the top repositories on GitHub was my entry to the GitHub Data Challenge

Developing

The provided Makefile has everything needed to fetch data from GitHub Archive, load it into a database, geocode locations and output the resulting JSON. It can even push the results to GitHub pages.

There are also some interesting queries for doing things like fetching top 200 repositories, gathering up all the relevant events or getting the unique set of locations to geocode.

# This first step takes hours and downloads ~6.5GB
make githubarchive

# This step takes a few hours with decently fast disks
make loaddb

# This step geocodes any locations not already geocoded (`locations.json`)
make geocode

# Generate the `events.json` file used for the visualization
make jsonify

Credits

About

An interactive data visualization of where open source comes from

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published