Skip to content

scrapes, cleans, transforms, and searches United Nations treaty data

Notifications You must be signed in to change notification settings

zmjones/untreaties

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

91 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repository contains a set of scripts that allow the retrieval of the United Nations' Treaty Collection, and transformation of the retrieved data. Please open an issue if you find any errors or would like to suggest a feature. Pull requests are welcome!

Getting the data

You can clone (see below) or download the data contained in this repository or scrape it using un_data.py, which requires pandas, requests, and beautifulsoup4. These dependencies are listed in requirements.txt. The script will generate index.csv along with a folder containing the treaty data itself (treaties) and a folder containing declarations made by participants (contained in declarations).

git clone [email protected]:zmjones/untreaties.git
cd untreaties
pip install -r requirements.txt
python un_data.py

Transforming the data

The R script utilities.R contains a number of functions that make working with the raw data easier. You can load these functions by simply sourcing the file. It requires stringr, lubridate, and plyr. The loadData function allows the user to load a specific treaty given its chapter and treaty numbers, which are passed as strings. You can optionally expand the column names (if needed). If you choose to expand the column names you can also transform the data into a country-year format, given a start year and an end year (both passed as strings).

source("utilities.R")
loadData(chap = "10", treaty = "2")
loadData(chap = "10", treaty = "2", expand = TRUE)
loadData(chap = "10", treaty = "2", expand = TRUE, panel = TRUE, syear = "1945", eyear = "2013")

About

scrapes, cleans, transforms, and searches United Nations treaty data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published