GitHub repository with code for full data analysis