THIS README IS FOR THE DATA/RAW_DATA DIRECTORY, EACH SUB-DIRECTORY WILL ALSO CONTAIN IT'S OWN README FILE.

THIS DIRECTORY CONTAINS THE CODE AND OUTPUTS FOR THE DATA MANIPULATION OF THE SHAPEFILES, SOCIO-ECONOMIC VARIABLES AND POINT PATTERNS IN PREPARATION FOR THE CREATION OF THE COUNT DATA.
THIS DIRECTORY CONTAINS 3 SUB-DIRECTORIES: ONE FOR THE SHAPEFILES, THE SOCIO-ECONOMIC VARIABLES AND THE POINT PATTERN DATA.

THE RAW AND MANIPULATED DATA FILES ARE NOT CONTAINED WITHIN THESE ARCHIVED FOLDERS BUT THE NECESSARY RAW DATA FOR EACH SUB-DIRECTORY CAN BE ACCESSED THROUGH AS DETAILED IN DataAccessInformation.pdf AND DISCUSSED IN APPENDIX F OF MY THESIS. ALSO THE MANIPULATED DATA CAN BE CREATED THROUGH THE R FILES IN EACH RELEVANT SUB-DIRECTORY. WHILE WE DO NOT HAVE THE FILES ARCHIVED, WE  DISCUSS THE NAMING CONVENTIONS BELOW AS THEY ARE USED WITHIN THE R SCRIPTS FOR EACH SUB-DIRECTORY.

- CRIME: this sub-directory contains the raw point pattern crime data for the cities as well as the code to extract the crimes of interest only, homicide and motor vehicle thefts. These outputs are also saved in the DATA/PROCESSED_DATA/CRIME/POINT_PATTERN directory for transforming into count data, whether over census tracts or discretisation grids.

- COVARIATES: this contains the raw socio-economic data on the census tracts for the necessary counties that contain the cities of interest as well as the code to extract the necessary census tract values and interpolate missing values as required. Each data set for the variables are saved into one of the sub-directories for each city. These outputs are also saved in the DATA/PROCESSED_DATA/COVARIATES directory, for use in the creation of the crime aggregated count data sets, whether over the census tracts or over discretisation grids in the DATA/PROCESSED_DATA/CRIME directory, as well as for the Ripley's K estimation in the DATA/MODELS/GLMS directory.

- SHAPEFILES: this contains two sub-directories one contains the shapefiles for the city boundaries while the other contains the census tract boundaries for the states that the cities of interest lie in. The latter also contains the required city census tracts extracted as well as their neighbourhood matrices. These outputs are also saved in the DATA/PROCESSED_DATA/SHAPEFILES directory. These are used to create the aggregated count data, over the census tracts and grids, in the DATA/PROCESSED_DATA/CRIME directory. These shapefiles are also used for plotting in the DATA/EDA and DATA/MODELS directories and for the creation of the Los Angeles window in the GRID_MESH/IRREGULAR_POLYGON_LGCP directory.
