The zip file contains a set of top level data and source code files, a further files arranged into 5 directories. The 'analysis_datasets' directory contains the main dataset used for the paper. It also includes the machine learning algorithm code (random forest in this case), its associated data and a document showing a selected feature set. The 'gt', 'sd', and 'tf' directories contain raw data collected by one of participants. The 'set_viz' directory contains visualisation code as well as knapsack (cost–benefit) optimisation code for the selection of minimal sensor sets subject to the budget.