<?xml version='1.0' encoding='utf-8'?>
<eprints xmlns='http://eprints.org/ep2/data/2.0'>
  <eprint id='https://researchdata.bath.ac.uk/id/eprint/480'>
    <eprintid>480</eprintid>
    <rev_number>27</rev_number>
    <documents>
      <document id='https://researchdata.bath.ac.uk/id/document/8220'>
        <docid>8220</docid>
        <rev_number>2</rev_number>
        <files>
          <file id='https://researchdata.bath.ac.uk/id/file/23242'>
            <fileid>23242</fileid>
            <datasetid>document</datasetid>
            <objectid>8220</objectid>
            <filename>QRESY-PDSY.zip</filename>
            <mime_type>application/zip</mime_type>
            <hash>c3f52841c80a2f6e69847d934a3347cd</hash>
            <hash_type>MD5</hash_type>
            <filesize>163656</filesize>
            <mtime>2018-03-07 10:57:22</mtime>
            <url>https://researchdata.bath.ac.uk/480/1/QRESY-PDSY.zip</url>
          </file>
        </files>
        <eprintid>480</eprintid>
        <pos>1</pos>
        <placement>1</placement>
        <mime_type>application/zip</mime_type>
        <format>other</format>
        <language>en</language>
        <security>public</security>
        <license>cc_by</license>
        <main>QRESY-PDSY.zip</main>
        <content>data</content>
      </document>
    </documents>
    <eprint_status>archive</eprint_status>
    <userid>6484</userid>
    <dir>disk0/00/00/04/80</dir>
    <datestamp>2018-03-27 09:42:34</datestamp>
    <lastmod>2024-07-15 10:58:47</lastmod>
    <status_changed>2018-03-27 09:42:34</status_changed>
    <type>data_collection</type>
    <metadata_visibility>show</metadata_visibility>
    <creators>
      <item>
        <name>
          <family>Herrera Fernandez</family>
          <given>Manuel</given>
        </name>
        <id>amhf20@bath.ac.uk</id>
        <orcid>0000-0001-9662-0017</orcid>
        <affiliation>University of Bath</affiliation>
        <contact>TRUE</contact>
      </item>
      <item>
        <name>
          <family>Ramallo-González</family>
          <given>Alfonso</given>
        </name>
        <id>aprg20@bath.ac.uk</id>
        <orcid>0000-0001-7021-0634</orcid>
        <affiliation>University of Murcia</affiliation>
        <contact>FALSE</contact>
      </item>
      <item>
        <name>
          <family>Eames</family>
          <given>Matt</given>
        </name>
        <id>m.e.eames@exeter.ac.uk</id>
        <orcid>0000-0002-0515-6878</orcid>
        <affiliation>University of Exeter</affiliation>
        <contact>FALSE</contact>
      </item>
      <item>
        <name>
          <family>Ferreira</family>
          <given>Aida A.</given>
        </name>
        <orcid>0000-0002-0322-6801</orcid>
        <affiliation>Pernambuco Federal Institute of Education, Science, and Technology</affiliation>
        <contact>FALSE</contact>
      </item>
      <item>
        <name>
          <family>Coley</family>
          <given>David</given>
        </name>
        <id>D.A.Coley@bath.ac.uk</id>
        <orcid>0000-0001-5744-1809</orcid>
        <affiliation>University of Bath</affiliation>
        <contact>FALSE</contact>
      </item>
    </creators>
    <title>Quantile Regression Ensemble Summer Year (QRESY)</title>
    <divisions>
      <item>dept_civ_eng</item>
    </divisions>
    <abstract>The zip file contain 4 datasets in csv format. Each of them correspond to weather files of one hot summer year hourly data based on the weather observed over 40 (basis) years, 1974 - 2013. Two are the so-called probabilistic design summer years (PDSY) for the cities of London (UK) and Joao Pessoa (Brazil). The PDSY uses an overheating metric that is based on the number of hours in which the temperature is above a certain threshold when a building is occupied. Then, PDSY is created by selecting an entire year which contains the third hottest mean based on this overheating metric. PDSY is currently used in the UK as reference of warm summers. However it is the first time that a PDSY is created for Brazil. The other two weather files correspond to the new quantile ensemble regression summer year (QRESY) also aiming to represent hot summers both for London and Joao Pessoa. QRESY is created by combining observed summer extreme temperatures. This is done by endowing higher weights to quantiles away from the median for ensembles within upper quantiles. At the same time, it increases the importance of quantiles near to the median for combining lower quantiles.</abstract>
    <date>2018-03-21</date>
    <publisher>University of Bath</publisher>
    <full_text_status>public</full_text_status>
    <corp_contributors>
      <item>
        <type>RightsHolder</type>
        <corpname>University of Bath</corpname>
      </item>
    </corp_contributors>
    <funding>
      <item>
        <funder_name>Engineering and Physical Sciences Research Council</funder_name>
        <funder_id>https://doi.org/10.13039/501100000266</funder_id>
        <grant_id>EP/M021890/1</grant_id>
        <project_name>COLBE - The Creation of Localized Current and Future Weather for the Built Environment</project_name>
      </item>
    </funding>
    <collection_method>The Quantile Regression Ensemble Summer Year (QRESY) creation process starts by collecting hourly weather data over a long  period. Typically, weather files attempt to be representative of periods around 20-40 years, and here we do the same, however much longer periods could be used. The existence, variables and quality of hourly weather data varies depending on the location. The variables usually include temperature, atmospheric pressure, cloud cover, wind speed and wind direction, precipitation, etc.</collection_method>
    <provenance>For the QRESY process, preprocessing of this data is required to ensure it contains no long sequences of missing data. If large amount of data are missing in any of the variables, the whole year is removed from the analysis. At this point it is also necessary to decide the target level of extreme weather to work with. That is, to fix the quantile level for the subsequent construction of the Quantile Regression models depending on the distance to the median (quantile 50, Q50). Running a Quantile Regression model for every year under analysis is an ``embarrassingly parallel&apos;&apos; problem, as it is straightforward to separate the problem into a number of parallel tasks and the code run on a parallel machine. The set of regressions is combined in a unique year of hourly data. This is done by endowing higher weights to quantiles away from Q50 for ensembles within upper quantiles. At the same time, it increases the importance of quantiles near to Q50 for combining lower quantiles. The idea being to focus on explaining critical phases of summer temperatures. Each ensemble is thereby made over the predictors of a number of regression models corresponding to each of the years in the database. The ensemble parameters can be tuned by cross-validation over random partitions of the data into training and test summer periods.</provenance>
    <temporal_cover>
      <date_from>1974-01-01</date_from>
      <date_to>2013-12-31</date_to>
    </temporal_cover>
    <geographic_cover>London (UK) and Joao Pessoa (Brazil)</geographic_cover>
    <language>en</language>
    <version>1</version>
    <doi>10.15125/BATH-00480</doi>
    <related_resources>
      <item>
        <link>https://doi.org/10.1016/j.envsoft.2018.03.007</link>
        <type>pub</type>
      </item>
    </related_resources>
    <access_types>
      <item>open</item>
    </access_types>
    <access_arrangements>Data under Licence: Creative Commons Attribution 4.0 International</access_arrangements>
  </eprint>
</eprints>
