<?xml version='1.0' encoding='utf-8'?>
<eprints xmlns='http://eprints.org/ep2/data/2.0'>
  <eprint id='https://researchdata.bath.ac.uk/id/eprint/1148'>
    <eprintid>1148</eprintid>
    <rev_number>46</rev_number>
    <documents>
      <document id='https://researchdata.bath.ac.uk/id/document/16045'>
        <docid>16045</docid>
        <rev_number>3</rev_number>
        <files>
          <file id='https://researchdata.bath.ac.uk/id/file/53283'>
            <fileid>53283</fileid>
            <datasetid>document</datasetid>
            <objectid>16045</objectid>
            <filename>data.zip</filename>
            <mime_type>application/zip</mime_type>
            <hash>5abf03f97ce6340cb9496babfb5f903e</hash>
            <hash_type>MD5</hash_type>
            <filesize>1944707089</filesize>
            <mtime>2022-05-27 14:08:40</mtime>
            <url>https://researchdata.bath.ac.uk/1148/2/data.zip</url>
          </file>
        </files>
        <eprintid>1148</eprintid>
        <pos>2</pos>
        <placement>2</placement>
        <mime_type>application/zip</mime_type>
        <format>other</format>
        <formatdesc>Zip file containing Gaussian09 and Gaussian16 output files for five distinct projects included in the associated thesis, as well as a document with all machine learning model metrics, features, and hyperparameters. A README.txt file is also included that details the structure of the folders and files provided.</formatdesc>
        <language>en</language>
        <security>public</security>
        <license>cc_by</license>
        <main>data.zip</main>
        <content>data</content>
      </document>
      <document id='https://researchdata.bath.ac.uk/id/document/16046'>
        <docid>16046</docid>
        <rev_number>3</rev_number>
        <files>
          <file id='https://researchdata.bath.ac.uk/id/file/53291'>
            <fileid>53291</fileid>
            <datasetid>document</datasetid>
            <objectid>16046</objectid>
            <filename>README.txt</filename>
            <mime_type>text/plain</mime_type>
            <hash>2094b12e97c0ae1255393c67ae8de3bc</hash>
            <hash_type>MD5</hash_type>
            <filesize>1758</filesize>
            <mtime>2022-05-27 14:59:41</mtime>
            <url>https://researchdata.bath.ac.uk/1148/3/README.txt</url>
          </file>
        </files>
        <eprintid>1148</eprintid>
        <pos>3</pos>
        <placement>3</placement>
        <mime_type>text/plain</mime_type>
        <format>other</format>
        <formatdesc>Documentation of folder structure</formatdesc>
        <language>en</language>
        <security>public</security>
        <license>cc_by</license>
        <main>README.txt</main>
        <content>documentation</content>
      </document>
      <document id='https://researchdata.bath.ac.uk/id/document/16047'>
        <docid>16047</docid>
        <rev_number>1</rev_number>
        <files>
          <file id='https://researchdata.bath.ac.uk/id/file/53293'>
            <fileid>53293</fileid>
            <datasetid>document</datasetid>
            <objectid>16047</objectid>
            <filename>indexcodes.txt</filename>
            <mime_type>text/plain</mime_type>
            <hash>1e0a546dd56dddae7862fde2994fecf0</hash>
            <hash_type>MD5</hash_type>
            <filesize>955</filesize>
            <mtime>2022-05-27 15:00:42</mtime>
            <url>https://researchdata.bath.ac.uk/1148/4/indexcodes.txt</url>
          </file>
        </files>
        <eprintid>1148</eprintid>
        <pos>4</pos>
        <placement>4</placement>
        <mime_type>text/plain</mime_type>
        <format>other</format>
        <formatdesc>Generate index codes conversion from other to indexcodes</formatdesc>
        <language>en</language>
        <security>public</security>
        <main>indexcodes.txt</main>
        <relation>
          <item>
            <type>http://eprints.org/relation/isVersionOf</type>
            <uri>https://researchdata.bath.ac.uk/id/document/16046</uri>
          </item>
          <item>
            <type>http://eprints.org/relation/isVolatileVersionOf</type>
            <uri>https://researchdata.bath.ac.uk/id/document/16046</uri>
          </item>
          <item>
            <type>http://eprints.org/relation/isIndexCodesVersionOf</type>
            <uri>https://researchdata.bath.ac.uk/id/document/16046</uri>
          </item>
        </relation>
      </document>
    </documents>
    <eprint_status>archive</eprint_status>
    <userid>10223</userid>
    <dir>disk0/00/00/11/48</dir>
    <datestamp>2022-08-12 08:32:24</datestamp>
    <lastmod>2024-07-18 12:32:04</lastmod>
    <status_changed>2022-08-12 08:32:24</status_changed>
    <type>data_collection</type>
    <metadata_visibility>show</metadata_visibility>
    <creators>
      <item>
        <name>
          <family>Farrar</family>
          <given>Elliot</given>
        </name>
        <id>ehef20@bath.ac.uk</id>
        <orcid>0000-0003-3350-2907</orcid>
        <affiliation>University of Bath</affiliation>
        <contact>TRUE</contact>
      </item>
    </creators>
    <contributors>
      <item>
        <type>Supervisor</type>
        <name>
          <family>Grayson</family>
          <given>Matthew</given>
        </name>
        <id>M.N.Grayson@bath.ac.uk</id>
        <orcid>0000-0003-2116-7929</orcid>
        <affiliation>University of Bath</affiliation>
      </item>
    </contributors>
    <title>Dataset for &quot;Computational Modelling and Machine Learning Approaches Towards Understanding Asymmetric Catalytic Organic Reactions&quot;</title>
    <subjects>
      <item>CJ0020</item>
      <item>CM0010</item>
    </subjects>
    <divisions>
      <item>dept_chem</item>
    </divisions>
    <keywords>Computational chemistry, Asymmetric organocatalysis, Computational modelling, Density functional theory, Machine learning</keywords>
    <abstract>For several decades, chemical modelling methods such as density functional theory (DFT) have provided invaluable contributions to the understanding of asymmetric and catalytic reactions. However, machine learning (ML) models, once trained, could allow for much more rapid screening of chemical reactions. In the thesis associated with this dataset, research into two distinct approaches to understanding organic reactions - modelling and ML - are presented, including several examples of conventional modelling with DFT, as well as details of a new ML methodology that bridges the gap between semi-empirical quantum mechanical (SQM) methods and DFT. This repository contains Gaussian09 and Gaussian16 output files for all computed structures used in this work, as well as a document containing a complete list of all metrics, features, and hyperparameters for all computed ML models.</abstract>
    <date>2022-08-12</date>
    <publisher>University of Bath</publisher>
    <full_text_status>public</full_text_status>
    <lay_summary>All computed chemical structures used to model several chemical reactions and build machine learning models to predict high-level chemical reaction barriers using low-level inputs.</lay_summary>
    <corp_contributors>
      <item>
        <type>RightsHolder</type>
        <corpname>University of Bath</corpname>
      </item>
    </corp_contributors>
    <funding>
      <item>
        <funder_name>Engineering and Physical Sciences Research Council</funder_name>
        <funder_id>https://doi.org/10.13039/501100000266</funder_id>
        <grant_id>EP/R513155/1</grant_id>
        <project_name>DTP 2018-19 University of Bath</project_name>
      </item>
    </funding>
    <collection_method>Structures were generated by conformational searching with Schrödinger&apos;s MacroModel (Version 11.3, 11.6, 12.7) and enumeration with Schrödinger’s R-Group Enumeration. These structures were optimised with Gaussian09 (Revision D.01) (performed at the University of Cambridge) and Gaussian16 (Revision A.03) (performed at the University of Bath) using several different molecular modelling methods. Additional analyses were performed with NCIPLOT. Machine learning models were built using the Scikit-learn and mlxtend python packages.</collection_method>
    <collection_date>
      <date_from>2018-09-01</date_from>
      <date_to>2022-05-26</date_to>
    </collection_date>
    <language>en</language>
    <version>1</version>
    <doi>10.15125/BATH-01148</doi>
    <related_resources>
      <item>
        <link>https://doi.org/10.1002/anie.202005563</link>
        <type>pub</type>
      </item>
      <item>
        <link>https://doi.org/10.1002/anie.202011256</link>
        <type>pub</type>
      </item>
      <item>
        <link>https://doi.org/10.1021/acs.joc.0c02226</link>
        <type>pub</type>
      </item>
      <item>
        <link>https://researchportal.bath.ac.uk/en/studentTheses/93e667c0-15f7-40b8-aca8-69d55534a1a9</link>
        <type>thesis</type>
      </item>
    </related_resources>
    <equipment>
      <item>
        <name>Balena High Performance Computing (HPC) System (Bath)</name>
        <id>3e22ef13-31b9-4700-b57e-57bcd3b3b985</id>
      </item>
      <item>
        <name>Anatra High Performance Computing (HPC) System (Bath)</name>
      </item>
      <item>
        <name>Darwin High Performance Computing (HPC) System (Cambridge)</name>
      </item>
      <item>
        <name>Ziggy Compute Server (Cambridge)</name>
      </item>
    </equipment>
    <access_types>
      <item>open</item>
    </access_types>
    <resourcetype>
      <general>Dataset</general>
    </resourcetype>
  </eprint>
</eprints>
