<?xml version='1.0' encoding='utf-8'?>
<eprints xmlns='http://eprints.org/ep2/data/2.0'>
  <eprint id='https://researchdata.bath.ac.uk/id/eprint/1092'>
    <eprintid>1092</eprintid>
    <rev_number>76</rev_number>
    <documents>
      <document id='https://researchdata.bath.ac.uk/id/document/16038'>
        <docid>16038</docid>
        <rev_number>2</rev_number>
        <files>
          <file id='https://researchdata.bath.ac.uk/id/file/53199'>
            <fileid>53199</fileid>
            <datasetid>document</datasetid>
            <objectid>16038</objectid>
            <filename>structures.zip</filename>
            <mime_type>application/zip</mime_type>
            <hash>1c53d038f24c0665101ccc6bf66d7dad</hash>
            <hash_type>MD5</hash_type>
            <filesize>1136300268</filesize>
            <mtime>2022-05-20 09:16:37</mtime>
            <url>https://researchdata.bath.ac.uk/1092/1/structures.zip</url>
          </file>
        </files>
        <eprintid>1092</eprintid>
        <pos>1</pos>
        <placement>1</placement>
        <mime_type>application/zip</mime_type>
        <format>other</format>
        <formatdesc>Zip file containing optimised structures for each level of theory (AM1, AM1-IEFPCM, PM6, PM6-IEFPCM, UFF, wB97XD, wB97XD-IEFPCM). With the exception of IEFPCM folders, each level of theory includes the nucleophile (nuc), 1037 Michael acceptors (gs), and 1037 Michael addition transition states for the reaction of the nucleophile with each Michael acceptor (ts), complete with IEFPCM(toluene) single point energy calculations. IEFPCM folders include only the 37 literature reactions.</formatdesc>
        <language>en</language>
        <security>public</security>
        <license>cc_by</license>
        <main>structures.zip</main>
        <content>data</content>
      </document>
    </documents>
    <eprint_status>archive</eprint_status>
    <userid>10223</userid>
    <dir>disk0/00/00/10/92</dir>
    <datestamp>2022-06-15 11:20:39</datestamp>
    <lastmod>2025-06-16 10:54:17</lastmod>
    <status_changed>2022-06-15 11:20:39</status_changed>
    <type>data_collection</type>
    <metadata_visibility>show</metadata_visibility>
    <creators>
      <item>
        <name>
          <family>Farrar</family>
          <given>Elliot</given>
        </name>
        <id>ehef20@bath.ac.uk</id>
        <orcid>0000-0003-3350-2907</orcid>
        <affiliation>University of Bath</affiliation>
        <contact>TRUE</contact>
      </item>
      <item>
        <name>
          <family>Grayson</family>
          <given>Matt</given>
        </name>
        <id>M.N.Grayson@bath.ac.uk</id>
        <orcid>0000-0003-2116-7929</orcid>
        <affiliation>University of Bath</affiliation>
        <contact>FALSE</contact>
      </item>
    </creators>
    <title>Dataset for &quot;Machine learning and semi-empirical calculations: A synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction&quot;</title>
    <subjects>
      <item>CJ0020</item>
      <item>CM0010</item>
    </subjects>
    <divisions>
      <item>dept_chem</item>
    </divisions>
    <keywords>Gaussian, Computational Chemistry, Machine Learning, Michael Additions</keywords>
    <abstract>Modern quantum mechanical modelling methods, such as Density Functional Theory (DFT), have provided detailed mechanistic insights into countless reactions and have been used in the design of a handful of chemical transformations. However, their computational cost inhibits their ability to rapidly screen large numbers of substrates and catalysts in reaction discovery. For a C-C bond forming Nitro-Michael addition, we introduce a synergistic semi-empirical quantum mechanical (SQM) and machine learning (ML) approach that achieves the fast and accurate prediction of DFT-quality free energy activation barriers using purely SQM-derived data. This dataset includes all the structural data, in the form of Gaussian16 (Revision A.03) output files, for the Nitro-Michael reaction used for this machine learning analysis.</abstract>
    <date>2022-06-14</date>
    <publisher>University of Bath</publisher>
    <full_text_status>public</full_text_status>
    <lay_summary>All computed chemical structures used to build machine learning models to predict high-level chemical reaction barriers using low-level inputs.</lay_summary>
    <corp_contributors>
      <item>
        <type>RightsHolder</type>
        <corpname>University of Bath</corpname>
      </item>
    </corp_contributors>
    <funding>
      <item>
        <funder_name>Engineering and Physical Sciences Research Council</funder_name>
        <funder_id>https://doi.org/10.13039/501100000266</funder_id>
        <grant_id>EP/N509589/1</grant_id>
        <project_name>DTP 2016-2017 University of Bath</project_name>
      </item>
    </funding>
    <collection_method>Uncatalysed reactant and transition state geometries for 1000 Nitro-Michael addition reactions were built using Schrödinger’s R-Group Enumeration by varying at four positions of a generic Michael acceptor core with common organic fragments. In addition to the nucleophile, uncatalysed reactant and transition state geometries for a further 37 biologically important Nitro-Michael addition reactions from literature were built in Gaussian16 (Revision A.03). All reactant and transition state structures were conformationally searched using Schrödinger’s MacroModel (version 12.7). All structures were subsequently optimised using Gaussian16 (Revision A.03) using several different molecular modelling methods.</collection_method>
    <language>en</language>
    <version>1</version>
    <doi>10.15125/BATH-01092</doi>
    <related_resources>
      <item>
        <link>https://doi.org/10.1039/D2SC02925A</link>
        <type>pub</type>
      </item>
    </related_resources>
    <equipment>
      <item>
        <name>Balena High Performance Computing (HPC) System</name>
        <id>3e22ef13-31b9-4700-b57e-57bcd3b3b985</id>
      </item>
    </equipment>
    <access_types>
      <item>open</item>
    </access_types>
    <resourcetype>
      <general>Dataset</general>
    </resourcetype>
  </eprint>
</eprints>
