<?xml version='1.0' encoding='utf-8'?>
<eprints xmlns='http://eprints.org/ep2/data/2.0'>
  <eprint id='https://researchdata.bath.ac.uk/id/eprint/1398'>
    <eprintid>1398</eprintid>
    <rev_number>46</rev_number>
    <documents>
      <document id='https://researchdata.bath.ac.uk/id/document/17989'>
        <docid>17989</docid>
        <rev_number>6</rev_number>
        <files>
          <file id='https://researchdata.bath.ac.uk/id/file/67109'>
            <fileid>67109</fileid>
            <datasetid>document</datasetid>
            <objectid>17989</objectid>
            <filename>data_archive.zip</filename>
            <mime_type>application/zip</mime_type>
            <hash>7c1394e17bf9f9ca8f101c31309ae8fd</hash>
            <hash_type>MD5</hash_type>
            <filesize>4919231668</filesize>
            <mtime>2024-05-08 14:56:01</mtime>
            <url>https://researchdata.bath.ac.uk/1398/1/data_archive.zip</url>
          </file>
        </files>
        <eprintid>1398</eprintid>
        <pos>1</pos>
        <placement>1</placement>
        <mime_type>application/zip</mime_type>
        <format>other</format>
        <formatdesc>A zipped directory containing distortion/interaction calculations for four datasets: nitro-Michael addition (MA), Diels-Alder, [3+2] cycloaddition, and dimethyl malonate MA. These calculations have been performed at both AM1 and the DFT level of theory of the original dataset. For the dimethyl malonate MA dataset, the reactant and transition structure geometries are also provided. These calculations were performed at AM1 and wB97X-D/def2-TZVP (IEFPCM=Water)//wB97X-D/def2-TZVP.</formatdesc>
        <language>en</language>
        <security>staffonly</security>
        <license>all_rights_reserved</license>
        <main>data_archive.zip</main>
        <content>data</content>
      </document>
    </documents>
    <eprint_status>archive</eprint_status>
    <userid>12231</userid>
    <dir>disk0/00/00/13/98</dir>
    <datestamp>2024-10-22 09:49:13</datestamp>
    <lastmod>2025-01-31 11:48:29</lastmod>
    <status_changed>2024-10-22 09:49:13</status_changed>
    <type>data_collection</type>
    <metadata_visibility>no_search</metadata_visibility>
    <creators>
      <item>
        <name>
          <family>Espley</family>
          <given>Sam</given>
        </name>
        <id>sge28@bath.ac.uk</id>
        <orcid>0000-0002-1135-9890</orcid>
        <affiliation>University of Bath</affiliation>
        <contact>TRUE</contact>
      </item>
      <item>
        <name>
          <family>Allsop</family>
          <given>Sam</given>
        </name>
        <id>ssa48@bath.ac.uk</id>
        <orcid>0009-0002-8284-898X</orcid>
        <affiliation>University of Bath</affiliation>
        <contact>FALSE</contact>
      </item>
    </creators>
    <contributors>
      <item>
        <type>Supervisor</type>
        <name>
          <family>Buttar</family>
          <given>David</given>
        </name>
        <id>david.buttar@astrazeneca.com</id>
        <orcid>0000-0001-5466-023X</orcid>
        <affiliation>AstraZeneca</affiliation>
      </item>
      <item>
        <type>Supervisor</type>
        <name>
          <family>Tomasi</family>
          <given>Simone</given>
        </name>
        <id>simone.tomasi@astrazeneca.com</id>
        <orcid>0000-0002-9373-7639</orcid>
        <affiliation>AstraZeneca</affiliation>
      </item>
      <item>
        <type>Supervisor</type>
        <name>
          <family>Grayson</family>
          <given>Matthew</given>
        </name>
        <id>M.N.Grayson@bath.ac.uk</id>
        <orcid>0000-0003-2116-7929</orcid>
        <affiliation>University of Bath</affiliation>
      </item>
    </contributors>
    <title>Dataset for &quot;Distortion/Interaction Analysis via Machine Learning&quot;</title>
    <subjects>
      <item>CJ0020</item>
    </subjects>
    <divisions>
      <item>dept_chem</item>
    </divisions>
    <keywords>Machine Learning, Gaussian, Distortion, Interaction, Activation, Strain, Reaction Barrier, Computational Chemistry, Diels-Alder, Michael Addition, Cycloadditions</keywords>
    <abstract>Machine learning (ML) has previously been applied to predict reaction barriers for a variety of different chemical reactions. This is seen as the end point for this type of study however, post-reaction barrier analysis/energy decomposition approaches can provide insight into chemical reactivity. One such approach that has previously been used to provide information on chemical reactivity, for cycloaddition reactions in particular, is distortion/interaction-activation strain analysis (DIAS). We demonstrate that ML can be coupled with cheap and rapid semi-empirical quantum mechanical methods (SQM) to predict distortion and interaction energies at a fraction of the computational cost associated with running density functional theory (DFT) calculations.  This dataset includes all the structural data in the form of Gaussian16 (Revision A.03 and C.01) output files for the four datasets used in this work and, the literature dataset reactions.</abstract>
    <date>2024-10-21</date>
    <publisher>University of Bath</publisher>
    <full_text_status>restricted</full_text_status>
    <dataurl>
      <item>
        <link>https://github.com/the-grayson-group/distortion-interaction_ML</link>
        <description>GitHub Link</description>
      </item>
    </dataurl>
    <corp_contributors>
      <item>
        <type>RightsHolder</type>
        <corpname>University of Bath</corpname>
      </item>
      <item>
        <type>RightsHolder</type>
        <corpname>AstraZeneca</corpname>
      </item>
    </corp_contributors>
    <funding>
      <item>
        <funder_name>Engineering and Physical Sciences Research Council (EPSRC)</funder_name>
        <funder_id>https://doi.org/10.13039/501100000266</funder_id>
        <grant_id>EP/V519637/1</grant_id>
        <project_name>Industrial CASE Account - University of Bath 2020</project_name>
      </item>
      <item>
        <funder_name>Engineering and Physical Sciences Research Council (EPSRC)</funder_name>
        <funder_id>https://doi.org/10.13039/501100000266</funder_id>
        <grant_id>EP/W003724/1</grant_id>
        <project_name>Machine Learning and Molecular Modelling: A Synergistic Approach to Rapid Reactivity Prediction</project_name>
      </item>
    </funding>
    <collection_method>Ground state reactant and transition state geometries for dimethyl malonate Michael addition reactions were built using Schrödinger’s R-Group Enumeration. R-groups were placed on various different positions of the Michael acceptor; the position depended upon the molecules in question. All structures were built in Gaussian16 (Revisions A.03 and C.01) and were conformationally searched using Schrödinger’s MacroModel (version 12.7). All structures were subsequently optimised using Gaussian16 (Revisions A.03 and C.01) using AM1 and wB97X-D/def2-TZVP (IEFPCM=Water)//wB97X-D/def2-TZVP. 
For distortion/interaction-activation strain calculations, python code (available on the associated GitHub page: https://github.com/the-grayson-group/distortion-interaction_ML) was used to separate the distorted reactant structures before single point energies were calculated using Gaussian16 (Revision C.01) using AM1 and the DFT level of theory used in the original transition structure calculation.</collection_method>
    <language>en</language>
    <version>1</version>
    <doi>10.15125/BATH-01398</doi>
    <related_resources>
      <item>
        <link>https://doi.org/10.1039/D4DD00224E</link>
        <type>pub</type>
      </item>
      <item>
        <link>https://doi.org/10.15125/BATH-01229</link>
        <type>data</type>
      </item>
    </related_resources>
    <equipment>
      <item>
        <name>Anatra HTC cluster</name>
        <id>70796ba8-e816-43ad-a262-6f8cfa5870e9</id>
      </item>
    </equipment>
    <access_types>
      <item>open</item>
    </access_types>
    <access_arrangements>Due to the duplication of a file, a new version was created.</access_arrangements>
    <parent>1455</parent>
  </eprint>
</eprints>
