Dataset of the study: "Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard"
This dataset contains the 30 questions that were posed to the chatbots (i) ChatGPT-3.5; (ii) ChatGPT-4; and (iii) Google Bard, in May 2023 for the study “Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard”. These 30 questions describe mathematics and logic problems that have a unique correct answer. The questions are fully described with plain text only, without the need for any images or special formatting. The questions are divided into two sets of 15 questions each (Set A and Set B). The questions of Set A are 15 “Original” problems that cannot be found online, at least in their exact wording, while Set B contains 15 “Published” problems that one can find online by searching on the internet, usually with their solution. Each question is posed three times to each chatbot.
This dataset contains the following: (i) The full set of the 30 questions, A01-A15 and B01-B15; (ii) the correct answer for each one of them; (iii) an explanation of the solution, for the problems where such an explanation is needed, (iv) the 30 (questions) × 3 (chatbots) × 3 (answers) = 270 detailed answers of the chatbots. For the published problems of Set B, we also provide a reference to the source where each problem was taken from.
Cite this dataset as:
Plevris, V.,
Papazafeiropoulos, G.,
Jimenez Rios, A.,
2023.
Dataset of the study: "Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard".
Zenodo.
Available from: https://doi.org/10.5281/zenodo.7940781.
Export
Creators
Vagelis Plevris
Qatar University
George Papazafeiropoulos
National Technical University of Athens
Alejandro Jimenez Rios
Oslo Metropolitan University
Funders
Oslo Metropolitan University
https://doi.org/10.13039/100020586
Publication details
Publication date: 20 May 2023
by: Zenodo
Version: 1
DOI: https://doi.org/10.5281/zenodo.7940781
URL for this record: https://researchdata.bath.ac.uk/1556
Related papers and books
Plevris, V., Papazafeiropoulos, G., and Jiménez Rios, A., 2023. Chatbots Put to the Test in Math and Logic Problems: A Comparison and Assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard. AI, 4(4), 949-969. Available from: https://doi.org/10.3390/ai4040048.
Contact information
Please contact the Research Data Service in the first instance for all matters concerning this item.
Contact person: Alejandro Jimenez Rios
Faculty of Engineering & Design
Architecture & Civil Engineering