Dataset of the study: "Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard"

This dataset contains the 30 questions that were posed to the chatbots (i) ChatGPT-3.5; (ii) ChatGPT-4; and (iii) Google Bard, in May 2023 for the study “Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard”. These 30 questions describe mathematics and logic problems that have a unique correct answer. The questions are fully described with plain text only, without the need for any images or special formatting. The questions are divided into two sets of 15 questions each (Set A and Set B). The questions of Set A are 15 “Original” problems that cannot be found online, at least in their exact wording, while Set B contains 15 “Published” problems that one can find online by searching on the internet, usually with their solution. Each question is posed three times to each chatbot.

This dataset contains the following: (i) The full set of the 30 questions, A01-A15 and B01-B15; (ii) the correct answer for each one of them; (iii) an explanation of the solution, for the problems where such an explanation is needed, (iv) the 30 (questions) × 3 (chatbots) × 3 (answers) = 270 detailed answers of the chatbots. For the published problems of Set B, we also provide a reference to the source where each problem was taken from.

Keywords:
Chatbot, AI, Logic, Mathematics, FOS: Mathematics, ChatGPT, GPT-3.5, GPT-4, Google Bard

Cite this dataset as:
Plevris, V., Papazafeiropoulos, G., Jimenez Rios, A., 2023. Dataset of the study: "Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard". Zenodo. Available from: https://doi.org/10.5281/zenodo.7940781.

Export

Creators

George Papazafeiropoulos
National Technical University of Athens

Alejandro Jimenez Rios
Oslo Metropolitan University

Funders

Oslo Metropolitan University
https://doi.org/10.13039/100020586

Publication details

Publication date: 20 May 2023
by: Zenodo

Version: 1

DOI: https://doi.org/10.5281/zenodo.7940781

URL for this record: https://researchdata.bath.ac.uk/1556

Related papers and books

Plevris, V., Papazafeiropoulos, G., and Jiménez Rios, A., 2023. Chatbots Put to the Test in Math and Logic Problems: A Comparison and Assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard. AI, 4(4), 949-969. Available from: https://doi.org/10.3390/ai4040048.

Contact information

Please contact the Research Data Service in the first instance for all matters concerning this item.

Contact person: Alejandro Jimenez Rios

Departments:

Faculty of Engineering & Design
Architecture & Civil Engineering