Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A Comparative Analysis of Mental Models in Floating, Suspending, and Sinking Phenomena: Evidence from Students and Large Language Models (ChatGPT and DeepSeek)
0
Zitationen
4
Autoren
2026
Jahr
Abstract
The increasingly widespread use of Large Language Models (LLMs) in education creates opportunities for further investigation into epistemic reasoning abilities, particularly the characteristics of mental models. Therefore, this study aimed to characterize the mental models of LLMs, specifically the AI models DeepSeek and ChatGPT. The technologies were also compared to students in relation to the phenomena of floating, suspending, and sinking using descriptive-quantitative and a partial descriptive analysis. This study was conducted in a descriptive-quantitative design employing a cross-sectional comparative design. The quantitative analysis compares test scores across all subject groups, while the descriptive analysis is used to characterize students’ and AIs’ mental models. The study subjects consisted of 709 students and 120 chats for two AI models as respondents. Each AI was treated as comprising 60 respondents, with the item–test pair regarded as a distinct, independent chat instance. Based on the data analysis, mental models of students were predominantly categorized as initial, with only a limited decline following an increase in educational level. This decline was accompanied by a similarly limited increase in the proportion of scientific mental models, showing a modest influence of curricular interventions experienced by the students. The AI models exhibited higher proportions of scientific categories, but the mental models showed inconsistency when contexts changed, and the underlying aspects remained the same. Therefore, the reasoning was not classified as global coherence. The students and the AI models exhibited misapplications of density-based reasoning and Archimedes’ principle in several cases or phenomena presented in the test. The results showed that students and AI models have limitations in their mental models.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.786 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.700 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.270 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.908 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.