Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Have we reached artificial general intelligence? Comparison of ChatGPT, Claude, and Gemini to human literacy and education benchmarks
1
Zitationen
1
Autoren
2025
Jahr
Abstract
Recent advancements in artificial intelligence (AI), particularly in large language models (LLMs) like ChatGPT, Claude, and Gemini, have prompted questions about their proximity to artificial general intelligence (AGI). This quantitative study compares LLMs’ performance on educational benchmarks. A quantitative research methodology and secondary exploratory analysis were used to test the proposed hypothesis, stating that current LLMs, including ChatGPT, Claude, and Gemini, possess AGI by comparing their educational metric scores to public education standards. This study used an ex-post research design, whereby secondary data from authoritative sources were collected to compare educational achievements and human literacy levels with the AI model’s performance on similar tasks. The results show that LLMs significantly outperform human benchmarks in tasks such as undergraduate knowledge and advanced reading comprehension (ARC), indicating substantial progress toward AGI.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.578 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.470 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.984 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.814 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.