Jan Niehues
218 Arbeiten3.339 Zitationen
Relevante Arbeiten
Meistzitierte Publikationen im Bereich Gesundheit & MedTech
SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading
2024 · 10 Zit.
SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading
2024 · 1 Zit. · arXiv (Cornell University)
Knockout LLM Assessment: Using Large Language Models for Evaluations through Iterative Pairwise Comparisons
2025 · 0 Zit. · ArXiv.org