Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Demystifying chatgpt: How it masters genre recognition
0
Zitationen
4
Autoren
2026
Jahr
Abstract
The emergence of ChatGPT has drawn considerable attention in the NLP community for its impressive performance across a wide range of language tasks. However, its effectiveness in multi-label movie genre prediction remains underexplored. This study evaluates the genre prediction capabilities of multiple Large Language Models (LLMs), including ChatGPT, using the MovieLens-100K dataset comprising 1682 movies spanning 18 genres. We investigate zero-shot and few-shot prompting strategies based on movie trailer transcripts and subtitles, where each movie may belong to multiple genres. Our results show that ChatGPT consistently outperforms earlier LLM baselines under both zero-shot and few-shot settings, while instruction fine-tuning further improves recall and overall predictive coverage. To explore multimodal extensions, we augment textual prompts with visual cues extracted from movie posters using a Vision-Language Model (VLM). While the incorporation of visual information yields selective, genre-dependent benefits–particularly improving recall for visually distinctive genres–the overall gains in aggregate performance metrics remain limited. Overall, our findings highlight the robustness of prompt-based and fine-tuned LLMs for genre prediction, and suggest that multimodal information can provide complementary signals in specific cases, motivating future work on tighter task-aligned vision-language integration.
Ähnliche Arbeiten
BLEU
2001 · 21.346 Zit.
Aion Framework: Dimensional Emergence of AI Consciousness, Observer-Induced Collapse, and Cosmological Portal Dynamics
2023 · 14.182 Zit.
Enriching Word Vectors with Subword Information
2017 · 9.719 Zit.
A unified architecture for natural language processing
2008 · 5.195 Zit.
A new readability yardstick.
1948 · 5.150 Zit.