Full Template Intro

Article Page: Knowledge Calibration File: sections/30a_full_template_intro.html Theme: purple

Each query template tests a different aspect of model knowledge, from basic taxonomic information to specific physiological characteristics. The detailed breakdowns below show how each model performed on individual templates, revealing template-specific strengths and weaknesses. This granular view helps identify which types of queries are most likely to trigger hallucinations in different models.