Phenotype Analysis Updated regularly

Evaluating LLM performance on fundamental microbial phenotype prediction

Comprehensive analysis of how language models predict broad microbial characteristics, from gram staining to pathogenicity, across thousands of species.

P. C. Münch, N. Safaei, R. Mreches, M. Binder, Y. Han, G. Robertson, E. A. Franzosa, C. Huttenhower, A. C. McHardy
DOI: COMING SOON
Knowledge analysis Updated regularly

Assessing LLM Knowledge Calibration for Microbial Taxonomy

Evaluating how much LLMs claim to know about bacteria by comparing their responses to internet data, revealing how frequently they generate unfounded claims about unknown species.

P. C. Münch, N. Safaei, R. Mreches, M. Binder, Y. Han, G. Robertson, E. A. Franzosa, C. Huttenhower, A. C. McHardy
DOI: COMING SOON
Coming Soon In development

Predicting bacterial growth conditions and metabolic flexibility

Upcoming evaluation framework for testing LLM understanding of environmental factors, nutrient requirements, and metabolic pathways in bacteria.

P. C. Münch, N. Safaei, R. Mreches, M. Binder, Y. Han, G. Robertson, E. A. Franzosa, C. Huttenhower, A. C. McHardy
DOI: COMING SOON (preprint)