Web Alignment Flow
Knowledge-Web Alignment Process
Methodology for measuring how well language model knowledge claims correlate with actual information availability on the web for real bacterial species.
Bacillus subtilis
Real species
LLM Query
Ask model for knowledge level
knowledge group
Limited
Web Search
Count Google search results
1.5 M
Correlation Analysis
Measure alignment between
model knowledge claims and
web presence
model knowledge claims and
web presence
r = 0.42
Moderate alignment
Limited + Low web presence = Good alignment
Extensive + High web presence = Good alignment
Extensive + Low web presence = Poor alignment