LLM Dominance Over Text-Based Research Tasks
#1Frontier LLMs as of 2025-2026 (GPT-4o, Claude 3.5/3.7 Sonnet, Gemini 1.5/2.0 Pro) process and synthesize text at speeds and volumes that dwarf human capacity: a model with a 1-million-token context window can 'read' and synthesize the equivalent of a 750,000-word dissertation in seconds. Historians' entire core workflow β reading, note-taking, synthesis, drafting β maps directly onto the tasks these models were optimized to perform. Benchmarks on historical comprehension tasks show frontier models scoring at or above PhD-level performance on standardized assessments of source interpretation and essay argumentation.