A team of researchers has developed a novel benchmark to evaluate the historical knowledge of leading large language models ...