Large Language Models Pass the Turing Test
We evaluated four systems (ELIZA, GPT-4o, LLaMa-3.1-405B, and GPT-4.5) in two randomised, controlled, and pre-registered Turing tests on independent populations. Participants had 5-minute conversations simultaneously with another human participant and one of these systems before judging which conversational partner they thought was human.











