Ministers have shut down or dropped at least half a dozen artificial intelligence prototypes intended for the welfare system, the Guardian has learned, in a sign of the headwinds facing Keir Starmer’s effort to increase government efficiency.
Pilots of AI technology to enhance staff training, improve the service in jobcentres, speed up disability benefit payments and modernise communication systems are not being taken forward, freedom of information (FoI) requests reveal.
Officials have internally admitted that ensuring AI systems are “scalable, reliable [and] thoroughly tested” are key challenges and say there have been many “frustrations and false starts”.
Not all trials would be expected to make it into regular use, but two of those now scrapped had been highlighted by the Department for Work and Pensions (DWP) in its latest annual report as examples of how it had “successfully tested multiple generative AI proofs of concept”.
A-cubed was intended to help staff steer jobseekers into work. Aigent was supposed to accelerate personal independence payments relied on by millions of people with disabilities.
This month the prime minister declared “AI is the way … to transform our public services” and wrote to all cabinet ministers “tasking them with driving AI adoption and growth … and making that a top priority for their departments”.
“Unsuccessful pilots and trials aren’t necessarily a cause for concern, as they offer an opportunity to improve, but these failures raise important questions for the government’s approach to AI in the public sector,” said Imogen Parker, associate director at the Ada Lovelace Institute, an independent research body focused on data and AI. “Are the right lessons being learned and acted upon, and does the reality of AI match the rhetoric?”
No information about AI used by the DWP in the welfare system has yet been disclosed on the government algorithm transparency register, which has been a requirement across Whitehall for almost a year.
Officials say the time spent on the pilot software is not wasted, as the technology could later appear as part of a system that is rolled out, and thorough testing is essential prior to rollouts. But the move illustrates the complexities of Labour’s hope to deploy AI to revolutionise public services and increase economic productivity.
This week Peter Kyle, the secretary of state for science, innovation and technology, announced a “blueprint for a modern digital government” and said his department “will put AI to work, speeding up our ability to deliver our Plan for Change, improve lives and drive growth”.
Writing in December after a year of running i.AI, the Whitehall AI incubator, its director, Laura Gilbert, admitted “there have been abundant blockers, frustrations and false starts”, but said “if something fails, we try, try again and find another route to impact”.
She said that of 57 ideas tested, 11 made it to rollout in various stages of testing and scaling. She added it has been working with US AI firms including OpenAI, Anthropic, Google and Microsoft.
DWP officials told tech companies at a private meeting in August that making sure “products are scalable, reliable [and] thoroughly tested” are key challenges in moving AI systems from proofs of concept [POC] to full use, according to meeting notes released under FoI. They showed that “approximately 9 POCs have so far been completed” and “one POC has gone live, one is in the process of going live”.
“It’s encouraging that the public sector isn’t taking a rigid or dogmatic approach to AI, particularly in welfare, where the risks of amplifying inequalities and causing real injustice are significant,” said Parker. “Yet a lack of transparency remains a critical issue … [It] should not depend on journalistic investigation – openness, evaluation, and learning must be central to the government’s strategy.”
The DWP declined to comment on the specific reasons AI pilots were dropped, but said considerations can include technological maturity, business readiness, business value, and scalability. It said it rigorously tests how much value the technology provides to officials and the public and its value for money.
A government spokesperson said: “Proof of concept projects are deliberately short, enabling new and innovative technologies to be explored and prototyped – not all projects are expected to become long-term, and the learning from them can be used in the future.
“This aligns with our ‘scan, pilot, scale’ approach set out in the AI opportunities action plan – because we recognise the tremendous potential of AI to transform our public services and save taxpayers billions.”