back to article DWP finds Copilot saves civil servants a whopping 19 minutes a day

Microsoft Copilot saved civil servants 19 minutes daily on routine tasks, according to Department for Work and Pensions (DWP) research comparing users to a control group of non-users. This finding is lower than the 26-minute saving reported by participants in a Government Digital Service research study of 20,000 civil servants …

  1. elsergiovolador Silver badge

    Costly placebo

    This is a PR puff piece dressed up as “research”.

    First, 19 minutes a day is not productivity. It is noise. You could claw back more than that by letting people start at 10, avoiding peak commuting, or cancelling one standing meeting about another meeting. No AI required. Just basic adult management.

    Second, the numbers are already wobbling. One government study says 26 minutes, another says 19, another says zero. That is not convergence. That is what happens when you measure vibes instead of outcomes. The only consistent result is “people felt nicer”, which is not the same thing as doing more useful work.

    Third, the sample is cooked. Volunteers and nominees using shiny new tools always report gains. It is the Hawthorne effect with autocomplete. Statistical “adjustments” are not magic. They do not turn self-selection into causality.

    Fourth, the tasks where Copilot “saves time” are exactly the ones civil servants should not be optimising blindly. Searching internal docs faster just means propagating outdated nonsense quicker. First-draft emails were never the bottleneck. Judgement, accountability, and decision-making were. Even the article admits the tool falls down the moment human judgement is required. That is the actual job.

    Fifth, the cost-benefit is missing entirely. We are told minutes were saved, but not how much was paid per minute to a foreign, tax-optimised corporation. Billions leave the UK so staff can shave a few minutes off emails that probably should not exist in the first place. If the same department banned “reply all” and status updates, the gains would dwarf Copilot overnight.

    Sixth, the risk is waved away. Every minute saved now comes with future minutes lost proving that sensitive government data was not inspected, retained, or subpoenaed under the US Cloud Act. That paperwork alone will eat the alleged gains for breakfast.

    Finally, “comfort blanket” is the most honest phrase in the article, and not in the way it intends. Reducing stress by outsourcing thinking to a probabilistic text generator is not efficiency. It is sedation.

    If Department for Work and Pensions wanted real productivity, it would fix workflows, kill pointless meetings, and stop confusing busyness with output. Instead, it bought licences from Microsoft, got a modest placebo effect, and called it transformation.

    1. ParlezVousFranglais Silver badge

      Re: Costly placebo

      I'm sure it also "saved" West Midlands Police time in creating their evidence against Maccabi Tel Aviv ...right up to point it hallucinated about the West Ham match that had never happened.

      Nothing in this "research" actually checked the integrity of any work that was produced, the users were selected based on volunteers, not on a random sample of workers, and the "results" were self-assessed by the users.

      Rubbish in, rubbish out...

      1. elsergiovolador Silver badge

        Re: Costly placebo

        And in functioning country these things would have warranted an investigation, not a slide deck and pats on the back.

    2. Anonymous Coward
      Anonymous Coward

      Re: Costly placebo

      And generally speaking their Copillock output wastes immeasurably more of everyone else's time than 19 minutes.

    3. Gordon 10
      IT Angle

      Re: Costly placebo

      You were doing quite well there until you muddied your points in a sea of ranting and hyperbole.

      Your core point is valid. Copilot is massively overpriced for what it does. Its $30 per head per month BTW - less any bulk discounts.

      There's a few points you miss though. 20-30 mins per day is worth persuing irregardless of whether there are alternatives, particularly because not all of them are tolerable to the CEO/CFO mindset, whilst a tech one might be. You correctly identify a couple - another is a Tea lady for example - studies have shown its one of the best cost and morale bang for bucks you can get.

      Your US cloud act risk assessment is overblown. 99% of major orgs are already operating under that risk because they are already in one of the big 3's clouds - its a neglible addition. We may not like it but it comes fairly low down the Likelyhood and Impact matrix because everyone else is in exactly the same boat, and we know that MS at least are willing to go toe to toe with the DOJ over wedge issues.

      Your sedation comment is laughable. No-one complains about sedation when they use a calculator or a spell checker - its just an aid.

      I've been running the same studies the article mentions - with similar results - plus a few "big bang" use cases where I've seen 2-6x effeciencies.

      One downside you miss is that Copilot is actually getting worse. I've had a license for 18 months and its been on a very distinct slide for responsiveness and relevance.

      TL;DR. Copilot has value - somewhere in the region of one of the "free" O365 apps like Planner or a turbo search product. Maybe Access at a push.

      I'd recommend Orgs adopt it at a $1 a month premium over vanilla office - once the AI bubble collapses.

      Anyhow I know its against the El Reg cynical mindset but IMO its a more balanced perspective than yours - let the downvotes commence!

      1. myhandler

        Re: Costly placebo

        Yay, you got an "irregardless" in there

    4. Helcat Silver badge

      Re: Costly placebo

      "Second, the numbers are already wobbling."

      Not necessarily: One department might do more work where co-pilot can be used successfully than another. However, the department saying there was zero benefit did note that was due to the need to check and correct the slop from co-pilot, which raises the question: Did the others actually check/verify/correct the Co-pilot output?

      One thing that was mentioned was the summarization of documents/meetings: That's one area where LLM's are fairly decent (in my experience). As long as it only references the document inputted and a template to fit it into, the output is decent enough and generally does save a bit of time, even with me having to proof read the output.

      Does this make AI cost effective? Hell no. It just means I get a few more minutes before the next pointless meeting... time for a coffee!

    5. Arthur the cat

      Re: Costly placebo

      First, 19 minutes a day is not productivity. It is noise. You could claw back more than that by [various sane suggestions]

      A friend of mine used to work for DWP when he started in the civil service. The sign on process was so inefficient he used to turn on his computer and sign in the moment he got into the office, then take off his coat, go get a cup of coffee, chat with colleagues, and generally waste time, and finally after 30-35 minutes the sign on would finish and he'd be able to get on with work. Just fixing that would have got a greater improvement.

  2. abend0c4 Silver badge

    Copilot saved civil servants 19 minutes

    A lot of DWP processes, at least in my experience, seems to be about deterrence - making it so laborious to claim for various things that people who might otherwise be entitled give up the fight.

    Finally, a purpose for which AI seems ideally suited.

  3. Aladdin Sane Silver badge

    I get baffled whenever "AI" is touted as being used to draft documents. Surely your staff should just be able to type documents themselves? Or has the general standard of written English degraded that far?

    1. Scotthva5

      Yes. Yes it has.

    2. elsergiovolador Silver badge

      This is exactly what happens in practice.

      You write a few clear paragraphs.

      AI pads them out with fluff to make them look “substantial”.

      The recipient asks AI to strip the fluff back out again.

      Then they reply by repeating the same process in reverse.

      Net result: two humans doing the same thinking as before, plus a machine adding and removing noise in between.

      Nothing is gained except big foreign tax-shy corporation gets paid.

    3. Anonymous Coward
      Anonymous Coward

      Voicing a view from inside the Civil Service, I'm amazed how many emails I get that start off with that horrid, insincere AI giveaway "I hope this email finds you well....". Invariably from people who in person are intelligent, functioning adults well able to make requests or hold conversations without the broken crutch that is AI. These will be some of the people insisting that Copilot saved them n minutes per day.

      Maybe I should conjure up an AI reply with the prompt "reply, explaining that the email didn't find me well, my sensitive intellectual constitution has been mortally wounded by the insincerity of a machine generated salutation that isn't even logically correct, because an email is not capable of finding or doing anything (other than perhaps causing offence)....etc etc."

      1. Anonymous Coward
        Anonymous Coward

        Your email has actually found me while I'm taking a shit, because it wasn't important enough for me to consider opening while I was at my desk and I accidentally tapped on it because my phone is slightly too hard to use single-handedly and it's time to wipe...

        1. David 132 Silver badge
          Happy

          “…and now, seeing your email on my phone, I am genuinely conflicted as to which hand I should use to wipe.”

    4. JLV Silver badge

      Not to mention that there will probably be copilot at the other end…

      - summarizing the email.

      - reusing the email as basis for further emails

      Then government efficiency can be redefined as numbers of documents produced per staff. Presto, hyper efficient government.

    5. Gordon 10

      What do you mean "degraded". it was never that great. Its a worksplace skill just like everything else.

  4. msknight

    How many man hours...

    ...were spent working that out. Or did they employ AI to do it?

    1. PB90210 Silver badge

      Re: How many man hours...

      It's more likely to be that there was budget left over that needed to be used up before the end of the financial year... "we need a success story"

      1. Anonymous Coward
        Anonymous Coward

        Re: How many man hours...

        It's more likely to be that there was budget left over that needed to be used up before the end of the financial year... "we need a success story"

        Civil servant here: Nothing of the sort. We're under a certain amount of gentle yet persistent pressure to cram AI into everything, because Starmer and his cabinet have picked AI as a magic bean to transform the lives of the plebs*. As civil servants, we're under an obligation** not to judge or obstruct government policy, but to enable and administer it, so whilst most colleagues agree AI is a piece of crap that adds nothing, our professional duty is to make full use of it, and then report back to justify what a boon it is.

        * You might need to look up the exact words from his speeches or publications issued in his name; He probably only thought the word "plebs", and more likely said "citizens".

        ** The Civil Service Code. This is part of our contracts of employment.

        1. elsergiovolador Silver badge

          Re: How many man hours...

          because Starmer and his cabinet have picked AI as a magic bean to transform the lives of the plebs

          Starmer has picked or wealth managers have picked for him?

          https://www.cityam.com/reeves-and-starmer-meet-blackrocks-larry-fink-for-growth-talks/

          https://www.thinkadvisor.com/2026/01/22/larry-fink-theres-no-ai-bubble/

          All this AI and Digital ID (nobody voted for) nonsense is designed to line BlackRock pockets.

          Civil servants should consider blowing the whistle otherwise this country will never improve.

    2. Aladdin Sane Silver badge

      Re: How many man hours...

      I believe it took 19min/user/day to calculate.

    3. Dan 55 Silver badge
      FAIL

      Re: How many man hours...

      Not many. The link to the methodology from the article says "These surveys collected data on... self-reported outcomes related to task efficiency, job satisfaction, and perceived work quality".

      In other words it's a vibe survey.

      The only report which actually objectively measured productivity found that developers thought they were 20% faster but they were actually 19% slower.

  5. Anonymous Coward
    Anonymous Coward

    Think chamber

    I've spent more time in the Think Chamber than this saving - and I dare say it was a lot more productive in multiple ways

  6. Pete 2 Silver badge

    More work in less time?

    > Microsoft Copilot saved civil servants 19 minutes daily

    But what is that "saved" time used for?

    Is it needed to navigate / wrestle / wait for Copilot to produce something usable. Time spent checking what spurts forth. Chatting on social media. Attending more useless meetings or (just possibly) producing 19% of extra value - however that intangible metric can be measured?

    1. Roland6 Silver badge

      Re: More work in less time?

      > But what is that "saved" time used for?

      The tea trolley can be reinstated…

      1. Anonymous Coward
        Anonymous Coward

        Re: More work in less time?

        But walking across to the coffee machine is good exercise

  7. Steve Foster
    FAIL

    "Our Survey says «two-tone raspberry of failure»"

    19 minutes saving you say? Across nearly 2000 civil servants? About half a second each then?

    Or is the claim 19 minutes per CS? Which seems likely to be utter BS, given the shambolic state of "AI" currently.

  8. Doctor Syntax Silver badge

    What I see these statements I wonder* why it doesn't say +/- whatever.

    * Not really. I know damn well. Either (a) only real statisticians would do that and none were involved or (b) real statisticians were involved and the error bar came out larger than the estimated time saved so that it wasn't strictly possible to say whether less time had been saved or lost or (c) real statisticians were involved, the estimate was a bit negative but by quoting the top of the error bar or an outlier they could get a positive number.

    Probably (a) but wouldn't exclude (c).

    1. breakfast Silver badge

      I haven't read in detail but the other very important thing is whether they're recording time spent in detail with and without AI or just surveying users on what they reckon. Enough other surveys have shown people believe AI is saving them time, but when you evaluate how their time is spent it turns out to have no benefit or work out slower, that we clearly can't trust vibes-based metrics.

      Unlike everywhere else, of course, where vibes-based metrics are entirely scientific (as long as they get results I agree with.)

  9. IGotOut Silver badge

    I'll beat that

    Cancel at least one meeting a week.

    There you go, same time back, zero cost.

    Heck, you may be able to lose several thousand pointless management types in the process.

  10. DrSunshine0104

    It's... Fine

    We use CoPilot for our government work, and it is good at search all our stuff for particular mentions of a topic or information. And, like the article says it is good for creating a first draft of an email.

    I work in geospatial information systems, and it cannot really help with that, and even at time when I have thrown something at it to see what it says, it gives the veneer of knowing the answer. It will always suggest the lowest hanging fruit of how to solve a problem, and sometime outright incorrect suggestions. It is like interviewing a potential hire who is trying for a position they are a little out of their depth, they say a lot of the words (which LLMs literally do), but a good grasp of the subject is not there.

    Generative AI, LLMs, etc., they all seem fine, but don't feel like a game changer. I want more domain specific ML models, that is where I think the usefulness is.

    1. that one in the corner Silver badge

      Re: It's... Fine

      > I want more domain specific ML models, that is where I think the usefulness is

      Agreed.

      But they are difficult to do, they require expertise in both ML (definitely including the ability to know when they *aren't* doing what you *think* they are doing!) and in the domain. And in the usefulness of that domain application to the goals of the users. And in the cost analysis of creating and then running the models against the bottom line (when this is a business and not research).

      All of which is so much harder to do than flinging (somebody else's) money, energy and compute at an LLM and keeping fingers crossed.

  11. Anonymous Coward
    Anonymous Coward

    I kind of get it searching sharepoint is not always easy.

    I can only imagine what a team of government management types think is a sensible folder structure Im sure a working group could come up with something truly spectacular.

  12. sarusa Silver badge
    Devil

    Wrong Faster!

    Of course what they're getting faster is incorrect, but DWP wouldn't care about that.

  13. Anonymous Coward
    Anonymous Coward

    19 minutes

    19 whole minutes.

    Meh

    Get back to me when it’s 3 or 4 hours a day.

    And only if AI actually does something useful.

  14. Random as if ! Bronze badge

    Facebook ++

    It browses Facebook and finds comments by known associates a lot faster.

  15. Jedit Silver badge
    Flame

    Copilot is a waste of time

    Literally. The other day Copilot hijacked my controls to do a location scan - in fucking Word, the last application that ever needs to know where you are. I didn't notice and kept typing. A small but very important chunk of text got missed out, and I had to redo the document.

    1. elsergiovolador Silver badge

      Re: Copilot is a waste of time

      If I got a penny for each time a relative called me because their important Word document "disappeared"...

    2. David 132 Silver badge

      Re: Copilot is a waste of time

      Ouch, you’ve triggered me there. I was in Word the other day when it suddenly popped up in the top-left corner asking for my location. Why? The document’s author swears she didn’t embed any location tracking. What gives? As you said, why TF should Word need to know my location??

POST COMMENT House rules

Not a member of The Register? Create a new account here.

  • Enter your comment

  • Add an icon

Anonymous cowards cannot choose their icon