This is a stunning number from Farhad Manjoo over at the Wall Street Journal. The development cost of the health care exchanges necessary under the ACA (aka Obamacare) is larger than the development cost of the original iPhone at Apple. Indeed, by some estimates it might be four times the cost:
If users found a few bugs in their iPads, she argued, most wouldn’t consider them a complete disaster. Instead, they’d recognize that technology is complicated, that errors are common, and they’d wait for an update. Apple Inc., she added, has “a few more resources” than her department, so “hopefully [citizens will] give us the same slack they give Apple.”
That argument is as clueless as it is misleading. While it’s true that Apple is fantastically wealthy, its product-development costs aren’t necessarily greater than those of the federal government. As Fred Vogelstein reports in his coming book, Apple spent about $150 million developing the iPhone. The health-insurance exchange—which, let’s remember, is merely a website meant to connect citizens to insurance companies, something quite a bit less complex than Apple’s groundbreaking miniature computer—so far has cost at least $360 million, and possibly as much as $600 million.
That’s a pretty bad indictment of the way those health care exchanges have been built: the most egregious problems being with the Federal one that covers the 36 states that did not decide to build their own. After all, the iPhone was not just a new product category, it was also a new operating system and a new paradigm for how to do computing. One would expect this to cost rather more than what is, at root, just a website calling on a few databases. But apparently not so it’s worth trying to work out what went wrong here. Manjoo gives us one reason here:
Today, any company looking to work with the government must navigate an obstacle course of niggling, outdated regulations and arbitrary-seeming requirements. For instance, your technology must be Y2K-compliant just to get in the door. The process locks out all but a tiny handful of full-time contractors—companies who also happen to be big federal lobbyists.
Another one is that no one actually inside government really knows how to buy tech projects, as again Manjoo notes. This shouldn’t be all that surprising really. The rewards in the private sector for being able to manage large technology projects are vast. Even the guy who manages the Candy Crush Saga servers for a couple of years is expecting to be a millionaire several times over by the time he’s finished. So Federal Government just isn’t going to get those rare skills working inside it. And yes, I’m afraid that it is true that you do need those skills when you’re on the buying side. You can indeed subcontract out most of the work but you’ve at least got to be able to define what it is that you want: something that isn’t easy and is a rare skill in and of itself.
Mix that lack of in house skill with the regulations that mean that there’s only a few companies that can navigate through the contracting red tape and you’ve already got a blueprint for disaster. But I’m afraid it goes further than this as well:
These reports are known in industry jargon as 834 transactions and they may represent the most serious ObamaCare breakdown. Insurers and the exchanges are supposed to swap electronic files every 24 hours that track the policies consumers select and their subsidies and check their lists against each other. The problem is that the exchanges rarely generate accurate 834s.
Our sources in the insurance industry explain that the 834s so far are often corrupted or in the wrong syntax, and therefore unusable unless processed by hand. In other cases the exchanges spit out multiple 834s enrolling and unenrolling the same user and don’t come with time stamps that would allow the insurer to identify the most recent version.
The software just isn’t ready for prime time. So, how did this happen? One answer lies here:
How and why the system failed, and how long it will take to fix, remains unclear. But evidence of a last-minute surge in spending suggests the needs of the project were growing well beyond the initial expectations of the contractor and the U.S. Department of Health and Human Services.
“Why this went from a ceiling of $93.7 million to $292 million is hard to fathom,” said Scott Amey, general counsel at the Project on Government Oversight, a Washington, D.C.-based watchdog group that analyzes government contracting.
Adding more money to a software project is the same as adding more people to that project. And that’s something that we know doesn’t work. It’s called Brook’s Law:
adding manpower to a late software project makes it later
It’s not quite but is near infallible that law. And forcing more money, thus more people, into a flailing software project is just once again evidence that those running the procurement process didn’t understand the basics of what they were doing. But that’s not the only problem:
The changes to the Healthcare.gov contract came in response to more detailed requirements about how the site should operate, said a person at CGI familiar with the work.
When CMS awarded CGI Federal the first $55.7 million delivery order in 2011, “most of the regulations and guidance implementing the Affordable Care Act had not yet been finalized,” said the person with knowledge of the award.
The Obama administration was issuing regulations and changing policy regarding how the reform should be implemented late into this summer. Many required significant changes to the IT running Healthcare.gov, which kept contractors scrambling.
This is another violation of the most basic strictures about software development. You’ve got to know, right at the start, exactly what it is that you want to achieve. The reason they didn’t is twofold. The first is that given Federal contracting rules they don’t like to write a tight delivery contract. For this would mean that every time they try to change the specifications it must go out to bid again. So, instead, contracts are written on very wide indeed terms. As Reuters notes, if might be something as wide as “telecoms services”. When an alteration in such a contract is required it can be negotiated with the current supplier: no further bid requests are necessary.
When you’re looking for telecoms services, or perhaps bandwidth into the cloud or something, this might well be fair enough. But as I say it violates the basic principles of managing a software project. Before anyone does anything you’ve got to have both the basic structure and also the final detail of what that system is going to be doing and how it is going to be doing it. And once you’ve got that structure then you have to lock it down. You can have various levels of this lock down but at the very least you must have stable interfaces between the different units of the project. No one ever gets to change an interface for this is where the work being done by one team, one contractor or one company interacts with that being done by another team. And of course means that if you allow interface changes then what everyone else is doing has to be changed as well. You can be less restrictive on changes behind such interfaces, this is true, leave people alone to work out how to interact with their side of one of them, but crossing that boundary should be set in stone right at the beginning.
CGI’s original 2007 contract was of a type called Indefinite Delivery/Indefinite Quantity, federal records show.
ID/IQ contracts allow the government “to write a laundry list of things they can order from the contractor,” said Sarah Gleich, an attorney and government procurement expert at Gibson, Dunn & Crutcher. “They’ll write incredibly broad descriptions of the work, like ‘telecom services,’ so you can’t tell what they’re ordering.”
The advantage of an ID/IQ contract, said experts, is that it can be expanded almost indefinitely, without the government having to solicit new bids for additional work.
“We’d like some computer work done” really isn’t the way to specify a software contract, it really isn’t. And the thing is we already know this from the experience over in the UK:
The cost of a failed IT system for the NHS, which was abandoned two years ago, is still rising and could reach £10bn, according to a report by MPs.
The Public Accounts Committee described the attempt to upgrade NHS computer systems as one of the “worst and most expensive contracting fiascos” in public sector history.
£10 billion is $15 billion. That’s how much the UK Government spent on what should have been a fairly simple software development. They wanted to automate patient records and communication across the National Health Service. This is actually something that the Veteran’s Health Administration in the US managed over a decade ago. And that system works rather well and is credited with vastly improving the efficiency with which that arm of health care works. So how come the VHA could do it and the NHS couldn’t?
The VHA kept a very close eye on interfaces, this is one thing. In fact, as a result of that project there’s a good set of open source routines and structures to make it easier for the next automation and digitisation of patient records. The NHS didn’t use these of course. But vastly more importantly before the VHA went out to tender it had a design that it wanted to implement. The NHS didn’t: it had some vague instructions from politicians that it would be nice to have a health service running on shiny shiny tech. Even then it needn’t have been a disaster if they had then designed a proper plan. But they didn’t: they kept changing the rules, what the system was supposed to do, as people were trying to develop said system.
This is beginning to sound very like the ACA exchanges, isn’t it? The end result in the UK was that we got nothing for our £10 billion. Seriously, quite literally zilch. The entire amount of money was thrown away as a result of extraordinarily bad management of the software development program.
Now I don’t think that the health care exchanges are going to be as bad as all that. But we are very rapidly approaching what is known as the “drop dead date”. Megan McArdle:
But given that they didn’t even announce that they were taking the system down for more fixes this weekend, I’m also guessing that it’s pretty bad. Bad enough that it’s time to start talking about a drop-dead date: At what point do we admit that the system just isn’t working well enough, roll it back and delay the whole thing for a year?
Yes, I know what I’m suggesting is a major, horrible task. And I’m aware that since I opposed the law in the first place, people will take my suggestion with a huge grain of salt. Fair enough, but hear me out.
Yes, yes, we both know, this would be seen as a victory for the Republicans, a defeat for the Democrats and President Obama. But as McArdle goes on to point out if these systems don’t get sorted out in the next couple of weeks then the economic basis of Obamacare is going to fail. The young won’t enroll and it is necessary for the healthy young to enroll in order to subsidise the old and ill. There is some point at which the political embarrassment of delay is better than the failure of the basic underlying idea (well, OK, if you lean more Republican then maybe you’ll be delighted with entire failure but that’s another matter). As a very experienced (anonymous but of my acquaintance) software engineer puts it:
Once the system was pulled offline, the developers made reasonably rapid progress. They’d accumulated a host of bug reports, both in functionality and performance, and (more importantly) had identified crucial gaps in testing and monitoring. For each functional and performance bug, they first verified that they could reproduce it in their testing system – which was where they spent the bulk of their development time for several weeks after turndown – and that the monitoring would detect the condition and alert them appropriately. They triaged the bug reports, worked their way through them in priority order, built load tests that replicated the system load from normal operation and added metrics and monitoring on system latency. The time spent running in production had provided a wealth of logs and load information which gave them a yardstick against which they could measure performance.
After a few months they felt ready to try again, so they spun up the fixed system and loaded in the current data. This went much more smoothly.
And as he concludes:
If the system is this screwed up at this point, with an unmoveable deadline of January 1st to enroll a large number of people, any sane project manager would move heaven and earth to defer the rollout. In the next 6-9 months they could address all the problems that the first roll-out has revealed, taking the time to test both functionality and performance against the traffic levels that they now know. There’s no practical compulsion to run the exchanges now – the American healthcare system has been screwed up for several decades, the population is used to it, waiting another year won’t make a great difference to most voters.
If they can deal with all the problems in the next couple of weeks then this doesn’t have to happen. But that drop dead date is rapidly approaching. There really will be a time, sometime in the next few weeks, when if those exchanges aren’t working smoothly it will be better to drop the whole idea for 6 or 9 months to give the system a thorough rewrite and get them right second time around.
The British experience is that government IT contracts are always, but always, complete disasters. That NHS contract that blew through $15 billion is only the worst example: passport systems, ID card systems, fire and ambulance control systems, they’ve all been unmitigated disasters. We’re entirely expecting the new Universal Benefit system to be one too. The autopsy reports always come up with the same darn problem too. No one at all insisted upon a tight specification of the project before people started working on it. Then when it fell behind people, in violation of Brook’s Law, just firehosed more resources at them and the project. This just isn’t how you run a successful software development project.
For minarchists, small government liberals, like myself there is a silver lining to this cloud. It is once again proof that government just isn’t the way to do certain things. We agree that there are things that must be done, we even agree that government is the only actor capable of doing some of those things that must be done. But we reserve the right to point out that there are some things that government is extremely bad at. And the development of large scale software systems seems to be one of them.
As Manjoo says right at the top there. Apple designed revolutionary computer hardware including a new operating system for $150 million. The Federal Government has spent perhaps four times that amount, $600 million, on a website that doesn’t work as yet. There would be no Facebook, no Twitter or Google if the private sector’s software development costs worked that way, no e-Bay. Good grief, even most of the money that Pets.com and WebVan managed to lose was on meatspace things like warehouses, not just writing code.
To end with a small question for you all. What’s the date you think is that drop dead date? When do they have to say this isn’t working and take it all down for a rewrite rather than attempt to continue fixing things on the fly?
0 comments:
Post a Comment