By the Numbers: How Duplicate Images Are Quietly Draining LA's Digital Infrastructure Budget
A surge in redundant image files across city databases and agency websites is costing Los Angeles millions annually — and the problem is accelerating.
A surge in redundant image files across city databases and agency websites is costing Los Angeles millions annually — and the problem is accelerating.

Los Angeles city departments are sitting on an estimated 47 million duplicate image files spread across municipal servers, costing taxpayers roughly $3.2 million per year in unnecessary storage, bandwidth and maintenance — a figure that city IT auditors flagged in a June 2026 internal review obtained by The Daily Los Angeles.
The problem has sharpened focus now because of timing. With the 2028 Olympics infrastructure build demanding significant digital capacity — the LA28 organizing committee alone projects it will generate more than 800 terabytes of media assets between now and the opening ceremony — city agencies cannot afford to hemorrhage server space on files that already exist three, five, sometimes a dozen times over. Redundant data is not a new problem, but the scale arriving with Olympic-era digital expansion is.
The June audit, produced by the City Administrative Office's Bureau of Technology Services at its offices on Spring Street in downtown Los Angeles, found that the Department of Recreation and Parks carried the heaviest duplicate load among municipal agencies — approximately 9.1 million redundant image files, many of them event photography accumulated over the past eight years. The Los Angeles Housing Department, which has been generating high volumes of documentation images under Mayor Karen Bass's housing emergency declaration, added roughly 2.4 million duplicates in the 18 months between January 2025 and June 2026 alone.
Storage costs in enterprise cloud environments run between $0.023 and $0.08 per gigabyte per month, depending on access tier, according to publicly available pricing from the vendors the city uses under its existing managed services contracts. Even at the low end, 47 million image files — averaging roughly 3.5 megabytes each — represent about 164 terabytes of redundant data. That works out to more than $45,000 per month in avoidable cloud storage fees before factoring in the labor costs of database administrators who must navigate bloated file systems during retrieval and backup cycles.
The duplication problem is not confined to city government. The Los Angeles Unified School District, which manages its own separate digital infrastructure for 1,300 schools and offices stretching from San Pedro to Sylmar, reported in its own May 2026 technology audit that 31 percent of images stored on its content management system were exact or near-exact duplicates. LAUSD's digital storage budget runs approximately $11 million annually. The district is now piloting a deduplication software tool from a vendor based in Culver City, with a contract valued at $480,000 over 18 months.
Perceptual hashing — the technical process that detects near-identical images even when file names differ — is the core technology behind most commercial deduplication tools. The city's Bureau of Technology Services tested three platforms between March and May of this year at a server facility in El Segundo. Results showed deduplication rates ranging from 68 percent to 84 percent depending on the tool and the data set, meaning that in some departments, nearly five out of every six stored images could theoretically be collapsed to a single master file.
The Bureau of Technology Services is expected to bring a recommendation to the City Council's Information Technology and General Services Committee by September 2026. The proposal will likely include a phased rollout beginning with the three highest-volume departments — Recreation and Parks, the Housing Department, and the Office of the City Clerk, which digitizes tens of thousands of permitting and land-use documents each month at its offices on Temple Street.
For residents and businesses dealing with the city's online permitting portals — including the much-criticized development portal that serves contractors in neighborhoods like Boyle Heights and Koreatown — faster database performance is the practical payoff. When servers carry less redundant weight, query and upload times improve. City IT officials say response times on the planning portal could drop by as much as 40 percent if deduplication is implemented system-wide.
The Bureau's timeline sets a full citywide rollout no earlier than the first quarter of 2027 — which would still leave more than a year of runway before Olympic digital demands peak in summer 2028. Getting there on schedule, however, depends on budget approval in a fiscal year when the city faces a projected $335 million general fund shortfall.
How does this story make you feel?
Spread the word
About this article
Published by The Daily Los Angeles
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News


