Los Angeles city departments are sitting on tens of thousands of duplicate image files spread across municipal servers, a data management problem that is costing the city in real dollars and, in at least one case, slowing access to records tied to the Bass administration's ongoing housing emergency response. That finding comes from a review of city IT procurement records and storage contract documents filed with the Office of the City Administrative Officer.
The issue is not abstract. With the 2028 Olympics infrastructure buildout accelerating across venues from SoFi Stadium in Inglewood to the Los Angeles Memorial Coliseum in Exposition Park, city agencies are under pressure to modernize their digital systems before a global audience arrives. Duplicate image data — redundant photo files created when records are uploaded multiple times, migrated between platforms, or scanned without deduplication protocols — clogs those systems and inflates storage costs at every level of municipal government.
What the Numbers Actually Show
Storage analysts who work with government clients estimate that unmanaged municipal databases routinely carry duplicate rates between 20 and 40 percent of total stored data. For Los Angeles, which the city's Information Technology Agency reported held more than 15 petabytes of active data across departments as of its most recent annual report, even a conservative 25 percent duplication rate would represent nearly 4 petabytes of redundant files — the equivalent of roughly 80 million high-resolution images stored twice or more.
The Los Angeles Housing Department, which is processing thousands of property inspection photos, site documentation images, and permit records tied to the Mayor's housing emergency declaration — first issued in January 2023 — has been expanding its digital intake faster than its deduplication tools can process. The department's Systematic Code Enforcement Program, known as SCEP, photographs multi-unit buildings across neighborhoods from Boyle Heights to Canoga Park. Multiple uploads from field inspectors using different devices routinely create duplicate records in the same case file, according to the structure of procurement contracts reviewed for this report.
City cloud storage contracts, which the ITA has increasingly routed through vendors operating data centers in the San Fernando Valley corridor near Chatsworth, run at rates that make duplication financially significant. Enterprise-grade municipal cloud storage in the Los Angeles market currently costs between $0.02 and $0.05 per gigabyte per month depending on tier and redundancy requirements. At those rates, even a single petabyte of genuinely unnecessary duplicate data runs the city between $20,000 and $50,000 every month — roughly $240,000 to $600,000 per year — before factoring in bandwidth and retrieval costs.
Fixing It Before the Clock Runs Out
The ITA has a deduplication initiative on paper. The Digital Infrastructure Modernization Program, referenced in the city's fiscal year 2025-26 budget materials, allocates funding toward storage optimization across at least seven major departments. But IT contractors familiar with similar projects in large American cities say implementation typically lags planning by 18 to 24 months when deduplication work competes with higher-profile system upgrades.
For agencies like the LAPD, which maintains evidence photograph archives dating to digitized records from the 1990s, the duplicate problem compounds over decades. The department's Evidence and Property Division, headquartered on Ramirez Street downtown, manages image files across criminal cases where chain-of-custody integrity makes aggressive automated deletion politically and legally complicated. Deduplication in that environment requires hash-matching technology that confirms two files are byte-for-byte identical before any consolidation — a slower, more expensive process than bulk cleanup.
The practical advice from data governance specialists is consistent: cities that have successfully reduced duplicate image loads — Chicago and Denver are frequently cited in federal digital infrastructure reports — did so by mandating single-point upload systems at the field level, not by running cleanup operations on existing archives. That means new mobile tools for inspectors, updated intake protocols for permit offices, and revised upload standards for any department feeding the city's central document management system.
With the Olympics credential and accreditation systems, venue security photo databases, and transportation monitoring networks all set to integrate with Los Angeles city infrastructure by late 2027, the window for cleaning up the existing mess is measured in months, not years.