Los Angeles city archivists have a problem hiding in plain sight. Duplicate digital images — the same photograph stored dozens of times across incompatible servers — have quietly consumed hundreds of terabytes of public storage capacity, and now officials and technology specialists are pushing for a coordinated replacement and deduplication program before the city's infrastructure demands spike ahead of the 2028 Summer Olympics.
The issue surfaced publicly during budget hearings at City Hall this spring, when the Department of General Services flagged that municipal digital storage costs had climbed sharply over the past three fiscal years. Multiple city departments, including the Bureau of Engineering and the Los Angeles Fire Department, had independently digitized overlapping records — aerial survey photographs, permitting images, and emergency-response documentation — without any shared protocol to prevent redundant files from accumulating across systems.
Why This Matters Now
The timing is not incidental. The city is in the middle of a capital infrastructure sprint tied to the 2028 Games, and Mayor Karen Bass's ongoing housing emergency declaration has pushed dozens of rapid-permitting workflows online for the first time. Both efforts generate enormous volumes of photographic documentation — site inspections, before-and-after construction records, encampment clearance imagery — that land in city servers without any automated check against what is already stored.
The Los Angeles City Archives, housed near the Piper Technical Center on Ramirez Street in the Arts District, has been flagging the deduplication problem internally for at least two years, according to city budget documents reviewed for this article. The Getty Conservation Institute, based in Brentwood, has consulted with municipal archivists on digital preservation standards, and specialists there have long advocated for hash-based deduplication tools — software that generates a unique fingerprint for each image file and automatically identifies copies — as the minimum baseline for any serious public archive.
Technology vendors have been circling. The city's Information Technology Agency issued a request for proposals in March 2026 for what it described as an enterprise digital asset management platform, with submissions due by May 30. The RFP specifically called out duplicate image replacement as a core functional requirement. No contract award has been publicly announced as of July 4.
What the Experts Are Saying
Digital preservation specialists contacted for this article — none of whom are named because they were not authorized to speak publicly about active city procurement — broadly agree that the problem is not unique to Los Angeles. New York City's Department of Records and Information Services undertook a comparable cleanup beginning in 2022, ultimately consolidating storage across borough offices. Los Angeles, with a municipal footprint that includes 35 city departments and dozens of proprietary database systems, faces a more complex consolidation challenge.
The practical cost argument is straightforward. Enterprise cloud storage runs roughly $20 to $25 per terabyte per month for the tiered archival products most city governments use. If redundant files account for even 15 percent of total municipal storage — a conservative figure by industry benchmarks — the annual waste runs into the low millions of dollars, money that budget analysts say could be redirected toward the homelessness data systems the Bass administration has prioritized under its Inside Safe program.
The Los Angeles County Metropolitan Transportation Authority is separately navigating a related challenge: its capital projects division has digitized tens of thousands of engineering photographs tied to the Purple Line Extension and the new Crenshaw/LAX corridor stations, and duplication rates in large construction documentation projects typically run between 20 and 40 percent by the time a project closes out.
For residents and city employees, the most immediate practical consequence is search degradation — when an archivist or engineer queries a system for a specific site photograph, duplicate files crowd results and slow retrieval. The Information Technology Agency has said it expects to announce a vendor selection for the digital asset management contract before the end of the 2025-2026 fiscal year, which closes September 30. Department heads have been told to audit their own image repositories in the interim and flag the largest duplicate clusters for priority consolidation once a platform is in place.