Los Angeles city departments collectively manage an estimated tens of millions of digital image files — and a growing share of that archive is redundant. Duplicate image replacement, the systematic process of identifying and consolidating identical or near-identical files across government and institutional databases, has quietly become one of the more expensive administrative headaches facing municipal IT divisions ahead of the 2028 Olympics infrastructure push.
The problem is not abstract. Every duplicated aerial survey photograph stored twice across the Los Angeles Department of Building and Safety and the Bureau of Engineering consumes real server space, real energy, and real budget. When those files number in the millions, the waste compounds fast.
The Scale of the Problem in Raw Numbers
Industry-standard audits of large municipal governments typically find duplication rates between 20 and 40 percent in unmanaged image repositories, according to data management research published by organizations including the Storage Networking Industry Association. For a city the size of Los Angeles, which operates dozens of discrete departmental databases from the Harbor Department at the Port of Los Angeles in San Pedro to planning offices in Van Nuys, that range translates into potentially hundreds of terabytes of redundant visual data.
Cloud storage at enterprise rates generally runs between $0.02 and $0.023 per gigabyte per month on major platforms as of mid-2026. A single department holding 50 terabytes of images, with 30 percent duplication, carries roughly 15 terabytes of files that exist somewhere else in the same system. At those rates, that duplication costs upward of $3,600 per year in storage alone — before factoring in bandwidth, retrieval latency, and staff time spent managing redundant records.
The Los Angeles County Department of Arts and Culture, which digitized thousands of images as part of its public collection programs based out of offices near Grand Park in downtown Los Angeles, faced precisely this kind of audit challenge when expanding its online public portal in 2024 and 2025. The City Archives, housed at the Los Angeles City Clerk's office on North Main Street, has similarly undertaken periodic deduplication reviews tied to its broader records modernization program.
Why 2026 Is a Pressure Point
The urgency is sharpening this year for two reasons. First, the 2028 Olympic and Paralympic Games require Los Angeles city agencies to integrate infrastructure data — construction progress photos, site surveys, permitting images — across multiple departments simultaneously. The LA28 organizing committee's coordination with city agencies means shared visual data pipelines that multiply duplication risk if not managed proactively now.
Second, the Los Angeles Fire Department's expanded wildfire preparedness effort, accelerated after the January 2025 fires that devastated areas including Altadena and Pacific Palisades, has generated a significant volume of aerial and drone imagery for risk mapping. LAFD's Real-Time Watch Commander system and related preparedness tools depend on clean, non-redundant image data for accurate situational awareness. Duplicate frames in drone survey archives can slow automated analysis tools and inflate processing times during fast-moving emergencies.
The Los Angeles Information Technology Agency, which oversees citywide digital infrastructure, has been rolling out a unified data governance framework that addresses deduplication as one component of broader storage rationalization. The agency has not published specific savings figures publicly as of this writing.
For departments looking to act before the Olympic infrastructure crunch peaks, IT procurement specialists recommend starting with a file hash audit — software that generates a unique fingerprint for each image and flags matches — before committing to a full replacement protocol. Open-source tools can handle collections below roughly five million files; above that threshold, enterprise platforms from vendors like Veritas or Commvault typically handle the load more reliably. The cost of running an initial audit on a mid-sized departmental archive generally runs between $15,000 and $40,000 depending on existing infrastructure. Against years of redundant storage spending, most agencies break even within the first 18 months.
City departments that want to get ahead of the problem before Olympic coordination demands peak in late 2027 have, realistically, a narrow window to clean house now.