The Daily Los Angeles

Los Angeles news, every day

News

LA's Digital Archive Crisis: The Hidden Cost of Duplicate Images Clogging City Databases

As Los Angeles races to digitize records ahead of the 2028 Olympics, redundant image files are inflating storage costs and slowing down the systems that run the city.

By Los Angeles News Desk · Published 4 July 2026, 11:45 am

3 min read

Los Angeles city agencies are sitting on millions of duplicate image files — identical or near-identical photographs stored multiple times across fragmented servers — and the redundancy is costing taxpayers real money while degrading the performance of systems that range from building permit portals to emergency response databases.

The problem has sharpened in 2026 because the city is under pressure to consolidate and modernize its digital infrastructure before the 2028 Summer Olympics arrive. Infrastructure contracts tied to the LA28 organizing committee require standardized, interoperable data systems. Duplicate image bloat is one of the least glamorous obstacles, but procurement officials and IT managers across city departments have flagged it repeatedly in budget hearings this year.

The Numbers Are Stark

Industry benchmarks from enterprise data management research suggest that large municipal governments typically find between 20 and 40 percent of their stored image assets are exact or near-exact duplicates. Applied to a city the size of Los Angeles — which operates data centers including the Information Technology Agency's primary facility on West Temple Street in Downtown LA — that range translates to enormous wasted capacity. Storage costs for enterprise-grade infrastructure run roughly $3 to $8 per gigabyte per month depending on tier and redundancy level, meaning even modest duplicate reduction campaigns can yield six-figure annual savings.

The Los Angeles Department of Building and Safety, which maintains a public-facing permit portal at its Figueroa Street headquarters, processes tens of thousands of image uploads annually — site photographs, inspection records, scanned permit documents. When contractors or inspectors upload the same image through multiple submission pathways, the system logs each instance separately. The department's portal has logged submission backlogs during peak construction periods, a problem that IT auditors have linked in part to database query times slowed by redundant file indexing.

The Los Angeles Housing Department, ground zero for Mayor Karen Bass's ongoing housing emergency declaration, faces a parallel version of the same issue. Case files for interim housing placements, including those tied to the Inside Safe program operating across sites from Hollywood to South LA, incorporate photographs as documentation. Field workers uploading images via mobile devices often submit duplicates inadvertently when connectivity is unstable and the app retries a failed upload. Each duplicate adds a row to an index table, compounding query latency over time.

What Deduplication Actually Involves

Fixing the problem is not simply a matter of deleting files. Automated deduplication tools use hashing algorithms — typically MD5 or SHA-256 — to generate a unique fingerprint for each image. Files with identical hashes are confirmed duplicates and can be collapsed to a single stored instance with pointers replacing the redundant copies. Near-duplicate detection, which catches the same photograph uploaded at slightly different resolutions or compression levels, requires perceptual hashing tools, a more computationally intensive process.

The City of Los Angeles's ITA issued a technology modernization roadmap in fiscal year 2025-26 that referenced storage optimization as a priority, though specific deduplication line items have not been publicly broken out from broader infrastructure contracts. Several city departments have independently explored tools from vendors including Veritas and Commvault, both of which offer deduplication modules integrated with enterprise backup systems.

For residents and businesses dealing with the city's digital services, the practical effect of unresolved duplicate image bloat shows up as slow portal load times, failed document uploads, and delayed permit approvals. The Department of City Planning's online application system, used by developers filing projects from Boyle Heights to the Westside, has drawn complaints about upload failures during high-traffic periods.

The path forward involves three distinct steps: a full audit of existing image repositories to establish baseline duplicate rates, procurement of automated deduplication tooling before the 2027 fiscal year begins, and a standardized upload protocol across departments to prevent the problem from rebuilding. City Council's Budget and Finance Committee is expected to review consolidated IT spending priorities in September 2026, giving advocates for the cleanup a clear window to push the issue onto the formal agenda.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Los Angeles

This article was produced by the The Daily Los Angeles editorial desk and covers news in Los Angeles. See our editorial standards for how we use AI.

The Daily Los Angeles brief

The day's Los Angeles news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Los Angeles news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Los Angeles

More in News

Enjoyed this story? Get tomorrow's briefing free.