The Daily Los Angeles

Los Angeles news, every day

News

L.A.'s Digital Archives Are Drowning in Duplicate Images — and the Numbers Tell a Costly Story

From city hall databases to the 2028 Olympics planning files, duplicated visual assets are eating storage budgets and slowing down the agencies that can least afford it.

By Los Angeles News Desk · Published 4 July 2026, 12:28 pm

3 min read

L.A.'s Digital Archives Are Drowning in Duplicate Images — and the Numbers Tell a Costly Story
Photo: Photo by Stephen Leonardi on Pexels

Los Angeles city departments collectively store an estimated tens of millions of digital image files across fragmented systems, and a growing share of that data is redundant — the same photograph, render, or scan saved two, three, or a dozen times under different filenames. The problem has a name in IT circles: duplicate image accumulation. And in a city already stretched thin across housing, wildfire response, and Olympic infrastructure timelines, the waste is measurable in real dollars.

The timing matters. With the 2028 Summer Olympics now less than two years away, LA28 and its constellation of partner city agencies are generating visual documentation at an accelerating rate — site surveys, venue renders, construction progress photos, media assets. Without systematic deduplication protocols, those files compound. Industry benchmarks from enterprise storage analysts suggest that duplicate and redundant data can account for anywhere from 25 to 40 percent of total storage volume in large public-sector environments, though the precise figure for any individual Los Angeles department would require an internal audit to confirm.

Where the Problem Concentrates

Two institutions illustrate the scope particularly well. The Los Angeles Department of City Planning maintains visual archives tied to environmental impact reports, zoning case files, and community plan updates across all 35 community plan areas — from Boyle Heights to the Westside. Each case file can include dozens of site photographs, many of which are uploaded multiple times by different staff or external consultants using different portals. The department's transition toward a unified permitting platform, known as ePlanLA, was designed partly to address this kind of fragmentation, but duplication persists in legacy folders that predate the system's rollout.

The Los Angeles Homeless Services Authority faces a parallel challenge. LAHSA coordinates with more than 100 contracted service providers across the county, and visual documentation — photos of encampment clearances, shelter facility conditions, outreach interactions — flows in from dozens of sources simultaneously. When Bass's housing emergency declaration, which she signed in January 2023, accelerated the pace of shelter deployments and Inside Safe program operations, the volume of field photography spiked. Storage costs follow that volume directly: commercial cloud storage through enterprise contracts typically runs between $0.02 and $0.05 per gigabyte per month, which sounds trivial until you're managing petabyte-scale archives.

Deduplication Tools and What They Actually Cost

The technical fix is not complicated. Perceptual hashing — an algorithmic method that identifies visually identical or near-identical images regardless of filename or minor encoding differences — has been commercially available for years. Platforms including Adobe Experience Manager, used by several large municipal communications offices nationally, and open-source tools like ImageHash can flag duplicates automatically before they are committed to long-term storage. The question is always implementation cost versus projected savings.

For a department like the LA Department of Public Works, which handles photo documentation across infrastructure projects from the Harbor Gateway to the San Fernando Valley, a one-time deduplication audit followed by automated intake rules could realistically reduce active image storage by a fraction that, across a multi-terabyte archive, translates to thousands of dollars annually in avoided cloud costs. Multiply that across two dozen major city departments and the aggregate savings become worth a line item in a budget conversation.

The practical path forward for Los Angeles agencies runs through two steps. First, any department preparing data systems for 2028 Olympic operations — and that includes the Port of Los Angeles, which is already handling increased logistics documentation tied to anticipated cargo disruptions — should commission a storage audit before the end of fiscal year 2027. Second, procurement officers writing new contracts for digital asset management platforms should require deduplication functionality as a baseline specification, not an optional add-on. The city's Information Technology Agency, based on Spring Street in downtown Los Angeles, has the authority to set those procurement standards citywide. The data already exists to make the case. Someone needs to run the numbers.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Los Angeles

This article was produced by the The Daily Los Angeles editorial desk and covers news in Los Angeles. See our editorial standards for how we use AI.

The Daily Los Angeles brief

The day's Los Angeles news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Los Angeles news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Los Angeles

More in News

Enjoyed this story? Get tomorrow's briefing free.