Los Angeles city agencies are sitting on tens of millions of duplicate digital image files spread across municipal servers, a sprawling redundancy problem that IT auditors have flagged as a drain on infrastructure budgets already stretched thin by wildfire preparedness upgrades and 2028 Olympics technology buildouts. The core issue is simple: the same photograph, scan, or surveillance still gets saved multiple times across multiple departments, ballooning storage costs and degrading the speed of retrieval systems that first responders and social service workers rely on daily.
The timing matters. The city is mid-construction on a unified data platform intended to support Olympic operations, emergency coordination, and the Mayor's housing office — all of which depend on clean, deduplicated image libraries. Every redundant file in the system is a small tax on that infrastructure, and the totals add up fast.
What the Numbers Actually Show
Industry benchmarks from enterprise storage analysts suggest that between 20 and 30 percent of files on large institutional servers are exact or near-exact duplicates. Apply that range to a municipal government the size of Los Angeles — which operates dozens of separate departmental servers across facilities from the civic center complex on Spring Street downtown to the LAPD's Information Technology Group offices in Van Nuys — and the redundancy problem is not trivial. Storage costs for enterprise-grade data centers in the Los Angeles Basin have been running between $3 and $8 per gigabyte per month for high-availability systems, according to regional data center pricing surveys. A department holding several hundred terabytes of image data, even a fraction of which is duplicated, burns through budget at a measurable rate.
The city's Bureau of Sanitation, which photographs hundreds of encampment clearances and infrastructure inspections annually as part of the Bass administration's housing emergency response, has been among the agencies flagged internally for image redundancy. Field crews routinely upload the same jobsite photographs from mobile devices that then sync automatically to cloud backup, a second departmental archive, and sometimes a supervisor's shared drive — creating three or four copies of a single 4-megabyte image file before anyone reviews it. Multiply that workflow across the department's roughly 4,000 employees and the duplication compounds quickly.
The Los Angeles Homeless Services Authority, which coordinates shelter placements and tracks encampment data across all 15 council districts, faces a parallel version of the problem. Case workers document client interactions with photographs that flow into multiple case management platforms, including the county-run Homeless Management Information System. Without automated deduplication running at the point of ingestion, the same image can exist in four separate database tables by the time a supervisor pulls a weekly report.
The Fix — and What It Costs
Deduplication software is not new. Vendors including Veritas, Commvault, and several open-source alternatives have offered hash-based detection tools for over a decade, identifying files with identical checksums and consolidating them to a single stored instance. For a municipal deployment across the scale of LA's infrastructure, licensing and integration costs for an enterprise deduplication rollout typically run between $500,000 and $2 million depending on scope, according to published municipal procurement records from comparable large U.S. cities. The savings case typically pays back within 18 to 24 months purely on reduced storage spend, before accounting for gains in retrieval speed.
The city's Information Technology Agency, headquartered on Main Street downtown, has a digital infrastructure modernization program already underway ahead of the 2028 Games. Deduplication across image libraries is listed as a component in that program's published scope documents, though the timeline for full implementation across all agencies has not been publicly confirmed.
For departments that can't wait for a citywide rollout, the practical path forward is narrower but faster: run a one-time deduplication scan on the highest-volume image repositories, flag duplicates for human review before deletion, and set ingestion rules going forward that block duplicate uploads at the source. The Los Angeles County Department of Public Works began a similar project at its Alhambra headquarters in early 2025, and published internal benchmarks showed a 23 percent reduction in active storage use within the first 90 days. That figure, while specific to one agency's environment, gives LA city departments a reasonable baseline for what targeted action can achieve before a larger unified system comes online.