The Daily Los Angeles

Los Angeles news, every day

News

LA's Digital Archives Are Riddled With Duplicate Images — and the Numbers Are Staggering

A quiet data crisis is costing Los Angeles agencies millions in storage and staff hours as redundant image files pile up across city systems.

By Los Angeles News Desk · Published 4 July 2026, 11:40 am

3 min read

Los Angeles city agencies collectively hold an estimated tens of millions of digital image files across departmental servers, and a significant share of them are exact or near-exact duplicates — a storage problem that has quietly inflated IT budgets and slowed public records responses for years. The issue spans departments from the Los Angeles Department of Building and Safety, which processes permit documentation for tens of thousands of properties annually, to the Los Angeles Police Department's evidence management systems along the 110 Freeway corridor facilities in South LA.

The timing matters. With the 2028 Olympics now less than two years out, city technology offices are under pressure to consolidate and modernize infrastructure. Mayor Karen Bass's housing emergency declaration has simultaneously pushed a surge of permit applications and inspection photo uploads through the Department of Building and Safety's digital pipeline. Every redundant file compounds the problem.

What the Numbers Actually Show

Digital asset management specialists who work with municipal governments estimate that duplicate image rates in unmanaged enterprise archives typically run between 20 and 40 percent of total stored files. Applied to a department like the Los Angeles Bureau of Engineering — which maintains visual documentation for thousands of active infrastructure projects, from the Sixth Street Viaduct reconstruction to ongoing Metro line extensions — even a conservative 25 percent duplication rate across a multi-terabyte archive translates to substantial unnecessary expenditure. Cloud and on-premise storage costs for government-grade systems in California have hovered around $0.023 per gigabyte per month for standard tiers, meaning a 50-terabyte archive carrying 25 percent dead weight runs up roughly $3,500 in avoidable annual storage costs per department — multiplied across dozens of city agencies, the figure climbs into the hundreds of thousands of dollars citywide.

The Los Angeles County Office of the Assessor, which maintains parcel photography records for more than 2.5 million properties across the county, publicly updated its digital records management policy in 2023 to begin phasing in deduplication protocols. The City of Los Angeles Information Technology Agency, headquartered near City Hall East on Main Street in Downtown, has piloted image-hash-based deduplication tools on a limited basis, but a full rollout across all city departments has not been publicly scheduled as of this writing.

The LAPD's body-worn camera program alone generates enormous quantities of image and video data daily. The department expanded its camera deployment program significantly after 2020, and while video files differ technically from static images, the same underlying duplication problem — multiple officers photographing the same scene, multiple uploads of the same file — applies to evidence photograph archives managed out of facilities including the Southeast Division station on 108th Street in Watts.

What Comes Next for City Systems

The pressure to act is building from multiple directions. The California Department of Finance has signaled continued scrutiny of municipal IT spending as part of its oversight of local government efficiency. Separately, the LA Controller's office has in recent years pushed for greater transparency in how city agencies account for technology expenditures, including storage contracts.

For departments managing Olympic-related documentation — the LA28 organizing committee, for example, is coordinating with city agencies on venue permitting and security planning at sites including SoFi Stadium in Inglewood and the Intuit Dome in Inglewood — the stakes of a cluttered, inefficient image archive are practical, not just financial. Slow retrieval times and misidentified files in permit and inspection records can delay construction timelines that have no slack built in given the July 2028 deadline.

Digital records managers recommend that city agencies implement automated perceptual hashing — a technique that identifies visually similar images even if file names differ — as a first pass before any major archive migration. The process typically reduces storage loads by 15 to 30 percent in real-world government deployments without deleting any file that hasn't been verified as a true duplicate. For Los Angeles, that kind of housekeeping before the Olympic infrastructure sprint gets underway would be the smart move. The data says it's already overdue.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Los Angeles

This article was produced by the The Daily Los Angeles editorial desk and covers news in Los Angeles. See our editorial standards for how we use AI.

The Daily Los Angeles brief

The day's Los Angeles news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Los Angeles news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Los Angeles

More in News

Enjoyed this story? Get tomorrow's briefing free.