The Daily Los Angeles

Los Angeles news, every day

News

L.A.'s Digital Archive Problem: The Hidden Scale of Duplicate Images Clogging City Records

From wildfire damage assessments to Olympic infrastructure permits, Los Angeles agencies are sitting on enormous volumes of redundant image files — and the cleanup tab is climbing.

By Los Angeles News Desk · Published 4 July 2026, 11:47 am

3 min read

Los Angeles city departments collectively store an estimated tens of millions of digital image files across servers managed by the Los Angeles Information Technology Agency, and a growing share of that data is redundant — the same photograph saved twice, three times, sometimes dozens of times under different filenames. The duplication problem, long treated as a low-priority IT housekeeping matter, has taken on new urgency as agencies race to digitize records ahead of the 2028 Summer Olympics and Mayor Karen Bass's ongoing housing emergency documentation effort.

The timing matters. The city is in the middle of a major push to digitize building permits, fire inspection records, and shelter intake forms — all programs that generate photographic evidence at high volume. The Los Angeles Department of Building and Safety alone processes thousands of permit applications per month, each of which can carry multiple site photographs. When field inspectors upload images from mobile devices and those same images are later re-uploaded by a supervisor for review, duplicates accumulate silently. Storage costs scale directly with volume, and with commercial cloud storage rates running at roughly $0.02 per gigabyte per month for enterprise tiers, what looks like a minor redundancy problem compounds fast across a city government the size of Los Angeles.

Where the Numbers Get Serious

Duplicate image data is not a uniquely governmental headache, but the Los Angeles context sharpens the stakes. The Mayor's Office of Housing launched its Bridge Home shelter documentation system in 2023, requiring photographic intake records at facilities including the Westside shelter on Centinela Avenue in Mar Vista and the San Pedro Navigation Center near the waterfront. Both facilities log images of bed availability, maintenance issues, and intake conditions. Separate uploads by case managers, facility staff, and city auditors create layered redundancy. Industry benchmarks in enterprise document management suggest duplicate image rates of between 20 percent and 40 percent are common in organizations that lack automated deduplication tools — a range that, applied to the city's housing records alone, would represent substantial wasted capacity.

The Los Angeles Fire Department faces a parallel problem in its post-wildfire damage documentation program. After the January 2025 fires in Pacific Palisades and Altadena — the most destructive in modern city history — field crews photographed destroyed structures for insurance verification, grant applications through FEMA, and internal records. Multiple agencies photographing the same addresses with overlapping jurisdictions meant that a single burned lot on Palisades Drive might exist in four or five separate departmental archives, often as near-identical image files. The city's Information Technology Agency identified deduplication of fire records as a line item in its fiscal year 2026 infrastructure modernization plan, though specific budget figures for that program have not been publicly released.

What Comes Next — and What It Costs

Automated deduplication software uses perceptual hashing — a technique that assigns each image a numeric fingerprint and flags files with matching or near-matching fingerprints — to identify redundant files without requiring manual review. Enterprise platforms licensed for government use typically run between $15,000 and $80,000 annually depending on storage volume, according to publicly available vendor pricing sheets from companies including NetApp and Komprise. For a city department managing petabytes of visual records, that cost can represent significant savings against ongoing storage bills, particularly as the city expands cloud infrastructure through its contract with Microsoft Azure, announced in 2024.

The practical upshot for Angelenos is less abstract than it might seem. Faster permit approvals, more reliable shelter records, and cleaner wildfire recovery documentation all depend on city staff being able to locate the correct, authoritative version of an image quickly. The Los Angeles Department of City Planning has flagged records management as a bottleneck in its Olympic venue permitting timeline, with construction at sites including the Inglewood sports corridor requiring rapid document retrieval. City IT staff have been advised to run deduplication audits quarterly starting in the third quarter of 2026. Whether agency heads prioritize the budget to act on those audits will determine how much of Los Angeles's digital attic gets cleaned out before the world shows up in two years.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Los Angeles

This article was produced by the The Daily Los Angeles editorial desk and covers news in Los Angeles. See our editorial standards for how we use AI.

The Daily Los Angeles brief

The day's Los Angeles news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Los Angeles news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Los Angeles

More in News

Enjoyed this story? Get tomorrow's briefing free.