The Daily Los Angeles

Los Angeles news, every day

News

LA City Archives Are Swimming in Duplicate Images — and the Numbers Show How Bad It Got

A growing backlog of redundant digital files is costing Los Angeles municipal agencies thousands of staff hours and real storage dollars, and the reckoning is finally arriving.

By Los Angeles News Desk · Published 4 July 2026, 11:48 am

3 min read

LA City Archives Are Swimming in Duplicate Images — and the Numbers Show How Bad It Got
Photo: Photo by RDNE Stock project on Pexels

Los Angeles city departments collectively hold an estimated tens of millions of digital image files across their servers — and a significant share of those files are duplicates. The Bureau of Engineering, the Department of City Planning, and the Los Angeles Housing Department have each flagged the problem internally over the past 18 months as they've migrated legacy records into cloud-based systems ahead of 2028 Olympics infrastructure deadlines.

The issue matters now because that migration is accelerating. Mayor Karen Bass's housing emergency declaration, signed in January 2023, triggered an avalanche of permit applications, inspection photographs, and environmental impact images flowing into city systems. Staff at the Department of City Planning's Figueroa Street offices have described document management as a persistent operational bottleneck, though the department has not released a formal audit quantifying the redundancy rate publicly.

What the Data Tells Us

Industry benchmarks offer a rough frame. A 2024 study by the Storage Networking Industry Association found that in large municipal and enterprise environments, duplicate and near-duplicate files typically account for between 20 and 40 percent of total stored image data. Applied conservatively to Los Angeles, even a 20 percent redundancy rate across a system storing 50 terabytes of photographic and scanned records would represent roughly 10 terabytes of redundant data — storage that, on commercial cloud platforms priced around $23 per terabyte per month, adds up to more than $2,700 annually in pure storage cost before accounting for bandwidth, backup cycles, or staff retrieval time.

The Los Angeles Housing Department, which administers the Systematic Code Enforcement Program covering roughly 750,000 rental units citywide, photographs properties during inspections. Each inspection can generate a dozen or more images. When inspectors upload from the field on Figueroa and later from office terminals, identical files frequently enter the system twice. Multiply that across thousands of annual inspections and the duplication problem compounds fast.

At the city's Geographic Information Systems division, which supports planning decisions for projects stretching from the Sepulveda Basin Recreation Area to the Los Angeles River revitalization corridor, aerial and satellite imagery layers are updated seasonally. Older image tiles are not always purged, leaving parallel datasets that slow query times and complicate version control for planners working on Olympic venue infrastructure near Exposition Park and SoFi Stadium in Inglewood.

The Cost in Time, Not Just Storage

Storage fees are the visible line item. Staff time is the invisible one. A 2023 report by McKinsey Global Institute estimated that knowledge workers spend roughly 20 percent of their working week searching for information or duplicating work already done. In a city department where an analyst earns a median salary around $78,000 per year — consistent with LA City salary schedules published through the Personnel Department — that translates to roughly $15,600 per employee per year in productivity lost to information management friction. Scale that across even a modest team of 50 analysts and the figure exceeds $780,000 annually.

Several Los Angeles agencies have begun piloting deduplication software. The Department of Public Works began a limited test of automated image-matching tools in early 2026 as part of a broader IT modernization contract. The Los Angeles County Metropolitan Transportation Authority, which maintains separate but overlapping records for projects intersecting city infrastructure along the Crenshaw/LAX Line corridor, is further along — its digital asset management overhaul began in fiscal year 2024-25.

For departments still in earlier stages, the practical path forward involves three steps: auditing current storage repositories to establish a baseline redundancy percentage, deploying perceptual-hash deduplication tools that catch near-identical images even when file names differ, and setting upload protocols that check for existing matches before writing new files to the server. The Bureau of Engineering, which oversees hundreds of active construction projects ahead of the 2028 Games, has the most urgent deadline. Its project photo archives need to be clean, searchable, and version-controlled before contractor handoff reviews begin in earnest — a timeline that, by most project schedules, starts in late 2026.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Los Angeles

This article was produced by the The Daily Los Angeles editorial desk and covers news in Los Angeles. See our editorial standards for how we use AI.

The Daily Los Angeles brief

The day's Los Angeles news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Los Angeles news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Los Angeles

More in News

Enjoyed this story? Get tomorrow's briefing free.