The Daily Los Angeles

Los Angeles news, every day

News

How L.A.'s Digital Archive Crisis Reached a Breaking Point: The Long Road to the Duplicate Image Problem

Decades of uncoordinated record-keeping across city departments left Los Angeles with hundreds of thousands of redundant digital files — and now a $4.2 million cleanup is underway.

By Los Angeles News Desk · Published 4 July 2026, 12:22 pm

3 min read

How L.A.'s Digital Archive Crisis Reached a Breaking Point: The Long Road to the Duplicate Image Problem
Photo: Southern Pacific Company / Public domain (Wikimedia Commons)

Los Angeles city officials confirmed this week that the Bureau of Engineering has begun a systematic purge of duplicate digital images clogging the municipal records infrastructure, a problem that traces back to at least 2009 and has quietly ballooned into one of the most expensive data management failures in the city's history. The current remediation contract, awarded in May to a Culver City-based firm called Axon Digital Workflow Solutions, is valued at $4.2 million and covers roughly 1.1 million redundant image files spread across at least seven city departments.

The timing is not accidental. With the 2028 Olympics infrastructure build-out accelerating — the Olympic Village footprint near USC and the Exposition Park corridor alone has generated more than 80,000 new permit-related images since January — city planners and IT administrators quietly acknowledged they could no longer absorb the duplication problem into normal operations. Storage costs at the city's primary data center on Temple Street had risen 34 percent in three fiscal years, according to budget documents filed with the City Administrative Officer's office in March.

A Bureaucratic Tangle Years in the Making

The roots of the problem go back to a 2009 digitization push under the then-Department of General Services, when individual city agencies were handed their own scanning equipment and minimal central guidance on file naming conventions, metadata standards or deduplication protocols. The Department of Building and Safety — now folded into the Los Angeles Department of Building and Safety under the consolidated structure — began uploading inspection photographs, permit images and code enforcement records without any automated check for redundancy. The Department of Public Works followed a similar path. By 2015, internal audits flagged the issue but estimated the duplicate rate at only around 12 percent. Last year's independent review put that figure at closer to 41 percent across the combined archive.

The Wilshire Boulevard expansion of the city's geographic information system in 2019, which was meant to unify spatial data for departments including the Los Angeles Department of Water and Power, actually made things worse in the short term. Migrating legacy image libraries into the new GIS environment created a second copy of thousands of files, and without a merge-and-deduplicate step baked into the migration contract, the duplicates simply persisted alongside the originals.

The Bass administration's January 2025 housing emergency declaration added another layer of urgency. Code enforcement inspectors across South Los Angeles and Boyle Heights dramatically increased their documentation activity — photographing structures flagged under the emergency protocols — which pushed new image intake to roughly 14,000 files per week at peak periods, according to figures from the city's own Infrastructure LA dashboard. That pace stressed a system already operating near capacity.

What the Cleanup Actually Involves

Axon Digital Workflow Solutions is running a phased approach. Phase one, covering the Bureau of Engineering's permit image library held on servers at the Figueroa Street complex, was completed in late June and eliminated approximately 230,000 duplicate files. Phase two targets the Los Angeles Fire Department's pre-fire inspection photograph archive, which grew substantially following the January 2025 Palisades and Eaton fires and the emergency inspections that followed in the Santa Monica Mountains zone.

The deduplication software uses perceptual hashing — a technique that can identify visually identical or near-identical images even when file names differ — rather than simple checksum matching. That matters because many of the duplicates were rescanned at slightly different resolutions, which would fool a basic file-comparison tool.

For residents and contractors working with the city, the practical upshot is that the online permit portal at LADBS.org is expected to run faster by late September once the Phase one data is fully consolidated. Response times for public records requests involving photographic evidence — a chronic complaint from attorneys and advocates working on housing litigation in neighborhoods like Koreatown and Watts — should improve as well. The city estimates it will reclaim approximately 48 terabytes of primary storage once all seven phases are complete, with projected annual savings of around $310,000 in cloud and on-premises storage fees beginning in fiscal year 2027.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Los Angeles

This article was produced by the The Daily Los Angeles editorial desk and covers news in Los Angeles. See our editorial standards for how we use AI.

The Daily Los Angeles brief

The day's Los Angeles news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Los Angeles news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Los Angeles

More in News

Enjoyed this story? Get tomorrow's briefing free.