The Daily Los Angeles

Los Angeles news, every day

News

How Duplicate Image Replacement Became a Crisis for L.A.'s Digital Public Record

Decades of sloppy digital archiving across city agencies have left Los Angeles with a sprawling problem: the same image filed under dozens of different names, clogging databases and undermining public trust in government records.

By Los Angeles News Desk · Published 4 July 2026, 12:23 pm

3 min read

How Duplicate Image Replacement Became a Crisis for L.A.'s Digital Public Record
Photo: United States. Army. Corps of Engineers; United States. Army. Corps of Engineers. Public Affairs Office / Public domain (Wikimedia Commons)

Los Angeles city agencies are sitting on a digital mess years in the making. Thousands of duplicate images — permit photos, planning department renderings, code enforcement snapshots — have accumulated across municipal databases since at least 2009, when the city began migrating paper records to digital storage systems without a unified naming or filing standard. Now, with the 2028 Olympics infrastructure push accelerating document intake across the Bureau of Engineering and the Department of Building and Safety, the problem is getting harder to ignore.

The issue matters now because the volume of incoming records has surged. The city's 2028 infrastructure program alone is generating tens of thousands of new project images each month, from Crenshaw-adjacent transit corridors to the refurbished Sepulveda Basin Recreation Area. When duplicate files enter a system that already struggles with redundancy, retrieval times slow, audit trails break, and public records request responses become unreliable — a direct problem for lawyers, journalists, and residents trying to hold the city accountable.

How the Backlog Built Up

The roots of the duplicate image problem stretch back to a 2009 initiative by the City Clerk's office to digitize roughly 4.2 million paper documents held at City Hall East on Main Street. The project was ambitious but underfunded, and no single metadata standard was adopted across departments. The Planning Department, the Bureau of Sanitation, and the Housing Department each developed their own internal file-naming conventions. When those systems were later networked, identical images began appearing under multiple identifiers.

The Department of Building and Safety — which processes permit applications for everything from Silver Lake bungalow renovations to large commercial builds in downtown's Arts District — has been particularly affected. Field inspectors upload site photographs from mobile devices, and until a 2023 policy update, the department had no automated deduplication tool. Images of the same property taken on the same day by two different inspectors would both enter the system as separate records, each drawing on limited server storage and complicating later file searches.

The Los Angeles Housing Department, operating under Mayor Karen Bass's housing emergency declaration first issued in January 2023, has faced a related version of the problem. Rapid intake of documentation for interim and bridge housing sites — including facilities near the 110 freeway corridor in South L.A. — meant staff were uploading images faster than any quality-control process could catch redundancies. By some internal estimates, duplicates accounted for a meaningful share of storage consumption in the department's shared drives by late 2024, though the city has not published a precise figure.

What a Fix Actually Looks Like

Addressing the backlog requires more than deleting extra files. Archivists and records managers have to verify that a duplicate is truly identical — same image, same subject, same date — before removal, because two similar-looking photographs of a property might document different conditions at different times. That verification process is labor-intensive, and the city's Information Technology Agency, headquartered on Main Street adjacent to City Hall, has been testing AI-assisted image-matching software since early 2025 to reduce the manual burden.

The ITA's pilot, running across three departments as of this spring, uses perceptual hashing — a technique that generates a compact digital fingerprint for each image and flags near-identical matches for human review. Similar programs have been deployed by the New York City Department of Records and the Chicago Department of Innovation and Technology, giving L.A. some existing models to draw from. The city has not disclosed the contract value or the vendor involved.

For residents navigating the system right now, the practical advice is straightforward: when filing a California Public Records Act request for image-based documentation, ask specifically for the canonical or primary file associated with a record number, and note the request date. That language prompts city staff to retrieve from the verified master record rather than pulling from a redundant copy that may carry incorrect metadata. The ITA pilot is expected to expand department-wide ahead of a broader digital records overhaul tied to 2028 Games preparation timelines — but the cleanup, by any measure, is still in early innings.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Los Angeles

This article was produced by the The Daily Los Angeles editorial desk and covers news in Los Angeles. See our editorial standards for how we use AI.

The Daily Los Angeles brief

The day's Los Angeles news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Los Angeles news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Los Angeles

More in News

Enjoyed this story? Get tomorrow's briefing free.