The Daily Los Angeles

Los Angeles news, every day

News

How L.A.'s Digital Archives Ended Up Flooded With Duplicate Images — and Who's Now Trying to Fix It

Decades of mismatched file systems, agency mergers, and emergency scanning drives left the city's public records infrastructure buried under millions of redundant photographs and documents.

By Los Angeles News Desk · Published 4 July 2026, 11:40 am

3 min read

How L.A.'s Digital Archives Ended Up Flooded With Duplicate Images — and Who's Now Trying to Fix It
Photo: Photo by RDNE Stock project on Pexels

Los Angeles city departments are sitting on an estimated tens of millions of redundant digital image files — the accumulated mess of three decades of uncoordinated scanning campaigns, multiple incompatible records systems, and disaster-response digitization pushes that treated speed as the only priority. The problem, long acknowledged internally but rarely addressed with any urgency, is now forcing a reckoning as the city accelerates infrastructure work tied to the 2028 Olympics and as Mayor Karen Bass's housing emergency declaration requires faster, cleaner access to permitting and property records.

The core issue is straightforward: when agencies digitized paper records, they rarely checked whether a version of the file already existed. A building permit for a Boyle Heights property might appear four or five times across the city's document management systems, each scan assigned a different file name, a different resolution, and sometimes a different department tag. The Los Angeles Department of Building and Safety, which handles hundreds of thousands of permit records, merged several legacy databases after the 2017 consolidation of its inspection software platforms. That process left large volumes of overlapping image files that have never been fully reconciled.

A Problem That Predates the Digital Era

The roots of the duplication problem stretch back to the mid-1990s, when city departments first began moving paper archives to digital formats under a series of unconnected initiatives. Each department contracted separately. The Los Angeles Public Library's photo archive, which includes the Los Angeles Herald Examiner Collection housed at the Central Library on West Fifth Street downtown, went through at least two separate digitization cycles — once in the early 2000s and again in 2014 under a California State Library grant. Library staff have acknowledged publicly in past budget presentations that portions of that archive contain duplicate entries, though a precise count has not been published.

The Los Angeles City Clerk's office, which maintains the official repository for council records and public documents, has been working since 2022 to implement a deduplication protocol under its Digital Records Management Initiative. That program, funded at roughly $3.4 million in the fiscal year 2023-24 budget cycle according to city budget documents, was intended to address exactly this category of redundancy. Progress has been incremental. Staff turnover and the competing demands of digitizing active emergency records — particularly documents related to the January 2025 Palisades and Eaton fires — pulled resources away from the cleanup effort.

The Palisades fire response is itself a case study in how duplication compounds under pressure. Within weeks of that disaster, multiple city and county agencies — the Los Angeles County Assessor, the Department of Building and Safety, and the Los Angeles Fire Department — were simultaneously scanning the same damaged property records from Altadena to Pacific Palisades. The urgency was legitimate. Homeowners needed documentation to file insurance claims and apply for federal relief. But the parallel scanning created fresh waves of duplicate images entering systems that were already overloaded.

What Comes Next for City Records

The timeline is pressing. Olympic venue construction and the infrastructure upgrades supporting events from SoFi Stadium in Inglewood to venues near the Los Angeles Memorial Coliseum in Exposition Park require rapid permitting turnaround. Duplicate image files slow automated permit-processing pipelines, force manual review, and create legal uncertainty when two versions of the same document carry different metadata. City officials have pointed to the Digital Services initiative under the Information Technology Agency as the lead program for resolving these conflicts, though a formal deduplication completion date has not been announced publicly.

For residents, the practical advice is narrow but real: if you are filing a permit application, a public records request, or an insurance-related document pull for properties in fire-affected areas like Altadena or Pacific Palisades, ask the receiving agency to confirm which database version of any historical record is being treated as authoritative. The city's 311 service can route document-specific questions to the relevant department. The problem did not arrive overnight, and it will not be cleaned up on a single deadline — but knowing it exists is the first step toward working around it.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Los Angeles

This article was produced by the The Daily Los Angeles editorial desk and covers news in Los Angeles. See our editorial standards for how we use AI.

The Daily Los Angeles brief

The day's Los Angeles news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Los Angeles news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Los Angeles

More in News

Enjoyed this story? Get tomorrow's briefing free.