Los Angeles city departments have been quietly grappling with a sprawling digital housekeeping failure: thousands of duplicate images stored across overlapping municipal systems, driving up cloud storage costs, slowing public-facing portals, and complicating the records requests that residents, journalists, and lawyers file every year. The problem did not arrive overnight.
The roots stretch back to the mid-2000s, when the city began digitising paper archives in earnest. Each department — the Los Angeles Department of Building and Safety, the Bureau of Engineering, the Department of Public Works — ran its own intake process, often scanning the same permit photographs, inspection records, and site images into separate servers with no shared catalogue. By the time the Mayor's Office of Technology and Innovation began auditing interdepartmental storage in 2019, the redundancy rate in some databases had climbed above 30 percent, according to internal assessments cited in city budget discussions that year.
From Paper Files to Pixel Sprawl
The shift to digital was never coordinated. The 2028 Olympic infrastructure push has since added fresh urgency: city planners working on venues from SoFi Stadium in Inglewood to the proposed athletes' village near UCLA need clean, verified image records to satisfy federal and international oversight requirements. Duplicate files create version-control nightmares — two photographs of the same construction site, taken a day apart, can carry conflicting metadata, and without a deduplication protocol, workers pulling records may act on outdated information.
The problem is also visible at the street level. The Los Angeles Bureau of Street Services, which manages roughly 6,500 miles of roadway, uses field photographs to document pothole repairs, graffiti removal, and post-storm damage. Crews uploading images from tablets in the field have historically synced to multiple backup systems simultaneously, a safeguard that made sense when single-point failures were common but now generates duplicate records at scale. The bureau's digital asset library had not undergone a full deduplication audit as recently as fiscal year 2023-24, according to budget documents released by the city controller's office.
The Department of City Planning faces a related challenge. Its public-facing GeoHub portal — accessible at geohub.lacity.org — pulls parcel imagery and zoning maps from at least three underlying data sources. Residents in Echo Park and Boyle Heights who have used the portal to review permit histories on properties affected by Mayor Karen Bass's housing emergency declaration have encountered duplicate document thumbnails that return the same image under different file names. The city's Information Technology Agency acknowledged the issue in a March 2025 service bulletin, noting that a migration to a unified content management platform was scheduled for completion by the end of calendar year 2025 — a deadline that has since slipped.
What a Fix Actually Requires
Deduplication at municipal scale is not simply a matter of running a software pass. City IT staff must reconcile files across systems that were never designed to talk to each other, verify that no unique record is deleted in the process, and maintain chain-of-custody documentation for any file that might surface in litigation. The Los Angeles City Archives, housed at 555 Ramirez Street downtown, holds physical precedent records that complicate the picture further: some digitised images were scanned multiple times by different contractors between 2008 and 2016, each scan treated as a new asset.
The cost of inaction is measurable. Cloud storage pricing for the enterprise tiers the city uses runs roughly $0.02 to $0.05 per gigabyte per month, and estimates from the ITA's 2024-25 budget submission put the city's total unstructured data footprint — the category that includes image files — at several hundred terabytes. Even a 20 percent reduction in duplicate image volume would translate to meaningful annual savings at those rates.
The ITA has said it intends to complete a phased deduplication project across six pilot departments before the 2028 Games begin. For residents trying to pull records through the city's development portal today, the practical advice is straightforward: if a document search returns visually identical thumbnails under different file identifiers, request the underlying file metadata from the relevant department clerk before relying on either copy. The city's 311 service line remains the fastest intake point for those requests.