Los Angeles city agencies are sitting on a digital mess years in the making. Thousands of duplicate images — permit photos, planning department renderings, code enforcement snapshots — have accumulated across municipal databases since at least 2009, when the city began migrating paper records to digital storage systems without a unified naming or filing standard. Now, with the 2028 Olympics infrastructure push accelerating document intake across the Bureau of Engineering and the Department of Building and Safety, the problem is getting harder to ignore.
The issue matters now because the volume of incoming records has surged. The city's 2028 infrastructure program alone is generating tens of thousands of new project images each month, from Crenshaw-adjacent transit corridors to the refurbished Sepulveda Basin Recreation Area. When duplicate files enter a system that already struggles with redundancy, retrieval times slow, audit trails break, and public records request responses become unreliable — a direct problem for lawyers, journalists, and residents trying to hold the city accountable.
How the Backlog Built Up
The roots of the duplicate image problem stretch back to a 2009 initiative by the City Clerk's office to digitize roughly 4.2 million paper documents held at City Hall East on Main Street. The project was ambitious but underfunded, and no single metadata standard was adopted across departments. The Planning Department, the Bureau of Sanitation, and the Housing Department each developed their own internal file-naming conventions. When those systems were later networked, identical images began appearing under multiple identifiers.
The Department of Building and Safety — which processes permit applications for everything from Silver Lake bungalow renovations to large commercial builds in downtown's Arts District — has been particularly affected. Field inspectors upload site photographs from mobile devices, and until a 2023 policy update, the department had no automated deduplication tool. Images of the same property taken on the same day by two different inspectors would both enter the system as separate records, each drawing on limited server storage and complicating later file searches.
The Los Angeles Housing Department, operating under Mayor Karen Bass's housing emergency declaration first issued in January 2023, has faced a related version of the problem. Rapid intake of documentation for interim and bridge housing sites — including facilities near the 110 freeway corridor in South L.A. — meant staff were uploading images faster than any quality-control process could catch redundancies. By some internal estimates, duplicates accounted for a meaningful share of storage consumption in the department's shared drives by late 2024, though the city has not published a precise figure.
What a Fix Actually Looks Like
Addressing the backlog requires more than deleting extra files. Archivists and records managers have to verify that a duplicate is truly identical — same image, same subject, same date — before removal, because two similar-looking photographs of a property might document different conditions at different times. That verification process is labor-intensive, and the city's Information Technology Agency, headquartered on Main Street adjacent to City Hall, has been testing AI-assisted image-matching software since early 2025 to reduce the manual burden.
The ITA's pilot, running across three departments as of this spring, uses perceptual hashing — a technique that generates a compact digital fingerprint for each image and flags near-identical matches for human review. Similar programs have been deployed by the New York City Department of Records and the Chicago Department of Innovation and Technology, giving L.A. some existing models to draw from. The city has not disclosed the contract value or the vendor involved.
For residents navigating the system right now, the practical advice is straightforward: when filing a California Public Records Act request for image-based documentation, ask specifically for the canonical or primary file associated with a record number, and note the request date. That language prompts city staff to retrieve from the verified master record rather than pulling from a redundant copy that may carry incorrect metadata. The ITA pilot is expected to expand department-wide ahead of a broader digital records overhaul tied to 2028 Games preparation timelines — but the cleanup, by any measure, is still in early innings.