Los Angeles city agencies are sitting on an estimated tens of thousands of duplicate digital images—scanned permits, property photographs, infrastructure inspection photos, and public event documentation—spread across at least a dozen incompatible storage systems, a problem that grew quietly for more than a decade before landing on the agenda of the city's Information Technology Agency this spring.
The issue matters right now for a specific reason: the 2028 Olympics infrastructure buildout is generating an unprecedented volume of new visual documentation, from Venice Beach venue renderings to Exposition Park construction progress shots, and officials want a clean, searchable archive before that material compounds an already messy record. A duplicate image costs the city nothing to create and almost nothing to store individually, but across petabytes of data the redundancy inflates storage contracts, slows retrieval systems, and—in the context of the Bass administration's housing emergency—has already delayed property inspection records being matched to code enforcement cases in South Los Angeles and Boyle Heights.
How the Problem Accumulated
The roots go back to at least 2012, when the city's Bureau of Engineering began digitizing paper records without a unified naming convention. Different bureaus adopted different software platforms. The Department of Building and Safety used one vendor; the Bureau of Street Services used another. By the time the COVID-19 pandemic hit in March 2020, in-person processing had stopped and remote scanning operations at facilities including the Piper Technical Center on Ramirez Street downtown created a second wave of uploads, many of them re-scans of documents that already existed in older databases. Staff working from home had inconsistent access to the legacy systems, so the path of least resistance was to scan again and upload fresh.
The Los Angeles Homeless Services Authority, which processes tens of thousands of case files annually, ran into a version of the same problem when it migrated data between systems in 2021 and 2022. Property photographs attached to encampment documentation—images used to verify clearance of sites from Skid Row to the Sepulveda Basin—appeared in multiple case records, making audits harder and slowing response tracking under the mayor's Inside Safe program, which launched in January 2023.
The city's IT Agency, headquartered at City Hall East on Main Street, began a formal audit of image duplication in February 2026. The scope expanded quickly. Staff found redundant files not just in building records but in the city's public-facing GeoHub data portal, where aerial survey images from the Bureau of Engineering had been uploaded multiple times under different project codes. The duplication rate in some datasets exceeded 30 percent, according to a preliminary internal review completed in April 2026, though that figure has not yet been published in a public report.
What Comes Next for City Archives
The IT Agency is piloting a perceptual hashing tool—software that generates a fingerprint for each image and flags near-identical copies regardless of filename or upload date—across three departments first: Building and Safety, the Bureau of Engineering, and the city's Recreation and Parks department, which manages photo documentation for more than 400 facilities including Griffith Park and the Balboa Sports Complex in Encino. If the pilot, scheduled to run through September 2026, proves out, a citywide rollout is planned before the end of the fiscal year.
For residents and journalists trying to access public records, the practical advice for now is straightforward: if a California Public Records Act request for photographic documentation returns a large file with apparent duplicates, requesters can specifically ask the responding department to confirm whether deduplication has been applied. That language, included in a request, puts agencies on notice and often speeds the process. The city clerk's office on Spring Street handles appeals when records requests are disputed.
The cleanup is unglamorous work. But with Olympic venue construction documentation set to multiply city image archives severalfold over the next two years, officials who have studied the problem say the window to fix the underlying architecture is closing fast.