Los Angeles city agencies are sitting on tens of thousands of duplicate digital images — redundant photographs, scanned documents, and graphics files stored across at least a dozen separate departmental servers — a problem that crept up over more than two decades of piecemeal technology investment and is now demanding a coordinated response as the city prepares for the scrutiny that comes with hosting the 2028 Summer Olympics.
The issue matters right now because L.A. is actively building out the digital infrastructure that will support everything from Olympic venue promotion to the Mayor's homelessness dashboard under the ongoing housing emergency declaration. Duplicate and mismatched image files slow those systems, inflate cloud storage costs, and — in the worst cases — push outdated or legally restricted photographs back into public-facing platforms.
A Decades-Long Accumulation
The roots of the problem stretch back to the early 2000s, when individual city departments began digitizing records independently, with no shared naming convention or central repository. The Los Angeles Department of City Planning, which manages images tied to permit applications and environmental impact reports, built its own archive. The Bureau of Engineering maintained a separate photo library for infrastructure projects along corridors like the I-405 widening and the Sixth Street Viaduct reconstruction. The Los Angeles Housing Department, handling thousands of case files generated under Mayor Karen Bass's April 2023 emergency declaration on homelessness, accumulated its own parallel image store.
By the time city IT managers began auditing these systems in late 2024, the duplication rate in some departments reportedly exceeded 30 percent of total stored files, according to internal assessments reviewed by city council staff — though the city has not published an official system-wide figure. Each redundant file costs money to store, index, and back up. Cloud storage contracts the city holds with major vendors run to millions of dollars annually across all departments, and storage bloat is a line item that budget analysts on Spring Street have flagged repeatedly during recent fiscal cycles.
The entertainment industry's ongoing disruption from AI tools added another layer. Studios and production companies, many of them headquartered in Burbank and Culver City, began licensing city-held images for promotional and archival use starting around 2021. When duplicate versions of the same photograph carried different metadata — different licensing dates, different credited photographers — rights clearance became a genuine legal headache for the city attorney's office.
The Push Toward a Unified System
The practical answer, city technology staff have concluded, is a duplicate-image replacement protocol: a standardized workflow that identifies redundant files using perceptual hashing algorithms, flags the lowest-quality or worst-documented version, and routes it for deletion or archival storage, while replacing any live links with a pointer to a single canonical version.
The Los Angeles County Metropolitan Transportation Authority — Metro — piloted a version of this approach in 2023 when it consolidated its project photo libraries ahead of the opening of the Crenshaw/LAX Line extension. The process took roughly eight months and eliminated an estimated 22,000 duplicate image records from Metro's digital asset management system, according to a project summary the agency published on its website in early 2024.
For city departments still working through the transition, the practical steps are well-established. IT teams typically run an initial audit using open-source or commercial deduplication software, establish a master asset management platform with enforced naming schemas, and retrain staff on upload procedures. The Getty Conservation Institute, based on Wilshire Boulevard, has published guidance on exactly this kind of digital preservation workflow for public institutions — guidance that several L.A. city librarians have cited in internal memos as a relevant framework.
The timeline now is tied directly to Olympic preparation. City planners want consolidated, clean digital asset libraries operational across key departments by mid-2027, leaving roughly a year of buffer before the Games begin in July 2028. Whether the budget to complete that work survives the city's next round of fiscal negotiations — council members are expected to revisit capital technology spending in the fall 2026 budget cycle — will determine how much of the cleanup actually gets finished before the world starts watching.