LA's Duplicate Image Problem: The Key Decisions Officials Must Make Now
City agencies and private developers are sitting on thousands of redundant digital records—and a reckoning over how to fix it is coming fast.
City agencies and private developers are sitting on thousands of redundant digital records—and a reckoning over how to fix it is coming fast.

Los Angeles city departments are facing a quiet but expensive crisis buried inside their own servers: tens of thousands of duplicate digital images cluttering permit files, infrastructure databases, and emergency-response archives—records that slow down the very systems the city is betting on to run the 2028 Olympics and accelerate Mayor Karen Bass's homelessness housing push. The question city technology officials must answer before the end of summer is deceptively simple: delete, migrate, or rebuild?
The timing matters. The Bureau of Engineering is in the middle of digitising decades of paper infrastructure records along the Sepulveda Boulevard transit corridor, while the Los Angeles Homeless Services Authority is expanding its Comprehensive Homeless Strategy database to track shelter placements and interim housing units in real time. Both efforts are generating fresh image files daily—blueprints, site photos, intake documents—and both are colliding with legacy systems that were never designed to flag or eliminate duplicates automatically. Without a decision soon, the redundancy compounds.
Storage is not free. The city's Information Technology Agency reported in its fiscal year 2025-26 budget documents that municipal cloud storage costs exceeded $14 million annually, a figure that does not account for the staff time spent manually reconciling mismatched or repeated files. Industry benchmarks suggest that between 20 and 30 percent of enterprise image repositories contain redundant files, which would put the city's wasted storage footprint in a range that independent technology analysts describe as significant for a government operation of this scale.
At Los Angeles World Airports, which manages LAX and Van Nuys Airport, the problem is particularly acute in the credentialing and security-camera archive systems, where images of the same access point are sometimes stored under multiple file trees after system migrations. The Port of Los Angeles in San Pedro faces a parallel issue inside its cargo-tracking and vessel-inspection photo databases, which expanded rapidly after trade volumes surged through 2024 and into 2025. Neither agency has publicly announced a remediation timeline.
The Entertainment Technology Center at USC has been studying AI-driven deduplication tools since 2023, partly because the same problem is wrecking post-production pipelines across studios in Burbank and Culver City. Researchers there have found that hash-based matching—essentially giving every image a unique digital fingerprint and flagging identical ones automatically—can cut redundant files by more than 40 percent in the first pass. The same methodology is now being pitched to city agencies, but adoption requires a policy decision about what gets deleted permanently versus archived offline.
Three choices are on the table, and city officials cannot avoid them indefinitely. First, they can authorise automated deletion of confirmed duplicates—fast and cheap, but legally risky if a deleted file later turns out to be the only surviving copy of a permitted structure that burns in a wildfire. Second, they can move duplicates to cold storage, keeping them accessible but off the active servers that affect day-to-day system performance; the cost drops sharply but does not go to zero. Third, they can rebuild their file intake processes from scratch, requiring new software procurement and retraining for thousands of city employees across departments from the Department of Building and Safety to the Los Angeles Fire Department.
The LAFD's Patsaouras Transit Plaza command-and-control upgrade, currently underway near Union Station, is the most immediate forcing function. That project requires clean, non-duplicated image records to feed its AI-assisted dispatch tools. Project managers have set an internal deadline of November 2026 to have the image library sorted, which gives the Information Technology Agency roughly four months to pick an approach and start executing it.
For residents and businesses filing permits in neighborhoods like Boyle Heights and Highland Park—areas at the center of the Bass administration's interim housing construction push—the downstream effect is practical: cleaner image databases mean faster permit approvals, fewer requests for resubmission, and less time waiting for inspectors who are currently cross-referencing duplicate files by hand. The fix is technical. The consequences of not fixing it are not.
How does this story make you feel?
Spread the word
About this article
Published by The Daily Los Angeles
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News


