LA's Hidden Digital Drain: The Numbers Behind the City's Duplicate Image Problem
From city archives to Olympic planning files, redundant digital assets are costing Los Angeles agencies millions in storage and staff hours.
From city archives to Olympic planning files, redundant digital assets are costing Los Angeles agencies millions in storage and staff hours.

Los Angeles city departments are collectively storing hundreds of millions of duplicate image files across fragmented digital systems, a problem that auditors and IT administrators have been flagging for years but that has grown measurably worse since the January 2025 wildfires forced emergency digitisation of thousands of paper records. The scale of the redundancy is significant: industry benchmarks suggest that large municipal governments typically carry duplicate rates of between 30 and 45 percent across unmanaged digital asset repositories, translating directly into inflated cloud storage contracts and wasted labour hours.
The timing matters because Los Angeles is under unusual pressure on multiple fronts. Mayor Karen Bass's housing emergency declaration has pushed the city's Housing and Community Investment Department to digitise inspection records and property photos at an accelerated pace. Simultaneously, the LA 2028 Olympics infrastructure office is building a sprawling digital archive of venue design files, contractor submissions and site photography — a project that analysts warn is particularly vulnerable to duplicate proliferation when dozens of contractors upload assets through different portals with no centralised deduplication protocol.
The problem is concentrated in a handful of key operations. The LA Department of Building and Safety, headquartered on South Spring Street in Downtown, processes roughly 45,000 permit applications per month, each of which can carry multiple attached photographs. Without automated deduplication, the same site photo frequently appears in three or four separate case files. The city's GIS division at the Information Technology Agency on South Main Street maintains aerial and satellite imagery layers that, according to procurement documents reviewed by The Daily Los Angeles, are re-ingested from vendor feeds without systematic checks for frames already held in the archive.
The Los Angeles County Metropolitan Transportation Authority faces a parallel issue. Metro's capital projects division, managing more than $23 billion in active construction across the Purple Line Extension and the East San Fernando Valley Light Rail, generates thousands of inspection images weekly. Storage contracts for unstructured data at agencies of Metro's scale typically run between $0.02 and $0.05 per gigabyte per month on commercial cloud platforms — costs that compound quickly when duplicate rates push stored volumes 30 percent higher than necessary.
The Los Angeles Fire Department accelerated its aerial drone imaging programme after the Palisades and Eaton fires, capturing high-resolution footage across hillside neighbourhoods from Brentwood to Altadena. The LAFD's digital operations unit has acknowledged internally that its post-incident archive workflow lacks a deduplication step before files are transferred to long-term storage, according to city budget justification documents attached to the department's fiscal year 2025-26 technology request.
The core data point driving urgency is straightforward. A 2024 analysis by the nonprofit Center for Digital Government found that mid-to-large US municipalities could recover between 18 and 28 percent of digital storage expenditure by deploying perceptual hashing tools — software that identifies visually identical or near-identical images even when file names, timestamps and metadata differ. For a city the size of Los Angeles, which spent approximately $180 million on information technology infrastructure in fiscal year 2024-25, even a conservative 15 percent storage recovery on the image-heavy portion of that budget represents millions of dollars annually.
The practical barrier has been procurement fragmentation. The City of Los Angeles operates under a decentralised IT model in which each department negotiates its own software agreements. That means the Bureau of Engineering on South Figueroa Street and the Department of Recreation and Parks, which manages photo archives for 480 parks across the city, are running different asset management platforms with no shared deduplication layer between them.
City Council's Budget and Finance Committee is scheduled to review a consolidated digital infrastructure proposal in September 2026, ahead of the fiscal year 2027 budget cycle. Advocates for the reform argue that the 2028 Olympics deadline — venues must be fully certified by late 2027 — creates a forcing function that prior reform efforts lacked. Departments that want to integrate their digital asset systems with the LA28 organising committee's unified project portal will need to clean their archives first. That means running deduplication passes before the September review if they want to be included in the pilot programme.
For residents, the consequence is less abstract than it sounds: duplicated records slow public-records requests, inflate the city's technology budget, and — in the case of fire-risk mapping imagery — can create confusion about which version of a survey file is authoritative when emergency planners need answers fast.
How does this story make you feel?
Spread the word
About this article
Published by The Daily Los Angeles
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News


