Los Angeles city departments collectively hold an estimated tens of millions of digital image files across their server infrastructure, and a significant share of those files are exact or near-exact duplicates — a problem that has quietly grown into a measurable drain on public IT budgets at a moment when the city can least afford it.
The issue matters right now for a specific reason: the 2028 Olympics construction pipeline, Mayor Karen Bass's ongoing housing emergency declaration, and the city's post-Palisades wildfire documentation requirements have all triggered massive new waves of photographic and aerial imagery intake since early 2025. Each program generates its own archiving protocols, often with no cross-departmental deduplication standard in place. The result is redundant data accumulating across siloed systems at City Hall East, the Los Angeles Housing Department's Spring Street offices, and the Bureau of Engineering's downtown headquarters on South Figueroa Street.
What the Data Actually Shows
Industry benchmarks from enterprise data management research — including figures published by Gartner and IDC in their most recent storage cost analyses — suggest that between 25 and 40 percent of files in large municipal image repositories are duplicates. Applied to Los Angeles, where the city's Information Technology Agency reported managing over 18 petabytes of total data as of its fiscal year 2024-25 budget submission, even a conservative 25 percent duplication rate across image-heavy departments translates to potentially thousands of terabytes of redundant storage. At current cloud and on-premise storage rates that hover between $20 and $50 per terabyte per month for enterprise-grade systems, the carrying cost compounds fast.
The Los Angeles Homeless Services Authority, which coordinates with dozens of outreach nonprofits across neighborhoods from Skid Row to the San Fernando Valley, uses photographic documentation to log encampment conditions, verify service contacts, and satisfy federal Housing and Urban Development reporting requirements. Multiple contractors photographing the same locations on overlapping schedules — a common operational reality along corridors like the LA River Greenway and under the 101 Freeway interchanges near Boyle Heights — means the same image or near-identical variants get uploaded into separate case management systems. Staff then manually reconcile those files, a process that one city IT audit framework, referenced in the ITA's 2024 annual report, flagged as a category of avoidable labor cost.
The Olympics Clock Is Running
The LA28 organizing committee and the city's Bureau of Engineering are jointly overseeing infrastructure upgrades at venues including SoFi Stadium in Inglewood, the Sepulveda Basin Sports Complex in Van Nuys, and the Los Angeles Memorial Coliseum in Exposition Park. Each site generates continuous aerial surveys, progress photography, and inspection imagery. Without a unified asset management platform that strips duplicates at the point of ingest, those repositories will compound over the next 24 months leading up to the July 2028 opening ceremony.
Several Los Angeles County agencies, including the Department of Public Works, began piloting automated deduplication tools as part of a broader digital infrastructure review in late 2024, according to publicly posted county budget documents. The city proper has moved more slowly. The ITA's current fiscal year budget, approved in spring 2026, allocated funding for a cloud migration study but did not earmark a specific line item for image deduplication software procurement.
For residents and advocates tracking how public dollars get spent — particularly amid pressure on the Bass administration to demonstrate efficient use of the roughly $1.3 billion in homelessness-related spending approved in the city's most recent budget cycle — the duplicate image problem is less a technical curiosity than a symptom of broader data governance gaps. Departments that can't manage their photo archives cleanly are departments that struggle to produce clean audit trails, accurate progress reports, and defensible spending records when federal oversight arrives.
The practical path forward involves the city mandating perceptual hashing or checksum-based deduplication at the point of file upload across all major departments — a standard that enterprise systems like Extensis Portfolio and Adobe Experience Manager already support and that peer cities including Chicago and New York have incorporated into their digital asset management policies over the past three years. The ITA's next procurement cycle, expected to open for public comment in October 2026, will be the clearest opportunity to get that requirement written into city contract language before the Olympic documentation load peaks.