Los Angeles city agencies collectively store an estimated 47 terabytes of duplicate image files across their digital infrastructure — redundant photographs that consume server space, slow down workflows, and quietly drain budgets that could go toward sidewalk repairs or shelter beds. That figure comes from a technology audit completed in May 2026 by the city's Information Technology Agency, which found that duplicate and near-duplicate images account for roughly 23 percent of all static media files held on municipal servers.
The timing matters. With the 2028 Summer Olympics less than two years out, the city is accelerating a broad digital modernization push. The Los Angeles City Administrative Office has flagged data storage costs as one of the fastest-growing line items in the municipal technology budget, and bloated image libraries are a significant driver. Meanwhile, the Bureau of Engineering and the Department of City Planning are both racing to digitize decades of paper records and site photography to support Mayor Karen Bass's housing emergency declaration — a process that generates new image files daily and compounds the duplication problem if not managed carefully.
Where the Numbers Come From
The ITA audit examined servers at six departments: the Los Angeles Police Department's evidence management unit at the Valley Bureau in Van Nuys, the Department of Public Works, the Los Angeles Housing Department on Figueroa Street downtown, the Bureau of Engineering's offices near Civic Center, the Department of Recreation and Parks, and the nascent LA28 Olympic digital asset office in El Segundo. Across those six, auditors identified more than 2.1 million image files flagged as probable duplicates using perceptual hash comparison — a technique that detects visually identical or near-identical photos even when file names differ.
Storage costs in the city's current municipal cloud contract, renegotiated in March 2025 with a vendor through the General Services department, run approximately $0.023 per gigabyte per month for warm storage. At that rate, the 47 terabytes of duplicate imagery costs the city roughly $13,000 a month — about $156,000 annually — for files that serve no archival purpose. That is not a catastrophic sum for a city with a multi-billion-dollar budget, but technology officers note it is emblematic of a broader inefficiency: the actual cost, they argue, is in staff hours spent manually searching through redundant files, not just in raw storage fees.
The LAPD's evidence photo archive presents the most acute case. Body camera footage and crime scene photography have expanded exponentially since the department's 2021 body-worn camera rollout, and secondary upload errors have seeded the archive with thousands of duplicate still frames. Deduplication there is not simply a cost question — it is a chain-of-custody and legal-discovery issue that defense attorneys in cases heard at the Clara Shortridge Foltz Criminal Justice Center have raised in at least a handful of motions over the past 18 months, according to court records filed in Los Angeles Superior Court.
What the City Plans to Do
The ITA is piloting an automated duplicate-detection and replacement workflow at two departments — Recreation and Parks and the Housing Department — starting August 1, 2026. The system flags probable duplicates for human review rather than deleting them outright, a safeguard insisted upon by the City Attorney's office to protect litigation holds. If the pilot meets its targets, a citywide rollout is scheduled for the first quarter of 2027, roughly 18 months before the Olympic torch arrives.
For the LA28 digital asset library, which will eventually house official Games photography distributed to international media partners, the stakes are reputational as much as operational. Duplicate or misidentified images pushed to wire services during the Games would be embarrassing at a moment when the city is under global scrutiny. The El Segundo office has already contracted with a digital asset management firm to implement deduplication protocols before the library goes live in early 2027.
City officials have told the ITA to produce a cost-benefit report by September 30, 2026. If the pilot data supports a broad rollout, the full deduplication program could free up enough server capacity to delay a planned $2.4 million storage infrastructure expansion — money that could be redirected, at least in part, to the technology needs of the housing emergency response.