Los Angeles city departments collectively store an estimated hundreds of terabytes of digital image data, and a growing share of it is junk — exact or near-exact duplicates that inflate storage contracts, slow retrieval systems, and, in some cases, muddy legal records. A review of publicly available procurement documents and technology audits from comparable U.S. municipal systems suggests duplicate image files can account for anywhere from 20 to 40 percent of total image storage volume in large government repositories, a figure that translates directly into unnecessary vendor costs.
The timing matters for Los Angeles specifically. The city is in the middle of a capital spending surge tied to the 2028 Summer Olympics, which requires detailed photographic documentation of construction progress at venues from SoFi Stadium in Inglewood to the Los Angeles Memorial Coliseum in Exposition Park. At the same time, Mayor Karen Bass's ongoing housing emergency declaration has pushed city departments to digitize inspection records, shelter intake forms, and before-and-after site photography at an accelerated pace. Both pipelines generate massive image inventories — and both are vulnerable to the duplication problem that archivists and IT managers have flagged for years.
What the Numbers Actually Show
Cloud storage for enterprise-grade municipal systems runs roughly $0.023 per gigabyte per month on standard tiers from major providers, according to published pricing schedules from Amazon Web Services and Microsoft Azure as of mid-2026. That sounds trivial until the scale lands. A department maintaining 50 terabytes of image files, with 30 percent of those files being duplicates, is paying for approximately 15 terabytes of redundant data every single month. Annualized, that figure reaches into tens of thousands of dollars for a single department — and Los Angeles has dozens of departments managing independent image archives.
The Los Angeles Department of Building and Safety, which operates permit and inspection portals including its public-facing ePlans system, has publicly acknowledged the challenge of managing growing digital backlogs. The department processed more than 200,000 permit applications in fiscal year 2024-25, each potentially carrying multiple attached image files submitted by applicants, inspectors, and contractors. Without automated deduplication protocols, identical site photographs submitted under slightly different file names are stored as distinct objects, each consuming full storage allocation.
The LAPD's digital evidence management system presents a related but legally higher-stakes version of the same problem. Body-worn camera footage and crime scene photography archives are subject to strict California retention requirements under Penal Code provisions and department policy. Duplicate files in that context are not just wasteful — they can complicate chain-of-custody documentation and create ambiguity in discovery proceedings. The city's Information Technology Agency, headquartered on Spring Street in downtown Los Angeles, has jurisdiction over citywide data governance standards but enforcement across individual department systems remains inconsistent, based on published audit reports from the City Controller's office.
What Comes Next for City IT
Automated duplicate image detection tools have matured considerably. Perceptual hashing algorithms — which identify visually identical or near-identical images regardless of file name, format, or minor compression differences — now run at speeds that can process millions of files in hours rather than days. Several vendors active in the Los Angeles municipal procurement market, including those on the city's approved IT services contract lists managed through the Bureau of Contract Administration, offer deduplication as a module within broader digital asset management platforms.
For the Olympic documentation effort in particular, the deadline pressure is real. Construction photography for venues under the LA28 organization's oversight is expected to accelerate significantly through 2026 and 2027 as completion timelines tighten. Getting image governance right before that volume peaks is materially cheaper than cleaning up afterward. Storage costs compound; legal exposure from mismanaged evidence files does not shrink on its own.
City departments that have not already implemented deduplication scanning should treat the current fiscal year's IT budget cycle as the practical window. The alternative is paying, month after month, for the same photograph stored twice.