How Los Angeles Got Buried in Its Own Photos: The Long Road to the Duplicate Image Crisis
Years of fragmented digital recordkeeping across city agencies have left LA with a sprawling, redundant image archive — and a growing push to clean it up.
Years of fragmented digital recordkeeping across city agencies have left LA with a sprawling, redundant image archive — and a growing push to clean it up.
Los Angeles city agencies collectively manage hundreds of thousands of digital image files spread across at least a dozen separate database systems, and a significant portion of those files are duplicates — the same photograph stored multiple times under different names, in different folders, on different servers. That is the conclusion city IT officials have been working from since an internal audit commissioned by the Office of the City Administrative Officer in late 2024 flagged the problem as both a storage cost issue and a public records liability.
The timing matters. With the 2028 Summer Olympics less than two years away and Mayor Karen Bass's housing emergency generating a daily avalanche of site photography, construction documentation, and planning imagery, the city's digital infrastructure is under more strain than at any prior point in its history. Duplicate image files aren't just a housekeeping annoyance — they slow records requests, inflate cloud storage invoices, and in at least one documented case involving the Los Angeles Department of Building and Safety, created conflicting versions of permit documentation photographs that complicated a code enforcement proceeding in the Boyle Heights neighborhood.
The roots of the problem go back to the mid-2000s, when individual departments began digitizing their photo archives independently, with no citywide standard for file naming, metadata tagging, or storage location. The Los Angeles Department of Transportation, the Bureau of Engineering, the Department of Recreation and Parks — each built its own system. When the city moved incrementally toward cloud infrastructure starting around 2018, those siloed archives were often migrated wholesale rather than consolidated, carrying their redundancies with them into Amazon Web Services and Microsoft Azure environments.
The Los Angeles Housing Department, which has been producing documentation photography at high volume under the Bass administration's Inside Safe program — a street-to-shelter initiative that has operated at dozens of sites including locations along Skid Row on San Julian Street and encampments near the 101 freeway corridor in Hollywood — began generating duplicate images almost immediately when field teams uploaded photos through two separate mobile apps that fed into two separate servers. Neither app checked whether an image already existed in the system before saving a new copy.
By the time the 2024 audit circulated internally, city IT staff had identified cloud storage costs related to image redundancy running into six figures annually, though the precise figure has not been released publicly. Storage costs for municipal cloud environments nationally have risen sharply since 2022; industry benchmarks published by the Government Technology association put average per-gigabyte costs for municipal cloud contracts in major U.S. cities at roughly $0.08 to $0.23 per gigabyte per month depending on tier and contract terms.
Consolidation efforts are now underway, centered on a project managed out of the city's Information Technology Agency, headquartered on Main Street in downtown Los Angeles. The ITA has been piloting a deduplication protocol using perceptual hashing — a technique that identifies visually identical or near-identical images even when file names differ — across the Bureau of Engineering's archive first, treating it as a proof-of-concept before any wider rollout.
The pilot launched in the first quarter of 2026 and is expected to produce a full report by September. Meanwhile, the 2028 Olympics planning apparatus — coordinated partly through LA28, the organizing committee based in Westwood — has its own image management requirements, and city officials have flagged the risk that Olympic-era documentation photography will compound existing redundancy problems if a unified protocol isn't in place before construction documentation ramps up further.
For Angelenos, the practical consequence of inaction is slower responses to public records requests that involve photographs — requests that have increased substantially since the January 2025 wildfires, when residents and insurers began filing document requests tied to damage assessment imagery held by city agencies. Anyone expecting photographic records from the city should anticipate delays that ITA officials have publicly acknowledged stem in part from the difficulty of locating the authoritative version of a given image across multiple storage environments. The deduplication pilot is the first systematic attempt to change that.
How does this story make you feel?
Spread the word
About this article
Published by The Daily Los Angeles
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News


