Los Angeles city archivists have spent the past 18 months systematically purging duplicate images from the municipal digital records system — a sprawling effort touching everything from building permit photographs filed at the Department of Building and Safety on South Spring Street to aerial survey files held at the Bureau of Engineering in downtown Civic Center. The work, folded under the city's broader Digital LA Initiative launched in late 2024, has cleared more than 400,000 redundant image files from public-facing databases, according to figures the city's Information Technology Agency presented to the City Council's Budget and Finance Committee in April 2026.
The timing is not incidental. With the 2028 Summer Olympics deadline compressing infrastructure and planning timelines across the basin, the city cannot afford bloated, error-prone databases slowing permit reviews, emergency-response mapping, or the real-time traffic and venue logistics systems being built out for Games operations. Duplicate images — photos filed twice under different case numbers, aerial tiles duplicated across survey runs — slow search results, inflate storage costs, and occasionally surface the wrong version of a record in legal disputes. For a city managing wildfire-risk mapping in the hills above Brentwood and Altadena at the same time as it tracks homeless encampment clearance for Mayor Karen Bass's Inside Safe program, clean data infrastructure is not an abstraction.
What LA Is Doing Differently
The city's approach relies on a perceptual hashing system — software that generates a fingerprint for each image and flags near-identical files even when they differ by compression artefact or timestamp metadata. The Getty Conservation Institute in Los Feliz has consulted informally on image-integrity standards for archival materials, though the bulk of the technical work sits with the ITA's data governance team. The Bureau of Engineering has run parallel deduplication passes on its GIS-linked photo libraries, which include street-level surveys for the Olympic infrastructure corridors along the Metro Crenshaw/LAX line extension and the redevelopment footprint around SoFi Stadium in Inglewood.
Storage savings from the deduplication program are projected to reach roughly $2.1 million annually by the end of fiscal year 2027, based on the city's own cost-per-terabyte contracts for cloud archival services — contracts renegotiated with Microsoft Azure in January 2026. The ITA has also reduced the median image-retrieval time on its public records portal from 11 seconds to under four seconds for permit-related searches, a change that contractors filing documentation on Hyperion Avenue corridor projects noticed in March.
How Other Cities Compare
Amsterdam moved earlier. The city's Stadsarchief — its municipal archive — began a deduplication and image-rights audit program in 2022, working through roughly 1.3 million digitised photographs accumulated since the 1990s. By 2025, Amsterdam had reduced its digital image repository by about 18 percent and published its deduplication methodology as open-source documentation adopted by archivists in Copenhagen and Vienna.
Seoul's Smart City Data Hub, operated through the Seoul Digital Foundation, took a different approach: rather than removing duplicates, it layers them into versioned records so planners can track how a site looked across multiple survey dates — useful for a dense city where construction timelines move fast. Chicago adopted a hybrid model through its Department of Innovation and Technology in 2023, prioritising deduplication for public-safety and building-inspection imagery while leaving planning and cultural heritage records intact for version comparison.
Los Angeles is closer to Chicago's model than Amsterdam's, with city staff drawing a deliberate distinction between operational records — where duplicates create liability — and archival collections, where historical variants have preservation value. The LA City Archives at 555 Ramirez Street in the Arts District is not currently part of the deduplication sweep, a decision that has drawn some criticism from open-records advocates who argue consistency matters more than category.
The ITA expects to complete the first full pass through operational image databases by December 2026, ahead of the Olympic planning crunch that will accelerate permit processing across venues in Hollywood, Long Beach, and the San Fernando Valley. Residents and contractors who use the city's online permit portal can already see faster results; those filing new records are advised to use the updated submission guidelines posted on the ITA website since May 2026, which specify maximum file sizes and accepted formats to reduce accidental duplication at the point of entry.