Los Angeles city agencies collectively store an estimated tens of millions of digital image files across municipal servers, and a growing share of that data is pure duplication — the same photograph saved twice, three times, sometimes dozens of times under different filenames. The hidden price tag attached to that redundancy has quietly become a budget conversation in several departments, coming into sharper focus as the city prepares its technology infrastructure for the 2028 Olympic and Paralympic Games.
The issue sits at the intersection of two pressures Los Angeles cannot ignore right now. The city is spending heavily on digital infrastructure upgrades tied to Olympic readiness, and at the same time Mayor Karen Bass's housing emergency declaration has pushed city agencies to account for every dollar in operational overhead. Duplicate image storage is not glamorous, but it consumes real server capacity, real energy, and real money that administrators would rather redirect.
What the Data Actually Shows
Cloud storage pricing from major providers currently runs between roughly $0.02 and $0.023 per gigabyte per month for standard-tier storage. A single high-resolution image file from a modern camera can reach 25 to 50 megabytes. An archive of 500,000 duplicated images — a figure well within the range reported by large municipal systems in comparable U.S. cities — translates to somewhere between 12 and 25 terabytes of wasted capacity. At commercial rates, that represents a recurring annual cost running into hundreds of thousands of dollars before personnel time is factored in.
The Los Angeles County Metropolitan Transportation Authority, which maintains photo documentation for infrastructure projects across its network of more than 100 rail and bus rapid transit stations, has acknowledged in public budget documents that digital asset management inefficiencies exist system-wide, though the agency has not published a specific duplicate-image count. The Los Angeles City Clerk's office, which archives photographic records of city proceedings and public works projects, operates under a mandate from the California Public Records Act to retain digital documentation — a legal requirement that has, over two decades of digital record-keeping, produced significant file overlap as staff upload materials from multiple sources without deduplication protocols.
The Los Angeles Public Library system, which spans 73 branch locations from Chatsworth to San Pedro, digitized roughly 4 million items across its Photo Collection and California Index over the past decade. Librarians and archivists at the Central Library on Fifth Street in Downtown LA have described deduplication as an ongoing operational challenge, though the library has not released current redundancy figures. Getty Images, headquartered in Seattle but with a major editorial operation in Los Angeles that feeds local news outlets, uses automated perceptual hashing — software that detects visually identical or near-identical images by comparing pixel patterns — as a standard workflow tool. Smaller organizations across the city have not adopted similar tools at scale.
Why It Matters Beyond the Budget Line
The practical consequences extend beyond storage costs. When the Los Angeles Fire Department or the Department of Public Works needs to quickly retrieve photographic documentation during an emergency — a wildfire bearing down on the hillside communities of Altadena or Sylmar, for instance — bloated archives with redundant files slow retrieval and complicate version control. During the January 2025 Palisades and Eaton fires, multiple city agencies were simultaneously pulling and sharing aerial and ground-level imagery, and after-action reviews flagged digital asset coordination as an area needing improvement.
The technology to address the problem is not new. Perceptual hashing algorithms, reverse image matching, and automated deduplication software have been commercially available and widely deployed in private-sector media companies since at least 2015. The Los Angeles County Office of Digital Services, established in 2021, has a stated mandate to modernize data management across county departments, but deduplication has not featured prominently in its public-facing project pipeline as of mid-2026.
For city agencies and cultural institutions looking to get ahead of the problem, archivists recommend starting with an audit of the five largest file repositories before investing in enterprise deduplication software. With Olympic media operations expected to generate millions of new image files beginning in summer 2028, the window to clean up existing archives before the volume compounds is closing. The time to run the audit is now, not two years from now when Exposition Park is full of television cameras.