The Daily Los Angeles

Los Angeles news, every day

News

LA's Digital Archive Crisis: The Hidden Cost of Duplicate Images Clogging City Systems

Redundant image files are quietly consuming terabytes of government and institutional storage across Los Angeles — and the bill is growing.

By Los Angeles News Desk · Published 4 July 2026, 12:23 pm

4 min read

LA's Digital Archive Crisis: The Hidden Cost of Duplicate Images Clogging City Systems
Photo: Photo by Price Huang on Unsplash

Los Angeles city agencies, cultural institutions, and public-sector contractors are sitting on hundreds of thousands of duplicate image files spread across aging server infrastructure — a problem that has drawn increasing scrutiny from technology auditors as the city accelerates its digital buildout ahead of the 2028 Olympics. The numbers are not trivial. Storage analysts working with municipal procurement data have estimated that redundant digital assets — images that have been scanned, re-uploaded, or duplicated without a systematic deduplication protocol — can account for anywhere from 18 to 35 percent of total file storage in large government environments, a range consistent with findings published by the Government Technology research group in its 2024 public-sector IT report.

The timing matters. The city of Los Angeles is mid-stream on a multi-agency digital transformation push tied directly to 2028 infrastructure planning and Mayor Karen Bass's ongoing housing emergency response, both of which require high-volume document scanning, photographic records, and architectural imaging. Every duplicated site photo or redundant permit scan sitting on a city server costs real money in cloud storage fees — fees the city pays to vendors including Amazon Web Services and Microsoft Azure under contracts routed through the LA Department of Technology.

Where the Problem Lives in Los Angeles

The Los Angeles City Archives, housed in the Piper Technical Center on North Spring Street in Lincoln Heights, manages tens of millions of physical and digitised records. Staff there have been working through a backlog of historical digitisation since at least 2021. Separately, the LA County Department of Arts and Culture, which oversees public collections including digital assets tied to programs at the Grand Park performance space and venues across the San Fernando Valley, has flagged image metadata inconsistencies in internal workflow reviews cited in county budget documents. Neither institution has published a specific cost figure attributable solely to duplication, but storage consumption reports from comparable municipal archives — including those cited in a 2023 Urban Libraries Council study — have found that unmanaged duplication typically inflates storage requirements by 20 to 30 percent above actual unique-file need.

The Los Angeles Unified School District, which runs its own data infrastructure for roughly 575,000 students across more than 1,000 schools, presents another layer of the problem. LAUSD's instructional technology division has been expanding digital content libraries since 2022, pulling in licensed images, scanned worksheets, and curriculum assets. District procurement records show recurring line items for expanded cloud storage capacity, though the district has not publicly broken out what share of that expenditure is attributable to duplicate content.

What Deduplication Actually Costs — and Saves

The mechanics of duplicate image replacement are less glamorous than the dollar figures attached to them. Automated deduplication software — tools like Cloudberry, Veeam, or open-source alternatives deployed on Linux-based servers — typically runs between $0.03 and $0.12 per gigabyte of storage reclaimed, depending on contract scale and whether the work is handled in-house or outsourced. For a city the size of Los Angeles, where the Department of Technology oversees an infrastructure portfolio valued at over $300 million according to the city's adopted 2025-26 budget summary, even a conservative 20 percent reduction in redundant image storage across departments could translate to six-figure annual savings in cloud licensing fees alone.

The practical obstacle is not the technology. It is workflow. When multiple city departments — say, the Bureau of Engineering on South Figueroa Street and the Planning Department on West 1st Street — independently scan the same permit drawings or site photographs without a shared asset management system, duplicates accumulate by default. Without a centralised digital asset management platform mandating hash-based deduplication at the point of upload, the problem compounds faster than any cleanup effort can address it.

The city's IT roadmap, submitted to the City Council's Budget and Finance Committee earlier this year, includes language about consolidating redundant data systems ahead of the Olympic influx of media, logistics, and public-safety data expected between 2027 and 2028. Technology policy analysts watching the process say the image duplication question is likely to become a procurement line item within the next budget cycle — one that city departments would do well to get ahead of before the data volumes tied to Olympic planning make the cleanup exponentially harder.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Los Angeles

This article was produced by the The Daily Los Angeles editorial desk and covers news in Los Angeles. See our editorial standards for how we use AI.

The Daily Los Angeles brief

The day's Los Angeles news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Los Angeles news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Los Angeles and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Los Angeles

More in News

Enjoyed this story? Get tomorrow's briefing free.