Los Angeles city agencies, public libraries, and entertainment-adjacent archives are confronting a problem that has quietly ballooned since the COVID-era digitization push: hundreds of thousands of duplicate images clogging servers, distorting public records, and eating through storage budgets at a pace that administrators say is no longer sustainable. The immediate question is not whether to act, but which replacement and deduplication framework to adopt — and who pays for it.
The stakes are higher than they might appear. With the 2028 Olympics infrastructure build accelerating across sites from SoFi Stadium in Inglewood to the Los Angeles Memorial Coliseum, and with Mayor Karen Bass's ongoing housing emergency generating a near-daily flood of permit records, site photographs, and inspection imagery, the volume of official digital assets is growing faster than any current policy can manage. A decision delayed into 2027 will cost significantly more to untangle.
Where the Backlog Lives
The Los Angeles Public Library system, which operates 73 branch locations including the Central Library on West 5th Street downtown, has been digitizing historical photograph collections since at least 2018. Librarians have flagged that deduplication — the process of identifying and replacing lower-quality or redundant versions of the same image with a single authoritative file — has not kept pace with ingestion. The result is a catalog that contains multiple versions of the same image at different resolutions, different color corrections, and sometimes different metadata, making accurate public search unreliable.
The Los Angeles County Department of Public Works, which manages permit and inspection photography across unincorporated areas stretching from the San Gabriel Valley to the Santa Monica Mountains, faces a related but distinct challenge. Field inspectors upload images directly from mobile devices, and without an automated hash-matching system to catch duplicates at the point of upload, the county's document management platforms accumulate redundant files that inflate storage costs and complicate legal discovery requests. Storage rates for enterprise cloud infrastructure have climbed steadily — industry benchmarks put per-terabyte annual costs for compliant government cloud storage between $200 and $600 depending on redundancy and access tier, though any figure specific to L.A. County contracts would require a public records request to verify.
The Getty Conservation Institute, headquartered on the Getty Center Drive campus above Brentwood, has done more than most to establish best practices for image asset governance in cultural institutions. Its guidance on digital preservation, updated in 2023, recommends periodic audits using perceptual hashing algorithms to flag near-duplicate images — not just exact byte-for-byte matches — before replacement decisions are made. That distinction matters because a slightly cropped or differently compressed version of an archival photograph may carry its own preservation value.
The Decisions That Cannot Wait
Three choices will define how this plays out over the next 18 months. First, city and county agencies need to settle on a deduplication standard — whether to replace lower-quality duplicates automatically or route them through a human review queue. Automation is faster and cheaper; human review is safer for irreplaceable assets. Most archivists recommend a tiered approach, with automated replacement for files below a quality threshold and mandatory review for anything flagged as historically significant.
Second, the question of who owns the replacement decision for images that appear in both a city archive and a private collection — a recurring issue in neighborhoods like Boyle Heights and Leimert Park, where community-generated photography has been donated to multiple institutions simultaneously — needs a formal inter-agency protocol. Right now, no such protocol exists at the city level.
Third, funding. The Bass administration's digital infrastructure priorities have centered on the Las Iniciativas broadband expansion and the HomelessnessAI pilot in Skid Row, leaving image asset governance without a dedicated budget line. An appropriation request submitted to the City Council's Budget and Finance Committee before the August recess would give the program a real shot at 2026-27 funding.
For Angelenos who interact with public records — journalists, researchers, attorneys, community organizers pulling permit histories in neighborhoods facing development pressure — the practical advice is simple: file California Public Records Act requests now for image-heavy document sets you may need, before any replacement cycle potentially removes or reclassifies files. Once a deduplication sweep runs, recovering superseded versions becomes significantly harder.