Los Angeles city agencies are deep into a multi-year effort to identify and replace duplicate images embedded across thousands of public-facing digital records — a problem that festered quietly through the early 2020s and only became impossible to ignore when storage costs and system slowdowns started affecting front-line services. The cleanup, now coordinated partly through the city's Information Technology Agency on Spring Street, has touched everything from permit databases maintained by the Department of Building and Safety to the digital evidence repositories used by the LAPD.
The issue matters right now because the city is simultaneously under pressure to digitize decades of paper-based records tied to Mayor Karen Bass's housing emergency declaration, prepare infrastructure for the 2028 Olympics, and expand digital services to residents dealing with ongoing wildfire recovery paperwork in neighborhoods like Altadena and Pacific Palisades. Every terabyte consumed by a duplicated inspection photograph or a redundant zoning map is storage that cannot serve those functions.
How the Duplication Problem Grew
The roots of the problem go back roughly fifteen years, to a period when city departments were migrating independently from paper to digital without a unified file-management standard. The Department of City Planning, headquartered in the Figueroa Plaza complex near downtown, ran its own document management system. The Bureau of Engineering operated another. The LAPD's records division maintained a third. When those systems were eventually linked or migrated to cloud-based platforms, automated ingestion scripts pulled the same image files multiple times — once from the legacy system, once from a transitional backup, sometimes a third time from a scanned archive. Nobody deleted the originals.
The problem compounded during the COVID-19 pandemic. Between March 2020 and the end of 2021, dozens of city divisions accelerated remote-access upgrades, generating new rounds of bulk file transfers. Staff working from home uploaded documents without standardized naming conventions. A single building inspection report for a property in, say, Boyle Heights might exist in four slightly different versions, each carrying the same attached JPEG of a foundation crack, none of them flagged as redundant by software that wasn't looking for image-level duplication.
By 2023, the city's ITA had begun piloting a deduplication protocol using hash-matching software — a standard technique that assigns a unique fingerprint to each image file and flags identical copies regardless of filename or folder location. The pilot ran first in the Department of Building and Safety's electronic plan check system, known as ePlan, which processes thousands of permit applications per year for properties across the city's 503 square miles. Early results surfaced redundancy rates that concerned department administrators enough to expand the program citywide, according to budget documents submitted to the City Council in fiscal year 2024-25.
What Comes Next for City Systems
The current phase of the project involves not just deleting duplicate images but replacing broken or missing image references in legacy records — a more painstaking task. When a duplicate is removed, any record that pointed to that file needs to be updated to point to a single canonical version. For older databases built on 1990s-era architecture, that means manual review queues. The ITA contracted with two technology vendors for this phase, and the work is expected to run through at least mid-2027, well into the lead-up to Olympic preparations that require city permitting systems to handle construction documentation at elevated volumes.
For ordinary Angelenos, the most practical implication is reliability of public records portals. Residents in communities like Watts and El Sereno who rely on the city's online permit lookup tool to research properties — whether for tenant rights purposes or contractor verification — have historically encountered broken image links or mismatched photographs. As the deduplication project advances, those errors should become less frequent. The ITA has published a public-facing project update page and directs residents with records access issues to the 311 service line as an interim step while the cleanup continues.