Skip to main content
The Daily Ballarat

Ballarat news, every day

News

How Ballarat's Digital Archives Ended Up Full of Ghost Images — and What Happens Next

A years-long accumulation of duplicated, broken and orphaned image files has quietly degraded the city's public digital records, and the reckoning is only now beginning.

How we report this

Our reporters are based in Ballarat and cover local government, business and community. We are independently owned and editorially independent. Read our editorial standards →

By Ballarat News Desk · Published 5 July 2026, 4:57 am · 4 min read ·

Updated 5 July 2026, 1:57 pm

How Ballarat's Digital Archives Ended Up Full of Ghost Images — and What Happens Next
Photo: Photo by Jigar Patel on Pexels

Ballarat's publicly accessible digital image repositories — spread across council websites, the Art Gallery of Ballarat's online collection portal, and the Sovereign Hill digital heritage archive — contain thousands of duplicate entries that have made reliable search and retrieval increasingly difficult. The problem has been building for close to a decade, and a coordinated remediation effort is now underway across several institutions.

The issue matters now because Ballarat is in the middle of a significant push to digitise and promote its gold-rush heritage identity, with state and federal cultural funding tied to public access metrics. If the underlying image infrastructure is unreliable — returning duplicate thumbnails, broken links or conflicting metadata — those access numbers become meaningless, and the funding case weakens.

How the Duplication Problem Accumulated

The roots are straightforward. Between roughly 2014 and 2022, multiple content management system migrations — including a major overhaul of the City of Ballarat's website platform around 2019 — left behind orphaned image files each time. Staff uploading content to the new system frequently re-uploaded assets that already existed in legacy folders, because search tools within the CMS were not reliably indexed. The Art Gallery of Ballarat, located on Lydiard Street North, underwent a parallel digitisation sprint around 2020 as part of a broader Victorian collections access initiative, and a similar pattern emerged: items scanned from the permanent collection were sometimes logged under two or three catalogue entries simultaneously.

Sovereign Hill, on Bradshaw Street in the city's south, built its own digital learning portal during the COVID-19 period to serve school groups locked out of the physical site. Image assets drawn from its photographic and artefact collection were uploaded rapidly under deadline pressure, with duplicate detection either disabled or absent from the workflow. By the time on-site programming resumed in 2022, the portal held an estimated several hundred duplicate image records — a figure the organisation has since been working to resolve internally, according to publicly available documentation on its digital learning program.

The Ballarat Heritage Office, which operates under the City of Ballarat and manages records connected to properties across the central heritage precinct — including the Victorian-era streetscapes of Sturt Street and Armstrong Street — has faced a related but distinct problem: photographic survey images taken during heritage assessments were stored on shared network drives without consistent naming conventions, meaning multiple copies of the same property photograph accumulated across different assessment years without any automated deduplication.

The Cost of Inaction Is Becoming Clearer

A 2024 audit of the Victorian Collections platform — the state-managed aggregator that draws on records from institutions including the Art Gallery of Ballarat — identified duplicate image records as one of three primary data quality issues affecting regional contributors. The audit was published by Museums Victoria as part of its collections stewardship reporting. Nationally, the Trove platform maintained by the National Library of Australia has published guidance noting that duplicate digital objects degrade discoverability scores and affect the weighting of search results for regional collections.

The practical cost is not abstract. The City of Ballarat allocated funding in its 2025-26 capital budget for digital infrastructure improvements, though the specific line items relating to content management were not broken out separately in the publicly released budget summary document. Staff time spent manually identifying and removing duplicate image records across multiple platforms represents an ongoing operational drain that several institutions have flagged in annual reports without attaching a dollar figure.

The remediation work now underway involves a phased approach: automated hash-matching tools to identify pixel-identical duplicates, followed by manual review of near-duplicates where metadata differs. For residents and researchers using services like the Art Gallery of Ballarat's online collection or the Sovereign Hill learning portal, the immediate practical advice is straightforward — if a search returns confusing or contradictory results for the same item, use the catalogue reference number rather than image title to locate authoritative records. Institutions are updating their public-facing guidance pages as the cleanup progresses, and the City of Ballarat's heritage team has indicated on its website that updated property photograph records for the Sturt Street precinct should be accessible through the standard heritage inquiry portal by the end of the 2026 calendar year.

Spread the word

Your reaction

Bookmark this story to your reading list.

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Ballarat

This article was produced by the The Daily Ballarat editorial desk and covers news in Ballarat. See our editorial standards for how we use AI.

The Daily Ballarat brief

The day's Ballarat news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Ballarat and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Ballarat news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Ballarat and accept our Privacy Policy. Unsubscribe anytime.

More from Ballarat

More from Ballarat

Enjoyed this story? Get tomorrow's briefing free.