Dozens of Ballarat residents have come forward this week to describe confusion, frustration and genuine concern after learning that the City of Ballarat's digital heritage image repository — a collection built over more than a decade to preserve the region's gold-rush history — has been compromised by a large-scale duplicate file problem that archivists are still working to untangle.
The issue surfaced publicly in late June when the Ballarat Heritage Office sent letters to community members who had donated digital scans of photographs, documents and artefacts, advising them that the repository had accumulated significant volumes of duplicate images across multiple upload cycles. Some donors were told their original contributions could not be verified as distinct from later copies.
Community members who spoke generally to The Daily Ballarat this week — without being named in this report at their request — described a consistent picture: they had contributed scans through the council's online portal over the past two to three years, often as part of outreach programs run through the Ballarat Library on Doveton Street and the Mining Exchange precinct on Lydiard Street North. Several said they had never received confirmation that their original files had been catalogued separately from duplicates created by the system itself.
One long-time Bakery Hill resident described spending nearly 40 hours scanning a collection of 19th-century mining lease documents her family had preserved, only to be told she may need to resubmit the material once the de-duplication process is complete. Another contributor, who said she had donated images through a program run at the Ballarat & Clarendon College community engagement office, said she was concerned the metadata linking photographs to their original family sources had been lost or scrambled during repeated file migrations.
Scale of the Problem Still Being Assessed
The City of Ballarat had not responded to a request for comment by publication time Saturday. The Ballarat Heritage Office's letter to donors, a copy of which was provided to this masthead, stated that the de-duplication process was expected to take until at least September 2026 to complete. The letter did not specify how many files were affected.
Archival industry guidance from the Australian Society of Archivists recommends that digital repositories run automated de-duplication checks at the point of ingestion — a standard that, if applied consistently, should prevent bulk accumulation of duplicate files. The cost of retrospective de-duplication on large collections can run into tens of thousands of dollars depending on collection size and the complexity of the metadata structure involved.
The Ballarat Historical Society, which operates out of the former Mechanics' Institute building on Sturt Street, confirmed it had been in contact with the Heritage Office but declined to characterise the conversations publicly while the assessment is ongoing.
For residents who donated material, the practical advice from archivists is consistent: retain all original physical items and do not destroy source documents on the assumption they are safely digitised. Anyone who donated through council-run programs between 2022 and 2025 should contact the Ballarat Heritage Office directly to request a record of what files were received under their name and whether those records remain individually identifiable in the system.
The broader lesson, according to digital preservation professionals, is that community-sourced heritage digitisation programs require clear file ingestion protocols from the outset — not just storage space and goodwill. Ballarat's gold heritage identity has long rested on the quality of its physical and documentary record. Whether this episode sets that record back depends entirely on how thoroughly the current audit is conducted and communicated.