Ballarat's cultural institutions are sitting on a storage problem years in the making. Across organisations including the Art Gallery of Ballarat on Lydiard Street North and the Ballarat Heritage Office, collections managers have been quietly working through thousands of duplicate digital image files — the accumulated debris of at least three separate digitisation waves stretching back to the early 2000s.
The issue matters now because funding is attached to getting it right. The Victorian Government's Regional Collections Access Program, which channels grants to institutions outside Melbourne, requires recipient organisations to demonstrate clean, deduplicated catalogues before additional capital rounds open. For Ballarat, where Sovereign Hill, the Ballarat Historical Society and the Ballarat Mechanics' Institute all maintain separate image libraries, the scale of the duplication problem has become a genuine administrative bottleneck.
Three Digitisation Waves, Three Different Systems
The roots of the problem trace to 2003, when a State Library Victoria-backed initiative pushed regional galleries to scan physical holdings onto CD-ROM and then early network drives. A second push came around 2011, when the federal government's digitisation component of the Cultural Connections grants program funded newer flatbed and overhead scanning equipment for institutions across central Victoria. A third wave followed the 2015 rollout of Collections Victoria, which encouraged institutions to upload assets to a shared discovery portal but provided no mandatory deduplication standard before upload.
Each wave used different file naming conventions, resolution benchmarks and metadata schemas. A single photograph of the 1854 Eureka Stockade site, for example, might exist as a 72 dpi JPEG from 2003, a 300 dpi TIFF from 2011 and a compressed PNG uploaded to Collections Victoria in 2016 — each filed under a different catalogue number and carrying slightly different descriptive metadata. Multiply that across tens of thousands of heritage images and the storage and retrieval costs compound quickly.
The Ballarat Mechanics' Institute, operating out of its building on Sturt Street since 1859, reportedly flagged the duplication problem to peak bodies as early as 2018, but no coordinated remediation program was funded until recently. The Art Gallery of Ballarat similarly identified redundant files during a 2022 internal audit connected to its building redevelopment project.
The Cost of Doing Nothing
Digital storage is not free. Cloud archiving for cultural collections at institutional grade — the kind that meets Public Records Victoria standards for long-term preservation — typically runs at between $800 and $2,400 per terabyte annually depending on redundancy tier, according to publicly available pricing schedules from providers used in the government sector. When an institution is storing three or four copies of the same image across different systems, those costs multiply proportionally.
Beyond storage, the retrieval problem is arguably more damaging. Researchers visiting Sovereign Hill's research centre on Bradshaw Street, or historians working through the Ballarat Heritage Office on Mair Street, can spend hours chasing the same image across multiple catalogue entries before confirming they are looking at the same object. That friction slows scholarship and reduces the practical value of digitisation investment that, in some cases, cost regional institutions tens of thousands of dollars per project.
The broader Victorian context adds urgency. Sydney's record-breaking June heat, the hottest since 1859 according to reporting this week, is a reminder that physical collections in climate-stressed regional facilities face real preservation risk — making accurate, clean digital surrogates more important, not less.
Institutions in Ballarat working through duplicate remediation are being advised to adopt the SPECTRUM 5.0 collections management standard, which includes a formal deduplication and merge protocol, before submitting to the next Regional Collections Access round. The practical steps include establishing a single master file hierarchy, retiring orphaned catalogue records and documenting provenance for each canonical image. The Ballarat Regional Library Corporation, which operates branches including the Ballarat Library on Doveton Street North, has offered to host a joint working session for smaller organisations that lack in-house collections staff to manage the process independently. Dates have not yet been publicly confirmed.