All posts by Paul

Keeping our Data Consistent

The consistency and integrity of data is essential for any digital archive. Therefore, for the past few months we have been running a series of programs to test the consistency of our file system and database and try to identify any other problems. This work started when we decided to develop a program to test all the checksums in our file system. The idea was to run the program every few months in order identify any checksums which had changed since the last run.

checksum report
Part of a checksum report.

In addition, the program would test the checksums in the file system against the checksums in the database so that we could be sure that they were synchronised. The program took a few weeks to develop and has now been run several times. Each run produces a report which shows any checksum changes in the file system and the database. Happily, there have only been a few checksums flagged up in the reports so far and usually there have been good reasons why they have been changed.

