Guides for Researchers
How to deal with non-digital data
The benefits of digitising data
What it is?
Not all research data are digital. Most researchers keep handwritten laboratory notebooks, journals and other materials, examples of which may be surveys, paintings, fossils, minerals and tissue. However, non-digital data can be converted to a digital source in a variety of ways.
Why digitise data
- Increased efficiency: with a well-defined digitisation and document imaging plan, stakeholders can easily share, collaborate, compare, exchange and access documents, thus reducing waste of time and effort.
- Cost efficiency: the cost of printing and paperwork can be excessive.
- Ease of access: digitised objects can be easily accessed through the cloud or network using any device that has internet, anytime and anywhere.
- Security: file permissions and access rights to specific users or groups can be defined if needed. This will increase security and maintains the confidentiality of the document.
- Long term preservation: physical objects and information stored on, for example, paper are subject to degradation, and will degrade further every time they are handled manually.
- Data recovery: in the event of natural or manmade disasters, loss or damage of physical objects (data) can happen very easily. The digital form of these in the cloud or managed services would not be subject to the same threats, thus reducing risk.
Digitising versus born-digital
Golden copy
Retention period
Typically, basic research data and related material should be retained for a minimum of 10 years after the study has been completed. Clinical research data should be preserved for at least 20 years after the study has been completed .
Please keep in mind that national legislation or codes of conduct may impose different periods.
Text digitising
Anything stored on paper can be scanned easily. For best results, scan your paper document as an image file, preferably as a TIFF image, or PDF. Using the latter method allows files to be searchable and Optical Character Recognition (OCR) is a process which allows conversion of text contained in images into editable text files.
If you are scanning sensitive data, we recommend scanning to an encrypted USB drive rather than to your network drive. However, you should transfer the data to a secured network storage space as soon as possible, then store the USB drive securely until you are ready to ask your IT support to perform secure data deletion .
Video and audio digitising
Real-world objects
Those who carry out material structure studies, or restoration of museum pieces, often work with real objects - like biological material, fabrics, fossils and paintings. Digital technology allows them to observe these items in several ways.
Cell and tissue specimens may exist as intact structures or may have been sectioned and adhered to glass slides. In the former case, there are numerous methods to 3D scan specimens such as various forms of tomography which can go from sub-nanometer to millimeter resolution. These will typically yield a 3D digital object which can then be virtually sectioned. In many cases, they are also used to discern physical characteristics or experimental signals that would not otherwise be possible. For sections, it is common practice to directly photograph these to create a digital record, and is the same for paintings. Other types of specimens such as fossils and minerals can also be subjected to tomographic imaging methods to produce 3D virtual objects, while in other cases the experimental subject may be very large, such as a building or an entire landscape, in which case there are more expansive methods such as SONAR, RADAR and LIDAR.
If the real world object is not easy to scan, the only remaining option may be to take a digital photograph(s) at the highest resolution possible. The image should be checked to make sure it is of sufficient quality and provides enough detail, and is an accurate reflection of the real world object.
Advice for managing data in hard copy
- Appoint an archive administrator and a deputy, and define clear key management duties.
- Make sure that only the archive administrator and their deputy have access to the physical part of the archive.
- Make sure that there is a clear procedure in place for archiving and reusing documents.
- A good archive is well organised. Use archive boxes and label them with the project name and number. Show information such as the name of the project leader and a serial number. For more information on documenting a paper archive, see the EAD standard for Encoded Archival Description.
- Create a corresponding digital database and log the information which is most relevant. Indicate the conditions according to which the contents of the archives can be accessed, as well as those governing their use. Keep a clear record of who has used the archive.
- Separate anonymised data from informed consent forms, for example, by keeping the forms in a locked cabinet in the archive.
- Keep the archive clean and up to date. Make sure that the contents of the physical archive match the records in the corresponding digital and hard-copy databases.
- Ensure the clarity of the procedures governing the removal or the destruction, or both, of archive materials.
- Records storage areas should be secure, clean, organised, safe, dry and accessible. If possible, the records storage should be a fireproof room. These are expensive, but very effective if set up properly with fire resistance for a minimum of four hours, a fire detection system, temperature and humidity control, dust-free conditions, and a secure locking device or some controlled access systems.
References
- 10 Advantages of Digitization and Data Capture You Must Know. Aptara Corp.
- Archiving non-digital data. Radboud University.
- Born-digital records and metadata. The National Archives.
- Good research practice: Principles and guidelines. Medical Research Council.
- What is a golden copy? The University of Edinburgh.
- Working with data: Non-digital data. University of Bath Library.