When conducting research, it is important to not only keep certain information confidential but to also retain data securely and make it available when needed. In order to effectively do this, there are a number of best practices you can follow:
You should store and backup the raw data that you collect. You should also store documents and analyses that you've produced or conducted. The following are high value data to store:
It's advisable to create a file naming system when creating and storing data. Consider creating a naming convention including:
Example: learningassessment_wilson_111623_testscores_2.csv
You may ask yourself what format you should store your data in. This can vary depending on the type of data you are storing. This chart from Ohio State University, provides a good breakdown of recommended formats.
Type of Data | Recommended Formats |
---|---|
Text |
|
Tables, spreadsheets, and databases |
|
Image Files |
|
Sound Files |
|
Video Files |
|
Databases |
|
Geospatial Data |
|
Web Data |
|
Web Archive |
|
Multidimensional Arrays |
|
E-books |
|
(File Format Basics, Ohio State University)
In general, you want to opt to using more open, preservable formats that aren't tied to single programs. For example, you could use:
Metadata provides documentation and context that allows your data to be re-usable. This includes file-level information and project information. It's important to collect metadata throughout the research process. Metadata often includes:
You can store metadata in a file naming schema, readme file, data dictionary, research notebooks, Standard Operation Procedures (SOPs), or project reports.