Double-Zipping Files and Folders: Difference between revisions

From ADA Public Wiki
Jump to navigation Jump to search
No edit summary
m (Dahaddican moved page Double-Zipping Files and Folders to Zipping and Encrypting Files and Folders: Update required since no longer need to Double-Zip all data files and some Supporting Documents if they are encrypted with a password since Data...)
(No difference)

Revision as of 22:37, 17 September 2019

Double-Zipping is required in order to ensure that the files remain compatible with Dataverse and to keep all files consistent. It is a known issue that Dataverse Version 4.6.1 cannot directly ingest certain file types (SAS, SPSS, Stata, Excel and CSV) without removing some of the formatting. Dataverse Version 4.6.1 also adds an ‘Explorer’ function button to directly ingested files and that enables a functionality that the ADA does not currently want at this point in time. To prevent both of the above, and also to cater for the fact that Dataverse strips away the first layer of zipping during the ingest process, all data files and those supporting documentation files that are in any of the aforementioned formats must be Double-Zipped prior to ingest to Dataverse. This leaves a single layer of zip attached to the files post ingest, the files integrity is then retained, and the files are present in their original format without the explorer function.

If there is an excessive number of files to upload, it is possible to ingest multiple files as a single downloadable folder, however this folder also needs to be uploaded as a double-zipped folder. This is for the same reasons as above.

Files and/or folders that are not correctly prepared may not be ingested by Dataverse correctly. Furthermore, files and/or folders that are discovered to have been uploaded incorrectly by ADA Staff during their Quality Assurance checks will need to be rectified prior to publishing. This will add unnecessary delays to the publishing of the Dataverse.

7-Zip Software

It is recommended by the ADA that all data files and certain supporting documents be encrypted using the 7-Zip open source software. This software is used by the ADA Staff and is free. This ensures that the files and folders are protected from unauthorised disclosure during the Dataverse upload process. The software creates a container called ‘archive’ that holds the files requiring protection. That archive container can then be encrypted and password protected. Copies of the software can be obtained via the links at https://www.7-zip.org/.

How to Double-Zip

For instructions on how to Double-Zipp files and folders using the 7-Zip software, refer to the page detailed below.