Introduction: Data sharing in clinical research is critical for increasing knowledge discovery. Data and software tools should be FAIR: Findable, Accessible, Inter-operable and Re-usable. Many bottlenecks exist in the process of a clinical investigator using shared data including data acquisition and statistical analysis. The objective of this project is to develop a structure for sharing data and providing rapid automated statistical analysis through creation of a pre-packaged, open-source software container.
Methods: We use the open source software container technologies VirtualBox and Vagrant to create a template for sharing clinical data and analysis scripts as a single container. We use a timer to record the time necessary to setup and initialize the software container and view the results.
Results: We have created a template for sharing data and analysis scripts together using open source software container technologies VirtualBox and Vagrant. We found the time needed to initialize the container to be 5 minutes and 36 seconds for a macOS-based machine and 7 minutes and 2 seconds for a Windows-based machine. Containers can be downloaded and executed from any Mac or Windows computer allowing both the reuse of and interaction with the data. This greatly reduces the time and effort needed to obtain and analyze clinical data.
Conclusion: Reducing the time and effort needed to obtain and analyze clinical data increases the time available for data exploration and the discovery of new knowledge. This can be effectively achieved using software containers and virtualization.
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.
Mattingly, William A. PhD; Furmanek, Stephen; Sinclair, Christopher M.; and Wiemken, Timothy L. PhD
"Distributing Data and Analysis Software Containers For Better Data Sharing in Clinical Research,"
The University of Louisville Journal of Respiratory Infections: Vol. 1
, Article 6.
Available at: https://ir.library.louisville.edu/jri/vol1/iss4/6
Community Health and Preventive Medicine Commons, Epidemiology Commons, Health Information Technology Commons, Influenza Humans Commons, Influenza Virus Vaccines Commons, International Public Health Commons, Translational Medical Research Commons