Cribbs, A.P., Luna-Valero, S., George, C. et al. (10 more authors) (Submitted: 2019) CGAT-core: a python framework for building scalable, reproducible computational biology workflows. F1000Research. ISSN 2046-1402 (Submitted)
Abstract
In the genomics era computational biologists regularly need to process, analyse and integrate large and complex biomedical datasets. Analysis inevitably involves multiple dependent steps, resulting in complex pipelines or workflows, often with several branches. Large data volumes mean that processing needs to be quick and efficient and scientific rigour requires that analysis be consistent and fully reproducible. We have developed CGAT-core, a python package for the rapid construction of complex computational workflows. CGAT-core seamlessly handles parallelisation across high performance computing clusters, integration of Conda environments, full parameterisation, database integration and logging. To illustrate our workflow framework, we present a pipeline for the analysis of RNAseq data using pseudo-alignment.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2019 Cribbs AP et al. This is an open access article distributed under the terms of the Creative Commons Attribution Licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
Keywords: | workflow; pipeline; python; genomics |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Science (Sheffield) > School of Biosciences (Sheffield) > Department of Molecular Biology and Biotechnology (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 15 Apr 2019 14:52 |
Last Modified: | 15 Apr 2019 14:52 |
Status: | Submitted |
Publisher: | F1000Research |
Identification Number: | 10.12688/f1000research.18674.1 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:145016 |
Download
Filename: 44778928-1b92-4178-ab21-8bff2aa086af_18674_-_adam_cribbs.pdf
Licence: CC-BY 4.0