Skip to content

Introduction

What is a workflow system and why use one?

Every computational workflow consists of multiple steps, starting with raw data and ending with summary data to plot and calculate statistics.

Workflow systems help you automate and manage the inputs, outputs, and commands for the analysis, thereby making it easier to maintain, reproduce, and share your workflow! Read more about workflow systems in this Nature Toolbox review and in the paper Streamlining Data-Intensive Biology with Workflow Systems.

What is Snakemake?

Snakemake is a Python-based workflow system (see 2012 publication). The name "Snakemake" comes from the fact that it's written in (and can be extended by) the Python programming language.

Snakemake works by looking at a file, called a "Snakefile", that contains rules for creating output files. Generally, each rule is defined as a step in the workflow. Snakemake uses the rules and command line options to figure out how the rules relate to each other so it can manage the workflow steps.

Let's get started!


Last update: May 13, 2021