How to Run the Workflow
Follow these instructions to run the Nextflow Casanovo workflow from your system. Note that your system must remain on for the duration of the running of the workflow. Even though the steps may be running on a remote system (like AWS Batch), your system still orchestrates the running of the steps.
Important
You must set up your system to run Nextflow first. Please see How to Install the Workflow for more information. All commands will be typed in the command line. See How to Install the Workflow for more information about how to get a command line.
Follow these steps to run a workflow:
Create a directory that will be the “home” directory for this search. Example commands:
cd mkdir my-nextflow-run cd my-nextflow-run
This will create a directory named
my-nextflow-runin your home directory and move into that directory.Copy in or create a pipeline.config file. A template can be found at: https://raw.githubusercontent.com/mriffle/nf-ms-dda-casanovo/main/resources/pipeline.config
Example command:
wget https://raw.githubusercontent.com/mriffle/nf-ms-dda-casanovo/main/resources/pipeline.configYou may edit this config file in two ways:
Command Line:
nano pipeline.configUse the commands displayed in the bottom of the window to save the file and close the editor when you are done. They will be
Control-Oand<Enter>to save andControl-Xto exit.GUI Editor in your Operating System:
MacOS or Linux: You can directly edit
~/my-nextflow-run/pipeline.configusing your favorite GUI editor.Windows: The file is a little tricky to find. In your file open dialogue, type in
\\wsl$\and hit enter. This should reveal aUbuntu-22.04directory (or something close to it). Go into that and double click onhome, then double click on your username, thenmy-nextflow-run. Thepipeline.configfile should be present and you can edit it like a normal file.Important
For a complete desciption of all parameters see Workflow Parameters.
Copy in or create a Casanovo params file. A template can be found at: https://raw.githubusercontent.com/mriffle/nf-ms-dda-casanovo/main/resources/casanovo.yaml
This is the file used to configure Casanovo. It contains many settings that may be reviewed and changed for your particular data.
Example command:
wget https://raw.githubusercontent.com/mriffle/nf-ms-dda-casanovo/main/resources/casanovo.yamlYou may edit this config file in two ways:
Command Line:
nano casanovo.yamlUse the commands displayed in the bottom of the window to save the file and close the editor when you are done. They will be
Control-Oand<Enter>to save andControl-Xto exit.GUI Editor in your Operating System:
MacOS or Linux: You can directly edit
~/my-nextflow-run/casanovo.yamlusing your favorite GUI editor.Windows: The file is a little tricky to find. In your file open dialogue, type in
\\wsl$\and hit enter. This should reveal aUbuntu-22.04directory (or something close to it). Go into that and double click onhome, then double click on your username, thenmy-nextflow-run. Thecasanovo.yamlfile should be present and you can edit it like a normal file.Run the workflow.
Nextflow workflows may be run with a variety of executors. Executors are what run the actual steps of the pipeline; that is, they are the systems on which steps like Casanovo and msconvert will be run. Examples of executors are your local computer, a computer cluster, or AWS Batch. The example below describes how to run the workflow using your local system or AWS Batch as the executor.
A good first step is to ensure you have the latest version of the workflow. Execute this command:
nextflow pull -r main mriffle/nf-ms-dda-casanovo
Then, to run the steps of the workflow on your local computer, execute this command:
nextflow run -resume -r main mriffle/nf-ms-dda-casanovo -c pipeline.config
Note
It is important to only launch one workflow at a time if you are running on your local computer. Launching multiple workflows at once will result in multiple instances of the programs running at once. To run multiple workflows at once on your local system, you will need to implementing a queuing system such as Slurm and use the Slurm executor.
Alternatively, to run the workflow using AWS Batch, execute a command similar to:
nextflow run -resume -r main -profile aws mriffle/nf-ms-dda-casanovo -bucket-dir s3://bucket/dir -c pipeline.config
Important
You must set up a AWS Batch cluster before running on AWS Batch. See How to Setup and Configure AWS Batch for more details about how to set up a AWS Batch cluster and the resulting parameters to set in your
pipeline.configfile.
Retrieve results.
Your results will appear in the
resultssub directory of your current directory. See Output & Results for more information about the results that are generated.