Welcome to Scribd!

Skip carousel

Making reproducible workflows with Nextflow features like portability, scalability and platform-agnostic execution

Uploaded by

chandrika

0% found this document useful (0 votes)

10 views14 pages

Original Title

nextflow

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views14 pages

Making reproducible workflows with Nextflow features like portability, scalability and platform-agnostic execution

Uploaded by

chandrika

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 14

Search inside document

Making

reproducible workflows with

Nextflow features
Generalisable

Portable

Scalable

Platform-agnostic

Based on Groovy and Java

Large active community in e.g. nf-core

Concepts and nomenclature

Channels contain data, e.g. input files

Processes run some kind of code, e.g. a script or a command-line program

Tasks are instances of a process, one per process input

Anatomy of a process
process GET_SRA_BY_ACCESSION {

input:
val(sample)

output:
path("${sample}.fastq.gz")

script:
"""
fastq-dump ${sample} > ${sample}.fastq.gz
"""
}
Anatomy of a process
process GET_SRA_BY_ACCESSION {

input:
val(sample)

output:
tuple val(sample), path("${sample}.fastq.gz")

script:
"""
fastq-dump ${sample} > ${sample}.fastq.gz
"""
}
Anatomy of a process
process GET_SRA_BY_ACCESSION {

cpus 2
memory '8 GB'

input:
val(sample)

output:
tuple val(sample), path("${sample}.fastq.gz")

script:
"""
fastq-dump ${sample} > ${sample}.fastq.gz
"""
}
Anatomy of a process
process GET_SRA_BY_ACCESSION {

cpus 2
memory '8 GB'

conda 'sra-tools=2.11.0'
container 'ncbi/sra-tools:2.11.0'

input:
val(sample)

output:
tuple val(sample), path("${sample}.fastq.gz")

script:
"""
fastq-dump ${sample} > ${sample}.fastq.gz
"""
}
Anatomy of a process
process GET_SRA_BY_ACCESSION {

cpus 2
memory '8 GB'

conda 'sra-tools=2.11.0'
container 'ncbi/sra-tools:2.11.0'

input:
val(sample)

output:
tuple val(sample), path("${sample}.fastq.gz")

script:
"""
fastq-dump ${sample} -X {params.depth} > ${sample}.fastq.gz
"""
}
Anatomy of a workflow
workflow {

// Define SRA input data channel

ch_sra_ids = Channel.fromList ( ["SRR935090", "SRR935091", "SRR935092"] )

// Define the workflow

GET_SRA_BY_ACCESSION (
ch_sra_ids
)
}
Anatomy of a workflow
workflow {

// Define SRA input data channel

ch_sra_ids = Channel.fromList ( ["SRR935090", "SRR935091", "SRR935092"] )

// Define the workflow

GET_SRA_BY_ACCESSION (
ch_sra_ids
)
RUN_FASTQC (
GET_SRA_BY_ACCESSION.out
)
}
Anatomy of a workflow
workflow {

// Define SRA input data channel

ch_sra_ids = Channel.fromList ( ["SRR935090", "SRR935091", "SRR935092"] )

// Define the workflow

GET_SRA_BY_ACCESSION (
ch_sra_ids
)
RUN_FASTQC (
GET_SRA_BY_ACCESSION.out
)
RUN_MULTIQC (
RUN_FASTQC.out.collect()
)
}
Executing Nextflow
# Execute a workflow
$ nextflow run main.nf

# Re-run using cached results

$ nextflow run main.nf -resume

# Execute with a specific configuration file

$ nextflow run main.nf -c nextflow.config

# Supply a custom parameter

$ nextflow run main.nf --my_param "my value"

# Use Docker or Singularity

$ nextflow run main.nf -with-docker
$ nextflow run main.nf -with-singularity

# Use a pre-defined configuration profile

$ nextflow run main.nf -profile uppmax
Differences between Snakemake and Nextflow

Snakemake Nextflow

Language Python Groovy

Data Everything is a file Can use both files and values
Execution Working directory Each job in its own directory
Philosophy "Pull" "Push"
Dry-runs Yes No
Track code changes No Yes

Question: But, which one is the best?

Answer: Both - it's mostly up to personal preference!

Questions?

50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
Rating: 3 out of 5 stars
3/5 (4)
Create An Spark Streaming App: 1. Architecture and Abstraction
Document8 pages
Create An Spark Streaming App: 1. Architecture and Abstraction
Ngô Hoàng
No ratings yet
Fast Data Processing Systems with SMACK Stack
From Everand
Fast Data Processing Systems with SMACK Stack
Raúl Estrada
No ratings yet
More Linux Commands - A Practical Reference
Document3 pages
More Linux Commands - A Practical Reference
grnarayanan.rama9925
No ratings yet
Spark Scala Crash Course Guide
Document21 pages
Spark Scala Crash Course Guide
SlavimirVesić
No ratings yet
Cluster Setup
Document4 pages
Cluster Setup
Suhaimi Mie
No ratings yet
Premake4 Lua
Document5 pages
Premake4 Lua
api-216952651
No ratings yet
Ece4750 Cheat Sheet
Document2 pages
Ece4750 Cheat Sheet
WangAlex
No ratings yet
List Any Four Advantages of PHP ?
Document15 pages
List Any Four Advantages of PHP ?
Tanmay Gore
No ratings yet
Unix Shell Scripting
Document10 pages
Unix Shell Scripting
mona257
No ratings yet
Parallel Processing with Runspaces
Document12 pages
Parallel Processing with Runspaces
Sean Kelley
No ratings yet
Advanced Research Techniques
Document35 pages
Advanced Research Techniques
Shiva Lord
No ratings yet
Sqlmap Cheat Sheet for SQL Injection Testing
Document12 pages
Sqlmap Cheat Sheet for SQL Injection Testing
Stavros T.
No ratings yet
Hadoop Training in Hyderabad
Document49 pages
Hadoop Training in Hyderabad
kellytechnologies
No ratings yet
400 Troubleshoots Sage-ERP-X3 - Configuration-Console Wiki
Document6 pages
400 Troubleshoots Sage-ERP-X3 - Configuration-Console Wiki
fsussan
No ratings yet
Penetration Testing Tools and COmmands
Document3 pages
Penetration Testing Tools and COmmands
qooarx
No ratings yet
Pyspark-1 6 1
Document32 pages
Pyspark-1 6 1
Matthew Reach
No ratings yet
Cadence To Matlab Tutorial
Document3 pages
Cadence To Matlab Tutorial
petricli
No ratings yet
Advanced Research Techniques Oracle
Document33 pages
Advanced Research Techniques Oracle
rockerabc123
No ratings yet
HIP2019-Advanced XXE Exploitation
Document30 pages
HIP2019-Advanced XXE Exploitation
Jesse Ou
No ratings yet
OSCP Methodology Dropbox Paper
Document15 pages
OSCP Methodology Dropbox Paper
Wiraki Ahemad
No ratings yet
Script Reference: Define Artillery Test Config & Scenarios
Document9 pages
Script Reference: Define Artillery Test Config & Scenarios
kjnjkn
No ratings yet
Program Structure:: BEGIN (Awk-Commands)
Document10 pages
Program Structure:: BEGIN (Awk-Commands)
VenuGopal Kavuluru
No ratings yet
Webservices - Axis: 1.1. Table of Contents
Document18 pages
Webservices - Axis: 1.1. Table of Contents
arnaldo.ayala
No ratings yet
Linux File Processing
Document4 pages
Linux File Processing
Maher Mechi
No ratings yet
Introduction To Wireless Simulations: Shao-Cheng Wang
Document22 pages
Introduction To Wireless Simulations: Shao-Cheng Wang
Khadidja IZNASNI
No ratings yet
Windows Powershell in Action
Document45 pages
Windows Powershell in Action
Vincent Martinez
No ratings yet
Database Name Change
Document4 pages
Database Name Change
babitae
No ratings yet
Ubuntu
Document56 pages
Ubuntu
Hường Đinh Thị
No ratings yet
Axis Reference Guide
Document19 pages
Axis Reference Guide
V P
No ratings yet
Check Proxy Will Be Working or Not
Document5 pages
Check Proxy Will Be Working or Not
Ramesh Vanka
No ratings yet
Golismero Cheat Sheet
Document9 pages
Golismero Cheat Sheet
Javier Ruiz
No ratings yet
Apache Spark Streaming Presentation
Document28 pages
Apache Spark Streaming Presentation
SlavimirVesić
100% (1)
JSON Presentation
Document26 pages
JSON Presentation
Gilbert Einstein
No ratings yet
Index: S.No Name of Program Date Remark
Document27 pages
Index: S.No Name of Program Date Remark
Aman Gupta
No ratings yet
Cheat Sheet
Document143 pages
Cheat Sheet
test slebew
No ratings yet
119CS0169 NS Lab 4
Document6 pages
119CS0169 NS Lab 4
Darshan Singh
No ratings yet
Docker Provider: Example Usage
Document34 pages
Docker Provider: Example Usage
Balakrishna manjula
No ratings yet
DBA Sheet v7.0
Document547 pages
DBA Sheet v7.0
Sunil J Shet
No ratings yet
Vps Colab Mining
Document6 pages
Vps Colab Mining
anak bawang
No ratings yet
NS Examples For Wireless Scenario
Document44 pages
NS Examples For Wireless Scenario
Satish Kumar
No ratings yet
Scala and Spark Tutorial: Big Data Frameworks
Document29 pages
Scala and Spark Tutorial: Big Data Frameworks
Mauricio Alejandro Arenas Arriagada
No ratings yet
Sqlmap
Document5 pages
Sqlmap
samin
No ratings yet
Spark Kafka Streaming
Document6 pages
Spark Kafka Streaming
ahmed_sft
No ratings yet
LINUX FUNDAMENTAL COMMANDS SUMMARY
Document5 pages
LINUX FUNDAMENTAL COMMANDS SUMMARY
Datta Tayde
No ratings yet
Search: PHP: Objetos - Manual
Document19 pages
Search: PHP: Objetos - Manual
perexwi
No ratings yet
Node Classification with GNNs on Cora Dataset
Document135 pages
Node Classification with GNNs on Cora Dataset
Gopi Chand
No ratings yet
A Brief Intro To: Scala
Document56 pages
A Brief Intro To: Scala
Tooba Sami Arifeen
No ratings yet
An implementation of RSA and ElGamal PKCs using Java BigInteger class
Document20 pages
An implementation of RSA and ElGamal PKCs using Java BigInteger class
Venkat
No ratings yet
Lecture 4
Document37 pages
Lecture 4
Hạnh Hoàng Đình
No ratings yet
PHP Sample Question Paper with Answers
Document13 pages
PHP Sample Question Paper with Answers
Rameshwar
No ratings yet
Develop Web Apps Faster with Grails
Document43 pages
Develop Web Apps Faster with Grails
Alfredo Tovar
No ratings yet
1 Borghello
Document18 pages
1 Borghello
juan
No ratings yet
TripleO OpenStack
Document36 pages
TripleO OpenStack
baba roshan
No ratings yet
64 Prerna Jain Dspractassg11
Document8 pages
64 Prerna Jain Dspractassg11
Rahul Lavhande
No ratings yet
Lab1 Shell Interface
Document7 pages
Lab1 Shell Interface
douheng
No ratings yet
Archetype
Document6 pages
Archetype
sacascasc
No ratings yet
Uncovering Oracle's Hidden Utility ORADEBUG
Document26 pages
Uncovering Oracle's Hidden Utility ORADEBUG
Palash Sarkar
No ratings yet
Oracle Dataguard Monitoring Scripts
Document6 pages
Oracle Dataguard Monitoring Scripts
knugroho1982
No ratings yet
CN Simulation
Document13 pages
CN Simulation
vvce21cse0119
No ratings yet
PLAZA 3.0: An Access Point For Plant Comparative Genomics
Document8 pages
PLAZA 3.0: An Access Point For Plant Comparative Genomics
chandrika
No ratings yet
References: Pairing Region of The Xic
Document14 pages
References: Pairing Region of The Xic
chandrika
No ratings yet
Untitled Diagram
Document1 page
Untitled Diagram
chandrika
No ratings yet
Export
Document3 pages
Export
chandrika
No ratings yet
O Ficheirinho de Tostas
Document1 page
O Ficheirinho de Tostas
chandrika
No ratings yet
Eucgr.H04893 Eximage
Document1 page
Eucgr.H04893 Eximage
chandrika
No ratings yet
Untitled List
Document2 pages
Untitled List
chandrika
No ratings yet
Sample Paper
Document2 pages
Sample Paper
chandrika
No ratings yet
Niwana Wethatama 4
Document94 pages
Niwana Wethatama 4
chandrika
No ratings yet
Ethdala Daruwa
Document181 pages
Ethdala Daruwa
chandrika
No ratings yet
SolidWorks2018 PDF
Document1 page
SolidWorks2018 PDF
Awan D'almighty
No ratings yet
Difference Between Modbus and Profibus
Document2 pages
Difference Between Modbus and Profibus
Sureshraja9977
100% (2)
Sheet 2
Document15 pages
Sheet 2
Moaz Ahmed
No ratings yet
ASMO 2021 Technical Guide
Document2 pages
ASMO 2021 Technical Guide
Nancy Kawilarang
No ratings yet
Zebradesigner XML Release Notes v2509427
Document13 pages
Zebradesigner XML Release Notes v2509427
atacara
No ratings yet
Pip Hunter Ea Manual For PC Deriv
Document29 pages
Pip Hunter Ea Manual For PC Deriv
sudarmono
No ratings yet
BALVBUFDEL
Document2 pages
BALVBUFDEL
saravanamuthusamy
No ratings yet
Device Load Monitor With Programmable Meter For Energy Audit
Document3 pages
Device Load Monitor With Programmable Meter For Energy Audit
Mandeep G Kashyap
No ratings yet
DG-Stomp E
Document42 pages
DG-Stomp E
Gugu Batunga
No ratings yet
WMQ Services Healthcheck
Document29 pages
WMQ Services Healthcheck
Manuel Vega
No ratings yet
The Microprocessor and Its Architecture
Document26 pages
The Microprocessor and Its Architecture
bassam
No ratings yet
FLYJET Setup Manual ENG
Document21 pages
FLYJET Setup Manual ENG
tutankamon20010
No ratings yet
Grade 9 Assignment 2
Document7 pages
Grade 9 Assignment 2
Vaansh Dhirwani
No ratings yet
Date Deviates
Document7 pages
Date Deviates
Balaji Selvaraj
No ratings yet
2 4 Pillers of Iot
Document15 pages
2 4 Pillers of Iot
Shanaya chauhan
No ratings yet
Outwppaysb
Document3 pages
Outwppaysb
zafer nadeem
No ratings yet
AndeSight STD v5.2 IDE User Manual UM257 V1.1
Document658 pages
AndeSight STD v5.2 IDE User Manual UM257 V1.1
NPI Rentwise
No ratings yet
Ex-Functional Interfaces in Java
Document11 pages
Ex-Functional Interfaces in Java
Ravi Kumar
No ratings yet
Oferta Generala Hama - Martie 2020
Document116 pages
Oferta Generala Hama - Martie 2020
Bogdan Bold
No ratings yet
FortiOS v400 Release Notes
Document30 pages
FortiOS v400 Release Notes
JohnGranados
No ratings yet
OOP1
Document23 pages
OOP1
Ахметов Данияр
No ratings yet
The Spoken Tutorial Project
Document2 pages
The Spoken Tutorial Project
Dinesh
No ratings yet
System Software Lesson Plan
Document3 pages
System Software Lesson Plan
Mani Kantan
No ratings yet
CEA 201 - Full Flashcards - Quizlet
Document12 pages
CEA 201 - Full Flashcards - Quizlet
Tiến Thuận Nguyễn
No ratings yet
ACRONIME Calculatoare (Adi Pascu Si Ivan)
Document249 pages
ACRONIME Calculatoare (Adi Pascu Si Ivan)
bibliografie
100% (3)
Compiler Design
Document2 pages
Compiler Design
Nandhana SS
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition
Document17 pages
William Stallings Computer Organization and Architecture 8 Edition
SalMan Zahoor
No ratings yet
Wings Ebiz Installation Guide - General
Document16 pages
Wings Ebiz Installation Guide - General
Roshan naidu
No ratings yet