Skip to content

Commit 79a7230

Browse files
committed
edits to ep 2
1 parent d2ba594 commit 79a7230

1 file changed

Lines changed: 5 additions & 6 deletions

File tree

docs/02-data-prelude.md

Lines changed: 5 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -89,13 +89,12 @@ strains Cit+. Ultimately, we will use R to answer these questions:
8989
## How VCF files are generated
9090

9191
Publicly accessible sequencing files in FASTQ formats can be downloaded
92-
from NCBI SRA. However, at FASTQ files contain unaligned sequences of
93-
varying quality, and requires clean up and alignment steps for variants
94-
to be called from the reference genome.
92+
from NCBI SRA. However, FASTQ files contain unaligned sequences of
93+
varying quality, and require clean up and alignment steps before variants
94+
can be called from the reference genome.
95+
96+
There are five steps we must take to transform raw FASTQ files into variant calls (VCF files). At each of the five steps we will be using specialized, non-R based bioinformatics tools:
9597

96-
Five steps are taken to transform FASTQ files to variant calls contained
97-
in VCF files and at each step, specialized non-R based bioinformatics
98-
tools that are used:
9998

10099
<figure markdown>
101100
![image](figures/variant_calling_workflow.png){ width="250" }

0 commit comments

Comments
 (0)