Skip to content

Commit

Permalink
Add changes for 5b7b490
Browse files Browse the repository at this point in the history
  • Loading branch information
actions-user committed Sep 2, 2024
1 parent 3b5d484 commit 092003c
Show file tree
Hide file tree
Showing 15 changed files with 192 additions and 111 deletions.
27 changes: 6 additions & 21 deletions 1- Library_preparation.html → 1- Library preparation.html
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/theme.css?v=19f00094" />


<link rel="canonical" href="https://fabianandradelozano.github.io/NGS-QC-Course/1- Library_preparation.html" />
<link rel="canonical" href="https://fabianandradelozano.github.io/NGS-QC-Course/1- Library preparation.html" />
<!--[if lt IE 9]>
<script src="_static/js/html5shiv.min.js"></script>
<![endif]-->
Expand All @@ -22,9 +22,7 @@
<script src="_static/js/theme.js"></script>
<link rel="author" title="About these documents" href="about.html" />
<link rel="index" title="Index" href="genindex.html" />
<link rel="search" title="Search" href="search.html" />
<link rel="next" title="2 Main Sequencing Technologies" href="2-%20Sequencing_technologies.html" />
<link rel="prev" title="About the course" href="about.html" />
<link rel="search" title="Search" href="search.html" />
</head>

<body class="wy-body-for-nav">
Expand All @@ -46,20 +44,10 @@
</form>
</div>
</div><div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="Navigation menu">
<ul class="current">
<ul>
<li class="toctree-l1"><a class="reference internal" href="about.html">About the course</a></li>
<li class="toctree-l1 current"><a class="current reference internal" href="#">1 Library preparation</a><ul>
<li class="toctree-l2"><a class="reference internal" href="#main-causes-of-poor-quality-data">Main Causes of poor quality data</a></li>
<li class="toctree-l2"><a class="reference internal" href="#id1">Library preparation</a></li>
<li class="toctree-l2"><a class="reference internal" href="#library-preparation-bias">Library preparation bias</a><ul>
<li class="toctree-l3"><a class="reference internal" href="#dna-library-bias">DNA library bias</a></li>
<li class="toctree-l3"><a class="reference internal" href="#rna-library-bias">RNA library bias</a></li>
</ul>
</li>
</ul>
</li>
<li class="toctree-l1"><a class="reference internal" href="2-%20Sequencing_technologies.html">2 Main Sequencing Technologies</a></li>
<li class="toctree-l1"><a class="reference internal" href="3-%20Quality%20Control%20and%20Preprocessing.html">3 Quality Control and Preprocessing</a></li>
<li class="toctree-l1"><a class="reference internal" href="4-%20Quality%20of%20the%20mapping.html">4 Quality of the Mapping</a></li>
</ul>

</div>
Expand All @@ -78,7 +66,7 @@
<li><a href="index.html" class="icon icon-home" aria-label="Home"></a></li>
<li class="breadcrumb-item active">1 Library preparation</li>
<li class="wy-breadcrumbs-aside">
<a href="_sources/1- Library_preparation.rst.txt" rel="nofollow"> View page source</a>
<a href="_sources/1- Library preparation.rst.txt" rel="nofollow"> View page source</a>
</li>
</ul>
<hr/>
Expand Down Expand Up @@ -240,10 +228,7 @@ <h3>RNA library bias<a class="headerlink" href="#rna-library-bias" title="Link t

</div>
</div>
<footer><div class="rst-footer-buttons" role="navigation" aria-label="Footer">
<a href="about.html" class="btn btn-neutral float-left" title="About the course" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
<a href="2-%20Sequencing_technologies.html" class="btn btn-neutral float-right" title="2 Main Sequencing Technologies" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right" aria-hidden="true"></span></a>
</div>
<footer>

<hr/>

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@
<link rel="stylesheet" type="text/css" href="_static/css/theme.css?v=19f00094" />


<link rel="canonical" href="https://fabianandradelozano.github.io/NGS-QC-Course/2- Sequencing_technologies.html" />
<link rel="canonical" href="https://fabianandradelozano.github.io/NGS-QC-Course/2- Sequencing technologies.html" />
<!--[if lt IE 9]>
<script src="_static/js/html5shiv.min.js"></script>
<![endif]-->
Expand All @@ -22,9 +22,7 @@
<script src="_static/js/theme.js"></script>
<link rel="author" title="About these documents" href="about.html" />
<link rel="index" title="Index" href="genindex.html" />
<link rel="search" title="Search" href="search.html" />
<link rel="next" title="3 Quality Control and Preprocessing" href="3-%20Quality%20Control%20and%20Preprocessing.html" />
<link rel="prev" title="1 Library preparation" href="1-%20Library_preparation.html" />
<link rel="search" title="Search" href="search.html" />
</head>

<body class="wy-body-for-nav">
Expand All @@ -46,20 +44,10 @@
</form>
</div>
</div><div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="Navigation menu">
<ul class="current">
<ul>
<li class="toctree-l1"><a class="reference internal" href="about.html">About the course</a></li>
<li class="toctree-l1"><a class="reference internal" href="1-%20Library_preparation.html">1 Library preparation</a></li>
<li class="toctree-l1 current"><a class="current reference internal" href="#">2 Main Sequencing Technologies</a><ul>
<li class="toctree-l2"><a class="reference internal" href="#short-reads-sequencing-illumina">Short Reads sequencing (Illumina)</a><ul>
<li class="toctree-l3"><a class="reference internal" href="#single-end">Single end</a></li>
<li class="toctree-l3"><a class="reference internal" href="#paired-end">Paired end</a></li>
</ul>
</li>
<li class="toctree-l2"><a class="reference internal" href="#long-read-sequencing-nanopore">Long read sequencing (Nanopore)</a></li>
<li class="toctree-l2"><a class="reference internal" href="#fastq-format-and-phred-quality-score">FASTQ format and Phred quality score</a></li>
</ul>
</li>
<li class="toctree-l1"><a class="reference internal" href="3-%20Quality%20Control%20and%20Preprocessing.html">3 Quality Control and Preprocessing</a></li>
<li class="toctree-l1"><a class="reference internal" href="4-%20Quality%20of%20the%20mapping.html">4 Quality of the Mapping</a></li>
</ul>

</div>
Expand All @@ -78,7 +66,7 @@
<li><a href="index.html" class="icon icon-home" aria-label="Home"></a></li>
<li class="breadcrumb-item active">2 Main Sequencing Technologies</li>
<li class="wy-breadcrumbs-aside">
<a href="_sources/2- Sequencing_technologies.rst.txt" rel="nofollow"> View page source</a>
<a href="_sources/2- Sequencing technologies.rst.txt" rel="nofollow"> View page source</a>
</li>
</ul>
<hr/>
Expand Down Expand Up @@ -169,10 +157,7 @@ <h2>FASTQ format and Phred quality score<a class="headerlink" href="#fastq-forma

</div>
</div>
<footer><div class="rst-footer-buttons" role="navigation" aria-label="Footer">
<a href="1-%20Library_preparation.html" class="btn btn-neutral float-left" title="1 Library preparation" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
<a href="3-%20Quality%20Control%20and%20Preprocessing.html" class="btn btn-neutral float-right" title="3 Quality Control and Preprocessing" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right" aria-hidden="true"></span></a>
</div>
<footer>

<hr/>

Expand Down
49 changes: 27 additions & 22 deletions 3- Quality Control and Preprocessing.html
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,8 @@
<link rel="author" title="About these documents" href="about.html" />
<link rel="index" title="Index" href="genindex.html" />
<link rel="search" title="Search" href="search.html" />
<link rel="prev" title="2 Main Sequencing Technologies" href="2-%20Sequencing_technologies.html" />
<link rel="next" title="4 Quality of the Mapping" href="4-%20Quality%20of%20the%20mapping.html" />
<link rel="prev" title="About the course" href="about.html" />
</head>

<body class="wy-body-for-nav">
Expand All @@ -47,8 +48,6 @@
</div><div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="Navigation menu">
<ul class="current">
<li class="toctree-l1"><a class="reference internal" href="about.html">About the course</a></li>
<li class="toctree-l1"><a class="reference internal" href="1-%20Library_preparation.html">1 Library preparation</a></li>
<li class="toctree-l1"><a class="reference internal" href="2-%20Sequencing_technologies.html">2 Main Sequencing Technologies</a></li>
<li class="toctree-l1 current"><a class="current reference internal" href="#">3 Quality Control and Preprocessing</a><ul>
<li class="toctree-l2"><a class="reference internal" href="#illumina">Illumina</a><ul>
<li class="toctree-l3"><a class="reference internal" href="#quality-control">Quality Control</a><ul>
Expand All @@ -61,6 +60,7 @@
</li>
</ul>
</li>
<li class="toctree-l1"><a class="reference internal" href="4-%20Quality%20of%20the%20mapping.html">4 Quality of the Mapping</a></li>
</ul>

</div>
Expand Down Expand Up @@ -112,28 +112,31 @@ <h4>FASTQC<a class="headerlink" href="#fastqc" title="Link to this heading"><
</div></blockquote>
<p>The html report generated for each file its divided in the following modules:</p>
<blockquote>
<div><p>#1. <strong>Basic Statistics</strong>: display the information related with the file, number and leght of the sequences, and overall %GC.
#2. <strong>Per base sequence quality</strong>: shows how the quality score (y axis) varys throughout the sequence reads (x axis). For each position a BoxWhisker is displayed, the red line represents the median and the blue the mean. Commonly the quality score tend to decrease at the end of the reads, because the polymerase tends to make more mistakes as the read progresses.</p>
<blockquote>
<div><p>is the median os any base is less than 25 a warning will arise.</p>
</div></blockquote>
<p>#3. <strong>Per tile sequence quality</strong>: shows the quality score distribution for each tile in the flowcell.
#4. <strong>Per sequence quality score</strong>: shows the distribution of the quality scores for all the reads in the file. If a huge amount of reads subset have a poor average quality this could indicate a systematic problem.
#5. <strong>Per base sequence content</strong>: proportion of each base position for the four nucleotides. A strong bias in the nucleotide composition could indicate a problem in the library preparation.
#6. <strong>Per sequence GC content</strong>: GC content distribution for all the reads in the file, and compared to a modelled normal distribution of human GC content.</p>
</div></blockquote>
<div><ol class="arabic simple">
<li><p><strong>Basic Statistics</strong>: display the information related with the file, number and leght of the sequences, and overall %GC.</p></li>
<li><dl class="simple">
<dt><strong>Per base sequence quality</strong>: shows how the quality score (y axis) varys throughout the sequence reads (x axis). For each position a BoxWhisker is displayed, the red line represents the median and the blue the mean. Commonly the quality score tend to decrease at the end of the reads, because the polymerase tends to make more mistakes as the read progresses.</dt><dd><p>is the median os any base is less than 25 a warning will arise.</p>
</dd>
</dl>
</li>
<li><p><strong>Per tile sequence quality</strong>: shows the quality score distribution for each tile in the flowcell.</p></li>
<li><p><strong>Per sequence quality score</strong>: shows the distribution of the quality scores for all the reads in the file. If a huge amount of reads subset have a poor average quality this could indicate a systematic problem.</p></li>
<li><p><strong>Per base sequence content</strong>: proportion of each base position for the four nucleotides. A strong bias in the nucleotide composition could indicate a problem in the library preparation.</p></li>
<li><p><strong>Per sequence GC content</strong>: GC content distribution for all the reads in the file, and compared to a modelled normal distribution of human GC content.</p></li>
</ol>
<div class="admonition danger">
<p class="admonition-title">Danger</p>
<blockquote>
<div><p>If the GC content is not close to the normal distribution, this could indicate a contamination or a problem in the library preparation.
<p>If the GC content is not close to the normal distribution, this could indicate a contamination or a problem in the library preparation.
Also, depending on the organism the GC content could vary, so it is important to know the GC content of the organism of interest (so avoid comparison with reference curve).</p>
</div></blockquote>
<p>#7. <strong>Per Base N content</strong>: If the sequencer is unable to determine the base in a position, it will be represented as an ‘N’. This section shows the distribution of Ns in the reads.
#8. <strong>Sequence Lenght Distribution</strong>: distribution of fragment sizes, for delimited size lenght (number of cycles) a peak only at one size is observed.
#9. <strong>Duplicate Sequences</strong>: shows the number of duplicated sequences in the file. a high level of duplication could indicate a enrichment bias (i.e. PCR amplification). Low level of duplication may indicate a very high level of coverage of the target sequence.
#10. <strong>Overrepresented sequences</strong>: show in a single sequence is very overrepresented in the file. This could indicate a contamination or a problem in the library preparation.
#11. <a href="#id1"><span class="problematic" id="id2">**</span></a>Adapter content <a href="#id3"><span class="problematic" id="id4">**</span></a>: shows the presence of adapter sequences in the reads. If there is presence of adapters, the reads should be trimmed before further analysis.</p>
</div>
<ol class="arabic simple" start="7">
<li><p><strong>Per Base N content</strong>: If the sequencer is unable to determine the base in a position, it will be represented as an ‘N’. This section shows the distribution of Ns in the reads.</p></li>
<li><p><strong>Sequence Lenght Distribution</strong>: distribution of fragment sizes, for delimited size lenght (number of cycles) a peak only at one size is observed.</p></li>
<li><p><strong>Duplicate Sequences</strong>: shows the number of duplicated sequences in the file. a high level of duplication could indicate a enrichment bias (i.e. PCR amplification). Low level of duplication may indicate a very high level of coverage of the target sequence.</p></li>
<li><p><strong>Overrepresented sequences</strong>: show in a single sequence is very overrepresented in the file. This could indicate a contamination or a problem in the library preparation.</p></li>
<li><p><a href="#id1"><span class="problematic" id="id2">**</span></a>Adapter content <a href="#id3"><span class="problematic" id="id4">**</span></a>: shows the presence of adapter sequences in the reads. If there is presence of adapters, the reads should be trimmed before further analysis.</p></li>
</ol>
</div></blockquote>
<div class="admonition seealso">
<p class="admonition-title">See also</p>
<p>For more information about FASTQC modules interpretation visit the <a class="reference external" href="https://www.bioinformatics.babraham.ac.uk/projects/fastqc/Help/3%20Analysis%20Modules/">FASTQC</a> website.</p>
Expand Down Expand Up @@ -197,6 +200,7 @@ <h3>Pre-processing<a class="headerlink" href="#pre-processing" title="Link to th
</ul>
<p>After preprocessing our reads, its important to check again the Quality. Fastp generates both htm and json report for asses the quality of our reads.
The json reports could be aggregated with MULTIQC.</p>
<p>Example of fastp report.</p>
</section>
</section>
</section>
Expand All @@ -205,7 +209,8 @@ <h3>Pre-processing<a class="headerlink" href="#pre-processing" title="Link to th
</div>
</div>
<footer><div class="rst-footer-buttons" role="navigation" aria-label="Footer">
<a href="2-%20Sequencing_technologies.html" class="btn btn-neutral float-left" title="2 Main Sequencing Technologies" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
<a href="about.html" class="btn btn-neutral float-left" title="About the course" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
<a href="4-%20Quality%20of%20the%20mapping.html" class="btn btn-neutral float-right" title="4 Quality of the Mapping" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right" aria-hidden="true"></span></a>
</div>

<hr/>
Expand Down
Loading

0 comments on commit 092003c

Please sign in to comment.