FastQCFastQC Report
Tue 14 Feb 2017
H2GJMBGX2_n01_mb20.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameH2GJMBGX2_n01_mb20.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences22126455
Sequences flagged as poor quality0
Sequence length50
%GC44

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[FAIL]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[WARN]Sequence Duplication Levels

Duplication level graph

[FAIL]Overrepresented sequences

SequenceCountPercentagePossible Source
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATGC291242013.162614616756278TruSeq Adapter, Index 9 (100% over 50bp)
GATCGGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATG711820.32170539745295845TruSeq Adapter, Index 9 (100% over 45bp)
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGCTCTCGTATGC342210.15466101551287811TruSeq Adapter, Index 9 (98% over 50bp)
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATATCGTATGC267360.12083273167798458TruSeq Adapter, Index 9 (98% over 50bp)

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
GATCGGA3290650.043.4601481
ATCGGAA3344600.043.413912
CGTATGC3224650.043.39142244
TCGGAAG3336300.043.303943
CGGAAGA3333800.043.229014
ACACGTC3425700.041.7920813
GCACACG3433500.041.72633711
CACACGT3438300.041.6550212
ACGTCTG3426000.041.6452415
CGTCTGA3420750.041.6273716
CACGTCT3434750.041.6194814
GAAGAGC3477150.041.451486
GATCGGG88150.041.3053861
GAGCACA3485650.041.199519
GTCACGA3403300.041.13960329
AGCACAC3489700.041.1123510
ACGATCA3402200.041.1070932
GATCTCG3272650.041.10650639
CGATCAG3391250.041.1048933
AGTCACG3412550.041.10222628