FastQCFastQC Report
Mon 21 Dec 2020
HV33TBGXG_n01_AKM166.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameHV33TBGXG_n01_AKM166.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences14959222
Sequences flagged as poor quality0
Sequence length76
%GC38

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[WARN]Per base sequence content

Per base sequence content

[FAIL]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[OK]Sequence Duplication Levels

Duplication level graph

[FAIL]Overrepresented sequences

SequenceCountPercentagePossible Source
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCAGATCATCTCGTATGC156614110.469401416731431TruSeq Adapter, Index 7 (100% over 50bp)
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCAGATCATCGCGTATGC502770.3360936818773062TruSeq Adapter, Index 7 (98% over 50bp)
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCAGATCATCTCGGATGC218870.1463110848946556TruSeq Adapter, Index 7 (98% over 50bp)
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCAGATCATCTCGTTTGC166900.11156997335824016TruSeq Adapter, Index 7 (98% over 50bp)

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
CGTATGC1814250.068.1145744
TATGCCG1808200.067.97742546
GTATGCC1825000.067.9567145
TCGTATG1771800.067.943643
GCACACG2021250.067.89432511
ATGCCGT1765450.067.86050447
TGCCGTC1759400.067.8009948
CTCGTAT1777650.067.7806642
ACGTCTG2015750.067.77302615
ACACGTC2019950.067.76826513
AGTCACC1999500.067.7195928
GCCGTCT1737000.067.7131849
CACACGT2027450.067.7091412
CACGTCT2021850.067.5996914
CAGTCAC2014950.067.5300927
CGTCTGA2023800.067.5021616
CCAGTCA2012850.067.3683126
GAACTCC2015300.067.3409821
ACTCCAG2010700.067.3270223
AGCACAC2044150.067.2603810