FastQCFastQC Report
Mon 16 Mar 2020
HKVH7DRXX_n01_dpr122.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameHKVH7DRXX_n01_dpr122.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences15047646
Sequences flagged as poor quality0
Sequence length20
%GC58

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[OK]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[FAIL]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
GGGGGGGGGGGGGGGGGGGG389720.25899067535214476No Hit
GGGTGTGTTGTGTTTTTGTG266230.17692468310325749No Hit

[WARN]Adapter Content

Can't analyse adapters as read length is too short

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
CTCCTTT8300.014.0083371
CCTCGTG13450.014.0083371
CCTCGGT8300.014.0083371
TCGCGTG950.014.0083371
CCGCTGT3300.014.0083371
TGCCTGG17700.014.0083371
CCCCGTT3200.014.0083371
TCCCGGG7600.014.0083371
GCCCGTG13450.014.0083371
CGCCGTT401.7471018E-414.0083371
CTGAGTG300.00591625314.0083361
CTCCTTG17100.014.0083361
CGCCTTT1350.014.0083361
TTCCGTT2500.014.0083361
CTCCTGT21700.014.0083361
CTCCTGG32750.014.0083361
CGCCTGG8900.014.0083361
TTCCGGG4950.014.0083361
TGCCGGG4900.014.0083361
GTCCGTT5050.014.0083361