FastQCFastQC Report
Mon 16 Mar 2020
HKVH7DRXX_n01_dpr109.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameHKVH7DRXX_n01_dpr109.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences6558178
Sequences flagged as poor quality0
Sequence length20
%GC56

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[OK]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[FAIL]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
GGGGGGGGGGGGGGGGGGGG115870.17668016940070855No Hit
GTGTGTGTGTGTGTGTGTGT106980.16312457514876844No Hit
GGGTTTGGGTTTGGGTTTGG76690.11693796661206818No Hit

[WARN]Adapter Content

Can't analyse adapters as read length is too short

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
GTCCGTT2850.014.0089511
CTCCTTT4400.014.008951
CTCCTTG8050.014.008951
CGCCTTG1100.014.008951
TTCCGTT950.014.008951
TGCCGTG1250.014.008951
TTCCGGG1050.014.008951
CCTCGTT3100.014.008951
GGCCGTT950.014.008951
CCGCTTT704.8676156E-914.008951
CCTCGGT3700.014.008951
CCTCGGG4300.014.008951
CCGCTGT1350.014.008951
TGCCTGG6700.014.008951
CCCCGTT1400.014.008951
CCTCTGT9450.014.008951
TCTCGGT1750.014.008951
GCTCGGG4550.014.008951
CTTCGTT2200.014.008951
CCCCTTT3500.014.008951