FastQCFastQC Report
Mon 16 Mar 2020
HKVH7DRXX_n01_dpr104.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameHKVH7DRXX_n01_dpr104.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences2483597
Sequences flagged as poor quality0
Sequence length20
%GC49

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[OK]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[WARN]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
GTGTGTGTGTGTGTGTGTGT50720.2042199277902172No Hit
GGGTTTGGGTTTGGGTTTGG36160.14559527974949238No Hit
GGGGGGGGGGGGGGGGGGGG29890.12034963804514179No Hit

[WARN]Adapter Content

Can't analyse adapters as read length is too short

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
GTCCGTT950.014.0082011
GTCCGGG950.014.0082011
GGCCTTT950.014.0082011
CTGCTGT1900.014.0082011
GCCCGTG950.014.0082011
CTTCTTT2950.014.0082011
CTCCTTT1050.014.00821
CTCCTTG1950.014.00821
CTCCTGT1850.014.00821
TGCCGTG401.7463205E-414.00821
CTCCTGG3050.014.00821
CGCCTGG801.4915713E-1014.00821
TTCCGGG559.111809E-714.00821
CCTCGTG1050.014.00821
GGCCGTG453.0191786E-514.00821
GTCCGGT505.2373834E-614.00821
CCTCGGT505.2373834E-614.00821
CCGCTGG350.001014032814.00821
GCGCGTT401.7463205E-414.00821
GCGCGTG401.7463205E-414.00821