FastQCFastQC Report
Fri 11 May 2018
HW73GBGX5_n01_arr5_r1.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameHW73GBGX5_n01_arr5_r1.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences13286874
Sequences flagged as poor quality0
Sequence length75
%GC47

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[WARN]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[FAIL]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
CTCAGGTAGTGGTTGTCGGGCAGCAGCACGGGGCCGTCGCCGATGGGGGTGTTCTGCTGGTAGTGGTCGGCGAGC269270.20265865394674473No Hit
CTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTC243710.18342162347592067No Hit
CGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAA147740.11119244451328432No Hit
CTTTGCTCAGGGCGGACTGGGTGCTCAGGTAGTGGTTGTCGGGCAGCAGCACGGGGCCGTCGCCGATGGGGGTGT145890.10980009293382327No Hit

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
GTGTGCG19500.016.4490478
GTGCGAG20200.016.39150410
CCCCCTA8200.014.30689846
TGTGCGA23550.013.9131919
CAGGTAG164400.012.9026473
CGGGAAT75500.012.7924161
GCCCGTA19700.012.78876667
GTCGGGC164550.012.66017315
TGCGAGT27800.012.53075311
CAGGGCG95750.012.4992278
CGCGGGG16300.012.4854971
CTGACAT38000.012.34516051
AGGTAGT175850.012.2585264
AGGGCGG95950.012.2575449
GACTCTA59500.012.2311164
TAAGCGG29950.012.2073420
GGAGTAC149550.012.2004193
ACCCGTA26850.012.07600832
TCAGGTA176600.012.0698892
AGAGGGG26200.011.98063832