FastQCFastQC Report
Thu 8 Feb 2018
H2LMCBGX5_n01_bzip3_r1_input_biotin_30.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameH2LMCBGX5_n01_bzip3_r1_input_biotin_30.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences39891883
Sequences flagged as poor quality0
Sequence length75
%GC41

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[FAIL]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[FAIL]Sequence Duplication Levels

Duplication level graph

[FAIL]Overrepresented sequences

SequenceCountPercentagePossible Source
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCAGATCATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAGG8576572.1499536635059315TruSeq Adapter, Index 7 (100% over 63bp)
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCAGATCATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAATG433760.10873389957551013TruSeq Adapter, Index 7 (100% over 63bp)

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
ATGCCGT1709300.063.2712147
CGTATGC1781600.063.10070444
GTATGCC1802600.063.0263645
TGCCGTC1687250.063.0125448
TATGCCG1789850.062.9109446
GCCGTCT1599300.062.70414749
CTCGTAT1675350.061.56247342
TCGTATG1698450.061.54971743
CCGTCTT1682450.060.1407850
TCTCGTA1709950.060.12228441
CATCTCG1818400.060.03967739
ATCTCGT1772600.059.93137740
CTGCTTG1579350.059.16469657
CAGATCA2359250.058.58073434
TCACCAG2385200.058.5758630
AGTCACC2394350.058.28447328
CACCAGA2396800.058.22441531
CGTCTTC1673050.058.1790251
GTCACCA2398850.058.1711429
CAGTCAC2411950.057.96300527