FastQCFastQC Report
Wed 4 Feb 2015
lane5_Undetermined_L005_R1_001.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
Filenamelane5_Undetermined_L005_R1_001.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences4562195
Sequences flagged as poor quality0
Sequence length51
%GC40

[OK]Per base sequence quality

Per base quality graph

[OK]Per tile sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[WARN]Per base sequence content

Per base sequence content

[OK]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[OK]Sequence Length Distribution

Sequence length distribution

[WARN]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
GATCGGAAGAGCACACGTCTGAACTCCAGTCACACTTGAATCTCGTATGCC128960.28267095115399493TruSeq Adapter, Index 8 (100% over 51bp)
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTAGCATCTCGTATGCCG70500.154530878228572TruSeq Adapter, Index 20 (97% over 38bp)
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGGAGCATCTCGTATGCCG55850.12241914253993966TruSeq Adapter, Index 2 (97% over 36bp)
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTTGTAATCTCGTATGCC52280.11459396189772686TruSeq Adapter, Index 12 (100% over 51bp)

[OK]Adapter Content

Adapter graph

[FAIL]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
GCACACG100500.042.5971311
GATCGGA103750.042.363521
ACGTCTG100800.042.14852515
ACACGTC101000.042.13041713
CACACGT101550.042.09168212
CACGTCT100900.042.0385714
CGTCTGA100650.042.0044616
TCGGAAG105050.041.9761283
AGCACAC103450.041.44713610
CGGAAGA107100.041.136974
AGAGCAC105300.041.1202288
GAACTCC100650.040.84968621
ATCGGAA109000.040.3565442
TGAACTC104700.039.78411520
CTCCAGT96150.039.74009724
CAGTCAC81950.039.46169327
CTGAACT106100.039.4259719
AACTCCA102950.039.2836522
GAGCACA110300.039.1787349
CCAGTCA90600.039.0181226