Basic Statistics
Measure | Value |
---|---|
Filename | SE338_CGATGT_L004_R1_001.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 5986702 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGATGTATCTCGTATGCC | 2093281 | 34.965511896199274 | TruSeq Adapter, Index 2 (100% over 51bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGATGAATCTCGTATGCC | 7755 | 0.12953709738684172 | TruSeq Adapter, Index 2 (98% over 51bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CACACGT | 234985 | 0.0 | 44.52282 | 12 |
GCACACG | 235190 | 0.0 | 44.504097 | 11 |
AGCACAC | 235375 | 0.0 | 44.490143 | 10 |
AGAGCAC | 236075 | 0.0 | 44.480194 | 8 |
ACACGTC | 234960 | 0.0 | 44.468197 | 13 |
GAGCACA | 236005 | 0.0 | 44.46765 | 9 |
CACGTCT | 234790 | 0.0 | 44.463985 | 14 |
GATCGGA | 236040 | 0.0 | 44.45547 | 1 |
TCGGAAG | 236115 | 0.0 | 44.45112 | 3 |
GAAGAGC | 236910 | 0.0 | 44.417423 | 6 |
CGGAAGA | 236730 | 0.0 | 44.404057 | 4 |
ACGTCTG | 234975 | 0.0 | 44.399303 | 15 |
AAGAGCA | 237175 | 0.0 | 44.382023 | 7 |
ATCGGAA | 236520 | 0.0 | 44.354347 | 2 |
CGTCTGA | 235110 | 0.0 | 44.34032 | 16 |
GGAAGAG | 238010 | 0.0 | 44.320263 | 5 |
TCTGAAC | 234655 | 0.0 | 44.2704 | 18 |
CTGAACT | 234705 | 0.0 | 44.23967 | 19 |
GAACTCC | 233780 | 0.0 | 44.224983 | 21 |
GTATGCC | 232860 | 0.0 | 44.202038 | 45 |