Basic Statistics
Measure | Value |
---|---|
Filename | MK237_GTGAAA_L002_R1_001.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 10473451 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGAAACGATCTCGTATG | 1077942 | 10.29213771086531 | TruSeq Adapter, Index 19 (97% over 40bp) |
GATTGGAAGAGCACACGTCTGAACTCCAGTCACGTGAAACGATCTCGTATG | 35691 | 0.34077592953841096 | TruSeq Adapter, Index 19 (97% over 40bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGAAATGATCTCGTATG | 15594 | 0.14889075243680425 | TruSeq Adapter, Index 19 (97% over 39bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGAAACGATTTCGTATG | 11495 | 0.10975370009369403 | TruSeq Adapter, Index 19 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CACACGT | 141060 | 0.0 | 44.37811 | 12 |
ACGTCTG | 140180 | 0.0 | 44.36623 | 15 |
GCACACG | 141245 | 0.0 | 44.339092 | 11 |
CGTCTGA | 140380 | 0.0 | 44.251743 | 16 |
CACGTCT | 140590 | 0.0 | 44.235245 | 14 |
ACACGTC | 140835 | 0.0 | 44.215797 | 13 |
AGCACAC | 142415 | 0.0 | 44.086983 | 10 |
GTCTGAA | 141835 | 0.0 | 43.954807 | 17 |
GATCGGA | 132785 | 0.0 | 43.89733 | 1 |
TCTGAAC | 141615 | 0.0 | 43.78026 | 18 |
CACGTGA | 137180 | 0.0 | 43.756016 | 31 |
CGGAAGA | 136960 | 0.0 | 43.743416 | 4 |
TCGTATG | 135625 | 0.0 | 43.74285 | 45 |
GTCACGT | 138375 | 0.0 | 43.73134 | 29 |
AGAGCAC | 144395 | 0.0 | 43.728603 | 8 |
CGTGAAA | 136285 | 0.0 | 43.71543 | 33 |
ACGTGAA | 137285 | 0.0 | 43.672157 | 32 |
CTGAACT | 141665 | 0.0 | 43.66476 | 19 |
TCACGTG | 138150 | 0.0 | 43.632797 | 30 |
CAGTCAC | 141075 | 0.0 | 43.626842 | 27 |