Basic Statistics
Measure | Value |
---|---|
Filename | C5B16ACXX l05n01 m48-2.3410000000cc92.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 22381281 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 50 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGAGTGGATATCTCGTAT | 69141 | 0.3089233364256496 | TruSeq Adapter, Index 7 (97% over 36bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACGAGTGGATATCTCGTA | 65383 | 0.29213251913507543 | TruSeq Adapter, Index 7 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGTCACG | 18665 | 0.0 | 38.180904 | 28-29 |
TATGCCG | 19475 | 0.0 | 36.237705 | 48-49 |
CGTATGC | 19785 | 0.0 | 35.788174 | 46-47 |
CACACGT | 20230 | 0.0 | 35.481865 | 12-13 |
ACGAGTG | 20135 | 0.0 | 35.134422 | 32-33 |
CTCGTAT | 20515 | 0.0 | 34.29513 | 44-45 |
AGATCGG | 21185 | 0.0 | 33.572952 | 1 |
CACGTCT | 21520 | 0.0 | 33.28888 | 14-15 |
TCACGAG | 22030 | 0.0 | 32.522278 | 30-31 |
CGCGTAT | 7505 | 0.0 | 32.18099 | 1 |
AGCACAC | 23985 | 0.0 | 30.193544 | 10-11 |
GTGGATA | 24335 | 0.0 | 28.904688 | 36-37 |
CGTCTGA | 25000 | 0.0 | 28.7872 | 16-17 |
GGATATC | 25050 | 0.0 | 28.194153 | 38-39 |
GCGTATT | 8690 | 0.0 | 27.980597 | 2 |
CCAGTCA | 26810 | 0.0 | 26.836525 | 26-27 |
ATATCTC | 26350 | 0.0 | 26.72197 | 40-41 |
GAGTGGA | 26775 | 0.0 | 26.59864 | 34-35 |
ACATGCG | 7600 | 0.0 | 26.429617 | 4 |
CCGTCTT | 27040 | 0.0 | 26.187593 | 52-53 |