Basic Statistics
Measure | Value |
---|---|
Filename | H2GJMBGX2_n01_mb20.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 22126455 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATGC | 2912420 | 13.162614616756278 | TruSeq Adapter, Index 9 (100% over 50bp) |
GATCGGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATG | 71182 | 0.32170539745295845 | TruSeq Adapter, Index 9 (100% over 45bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGCTCTCGTATGC | 34221 | 0.15466101551287811 | TruSeq Adapter, Index 9 (98% over 50bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATATCGTATGC | 26736 | 0.12083273167798458 | TruSeq Adapter, Index 9 (98% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGA | 329065 | 0.0 | 43.460148 | 1 |
ATCGGAA | 334460 | 0.0 | 43.41391 | 2 |
CGTATGC | 322465 | 0.0 | 43.391422 | 44 |
TCGGAAG | 333630 | 0.0 | 43.30394 | 3 |
CGGAAGA | 333380 | 0.0 | 43.22901 | 4 |
ACACGTC | 342570 | 0.0 | 41.79208 | 13 |
GCACACG | 343350 | 0.0 | 41.726337 | 11 |
CACACGT | 343830 | 0.0 | 41.65502 | 12 |
ACGTCTG | 342600 | 0.0 | 41.64524 | 15 |
CGTCTGA | 342075 | 0.0 | 41.62737 | 16 |
CACGTCT | 343475 | 0.0 | 41.61948 | 14 |
GAAGAGC | 347715 | 0.0 | 41.45148 | 6 |
GATCGGG | 8815 | 0.0 | 41.305386 | 1 |
GAGCACA | 348565 | 0.0 | 41.19951 | 9 |
GTCACGA | 340330 | 0.0 | 41.139603 | 29 |
AGCACAC | 348970 | 0.0 | 41.11235 | 10 |
ACGATCA | 340220 | 0.0 | 41.10709 | 32 |
GATCTCG | 327265 | 0.0 | 41.106506 | 39 |
CGATCAG | 339125 | 0.0 | 41.10489 | 33 |
AGTCACG | 341255 | 0.0 | 41.102226 | 28 |