Basic Statistics
Measure | Value |
---|---|
Filename | H5YHGBGXY_n01_crf4roots_05.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 21808299 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGG | 63294 | 0.2902289628365789 | No Hit |
GGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACA | 42165 | 0.19334382750346554 | No Hit |
GCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTG | 24419 | 0.1119711353920817 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CACGAGT | 27400 | 0.0 | 17.767654 | 68 |
GTCCGCA | 2025 | 0.0 | 16.246965 | 34 |
ACGAGTT | 31140 | 0.0 | 15.656081 | 69 |
CGTCCGC | 2065 | 0.0 | 15.593127 | 33 |
GGCCACG | 34090 | 0.0 | 14.342468 | 65 |
GCCACGA | 34735 | 0.0 | 14.156684 | 66 |
CCACGAG | 35170 | 0.0 | 14.011441 | 67 |
GGATAAC | 36110 | 0.0 | 13.830516 | 1 |
TATGCCG | 2545 | 0.0 | 13.752374 | 48 |
CGCACAT | 2515 | 0.0 | 13.220869 | 37 |
ACGTCCG | 2490 | 0.0 | 13.072031 | 32 |
AACGGCC | 37245 | 0.0 | 13.052321 | 62 |
ACGGCCA | 37505 | 0.0 | 12.999135 | 63 |
GAACGGC | 37910 | 0.0 | 12.804753 | 61 |
GTGAACG | 38860 | 0.0 | 12.581839 | 59 |
CGGCCAC | 39310 | 0.0 | 12.491314 | 64 |
TGAACGG | 39075 | 0.0 | 12.467769 | 60 |
CCGCACA | 2630 | 0.0 | 12.376608 | 36 |
ACATGGC | 40080 | 0.0 | 12.357851 | 6 |
CGCGGGG | 1505 | 0.0 | 12.333484 | 1 |