Basic Statistics
Measure | Value |
---|---|
Filename | H5YHGBGXY_n01_crf4roots_01.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 20752854 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGG | 81168 | 0.39111728921718425 | No Hit |
GGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACA | 51300 | 0.24719491593782716 | No Hit |
GCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTG | 30524 | 0.14708338429018003 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CACGAGT | 30190 | 0.0 | 20.345345 | 68 |
ACGAGTT | 33295 | 0.0 | 18.468794 | 69 |
TATGCCG | 2425 | 0.0 | 18.041052 | 48 |
GCCACGA | 38430 | 0.0 | 16.119549 | 66 |
GGCCACG | 38430 | 0.0 | 16.028631 | 65 |
CCACGAG | 38530 | 0.0 | 15.986877 | 67 |
CGTATGC | 2895 | 0.0 | 14.991138 | 46 |
GTATGCC | 3015 | 0.0 | 14.974936 | 47 |
GGATAAC | 42865 | 0.0 | 14.787322 | 1 |
ACGGCCA | 42635 | 0.0 | 14.546235 | 63 |
AACGGCC | 43210 | 0.0 | 14.287868 | 62 |
CGGCCAC | 43485 | 0.0 | 14.165351 | 64 |
GAACGGC | 44205 | 0.0 | 13.902658 | 61 |
GCGCGCT | 1790 | 0.0 | 13.694897 | 1 |
TGAACGG | 45115 | 0.0 | 13.583478 | 60 |
GTGAACG | 45240 | 0.0 | 13.5459795 | 59 |
ACATGGC | 47765 | 0.0 | 13.20578 | 6 |
CGTGAAC | 46885 | 0.0 | 13.123087 | 58 |
CATGGCC | 47635 | 0.0 | 13.092644 | 7 |
CGAGTTC | 48435 | 0.0 | 13.02094 | 70 |