Basic Statistics
Measure | Value |
---|---|
Filename | H5YHGBGXY_n01_crf4roots_11.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 16805061 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGG | 40285 | 0.2397194511819981 | No Hit |
GGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACA | 29391 | 0.17489374183170175 | No Hit |
GCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTG | 18694 | 0.11124029838392137 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CACGAGT | 17895 | 0.0 | 16.81947 | 68 |
ACGAGTT | 21075 | 0.0 | 14.347841 | 69 |
GGCCACG | 23490 | 0.0 | 12.947662 | 65 |
CCACGAG | 23670 | 0.0 | 12.908114 | 67 |
GCGGGGA | 3110 | 0.0 | 12.837972 | 1 |
GCCACGA | 23775 | 0.0 | 12.821667 | 66 |
GCGAGGA | 13895 | 0.0 | 12.300229 | 1 |
GTATGCC | 1785 | 0.0 | 12.157029 | 47 |
AACGGCC | 25720 | 0.0 | 11.852242 | 62 |
GGATAAC | 26510 | 0.0 | 11.850444 | 1 |
ACGGCCA | 25825 | 0.0 | 11.831158 | 63 |
CGGCCAC | 26550 | 0.0 | 11.481754 | 64 |
TGAACGG | 26535 | 0.0 | 11.448504 | 60 |
GAACGGC | 26455 | 0.0 | 11.430207 | 61 |
ACATGGC | 27285 | 0.0 | 11.327734 | 6 |
GGGGCCC | 680 | 0.0 | 11.322808 | 70 |
GTGAACG | 26880 | 0.0 | 11.301565 | 59 |
GGCCATC | 26825 | 0.0 | 11.272415 | 10 |
CGCGGGG | 1170 | 0.0 | 11.075614 | 1 |
ATGCGCT | 27185 | 0.0 | 11.020738 | 29 |