Basic Statistics
Measure | Value |
---|---|
Filename | HMJHVBGXB_n01_undetermined.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 35164198 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG | 819899 | 2.3316300289288554 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTAT | 309531 | 0.8802447307343679 | TruSeq Adapter, Index 2 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGAGCAC | 101750 | 0.0 | 61.407593 | 8 |
CGGAAGA | 108880 | 0.0 | 57.722767 | 4 |
TCGGAAG | 108945 | 0.0 | 57.527866 | 3 |
ATCGGAA | 110295 | 0.0 | 56.84589 | 2 |
GATCGGA | 109125 | 0.0 | 55.256004 | 1 |
GAGCACA | 115660 | 0.0 | 54.02644 | 9 |
AAGAGCA | 126645 | 0.0 | 49.876194 | 7 |
GAAGAGC | 131050 | 0.0 | 47.92946 | 6 |
GGAAGAG | 141525 | 0.0 | 44.640358 | 5 |
TATCTCG | 41630 | 0.0 | 42.9608 | 40-41 |
CTCGTAT | 49950 | 0.0 | 42.39876 | 44-45 |
TCTCGTA | 50345 | 0.0 | 41.942257 | 42-43 |
ATGCCGT | 55325 | 0.0 | 40.738583 | 48-49 |
CGCTCAT | 49015 | 0.0 | 39.354256 | 34-35 |
ATCTCGT | 53225 | 0.0 | 39.079433 | 42-43 |
CACCGCT | 49215 | 0.0 | 38.605015 | 30-31 |
CGTCTGA | 80900 | 0.0 | 38.195797 | 16-17 |
CCGCTCA | 50515 | 0.0 | 38.000443 | 32-33 |
TCACCGC | 50240 | 0.0 | 37.982845 | 30-31 |
GTCACCG | 50675 | 0.0 | 37.85032 | 28-29 |