Basic Statistics
Measure | Value |
---|---|
Filename | HTVCNBGX9_n01_undetermined.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 25498540 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 52 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG | 2375407 | 9.315854946989122 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTAT | 27779 | 0.10894349245094033 | TruSeq Adapter, Index 2 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGAGCAC | 33500 | 0.0 | 47.28548 | 8 |
CGGAAGA | 35195 | 0.0 | 45.390503 | 4 |
TCGGAAG | 34920 | 0.0 | 45.326176 | 3 |
ATCGGAA | 35325 | 0.0 | 45.088146 | 2 |
GATCGGA | 35050 | 0.0 | 44.338985 | 1 |
TATCTCG | 6810 | 0.0 | 34.822517 | 40-41 |
CTCGTAT | 10010 | 0.0 | 34.500633 | 44-45 |
GAGCACA | 46585 | 0.0 | 34.41966 | 9 |
TCTCGTA | 10560 | 0.0 | 34.35249 | 42-43 |
ATGCCGT | 12155 | 0.0 | 32.843063 | 48-49 |
AAGAGCA | 51985 | 0.0 | 31.727116 | 7 |
CGTCTGA | 24975 | 0.0 | 31.071121 | 16-17 |
ACGTCTG | 25220 | 0.0 | 30.655083 | 14-15 |
ACACGTC | 25525 | 0.0 | 30.266888 | 12-13 |
CACGTCT | 25450 | 0.0 | 30.247864 | 14-15 |
GAAGAGC | 54565 | 0.0 | 29.66894 | 6 |
CACACGT | 26610 | 0.0 | 29.272867 | 12-13 |
ATCTCGT | 11725 | 0.0 | 28.93333 | 42-43 |
GGAAGAG | 58780 | 0.0 | 27.847414 | 5 |
GCACACG | 28350 | 0.0 | 27.485598 | 10-11 |