Basic Statistics
Measure | Value |
---|---|
Filename | HGGHHBGX2_n01_9s-19.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 11114235 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTTTCGGAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 93719 | 0.8432339247820475 | TruSeq Adapter, Index 21 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 12325 | 0.0 | 62.67401 | 48 |
CGTATGC | 12470 | 0.0 | 61.943295 | 46 |
GTATGCC | 12720 | 0.0 | 60.917355 | 47 |
TCGTATG | 12735 | 0.0 | 59.97465 | 45 |
ATGCCGT | 12805 | 0.0 | 59.73353 | 49 |
AGTCACG | 14000 | 0.0 | 59.212425 | 28 |
ACGTCTG | 17005 | 0.0 | 58.60657 | 15 |
GTCACGT | 14045 | 0.0 | 58.506905 | 29 |
CGTTTCG | 13955 | 0.0 | 58.21626 | 33 |
GCACACG | 17130 | 0.0 | 58.037945 | 11 |
CTCGTAT | 12995 | 0.0 | 57.659214 | 44 |
GCCGTCT | 13055 | 0.0 | 57.50819 | 51 |
CACACGT | 17520 | 0.0 | 56.80507 | 12 |
CACGTCT | 17735 | 0.0 | 56.17453 | 14 |
CACGTTT | 14525 | 0.0 | 56.145447 | 31 |
GTTTCGG | 14885 | 0.0 | 54.764862 | 34 |
AGCACAC | 18385 | 0.0 | 54.16972 | 10 |
ACACGTC | 18495 | 0.0 | 53.792072 | 13 |
TCCAGTC | 18260 | 0.0 | 53.690895 | 25 |
ACTCCAG | 18505 | 0.0 | 53.389687 | 23 |