Basic Statistics
Measure | Value |
---|---|
Filename | HGHK3BGX9_n01_AHm21.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 21874632 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGGCTATGATCTCGTAT | 229262 | 1.0480724887166102 | TruSeq Adapter, Index 7 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 27985 | 0.0 | 44.16554 | 48-49 |
CGTATGC | 28230 | 0.0 | 43.81003 | 46-47 |
CTCGTAT | 27815 | 0.0 | 43.328457 | 44-45 |
TCGTATG | 27890 | 0.0 | 41.91746 | 44-45 |
ATCTCGT | 28885 | 0.0 | 41.22328 | 42-43 |
TCTCGTA | 28300 | 0.0 | 39.96032 | 42-43 |
CGGCTAT | 30875 | 0.0 | 39.907616 | 34-35 |
GCCGTCT | 30985 | 0.0 | 37.795425 | 50-51 |
CCGTCTT | 32805 | 0.0 | 37.65308 | 52-53 |
TCACCGG | 32975 | 0.0 | 37.64842 | 30-31 |
TGCCGTC | 33020 | 0.0 | 37.472816 | 50-51 |
GAGCACA | 63125 | 0.0 | 37.367027 | 9 |
GTATGCC | 31485 | 0.0 | 37.22891 | 46-47 |
ATGCCGT | 31585 | 0.0 | 37.101364 | 48-49 |
TATGATC | 32480 | 0.0 | 36.76848 | 38-39 |
ACCGGCT | 34285 | 0.0 | 36.205624 | 32-33 |
ATCGGAA | 65460 | 0.0 | 35.656067 | 2 |
GTCACCG | 33045 | 0.0 | 35.507175 | 28-29 |
AGAGCAC | 66460 | 0.0 | 35.449127 | 8 |
CGGAAGA | 66830 | 0.0 | 35.089474 | 4 |