Basic Statistics
Measure | Value |
---|---|
Filename | HGHC5BGX9_n01_AHm11.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 30213640 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTAATGCGCATCTCGTAT | 75661 | 0.25042000897607836 | TruSeq Adapter, Index 3 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 11650 | 0.0 | 37.978607 | 48-49 |
CGTATGC | 11540 | 0.0 | 37.80554 | 46-47 |
TAATGCG | 13740 | 0.0 | 37.56033 | 34-35 |
TCGTATG | 10765 | 0.0 | 36.13687 | 44-45 |
CTCGTAT | 11840 | 0.0 | 35.563763 | 44-45 |
AATGCGC | 14910 | 0.0 | 32.303288 | 34-35 |
ATCTCGT | 14005 | 0.0 | 30.523901 | 42-43 |
TCTCGTA | 13010 | 0.0 | 30.19318 | 42-43 |
ATGCGCA | 17260 | 0.0 | 29.693666 | 36-37 |
GCGCATC | 16910 | 0.0 | 28.91758 | 38-39 |
TGCGCAT | 17145 | 0.0 | 27.482569 | 36-37 |
CGCGTAT | 1230 | 0.0 | 25.680128 | 44-45 |
CTAATGC | 19305 | 0.0 | 25.145876 | 32-33 |
GTATGCC | 16830 | 0.0 | 24.864166 | 46-47 |
CCGTCTT | 16650 | 0.0 | 24.833527 | 52-53 |
ATGCCGT | 16850 | 0.0 | 23.974884 | 48-49 |
GCCGTCT | 16110 | 0.0 | 23.395542 | 50-51 |
CATCTCG | 17605 | 0.0 | 23.13538 | 40-41 |
ACTAATG | 22625 | 0.0 | 23.114494 | 32-33 |
CGCATCT | 18625 | 0.0 | 22.786493 | 38-39 |