Basic Statistics
Measure | Value |
---|---|
Filename | HGHC5BGX9_n01_AHm13.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 11878160 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTAATGCGCATCTCGTAT | 109468 | 0.9215905493780182 | TruSeq Adapter, Index 3 (97% over 36bp) |
CAAAGACATAATGGATTCCAACACTGTGTCAAGTTTCCAGGTAGATTGCT | 15246 | 0.12835321295554195 | No Hit |
CAACGATTCAAGTGACCCGCTTGTTGTTGCCGCGAGTATCATTGGGATCT | 12329 | 0.10379553735595413 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATGC | 14590 | 0.0 | 44.568806 | 46-47 |
CTCGTAT | 13835 | 0.0 | 44.511814 | 44-45 |
TATGCCG | 14760 | 0.0 | 44.48992 | 48-49 |
TAATGCG | 17525 | 0.0 | 44.05705 | 34-35 |
ATCTCGT | 14625 | 0.0 | 42.350998 | 42-43 |
TCGTATG | 13735 | 0.0 | 42.103893 | 44-45 |
GCGCATC | 17230 | 0.0 | 41.84731 | 38-39 |
ATGCGCA | 18505 | 0.0 | 41.274483 | 36-37 |
CGCGTAT | 1345 | 0.0 | 40.612293 | 44-45 |
TCTCGTA | 14280 | 0.0 | 40.26415 | 42-43 |
GAGCACA | 36530 | 0.0 | 39.878994 | 9 |
AATGCGC | 18005 | 0.0 | 39.78274 | 34-35 |
CCGTCTT | 15400 | 0.0 | 39.55723 | 52-53 |
CGGAAGA | 37390 | 0.0 | 38.771683 | 4 |
TGCGCAT | 17960 | 0.0 | 38.771473 | 36-37 |
AGAGCAC | 38065 | 0.0 | 38.37067 | 8 |
TCGGAAG | 38165 | 0.0 | 38.109463 | 3 |
GCCGTCT | 14755 | 0.0 | 37.84156 | 50-51 |
TGCCGTC | 16680 | 0.0 | 37.81705 | 50-51 |
CATCTCG | 15710 | 0.0 | 37.581657 | 40-41 |