Basic Statistics
Measure | Value |
---|---|
Filename | HHGHNAFX2_n01_DS2219-1.dk.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 6396295 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGATGTATCGGGGGTG | 24289 | 0.37973545622895755 | TruSeq Adapter, Index 2 (97% over 42bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGATGTATCGGGGATG | 10916 | 0.17066129689140352 | TruSeq Adapter, Index 2 (97% over 42bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGATGTATCGGGGTTG | 9385 | 0.14672556534681405 | TruSeq Adapter, Index 2 (97% over 42bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGATGTATCGCGTATG | 7501 | 0.1172710139229038 | TruSeq Adapter, Index 2 (97% over 49bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATGCCGG | 3545 | 0.0 | 42.65016 | 48 |
GGGTGGA | 7180 | 0.0 | 40.117012 | 60 |
GCGTGGA | 3010 | 0.0 | 39.53353 | 60 |
CGGGGAT | 2255 | 0.0 | 38.49092 | 43 |
TCGGGGA | 2305 | 0.0 | 38.26333 | 42 |
AGTCGGA | 1480 | 0.0 | 37.864956 | 1 |
TCGCGTG | 1230 | 0.0 | 36.9906 | 48 |
GGGCGTG | 2975 | 0.0 | 36.82227 | 58 |
TGCCGGG | 3700 | 0.0 | 36.22849 | 49 |
CGCGGAT | 1590 | 0.0 | 36.09939 | 43 |
GCCGGGG | 3620 | 0.0 | 35.96562 | 50 |
CGCGTGT | 1260 | 0.0 | 35.554337 | 49 |
CCGGGGT | 3485 | 0.0 | 35.249866 | 51 |
TCGCGGA | 1650 | 0.0 | 35.210915 | 42 |
TGGCGGG | 8010 | 0.0 | 34.43078 | 49 |
CGGGGTG | 9945 | 0.0 | 34.382935 | 52 |
GATGCCG | 2450 | 0.0 | 33.998837 | 47 |
GGATGCC | 2675 | 0.0 | 33.75585 | 46 |
AAAAAGG | 14945 | 0.0 | 33.58199 | 70 |
GTGGAAA | 12325 | 0.0 | 33.394394 | 62 |