Basic Statistics
Measure | Value |
---|---|
Filename | HH5H2BGX9_n01_KO_DI222_M1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18891547 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATTACTCGATCTCGTAT | 131659 | 0.6969201622291705 | TruSeq Adapter, Index 27 (97% over 39bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 17645 | 0.0 | 42.1784 | 48-49 |
CTCGTAT | 17280 | 0.0 | 40.579742 | 44-45 |
TCGTATG | 18540 | 0.0 | 40.152157 | 44-45 |
CGTATGC | 18725 | 0.0 | 39.881702 | 46-47 |
TACTCGA | 18630 | 0.0 | 39.422874 | 36-37 |
ACTCGAT | 18160 | 0.0 | 38.822315 | 36-37 |
TCTCGTA | 18070 | 0.0 | 37.63658 | 42-43 |
ATGCCGT | 20450 | 0.0 | 36.323395 | 48-49 |
TTACTCG | 20815 | 0.0 | 36.162586 | 34-35 |
ATCTCGT | 18695 | 0.0 | 35.692677 | 42-43 |
GCCGTCT | 21210 | 0.0 | 35.049057 | 50-51 |
TGCCGTC | 21275 | 0.0 | 34.975456 | 50-51 |
TCGATCT | 19180 | 0.0 | 34.76537 | 38-39 |
CGATCTC | 19315 | 0.0 | 34.53481 | 40-41 |
CTCGATC | 19470 | 0.0 | 34.418236 | 38-39 |
GTATGCC | 21930 | 0.0 | 34.009823 | 46-47 |
CCGTCTT | 21960 | 0.0 | 33.79552 | 52-53 |
GATCTCG | 20090 | 0.0 | 33.344368 | 40-41 |
ATTACTC | 23600 | 0.0 | 31.93533 | 34-35 |
ACATTAC | 23800 | 0.0 | 31.666842 | 32-33 |