Basic Statistics
Measure | Value |
---|---|
Filename | H5CCJBGXG_n01_AKM102.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 15981996 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 39 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTTGTAATCTCGTATGC | 2009999 | 12.57664562048445 | TruSeq Adapter, Index 12 (100% over 50bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTTGTAATATCGTATGC | 23105 | 0.14456892618418876 | TruSeq Adapter, Index 12 (98% over 50bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTTGTAATGTCGTATGC | 20377 | 0.127499719058871 | TruSeq Adapter, Index 12 (98% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATGC | 225465 | 0.0 | 68.093796 | 44 |
GTATGCC | 225725 | 0.0 | 68.071396 | 45 |
CTCGTAT | 219740 | 0.0 | 68.01836 | 42 |
TCGTATG | 225105 | 0.0 | 67.99745 | 43 |
TATGCCG | 225405 | 0.0 | 67.986084 | 46 |
GCCGTCT | 223620 | 0.0 | 67.86694 | 49 |
ATGCCGT | 225265 | 0.0 | 67.84787 | 47 |
CCTTGTA | 227900 | 0.0 | 67.83334 | 33 |
AGTCACC | 231765 | 0.0 | 67.76958 | 28 |
GTCACCT | 230595 | 0.0 | 67.7598 | 29 |
TGCCGTC | 225170 | 0.0 | 67.731514 | 48 |
CACCTTG | 230000 | 0.0 | 67.66183 | 31 |
GTAATCT | 221260 | 0.0 | 67.6154 | 37 |
ACCTTGT | 229045 | 0.0 | 67.571754 | 32 |
ACGTCTG | 235085 | 0.0 | 67.54666 | 15 |
CAGTCAC | 233690 | 0.0 | 67.53332 | 27 |
TCTCGTA | 221720 | 0.0 | 67.48902 | 41 |
CGTCTGA | 235310 | 0.0 | 67.473366 | 16 |
CCGTCTT | 224985 | 0.0 | 67.47096 | 50 |
CCAGTCA | 233275 | 0.0 | 67.455864 | 26 |