Basic Statistics
Measure | Value |
---|---|
Filename | HGVKWBGX2_n01_sl40_bzip1ab_c01.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 19726837 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAGG | 1722254 | 8.7305126513693 | TruSeq Adapter, Index 1 (100% over 63bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAAG | 133509 | 0.6767886813278784 | TruSeq Adapter, Index 1 (100% over 63bp) |
CCCCCACCCACGAGGAGCATCGTGGAAAAAGAAGACGTTCCAACCACGTCTTCAAAGCAAGTGGATTGATGTGAT | 27224 | 0.13800489150896314 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GCCGTCT | 223905 | 0.0 | 67.48135 | 49 |
GTATGCC | 224520 | 0.0 | 67.46655 | 45 |
CGTATGC | 224865 | 0.0 | 67.46617 | 44 |
TATGCCG | 224370 | 0.0 | 67.46006 | 46 |
ATGCCGT | 224360 | 0.0 | 67.446495 | 47 |
TCTCGTA | 220105 | 0.0 | 67.44127 | 41 |
TGCCGTC | 224200 | 0.0 | 67.42761 | 48 |
ATCTCGT | 219730 | 0.0 | 67.38752 | 40 |
CACGATC | 214720 | 0.0 | 67.36095 | 36 |
CCGTCTT | 224145 | 0.0 | 67.34771 | 50 |
TGCTTGA | 224595 | 0.0 | 67.3394 | 58 |
GCTTGAA | 224625 | 0.0 | 67.32597 | 59 |
TCACGAT | 218060 | 0.0 | 67.292854 | 35 |
CTTGAAA | 225885 | 0.0 | 67.170044 | 60 |
ATCACGA | 219400 | 0.0 | 67.1581 | 34 |
GTCACAT | 225720 | 0.0 | 66.989174 | 29 |
TCTGAAC | 233025 | 0.0 | 66.30507 | 18 |
CGTCTGA | 233645 | 0.0 | 66.11795 | 16 |
ACGTCTG | 233985 | 0.0 | 66.01826 | 15 |
TCGTATG | 231440 | 0.0 | 65.79827 | 43 |