Basic Statistics
Measure | Value |
---|---|
Filename | HGVKWBGX2_n01_sl40_bzip1ab_i16.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 24235548 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCCGTCCATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAGG | 1079977 | 4.456169095082975 | TruSeq Adapter, Index 16 (98% over 63bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCCGTCCATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAAG | 92768 | 0.38277657266095244 | TruSeq Adapter, Index 16 (98% over 63bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CATCTCG | 135950 | 0.0 | 67.431076 | 39 |
GCCGTCT | 138795 | 0.0 | 67.35584 | 49 |
CGTATGC | 140345 | 0.0 | 67.33532 | 44 |
TATGCCG | 139910 | 0.0 | 67.3214 | 46 |
TCTCGTA | 137860 | 0.0 | 67.30273 | 41 |
GTATGCC | 140235 | 0.0 | 67.29726 | 45 |
ATGCCGT | 139690 | 0.0 | 67.2651 | 47 |
ATCTCGT | 138020 | 0.0 | 67.246506 | 40 |
TGCCGTC | 139445 | 0.0 | 67.23359 | 48 |
CCGTCCA | 136845 | 0.0 | 67.18525 | 34 |
CCGTCTT | 139235 | 0.0 | 67.08586 | 50 |
CCCGTCC | 137280 | 0.0 | 67.057945 | 33 |
TGCTTGA | 139250 | 0.0 | 67.04203 | 58 |
GCTTGAA | 139650 | 0.0 | 66.996315 | 59 |
CACCCGT | 139695 | 0.0 | 66.96773 | 31 |
TCACCCG | 140200 | 0.0 | 66.951645 | 30 |
ACCCGTC | 139040 | 0.0 | 66.93625 | 32 |
GTCACCC | 141305 | 0.0 | 66.80527 | 29 |
AGTCACC | 142860 | 0.0 | 66.78777 | 28 |
CTTGAAA | 140545 | 0.0 | 66.71031 | 60 |