Basic Statistics
Measure | Value |
---|---|
Filename | HGVKWBGX2_n01_sl40_bzip1ab_i18.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 20206293 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTCCGCACATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 905082 | 4.479208531718312 | TruSeq Adapter, Index 18 (97% over 40bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTCCGCACATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAG | 79270 | 0.3923035264310975 | TruSeq Adapter, Index 18 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATGC | 117155 | 0.0 | 67.31044 | 46 |
TATGCCG | 116655 | 0.0 | 67.29217 | 48 |
GTATGCC | 117020 | 0.0 | 67.29192 | 47 |
GCCGTCT | 116190 | 0.0 | 67.26871 | 51 |
TCCGCAC | 115850 | 0.0 | 67.26644 | 35 |
TGCCGTC | 116455 | 0.0 | 67.22195 | 50 |
ACGTCCG | 116775 | 0.0 | 67.21058 | 32 |
ATGCCGT | 116785 | 0.0 | 67.20054 | 49 |
CATCTCG | 115225 | 0.0 | 67.173225 | 41 |
CGTCCGC | 116455 | 0.0 | 67.149216 | 33 |
TCTCGTA | 116870 | 0.0 | 67.13379 | 43 |
CACGTCC | 117590 | 0.0 | 67.08768 | 31 |
GTCACGT | 119135 | 0.0 | 67.03227 | 29 |
ATCTCGT | 116535 | 0.0 | 66.971695 | 42 |
TCACGTC | 118435 | 0.0 | 66.931366 | 30 |
CACATCT | 115575 | 0.0 | 66.91012 | 39 |
CCGTCTT | 117175 | 0.0 | 66.87448 | 52 |
TGCTTGA | 115770 | 0.0 | 66.67636 | 60 |
GCTTGAA | 116330 | 0.0 | 66.66928 | 61 |
CTTGAAA | 117580 | 0.0 | 66.31398 | 62 |