Basic Statistics
Measure | Value |
---|---|
Filename | HGVKWBGX2_n01_sl40_bzip1ab_i15.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 28018989 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATGTCAGAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 1123071 | 4.008249548190336 | TruSeq Adapter, Index 15 (97% over 40bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATGTCAGAATCTCGTATGCCGTCTTCTGCGTGAAAAAAAAAA | 58726 | 0.20959357241619245 | TruSeq Adapter, Index 15 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GTATGCC | 152425 | 0.0 | 67.31866 | 47 |
AATCTCG | 162385 | 0.0 | 67.27694 | 41 |
TCTCGTA | 158075 | 0.0 | 67.27348 | 43 |
CGTATGC | 151890 | 0.0 | 67.25226 | 46 |
GCCGTCT | 145645 | 0.0 | 67.23099 | 51 |
TATGCCG | 148600 | 0.0 | 67.230415 | 48 |
ATGCCGT | 148750 | 0.0 | 67.22096 | 49 |
ATCTCGT | 161155 | 0.0 | 67.21018 | 42 |
TGCCGTC | 147760 | 0.0 | 67.15585 | 50 |
ATGTCAG | 173220 | 0.0 | 67.0321 | 34 |
ACATGTC | 174300 | 0.0 | 66.95486 | 32 |
CCGTCTT | 149570 | 0.0 | 66.93228 | 52 |
GTCACAT | 177650 | 0.0 | 66.83417 | 29 |
GCTTGAA | 139380 | 0.0 | 66.80775 | 61 |
TGCTTGA | 137775 | 0.0 | 66.59742 | 60 |
CTTGAAA | 143810 | 0.0 | 66.42053 | 62 |
TCTGAAC | 184035 | 0.0 | 66.2534 | 18 |
CGTCTGA | 184465 | 0.0 | 66.11485 | 16 |
ACGTCTG | 184445 | 0.0 | 66.09863 | 15 |
ACACGTC | 185375 | 0.0 | 65.95288 | 13 |