Basic Statistics
Measure | Value |
---|---|
Filename | HVHYTBGX5_n01_ir640_4bw_12_1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 14212923 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 53 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGAAACGATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 31913 | 0.22453509387196424 | TruSeq Adapter, Index 19 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GTCACGT | 9675 | 0.0 | 56.482445 | 29 |
CTCGTAT | 9275 | 0.0 | 55.68328 | 44 |
AGTCACG | 9900 | 0.0 | 55.33854 | 28 |
TCGTATG | 9905 | 0.0 | 54.82336 | 45 |
CGTATGC | 10070 | 0.0 | 53.78821 | 46 |
TATGCCG | 10095 | 0.0 | 53.17581 | 48 |
ACGTCTG | 10755 | 0.0 | 51.675934 | 15 |
CACGTGA | 11010 | 0.0 | 49.790432 | 31 |
CACGTCT | 11210 | 0.0 | 49.732697 | 14 |
TCACGTG | 11150 | 0.0 | 49.258083 | 30 |
TGAAACG | 11130 | 0.0 | 48.974636 | 35 |
GAAACGA | 11130 | 0.0 | 48.540512 | 36 |
GTATGCC | 11365 | 0.0 | 47.567505 | 47 |
CAGTCAC | 11595 | 0.0 | 47.338367 | 27 |
ATGCCGT | 11680 | 0.0 | 46.28433 | 49 |
GCCGTCT | 11820 | 0.0 | 44.451733 | 51 |
GTCTGAA | 13170 | 0.0 | 42.278637 | 17 |
GCTTGAA | 11995 | 0.0 | 42.221195 | 61 |
TCTCGTA | 12180 | 0.0 | 42.147575 | 43 |
CCGTCTT | 12670 | 0.0 | 41.632797 | 52 |