Basic Statistics
Measure | Value |
---|---|
Filename | HW7NHBGX9_n01_undetermined.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 33051797 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 51 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG | 2962917 | 8.964465683968712 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATTACTCGATCTCGTAT | 48857 | 0.14781949677350373 | TruSeq Adapter, Index 27 (97% over 39bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGAGCAC | 48455 | 0.0 | 57.56625 | 8 |
TCGGAAG | 52260 | 0.0 | 54.478348 | 3 |
CGGAAGA | 53065 | 0.0 | 53.982788 | 4 |
ATCGGAA | 53540 | 0.0 | 53.53174 | 2 |
GATCGGA | 53290 | 0.0 | 52.56286 | 1 |
GAGCACA | 67385 | 0.0 | 41.67769 | 9 |
AAGAGCA | 72720 | 0.0 | 39.629383 | 7 |
GAAGAGC | 76230 | 0.0 | 37.37918 | 6 |
ACTCGAT | 13170 | 0.0 | 36.172623 | 36-37 |
TCTCGTA | 12470 | 0.0 | 35.62408 | 42-43 |
TACTCGA | 13585 | 0.0 | 34.84035 | 36-37 |
GGAAGAG | 83150 | 0.0 | 34.770313 | 5 |
CTCGTAT | 12790 | 0.0 | 34.698483 | 44-45 |
CGTCTGA | 40440 | 0.0 | 33.68357 | 16-17 |
ATGCCGT | 16015 | 0.0 | 33.58482 | 48-49 |
ACACGTC | 40630 | 0.0 | 33.58217 | 12-13 |
ACGTCTG | 40635 | 0.0 | 33.538376 | 14-15 |
CACGTCT | 40635 | 0.0 | 33.351597 | 14-15 |
CACACGT | 41450 | 0.0 | 33.032257 | 12-13 |
TCGATCT | 12805 | 0.0 | 32.3717 | 38-39 |