Basic Statistics
Measure | Value |
---|---|
Filename | H3HF7AFXY_n01_dna2-1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 12230949 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTTAGGCATCTCGTATGC | 207865 | 1.6995001777866952 | TruSeq Adapter, Index 3 (100% over 50bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACTTAGGCATCTCGTATG | 18884 | 0.1543952149583814 | TruSeq Adapter, Index 3 (100% over 49bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTGCTTG | 32385 | 0.0 | 50.363758 | 57 |
TGCCGTC | 34745 | 0.0 | 48.40434 | 48 |
GCCGTCT | 34255 | 0.0 | 48.176342 | 49 |
CCGTCTT | 34615 | 0.0 | 47.97845 | 50 |
AAAAGGG | 28640 | 0.0 | 47.892323 | 70 |
TATGCCG | 35785 | 0.0 | 47.51579 | 46 |
ATGCCGT | 35690 | 0.0 | 47.398064 | 47 |
TGCTTGA | 34725 | 0.0 | 47.221897 | 58 |
CATCTCG | 34390 | 0.0 | 46.080627 | 39 |
GCTTGAA | 36530 | 0.0 | 45.61678 | 59 |
GTATGCC | 37760 | 0.0 | 45.160294 | 45 |
GCATCTC | 35295 | 0.0 | 45.017696 | 38 |
AGGCATC | 36960 | 0.0 | 44.48583 | 36 |
TAGGCAT | 38415 | 0.0 | 44.295002 | 35 |
GGCATCT | 36045 | 0.0 | 44.24606 | 37 |
CGTATGC | 38780 | 0.0 | 43.92735 | 44 |
ACTTAGG | 41135 | 0.0 | 43.654892 | 32 |
CTTAGGC | 41030 | 0.0 | 43.630127 | 33 |
CGTCTTC | 38100 | 0.0 | 43.507015 | 51 |
CTCGTAT | 37880 | 0.0 | 43.446075 | 42 |