Basic Statistics
Measure | Value |
---|---|
Filename | HGGLTBGX9_n01_5_RAV1_BIO_5_R1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 13954515 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACACAGTGATCTCGTATGC | 435385 | 3.120029610488075 | TruSeq Adapter, Index 5 (100% over 50bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACACAGTGATGTCGTATGC | 58106 | 0.41639569701992507 | TruSeq Adapter, Index 5 (98% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATGC | 55315 | 0.0 | 66.88434 | 44 |
TATGCCG | 55245 | 0.0 | 66.86041 | 46 |
ATGCCGT | 55070 | 0.0 | 66.740944 | 47 |
GCCGTCT | 54710 | 0.0 | 66.6768 | 49 |
GTATGCC | 55700 | 0.0 | 66.64173 | 45 |
TGCCGTC | 54950 | 0.0 | 66.60572 | 48 |
ATCTCGT | 49280 | 0.0 | 66.12421 | 40 |
CCGTCTT | 55620 | 0.0 | 65.96297 | 50 |
TGCTTGA | 56355 | 0.0 | 64.878334 | 58 |
GTCACAC | 58425 | 0.0 | 64.72004 | 29 |
GCTTGAA | 57210 | 0.0 | 64.355484 | 59 |
TCGTATG | 57245 | 0.0 | 63.768585 | 43 |
ACGTCTG | 60725 | 0.0 | 63.707897 | 15 |
CGTCTGA | 60685 | 0.0 | 63.692223 | 16 |
CTGAACT | 60795 | 0.0 | 63.608955 | 19 |
ACACGTC | 61275 | 0.0 | 63.240677 | 13 |
GCACACG | 61275 | 0.0 | 63.183792 | 11 |
CACACGT | 61345 | 0.0 | 63.150944 | 12 |
TCTCGTA | 51805 | 0.0 | 63.03281 | 41 |
CTTGAAA | 58865 | 0.0 | 62.908752 | 60 |