Basic Statistics
Measure | Value |
---|---|
Filename | HGGHHBGX2_n01_4w-19.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 14700124 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTCAACAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 92366 | 0.6283348358149904 | TruSeq Adapter, Index 13 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 12575 | 0.0 | 59.728657 | 48 |
CGTATGC | 12725 | 0.0 | 59.45394 | 46 |
GTATGCC | 13210 | 0.0 | 57.145008 | 47 |
ATGCCGT | 13335 | 0.0 | 56.40332 | 49 |
TCGTATG | 13700 | 0.0 | 54.868324 | 45 |
GCACACG | 17270 | 0.0 | 54.629765 | 11 |
ACGTCTG | 17470 | 0.0 | 53.963764 | 15 |
CACACGT | 17650 | 0.0 | 53.551323 | 12 |
CTCGTAT | 13780 | 0.0 | 52.797203 | 44 |
CACGTCT | 18390 | 0.0 | 51.39523 | 14 |
ACACGTC | 18540 | 0.0 | 51.017666 | 13 |
GCCGTCT | 14905 | 0.0 | 50.55766 | 51 |
AGCACAC | 18980 | 0.0 | 49.889824 | 10 |
TGCCGTC | 15645 | 0.0 | 48.07645 | 50 |
TCTCGTA | 15135 | 0.0 | 48.047764 | 43 |
ACTCCAG | 19800 | 0.0 | 47.597515 | 23 |
TCCAGTC | 19745 | 0.0 | 47.502335 | 25 |
CAGTCAC | 17075 | 0.0 | 47.092026 | 27 |
CTGAACT | 20270 | 0.0 | 46.477802 | 19 |
GTCACAG | 16965 | 0.0 | 45.97465 | 29 |