Basic Statistics
Measure | Value |
---|---|
Filename | HFHGMBGXC_n01_RA1_1sorted.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 8275258 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 52 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGGCCTTATCTCGTAT | 743636 | 8.986257588585152 | TruSeq Adapter, Index 20 (97% over 44bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGGCCTTATCTCGTA | 700844 | 8.469149844029031 | TruSeq Adapter, Index 20 (97% over 44bp) |
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGGCCTTATCTCGTATG | 15442 | 0.18660445390343117 | TruSeq Adapter, Index 20 (97% over 43bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AAAAAAG | 89100 | 0.0 | 61.881393 | 70 |
AGATCGG | 88140 | 0.0 | 60.64777 | 1 |
TTACCTA | 1770 | 0.0 | 48.843563 | 41 |
AAAAAGG | 8405 | 0.0 | 41.639854 | 70 |
ACTCTAG | 2855 | 0.0 | 37.640495 | 5 |
TACCTAC | 2300 | 0.0 | 37.13222 | 42 |
TACTCTA | 2970 | 0.0 | 36.300453 | 4 |
ATTACCT | 1920 | 0.0 | 36.094395 | 40 |
CGTATGC | 157735 | 0.0 | 34.78713 | 46 |
TCGTATG | 157800 | 0.0 | 34.782093 | 45 |
TATGCCG | 157500 | 0.0 | 34.78062 | 48 |
CTCGTAT | 156705 | 0.0 | 34.770508 | 44 |
GTATGCC | 158090 | 0.0 | 34.6603 | 47 |
TATCTCG | 156630 | 0.0 | 34.614666 | 41 |
TTGAAAA | 159125 | 0.0 | 34.551342 | 63 |
GCTTGAA | 158700 | 0.0 | 34.527615 | 61 |
TGAAAAA | 159705 | 0.0 | 34.52208 | 64 |
CCGTCTT | 158770 | 0.0 | 34.4703 | 52 |
CCTACAA | 2540 | 0.0 | 34.450474 | 44 |
CTTGAAA | 159280 | 0.0 | 34.438618 | 62 |