Basic Statistics
Measure | Value |
---|---|
Filename | HFHGMBGXC_n01_KN1_1sorted.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 6208869 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 53 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTCAACAATCTCGTAT | 458736 | 7.388398756681773 | TruSeq Adapter, Index 13 (97% over 40bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTCAACAATCTCGTA | 451505 | 7.271936322058011 | TruSeq Adapter, Index 13 (97% over 40bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTCAACAATATCGTA | 6585 | 0.1060579632135901 | TruSeq Adapter, Index 13 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AAAAAAG | 56090 | 0.0 | 61.780033 | 70 |
AGATCGG | 59445 | 0.0 | 61.337185 | 1 |
AGTCTAA | 1655 | 0.0 | 46.735332 | 29 |
CCTACAA | 3420 | 0.0 | 44.212715 | 44 |
AACCTAC | 1690 | 0.0 | 42.45738 | 42 |
GTCTAAG | 1010 | 0.0 | 41.92939 | 30 |
TACTCTA | 2210 | 0.0 | 41.657 | 4 |
TATAAAA | 3555 | 0.0 | 40.26565 | 64 |
ACTCTAG | 2280 | 0.0 | 40.22453 | 5 |
ACCTACA | 2630 | 0.0 | 40.19178 | 43 |
CTACAAT | 3740 | 0.0 | 40.149044 | 45 |
GCTATAA | 1185 | 0.0 | 39.281166 | 62 |
TACAATG | 2735 | 0.0 | 38.90472 | 46 |
CAGTCTA | 2820 | 0.0 | 38.845997 | 28 |
AGATATA | 2685 | 0.0 | 38.322834 | 61 |
TCTAGTT | 2365 | 0.0 | 38.183704 | 7 |
ACAATGC | 2865 | 0.0 | 37.383446 | 47 |
GTTACCT | 2450 | 0.0 | 37.00094 | 14 |
CCCTCTT | 5710 | 0.0 | 36.715145 | 53 |
TTACCTC | 2485 | 0.0 | 36.479797 | 15 |