Basic Statistics
Measure | Value |
---|---|
Filename | HFHN2BGXB_n01_ALm201.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 17899310 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTGAAGCTATCTCGTAT | 42779 | 0.23899803958923557 | TruSeq Adapter, Index 19 (97% over 38bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCGTATG | 6470 | 0.0 | 35.061005 | 44-45 |
CGTATGC | 6730 | 0.0 | 34.019917 | 46-47 |
TATGCCG | 6730 | 0.0 | 33.843987 | 48-49 |
CTCGTAT | 7050 | 0.0 | 31.940704 | 44-45 |
TATCTCG | 7615 | 0.0 | 29.388205 | 40-41 |
TCTCGTA | 8210 | 0.0 | 27.284174 | 42-43 |
ATCTCGT | 9085 | 0.0 | 24.891684 | 42-43 |
ATGCCGT | 10025 | 0.0 | 22.933426 | 48-49 |
GCTATCT | 9915 | 0.0 | 22.643105 | 38-39 |
GTATGCC | 10510 | 0.0 | 21.89739 | 46-47 |
GCCGTCT | 10855 | 0.0 | 21.245014 | 50-51 |
CCGTCTT | 11195 | 0.0 | 20.960619 | 52-53 |
AGCTATC | 11095 | 0.0 | 20.320566 | 38-39 |
CTATCTC | 11925 | 0.0 | 19.045464 | 40-41 |
TGCCGTC | 12760 | 0.0 | 18.38967 | 50-51 |
CGTCTTC | 13625 | 0.0 | 17.30948 | 52-53 |
AAGCTAT | 13845 | 0.0 | 16.799147 | 36-37 |
GAAGCTA | 14640 | 0.0 | 16.178997 | 36-37 |
AGGGGGG | 13645 | 0.0 | 15.019992 | 74-75 |
GAGCACA | 33090 | 0.0 | 14.365626 | 9 |