Basic Statistics
Measure | Value |
---|---|
Filename | HGHC5BGX9_n01_AHm05.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 29457885 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTGAAGCTATCTCGTAT | 35054 | 0.1189970019911477 | TruSeq Adapter, Index 19 (97% over 38bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATGC | 7645 | 0.0 | 29.232758 | 46-47 |
TATGCCG | 7650 | 0.0 | 29.213531 | 48-49 |
TCGTATG | 7230 | 0.0 | 27.855679 | 44-45 |
CTCGTAT | 7985 | 0.0 | 26.917198 | 44-45 |
TATCTCG | 8635 | 0.0 | 22.773092 | 40-41 |
TCTCGTA | 9430 | 0.0 | 20.802896 | 42-43 |
ATCTCGT | 10900 | 0.0 | 19.609724 | 42-43 |
CCGTCTT | 13950 | 0.0 | 16.429014 | 52-53 |
ATGCCGT | 13095 | 0.0 | 15.778641 | 48-49 |
TACCGTA | 2975 | 0.0 | 15.6466055 | 9 |
GCTATCT | 12810 | 0.0 | 15.406581 | 38-39 |
GTATGCC | 13895 | 0.0 | 14.955714 | 46-47 |
AGCTATC | 14560 | 0.0 | 14.843433 | 38-39 |
GCCGTCT | 14480 | 0.0 | 14.51544 | 50-51 |
TGCCGTC | 16805 | 0.0 | 13.779122 | 50-51 |
CTATCTC | 16320 | 0.0 | 13.66468 | 40-41 |
CTACCGT | 3895 | 0.0 | 13.536194 | 8 |
CACGTCT | 20165 | 0.0 | 11.600829 | 14-15 |
CGTCTTC | 18425 | 0.0 | 11.588057 | 52-53 |
ACGTCTG | 18115 | 0.0 | 11.523955 | 14-15 |