Basic Statistics
Measure | Value |
---|---|
Filename | HLYHHAFXX_n01_ah8584d-062817.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 7886347 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 38 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACACAGTGATCTCGTATGC | 59826 | 0.7586021766478194 | TruSeq Adapter, Index 5 (100% over 50bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACACAGTGATGTCGTATGC | 23836 | 0.3022438652521884 | TruSeq Adapter, Index 5 (98% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GAGCACA | 30615 | 0.0 | 64.63128 | 9 |
AGAGCAC | 31765 | 0.0 | 62.545593 | 8 |
TCGGAAG | 34590 | 0.0 | 57.55435 | 3 |
CGGAAGA | 34865 | 0.0 | 57.263924 | 4 |
GATCGGA | 32990 | 0.0 | 56.189857 | 1 |
GAAGAGC | 36090 | 0.0 | 55.017765 | 6 |
ATCGGAA | 36150 | 0.0 | 54.874226 | 2 |
GGAAGAG | 37155 | 0.0 | 53.427628 | 5 |
AAGAGCA | 37795 | 0.0 | 52.9441 | 7 |
GCCGTCT | 12565 | 0.0 | 41.21487 | 48-49 |
TGCCGTC | 14120 | 0.0 | 40.982883 | 48-49 |
CACACAG | 23075 | 0.0 | 40.95109 | 30-31 |
TATGCCG | 15525 | 0.0 | 40.456375 | 46-47 |
GTCACAC | 23390 | 0.0 | 40.338417 | 28-29 |
GTATGCC | 15940 | 0.0 | 40.191284 | 44-45 |
CACAGTG | 22865 | 0.0 | 40.14392 | 32-33 |
ATGCCGT | 15000 | 0.0 | 40.098637 | 46-47 |
CGTATGC | 15740 | 0.0 | 40.068127 | 44-45 |
GATCTCG | 11640 | 0.0 | 39.521866 | 38-39 |
CAGTCAC | 24600 | 0.0 | 39.12609 | 26-27 |