Basic Statistics
Measure | Value |
---|---|
Filename | H2GKGBGX2_n01_tmd10.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18713800 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG | 174977 | 0.9350158706409173 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGAGTGGATATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 127011 | 0.6787023479998717 | TruSeq Adapter, Index 7 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 15840 | 0.0 | 63.160458 | 48 |
ATGCCGT | 16570 | 0.0 | 60.398705 | 49 |
GTATGCC | 17915 | 0.0 | 56.36455 | 47 |
CGTATGC | 18720 | 0.0 | 53.628036 | 46 |
TGCCGTC | 18850 | 0.0 | 53.093616 | 50 |
AGGGGGG | 9240 | 0.0 | 52.8081 | 1 |
TATCTCG | 18180 | 0.0 | 52.563854 | 41 |
GCCGTCT | 20740 | 0.0 | 48.24007 | 51 |
TCGTATG | 22195 | 0.0 | 45.355774 | 45 |
ATATCTC | 22755 | 0.0 | 42.238213 | 40 |
TGGATAT | 24755 | 0.0 | 39.313477 | 37 |
CTCGTAT | 25465 | 0.0 | 39.15224 | 44 |
CCGTCTT | 26290 | 0.0 | 38.200615 | 52 |
GTGGATA | 25680 | 0.0 | 37.81679 | 36 |
GGATATC | 25820 | 0.0 | 37.464268 | 38 |
GATATCT | 25870 | 0.0 | 37.218502 | 39 |
CTTGAAA | 27745 | 0.0 | 36.32029 | 62 |
TGAAAAA | 28795 | 0.0 | 35.247383 | 64 |
CTGCTTG | 30365 | 0.0 | 33.049667 | 59 |
TCTCGTA | 30435 | 0.0 | 32.656708 | 43 |