Basic Statistics
Measure | Value |
---|---|
Filename | MK207_TGACCA_L007_R1_001.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 2804948 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTGACCAATCTCGTATGCC | 157219 | 5.605059345128679 | TruSeq Adapter, Index 4 (100% over 51bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CACACGT | 19920 | 0.0 | 43.990337 | 12 |
GCACACG | 19945 | 0.0 | 43.92392 | 11 |
CACGTCT | 19805 | 0.0 | 43.81411 | 14 |
GATCGGA | 20125 | 0.0 | 43.737473 | 1 |
ACACGTC | 19895 | 0.0 | 43.72898 | 13 |
AGCACAC | 20090 | 0.0 | 43.68451 | 10 |
CGTCTGA | 19860 | 0.0 | 43.534172 | 16 |
AGAGCAC | 20355 | 0.0 | 43.50262 | 8 |
TCGGAAG | 20235 | 0.0 | 43.496098 | 3 |
ACGTCTG | 19945 | 0.0 | 43.461445 | 15 |
GAGCACA | 20440 | 0.0 | 43.244667 | 9 |
ATCGGAA | 20385 | 0.0 | 43.188614 | 2 |
TCTGAAC | 20040 | 0.0 | 43.131153 | 18 |
GTATGCC | 18935 | 0.0 | 43.11367 | 45 |
AGTCACT | 19700 | 0.0 | 43.04803 | 28 |
CTCGTAT | 19000 | 0.0 | 43.0468 | 42 |
GTCTGAA | 20190 | 0.0 | 43.034336 | 17 |
CTGAACT | 20035 | 0.0 | 43.029625 | 19 |
CAGTCAC | 19965 | 0.0 | 42.893635 | 27 |
TCCAGTC | 20035 | 0.0 | 42.856075 | 25 |