Basic Statistics
Measure | Value |
---|---|
Filename | HGGLTBGX9_n01_15_INPUT_BZIP3_5_R1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 15932519 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 45 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATGTCAGAATCTCGTAT | 183760 | 1.1533643863848522 | TruSeq Adapter, Index 15 (97% over 40bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATGTCAGAATCTCGTTT | 33285 | 0.20891235089692972 | TruSeq Adapter, Index 15 (97% over 40bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATGTCAGAATCTCGGAT | 21860 | 0.13720366503250364 | TruSeq Adapter, Index 15 (97% over 40bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACATGTCAGAATCTCGTA | 20607 | 0.12933924635520597 | TruSeq Adapter, Index 15 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GTGCGTG | 4005 | 0.0 | 57.493664 | 59 |
CGTATGC | 18150 | 0.0 | 56.022556 | 46 |
ATCTCGT | 29765 | 0.0 | 55.987507 | 42 |
TCTCGTA | 24480 | 0.0 | 55.821476 | 43 |
GTATGCC | 18555 | 0.0 | 55.3648 | 47 |
TATGCCG | 16820 | 0.0 | 55.310303 | 48 |
GTCACAT | 47075 | 0.0 | 55.17779 | 29 |
GCGTGAA | 14775 | 0.0 | 55.06864 | 61 |
ATGTCAG | 46560 | 0.0 | 55.05422 | 34 |
ACATGTC | 46970 | 0.0 | 54.830338 | 32 |
ATGCCGT | 17645 | 0.0 | 54.50624 | 49 |
CATGTCA | 47235 | 0.0 | 54.4486 | 33 |
TGCCGTC | 18535 | 0.0 | 53.550488 | 50 |
CTGCGTG | 10660 | 0.0 | 53.14792 | 59 |
CTGAACT | 50165 | 0.0 | 53.069626 | 19 |
GCCGTCT | 17355 | 0.0 | 52.915573 | 51 |
CGTCTGA | 50510 | 0.0 | 52.620197 | 16 |
ACGTCTG | 50850 | 0.0 | 52.281963 | 15 |
TCTGCGT | 11175 | 0.0 | 52.076443 | 58 |
CTCCAGT | 50965 | 0.0 | 51.90956 | 24 |