Basic Statistics
Measure | Value |
---|---|
Filename | HGGLTBGX9_n01_6_BZIP3_BIO_10_R1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 17174150 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGCCAATATCTCGTATGC | 632807 | 3.6846481485255453 | TruSeq Adapter, Index 6 (100% over 50bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACGCCAATATCTCGTATG | 133382 | 0.7766439678237351 | TruSeq Adapter, Index 6 (100% over 49bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AAAAGGG | 72570 | 0.0 | 57.818558 | 70 |
TATGCCG | 86360 | 0.0 | 55.2126 | 46 |
GTATGCC | 86980 | 0.0 | 54.883583 | 45 |
CGTATGC | 86830 | 0.0 | 54.875515 | 44 |
ATGCCGT | 86985 | 0.0 | 54.73896 | 47 |
GCCGTCT | 86885 | 0.0 | 54.67035 | 49 |
TGCCGTC | 87070 | 0.0 | 54.61671 | 48 |
TCTCGTA | 88205 | 0.0 | 53.94165 | 41 |
CCGTCTT | 88520 | 0.0 | 53.739014 | 50 |
CTCGTAT | 88800 | 0.0 | 53.61854 | 42 |
TCGTATG | 89140 | 0.0 | 53.533073 | 43 |
TCACGCC | 87590 | 0.0 | 53.455963 | 30 |
CGCCAAT | 87265 | 0.0 | 53.12085 | 33 |
ATCTCGT | 89640 | 0.0 | 53.09935 | 40 |
AGTCACG | 89790 | 0.0 | 53.085423 | 28 |
ACGTCTG | 91025 | 0.0 | 53.01278 | 15 |
CGTCTGA | 91240 | 0.0 | 52.853043 | 16 |
CACGCCA | 88525 | 0.0 | 52.79173 | 31 |
ACACGTC | 91795 | 0.0 | 52.755054 | 13 |
GCCAATA | 88040 | 0.0 | 52.643288 | 34 |