Basic Statistics
Measure | Value |
---|---|
Filename | HFHGMBGXC_n01_NLS_1Nsorted.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 6015292 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 54 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACACAGTGATCTCGTATG | 144315 | 2.3991354035681063 | TruSeq Adapter, Index 5 (100% over 49bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACACAGTGATCTCGTATGC | 109872 | 1.8265447462899556 | TruSeq Adapter, Index 5 (100% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AAAAGGG | 13205 | 0.0 | 61.304096 | 70 |
AGATCGG | 22505 | 0.0 | 55.4971 | 1 |
CTCGTAT | 27730 | 0.0 | 38.762115 | 43 |
CGTCTGA | 32290 | 0.0 | 38.653034 | 17 |
ACGTCTG | 32420 | 0.0 | 38.54123 | 16 |
TCGTATG | 28630 | 0.0 | 38.50908 | 44 |
TATGCCG | 28710 | 0.0 | 38.45054 | 47 |
CGTATGC | 28875 | 0.0 | 38.32779 | 45 |
CACGTCT | 32745 | 0.0 | 38.148643 | 15 |
GTCTGAA | 32730 | 0.0 | 38.123035 | 18 |
GTATGCC | 29070 | 0.0 | 38.11885 | 46 |
AGTCACA | 30010 | 0.0 | 37.63411 | 29 |
GTCACAC | 29945 | 0.0 | 37.633987 | 30 |
CAGTCAC | 30135 | 0.0 | 37.50123 | 28 |
CACACGT | 33495 | 0.0 | 37.37835 | 13 |
CACAGTG | 30145 | 0.0 | 37.19823 | 34 |
AGTGATC | 29000 | 0.0 | 37.088123 | 37 |
CAGTGAT | 29810 | 0.0 | 37.031082 | 36 |
ACACGTC | 33835 | 0.0 | 37.023125 | 14 |
ACACAGT | 30505 | 0.0 | 36.80513 | 33 |