Basic Statistics
Measure | Value |
---|---|
Filename | HNLNGBGXF_n01_DG4474_ADI2_mRNA.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 33493426 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTAT | 86190 | 0.2573340810223475 | TruSeq Adapter, Index 2 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 13405 | 0.0 | 36.656338 | 48-49 |
TCGTATG | 14415 | 0.0 | 34.351562 | 44-45 |
CTCGTAT | 13680 | 0.0 | 34.235435 | 44-45 |
TATCTCG | 14205 | 0.0 | 33.48848 | 40-41 |
CGTATGC | 15155 | 0.0 | 32.987637 | 46-47 |
TCTCGTA | 15565 | 0.0 | 29.64684 | 42-43 |
ACCGCTC | 19405 | 0.0 | 28.455338 | 32-33 |
ATGCCGT | 17685 | 0.0 | 27.288145 | 48-49 |
ATCTCGT | 17620 | 0.0 | 26.98441 | 42-43 |
CGCTCAT | 20575 | 0.0 | 26.802591 | 34-35 |
TCACCGC | 21615 | 0.0 | 25.743805 | 30-31 |
CCGCTCA | 22890 | 0.0 | 24.413528 | 32-33 |
CACCGCT | 23080 | 0.0 | 24.294945 | 30-31 |
GTCACCG | 23355 | 0.0 | 23.917427 | 28-29 |
TGCCGTC | 19565 | 0.0 | 23.731342 | 50-51 |
GTATGCC | 21040 | 0.0 | 23.693092 | 46-47 |
GCCGTCT | 18500 | 0.0 | 23.030645 | 50-51 |
CCGTCTT | 18885 | 0.0 | 22.699446 | 52-53 |
CGTCTGA | 25455 | 0.0 | 22.401546 | 16-17 |
ATCGGAA | 53120 | 0.0 | 21.400002 | 2 |