Basic Statistics
Measure | Value |
---|---|
Filename | HHV3KAFX2_n02_1966-lungLR-rep1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 891062 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 43 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTAGCATGTTGTTTAGTCTCTGTGTTTATGAATTTCCAGTTTTCTTCTTG | 1066 | 0.11963252837625216 | No Hit |
GTTCTATACCACTGTGGTTGGAAAAGATGGTTGATTTGATTTCAGTCCTC | 988 | 0.11087892873896542 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GTGATCG | 110 | 2.4465634E-4 | 32.930115 | 9 |
TATACTG | 440 | 0.0 | 29.635439 | 5 |
TCAGGCG | 160 | 6.861867E-5 | 27.167345 | 9 |
TACAGCC | 590 | 0.0 | 27.013857 | 7 |
ATACAGC | 645 | 0.0 | 26.955233 | 6 |
CGAAAGC | 380 | 0.0 | 26.692225 | 3 |
TGCGGTC | 110 | 0.0097060455 | 26.344093 | 8 |
GCGAAAG | 415 | 0.0 | 26.188334 | 2 |
AGCGAAA | 385 | 1.6370905E-11 | 24.47199 | 1 |
GTAGCAT | 1410 | 0.0 | 24.158247 | 1 |
TATAATA | 990 | 0.0 | 23.419601 | 2 |
GCTATAT | 220 | 2.2344573E-5 | 23.060148 | 1 |
CTACATC | 365 | 5.7279976E-9 | 21.834343 | 3 |
GGTTATA | 400 | 6.9303496E-10 | 21.742424 | 1 |
GTATAAT | 370 | 6.673872E-9 | 21.546545 | 1 |
GTATTAG | 210 | 4.323772E-4 | 20.70707 | 1 |
CATTCTA | 1090 | 0.0 | 20.606293 | 2 |
GTCCTAT | 1340 | 0.0 | 20.55254 | 1 |
TCCTATG | 1435 | 0.0 | 20.196344 | 2 |
TCGCTGA | 295 | 9.621712E-6 | 19.64644 | 4 |