Basic Statistics
Measure | Value |
---|---|
Filename | HHV3KAFX2_n01_1968-lungLL-rep1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 821668 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
TCCATTACCTCTACCTGCAGAACAACTTCATAACGGAGCTCCCCGTGGAG | 842 | 0.10247447874323935 | No Hit |
GTAGCATGTTGTTTAGTCTCTGTGTTTATGAATTTCCAGTTTTCTTCTTG | 840 | 0.10223107143031006 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGTGCGA | 60 | 6.8045174E-6 | 60.416668 | 8 |
GTTACGT | 110 | 9.9564204E-8 | 46.13636 | 1 |
AGTCGAA | 80 | 3.7340687E-5 | 45.3125 | 2 |
CGACCCG | 70 | 0.001056531 | 41.42857 | 145 |
AAGTCGA | 90 | 7.481275E-5 | 40.27778 | 1 |
CCGCGTC | 75 | 0.0014832319 | 38.666668 | 145 |
TATACTG | 265 | 0.0 | 38.301888 | 5 |
CGGAGCG | 135 | 4.969952E-7 | 37.592594 | 145 |
GCTACTA | 180 | 3.1977834E-9 | 36.25 | 9 |
GTTGACA | 650 | 0.0 | 35.692307 | 1 |
GTGCGAT | 125 | 1.2648135E-5 | 34.8 | 9 |
GTCGAAG | 105 | 1.8534291E-4 | 34.523808 | 3 |
ACATGGT | 715 | 0.0 | 33.46154 | 5 |
AGCGAAA | 305 | 0.0 | 33.27869 | 1 |
CCGACTA | 240 | 4.3655746E-11 | 33.229168 | 4 |
TATACTA | 110 | 2.4357735E-4 | 32.954544 | 5 |
GGTTCGA | 120 | 4.0574223E-4 | 30.208334 | 8 |
GTATATC | 170 | 3.010853E-6 | 29.852942 | 1 |
ATAATGC | 100 | 0.0060738344 | 29.0 | 3 |
TTACGTG | 180 | 4.6986243E-6 | 28.194445 | 2 |