Basic Statistics
Measure | Value |
---|---|
Filename | HHV3KAFX2_n01_1972-lungLR-rep1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 844607 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GTAGCATGTTGTTTAGTCTCTGTGTTTATGAATTTCCAGTTTTCTTCTTG | 1135 | 0.1343820261967992 | No Hit |
GTTATGTGTTGGCCTTTGGCTCAAGTCATGAATTTAGGGTCCTGGAATTG | 1006 | 0.11910865053214098 | No Hit |
TTACCCACCCAGCATGGAGCAACTAAATATGTAGAGCATATATGAACAAA | 937 | 0.11093917052546333 | No Hit |
GTATTATTTTTTCATTTACCTCAAAGTATTTCTGTGTTCCTATTTTGGTT | 878 | 0.10395367312844908 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GTTACGT | 110 | 0.0 | 85.68182 | 1 |
TTACGTG | 150 | 0.0 | 62.833332 | 2 |
TACGTGT | 235 | 0.0 | 40.106384 | 3 |
CGTGTTG | 245 | 0.0 | 38.469387 | 5 |
GTTATGT | 525 | 0.0 | 34.52381 | 1 |
ATATAGC | 90 | 0.003627915 | 32.22222 | 3 |
GTTGACA | 760 | 0.0 | 31.48026 | 1 |
GCTACTA | 165 | 2.3858483E-6 | 30.757576 | 9 |
ACGTGTT | 315 | 1.8189894E-12 | 29.920635 | 4 |
GCTATAT | 125 | 5.1531714E-4 | 29.0 | 1 |
GTTACGG | 20 | 0.006076645 | 29.0 | 110-114 |
GATATAC | 175 | 3.7737009E-6 | 29.0 | 3 |
TGCGTTA | 125 | 5.1531714E-4 | 29.0 | 8 |
TATACTG | 225 | 2.8159775E-8 | 28.999998 | 5 |
ACATGGT | 830 | 0.0 | 27.951807 | 5 |
GCGTTAA | 135 | 8.081667E-4 | 26.851852 | 9 |
CATTCTA | 765 | 0.0 | 26.535948 | 2 |
AGTCCGC | 220 | 7.48365E-7 | 26.363638 | 145 |
GGAACCG | 110 | 0.00967075 | 26.363638 | 5 |
GTATTAT | 915 | 0.0 | 26.147541 | 1 |