Basic Statistics
Measure | Value |
---|---|
Filename | HCCY2BGXF_n01_bZIP44_Col-B_p1.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 11829783 |
Sequences flagged as poor quality | 0 |
Sequence length | 76 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGAAGAGCACACGTCTGAACTCCAGTCACATTACTCGATCTCGTATGCC | 25273 | 0.21363874552897547 | TruSeq Adapter, Index 27 (97% over 36bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATTACTCGATCTCGTAT | 24383 | 0.20611536154128948 | TruSeq Adapter, Index 27 (97% over 39bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 9265 | 0.0 | 28.105387 | 45 |
ATGCCGT | 9520 | 0.0 | 27.279034 | 46 |
GTATGCC | 9770 | 0.0 | 26.831768 | 44 |
GCCGTCT | 9215 | 0.0 | 26.624699 | 48 |
TGCCGTC | 9630 | 0.0 | 26.49496 | 47 |
CGTATGC | 9840 | 0.0 | 26.42748 | 43 |
TCGTATG | 10525 | 0.0 | 24.175436 | 42 |
CTCGTAT | 9580 | 0.0 | 24.0028 | 41 |
CCGTCTT | 10940 | 0.0 | 22.906448 | 49 |
CTGCTTG | 11160 | 0.0 | 19.914598 | 56 |
TCTCGTA | 10595 | 0.0 | 19.85343 | 40 |
GATCTCG | 10165 | 0.0 | 19.832485 | 38 |
CGATCTC | 10835 | 0.0 | 18.929132 | 37 |
CGTCTTC | 13495 | 0.0 | 18.15462 | 50 |
GCTTGAA | 12835 | 0.0 | 18.05195 | 58 |
TTACTCG | 15300 | 0.0 | 17.820015 | 32 |
CTTCTGC | 12780 | 0.0 | 17.77362 | 53 |
ATCTCGT | 11660 | 0.0 | 17.529774 | 39 |
TGCTTGA | 13310 | 0.0 | 17.092175 | 57 |
AAGGGGG | 2350 | 0.0 | 16.233818 | 70 |