ANGSD: Analysis of next generation Sequencing Data
Latest tar.gz version is (0.938/0.939 on github), see Change_log for changes, and download it here.
Mismatch: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
No edit summary |
||
(2 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
Angsd can output mismatch from the reference given: | Angsd can output mismatch count matrix from the reference given: | ||
;- Distance from be beginning of the read (posi) | ;- Distance from be beginning of the read (posi) | ||
Line 9: | Line 9: | ||
<pre> | <pre> | ||
./angsd -i my.cram -ref hg19.fa -doMisMatch | ./angsd -i my.cram -ref hg19.fa -doMisMatch 1 | ||
</pre> | </pre> | ||
Line 36: | Line 36: | ||
0 24 15 1 2 0 2 79 2 | 0 24 15 1 2 0 2 79 2 | ||
</pre> | </pre> | ||
column 1 and colum two are the distance from each end of the read | |||
qs is the quality score in numeric phred scale | |||
Strand is which sequencing strand zero and one | |||
Ref is the reference A,C,G,T = 0,1,2,3 | |||
The last 4 column is the count of bases acroos all reads where the position from the beginning is posi, distance from the end is isop, qualityscore is qs, strand is 0 or one. |
Latest revision as of 22:28, 2 March 2022
Angsd can output mismatch count matrix from the reference given:
- - Distance from be beginning of the read (posi)
- - Distance from the end of the read (isop)
- - Strand
- - Qscore
Run like:
./angsd -i my.cram -ref hg19.fa -doMisMatch 1
Output looks like:
posi isop qs strand Ref A C G T 0 24 13 0 1 0 2 0 0 0 24 13 0 3 0 0 0 1 0 24 13 1 0 1 0 0 0 0 24 13 1 3 0 0 0 1 0 24 14 0 0 152 1 4 0 0 24 14 0 1 0 128 0 0 0 24 14 0 2 11 2 196 1 0 24 14 0 3 1 0 2 12 0 24 14 1 0 20 1 2 2 0 24 14 1 1 1 200 1 16 0 24 14 1 2 0 5 118 6 0 24 14 1 3 0 6 3 164 0 24 15 0 0 186 1 2 0 0 24 15 0 1 4 91 0 0 0 24 15 0 2 8 0 90 0 0 24 15 0 3 0 1 2 7 0 24 15 1 0 14 1 0 0 0 24 15 1 1 0 93 1 2 0 24 15 1 2 0 2 79 2
column 1 and colum two are the distance from each end of the read
qs is the quality score in numeric phred scale
Strand is which sequencing strand zero and one
Ref is the reference A,C,G,T = 0,1,2,3
The last 4 column is the count of bases acroos all reads where the position from the beginning is posi, distance from the end is isop, qualityscore is qs, strand is 0 or one.