ANGSD: Analysis of next generation Sequencing Data

Latest tar.gz version is (0.938/0.939 on github), see Change_log for changes, and download it here.

Major Minor: Difference between revisions

From angsd
Jump to navigation Jump to search
Line 13: Line 13:
; -doMajorMinor 1
; -doMajorMinor 1


From input for either sequencing data like bam files or from genotype likelihood data like glfv3 the major and minor allele can be inferred directly from likelihoods. We use a maximum likelihood approach to choose the major and minor alleles. Details can be found [[majorminor|here]] [[Skotte|Citation]].
From input for either sequencing data like bam files or from genotype likelihood data like glfv3 the major and minor allele can be inferred directly from likelihoods. We use a maximum likelihood approach to choose the major and minor alleles. Details can be found [[majorminor|here]] [[Skotte2012]].


===From genotype probability data===
===From genotype probability data===
; -doMajorMinor 3
; -doMajorMinor 3
Currently only genotype probability data in beagle output format is allowed. This format already contains information for the major and minor allele.
Currently only genotype probability data in beagle output format is allowed. This format already contains information for the major and minor allele.

Revision as of 15:04, 11 October 2012

Inferring Major and Minor alleles

arguments

-doMajorMinor [int]=0

The inference method is chosen based on the data input.

From alignment data

-doMajorMinor 2
-doCount 1

If you input sequencing data like the bam format you can choose to infer the major and minor allele by picking the two most frequently observed bases across individuals. This is the approach from here: citation. To use this appraoch choose

From genotype likelihood data

-doMajorMinor 1

From input for either sequencing data like bam files or from genotype likelihood data like glfv3 the major and minor allele can be inferred directly from likelihoods. We use a maximum likelihood approach to choose the major and minor alleles. Details can be found here Skotte2012.

From genotype probability data

-doMajorMinor 3

Currently only genotype probability data in beagle output format is allowed. This format already contains information for the major and minor allele.