ANGSD: Analysis of next generation Sequencing Data

Latest tar.gz version is (0.938/0.939 on github), see Change_log for changes, and download it here.

PCA: Difference between revisions

From angsd
Jump to navigation Jump to search
No edit summary
No edit summary
Line 12: Line 12:


= single read sampling =
= single read sampling =
Both PCA and MDS can be performed based on sampling of a single read at each site. This can work even with very low depth data e.g. <1X. This method can be [[PCA_MDS found here]]. However, it requires low error rate and polymorphic sites need to be inferred (or provided by user)
Both PCA and MDS can be performed based on sampling of a single read at each site. This can work even with very low depth data e.g. <1X. This method can be found here:[[PCA_MDS]]. However, it requires low error rate and polymorphic sites need to be inferred (or provided by user)

Revision as of 12:23, 27 April 2016

Genotype likelihood approach

Matteo Fumagallis methods for doing PCA/Covariance based on ANGSD output files:

Fumagalli, M, Vieira, FG, Korneliussen, TS, Linderoth, T, Huerta-Sánchez, E, Albrechtsen, A, Nielsen, R (2013). Quantifying population genetic differentiation from next-generation sequencing data. Genetics, 195, 3:979-92.


This works without the need to call SNPs or genotypes based on genotype likelihoods

The main documentation for this is found here: https://github.com/mfumagalli/ngsTools and here https://github.com/mfumagalli/ngsTools#ngscovar

single read sampling

Both PCA and MDS can be performed based on sampling of a single read at each site. This can work even with very low depth data e.g. <1X. This method can be found here:PCA_MDS. However, it requires low error rate and polymorphic sites need to be inferred (or provided by user)