ANGSD: Analysis of next generation Sequencing Data
Latest tar.gz version is (0.938/0.939 on github), see Change_log for changes, and download it here.
Angsd structure: Difference between revisions
Line 16: | Line 16: | ||
Depending on the input format the information in the struct will span different regions. For the Beagle files the region size is defined by '''-chunkSize'''. The region size for BAM input can very quite alot since this is dependent on number of reads in the different BAM files. A thorough description of the BAM reading can be found in [[BAM reading details]] | Depending on the input format the information in the struct will span different regions. For the Beagle files the region size is defined by '''-chunkSize'''. The region size for BAM input can very quite alot since this is dependent on number of reads in the different BAM files. A thorough description of the BAM reading can be found in [[BAM reading details]] | ||
==Analysis== | |||
==History== | ==History== | ||
Angsd started back in 2009 as a simple program for estimating allele frequency, based on this we incrementally added new functionality. A rough timeline can be found in the AUTHORS file in the program bundle | Angsd started back in 2009 as a simple program for estimating allele frequency, based on this we incrementally added new functionality. A rough timeline can be found in the AUTHORS file in the program bundle |
Revision as of 18:54, 4 December 2013
This page contains a short description of the overall structure of angsd.
It can be divided into:
- Filereading
- Analysis
Input formats
We allow for many different input formats, many of these are deprecated and have little use, but we have decided to keep these.
- BAM files
- soap alignment files
- Beagle files
- glfv3 (Binary and text)
The data for the different input formats are encapsulated in a struct 'funkyPars', and this struct is sent to the 'analysis' classes, which will do analysis and populate the struct with results (eg based on the sequencing data we calculate genotype likelihoods and attach this to the struct).
Depending on the input format the information in the struct will span different regions. For the Beagle files the region size is defined by -chunkSize. The region size for BAM input can very quite alot since this is dependent on number of reads in the different BAM files. A thorough description of the BAM reading can be found in BAM reading details
Analysis
History
Angsd started back in 2009 as a simple program for estimating allele frequency, based on this we incrementally added new functionality. A rough timeline can be found in the AUTHORS file in the program bundle