ANGSD: Analysis of next generation Sequencing Data
Latest tar.gz version is (0.938/0.939 on github), see Change_log for changes, and download it here.
Angsd structure: Difference between revisions
No edit summary |
No edit summary |
||
(5 intermediate revisions by the same user not shown) | |||
Line 11: | Line 11: | ||
# Beagle files | # Beagle files | ||
# glfv3 (Binary and text) | # glfv3 (Binary and text) | ||
The data for the different input formats are encapsulated in a struct 'funkyPars', and this struct is sent to the 'analysis' classes, which will do analysis and populate the struct with results (eg based on the sequencing data we calculate genotype likelihoods and attach this to the struct). | |||
Depending on the input format the information in the struct will span different regions. For the Beagle files the region size is defined by '''-chunkSize'''. The region size for BAM input can very quite alot since this is dependent on number of reads in the different BAM files. A thorough description of the BAM reading can be found in [[BAM reading details]] | |||
==Analysis== | |||
We have defined the following '''abstract base class''' called general.h | |||
<pre> | |||
class general{ | |||
public: | |||
static aHead *header;//contains the header of a single bam; | |||
static std::map<char *,int,ltstr> *revMap; | |||
int index; | |||
static int tot_index; | |||
// virtuel general() | |||
virtual void run(funkyPars *f)=0; | |||
virtual void print( funkyPars *f)=0; | |||
// virtual void printArg(const char *fname)=0; <-maybe include | |||
virtual void clean(funkyPars *f)=0; | |||
general(){index=tot_index++;}; | |||
virtual ~general(){}; | |||
}; | |||
</pre> | |||
==History== | ==History== | ||
Angsd started back in 2009 as a simple program for estimating allele frequency, based on this we incrementally added new functionality. A rough timeline can be found in the AUTHORS file in the program bundle | Angsd started back in 2009 as a simple program for estimating allele frequency, based on this we incrementally added new functionality. A rough timeline can be found in the AUTHORS file in the program bundle |
Latest revision as of 19:03, 4 December 2013
This page contains a short description of the overall structure of angsd.
It can be divided into:
- Filereading
- Analysis
We allow for many different input formats, many of these are deprecated and have little use, but we have decided to keep these.
- BAM files
- soap alignment files
- Beagle files
- glfv3 (Binary and text)
The data for the different input formats are encapsulated in a struct 'funkyPars', and this struct is sent to the 'analysis' classes, which will do analysis and populate the struct with results (eg based on the sequencing data we calculate genotype likelihoods and attach this to the struct).
Depending on the input format the information in the struct will span different regions. For the Beagle files the region size is defined by -chunkSize. The region size for BAM input can very quite alot since this is dependent on number of reads in the different BAM files. A thorough description of the BAM reading can be found in BAM reading details
Analysis
We have defined the following abstract base class called general.h
class general{ public: static aHead *header;//contains the header of a single bam; static std::map<char *,int,ltstr> *revMap; int index; static int tot_index; // virtuel general() virtual void run(funkyPars *f)=0; virtual void print( funkyPars *f)=0; // virtual void printArg(const char *fname)=0; <-maybe include virtual void clean(funkyPars *f)=0; general(){index=tot_index++;}; virtual ~general(){}; };
History
Angsd started back in 2009 as a simple program for estimating allele frequency, based on this we incrementally added new functionality. A rough timeline can be found in the AUTHORS file in the program bundle