Relatedness: Difference between revisions

Revision as of 16:28, 12 July 2016

NGSrelate - estimation of IBD probabilities

In order to estimate kinship coefficient then population allele frequencies are needed. These can be estimated from data if you can multiple individuals. For some individuals, for example most human populations, there are publicly available data. If you can obtain population allele frequencies or have a many samples from your population then we recommend that you use NGSrelate has works with ANGSD output. From the estimated IBD probabilities you can then infer the relationship. Below is a table of the expected IBD sharing probabilities assuming no inbreeding

Relationship	$K_{0}$	$K_{1}$	$K_{2}$
mono-zygotic twin	$0$	$0$	$1$
Parent-Offspring	$0$	$1$	$0$
Full siblings	$0.25$	$0.5$	$0.25$
Half siblings	$0.5$	$0.5$	$0$
First cousins	$0.75$	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle 0.25 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle 0 }
Unrelated	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle 1 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle 0 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle 0 }

NGSrelate has its very own website http://www.popgen.dk/software/index.php/NgsRelate

IBS/genotype distribution

If you do not have population allele frequencies the you cannot estimate kinship coefficients. However, you can still make some claims about the relationship of your samples based on IBS patterns. Below is an example of IBS patterns between two individuals where we ignore the allele types. G is the genotype that counts for example the number of derived or non-reference alleles. Basically it is the 2D SFS where the is just 1 individual in each of the two populations

		ind2
ind1	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle G=0 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle G=1 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle G=2 }
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle G=0 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle A }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle D}	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle G }
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle G=1 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle B }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle E }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle H }
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle G=2 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle C }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle F }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle I }

Here are some usefull ratio of IBS that can be used to say something about relatedness. Here we assume no inbreeding.

Relationship	Expected ratio	Expected ratio (R1)	Expected ratio (R1)
mono-zygotic twin	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle B,C,D,F,G,H=0 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{B+C+D+F+G+H}= \infty }	-
Parent-Offspring	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle C,G=0 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{B+C+D+F+G+H}=0.5 }	-
Full siblings	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{C+G}>2 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{B+C+D+F+G+H}>0.5 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{B+C+D+F+G+H}<10/13 }
Half siblings	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{C+G}>2 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{B+C+D+F+G+H}<4/9}	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{B+C+D+F+G+H}> 1/6}
First cousins	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{C+G}>2 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{B+C+D+F+G+H}<8/19}	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{B+C+D+F+G+H}> 1/14 }
Unrelated	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{C+G}=2 }	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{B+C+D+F+G+H}<4/10}	Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \frac{E}{B+C+D+F+G+H}> 0 }

How to get the IBS pattern

You can get the estimate by using the 2D SFS method or you can use the genotype distribution method both in ANGSD.

The two methods are very similar but with a very small difference. The SFS method uses ancestral information or a reference in order to infer the 2 alleles for each position. The genotype distribution does not infer either the major or the minor allele but uses all 10 possible genotype likelihoods.

Rcode to get expectations

# R code go get expected IBS pattern
## k is the 3 IBD sharing probabities
## f is the allele frequency 
getEst<-function(k=c(1,0,0),f=0.5){
    p<-f
    q<-1-f
    m0<-rbind(
        c(p^4,2*p^3*q,p^2*q^2),
        c(2*p^3*q,4*p^2*q^2,2*p*q^3),
        c(p^2*q^2,2*q^3*p,q^4)
        )
   m1<-rbind(
        c(p^3,p^2*q,0),
        c(p^2*q,p^2*q+q^2*p,p*q^2),
        c(0,q^2*p,q^3)
        )
    m2<-rbind(
        c(p^2,0,0),
        c(0,2*p*q,0),
        c(0,0,q^2)
        )

return(k[1]*m0+k[2]*m1+k[3]*m2)

}

getEst(k=c(1,0,0),f=0.5)
       [,1]  [,2]   [,3]
[1,] 0.0625 0.125 0.0625
[2,] 0.1250 0.250 0.1250
[3,] 0.0625 0.125 0.0625

@@ Line 47: / Line 47: @@
 {| class="wikitable" style="text-align: center
-!|  Relationship || Expected ratio || Expected ratio || Expected ratio
+!|  Relationship || Expected ratio || Expected ratio (R1) || Expected ratio (R1)
 |-
 |   mono-zygotic twin   ||    <math>B,C,D,F,G,H=0 </math>    ||    <math> \frac{E}{B+C+D+F+G+H}= \infty  </math>  ||  -
@@ Line 61: / Line 61: @@
 |   Unrelated   || <math> \frac{E}{C+G}=2 </math>    ||   <math>  \frac{E}{B+C+D+F+G+H}<4/10</math>  ||  <math> \frac{E}{B+C+D+F+G+H}> 0 </math>
 |}
+[[File:ratioRel.png|800px]]
 === How to get the IBS pattern ===

Relatedness: Difference between revisions

Revision as of 16:28, 12 July 2016

Contents

NGSrelate - estimation of IBD probabilities

IBS/genotype distribution

How to get the IBS pattern

Rcode to get expectations

Navigation menu