Processing math: 100%

Calculating and Inferring Relatedness Coefficients with BGmisc

Introduction

This vignette demonstrates how to quantify relatedness using two functions from the BGmisc package: - calculateRelatedness computes the relatedness coefficient based on known genealogical structure, and - inferRelatedness infers the relatedness coefficient from observed phenotypic correlations under a fixed ACE model.

The relatedness coefficient r indexes the proportion of alleles shared identically by descent (IBD) between two individuals. This value ranges from 0 (no shared alleles by descent) to 1 (a perfect genetic match, which occurs when comparing an individual to themselves, their identical twin, or their clone). Values can be interpreted in the context of standard relationships: e.g., full siblings are expected to have r=0.5, half siblings r=0.25, and first cousins r=0.125.

Calculating Relatedness Coefficient

The calculateRelatedness function offers a method to compute the relatedness coefficient based on shared ancestry. The function computes r based on generational distance to one or more shared ancestors, according to Wright’s (1922) formulation:

rbc=(12)n+n+1(1+fa)

Here, n and n are the number of generations from each descendant to a common ancestor a, and fa is the inbreeding coefficient of a, assumed to be zero unless specified otherwise.

library(BGmisc)
# Example usage:
# For full siblings, the relatedness coefficient is expected to be 0.5:
calculateRelatedness(generations = 1, full = TRUE)
#> [1] 0.5
# For half siblings, the relatedness coefficient is expected to be 0.25:
calculateRelatedness(generations = 1, full = FALSE)
#> [1] 0.25

These examples illustrate how relatedness changes based on whether the siblings share both parents (full) or only one (half). When full = TRUE, each sibling is one generation from the shared pair of parents, yielding r=0.5. When full = FALSE, they share only one parent, yielding r=0.25.

Inferring Relatedness Coefficient

The inferRelatedness function solves for the relatedness coefficient r implied by an observed phenotypic correlation under a fixed ACE variance decomposition. Specifically, it inverts the equation:

obsR=ra2+sharedCc2

to obtain:

r=obsRsharedCc2a2

where: - obsR is the observed phenotypic correlation between two individuals or groups. - aceA and aceC represent the proportions of variance due to additive genetic and shared environmental influences, respectively. - sharedC is the shared-environment analog to the relatedness coefficient: it indicates what proportion of the shared environmental variance applies to this pair (e.g., 1 for siblings raised together, 0 for siblings raised apart).

# Example usage:
# Infer the relatedness coefficient:
inferRelatedness(obsR = 0.5, aceA = 0.9, aceC = 0, sharedC = 0)
#> [1] 0.5555556

In this example, the observed correlation is 0.5, and no shared environmental variance is assumed. Given that additive genetic variance accounts for 90% of trait variance, the inferred relatedness coefficient is approximately 0.556. This reflects the proportion of genetic overlap that would be required to produce the observed similarity under these assumptions.

# Now assume shared environment is fully shared:
inferRelatedness(obsR = 0.5, aceA = 0.45, aceC = 0.45, sharedC = 1)
#> [1] 0.1111111

In this case, the observed phenotypic correlation is still 0.5, and both additive genetic and shared environmental components are assumed to explain 45% of the variance. Because the shared environment is fully shared between individuals (sharedC = 1), much of the observed similarity is attributed to C, leaving only a small portion attributable to genetic relatedness. The function returns an inferred relatedness coefficient of approximately 0.11 — that is, the amount of additive genetic overlap required (under this model) to produce the remaining unexplained correlation after accounting for shared environmental similarity.