Proceedings | Open | Published:

# Using the maternal-fetal genotype incompatibility test to assess non-inherited maternal HLA-DRB1 antigen coding alleles as rheumatoid arthritis risk factors

*BMC Proceedings***volume 1**, Article number: S124 (2007)

## Abstract

Non-inherited maternal antigens encoded by specific *HLA-DRB1* alleles (NIMA) have been implicated as a rheumatoid arthritis (RA) risk factor. Using genotype data from North American Rheumatoid Arthritis Consortium study participants and the maternal-fetal genotype incompatibility (MFG) test, we find evidence for offspring allelic effects but no evidence for NIMA as a RA risk factor. We discuss possible reasons why our result conflicts with several previous studies (including one of our own) that used RA patients from northern Europe.

## Background

Rheumatoid arthritis (RA) is highly associated with *HLA-DRB1* *0101, *0102, *0401, *0404, *0405, *0408, *0423, *10, *1001, *1402, and *1406 alleles that encode a shared epitope (SE) [1]. However, as many as 30% of the patients do *not* carry SE coding alleles [1] and an association between non-inherited maternal SE *HLA-DRB1* antigen coding alleles (NIMA) and RA also has been observed [2–5]. One explanation for the latter finding is that NIMA may be involved in RA pathogenesis through microchimera formation in offspring who do not have SE coding alleles [2]. However, other studies have not found a significant NIMA effect [6–8].

Hsieh et al. [3] demonstrated that the maternal-fetal genotype incompatibility (MFG) test [9, 10] allows the joint estimation of the offspring allelic and NIMA effects as a RA risk factor and is robust to population stratification. The current study applies the MFG test to *HLA-DRB1* genotype data from the North American Rheumatoid Arthritis Consortium (NARAC) study to test NIMA as a risk factor for RA.

## Methods

### Statistical modeling: the MFG test

The MFG test models the joint distribution of parental and offspring genotypes given the number of affected offspring in nuclear families as

where for family *i* = 1,..., *N*, ${G}_{c}^{(i)}=\left({G}_{c1}^{(i)},\cdots ,{G}_{c{n}_{i}}^{(i)}\right)$ denote the genotypes of the *n*_{
i
}offspring, in which *k*_{
i
}are affected; ${G}_{p}^{(i)}=\left({G}_{m}^{(i)},{G}_{f}^{(i)}\right)$ denote parental genotypes, in which ${G}_{m}^{(i)}$ and ${G}_{f}^{(i)}$ are maternal and paternal genotypes, respectively; ${D}^{(i)}=\left({D}_{1}^{(i)},\cdots ,{D}_{{k}_{i}}^{(i)}\right)$ denotes the event that *k*_{
i
}offspring are affected [4, 5]. Phenotypes of the unaffected offspring are not used in the likelihood.

Using Bayes' theorem and assuming siblings' phenotypes are independent given parental genotypes, likelihood Eq. (1) is parameterized in terms of penetrance functions and population mating type frequencies [3, 9]. As in Hsieh et al. [3], we model the penetrances as

pr(*D*|*G*_{
c
}, *G*_{
p
}) = *p* × *μ*^{I[M] }× *ρ*_{1}^{I[Z= 1] }× *ρ*_{2}^{I[Z= 2] }, (2)

where *I* [·] is the indicator function, *M* is the event that the offspring does not inherit an SE coding allele carried by the mother (i.e., NIMA), and *Z* denotes the number of SE coding alleles present in the offspring genotype. The parameter *p* denotes the population baseline disease incidence rate, which ultimately cancels from Eq. (1). The parameter *μ* is the relative risk due to NIMA; *ρ*_{1} and *ρ*_{2} are the relative risks when one or two copies of the SE coding allele are present in the offspring genotype, respectively, relative to zero copies. Note that the relative risks (*μ*, *ρ*_{1}, *ρ*_{2}) range from 0 to infinity, and each have a null value of 1. The numerical maximization of the log-likelihood is better conditioned when we use the natural logarithm of the relative risks as parameters so we actually estimate *α* = log(*μ*), *β*_{1} = log(*ρ*_{1}), and *β*_{2} = log(*ρ*_{2}) [9]. Each of these parameters, *α*, *β*_{1}, and *β*_{2}, are therefore defined on the real line from negative infinity to infinity and each have a null value of zero.

Because we assume that all SE coding alleles confer the same risk to RA susceptibility (as was assumed in earlier studies [2–8]), the model reduces to two alleles that we denote as *S* for the SE coding risk allele and *N* for the non SE coding allele. Thus, there are three possible genotypes (*S*/*S*, *S*/*N*, and *N*/*N*). The penetrance function as expressed by Eq. (2) is short-hand for three mutually exclusive maternal-offspring genotype combinations in which the indicator functions determine which relative risk corresponds to the genotype combination. These mutually exclusive combinations are: the offspring carries two copies of the S allele; the offspring carries one copy of the S allele; and the offspring does not carry S allele but the mother does. Each of these combinations has an associated risk that we define in a standard manner, that is, as relative to a reference category in which both the offspring and mother are *N*/*N*.

Assuming mating symmetry under the null hypothesis, there are six possible mating types [10]. Neither Hardy-Weinberg equilibrium nor random mating is assumed. Data from families with incomplete parental genotypes are included in Eq. (1) by assuming the genotypes are missing at random and summing over all possible parental genotypes [9, 11]. Hypothesis tests of offspring allelic or of NIMA effects use a likelihood-ratio test statistic, whose distribution is asymptotically chi-square.

### The NARAC data set

Of the four GAW15 RA study samples, only the NARAC data set was suitable for this analysis because at least some parental *HLA-DRB1* genotypes must be available to test for NIMA effects. The ethnic background of the NARAC families is representative of United States and Canadian families affected with RA (predominately Caucasian with African, Hispanic, Native American, and Asian minorities).

Before recoding the alleles into *S* and *N*, we used the pedigree trimming option of Mendel (version 6.5.0) to remove individuals without *HLA-DRB1* genotypes who are not necessary to define the relationships among genotyped individuals [12, 13]. We then tested for genotyping errors using the mistyping analysis available in Mendel version 6.5.0 [13, 14] and found that the *HLA-DRB1* error rate was less than 1%. Individuals' genotypes that were in error were omitted and when several family members could be in error, the entire family's *HLA-DRB1* genotypes were omitted. Most of the families are nuclear families; however there are a few extended families. For each extended pedigree, a single nuclear family was selected using the following protocol: whenever a nuclear family has complete parental genotypes available, they are selected over those families with zero or one parent genotyped. Whenever multiple nuclear families within the extended pedigree have the same parental genotype availability, one of the nuclear families is randomly selected. At this point, there were 708 nuclear families.

About half the families have neither parent genotyped. These families provide very little power for detecting maternal-fetal genotype interactions [11] and were omitted. The remaining number of nuclear families was 318. We then recoded the alleles. The resolution of 119 individuals' *HLA-DRB1* genotypes is too low to allow certain classification of their alleles as *S* or *N*. For example the allele *01 could be *0101 (*S*) or *0103 (*N*). These ambiguous genotypes were treated as missing in the analysis.

The final data set had 263 nuclear families with at least one parent and one affected offspring genotyped. The distribution of affected offspring and genotyped parents is given in Table 1. There are a total of 517 affected offspring (393 female and 124 male) in these families. To test whether we introduced a bias by our family selection scheme, we compared the genotype distribution of affected offspring from the families used in the analysis to the genotype distribution of affected offspring from the excluded families. Specifically, we randomly selected one affected offspring per included family and one affected offspring per excluded family and compared the distributions using a Fisher exact test. We found that the genotype distributions were not significantly different (*p* = 0.946).

## Results

### Inference

We fit three models by placing different constraints on the parameters. These models are compared by constructing likelihood-ratio test statistics. Comparison of Models 1 (*α* = *β*_{1} = *β*_{2} = 0) and 2 (*α* = 0) in Table 2 reveals a highly significant offspring SE allelic effect on RA risk (${\chi}_{2}^{2}$ = 128.1, *p* = 1.5 × 10^{-28}). The relative risk for individuals with S/N is *ρ*_{1} = 4.3 and the relative risk for individuals with S/S is *ρ*_{2} = 12.7, similar in magnitude to the results of earlier association studies (see for example [15, 16]). Comparison of Models 2 (*α* = 0) and 3 (no parameter restrictions) shows there is no evidence for the NIMA effect on RA risk in the presence of offspring allelic effects (${\chi}_{1}^{2}$ = 0.0172, *p* = 0.8956).

### Power

One possible explanation for our failure to find evidence for the NIMA effect is that our sample size is too small. The earlier studies found the MFG effect to be ~4, approximately equal to the S/N risk to offspring [2–5]. We simulated 1000 data sets and found that there is 80% power to detect an MFG effect size of 3.5 with 63 families each having two affected offspring and completely genotyped parents. Our actual study sample has power to detect even smaller effect sizes because the additional 200 families with only one parent genotyped substantially increase the power [11].

## Discussion

MFG incompatibility results when specific maternal-fetal genotype combinations produce an adverse effect on the developing fetus that ultimately increases offspring disease risk. The exposure of fetuses who do not carry SE coding alleles to a maternal SE antigen is an example of a putative MFG incompatibility event and so can be examined using the MFG test.

The highly significant offspring allelic effects are consistent with numerous studies [3, 15, 16]. However, we did not find any evidence of a NIMA effect. Although we can not exclude the possibility that small NIMA effects were missed, low power is not a reason for the failure to detect moderate NIMA effects. Our simulation results show that we could detect a NIMA effect that is less than the effect size observed in earlier studies [2–5]. Although the NARAC families are ethnically diverse, population stratification also is not a concern as the MFG test is robust to its effects [3, 10, 11].

There were originally 708 nuclear families available in the NARAC data set. After excluding families without parental genotypes and individuals with low-resolution *HLA-DRB1* genotypes, only 263 nuclear families were used in our analysis. However when the genotypes are missing at random, it has been shown that MFG test produces unbiased parameter estimates and accurate hypothesis tests when only approximately 25% of the families in the sample have both parental genotypes [11]. The missing-at-random assumption is reasonable in this study. Therefore, the remaining 263 families provide valid, unbiased results even though more than half of the original NARAC families are excluded.

The following reasons might account for the failure to find a significant MFG incompatibility effect: a) the previous studies with positive findings [2–5] could represent type I errors, or the maternal or offspring effects at other *HLA* loci could have been misattributed to a *HLA-DRB1* NIMA effect (an issue of model misspecification); b) *HLA-DRB1* is difficult to accurately genotype and mistyping could lead to both false-positive and negative NIMA results; c) there could be population differences in the susceptibility to microchimera; d) differences in ascertainment or diagnostic criteria used in the different studies might also contribute to the variability in the results. The NARAC families are multi-case families and these families may have a different genetic background than single-case families [5].

## References

- 1.
Newton JL, Harney SM, Wordsworth BP, Brown MA: A review of the MHC genetics of rheumatoid arthritis. Genes Immun. 2004, 5: 151-157. 10.1038/sj.gene.6364045.

- 2.
Harney S, Newton J, Milicic A, Brown MA, Wordsworth BP: Non-inherited maternal HLA alleles are associated with rheumatoid arthritis. Rheumatology. 2003, 42: 171-174. 10.1093/rheumatology/keg059.

- 3.
Hsieh H, Palmer CGS, Harney S, Newton JL, Woodsworth P, Brown MA, Sinsheimer JS: The v-MFG test: Investigating maternal, offspring, and maternal-fetal genetic incompatibilities effects on disease and viability. Genet Epidemiol. 2006, 30: 333-347. 10.1002/gepi.20148.

- 4.
ten Wolde S, Breedveld FC, de Vires RR, D'Amaro J, Rubenstein P, Schreuder GMTh, Claas FHJ, van Rood JJ: Influence of non-inherited maternal HLA antigens on occurrence of rheumatoid arthritis. Lancet. 1993, 341: 200-202. 10.1016/0140-6736(93)90065-O.

- 5.
van der Horst-Bruinsma IE, Haxes JM, Schreuder GM, Radstake TR, Barrera P, van de Putte LB, Mustamu D, van Schaardenburg D, Breedveld FC, de Vries RR: Influence of non-inherited maternal HLA-DR antigens on susceptibility to rheumatoid arthritis. Ann Rheum Dis. 1998, 57: 672-675.

- 6.
Silman AJ, Hay EM, Worthington J, Thomson W, Pepper L, Davidson J, Dyer PA, Ollier WE: Lack of influence of non-inherited maternal HLA-DR alleles on susceptibility to rheumatoid arthritis. Ann Rheum Dis. 1995, 54: 311-313.

- 7.
Barrera P, Balsa A, Alves H, Westhovens R, Maenaut K, Cornélis F, Fritz P, Bardin T, de Almeida G, Lopes-Vaz A, Pascual Salcedo D, de la Concha EG, Radstake TR, van de Putte LB, Migliorini P, Prud'homme JF, Charron D, Spyropoulou M, Mendes A, Spaepen M, Martinez M, Lepage V, Stravopoulos C: Noninherited maternal antigens do not play a role in rheumatoid arthritis susceptibility in Europe. European Consortium on Rheumatoid Arthritis Families. Arthritis Rheum. 2000, 43: 758-764. 10.1002/1529-0131(200004)43:4<758::AID-ANR6>3.0.CO;2-6.

- 8.
Barrera P, Balsa A, Alves H, Westhovens R, Maenaut K, Cornélis F, Fritz P, Bardin T, Ceu Maia M, Lopes-Vaz A, Pascual Salcedo D, de la Concha E, Radstake T, van de Putte LB, Migliorini P, Prudhomme JF, Charron D, Spyropoulou M, Mendes A, Spaepen M, Martinez M, Stavropoulos C, European Consortium on Rheumatoid Arthritis Families: Noninherited maternal antigens do not increase the susceptibility for familial rheumatoid arthritis. European Consortium on Rheumatoid Arthritis Families. J Rheumatol. 2001, 28: 968-974.

- 9.
Kraft P, Palmer CGS, Woodward JA, Turunen JA, Minassian S, Paunio T, Lonnqvist J, Peltonen L, Sinsheimer JS: RHD Maternal-Fetal genotype incompatibility and schizophrenia: Extending the MFG test to include multiple siblings and birth order. Eur J Hum Gen. 2004, 12: 192-198. 10.1038/sj.ejhg.5201129.

- 10.
Sinsheimer JS, Palmer CGS, Woodward JA: Detecting genotype combinations that increase risk for disease: The Maternal-Fetal genotype incompatibility test. Gen Epidemiol. 2003, 24: 1-13. 10.1002/gepi.10211.

- 11.
Hsieh HJ, Palmer CG, Sinsheimer JS: Allowing for missing data at highly polymorphic genes when testing for maternal, offspring and maternal-fetal genotype incompatibility effects. Hum Hered. 2006, 62: 165-174. 10.1159/000096444.

- 12.
Lange K, Sinsheimer JS: The pedigree trimming problem. Hum Hered. 2004, 58: 108-111. 10.1159/000083031.

- 13.
Lange K, Cantor R, Horvath S, Perola M, Sabatti C, Sinsheimer J, Sobel E: Mendel version 4.0: A complete package for the exact genetic analysis of discrete traits in pedigree and population data sets. Amer J Hum Gen. 2001, 69 (S): A1886-

- 14.
Sobel E, Papp JC, Lange K: Detection and integration of genotyping errors in statistical genetics. Amer J Hum Gen. 2002, 70: 496-508. 10.1086/338920.

- 15.
Hall FC, Weeks DE, Camilleri JP, Williams LA, Amos N, Darke C, Gibson K, Pile K, Wordsworth BP, Jessop JD: Influence of the HLA-DRB1 locus on susceptibility and severity in rheumatoid arthritis. QJM. 1996, 89: 821-829.

- 16.
Meyer JM, Evans TI, Small RE, Redford TW, Han J, Singh R, Moxley G: HLA-DRB1 genotype influence risk for and severity of rheumatoid arthritis. J Rheumatol. 1999, 26: 1024-1034.

## Acknowledgements

This investigation was supported in part by NIH grants GM53275 and HL28481.

This article has been published as part of *BMC Proceedings* Volume 1 Supplement 1, 2007: Genetic Analysis Workshop 15: Gene Expression Analysis and Approaches to Detecting Multiple Functional Loci. The full contents of the supplement are available online at http://www.biomedcentral.com/1753-6561/1?issue=S1.

## Author information

## Additional information

### Competing interests

The author(s) declare that they have no competing interests.

## Rights and permissions

## About this article

#### Published

#### DOI

### Keywords

- Rheumatoid Arthritis
- Nuclear Family
- Parental Genotype
- Shared Epitope
- Rheumatoid Arthritis Risk