Skip to main content

Assessing genotype × environment interaction in linkage mapping using affected sib pairs


Rheumatoid arthritis (RA) is a complex disease that involves both environmental and genetic factors. Elucidation of the basic etiologic factors involved in RA is essential for preventing and treating this disease. However, the etiology of RA, like that of other complex diseases, is largely unknown. In the present study, we conducted autosomal multipoint linkage scans using affected sib pairs by incorporating the smoking status into analysis. We divided the affected sib pairs into three subgroups based on smoking status (ever, current, or never). Interactions between the susceptibility genes and smoking could then be assessed through linkage mapping. Results suggested that the genetic effect of chromosome 6p21.2-3 in concordant current smoker pairs was about two-fold greater than that of the concordant non-current smoker pairs or discordant pairs. With incorporation of smoking status, additional regions with evidence of linkage were identified, including chromosomes 4q and 20q; while evidence of linkage remained in the regions of chromosomes 6p, 8p, and 9p. The interaction effects varied in different regions. Results from our analyses suggested that incorporating smoking status into linkage analyses could increase the statistical power of the multipoint linkage approach applied here and help elucidate the etiology of RA.


Numerous epidemiologic studies have shown that both genetic and environmental factors contribute to the development of rheumatoid arthritis (RA). Few studies have proceeded further to study how environmental and genetic factors might interact in individuals with RA [1]. To prevent and treat RA, it is necessary to understand the basic etiologic factors involved. Up to now, the most solid evidence for environmental influences exists for smoking; however, to the best of our knowledge, no previous studies have incorporated the effects of smoking into a genome-wide search for the susceptibility loci of RA. Because a gene × smoking interaction is likely to play an important role in the etiology of RA, incorporating smoking data to allow for interaction in linkage analysis would allow the interaction to be assessed and increase power for localizing a disease gene [2].

Recently, Liang et al. [35] proposed a robust multipoint linkage analysis approach using affected sib pairs by incorporating the information from environmental factors, which simultaneously tests for statistical interaction between the susceptibility locus and the environmental factors. It also provides estimates of the genetic effects stratified by the environmental factor and location of the disease locus τ, along with sampling uncertainty, to help investigators to narrow down chromosomal regions of interest. The value (1+C i )/2 (where the genetic effect for stratum i is denoted by "C i ") characterizes the probability of an affected sib pair in stratum i sharing the same allele at τ from the parent. The genetic effects from the susceptibility locus for all stratified groups could be estimated, and the significance levels of these genetic linkage effects could be assessed at the estimated putative disease locus. Further, the significance of the interaction between a gene and smoking can be assessed through testing the null hypothesis, where all C values are considered equal. Hence, we adopted this multipoint linkage approach to assess the gene × smoking interaction for RA in the present study.



A total of 1096 affected sib pairs from 757 multiplex families in the North American Rheumatoid Arthritis Consortium (NARAC) study were included in the study. Only 615 or 627 sib pairs (depending on the chromosomal regions) had genotype information, and thus these were used for our analysis. The NARAC multiplex families contain 8017 individuals, most of whom are Caucasians (90.6%). We performed the analyses using the entire data set and the subset of Caucasians and found that the results from both data sets were virtually identical. We therefore reported only the results from the entire data set here. A total of 375 microsatellite markers were used in the analyses. There were 615 affected sib pairs available for chromosomes 1–11, 13–16, and 19–22, and 627 affected sib pairs available for chromosomes 12, 17, and 18. The smoking variables included "ever smoker" and "current smoker." Due to the missingness of smoking variables, the total number of affected sib pairs included in the analysis varied from 585 to 597, depending on which smoking variable was used, and which chromosomal region was studied.

Several studies have shown that the association between current heavy smokers and RA was striking, while the association between "ever smokers" and RA was modest (e.g., [6]). To understand the etiology of RA, it is therefore helpful to examine the interactions between these two smoking statuses and the trait locus of RA. The affected sib pairs were stratified according to their smoking status. The gene × smoking effect was examined separately for the two smoking variables. The three groups of affected sib pairs stratified by "ever smoked" status were (never smoked, never smoked) pairs, (never smoked, ever smoked) pairs, and (ever smoked, ever smoked) pairs; they were (non-current smoker, non-current smoker) pairs, (non-current smoker, current smoker) pairs, and (current smoker, current smoker) pairs when stratified by the "current smoking" status. For chromosomes 12, 17, and 18, the numbers of (never smoked, never smoked), (never smoked, ever smoked), and (ever smoked, ever smoked) affected sib pairs were 163, 206, and 225, respectively; there were 160, 203, and 222, respectively, for the rest of chromosomes. In addition, the numbers of affected sib pairs for (non-current smoker, non-current smoker), (non-current smoker, current smoker), and (current smoker, current smoker) were 425, 138, and 34, respectively, for chromosomes 12, 17, and 18, and 417, 137, and 34, respectively, for the rest of chromosomes. Among the 425 (417) concordant "non-current smoker" pairs, 160 (163) of them were concordant "never smoked" pairs, while 34 (34) out of 225 (222) "ever smoked" pairs were "current smoker" pairs. There were 636 (about 39.9%) former smokers who were "ever smokers", yet were not "current smokers." Five affected sibs (about 0.31%) mistakenly reported that they never smoked, yet were current smokers. The difference in numbers of the affected sib pairs defined by never/ever smoked and non-current/current smokers was made by these 641 (636+5) affected sibs.

Statistical methods

The parameters C0, C1, and C2 were the genetic effects for the three groups stratified by one of the smoking statuses, respectively. The GeneHunter program was used to calculate identity-by-decent (IBD) sharing of affected sib pairs. The GeneFinder program was applied to obtain the estimates of τ and C i , i = 0, 1, 2, and their 95% confidence intervals, as well as to calculate the p-values of the genetic effects to test whether C values were all equal (that is, if the gene × environment interaction was present). In addition, we compared these results with the results from analyses excluding environmental factors.


For comparison, we demonstrated the results from the autosomal-wide scan in which smoking status was not incorporated in Table 1. As illustrated in Table 2 and Figure 1a, after stratifying on the status of "ever smoked," the susceptibility disease locus on chromosome 6 remained in the same region as that identified without incorporating "ever smoked" status, around 45.6 cM on chromosome 6. The genetic effects from the three "ever smoked" groups remained statistically significant and were similar at this locus, with C ^ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGdbWqgaqcaaaa@2DCB@ 0 = 0.21 (p = 0.00012), C ^ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGdbWqgaqcaaaa@2DCB@ 1 = 0.20 (p = 0.000031) and C ^ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGdbWqgaqcaaaa@2DCB@ 2 = 0.22 (p = 2.32 × 10-6). Therefore, the interaction between the susceptibility locus and "ever smoked" status was not statistically significant (p = 0.95). Nevertheless, the interaction of the gene by "ever smoked" status was observed at 25.84 cM on chromosome 8 (p = 0.0055), at 23.9 cM on chromosome 13 (p = 0.026), at 55.12 cM on chromosome 15 (p = 0.029), and at 44.85 cM on chromosome 17 (p = 0.017). The (never smoked, never smoked) and (never smoked, ever smoked) pairs showed significant genetic effects ( C ^ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGdbWqgaqcaaaa@2DCB@ 0 = 0.22, p = 0.00047; C ^ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGdbWqgaqcaaaa@2DCB@ 1 = 0.10, p = 0.027) at the susceptibility locus identified on chromosome 8, but not on chromosomes 13, 15, or 17, indicating that the genetic effect of "ever smoked" status varied from region to region, and the interaction could still exist when the genetic effect for each stratum in the region was not statistically significant.

Table 1 Autosomal-wide linkage mapping without incorporating a smoking variable
Table 2 Incorporating "ever smoked" status into linkage mapping using affected sib pairs
Figure 1
figure 1

a, Estimated IBD stratified by "ever smoked" vs. overall estimated IBD for chromosome 6; b, estimated IBD stratified by "current smoker" vs. overall estimated IBD for chromosome 6.

When stratified by "current smoking" status (Table 3), the susceptibility disease locus on chromosome 6 remained at the same location, and the interaction between this locus and current smoking status was statistically significant (p = 0.023). The genetic effect from the (current smoker, current smoker) group was estimated to be 0.43 (p = 0.0018), about two-fold higher than those from the (non-current smoker, non-current smoker) ( C ^ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGdbWqgaqcaaaa@2DCB@ 0 = 0.20, p = 1.30 × 10-8) and (non-current smoker, current smoker) ( C ^ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGdbWqgaqcaaaa@2DCB@ 1 = 0.20, p = 0.00068) groups as illustrated in Figure 1b.

Table 3 Incorporating "current smoker" status into linkage mapping using affected sib pairs

Other regions showed a significant interaction between the susceptibility locus and current smoking status, including the locations of 160.2, 26.3, 38.8, 26.9, 38.0, 83.0, 74.5, 81.1, 88.5, and 38.4 cM on chromosomes 4, 8, 9, 10, 13, 14, 18, 19, 20, and 21, respectively. The gene × smoking interactions were most striking on chromosomes 8 (p = 0.0086) and 9 (p = 0.00052). Among them, the (non-current smoker, non-current smoker) group showed statistically significant genetic effects on chromosomes 8p and 9p with C ^ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGdbWqgaqcaaaa@2DCB@ 0 = 0.12 (p = 0.00095) and 0.11 (p = 0.0060), respectively. The (non-current smoker, current smoker) group showed a statistically significant genetic effect on chromosome 4q ( C ^ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGdbWqgaqcaaaa@2DCB@ 1 = 0.12, p = 0.044), and the (current smoker, current smoker) group showed a statistically significant genetic effect on chromosomes 20q ( C ^ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGdbWqgaqcaaaa@2DCB@ 2 = 0.21, p = 0.052).


The etiology of RA is multifactorial, with genetic factors contributing between 30 and 50% of the total risk [6]. We conducted genome-wide multipoint linkage scans by stratifying on the status of "ever smoked" or "current smoker" to assess the interaction between smoking and the susceptibility locus for RA. The "current smoker" status significantly interacted with the locus identified in the region of 6p21.2-3, while "ever smoked" did not. The genetic effect in this region for "current smoker" pairs was about two-fold of that for other types of pairs, suggesting that smoking might trigger genes with immunologic significance in this region more strongly in current smokers than it does for "non-current" smokers.

It is estimated that about one-third of the total proportion of the genetic risk arises from the major histocompatibility complex that lies in the 6p21.2-3 region, which is known to contain more than 220 genes [7]. Our results suggested that the genes in this region interact with current smoking status. In addition, by incorporating smoking status into our analyses, we also identified other regions with statistically significant genetic effects and interaction effects between current smoking status and the susceptibility locus, including regions on chromosomes 4q, 8p, 9p, and 20q. Among them, the concordant non-current smoker pairs showed stronger genetic effects on chromosomes 8p and 9p, while the concordant current smoker pairs showed stronger genetic effects on chromosome 20q. These findings help investigators to dissect the etiology and underlying genetic mechanism of RA, which is critical for designing new tools for suppressing RA pathogenesis before the onset of disease.


  1. Klareskon L, Padyukov L, Lorentzen J, Alfredsson L: Mechanisms of disease: genetic susceptibility and environmental triggers in the development of rheumatoid arthritis. Nat Clin Pract Rheum. 2006, 2: 425-433. 10.1038/ncprheum0249.

    Article  Google Scholar 

  2. Gauderman WJ, Siegmund KD: Gene-environment interaction and affected sib pairs linkage analysis. Hum Hered. 2001, 52: 34-46. 10.1159/000053352.

    Article  PubMed  CAS  Google Scholar 

  3. Liang KY, Chiu YF, Beaty TH: A robust identity-by-descent procedure using affected sib pairs: multipoint mapping for complex diseases. Hum Hered. 2001, 51: 64-78. 10.1159/000022961.

    Article  PubMed  CAS  Google Scholar 

  4. Liang KY, Chiu YF, Beaty TH, Wjst M: Multipoint analysis using affected sib pairs: incorporating linkage evidence from unlinked regions. Genet Epidemiol. 2001, 21: 105-122. 10.1002/gepi.1021.

    Article  PubMed  CAS  Google Scholar 

  5. Chiu YF, Liang KY: Conditional multipoint linkage analysis using affected sib pairs: an alternative approach. Genet Epidemiol. 2004, 26: 108-115. 10.1002/gepi.10305.

    Article  PubMed  Google Scholar 

  6. Hutchinson D, Shepstone L, Moots R, Lear JT, Lynch MP: Heavy cigarette smoking is strongly associated with rheumatoid arthritis (RA), particularly in patients without a family history of RA. Ann Rheum Dis. 2001, 60: 223-227. 10.1136/ard.60.3.223.

    Article  PubMed Central  PubMed  CAS  Google Scholar 

  7. Deighton CM, Walker DJ, Griffiths ID, Roberts DF: The contribution of HLS to rheumatoid arthritis. Clin Genet. 1989, 36: 178-182.

    Article  PubMed  CAS  Google Scholar 

Download references


We thank Ms. Karen Klein (Research Support Core, WFUHS), Ms. Rhonda Hawks, and Ms. Patricia Feeney for their editorial contributions to this manuscript. This work was supported by grant BS-096-pp08 from the National Health Research Institutes and Academia Sinica grant GRC 94B001-1.

This article has been published as part of BMC Proceedings Volume 1 Supplement 1, 2007: Genetic Analysis Workshop 15: Gene Expression Analysis and Approaches to Detecting Multiple Functional Loci. The full contents of the supplement are available online at

Author information

Authors and Affiliations


Corresponding author

Correspondence to Yen-Feng Chiu.

Additional information

Competing interests

The author(s) declare that they have no competing interests.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Chen, YS., Chiu, YF., Kao, HY. et al. Assessing genotype × environment interaction in linkage mapping using affected sib pairs. BMC Proc 1 (Suppl 1), S71 (2007).

Download citation

  • Published:

  • DOI: