Abstract
Individuals differ widely in their immune responses, with age, sex and genetic factors having major roles in this inherent variability1,2,3,4,5,6. However, the variables that drive such differences in cytokine secretion—a crucial component of the host response to immune challenges—remain poorly defined. Here we investigated 136 variables and identified smoking, cytomegalovirus latent infection and body mass index as major contributors to variability in cytokine response, with effects of comparable magnitudes with age, sex and genetics. We find that smoking influences both innate and adaptive immune responses. Notably, its effect on innate responses is quickly lost after smoking cessation and is specifically associated with plasma levels of CEACAM6, whereas its effect on adaptive responses persists long after individuals quit smoking and is associated with epigenetic memory. This is supported by the association of the past smoking effect on cytokine responses with DNA methylation at specific signal trans-activators and regulators of metabolism. Our findings identify three novel variables associated with cytokine secretion variability and reveal roles for smoking in the short- and long-term regulation of immune responses. These results have potential clinical implications for the risk of developing infections, cancers or autoimmune diseases.
Similar content being viewed by others
Main
High levels of variability exist among individuals and populations in relation to responses to immune challenges2,7. This has been highlighted by the COVID-19 pandemic through the diverse clinical outcomes observed after infection with SARS-CoV-26,8. Variables such as age, sex and genetics have a major effect on the way individuals respond to infection2,3,4,5,6,9,10. However, such immune variability is generally not considered in the design of treatments or vaccines, and there is a need to better identify the variables associated with immune response variation11.
The Milieu Intérieur project was developed to assess the factors that contribute to variable ‘healthy’ immune responses12. The cohort is equilibrated in terms of age and sex and comprises individuals of a homogenous genetic background, to facilitate identification of novel immune determinants, in addition to age, sex and genetic variants. The Milieu Intérieur project has already advanced our understanding of the variables that regulate immune homeostasis. In particular, by quantifying the effects of age, sex, genetics and cellular composition on the transcript levels of immune-related genes4, and the effects of age, sex, cytomegalovirus (CMV) latent infection and smoking on blood leukocyte composition3.
To identify new environmental factors associated with variability in response to immune stimulation, we focused on cytokine protein secretion as an immune response phenotype. The concentrations of 13 disease- and medically-relevant cytokines (CXCL5, CSF2, IFNγ, IL-1β, TNF, IL-2, IL-6, IL-8, IL-10, IL-12p70, IL-13, IL-17 and IL-23) were measured with Luminex technology, after 22 h of standardized whole-blood stimulation with 11 immune agonists for the 1,000 Milieu Intérieur donors (Supplementary Table 1), as well as in a non-stimulated control (null condition). The stimulations are classified into 4 categories: microbial (Bacillus Calmette-Guérin (BCG), Escherichia coli (E. coli), lipopolysaccharide (LPS) and Candida albicans (C. albicans)) and viral (influenza and polyinosinic–polycytidylic acid (poly I:C)) agents, which are predominantly recognized by receptors on innate immune cells; T cell activators (Staphylococcus aureus enterotoxin B superantigen (SEB) and anti-CD3 and anti-CD28 antibodies (anti-CD3 + CD28)), which induce adaptive immune responses; and cytokines (TNF, IL-1β and IFNγ).
Smoking, CMV and BMI associations
Principal component analysis (PCA) (Extended Data Fig. 1) and heat maps (Extended Data Fig. 2) of the 13 cytokines in the 12 immune stimulations highlight the specific cytokines that were induced in each independent condition. Hierarchical clustering of the standardized log mean differences of the cytokine levels (Fig. 1a) clearly distinguishes groups that broadly correspond to stimulation type. Immune responses induced by innate (E. coli and LPS) and adaptive (SEB and anti-CD3 + CD28) stimulations cluster separately, and show greater inter-individual variability in the measured cytokines compared with the other stimulation types.
From the electronic case report form (eCRF), we compiled 136 socio-demographic, environmental, clinical and nutritional variables (Supplementary Table 2) and tested their association with the induced cytokines in each stimulation through likelihood ratio tests (LRTs), using age, sex and experimental batch as covariates. Eleven variables are associated with at least one cytokine in at least one stimulation (Benjamini–Yekutieli adjusted P value <0.01) (Fig. 1b). These are related mostly to body mass index (BMI) in SEB and BCG stimulations, CMV latent infection in adaptive immune stimulations and smoking, which shows the most associations across stimulations.
Smoking-related variables are associated with IL-2 and IL-13 in SEB and anti-CD3 + CD28 stimulations, and they are associated with CXCL5 in innate immune stimulations (Fig. 1c). These observations are supported by previous findings showing that smoking favours inflammation and compromises immunity to bacterial infection13, and that IL-2 and IL-13 are involved in modulating effects of exposure to tobacco14,15. CMV latent infection is associated with CSF2, IFNγ and TNF upon adaptive immune stimulations, in line with our previous work showing strong associations between CMV seropositivity and increased numbers of T cell effector memory subsets3. We also observed that BMI-related variables are associated with CXCL5 after BCG stimulation, and with IL-2 after SEB stimulation, which is consistent with the dysregulation of CXCL5 and IL-2 in obesity16,17. As potential interactions may exist between our tested variables and age, we performed the same analysis considering age and smoking interactions in the models. The results are very similar to the ones obtained without considering interactions, and some smoking-related variables are associated with even higher significance in SEB, anti-CD3 + CD28, E. coli and LPS stimulation conditions (Extended Data Fig. 3). Notably, by including these interactions, smoking-related variables are significantly associated with IL-2 responses after BCG stimulation. This IL-2 response may reflect a long-lived antigen-specific T cell response to BCG vaccination, which all of the cohort received at birth owing to mandatory BCG vaccination in France prior to 2007, further strengthening the associations of smoking with T cell immunity. Individual effects of age and sex have also been tested, and corresponding LRT results and effects sizes are shown on Extended Data Figs. 4 and 5. In addition, as human leukocyte antigen (HLA) is a well-known determinant of immune response variability, which is mostly relevant for antigen-specific responses, we tested associations between previously identified HLA types3 and induced cytokine responses following the same procedure that we used for the other donor variables. We detected only one significant association, between the major histocompatibility complex class II, DQ beta 1 HLA.DBQ1.1P and IL-6 in the non-stimulated control condition. However no associations were observed with induced cytokine responses after stimulation.
The smoking effect is persistent
To assess the biological effect of smoking on cytokine secretion, we plotted the effect sizes for the smoking variables from the linear models. We observed that current smoking affects both innate and adaptive immune responses (Fig. 2a and Extended Data Fig. 6). Smoking is associated with stronger induction of CXCL5 after E. coli stimulation, and stronger induction of IL-2 and IL-13 after SEB stimulation (Benjamini–Yekutieli adjusted P value of LRT < 0.001). The smoking-related variables—number of years smoking, number of years since last smoked, and total number of cigarettes—show consistent associations (Fig. 2b,c).
Of note, in contrast to current smokers (Fig. 2a), past smokers show no significant increase in CXCL5 secretion after innate immune stimulation, whereas they show an increase in IL-2 and IL-13 secretion after adaptive immune stimulation, compared with non-smokers (Fig. 2d). Box plots of cytokine concentrations show that the levels of CXCL5 in past smokers and non-smokers are not significantly different, but differ from those in current smokers after innate immune stimulations (Fig. 2e and Extended Data Fig. 6a,d). Conversely, the levels of IL-2 in current smokers and past smokers are not significantly different but differ from those in non-smokers in adaptive immune stimulations (Fig. 2f and Extended Data Fig. 6b,e). Cytokine production for both current and past smokers correlates with the number of years smoking in adaptive immune stimulations, but this is not the case for past smokers in innate immune stimulations (Fig. 2g and Extended Data Fig. 6c,f). These results collectively show a short-term effect of smoking on innate immune responses, and a long-term effect of smoking on adaptive immune responses.
Immune cells and plasma proteins
As induced cytokine levels are associated with numbers of specific subsets of circulating immune cells, we tested whether the smoking–cytokine associations remain when considering these cell subsets in our models (Fig. 3 and Extended Data Fig. 7a). The associations of CMV serostatus with the levels of CSF2, IFNγ and TNF in SEB stimulation are lost when specific memory T cell numbers are used as covariates (Extended Data Fig. 7a), which is consistent with the reported association of these cells with CMV latent infection3, and suggests that the effects of CMV latent infection on cytokines are mediated by changes in blood cell composition. The same analysis performed on the smoking variable for the innate immune stimulations show no clear cell subsets affecting the association of smoking with CXCL5 levels (Fig. 3a). Conversely, upon adaptive immune stimulations, the numbers of multiple B cell and regulatory T cell subsets eliminate the association of smoking with protein levels (Fig. 3a), suggesting that these subsets mediate the smoking effect in adaptive immune responses. We also assessed a role for 326 soluble blood proteins measured in the plasma of a subset of 400 donors18. We identified that the levels of carcinoembryonic antigen-related cell adhesion molecule 6 (CEACAM6), when included in the models, eliminates the association of smoking with CXCL5 after innate immune stimulations, suggesting that CEACAM6 is involved in CXCL5 regulation in smokers (Fig. 3b). We interpret this effect as a biological interaction rather than a shift in immune cell populations, owing to the short stimulation period of 22 h and because the regression analysis including cell numbers as covariates (Fig. 3a) did not identify any cellular associations with the bacteria-induced CXCL5–smoking association.
DNA methylation changes
To test the hypothesis that an epigenetic mechanism mediates the persistent effect of smoking on adaptive immune responses, we determined whether smoking affected cytokine levels through DNA methylation changes. We measured baseline DNA methylation at more than 850,000 CpG sites using the MethylationEPIC array and identified 2,416 CpG sites that are directly associated with smoking—that is, not mediated by blood cell composition—in the Milieu Intérieur cohort19. Among those sites, 129 are significantly associated with IL-2 in SEB stimulation (Benjamini–Yekutieli adjusted P value of the LRT < 0.001). We tested whether the DNA methylation levels of these CpG sites could be related to the association of smoking with cytokine levels after SEB stimulation. We observed that 11 CpGs, when passed as covariates in the models, eliminate the association of smoking with IL-2 and IL-13 (Benjamini–Yekutieli adjusted P value of LRT < 0.001) (Extended Data Fig. 7b). Among these, 3 relate to the aryl hydrocarbon receptor repressor (AHRR) gene. The other CpG sites correspond to F2RL3 and G-protein-coupled receptor (GPR15), retinoic acid receptor (RARA) and serine protease (PRSS23) genes. At all of these loci, we observe a strong DNA hypomethylation in current smokers compared with non-smokers, with past smokers showing an intermediate state of methylation (Fig. 4a). Consistently, in past smokers, the number of years individuals smoked (Fig. 4b), the total number of cigarettes they smoked (Fig. 4c) and IL-2 levels in SEB stimulation (Fig. 4e) negatively correlate with the level of DNA methylation of these genes, whereas the number of years since they stopped smoking generally positively correlate with their level of DNA methylation (Fig. 4d). These results provide evidence that a persistent effect of smoking on adaptive immune responses is associated with DNA methylation at signal trans-activators and metabolism regulators.
Genomic associations
As some genetic variants manifest their regulatory roles upon stimulation only, our study is particularly well suited to identify response protein quantitative trait loci (pQTLs). We tested a total of 5,699,237 high-quality imputed single nucleotide polymorphisms (SNPs) for associations with the cytokines induced in each stimulation, adjusting for age, sex, technical variables and major immune cell population counts (Supplementary Table 3) and report 44 reponse pQTLs (Table 1). The Somalogic and Olink databases are the main resources of plasma pQTLs, which have identified pQTLs for some of our tested cytokines at steady state20,21,22,23,24. However, to our knowledge, only one study—the 500FG study—tested for response pQTLs for some of our tested cytokines in whole blood25. Among the common tested cytokine–stimulation couples, both studies identify pQTLs for IL-6 and IL-1β in poly I:C, TNF in C. albicans and IL-1β in LPS. Of note, the pQTL (rs3775291) we identify for IL-1β and IL-6 in poly I:C is located in the TLR3 exon locus, whereas the pQTLs reported by the 500FG study for these cytokines (rs28393318 for IL-1β and rs6831581 for IL-6) are located in the TLR1–TLR6–TLR10 (TLR1/6/10) locus. We performed conditional analysis between the two loci by passing rs28393318 as a covariate in our pQTL identification, and show that the association we report between IL-1β and rs3775291 is maintained, indicating that it is independent of the one reported in the 500FG study (Supplementary Table 4). Among the potential new trans-pQTLs we identified, 3 are for CXCL5. One was identified upon BCG stimulation and is located in the TLR1/6/10 locus. Another one was identified in C. albicans stimulation and is located in the CR1 locus. The third was observed in poly I:C, IL-1β and IFNγ stimulations, and is in an intron of JMJD1C, which encodes a candidate histone H3K27me3 demethylase. We also identified a trans-pQTL for IL-2 in anti-CD3 + CD28 stimulation, which is close to the immunoglobulin Fc receptor IIa (FCGR2A) gene—identified as an eQTL by the GTEx consortium26—that has been associated with multiple autoimmune diseases27 and conditions the response to anti-CD3 + CD28 stimulation28. Another trans-pQTL of interest is for IL-1β, IL-12, TNF, IL-6 and IFNγ in poly I:C stimulation, which is located in a TLR3 exon and associated with age-related macular degeneration29 and resistance to viral infections30, consistent with poly I:C stimulation acting through TLR3. Other trans associations were identified for IL-1β in LPS stimulation, with the corresponding variant rs3764613, located in a PPP5C enhancer, and for IL-6 in BCG stimulation, for which the corresponding variant—rs72636686—is in an intron of the KAZN gene, which encodes a cell adhesion protein. We also assessed potential interactions between the identified SNPs and smoking status. No significant associations were observed for E. coli, LPS, SEB and anti-CD3 + CD28 stimulations, but BCG stimulation showed significant genetic-smoking interaction (effect size = 1.58 [1.42–1.75], Benjamini–Yekutieli adjusted P value = 3.8 × 10−13) (Extended Data Fig. 8a,b) between rs72636686 and smoking status for IL-8 levels. This interaction shows that smoking status can remove differences in response to some immune stimulation between individuals of different genotypes (Extended Data Fig. 8c), supporting modulation of genotype effects by the smoking status.
Cytokine variance explained
To assess the relative effect of the smoking associations identified, we quantified how inter-individual variance in each induced cytokine may be explained by each associated variable. The genetic and epigenetic loci, major immune cell subsets, age, sex, BMI, CMV latent infection and smoking status were combined in the same model and their respective effects on induced cytokine levels were computed (Fig. 5a–h). We observed that smoking explained between 4 and 9% of the inter-individual variance of the associated cytokines, a level equivalent to age, sex or genetic effects when present.
Discussion
Here we show that smoking status, CMV latent infection and BMI, in addition to age, sex, genetic variation, DNA methylation levels and immune cell subsets, are the variables most associated with variation in cytokine secretion upon immune challenge. Our approach to study induced immune responses to diverse stimuli was validated by the identification of new environmental and genetic determinants for 11 of the 13 cytokines studied, associations that were detected across the different stimulation conditions. The effects of smoking are of particular interest, as they have effects on both innate and adaptive immune responses (Extended Data Fig. 10c). The variance explained by the smoking status for some cytokines upon stimulation reaches a level equivalent to those of age, sex and genetic variants, all of which are known to have implications for disease risk. Current smokers showed an increased inflammatory response following bacterial stimulation, which is quickly lost upon smoking cessation. Conversely, the smoking effects on T cell responses persist years after individuals quit smoking.
The association of the smoking effect with long-lived B and T cell subsets and DNA methylation at signal trans-activators and metabolism regulators highlights a mechanism for the persistent effects in the adaptive response. AHRR, F2RL3, GP15, PRSS23 and RARA CpG sites were previously identified as candidate smoking-related loci in whole blood31,32. Of note, the CpG site most associated with smoking—within the AHRR gene—is a quantitative biomarker of smoking cessation33 and is associated with increased risk for chronic obstructive pulmonary disease and lung cancer34. Although the interplay of AHRR in the AHR pathway is complex and depends on the cell-type specific balancing of AHR and AHRR, the repressive activity of AHRR is due to displacement or inhibition of AHR binding to xenobiotic response elements in the promoter of its targets35. Decreased expression of xenobiotic metabolizing genes may, in turn, compromise the ability of the body to metabolize harmful agents, potentially leading to impaired lung function. Our work thus suggests that smoking can induce specific epigenetic modifications that could subsequently lead to altered immune responses.
By contrast, we did not identify putative cellular mediators for the increased inflammatory response to innate stimulation in smokers. We found a strong association between induced levels of CXCL5 with circulating levels of CEACAM6, a glycosylphosphatidylinositol cell surface protein that belongs to the CEACAM immunoglobulin supergene family. CXCL5 regulates neutrophil trafficking to the lung via CXCR2 and has been implicated in asthma and multiple cancers36. CEACAM6 is expressed on the surface of neutrophils, macrophages and lung and intestinal epithelial cells and its levels are increased in multiple cancers, such as breast and gastric cancer where it has been proposed as a clinical biomarker37. Although previous studies have suggested that levels of CXCL538 and CEACAM639 may be increased in smokers, most studies have focused on pulmonary sites and in patients with respiratory disease (such as cancer, chronic obstructive pulmonary disease and asthma), which can confound the results. Our study identifies a strong link between these previously proposed disease biomarkers and response to immune challenges in smokers versus non-smokers. Furthermore, our findings in healthy donors open avenues for further exploration into understanding how smoking acts as a risk factor for cancers beyond the lungs.
Although our study has revealed novel effects of environmental variables on immune responses, it does present some limitations. One is the absence of a replication cohort to validate these findings. Furthermore, our analyses were conducted on a population of similar genetic background, but ongoing efforts are underway to include other populations from diverse ancestries. Future studies will also aim to identify the transcriptional regulatory networks40 that underly the persistent effect of smoking on adaptive immune responses. These findings provide new understanding on the effects of smoking on human health, and the role of modifiable environmental effects on immune response variability.
Methods
Human samples
Human samples came from the Milieu Intérieur cohort, which was approved by the Comité de Protection des Personnes–Ouest 6 on 13 June 2012, and by the French Agence Nationale de Sécurité du Médicament (ANSM) on 22 June 2012. The study is sponsored by Institut Pasteur (Pasteur ID-RCB Number: 2012-A00238-35) and was conducted as a single-centre interventional study without an investigational product. The original protocol was registered under ClinicalTrials.gov (study no. NCT01699893). The samples and data used in this study were formally established as the Milieu Intérieur biocollection (NCT03905993), with approvals by the Comité de Protection des Personnes–Sud Méditerranée and the Commission Nationale de l’Informatique et des Libertés (CNIL) on 11 April 2018. Donors gave written informed consent. The 1,000 donors of the Milieu Intérieur cohort were recruited by BioTrial to be composed of healthy individuals of the same genetic background (Western European) and to have 100 women and 100 men from each decade of life between 20 and 69 years of age. Donors were selected based on various inclusion and exclusion criteria that were previously described12. In brief, donors were required to have no history or evidence of severe, chronic or recurrent pathological conditions, neurological or psychiatric disorders, alcohol abuse, recent use of drugs, recent vaccine administration and recent use of immune modulatory agents. To avoid the influence of hormonal fluctuations in women, pregnant and peri-menopausal women were not included. To avoid genetic stratification in the study population, the recruitment of donors was restricted to individuals whose parents and grandparents were born in Metropolitan France. Additionally, we formally checked how the genetic background of the donors could affect cytokine levels by performing association tests between the first 20 genetic principal components out of the PCA on the individual genotypes and each of the induced cytokines in each stimulation. Although PC1 had significant association with IL-10 (Benjamini–Yekutieli adjusted P value < 0.05), we found that the first 20 principal components showed no significant associations with cytokine responses at the P value threshold (Benjamini–Yekutieli adjusted P value < 0.01) we use throughout this study. To illustrate the homogeneity of the genetic structure of the 1,000 individuals of the Milieu Intérieur cohort, a PCA was performed with EIGENSTRAT41 on 261,827 independent SNPs and 1,723 individuals, which include the 1,000 Milieu Intérieur donors together with 723 individuals from a selection of 36 populations originating from North Africa, the Near East, as well as western and northern Europe42 is shown, similarly to what was previously performed3. PC1 versus PC2, PC1 versus PC3 and PC2 versus PC3 are displayed as well as a bar plot of the variance explained by the first 20 components of the PCA (Extended Data Fig. 9b). Unless otherwise stated, all displayed results have been performed on the 955 individuals of the cohort who gave consent to share their data publicly, in order to ensure easy reproducibility of the results.
TruCulture whole-blood stimulations
TruCulture whole-blood stimulations were performed in a standardized way as previously described4,43. Briefly, tubes were prepared in batch with the indicated stimulus, resuspended in a volume of 2 ml buffered medium, and maintained at −20 °C until time of use. Stimuli used in this study were LPS derived from E. coli O111:B4 (Invivogen), E. coli O111:B4 (Invivogen), C. albicans (Invivogen), vaccine-grade poly I:C (Invivogen), live Bacillus Calmette-Guerin (Immucyst, Sanofi Pasteur), live H1N1 attenuated influenza A/PR8 (IAV) (Charles River), SEB (Bernhard Nocht Institute), CD3 + CD28 (R&D Systems and Beckman Coulter), and cytokines TNF (Miltenyi Biotech), IL-1β (Peprotech) and IFNγ (Boehringer Ingelheim). One millilitre of whole blood was distributed into each of the prewarmed TruCulture tubes, inserted into a dry block incubator, and maintained at 37 °C room air for 22 h. At the end of the incubation period, tubes were opened, and a valve was inserted in order to separate the sedimented cells from the supernatant and to stop the stimulation reaction. Liquid supernatants were aliquoted and immediately frozen at −80 °C until the time of use.
Luminex multi-analyte profiling
Supernatants from TruCulture tubes were analysed by Rules Based Medicine using the Luminex xMAP technology. Samples were analysed according to the Clinical Laboratory Improvement Amendments (CLIA) guidelines. The lower limit of quantification (LLOQ) was determined as previously described43, and is the lowest concentration of an analyte in a sample that can be reliably detected and at which the total error meets CLIA requirements for laboratory accuracy. The 13 cytokines (CXCL5, CSF2, IFNγ, IL-1β, TNF, IL-2, IL-6, IL-8, IL-10, IL-12p70, IL-13, IL-17 and IL-23), which were measured in this study, were selected to best capture broad immune response variability. Among 109 analytes initially tested, these are the ones that captured the maximum variance following stimulation with the 4 stimuli (LPS, BCG, poly I:C and SEB) that showed the most distinct immune responses among 27 stimuli tested on a subset of 25 individuals of the Milieu Intérieur cohort.
Principal components analysis
The PCA in Extended Data Fig. 1 was created in R 4.2.1 using the FactoMineR 2.8 package. The data were log-transformed and by default scaled to unit and missing values were imputed by the mean of the variable.
Cytokine induction visualization
Cytokines were considered induced if the absolute value of their median concentration in the stimulated condition was 30%-fold of their concentration in the null condition. Standardized log mean differences were computed as follows (mean(concentration of the cytokine in the stimulated condition) − mean(concentration of the cytokine in the null condition))/s.d.((concentration of the cytokine in the stimulated condition) − (concentration of the cytokine in the null condition)) and the corresponding heat map was generated with heatmaply 1.0.0 and dendextend 1.13.12 with ‘complete’ clustering method and ‘euclidean’ distance in R version 4.2.1.
Identification of CD3 + CD28 non-responders
Levels of cytokines that we focused on are low to undetectable in the non-stimulated condition, and cytokine induction is generally homogenous within this healthy population of individuals, with no clear distinguishable groups of responders and non-responders, except for anti-CD3 + CD28 stimulation (Extended Data Fig. 2). For the anti-CD3 + CD28 stimulation, we identified through k-means clustering a group of 705 individuals that responded to the stimulation and a group of 295 individual did not. This lack of response of 295 individuals is explained by a FcγRIIA polymorphism (rs1801274) that was previously described as preventing response to this anti-CD3 + CD28 stimulation28 (Extended Data Fig. 9). All statistical analyses on anti-CD3 + CD28 stimulations in this study were thus performed on the responders only.
eCRF criteria association tests with induced cytokines
Variables were extracted from the eCRF filled by the donors with the help of a physician. To limit biases in associations, categorical variables had to have at least 5% of individuals in at least half of the categorical levels to be considered for association tests. Such categoricalvariables or numerical ones were tested for associations with the log-transformed induced cytokine levels in each stimulation through LRTs, using age, sex and the technical variable batchID (corresponding to two batches of TruCulture tubes produced at different periods of time) as covariates: the LRT compared the models lm(cytokine ~ variable + age + sex + batchID) with lm(cytokine ~ age + sex + batchID), followed by Benjamini–Yekutieli multiple testing correction applied to the whole heat maps, so taking into account the tests made for the 136 CRF variables with all the induced cytokines in a specific stimulation. For Extended Data Figs. 4 and 5, the models compared were lm(cytokine ~ age + sex + batchID) with lm(cytokine ~ sex + batchID) for age and lm(cytokine ~ sex + batchID) with lm(cytokine ~ age + batchID) for sex. P values of association tests were represented using ggplot2 3.2.1 in R 3.6.0. Adjusted P values on the box plots were computed with the wilcox.test function, correcting for multiple testing. Versions of the box plots and scatter plots made on the residuals after regression on age, sex and batchID are displayed on Extended Data Fig. 6d–f.
Effect size plots
Linear regression models were estimated in each stimulation using the log-transformed induced cytokine levels as outcome and age, sex, batchID, and the covariates of interest (for example, smoking status) as predictor variables. Interactions with the covariates of interest were considered when indicated. Exponential of the regression coefficient estimates, and their 95% confidence interval were plotted. When the covariate of interest is of categorical nature, each level of the variable is shown independently, considering the one specified as the reference. When the P value of the t-test testing if the coefficient estimate is different from zero is <0.01, it is plotted in black, otherwise it is plotted in grey. If the LRT comparing the regression with and without the variable of interest in the model with a Chi-square test is significant with a Benjamini–Yekutieli adjusted P value < 0.01, a red star is added above the effect size value and interval.
Cell subset association tests
Acquisition of flow cytometry data was detailed previously3. Association tests with log-transformed values of induced cytokines in each stimulation were performed as for the eCRF criteria association tests using log-transformed raw counts of cell subsets for each donor. P values of significance are indicated with asterisks as follows: *P < 0.05; **P < 0.01; ***P < 0.001.
DNA methylation association tests
CpG methylation profiles were generated using the Infinium MethylationEPIC BeadChip (Illumina) on genomic DNA treated with sodium bisulfite (Zymo Research) for 958 individuals of the Milieu Intérieur cohort as described19. Associations between the DNA methylation levels for the CpG sites located within 1 Mb of each cytokine gene transcription start site (TSS) and the levels of log-transformed induced cytokines in each stimulation, adjusting for age, sex, technical variable batchID and major immune cell population counts for each stimulation, were tested through LRT and identified CpG sites weakly associated with IL-17 in LPS (cg09582880), IL-2 in C. albicans (cg17850932 and cg25065535) and IL-8 in influenza (cg16468729) stimulations (FDR adjusted P value of LRT < 0.05) (Extended Data Fig. 10). These effects were mild compared with the identified associated genetic variants and the other associated variables identified in this study but are considered in the final global models (Fig. 5). CpG sites with DNA methylation levels that are directly affected by smoking have been selected as described19.
Heat maps showing effects of covariates
To test if the levels of some covariates, such as cell subsets, plasma proteins or DNA methylation probes, could modify the observed association of a variable, such as smoking status, with the log-transformed induced levels of cytokines in each stimulation, we compared with a LRT for each cytokine in each stimulation the model considering both the variable of interest and the covariate of interest (with interactions) plus the usual covariates age, sex and the technical covariate batchID, with a model containing all the covariates but not the variable of interest, followed by a Benjamini–Yekutieli multiple testing adjustment on the whole heat maps. For example, for Fig. 3a, the variable of interest was smoking status and the covariate of interest was each cell subset, so we compared lm(cytokine ~ smoking status × cell subset + age + sex + batchID) with lm(cytokine ~ cell subset + age + sex + batchID). When the LRT is significant, it means adding the variable of interest to the model improves the fit to the cytokine levels. For BMI-related variables, these do not improve the fit to both IL-2 and CXCL5 when T cell subsets are passed as covariates, showing that our approach is powered to identify cellular associations with effects on CXCL5 levels when present.
pQTL analyses
Protocols and quality-control filters for genome-wide SNP genotyping are detailed in ref. 3. In brief, the 1,000 Milieu Intérieur donors were genotyped on both the HumanOmniExpress-24 and the HumanExome-12 BeadChips (Illumina), which include 719,665 SNPs and 245,766 exonic SNPs, respectively. Average concordance rate between the two genotyping arrays was 99.99%. The final dataset included 732,341 high-quality polymorphic SNPs. After genotype imputation and quality-control filters, 11,395,554 SNPs were further filtered for minor allele frequencies > 5%, yielding a dataset composed of 1,000 donors and 5,699,237 SNPs for pQTL mapping. pQTL analyses were performed using the MatrixEQTL44 2.2 R package. SNPs were considered as cis-acting pQTLs if they were located within 1 Mb of the TSS of the gene, otherwise they were considered as trans-pQTLs. Protein expression data of the 1,000 individuals were log-transformed prior to pQTL analysis. Bonferonni correction for multiple testing (adjusted P value < 0.05) was applied to the results. We used detection thresholds of 10−3 for cis-pQTLs and 10−5 for trans-pQTLs and age, sex and the technical covariate batchID, as well as a main associated cell subset (monocytes for LPS, E. coli and C. albicans stimulations, CD4pos for SEB, CD8posEMRA for anti-CD3 + CD28, CD45pos for BCG, cDC3 for poly I:C, CD3pos for influenza, CD45pos for TNF, none for null, IL-1β and IFNγ) as covariates. SNPs associated with IFNγ in IFNγ stimulation, with IL-1β in IL-1β stimulation and with TNF in TNF stimulation were disregarded because each of these cytokines were respectively added to the TruCulture tubes and thus do not reflect endogenous secretion. To test the novelty of our pQTL results, we studied the SomaLogic plasma protein pQTL database20, for both cis- and trans-pQTLs listed in Table 1. This dataset allowed testing associations for CXCL5, IFNγ, IL-1β, IL-2, IL-6, IL-10 and IL-12a. Significant associations were identified between the variants rs352045 (cis), rs2393969 (trans), rs10822168 (trans) and the protein CXCL5 (respective FDR adjusted P = 3.02 × 10−10, P = 0.01 and P = 0.022), between rs35345753 (cis), rs62449491 (cis) and IL-6 (respective FDR adjusted P = 4.17 × 10−3 and P = 0.017) and between rs3775291 (trans) and IL-12A (FDR adjusted P = 0.049). To test associations for SNPs in linkage disequilibrium with the SNPs originally referenced in Table 1, we used a dataset of linkage disequilibrium from the ensemble database with similar ancestries as the Milieu Intérieur cohort (1000GENOMES:phase_3:CEU: Utah residents with Northern and Western European ancestry). To be inclusive, SNPs with a r2 > 0.2 were selected as associated alleles and underwent the same analysis as the one performed with the SNPs of reference. SNPs that came out as significant are those in linkage disequilibrium with the SNP referenced in Table 1 that is significantly associated with the corresponding protein. In addition, we also screened eQTL results. We compared our pQTL results with the eQTLs reported in our previous work based on nanostring transcriptomic data for common cytokines (CSF2, IFNγ, IL-1β, TNF, IL-2, IL-6, IL-8, IL-10, IL-12p70, IL-13, IL-17 and IL-23) and stimulations (E. coli, C. albicans, influenza, BCG, and SEB)4, which identified 2 main loci: the TLR1/6/10 locus and the CR1 locus. Association of variants referenced in Table 1 were found in the GTEx consortium database for rs1518110 and IL-10 (FDR adjusted P = 4.3 × 10−9), for rs352045 (cis) and CXCL5 (FDR adjusted P = 9.2 × 10−23) in whole blood and for rs143060887 (cis) and IL-12A (FDR adjusted P = 0.000076). Significant associations between rs352045 and CXCL5 and between rs1518110 and IL-10 were also found in the eQTLgen catalogue.
Computation of variance explained
For each stimulation, all the variables associated with at least one induced cytokine were considered to compute the percentage of each induced cytokine variance explained by each associated variable (q value < 0.05) with the R package relaimpo 2.2.3 and plotted with the R package ggplot2 3.2.1. The R2 contribution averaged over orderings among regressors was computed using the lmg type in the calc.relimp function of the relaimpo R package. For this analysis log-transformed induced cytokine data and log-transformed raw counts of cell subsets were used, as well as data for cis- and trans-associated SNPs and methylation probes. For each stimulation, all associated cis-pQTLs (rs352045, rs143060887, rs62449491 and rs1518110 for LPS; rs352045 for anti-CD3 + CD28, rs352045, rs62449491 and rs113845942 for poly I:C; rs352045 and rs35345753 for influenza), and trans-pQTLs (rs3764613 for LPS; rs4833095 for E. coli; rs11936050 for SEB; rs1801274 for anti-CD3 + CD28; rs4833095, rs72636686 and rs10013453 for BCG; rs10779330 and rs11117956 for C. albicans; rs3775291 and rs10822168 for poly I:C), as well as methylation probes (cg09582880 for LPS; cg25065535 for C. albicans, cg17850932 for poly I:C and cg16468729 for influenza) and a main associated cell subset (monocytes for LPS, E. coli and C. albicans stimulations, CD4pos for SEB, CD8posEMRA for anti-CD3 + CD28, CD45pos for BCG, cDC3 for poly I:C, CD3pos for influenza) were considered in the models.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
SNP array data can be accessed from the European Genome-Phenome Archive EGA under accession EGAS00001002460, the Infinium MethylationEPIC raw intensity data can be accessed via https://dataset.owey.io/doi/10.48802/owey.f83a-1042. Cytokine data are provided in Supplementary Data 1 and eCRF data are provided in Supplementary Data 2. Source data are provided with this paper.
Code availability
All the scripts related to this study are available on Github https://github.com/ViolaineSaint-Andre/ and have a tagged copy on Zenodo: https://doi.org/10.5281/zenodo.10198929.
References
Brodin, P. et al. Variation in the human immune system is largely driven by non-heritable influences. Cell 160, 37–47 (2015).
Quach, H. et al. Genetic adaptation and neandertal admixture shaped the immune system of human populations. Cell 167, 643–656.e17 (2016).
Patin, E. et al. Natural variation in the parameters of innate immune cells is preferentially driven by genetic factors. Nat. Immunol. 19, 302–314 (2018).
Piasecka, B. et al. Distinctive roles of age, sex, and genetics in shaping transcriptional variation of human immune responses to microbial challenges. Proc. Natl Acad. Sci. USA 115, E488–E497 (2018).
Bakker, O. B. et al. Integration of multi-omics data and deep phenotyping enables prediction of cytokine responses. Nat. Immunol. 19, 776–786 (2018).
Salje, H. et al. Estimating the burden of SARS-CoV-2 in France. Science 369, 208–211 (2020).
Nédélec, Y. et al. Genetic ancestry and natural selection drive population differences in immune responses to pathogens. Cell 167, 657–669.e21 (2016).
Hadjadj, J. et al. Impaired type I interferon activity and inflammatory responses in severe COVID-19 patients. Science 369, 718–724 (2020).
Brodin, P. Immune determinants of COVID-19 disease presentation and severity. Nat. Med. 27, 28–33 (2021).
Li, Y. et al. Inter-individual variability and genetic influences on cytokine responses to bacteria and fungi. Nat. Med. 22, 952–960 (2016).
Liston, A., Carr, E. J. & Linterman, M. A. Shaping variation in the human immune system. Trends Immunol. 37, 637–646 (2016).
Thomas, S. et al. The Milieu Intérieur study—an integrative approach for study of human immunological variance. Clin. Immunol. 157, 277–293 (2015).
Lugade, A. A. et al. Cigarette smoke exposure exacerbates lung inflammation and compromises immunity to bacterial infection. J. Immunol. 192, 5226–5235 (2014).
Mertens, T. C. J. et al. Cigarette smoke differentially affects IL-13-induced gene expression in human airway epithelial cells. Physiol. Rep. 5, e13347 (2017).
Butov, D., Kuzhko, M., Makeeva, N. & Butova, T. Smoking as a risk factor which activates IL-2 gene polymorphism in patients with MDR-TB. Eur. Respir. J. 46, PA4535 (2015).
Chavey, C. et al. CXC ligand 5 is an adipose-tissue derived factor that links obesity to insulin resistance. Cell Metab. 9, 339–349 (2009).
Vargas, R. et al. Increased C-reactive protein and decreased Interleukin-2 content in serum from obese individuals with or without insulin resistance: associations with leukocyte count and insulin and adiponectin content. Diabetes Metab. Syndr. Clin. Res. Rev. 10, S34–S41 (2016).
Caron, B. et al. Integrative genetic and immune cell analysis of plasma proteins in healthy donors identifies novel associations involving primary immune deficiency genes. Genome Med. 14, 28 (2022).
Bergstedt, J. et al. The immune factors driving DNA methylation variation in human blood. Nat. Commun. 13, 5895 (2022).
Sun, B. B. et al. Genomic atlas of the human plasma proteome. Nature 558, 73–79 (2018).
Ferkingstad, E. et al. Large-scale integration of the plasma proteome with genetics and disease. Nat. Genet. 53, 1712–1721 (2021).
Pietzner, M. et al. Mapping the proteo-genomic convergence of human diseases. Science 374, eabj1541 (2021).
Gudjonsson, A. et al. A genome-wide association study of serum proteins reveals shared loci with common diseases. Nat. Commun. 13, 480 (2022).
Koprulu, M. et al. Proteogenomic links to human metabolic diseases. Nat. Metab. 5, 516–528 (2023).
Li, Y. et al. A functional genomics approach to understand variation in cytokine production in humans. Cell 167, 1099–1110.e14 (2016).
Ardlie, K. G. et al. Human genomics. The genotype–tissue expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Zhang, C., Wang, W., Zhang, H., Wei, L. & Guo, S. Association of FCGR2A rs1801274 polymorphism with susceptibility to autoimmune diseases: a meta-analysis. Oncotarget 7, 39436–39443 (2016).
Stein, M. M., Hrusch, C. L., Sperling, A. I. & Ober, C. Effects of an FcγRIIA polymorphism on leukocyte gene expression and cytokine responses to anti-CD3 and anti-CD28 antibodies. Genes Immun. 20, 462–472 (2019).
Ma, L. et al. Association of toll-like receptor 3 polymorphism rs3775291 with age-related macular degeneration: a systematic review and meta-analysis. Sci. Rep. 6, 19718 (2016).
Sironi, M. et al. A common polymorphism in TLR3 confers natural resistance to HIV-1 infection. J. Immunol. 188, 818–823 (2012).
Su, D. et al. Distinct epigenetic effects of tobacco smoking in whole blood and among leukocyte subtypes. PLoS ONE 11, e0166486 (2016).
Christiansen, C. et al. Novel DNA methylation signatures of tobacco smoking with trans-ethnic effects. Clin. Epigenetics 13, 36 (2021).
Philibert, R. et al. Reversion of AHRR demethylation is a quantitative biomarker of smoking cessation. Front. Psychiatry https://doi.org/10.3389/fpsyt.2016.00055 (2016).
Bojesen, S. E., Timpson, N., Relton, C., Davey Smith, G. & Nordestgaard, B. G. AHRR (cg05575921) hypomethylation marks smoking behaviour, morbidity and mortality. Thorax 72, 646–653 (2017).
Haarmann-Stemmann, T. & Abel, J. The arylhydrocarbon receptor repressor (AhRR): structure, expression, and function. Biol. Chem. 387, 1195–1199 (2006).
Sokulsky, L. A. et al. A critical role for the CXCL3/CXCL5/CXCR2 neutrophilic chemotactic axis in the regulation of type 2 responses in a model of rhinoviral-induced asthma exacerbation. J. Immunol. 205, 2468–2478 (2020).
Ru, G. Q. et al. CEACAM6 is a prognostic biomarker and potential therapeutic target for gastric carcinoma. Oncotarget 8, 83673–83683 (2017).
Chen, J. et al. The elevated CXCL5 levels in circulation are associated with lung function decline in COPD patients and cigarette smoking-induced mouse model of COPD. Ann. Med. 51, 314–329 (2019).
Wu, C.-Y. et al. CEACAM6 as a novel therapeutic target to boost HO-1-mediated antioxidant defense in COPD. Am. J. Respir. Crit. Care Med. 207, 1576–1590 (2023).
Saint-André, V. Computational biology approaches for mapping transcriptional regulatory networks. Comput. Struct. Biotechnol. J. 19, 4884–4895 (2021).
Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).
Behar, D. M. et al. The genome-wide structure of the Jewish people. Nature 466, 238–242 (2010).
Duffy, D. et al. Functional analysis via standardized whole-blood stimulation systems defines the boundaries of a healthy immune response to complex stimuli. Immunity 40, 436–450 (2014).
Shabalin, A. A. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics 28, 1353–1358 (2012).
Acknowledgements
This programme is managed by the Agence Nationale de la Recherche. This work benefited from support of the French government’s Invest in the Future programme; reference ANR-10-LABX-69-01. A. Bertrand is a student from the FIRE PhD programme funded by the Bettencourt Schueller Foundation and the EURIP Graduate Program (ANR-17-EURE-0012).
Author information
Authors and Affiliations
Consortia
Contributions
Conceptualization: M.L.A., L.Q.-M, D.D., V.S.-A. and E.P. Methodology: V.S.-A., A. Biton and D.D. Software: V.S.-A. and A. Biton Validation: V.S.-A., A. Biton and D.D. Formal analysis: V.S.-A. and A. Biton. Investigation: V.S.-A., B.C., C.P., A. Bertrand and D.D. Resources: M.R., J.B., E.P., L.Q.-M. and D.D. Data curation: V.S.-A.,V.R., B.C., E.P. and D.D. Writing, original Draft: V.S.-A. and D.D. Writing, review and editing: V.S.-A, D.D., A. Biton, M.R., J.B., E.P. and L.Q.-M. Visualization: V.S.-A. and D.D. Supervision: D.D., L.Q.-M., V.S.-A. and E.P. Project administration: D.D. Funding acquisition: M.L.A., L.Q.-M., D.D. and the Milieu Intérieur Consortium.
Corresponding authors
Ethics declarations
Competing interests
M.L.A. is an employee of Octant Biosciences, CA, USA. D.D. previously received grant support from Rules Based Medicine. The other authors declare no competing interests.
Peer review
Peer review information
Nature thanks Sam Janes and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer review reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data figures and tables
Extended Data Fig. 1 Principal component analysis (PCA) of individuals for the 13 cytokines in the 12 stimulation conditions.
Each dot represents one individual in a specified stimulation condition. Contributions of the cytokines to the first two dimensions are represented with arrows.
Extended Data Fig. 2 Heatmaps showing cytokine concentration levels in each stimulation condition for the 1,000 individuals of the Milieu Intérieur cohort.
Hierarchical clustering is performed with Ward method and euclidean distance on both lines and columns.
Extended Data Fig. 3 Heatmaps showing -log10(BY adj. pval of likelihood ratio test) of association for the eCRF variables associated with at least one cytokine in each stimulation considering smoking status and age interactions in the compared models.
These are coloured according to the colour key on the side of each heatmap and stars are shown depending on the strength of association (*P < 0.05; **P < 0.01; ***P < 0.001).
Extended Data Fig. 4 Heatmaps showing associations of age and sex variables corrected for sex or age respectively and for batchId, on induced cytokines in each stimulation condition.
Significance of the BY adj-pval likelihood ratio test is marked with stars (*P < 0.05; **P < 0.01; ***P < 0.001).
Extended Data Fig. 5 Effect of sex and age on on induced cytokines.
Two-sided effect size plots made from n = 955 independent individuals with 0.95 confidence interval for sex (a) and age (b) corrected for age or sex respectively and batchId, on induced cytokines in each stimulation condition for the 1,000 individuals of the Milieu Intérieur cohort. Significant effect sizes are in black, others are in grey. Significance of the BY adj-pval likelihood ratio test is marked with stars (*P < 0.05; **P < 0.01; ***P < 0.001).
Extended Data Fig. 6 Smoking effect on innate and adaptive immune responses, represented by LPS and CD3 + CD28 stimulations respectively and same plots on residuals, after regression on age, sex and batchId.
a,b, CXCL5 concentration following LPS stimulation (a) and IL-2 concentration following anti-CD3 + CD28 stimulation (b) for never, past and current smokers. Boxplots represent n = 955 independent individuals. The centre line shows the median, hinges represent the 25th and 75th percentiles whiskers extend from the hinge to the largest or smallest value no further than 1.5 interquartile range. Significance of two-sided Wilcoxon rank sum tests adjusting for multiple comparisons is indicated with stars above the boxes on the left (between never and past smokers), in the middle (between never and current smokers) and on the right (between past and current smokers) (N.S, not significant; * P < 0.05, **P < 0.01, ***P < 0.001). P-values are from left to right, for CXCL5 in LPS: 0.74, 4.9e-13 and 4.7e-10 and for IL-2 in anti-CD3 + CD28: 3.8e-03, 2.4e-05 and 0.17. c,CXCL5 concentrations following LPS stimulation or IL-2 and IL-13 concentrations following anti-CD3 + CD28 stimulation versus numbers of years smoking for current smokers (red) or past smokers (blue). Grey areas depict the 0.95 confidence intervals of the linear regression lines. d–f, Residuals of CXCL5 following E. coli and LPS stimulations (d) or of IL-2 and IL-13 following SEB and anti-CD3 + CD28 stimulations (e) after regression on age, sex and batchId variables. Boxplots represent n = 955 independent individuals. The centre line shows the median, hinges represent the 25th and 75th percentiles, and whiskers extend from the hinge to the largest or smallest value no further than the 1.5 interquartile range.Significance of two-sided Wilcoxon Rank Sum tests adjusting for multiple comparisons is indicated with stars above the boxes on the left (between never and past smokers), in the middle (between never and current smokers) and on the right (between past and current smokers) (N.S, not significant; *P < 0.05, **P < 0.01, ***P < 0.001). P-values are from left to right, for CXCL5 in E.coli: 0.38, 7.1e-10 and 9.9e-07; for CXCL5 in LPS: 0.34, 7.5e-12 and 5.6e-08; for IL-2 in SEB: 1.4e-03,1.2e-07 and 4.2e-02 and for IL-2 in anti-CD3 + CD28: 0.026, 6.7e-09 and 9.8e-04. f, CXCL5 residuals following E.coli or LPS stimulation or IL-2 and IL-13 concentrations following SEB or anti-CD3 + CD28 stimulation versus the numbers of years smoking for current smokers (red) or past smokers (blue). Grey areas depict the 0.95 confidence intervals of the linear regression lines.
Extended Data Fig. 7 Effect s of CMV status on induced cytokines is modified by blood cell subsets and effect of smoking on induced cytokines is modified by specific CpGs associated with induced levels of IL2 following SEB stimulation.
a, Heatmap showing associations of the cytomegalovirus (CMV) serostatus with each induced cytokine following SEB stimulation, with either no cellular covariate (top line) or each of 76 cell subsets passed as covariates in the models. Significance of the BY adj-pval likelihood ratio test is marked with stars (*P < 0.05, **P < 0.01, ***P < 0.001). b, Heatmap showing associations of the smoking status with each induced cytokine following SEB stimulation when either no methylation probe (top line) or each individual methylation probe associated with IL-2 concentration levels in SEB stimulation is passed as a covariate in the models. Significance of the BY adj-pval likelihood ratio test is marked with stars (*P < 0.05, **P < 0.01, ***P < 0.001).
Extended Data Fig. 8 Smoking status and SNPs interactions.
a, Heatmaps showing -log10(BY adj.pval of likelihood ratio test) of interactions between genetic variants listed in Table 1 and smoking status for each induced cytokine following BCG stimulation considering age, sex and batchId as covariates. These are colored according to the color key on the side of the heatmap and stars are shown depending on the strength of association.*P < 0.05, **P <0.01, ***P < 0.001. b, Two-sided effect size plot made representing n = 955 independent individuals with 0.95 confidence interval for the interaction of the SNP rs72636686 with the smoking status, corrected for age, sex and batchId, on induced cytokines following BCG stimulation. Significant effect sizes (p-val < 0.01) are in black, others are in grey. Those that also have BY adj.pval of the likelihood ratio test <0.01 are labelled with a red star. c, Boxplot representing n = 955 independent individuals for IL8 levels depending on the genotype for rs72636686 and the smoking status. The centre line shows the median, hinges represent the 25th and 75th percentiles, and whiskers extend from the hinge to the largest or smallest value no further than the 1.5 interquartile range. Significance of two-sided Wilcoxon rank sum tests adjusting for multiple comparisons are indicated above the compared data.
Extended Data Fig. 9 IL-2 protein levels depending on FcγRIIA polymorphism rs1801274 and genetic PCA showing homogeneity of the genetic background of Milieu Intérieur individuals.
a, Boxplots representing n = 1,000 independent individuals for IL-2 concentration levels depending on the genotype of the donors for the rs1801274 genetic variant before and after K-means clustering (2 groups) selection of responders. The centre line shows the median, hinges represent the 25th and 75th percentiles, and whiskers extend from the hinge to the largest or smallest value no further than the 1.5 interquartile range. Significance of two-sided Wilcoxon rank sum tests adjusting for multiple comparisons is indicated above the boxes (N.S, not significant; *P < 0.05, **P < 0.01, ***P < 0.001). P-values are from left to right on the top boxplot: <2.22e-16, <2.22e-16 and <2.22e-16 and on the bottom boxplot: <2.22e-16, 0.35 and 0.15. b, PCA performed on 261,827 independent SNPs and 1,723 individuals, which include the 1,000 MI donnors together with 723 individuals from a selection of 36 populations from North Africa, the Near East, as well as western and northern Europe. PC1 versus PC2 (top left), PC1 versus PC3 (bottom left) and PC2 versus PC3 (top right) are displayed as well as a barplot (bottom right) of the variance explained by the first 20 components of the PCA.
Extended Data Fig. 10 Association of DNA methylation probes with their cognate cytokines.
a, DNA methylation probes located within 1 Mb of the TSS of each cytokine that show associations (Benjamini-Hochberg false discovery rate adj-pval of likelihood ratio test <0.05) with the concentration levels of these cytokines in the specified stimulation condition (*P < 0.05). b, Plots showing the corresponding concentration levels depending on the methylation levels of the probes. Adj-r2 values of the linear regressions and p-statistics of two-sided Pearson correlations are reported on each graph.
Supplementary information
Supplementary Information
Graphical abstract
Supplementary Table 1
Proteomic data: Luminex data for the concentration levels of 13 cytokines in 12 stimulation conditions for the donors of the Milieu Intérieur cohort that gave consent to make their data public.
Supplementary Table 2
eCRF data
Supplementary Table 3
pQTL summary statistics
Supplementary Table 4
Test of independence of rs3775291 and rs28393318 for IL-1β pQTL in poly I:C stimulation
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Saint-André, V., Charbit, B., Biton, A. et al. Smoking changes adaptive immunity with persistent effects. Nature 626, 827–835 (2024). https://doi.org/10.1038/s41586-023-06968-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41586-023-06968-8
This article is cited by
-
Transcriptomic analysis reveals distinct effects of cigarette smoke on murine airspace and bone-marrow derived macrophages
Respiratory Research (2024)
-
Smoking’s lasting effect on the immune system
Nature (2024)
-
Smoking scars the immune system for years after quitting
Nature (2024)