Article
Open access
Published: 04 April 2024

Reassessing human MHC-I genetic diversity in T cell studies

Roderick C. Slieker^1,2,
Daniël O. Warmerdam³,
Maarten H. Vermeer⁴,
Remco van Doorn^4,5,
Mirjam H. M. Heemskerk⁶ &
…
Ferenc A. Scheeren⁴

Scientific Reports volume 14, Article number: 7966 (2024) Cite this article

1501 Accesses
3 Altmetric
Metrics details

Subjects

Abstract

The Major Histocompatibility Complex class I (MHC-I) system plays a vital role in immune responses by presenting antigens to T cells. Allele specific technologies, including recombinant MHC-I technologies, have been extensively used in T cell analyses for COVID-19 patients and are currently used in the development of immunotherapies for cancer. However, the immense diversity of MHC-I alleles presents challenges. The genetic diversity serves as the foundation of personalized medicine, yet it also poses a potential risk of exacerbating healthcare disparities based on MHC-I alleles. To assess potential biases, we analysed (pre)clinical publications focusing on COVID-19 studies and T cell receptor (TCR)-based clinical trials. Our findings reveal an underrepresentation of MHC-I alleles associated with Asian, Australian, and African descent. Ensuring diverse representation is vital for advancing personalized medicine and global healthcare equity, transcending genetic diversity. Addressing this disparity is essential to unlock the full potential of T cells for enhancing diagnosis and treatment across all individuals.

Investigating the genetic makeup of the major histocompatibility complex (MHC) in the United Arab Emirates population through next-generation sequencing

Article Open access 09 February 2024

Systematic identification of minor histocompatibility antigens predicts outcomes of allogeneic hematopoietic cell transplantation

Article 21 August 2024

Interpretable GWAS by linking clinical phenotypes to quantifiable immune repertoire components

Article Open access 20 October 2024

Introduction

The MHC-I system is a family of proteins expressed on the surface of cells and is involved in the recognition and presentation of peptides to the immune system. It consists of a polymorphic heavy chain, a constant light chain called Beta-2 Microglobulin (β2M) and an 8–13 amino-acid peptide ligand^1,2. The peptide binding groove of MHC-I heavy chain accommodates these peptides, and the properties of the pockets within the groove are important for peptide presentation. MHC-I comprises three major Human Leukocyte Antigen (HLA) families: HLA-A, HLA-B, and HLA-C, each consisting of numerous alleles. Among these, HLA-A and HLA-B exhibit the greatest diversity, while HLA-C shows less variation³. HLA-C is associated with multiple additional receptors, such as Killer Immunoglobulin-like Receptors (KIRs) that can be expressed on T cells, adding complexity to the system and complicating diagnostics of HLA-C restricted T cells⁴. Consequently, these complexities contribute to the tendency to overlook HLA-C in research studies. The diversity in the MHC-I heavy chain directly influences peptide presentation by altering the properties of the pockets in the peptide binding groove. This genetic polymorphism results in changes in the size, shape, and electrostatic properties of the pockets, which in turn affect the binding affinity and specificity of the MHC-I molecule for different peptides, thereby directly influencing the peptide repertoire. Although peptides may have overlap between similar MHC-I heavy chains, each allele has an unique repertoire of peptides⁵. Hence, understanding that while the overall structure of MHC-I remains largely consistent, the specific composition of alleles and variations in anchor residues give rise to unique peptide-binding specificities within each population, resulting in exceptionally high sequence diversity. This leads to an extensive array of alleles, totalling over 37,000 variants, some of which are exclusive to particular ancestral populations^6,7,8,9. This diversity in MHC-I alleles enhances the population's ability to mount effective immune responses against a given pathogen by increasing an individual’s chance of eliciting a suitable immune defence. Thus, MHC-I diversity helps to protect against pandemics. For example, the HLA-B*15:01 allele is more prevalent in Southeast Asian populations compared to European populations, highlighting the geographic variability in HLA alleles⁸. However, this diversity also represents a challenge in the biomedical domain due to its potential to reinforce existing disparities, potentially leading to unequal healthcare based on an individual’s MHC-I alleles.

The relevance of MHC-I alleles in diagnostics and immunotherapy has surged in recent years, offering potential applications in disease diagnosis and treatment^{10,11,12,13,14,15,16}. The groundwork for this technology was laid in 1996 with the introduction of recombinant MHC-I technology, enabling the visualization of antigen-specific cells¹⁷. This methodology involves the synthesis of MHC-I molecules via synthetic DNA sequences in a laboratory setting. These recombinant soluble MHC-I monomers, complexed with specific peptides, are then multimerized and fluorescently labelled, commonly referred to as tetramers or multimers. These multimerized peptide MHC-I complexes play a crucial role in immune response monitoring by facilitating the specific binding of antigen-specific T cells, allowing the visualization, and tracking of T cell responses over time. They have applications in diagnostics, aiding in the identification of antigen-specific T cells and differentiating between vaccination and natural infection based on pathogen protein coverage. Recent studies have extensively investigated T cells in COVID-19 patients^18,19,20. Furthermore, recombinant MHC-I technology has significantly impacted cancer immunotherapy. This technology leverages the fundamental link between TCR and peptide-HLA complexes, enabling the precise targeting of cancer cells. Customized recombinant peptide MHC-I complexes help identify specific cancer antigens, paving the way for personalized treatment strategies. Additionally, they play a pivotal role in TCR-based therapies by facilitating the precise targeting of cancer cells by engineered T cells. Clinical trials utilizing HLA-restricted TCRs are currently underway, either by introducing TCRs into patient T cells or employing recombinant TCR fusion proteins fused to anti-CD3, resulting in bispecific T cell engagers^{10,11,12,13,14}. These therapies are designed to target specific immune responses mediated by predetermined HLA alleles. Moreover, recombinant MHC-I technology serves as a peptide-specific platform for inducing T cell proliferation in an antigen-specific manner^21,22. In summary, the versatility and effectiveness of recombinant MHC-I technology position it as a cornerstone in the ongoing battle against cancer through immunotherapy.

Given the extensive polymorphism among HLA genes and their connections to population genetics, we set out to investigate whether there are biases in the HLA alleles studied in medical research. Our analysis focused on the utilization of MHC-I alleles in both clinical and preclinical publications, particularly those related to COVID-19 and clinical trials focused on TCR-based immunotherapies. Our findings reveal a notable underrepresentation of alleles found in people from Asian, Australian, and African descent, suggesting a widespread allele bias in medical research and clinical therapeutic development.

Results

This study conducted a comprehensive search for articles published between August 2020 and April 2023, focusing on T-cell research related to Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) and Human Leukocyte Antigen Class-I (HLA-I), representing the human MHC-I, aiming to assess the breadth of HLA alleles studied in medical research. SARS-CoV-2 was chosen as a model infection due to the extensive utilization of MHC-I technology in research on the immune response to the virus and its global impact, affecting all continents and countries. Out of 615 articles identified, 74 were included, focused on allele-specific analyses, in specific MHC-I technology. In the 74 studies we considered, individual epitopes, the specific portion of the peptide sequence that is recognized and bound by the MHC-I molecule, against COVID-19 were determined by using mono-allelic MHC-I multimer qualitative binding (Table S1, Fig. S1). A total of 22 unique MHC-I alleles were used with epitopes of SARS-CoV-2. MHC-I allele frequencies have a widespread variation across human populations, however the geography on the distribution of MHC-I alleles resembles geographical location of populations²³. For this reason, we analysed the frequencies of the MHC-I allele usage in literature in relation to the frequency of occurrence in a specific continental or intra-continental group. The following continental and intra-continental groups were included in our analysis: South Asia, North-East Asia, South-East Asia, Sub-Saharan Africa, Oceania, North Africa, Western Asia, South and Central America, North America, and Europe.

The HLA-A*02:01 allele was included in the majority of the studies, i.e. 55 of the 74 studies included for analysis (74.3%), followed by HLA-A*24:02 (N = 31, 41.9%), HLA-A*01:01 (N = 24, 32.4%), HLA-B*07:02 (N = 24, 32.4%), HLA-A*03:01 (N = 21, 28.4%, Fig. 1a). For studied alleles, there was a difference in frequency across geographical populations (Fig. 1b). Indeed, there was a strong positive correlation between the frequency of alleles used across studies and the allele frequency in Europe (r = 0.70, P = 2.7·10⁻⁴), North America (r = 0.59, P = 3.9·10^–3) and South and Central America (r = 0.62, P = 2.0·10⁻³) (Fig. 1c). We observed a weak correlation or absent correlation for North Africa (r = 0.38, P = 0.08), Western Asia (r = 0.31, P = 0.16), North-East Asia (r = 0.42, P = 0.05), Australia (r = 0.39, P = 0.07), Sub-Saharan Africa (r = 0.02, P = 0.92), South-East Asia (r = 0.31, P = 0.16), South Asia (r = 0.20, P = 0.38) and Oceania (r = 0.34, P = 0.12). Overall, the HLA-A alleles were the most studied alleles (Fig. 2a) and showed the best coverage for Europe (69.8%), while the lowest coverage for Sub-Sharan Africa (36.1%). For the HLA-B alleles, a similar pattern was observed with a high frequency in Europe but low in Sub-Sharan Africa. The B-alleles particularly showed a low coverage in North America (Fig. 2b), but this was mainly driven by the low frequency in Mexico (5.1%) versus USA-based studies (41.2%, Fig. 2b). HLA-C alleles were not extensively researched, accounting for only 4% of the studies, despite their relative high population coverage across continents (Fig. 2c).

Our analysis was repeated based on an independent dataset obtained from a systematic review of T-cell epitopes defined from the proteome of SARS-CoV-2, describing 1349 MHC-I epitopes²⁰. This validation in a second COVID-19 dataset obtained from this systematic review of T-cell epitopes is essential to ensure the robustness and generalizability of the initial analysis and to minimize the risk of bias. The acquired alleles were not restricted to MHC-I technology, and this systematic review provides a description and explanation of the diverse range of technologies utilized. This analysis gave a similar distribution of investigated alleles was observed, with most studies including HLA-A*02:01 (N = 34, 79.1%), followed by HLA-A*24:02 (N = 16, 37.2%) and HLA-A*01:01 (N = 14, 32.6%). Again, the highest correlations were observed for Europe followed by the Americas (Fig. S3).

Next, we analysed the clinical trials that make use of TCR-based immunotherapy. TCRs can only recognize and bind to a specific peptide presented by a particular MHC-I allele. Therefore, TCR-based immunotherapy treatments are designed to target specific peptides presented by one predetermined MHC allele^24,25. For this reason, TCR-based immunotherapies are HLA-restricted therapies. By examining these clinical trials, we can assess for which specific HLA alleles therapies are designed. This analysis can help to guide future research and clinical development efforts towards more personalized and effective treatments for patients with specific genetic backgrounds. Using the clinicaltrials.gov website we found 126 studies in which TCR transfer was clinically used (N = 118) or recombinant TCR fusion proteins were used (N = 8). The latter are all HLA-A2 restricted. The allele coverage of these TCRs showed clear over-representation of HLA-A2 in these clinical trials (Fig. 3). Seven studies mention that personalized TCRs will be developed, however the coverage of alleles was not mentioned and thus it was difficult to determine how large the allele diversity will be in these clinical studies. The focus on HLA-A*2, and in specific A*02:01, means that a large population is excluded in these clinical trials. For example, within the American population, people with an African American or Asian genetic descent will have an almost 50% lower chance to enrol in these TCR-based immunotherapy trials^26,27.

Discussion

These results demonstrate that the preclinical and clinical analyses of antigen-specific T cell diagnostics and the clinical development of HLA-I restricted therapies, such as TCR-based immunotherapies, show an underrepresentation of people with an Asian, African, Australian, and Oceanian descent. We provide data for both the COVID-19 outbreak and TCR-based therapies. However, we believe that the underrepresentation of specific HLA alleles is not confined to only these two fields; rather, they serve as examples representing the broader scope of the field. Within the clinical setting there is a strong bias towards the use of HLA-A*2. This raises concerns about the inclusivity and generalizability of findings within the preclinical and clinical analyses of antigen-specific T cell research and diagnostics that rely on MHC-I technologies. Conversely, populations with European and American (North, South, and Central) descent exhibit robust representation in these T cell-focused investigations and clinical trials.

This lack of MHC-I allele diversity within T cell research and clinical setting is multifaceted, rooted in historical, methodological, and systematic factors. Historically, since the HLA-A02:01 allele is present in 50% of the Caucasian population, the initial analyses of T cell responses have been focused on the HLA-A02:01 allele^17,28. Early reagents, such as HLA-I restricted T cell clones and later recombinant HLA-A*02:01 were focused on the European population resulting in a skewed research field. Additionally, peptide affinity predictions for specific alleles rely on data obtained through wet lab experiments, such as mass spectrometry based ligandome²⁹. The accuracy of these predictions improves with the availability of more data. Less characterized alleles have a poorer performance for the in-silico peptide predictions³⁰. As a result, researchers gravitate toward well-characterized alleles, reinforcing a feedback loop that perpetuates the imbalance. Increased availability of diverse recombinant MHC-I alleles in combination with high-quality peptide databases needed for high-accuracy in silico predictions would enhance the diversity in biomedical research needed for T cell analyses in a diverse population. Additionally novel technologies allowing HLA-unbiased TCR are also developed and very important in ensuring diverse HLA representation^31,32.

The ramification of this underrepresentation involves critical aspects. Firstly, the COVID-19 vaccine and clinical trial landscapes exhibited overrepresentation of white non-Hispanic participants, mirroring trends in cancer immunotherapy research^33,34. Given the influence of MHC-I allele diversity on disease outcome of pathogens such as SARS-CoV-2, vaccine responses, and effectiveness, the bias hampers generalizability and obstructs insights into diverse population responses to interventions^35,36,37,38. In the context of COVID-19, variations in HLA genes influence an individual's susceptibility to the virus, severity of the disease, and response to treatments or vaccines^35,39,40,41. Therefore, investigating the impact of HLA on patients with COVID-19 is essential for both clinical management and public health strategies.

Secondly, while pivotal scientific breakthroughs hold importance, the next step entails integrating human genetic diversity into research paradigms. Within the field of genome editing, genetic data from people with a diverse ancestry is essential to determine the CRISPR off-targets, and thereby assess safety and efficacy⁴². Originally the human genome project consisted of 70% of only one person with a blended ancestry. The remaining 30% came from 19 individuals of European ancestry⁴³, resulting in a very limited genomic diversity. Although currently the vast majority of genomics studies have been conducted in individuals of European descent, the human genome field is now taking the lead by rapidly increasing the number of reference genomes from individuals with diverse ancestry^44,45. This inclusion of genomic diversity is not only essential for genome editing but ensures that the benefits of genomic medicine are accessible to all. Just as the inclusion of diverse genetic ancestry is pivotal in genome editing, genomic medicine's broad applicability hinges on embracing genetic diversity. Incorporating genetic variability needs to become a cornerstone in the progress of immune diagnostics and immunotherapy, mirroring the trajectory of genomic medicine.

The significance of MHC-I alleles in COVID-19 outcomes underscores the necessity of comprehensive representation in COVID-19 T-cell studies, compelling the inclusion of individuals with underrepresented genetic makeup. Similarly, the overrepresentation of HLA-A*2 in TCR-based immunotherapies should intensify the urgency for equitable representation. The over-representation of the HLA-A*2 allele impedes the versatility of TCR application and reveals a skewed MHC-I allele representation in therapeutic contexts. Bridging this gap requires increased awareness and strategic funding. An example of such endeavour is the Cancer Grand Challenge of 2023 (https://cancergrandchallenges.org/challenges), which addresses disparities in cancer research across diverse populations.

Beyond the scientific field, the implications of this underrepresentation extend to inequities in therapy development and healthcare availability, demonstrating the necessity to engage diverse communities in biomedical science. The ethical goal to include MHC-I genetic diversity aligns with the scientific goal, as the biological associations of specific MHC-I alleles underscore the complexity that demands comprehensive understanding. The absence of equal representation poses formidable barriers to advancing T cell-based therapies in the era of personalized medicine.

Material and methods

Article inclusion

Articles published in peer-reviewed journals before April 2023 were included. Articles were identified using the following search term:

(“SARS-CoV-2” OR “COVID-19”) AND “T-cell” AND (“tetramer” OR “multimer”) AND “HLA-A”.

We used specifically Google Scholar given that the use of tetramers is often not described in the abstract. Indeed, the search term above yielded only 6 results in PubMed versus 615 articles on Scholar. Out of the 615 articles, 74 were suitable for inclusion, given that 7 were non-English, 323 did not report on COVID-19 but only mentioned it in the text, 177 articles were different types of articles including reviews and opinions and 34 were on a COVID-19-related topic (Fig. S1, Table S1). Regarding the latter, we focused on allele-specific analyses, and we did not include experiments, in which there was no active discrimination between the 6 different MHC-I alleles expressed in one person. Clinical trials were obtained from the website https://clinicaltrials.gov/. Trials were identified using the following search term: “TCR-T cell” AND “TCR therapy” AND “TCR-CD3 therapy” AND “TCR”. We selected only TCR transfer trials and trials that used a recombinant TCR fusion protein, such as Tebentafusp. Studies were included also when they only reported the antigen, for example A*02.

Allele frequencies in different populations

Allele frequencies in different geographic locations were obtained from the Allele Frequency Net Database (http://www.allelefrequencies.net/, access date July 2023⁸). Country of each study was assigned to regions as defined by the Allele Frequency Net Database (https://www.allelefrequencies.net/datasets.asp).

Replication set

The analysis was repeated in an independent dataset obtained from a systemic review of T-cell epitopes defined from the proteome of SARS-CoV-2, describing 1349 MHC-I epitopes²⁰. We only included epitopes that were predicted for one specific allele and excluded epitopes for potentially two or more different MHC-I alleles. These epitopes were functionality tested using the following assays: ELISA, HTMA, multimer staining, cytotoxicity, AIM, ICS, ELISPOT and, or proliferation. Alleles investigated in each study included in the systematic review were extracted and frequencies of alleles across studies determined.

Statistical analysis

Total coverage of a geographical region was calculated as described previously⁴⁶. For each study on The Allele Frequency Net Database, we calculated the sum of all identified MHC-I class allele frequencies. When the sum of the allele frequency exceeded one, observed allele frequencies were scaled based on the sum value. When below one, it was assumed that there was an additional unmeasured allele. Next, alleles that were included in articles were summed to get a measure of the coverage of a population by the current studies on COVID-19. Median allele frequencies were calculated for each region, by taking the median allele frequency of an allele across studies. Median allele frequencies for each country are given in Table S2. The median allele frequency per region was plotted against the fraction of studies that studied a specific allele. Correlations between study frequency and allele frequency were determined based on Pearson correlation and a P-value below 0.05 was considered significant. Figures were produced using R4.3.0 in combination with ggplot2 (v3.4.3) and patchwork (v1.1.3). Geographical maps were produced with ggplot2 using the map data function; https://ggplot2.tidyverse.org/reference/map_data.html

Data availability

All data generated or analysed during this study are included in this published article and its supplementary information files.

Code availability

All R code used in the current study is available from GitHub: https://github.com/roderickslieker/HLA_Disparity

References

Abelin, J. G. et al. Mass spectrometry profiling of HLA-associated peptidomes in mono-allelic cells enables more accurate epitope prediction. Immunity 46, 315–326 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bouvier, M. & Wiley, D. C. Importance of peptide amino and carboxyl termini to the stability of MHC class I molecules. Science 265, 398–402 (1994).
Article ADS CAS PubMed Google Scholar
Robinson, J. et al. The IPD and IMGT/HLA database: Allele variant databases. Nucl. Acids Res. 43, D423-431 (2015).
Article CAS PubMed Google Scholar
Parham, P. MHC class I molecules and KIRs in human history, health and survival. Nat. Rev. Immunol. 5, 201–214 (2005).
Article CAS PubMed Google Scholar
Pearson, H. et al. MHC class I-associated peptides derive from selective regions of the human genome. J. Clin. Investig. 126, 4690–4701 (2016).
Article PubMed PubMed Central Google Scholar
Gourraud, P. A. et al. HLA diversity in the 1000 genomes dataset. Plos One 9, e97282 (2014).
Article ADS PubMed PubMed Central Google Scholar
Robinson, J. et al. IPD-IMGT/HLA database. Nucl. Acids Res. 48, D948–D955 (2020).
CAS PubMed Google Scholar
Gonzalez-Galarza, F. F. et al. Allele frequency net database (AFND) 2020 update: Gold-standard data classification, open access genotype data and new query tools. Nucl. Acids Res. 48, D783–D788 (2020).
CAS PubMed Google Scholar
Horton, R. et al. Gene map of the extended human MHC. Nat. Rev. Genet. 5, 889–899 (2004).
Article CAS PubMed Google Scholar
Nathan, P. et al. Overall survival benefit with Tebentafusp in metastatic uveal melanoma. N. Engl. J. Med. 385, 1196–1206 (2021).
Article CAS PubMed Google Scholar
Tran, E. et al. T-cell transfer therapy targeting mutant KRAS in cancer. N. Engl. J. Med. 375, 2255–2262 (2016).
Article CAS PubMed PubMed Central Google Scholar
Jahn, L. et al. TCR-based therapy for multiple myeloma and other B-cell malignancies targeting intracellular transcription factor BOB1. Blood 129, 1284–1295 (2017).
Article CAS PubMed Google Scholar
Rapoport, A. P. et al. NY-ESO-1-specific TCR-engineered T cells mediate sustained antigen-specific antitumor effects in myeloma. Nat. Med. 21, 914–921 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hong, D. S. et al. Autologous T cell therapy for MAGE-A4(+) solid cancers in HLA-A*02(+) patients: A phase 1 trial. Nat. Med. 29, 104–114 (2023).
Article CAS PubMed PubMed Central Google Scholar
Hadrup, S. R. et al. Parallel detection of antigen-specific T-cell responses by multidimensional encoding of MHC multimers. Nat. Methods 6, 520–526 (2009).
Article CAS PubMed Google Scholar
Gangaev, A. et al. Identification and characterization of a SARS-CoV-2 specific CD8(+) T cell response with immunodominant features. Nat. Commun. 12, 2593 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Altman, J. D. et al. Phenotypic analysis of antigen-specific T lymphocytes. Science 274, 94–96 (1996).
Article ADS CAS PubMed Google Scholar
Quadeer, A. A., Ahmed, S. F. & McKay, M. R. Landscape of epitopes targeted by T cells in 852 individuals recovered from COVID-19: Meta-analysis, immunoprevalence, and web platform. Cell Rep. Med. 2, 100312 (2021).
Article CAS PubMed PubMed Central Google Scholar
Saini, S. K. et al. SARS-CoV-2 genome-wide T cell epitope mapping reveals immunodominance and substantial CD8(+) T cell activation in COVID-19 patients. Sci. Immunol. https://doi.org/10.1126/sciimmunol.abf7550 (2021).
Article PubMed PubMed Central Google Scholar
Jin, X., Liu, X. & Shen, C. A systemic review of T-cell epitopes defined from the proteome of SARS-CoV-2. Virus Res. 324, 199024 (2023).
Article CAS PubMed Google Scholar
Tvingsholm, S. A. et al. TCR-engaging scaffolds selectively expand antigen-specific T-cells with a favorable phenotype for adoptive cell therapy. J. Immunother. Cancer 11, e006847 (2023).
Article PubMed PubMed Central Google Scholar
Weiss, L. et al. Direct in vivo activation of T cells with nanosized immunofilaments inhibits tumor growth and metastasis. ACS Nano 17, 12101–12117 (2023).
Article CAS PubMed PubMed Central Google Scholar
Arrieta-Bolanos, E., Hernandez-Zaragoza, D. I. & Barquera, R. An HLA map of the world: A comparison of HLA frequencies in 200 worldwide populations reveals diverse patterns for class I and class II. Front. Genet. 14, 866407 (2023).
Article CAS PubMed PubMed Central Google Scholar
Morgan, R. A. et al. Cancer regression in patients after transfer of genetically engineered lymphocytes. Science 314, 126–129 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Oliveira, G. & Wu, C. J. Dynamics and specificities of T cells in cancer immunotherapy. Nat. Rev. Cancer 23, 295–316 (2023).
Article CAS PubMed PubMed Central Google Scholar
Ellis, J. M. et al. Frequencies of HLA-A2 alleles in five U.S. population groups. Predominance of A*02011 and identification of HLA-A*0231. Hum. Immunol. 61, 334–340 (2000).
Article CAS PubMed Google Scholar
Cao, K. et al. Analysis of the frequencies of HLA-A, B, and C alleles and haplotypes in the five major ethnic groups of the United States reveals high levels of diversity in these loci and contrasting distribution patterns in these populations. Hum. Immunol. 62, 1009–1030 (2001).
Article CAS PubMed Google Scholar
Spits, H., Breuning, M., Ivanyi, P., Russo, C. & de Vries, J. E. In vitro-isolated human cytotoxic T-lymphocyte clones detect variations in serologically defined HLA antigens. Immunogenetics 16, 503–512 (1982).
Article CAS PubMed Google Scholar
Sarkizova, S. et al. A large peptidome dataset improves HLA class I epitope prediction across most of the human population. Nat. Biotechnol. 38, 199–209 (2020).
Article CAS PubMed Google Scholar
Luo, Y. et al. A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response. Nat. Genet. 53, 1504–1516 (2021).
Article CAS PubMed PubMed Central Google Scholar
Cattaneo, C. M. et al. Identification of patient-specific CD4(+) and CD8(+) T cell neoantigens through HLA-unbiased genetic screens. Nat. Biotechnol. 41, 783–787 (2023).
Article CAS PubMed PubMed Central Google Scholar
O’Brien, H. et al. Breaking the performance ceiling for neoantigen immunogenicity prediction. Nat. Cancer 4, 1618–1621 (2023).
Article CAS PubMed Google Scholar
Hamel, L. M. et al. Barriers to clinical trial enrollment in racial and ethnic minority patients with cancer. Cancer Control 23, 327–337 (2016).
Article PubMed Google Scholar
Khalil, L. et al. Racial and ethnic diversity in SARS-CoV-2 vaccine clinical trials conducted in the United States. Vaccines (Basel) 10, 290 (2022).
Article CAS PubMed Google Scholar
Augusto, D. G. et al. A common allele of HLA is associated with asymptomatic SARS-CoV-2 infection. Nature 620, 128–136 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Hovhannisyan, A. et al. HLA-C*04:01 affects HLA class I heterozygosity and predicted affinity to SARS-CoV-2 peptides, and in combination with age and sex of armenian patients contributes to COVID-19 severity. Front. Immunol. 13, 769900 (2022).
Article CAS PubMed PubMed Central Google Scholar
Langton, D. J. et al. The influence of HLA genotype on the severity of COVID-19 infection. HLA 98, 14–22 (2021).
Article CAS PubMed PubMed Central Google Scholar
Genetic Analysis of Psoriasis C et al. A genome-wide association study identifies new psoriasis susceptibility loci and an interaction between HLA-C and ERAP1. Nat. Genet. 42, 985–990 (2010).
Article Google Scholar
Chen, Y. M. et al. Epidemiological and genetic correlates of severe acute respiratory syndrome coronavirus infection in the hospital with the highest nosocomial infection rate in Taiwan in 2003. J. Clin. Microbiol. 44, 359–365 (2006).
Article PubMed PubMed Central Google Scholar
Ng, M. H. et al. Association of human-leukocyte-antigen class I (B*0703) and class II (DRB1*0301) genotypes with susceptibility and resistance to the development of severe acute respiratory syndrome. J. Infect. Dis. 190, 515–518 (2004).
Article CAS PubMed Google Scholar
Correale, P. et al. HLA-B*44 and C*01 prevalence correlates with Covid19 spreading across Italy. Int. J. Mol. Sci. 21, 5205 (2020).
Article CAS PubMed PubMed Central Google Scholar
Cancellieri, S. et al. Human genetic diversity alters off-target outcomes of therapeutic gene editing. Nat. Genet. 55, 34–43 (2023).
Article CAS PubMed Google Scholar
Lander, E. S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
Article ADS CAS PubMed Google Scholar
Liao, W. W. et al. A draft human pangenome reference. Nature 617, 312–324 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Fatumo, S. et al. A roadmap to increase diversity in genomic studies. Nat. Med. 28, 243 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bui, H.-H. et al. Predicting population coverage of T-cell epitope-based diagnostics and vaccines. BMC bioinformatics 7, 153 (2006).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank A. van Duijn and N.F.C.C. de Miranda, S.B. Coffelt and L.T. Morton for helpful discussion and critical input.

Author information

Authors and Affiliations

Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, The Netherlands
Roderick C. Slieker
Leiden Center for Computational Oncology, Leiden University Medical Center, Leiden, The Netherlands
Roderick C. Slieker
Centre for Future Affordable & Sustainable Therapy Development (FAST), The Hague, The Netherlands
Daniël O. Warmerdam
Department of Dermatology, Leiden University Medical Center, Leiden, The Netherlands
Maarten H. Vermeer, Remco van Doorn & Ferenc A. Scheeren
Department of Dermatology, Netherlands Cancer Institute, Amsterdam, the Netherlands
Remco van Doorn
Department of Hematology, Leiden University Medical Center, Leiden, The Netherlands
Mirjam H. M. Heemskerk

Authors

Roderick C. Slieker
View author publications
You can also search for this author in PubMed Google Scholar
Daniël O. Warmerdam
View author publications
You can also search for this author in PubMed Google Scholar
Maarten H. Vermeer
View author publications
You can also search for this author in PubMed Google Scholar
Remco van Doorn
View author publications
You can also search for this author in PubMed Google Scholar
Mirjam H. M. Heemskerk
View author publications
You can also search for this author in PubMed Google Scholar
Ferenc A. Scheeren
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.C.S and F.A.S. collected and analysed the data. R.C.S performed the statistical analyses. All authors were involved in the critical input and review of the manuscript. All authors read and approved the manuscript.

Corresponding author

Correspondence to Ferenc A. Scheeren.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Supplementary Table S1.

Supplementary Table S2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Slieker, R.C., Warmerdam, D.O., Vermeer, M.H. et al. Reassessing human MHC-I genetic diversity in T cell studies. Sci Rep 14, 7966 (2024). https://doi.org/10.1038/s41598-024-58777-2

Download citation

Received: 01 November 2023
Accepted: 03 April 2024
Published: 04 April 2024
DOI: https://doi.org/10.1038/s41598-024-58777-2

Reassessing human MHC-I genetic diversity in T cell studies

Subjects

Abstract

Similar content being viewed by others

Investigating the genetic makeup of the major histocompatibility complex (MHC) in the United Arab Emirates population through next-generation sequencing

Systematic identification of minor histocompatibility antigens predicts outcomes of allogeneic hematopoietic cell transplantation

Interpretable GWAS by linking clinical phenotypes to quantifiable immune repertoire components

Introduction

Results

Discussion

Material and methods

Article inclusion

Allele frequencies in different populations

Replication set

Statistical analysis

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Figures.

Supplementary Table S1.

Supplementary Table S2.

Rights and permissions

About this article

Cite this article

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

Investigating the genetic makeup of the major histocompatibility complex (MHC) in the United Arab Emirates population through next-generation sequencing

Systematic identification of minor histocompatibility antigens predicts outcomes of allogeneic hematopoietic cell transplantation

Interpretable GWAS by linking clinical phenotypes to quantifiable immune repertoire components

Introduction

Results

Discussion

Material and methods

Article inclusion

Allele frequencies in different populations

Replication set

Statistical analysis

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Figures.

Supplementary Table S1.

Supplementary Table S2.

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links