High-throughput discovery of MHC class I- and II-restricted T cell epitopes using synthetic cellular circuits

Kohlgruber, Ayano C.; Dezfulian, Mohammad H.; Sie, Brandon M.; Wang, Charlotte I.; Kula, Tomasz; Laserson, Uri; Larman, H. Benjamin; Elledge, Stephen J.

doi:10.1038/s41587-024-02248-6

High-throughput discovery of MHC class I- and II-restricted T cell epitopes using synthetic cellular circuits

Article
Open access
Published: 02 July 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue Submit your manuscript

High-throughput discovery of MHC class I- and II-restricted T cell epitopes using synthetic cellular circuits

Download PDF

14k Accesses
38 Altmetric
Explore all metrics

Abstract

Antigen discovery technologies have largely focused on major histocompatibility complex (MHC) class I-restricted human T cell receptors (TCRs), leaving methods for MHC class II-restricted and mouse TCR reactivities relatively undeveloped. Here we present TCR mapping of antigenic peptides (TCR-MAP), an antigen discovery method that uses a synthetic TCR-stimulated circuit in immortalized T cells to activate sortase-mediated tagging of engineered antigen-presenting cells (APCs) expressing processed peptides on MHCs. Live, tagged APCs can be directly purified for deconvolution by sequencing, enabling TCRs with unknown specificity to be queried against barcoded peptide libraries in a pooled screening context. TCR-MAP accurately captures self-reactivities or viral reactivities with high throughput and sensitivity for both MHC class I-restricted and class II-restricted TCRs. We elucidate problematic cross-reactivities of clinical TCRs targeting the cancer/testis melanoma-associated antigen A3 and discover targets of myocarditis-inciting autoreactive T cells in mice. TCR-MAP has the potential to accelerate T cell antigen discovery efforts in the context of cancer, infectious disease and autoimmunity.

T cell antigen discovery via signaling and antigen-presenting bifunctional receptors

Article 28 January 2019

Cell activation-based screening of natively paired human T cell receptor repertoires

Article Open access 17 May 2023

De novo identification of CD4⁺ T cell epitopes

Article Open access 24 April 2024

Main

CD8⁺ and CD4⁺ T cells recognize peptides presented on cell-surface major histocompatibility complex (MHC) class I and class II molecules, respectively, to survey the intracellular and extracellular landscape for pathogens and assess cellular health¹. The mechanism by which T cells recognize specific peptide–MHC (pMHC) combinations is through their membrane-bound T cell receptor (TCR). TCR recognition of the cognate pMHC results in T cell activation and induction of various effector functions that are critical for mounting an adaptive immune response.

Knowledge of the antigens that T cells recognize through their TCR is foundational for our understanding of how and why they are involved in the pathology of human diseases such as cancer and autoimmunity. Moreover, harnessing their exquisite specificity holds therapeutic potential and is the basis for successful vaccines and adoptive T cell therapies. However, the inherent diversity of the TCR repertoire, MHC alleles and the peptides that can be presented on any given MHC molecule make the task of mapping T cell antigens a complex problem. In addition, TCRs can be polyspecific, recognizing multiple pMHC combinations, and typically have lower affinity for their antigens (micromolar) compared to antibody–ligand interactions (picomolar to nanomolar)^2,3. Over the past decade, we and others have developed strategies to determine TCR specificities against tumors, self-antigens, pathogens and allergens^{4,5,6,7,8,9,10,11,12,13,14} but few methods have demonstrated success in assessing TCR reactivities at the proteome scale, hindering high-throughput and unbiased antigen discovery efforts^13,15. Furthermore, antigen discovery technologies have largely focused on class I-restricted human TCRs and equivalent methods to interrogate class II-restricted CD4⁺ T cells or assess mouse TCR reactivities have not kept pace^5,6,7,8. Ultimately, a universal method that has utility for both human T cell antigen discovery efforts and other frequently used preclinical model organisms, such as mice, and that can be applied for class I-restricted and II-restricted TCRs is highly desirable.

Our lab previously developed T-Scan^13,15, which can perform genome-wide analysis of TCR specificities across both viral and human proteomes. T-Scan works by using patient or donor T cells programmed with the TCR of interest in a screen against a library that expresses protein fragments presented on MHC molecules in target cells containing a granzyme reporter^13,15. Upon T cell recognition of target cells expressing the cognate antigen, granzyme B is cytosolically delivered to target cells, which activates the fluorescent reporter for harvesting by cell sorting.

A limitation of this system is the need to obtain fresh primary T cells for each screen. In addition, the assay kills the target cells, limiting the opportunity for further enrichment and subsequent rescreening for signal amplification. Thus, we were motivated to devise an improved screening method that did not rely directly on patient T cells and killing of target cells as part of the recognition assay.

Here we describe a new, cell-based T cell antigen discovery method called TCR mapping of antigenic peptides (TCR-MAP). TCR-MAP enables TCRs with unknown specificity to be queried against a large, peptide tiling library of antigen-presenting cells (APCs) expressing processed peptides on patient-specific or mouse-specific MHC alleles. This system relies on a synthetic circuit expressed in Jurkat cell lines transduced with the TCRs of interest. Upon T cell recognition of the cognate pMHC, the bacterial transpeptidase sortase A (SrtA) is induced and expressed on the cell surface of Jurkats and covalently biotinylates the reciprocal cognate APC. This new method is high throughput and can capture unbiased reactivities against the complete human, mouse or viral proteome or any genetically encoded peptide library of choice. Moreover, it is highly sensitive and can reproducibly discover both high-affinity and low-affinity TCR antigens. We demonstrate the utility of TCR-MAP for antigen discovery efforts for both CD8⁺ and CD4⁺ T cells and for TCRs derived from humans or mice. Application of this technology has the potential to enhance T cell antigen discovery efforts in the context of cancer, infectious disease and autoimmunity.

Results

TCR-MAP captures human MHC class I HLA-A2 and class II pMHC–TCR interactions

To establish a highly sensitive and specific reporter system that would capture cognate TCR–pMHC interactions, we selected a previously reported proximity labeling strategy using the Staphylococcus aureus transpeptidase SrtA, which covalently transfers substrates containing the polypeptide motif LPXTG to nearby N-terminal oligoglycine residues^16,17. We designed a two-cell method consisting of immortalized Jurkat T cells expressing a genetically fused mouse CD40 ligand–SrtA (mCD40L–SrtA) construct under an inducible nuclear factor of activated T cells (NFAT) promoter to serve as the donor (SrtA-Jurkats) (Extended Data Fig. 1a) and human leukocyte antigen (HLA) class I-null HEK-293T APCs transduced with an N-terminal oligoglycine-tagged mouse CD40 receptor (G₅-mCD40) as the SrtA substrate acceptor (G₅-targets)^17,18 (Fig. 1a). Jurkat cells can be easily engineered to express TCRs of interest and additional TCR signaling components such as the CD8 coreceptor. Target cells can be further transduced with the desired MHC alleles and antigens encoded as peptide fragments or full-length proteins (FL-ORF (open reading frame)) for presentation. Upon T cell activation, mCD40L–SrtA is induced on the Jurkat cell surface and catalyzes the transfer of exogenously added LPETG–biotin substrates onto cognate target cells expressing the G₅-mCD40 acceptor (Fig. 1a and Extended Data Fig. 1b). This method, which we call TCR-MAP, relies on a TCR-stimulated circuit in immortalized T cells to activate sortase-mediated biotinylation of cognate APCs.

**Fig. 1: TCR-MAP efficiently and selectively identifies cognate human TCR–pMHC interactions.**

To examine the specificity of TCR-MAP, we introduced the NLV3 TCR, specific for the human cytomegalovirus (CMV)-derived pp65 epitope, NLVPMVATV, into TCRβ-null SrtA-Jurkats and cultured them with HLA-A2⁺ G₅-target cells that were pulsed with either DMSO or the pp65 epitope^13,19. As assessed by flow cytometry, biotinylation of HLA-A2⁺ G₅-targets occurred only in the presence of the cognate pp65 peptide (Fig. 1b). We next sought to determine whether genetically encoded peptide fragments or FL-ORFs could be processed and presented by the target cells for recognition by NLV3 TCR⁺ Jurkats. We expressed either a 56-aa (amino acid) fragment that contained the NLV epitope or the pp65 FL-ORF (561 aa) in HLA-A2⁺ G₅-targets and found that both constructs resulted in biotinylation of HLA-A2⁺ target cells (Fig. 1c). The biotin signal was specific to target cells expressing the correct HLA class I-restricting allele and not HLA-A1⁺ G₅-targets that expressed the nonrestricting allele (Fig. 1c). To assess the utility of the system beyond viral antigen–TCR pairs, we generated HLA-A2⁺ G₅-target cells that encoded a 90-aa fragment from the cancer/testis antigen 1B (CTAG1B/NY-ESO-1) protein containing the IG4 TCR-specific epitope, SLLMWITQC²⁰. SrtA-Jurkats transduced with the IG4 TCR specifically biotinylated HLA-A2⁺ G₅-target cells only when the CTAG1B antigen (either the 90-aa peptide or the 180-aa FL-ORF) was expressed and showed no reactivity to the controls with no antigen or irrelevant antigens (Fig. 1d).

After establishing TCR-MAP for class I-restricted TCR–pMHC pairs, we next sought to engineer an equivalent method for class II-restricted TCRs based on the TScan-II peptide delivery strategy¹⁵. First, to ensure efficient processing and presentation of class II antigens, we transduced target cells with CIITA, the MHC class II transactivator, and CTSS, the cathepsin serine protease required for class II invariant chain processing^{15,21,22,23,24}. Next, we used transient CRISPR (clustered regularly interspaced short palindromic repeats) with Cas9 nucleofection to mutate all class II alleles using single guide RNAs (sgRNAs) targeting the HLA-DR, HLA-DP and HLA-DQ class II locus. Lastly, to direct antigens to the MHC class II-containing cellular compartments, oligonucleotides encoding peptide antigens were genetically fused downstream of a truncated N-terminal sequence of CD74 (invariant chain) for lentiviral expression in target cells^5,8,15,25 (Fig. 1e). SrtA-Jurkat cells were modified to express the CD4 coreceptor and either F24 or Ob1A12 TCRs, which recognize peptide antigens derived from the human immunodeficiency virus (HIV) Gag polyprotein (DR11⁺/Gag293₂₉₉_–₃₁₂/RFYKTLRAEQASQE) or myelin basic protein (DR15⁺/MBP₈₅_–₉₉/ENPVVHFFKNIVTPR), respectively^26,27,28. In alignment with the results obtained for HLA-A2-restricted TCR responses, G₅-target cells expressing the correct HLA class II allele and antigen were specifically recognized and biotinylated (Fig. 1f). These data demonstrate that TCR-MAP is a specific and sensitive system to capture cognate pMHC–TCR interactions for both class I-restricted (HLA-A2) and class II-restricted T cells in humans.

TCR-MAP captures mouse MHC class I-restricted (H2-K^b) and class II-restricted (H2-IA^b) pMHC–TCR interactions

Given the success of TCR-MAP in distinguishing cognate antigen–TCR specificities from humans, we next sought to test the method’s ability to assess mouse TCRs using model TCR–antigen pairs. We chose the well-characterized OT-I TCR–SIINFEKL epitope pair and established G₅-target cells that coexpressed the mouse H2-K^b MHC class I allele with full-length ovalbumin (OVA) or the minimal SIINFEKL epitope. To our TCRβ-null SrtA-Jurkats, we cotransduced the mouse CD8 coreceptor and the OT-I TCR, which recognizes the 8-aa SIINFEKL epitope derived from OVA (Fig. 2a). Coculture of the two engineered cells showed robust biotinylation of G₅-target cells, verifying that ectopic expression of mouse MHC alleles, coreceptors and TCRs into immortalized human cell lines was sufficient for antigen recognition (Fig. 2b). To study how TCR affinity for an antigen impacts the overall signal-to-noise ratio of TCR-MAP, we next tested the reactivity of the OT-I TCR⁺ SrtA-Jurkats against mutant variants of the known SIINFEKL (N4) epitope²⁹. We selected five variants with equivalent binding affinity to H2-K^b but which differed in their overall ability to stimulate OT-I T cells²⁹. We observed a direct correlation between target cell biotinylation and TCR reactivity for the SIINFEKL variant series tested (Fig. 2c).

**Fig. 2: TCR-MAP quantitatively captures class I-restricted and class II-restricted cognate mouse TCR–pMHC interactions according to TCR affinity.**

With the success of the OT-I/SIINFEKL system, we next asked whether a parallel approach could be taken to establish TCR-MAP for class II-restricted mouse CD4⁺ TCR–antigen pairs. To achieve this goal, we first transduced the HLA class II-null, CIITA⁺CTSS⁺ G₅-target cells with the mouse H2-IA^b class II allele and full-length OVA fused downstream of the truncated invariant chain construct (CD74–OVA). Next, we transduced TCRβ-null SrtA-Jurkats with the OT-II TCR and a chimeric CD4 coreceptor composed of the extracellular mouse CD4 domain fused to the transmembrane and cytoplasmic region of human CD4 (Fig. 2d). Coculture of the two engineered target and Jurkat cell lines demonstrated significant biotin transfer that was specific for the restricting H2-IA^b allele-expressing target cells but not the nonrestricting H2-IA^d allele expressors or target cells that did not express the antigen (Fig. 2e). As expected, peptide pulsing and expression of a 90-aa tile that contained the known OT-II epitope (KISQAVHAAHAEINEAG; CD74–OVA (90-mer)) also resulted in target cell biotinylation that was restricted to H2-IA^b (Fig. 2e), although the degree to which APCs were biotinylated varied depending on the length of the genetically encoded OVA antigen. Lastly, to assess the sensitivity and ability of the engineered mouse class II system for self-antigen discovery, we generated SrtA-Jurkats that expressed two distinct mouse TCRs that were each reactive against the same antigen, mouse peptidyl arginine deiminase 4 (PAD4), but which differed in their affinity for the cognate target³⁰. Similar to the SIINFEKL variant experiment, TCR-MAP nicely phenocopied the known TCR affinities for PAD4; the SrtA-Jurkats transduced with the higher-affinity anti-PAD4 TCR (clone 6MNO) exhibited significantly stronger G₅-target cell biotinylation and Jurkat cell activation through CD69 upregulation when compared to the known lower-affinity anti-PAD4 TCR (clone 6MNG) (Fig. 2f). Overall, our data demonstrate that TCR-MAP is readily adaptable to study mouse class I-restricted (H2-K^b) and class II-restricted (H2-IA^b) TCR interactions and the magnitude of the signal is determined by the strength of the TCR–ligand interaction.

Virome-wide screens and TCR binding footprints with TCR-MAP

Mapping the exact specificity of a given TCR in terms of both the antigen recognized and its nuanced binding footprint is important for our understanding of how and what pathogens or autoantigens are recognized by T cells. Having shown that TCR-MAP cleanly discriminates cognate versus noncognate interactions and sensitively captures a wide range of TCR affinities against genetically encoded peptides, we reasoned that the method would be able to identify T cell antigens from complex, oligonucleotide libraries and map epitope binding footprints of TCRs.

We first empirically tested several conditions to determine the optimal signal-to-noise ratio of TCR-MAP, including effector-to-target cell ratios, concentration of the LPETG–biotin substrate and coculture timing (Extended Data Fig. 2a–c). We found that the biotin signal was significantly maintained on G₅-target cells even up to 24 h after coculture and maintaining low Jurkat cell-to-target ratios improved the signal-to-noise ratio of TCR-MAP. Next, to mimic a pooled genetic screening situation where cognate target cells would be a small fraction of a large library, we performed spike-in experiments where we labeled cognate G₅-target cells with cell trace violet and added them at a frequency of <1% to controls with either no antigen or irrelevant antigens and queried their relative enrichment with a single TCR or multiple pooled TCRs (Extended Data Fig. 2d). Even when TCRs were multiplexed, cognate G₅-targets were enriched within the top 1% of all biotinylated targets. Lastly, we determined whether antigens were able to be enriched using magnetic beads and found that cognate G₅-target cells were enriched 10–30-fold relative to the input frequencies (Extended Data Fig. 2e).

To test the sensitivity of TCR-MAP against a large library, we performed proof-of-concept screens using the well-characterized NLV3 TCR against the viral peptidome library consisting of ~100,000 unique oligonucleotides encoding 56-aa fragments offset by 28 aa across all proteins derived from viral species known to infect humans¹³. This library contains two 56-aa tiles encoding the pp65 epitope to serve as positive controls. After library introduction and puromycin selection, viral peptidome-expressing HLA-A2⁺ target cells were cultured with NLV3 TCR⁺ SrtA-Jurkat cells in the presence of LPETG–biotin. We sorted the top 1% of biotinylated target cells on the basis of streptavidin fluorophore labeling and identified the enriched antigens by Illumina sequencing by comparing the relative abundance of each oligonucleotide before and after the sort (Fig. 3a). Notably, even in the context of a large, diverse library of ~100,000 unique fragments, the top two enriched oligonucleotide tiles were the adjacent 56-aa fragments that encoded the CMV-derived pp65 epitope, NLVPMVATV (Fig. 3b).

**Fig. 3: TCR-MAP accurately identifies cognate T cell antigens from complex, pooled library screens and generates high-resolution epitope binding footprints of TCRs.**

Having shown that sorting the top 1% of biotinylated targets results in sensitive identification of cognate antigens, we next tested whether rounds of streptavidin magnetic bead purification could also enrich for cognate antigens. We screened the NLV3 TCR against a 56-aa fragment library tiling the entire CMV proteome (5,764 unique peptides) and found that two rounds of enrichment were sufficient to capture the four fragments in the library that contained the antigenic epitope with a signal-to-noise ratio comparable to sorting (Extended Data Fig. 3a,b). Antigen discovery screens using TCR-MAP can, thus, be performed without reliance on flow cytometry sorters.

Lastly, to test whether we could build high-resolution TCR–peptide binding footprints using TCR-MAP, we screened a comprehensive saturation mutagenesis library for the OT-I TCR–SIINFEKL antigen pair (Fig. 3c). This single mutant library was composed of 56-aa tiles where the SIINFEKL epitope and two amino acids immediately upstream and downstream of the antigen were substituted to each of the 19 alternative amino acids (Fig. 3c). We compared the enrichment of each mutant in the library to the wild-type (WT) SIINFEKL peptide tiles and generated a critical binding interface heatmap based on the relative enrichment or depletion scores (Fig. 3d). As expected, the vast majority of substitutions to the SIINFEKL epitope resulted in abrogated or decreased OT-I TCR recognition (Fig. 3d). However, substitutions to amino acids with similar chemical properties were tolerated at several positions, such as positions 7 and 8 (Fig. 3d). Because we previously showed that TCR-MAP can capture a wide range of OT-I TCR affinities to mutant variants of SIINFEKL (Fig. 2b), we correlated the saturation mutagenesis footprint values obtained from the screen to the relative ability of individual SIINFEKL variants to stimulate OT-I TCRs. The results of the SIINFEKL saturation mutagenesis screen showed strong concordance with the known antigenic stimulatory capacity of the SIINFEKL variants (Fig. 3e). Thus, TCR-MAP is a powerful method that can isolate and decode cognate antigen-expressing target cells from a complex, pooled library setting and capture TCR–epitope binding footprints using saturation mutagenesis antigen libraries with high sensitivity and accuracy.

Genome-wide TCR-MAP screens using single or pooled TCRs

Beyond TCR specificities to foreign sources such as viruses, self-antigens comprise another class of important antigenic targets. Particularly in cancer, boosting T cell responses against cancer-expressed testis antigens has shown promising antitumor responses clinically and efforts aimed at identifying TCRs with high on-tumor and low off-tumor specificity is an active area of research. However, the task of mapping self-reactivities is difficult because of the vast size of the human proteome from which antigens can arise and because of the often-low TCR affinities to self-peptides³.

To test whether TCR-MAP would aid in T cell antigen discovery of self-reactive TCRs at a genome scale, we performed TCR-MAP screens using the class I-restricted CTAG TCR, IG4, against the human peptidome library^3,15,31. In addition to full-length ORF proteins tiled with 90-aa peptides offset by 22 aa, this library includes additional N-terminal fragments to increase coverage of the beginning sequence of ORFs, additional protein isoforms and human endogenous retroviral proteins for a comprehensive library containing ~580,000 unique peptides (Fig. 4a). When IG4 TCR⁺ SrtA-Jurkats were screened against HLA-A2⁺ G₅-target cells expressing the human peptidome library, the top enriched peptide tiles were the CTAG1B and CTAG1A fragments that contained the known antigenic 9-mer epitope (Fig. 4a,b). In addition, a single peptide tile from hypothetical protein XM_002346349 was significantly enriched with a similar magnitude (Fig. 4a). We found that the 90-aa fragment of XM_002346349 also contained the antigenic SLLMWITQC epitope (Fig. 4b), highlighting the utility and sensitivity of TCR-MAP to identify cognate and cross-reactive antigens from a large, complex peptidome library.

**Fig. 4: Genome-wide TCR-MAP screens can be performed using single or pooled TCRs.**

A major goal of T cell antigen discovery efforts is to map the reactivities of multiple TCRs at a time. To increase the throughput of our screens, we wondered whether SrtA-Jurkats expressing different TCR sequences could be pooled together without compromising the signal-to-noise ratio of the approach. We performed antigen discovery screens against the human peptidome combining SrtA-Jurkat cells expressing the cancer/testis antigen-reactive IG4 or DMF5 TCRs (A2⁺/melanocyte antigen (MLANA)/EAAGIGILTV), with three additional HLA-A2-restricted TCR clonotypes that do not exhibit reactivity to self-antigens contained in the human peptidome library (Fig. 4c). As expected, the CTAG1B, CTAG1A and XM_002346349 peptide fragments containing the IG4 TCR epitope, SLLMWITQC, showed comparable enrichment compared to the IG4 TCR when screened alone using TCR-MAP (Fig. 4a). A peptide fragment from MLANA containing the antigen for the DMF5 TCR, EAAGIGILTV, was also strongly enriched (Fig. 4c). Proteome-wide T cell antigen discovery screens via TCR-MAP can not only be used to search for TCR reactivities for one TCR at a time but can also be multiplexed to assess five or more TCRs, thus increasing the throughput of the system.

Thus far, we demonstrated the ability of TCR-MAP to determine TCR reactivities in the context of single HLA allele-expressing target cells. However, the cells of our body can express up to eight different HLA class I or class II alleles. Moreover, promiscuous class II-restricted CD4⁺ T cells have been known to exhibit both cross-reactive (recognition of different antigens on a common HLA class II allele) and/or cross-restrictive behavior (single TCR recognizing two different antigens on two different HLA class II alleles)^32,33,34. We next screened TCR3898-2, which recognizes an epitope from CTAG1B (DR4⁺/CTAG1B₁₂₁_–₁₃₀/VLLKEFTVSG), against CIITA⁺CTSS⁺ G₅-target cells expressing six different HLA class II alleles, including the restricting HLA-DRB1*04:01 allele, transduced with the invariant chain fused human peptidome library (Extended Data Fig. 4a). As expected, several 90-aa fragments from the human peptidome library containing the known antigenic epitope from CTAG1A, the close family member CTAG2 and TTN scored (Extended Data Fig. 4a)¹⁵. Notably, several additional 90-aa fragments from new protein sources including TAOK (a serine/threonine protein kinase), CENPF (centromere protein F), ZNF321 (zinc finger protein 321) and MSANTD3 (Myb/SANT-like DNA-binding domain-containing protein 3) and a single N-terminal peptide (ENST00000412481.1) were also enriched in the screen (Extended Data Fig. 4a). To validate these hits, we generated G₅-target cells that expressed each tile individually and tested for Jurkat cell activation by CD69 upregulation. Although TCR3898-2 was previously characterized to not exhibit cross-reactivity or cross-restriction³¹, the peptide fragments did indeed validate (Extended Data Fig. 4b,d). Moreover, individual fragments showed excellent concordance with fold enrichment values from the human peptidome screen, demonstrating that TCR-MAP sensitively captures antigen hierarchies of varying TCR strength (Extended Data Fig. 4c). To assess which of the validated fragments were restricted specifically to the HLA-DRB1*04:01 allele relative to the other expressed HLA class II alleles, we generated target cells that coexpressed HLA-DRB1*04:01 and each of the individual 90-aa fragments (Extended Data Fig. 4b,d). From the Jurkat cell activation data, we concluded that peptide tiles from CTAG2, TTN, CTAG1A, CENPF and MSANTD3 were restricted to HLA-DRB1*04:01, while antigenic peptides from ENST, TAOK and ZNF321 were likely presented by one of the other HLA class II alleles. In alignment with these results, the known LKEF sequence motif recognized by the TCR3898-2 CD4⁺ TCR or a very similar epitope sequence was contained only in the validated DR4-restricted peptide tiles (Extended Data Fig. 4d)^15,31. TCR-MAP is a powerful tool to detect cognate antigen reactivities that may be cross-reactive and/or cross-restrictive.

TCR-MAP discovers autoantigens of mouse TCRs

Although genome-scale antigen discovery methods have been described for the study of human TCRs, fewer high-throughput approaches exist to query TCRs from mice^6,35. We next wondered whether TCR-MAP could be applied to discover T cell antigens for class I-restricted (H2-K^b) and class II-restricted (H2-IA^b) mouse TCRs. To test this, we selected the well-characterized 2C TCR that has a known H2-L^d-associated peptide antigen, QLSPFPFDL (QL9), derived from the enzyme ɑ-ketoglutarate dehydrogenase (OGDH)^36,37. G₅-target cells expressing mouse H2-L^d were transduced with a mouse peptidome library encoding 56-aa fragments and screened against 2C TCR⁺ SrtA-Jurkat reporter cell lines. To our surprise, three overlapping peptide fragments derived from a high-affinity copper transporter membrane protein, SLC31A1, were the only tiles that significantly enriched (Fig. 5a). To narrow down the antigenic epitope within the validated hits, we performed NetMHC³⁸ analysis on the overlapping polypeptide sequence from the adjacent peptide tiles from SLC31A1 and assessed the presence of H2-L^d binders (Fig. 5b). We tested the top predicted H2-L^d 8-aa and 9-aa binders by performing peptide pulse experiments and determined the 9-mer peptide, MPMTFYFDF, to be the reactive antigen for 2C TCR⁺ Jurkats (Fig. 5b). Notably, this epitope contained the FD motif, which is required for recognition by 2C TCR³⁹. These results highlight the utility of comprehensive T cell antigen discovery screens using TCR-MAP as TCR cross-reactivities may be difficult to predict using homology-based searches with minimal motifs.

**Fig. 5: Genome-wide screens using TCR-MAP discovers autoantigen reactivities of mouse TCRs.**

Although we validated SLC31A1 as an antigen for the 2C TCR, we wondered why the previously characterized antigen, OGDH, did not score in our screen. We generated H2-L^d target cells that expressed OGDH or SLC31A1 as 56-aa fragments and compared their relative reactivities (Fig. 5c). The peptide fragment from SLC31A1 activated 2C TCR⁺ SrtA-Jurkats with >3-fold greater potency than the peptide tile from OGDH (Fig. 5c). Further supporting the observed reactivities, epitope binding affinity predicted by NetMHC revealed that the epitope from SLC31A1 bound H2-L^d with ~200-fold greater affinity than that from OGDH (Fig. 5d). Thus, from our unbiased mouse peptidome screens, we identified a previously uncharacterized mouse self-antigen from SLC31A1 as a new and potent target of the 2C TCR.

Knowledge of self-antigen reactivities of regulatory T cells (Treg) is an important aspect of understanding how they suppress inflammation. To determine whether TCR-MAP could deconvolute class II-restricted Treg specificities, we performed mouse peptidome screens using a Treg TCR (clone MNO) that recognizes the PAD4₉₂_–₁₀₅ epitope, VRVSYYGPKTSPVQ (Fig. 2f)³⁰. G₅-targets were transduced with H2-IA^b and an invariant chain fused mouse peptidome library, cocultured with mouse MNO TCR⁺ SrtA-Jurkat reporters and screened. The top-scoring hit from the class II mouse peptidome screen was a 56-aa fragment spanning positions 74–103 of PAD4 (Extended Data Fig. 5a). These data support the ability of TCR-MAP to discover self-antigen targets of class I-restricted and class II-restricted mouse TCRs.

Immune-related adverse events (irAEs) that arise in response to immune checkpoint inhibitors pose challenges for many patients during cancer treatment⁴⁰. Myocarditis is a rare but deadly form of irAE where the pathogenesis is driven by clonally expanded CD8⁺ T cells that infiltrate the heart^41,42. Although TCR reactivities to heart-specific proteins such as ɑ-myosin have been reported, the antigenic source for other dominant T cell clonotypes present in cardiac and skeletal muscle during a mouse model of checkpoint-induced myocarditis remain unknown⁴². To rapidly identify pathogenic myocarditis T cell reactivities, we pooled SrtA-Jurkat reporter cells that expressed five different highly expanded TCRs whose relevance, if any, to the myocarditis was not known and screened them against mouse peptidome-positive H2-K^b/D^b+ G₅-targets (Fig. 5e). From the screen, two 56-aa fragments scored and subsequent validation experiments helped to deconvolute the reactivities to a single TCR clonotype, TCR4 (Fig. 5f). NetMHC analysis of the top-scoring tiles identified several predicted peptide binders for creatine kinase S type (CKMT2) and RIKEN cDNA on H2-K^b and H2-D^b+ (Fig. 5g). TCR4 showed reactivity to three of the eight peptides, where a common motif sequence, XXVRXPKL, was present in the validated epitopes (Fig. 5g). When we looked at the tissue distribution of CKMT2, we found high expression of the gene exclusively in the heart (Fig. 5h)⁴³. The reactivity of TCR4 against CKMT2 strongly suggests that this clonotype contributes to the pathology of myocarditis through recognition of self-antigens expressed in cardiac tissue. Through TCR multiplexing, we demonstrate that TCR-MAP can quickly identify new reactivities of TCRs of unknown etiology and provide detail to better understand pathological mechanism.

TCR-MAP predicts adverse cross-reactivities of clinical TCRs

Adoptive T cell therapy is a promising clinical strategy to eliminate tumors^44,45. One approach within the cell therapy space has been to engineer high-affinity TCRs targeting self-antigens expressed by cancer cells^46,47. While TCRs can be engineered to improve antigen recognition, undesirable on-tumor and off-tumor reactivities have been observed upon such affinity enhancement strategies, leading to complications during treatment^48,49,50. In one example, TCR engineering to enhance recognition of the HLA-A*01:01-restricted melanoma-associated antigen A3 (MAGEA3) showed promising ex vivo efficacy for several affinity-enhanced TCRs tested⁵¹. However, one of the affinity-enhanced clonotypes, the a3a TCR, was found to cross-react with a peptide from the cardiac protein titin (TTN) leading to cardiogenic shock and death of two patients during treatment for melanoma^51,52,53.Therefore, there is a need to not only comprehensively map candidate TCRs for reactivities against self-targets that may be of clinical benefit, but also screen against off-target antigens that may yield adverse events prior to clinical use.

On the basis of TCR-MAP to accurately define cross-reactive self-antigens for a given TCR (Fig. 4), we sought to test whether we could comprehensively map the autoreactive landscape of the a3a TCR. We performed human peptidome screens by culturing HLA-A*01:01⁺ G₅-target cells with SrtA-Jurkat cells transduced with a3a TCR and sorted the top 1% of biotinylated G₅-target cells to characterize the full self-reactivity profile of the a3a TCR (Fig. 6a). Among the top enriched hits from the screen were previously detected antigens for the pre-enhanced TCR including MAGEA3, MAGEA6, FAT2 (a protocadherin) and PLD5 (a phospholipase) (Fig. 6a)¹³. Several overlapping fragments from TTN showed significant enrichment, as did previously uncharacterized reactivities to MAGEB18 and a predicted exon splice junction in the calcium-responsive transcription factor gene (Fig. 6a). Close inspection of the enriched peptide tile sequences from the screen uncovered the critical EXDPXXXY motif that is likely recognized by the a3a TCR⁵¹ (Fig. 6a).

**Fig. 6: Therapeutic application of TCR-MAP to predict adverse cross-reactivities of clinical TCR.**

EpitopeID detection of cross-reactive peptides

To further examine whether all possible cross-reactivities of the a3a TCR were discovered, we performed saturation mutagenesis screens to generate a high-resolution TCR footprint (Fig. 6b). Using the saturation mutagenesis enrichment scores, we developed an algorithm called EpitopeID that predicts a rank order of peptides in the human proteome that may be recognized by the a3a TCR (Methods). EpitopeID leverages the fold enrichment values to generate a scoring matrix based on the relative reactivity of a given peptide variant and performs in silico analysis to score the relative stimulatory potential of 9-mer peptides derived from the human proteome (Fig. 6c). Further filtering of potential antigens for HLA-A*01:01 binders was conducted using NetMHC³⁸. From this analysis, the top five predicted peptides were derived from the proteins that scored in the human peptidome screens. Two other peptides that weakly scored in the screen were in the top 65 predicted off-targets, PLD5 (44) and FAT2 (64). In addition, our analysis uncovered four peptides from ZNF609, ALCAM, MAGEA2 and FGD5 that scored highly but were not hits in the human peptidome screen. We synthesized peptides for each of the untested epitopes and performed peptide pulsing validation experiments. Only the positive control peptide derived from MAGEA3, EVDPIGHLY, was able to stimulate the a3a TCR⁺ Jurkat cells (Fig. 6c), suggesting that the screen recovered the majority of cross-reactive antigens from the human peptidome. Thus, TCR-MAP has utility for preclinical studies where reducing the risk of engineered TCRs is a primary goal.

Discussion

Here we characterized an antigen discovery method called TCR-MAP that uses a TCR-stimulated circuit in immortalized T cells to activate sortase-mediated tagging of engineered APCs expressing genetically encoded peptides on MHCs of interest. We demonstrated that TCR-MAP accurately captures self-reactivities or viral reactivities with high throughput and sensitivity for a diverse set of MHC class I-restricted (CD8⁺) and class II-restricted (CD4⁺) T cells and for TCRs derived from humans or mice.

TCR-MAP has several advantages over previously reported antigen discovery methods to facilitate large-scale T cell antigen discovery efforts. Firstly, the reagents and tools used to perform TCR-MAP are readily available in all labs that do mammalian cell culture and do not require specialized protocols such as barcoded tetramer generation or equipment such as microfluidic devices^9,54,55,56. Secondly, the system uses target cells that can process long polypeptides or full-length proteins expressed in the cytosol or MHC class II loading compartments for presentation on many different MHC class I or class II molecules, respectively. This negates a priori knowledge of antigenic epitopes and enables synthesis of highly customizable and scalable genetic libraries where the identity of the antigen is unknown^7,57,58,59. Thirdly, TCR-MAP takes advantage of immortalized cell lines for antigen discovery efforts, which avoids donor-to-donor variability associated with using primary T cells for cytotoxicity or activation assays and circumvents the need for obtaining patient T cells for antigen discovery or subsequent validation efforts^8,12,13,15. In addition, engineering of immortalized cell lines allows for species-agnostic T cell antigen discovery and we highlighted here the ability of the method to accurately identify cognate TCR reactivities for both humans and mice. Fourthly, the reporter of cognate T cell reactivities is the biotinylation of G₅-target cells, whereby the signal is quantitative and proportional to the strength of the antigen recognition and avoids cytotoxic effects on the target cells^12,13,15. This opens the possibility of enriching live target cells and growing them for further signal amplification through several rounds of enrichment and subsequent rescreening. Thus, these last two points highlight some of the strengths inherent in TCR-MAP and outline the design advancements that were incorporated in our second-generation T cell antigen discovery method, building on our previous T-Scan technologies. Lastly, we demonstrated that the method can accurately identify cognate antigens from genetic screens where single or pooled TCRs are used. Moreover, detailed TCR binding footprints can be generated and used to assess potential cross-reactivities. TCR-MAP is, thus, a sensitive and convenient antigen discovery method to deconvolute TCR specificities at scale and has broad application across species.

While powerful, there are several aspects of T cell antigen discovery that the current design of TCR-MAP does not address. Firstly, although we demonstrated that TCR-MAP can accurately capture low-affinity and high-affinity antigens when several TCRs are combined, larger-scale antigen screening efforts that combine multiple TCR clonotypes with varying antigen affinities would benefit from further optimization of the effector-to-target cell ratios used in a screening context and the maximal number of clonotypes to pool on the basis of known cognate TCRs as internal controls. Secondly, without further engineering of target cells, TCR-MAP is unlikely to capture T cell reactivities to post-translationally modified (PTM) peptides. Future work expressing enzymes that catalyze modifications such as phosphorylation (by kinases) or citrullination (by peptidyl arginine deiminases) in target cells may enable PTM antigen discovery efforts. Lastly, we did not demonstrate the ability of TCR-MAP to deconvolute T cell reactivities against nonpeptide antigens such as lipids (for CD1d-restricted invariant natural killer T cells) or metabolites (for MR1-restricted mucosal-associated invariant T cells). Owing to the flexibility of the TCR-MAP method, however, additional engineering of target cells to express the restricting MHC allele of interest coupled with genetic screening strategies to activate lipid or vitamin B metabolic pathways may expand the possibility of unbiased T cell antigen discovery for T cell subsets not restricted by MHC classes I and II.

There is a growing need to understand the pathological mechanisms that drive irAEs in patients who receive immune checkpoint blockade treatment. Using multiplexed TCR-MAP screens against our mouse proteome library, we were able to rapidly identify cardiac self-reactivity against CKMT2 for an expanded TCR clonotype derived from a mouse model of myocarditis. It will be interesting to assess whether CKMT2 also serves as an autoantigen for expanded CD8 TCRs in human patients with myocarditis. Thus, TCR-MAP holds promise as a tool for T cell antigen discovery efforts in mouse models, which can then inform new insights and generate hypotheses for evaluation in human disease.

The clonal selection theory proposes that each individual lymphocyte bears a single type of receptor with unique specificity². However, we now appreciate that TCRs can exhibit some level of cross-reactivity and cross-restriction, thereby increasing the total number of foreign antigens that T cells can respond to^{2,32,33,34,60}. As adoptive T cell therapies gain ground as a cancer treatment option, there is a growing need to test the safety profile of clinical TCRs before infusion into patients. By vetting the a3a MAGEA3-specific TCR using TCR-MAP, we discovered several cross-reactive antigens that were unknown at the time of clinical use. Knowledge of the a3a MAGEA3 TCR specificity against self-proteins such as TTN may have motivated greater precaution and further engineering before patient treatment that could have saved lives. Systematic characterization of T cell reactivity profiles using comprehensive antigen mapping technologies such as TCR-MAP will undoubtedly help to reduce the risk of therapeutic TCRs moving forward.

Lastly, we demonstrated that the EpitopeID algorithm has utility in predicting off-target effects of TCR specificity. Many approaches have been proposed to predict the epitope specificity of TCRs⁶¹. Computational approaches for these predictions commonly use receptor–peptide interaction databases such as VDJdb, IEDB and PIRD and/or additional information such as from CDR3 sequencing. In contrast, EpitopeID is a functional readout of the saturation mutagenesis TCR footprint using endogenous antigen presentation, which provides an empirical, high-resolution starting point from which cross-reactivity predictions can be made. Furthermore, by querying TCR footprints against a proteome of interest rather than against all possible k-mers, we cast a wide net in terms of search space for potential cross-reactors while also imposing a logical constraint to sequences that are likely to be observed. In our study, TCR-MAP saturation mutagenesis footprinting, coupled with EpitopeID, predicted the top five scoring peptides in the human peptidome screen itself. In addition, it predicted two additional cross-reactive epitopes, from PLD5 and FAT2, within the top 65 predictions. One explanation for why these cross-reactivities did not receive a higher score is that the algorithm was trained on the effects of single-amino acid changes. Therefore, it is possible that changes owing to pairs of amino acids can act in a cooperative, nonlinear fashion to influence the TCR loop interaction structure in currently unpredictable ways⁶². Future efforts that use mutagenesis matrices that contain all possible double-amino acid substitutions might, thus, further improve TCR reactivity predictions^8,13,47.

Methods

Cell culture

HEK-293T (CRL-3216) and TCRβ-null Jurkat (J.RT3-T3.5) cell lines were obtained from the American Type Culture Collection (ATCC). HEK-293T cells were cultured in DMEM (Gibco, 11995065) with 10% FBS (HyClone) and 1% penicillin–streptomycin (Invitrogen, 15140-122). TCRβ-null Jurkat cells were cultured in RPMI (Gibco, A10491-01) with 10% FBS (HyClone) and 1% penicillin–streptomycin (Invitrogen, 15140-122). All cell lines were regularly tested for mycoplasma and were all negative. Cells were obtained directly from ATCC and, thus, were not authenticated.

Generation of TCR-MAP target cell lines

HEK-293T cells were transfected with sgRNAs targeting conserved sequences across the HLA-A, HLA-B and HLA-C locus to generate HLA class I-null target cells as described previously¹³. To generate target cells for MHC class II antigen presentation, HEK-293T cells were first transduced with a lentiviral vector containing an EF1α promoter driving expression of CIITA (UniProt P33076) and CTSS (UniProt P25774). Cells were sorted for high HLA II expression. Cas9 protein (Thermo Fisher Scientific, A36499) was complexed with 900 pmol of sgRNAs targeting HLA-DPB1, HLA-DPA1, HLA-DQB1, HLA-DQA1, HLA-DRA, HLA-DRB1 and HLA-DRB5 (Invitrogen TrueGuide Synthetic gRNAs, A35510) in Opti-MEM medium and incubated for 5 min at room temperature to form Cas9 ribonucleoproteins and added to cells (Thermo Fisher Scientific, Lipofectamine CRISPRMAX Cas9 Transfection Reagent, CMAX00003). After incubation for 48 h, the cells were assessed for HLA II expression and cells exhibiting diminished cell-surface HLA class II molecules were then single-cell cloned by sorting into 96-well plates. Both the HLA class I-null and HLA class I/II-null HEK-293T cell lines were transduced with lentivirus containing EF1α-G₅-mCD40-neomycin constructs. HLA-A*02:01, HLA-A*01:01, HLA-DRB1*11:01, HLA-DRB1*15:01, HLA-DRB1*04:01, HLA-DRB1*01:02, HLA-DRA*01:01, HLA-DPB1*04:01;DPA*01:03, HLA-DPB1*04:02;DPA*01:03, HLA-DQB1*03:01;DQA*03:01 and HLA-DQB1*05:01;DQA*01:01 sequences were obtained from the IPD-IMGT/HLA database⁶³ and synthesized as gBlocks (IDT). Mouse MHC allele sequences were obtained from UniProt and synthesized as gBlocks (IDT): H2-K^b (UniProt P01901), H2-L^d (UniProt P01897), H2-IA^d (UniProt P01921) and H2-IA^b (UniProt P14483). Human and mouse MHC alleles were cloned into pDONR221. For expression, they were Gateway-cloned into pHAGE-EF1α-DEST expression vectors with variable antibiotic selection and fluorophore markers.

Generation of TCR-MAP Jurkat cell lines

TCRβ-null Jurkat cells were spinfected with lentivirus at varying concentrations to achieve a multiplicity of infection (MOI) < 1 and introduced with CD4 or CD8 coreceptors, the NFAT-SrtA reporter and TCRs of interest. A total of 1 × 10⁶ cells were spun with 8 μg ml⁻¹ polybrene (Millipore, TR-1003-G) and lentivirus for 30 min at 800g in 12-well plates. Cells were incubated at 37 °C and the virus was washed off after 24 h. After spinfection for 48 h, the cell-surface expression of constructs was tested by flow cytometry. Human CD4 (UniProt P01730) and CD8 (P01732 and P10966) and mouse CD8 (UniProt P01731 and P10300) coreceptors were synthesized as gBlocks (IDT) and cloned into pDONR221 (Gateway 12536017). CD8β and CD8α receptors were separated by porcine teschovirus 1 (P2A)⁶⁴. The CD4 coreceptor used for mouse class II antigen discovery was engineered by fusing the mouse extracellular domain of CD4 (UniProt P06332) with the human transmembrane and cytoplasmic tail of CD4. Codon-optimized TCRβ and TCRα variable sequences (V-CDR3-J domains) of TCRs used in the study were fused to mouse TCR constant regions, gene-synthesized (Twist Biosciences) and subcloned in pHAGE-EF1α-DEST-PGK-Bsd expression vectors. Affinity-matured a3a MAGEA3 TCRs were designed using human TCR constant regions and cloned into pHAGE-EF1α-DEST-PGK-ZsGreen vectors for expression. Each TCR was encoded a single construct containing TCRα P2A TCRβ with either the mouse or human TCR constant region.

Generation of TCR-MAP reporter constructs

Plasmids encoding G₅–Myc–mCD40 and Flag–SrtA–mCD40L were kindly provided by Gabriel Victora¹⁷. G₅–Myc–mCD40 was subsequently cloned into the pHAGE-EF1α-DEST-PGK-neomycin vector. Flag–SrtA–mCD40L was cloned into inducible expression vectors containing leucine zipper (ZIP) domains and NFAT response elements in various combinations upstream of a minimal interleukin 2 (IL-2) promoter (Extended Data Fig. 1a).

Lentiviral production

HEK-293T cells were transfected with second-generation lentiviral packaging plasmids pMD2.G (Addgene, cat. no. 12259) and psPAX2 (Addgene, cat. no. 12260), encoding VSV-G, Tat, Rev and Gag-Pol. Transfection was performed using PolyJet In Vitro DNA Transfection Reagent (SignaGen, SL100688) according to the manufacturer’s protocol. Viral supernatants were collected 48 and 72 h after transfection, passaged through a 0.45-µm filter and added to cells.

SrtA substrates

Biotin–aminohexanoic acid–LPETGS (C-terminal amide, 95% purity) was purchased from LifeTein (custom synthesis) and stock solutions were prepared in PBS at 20 mM as previously reported¹⁷.

Peptide pulsing and endogenous antigen expression cocultures

Small-scale validation experiments testing TCR reactivities against various antigens were performed in 96-well plates adding 100,000 G₅-target cells with 100,000 SrtA-Jurkat cells in the presence of 50 μM of LPETG–biotin substrate. After 6–16-h incubation at 37 °C, cells were washed twice with PBS supplemented with 0.5% BSA and 2 mM phosphate-buffered EDTA (PBE) to remove excess LPETG–biotin before analysis by flow cytometry. Peptides (GenScript custom peptide synthesis) used for peptide pulsing experiments were added to G₅-target cells at a concentration of 1 μM for 1 h before cells were washed and cultured with SrtA-Jurkat cells. For endogenous expression of antigens, 56-mer or 90-mer peptide fragments were reverse-translated and synthesized as gBlocks (IDT) with 5′ and 3′ BP recombination sites. Peptide fragments were Gateway-cloned into pHAGE-CMV-Nflag-HA-DEST-IRES-Puro or pHAGE-CMV-CD74₁_–₈₀-DEST-PGK-Puro expression vectors for class I or class II antigen presentation experiments, respectively. Peptides and antigens used for TCR3898-2 validation are reported in Extended Data Fig. 4. All other peptides and antigens expressed in target cells were as follows:

Pp65 56-mer: RLKAESTVAPEEDTDEDSDNEIHNPAVFTWPPWQAGILARNLVPMVATVQSGARA*

CTAG1B 90-mer: APPLPVPGVLLKEFTVSGNILTIRLTAADHRQLQLSISSCLQQLSLLMWITQCFLPVFLAQPPSGQRR*

HIV gag 56-mer: SILDIRQGPKEPFRDYVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDCKTILKA

OT-II OVA 90-mer: LMAMGITDVFSSSANLSGISSAESLKISQAVHAAHAEINEAGREVVGSAEAGVDAASVSEEFRADHPFLFCIKHIATNAVLFFGRCVSP*

PAD4 56-mer: EVTLQVKAASSRTDDEKVRVSYYGPKTSPVQALIYITGVELSLSADVTRTGRVKPA

OGDH 56-mer: EEEVAITRIEQLSPFPFDLLLKEAQKYPNAELAWCQEEHKNQGYYDYVKPRLRTTI

SLC31A1 56-mer: HSHGGGDSMMMMPMTFYFDFKNVNLLFSGLVINTPGEMAGAFVAVFLLAMFYEGLK

SIINFEKL (WT 8-mer): SIINFEKL. Variants of SIINFEKL used contained the indicated substitution from the WT sequence.

OT-II peptide: KISQAVHAAHAEINEAG

PAD4 peptide: VRVSYYGPKTSPVQ

SLC31A1 predicted peptide binders: PMTFYFDF, MPMTFYFDF and MMMPMTFYF

MAGEA3 peptide: EVDPIGHLY

ZNF609 peptide: EMDPILWYR

ALCAM peptide: EMDPVTQLY

MAAGEA2B peptide: EVVPISHLY

MAGEA2 peptide: EVVPISHLY

FGD5 peptide: EVGPIFHLY

Flow cytometry

Cells were stained for at least 30 min in PBE with antibodies and then washed two times in PBE. Samples were acquired using LSR-II (BD Biosciences) or CytoFLEX (Beckman Coulter) flow cytometers and data were analyzed using FlowJo (version 10.8.2) software. All antibodies or cell-surface staining reagents were from BioLegend and were used at 0.5−1 μl per million cells (APC anti-mouse CD40, clone 3/23; BV421 streptavidin, 405226; PE or BV421 anti-human CD69, clone FN50; APC anti-mouse CD154 (CD40L), clone SA047C3; PE anti-biotin, 1D4-C5; APC anti-human HLA-A, HLA-B and HLA-C, clone W6/32; APC anti-human HLA-DR, HLA-DP and HLA-DQ, clone Tu39; BV421 anti-mouse MHC class II, clone M5/11.15.2; PE anti-human CD4, clone RPA-T4; FITC anti-mouse CD4, clone RM4-5; BV785 anti-human CD8, clone SK1; BV421 anti-mouse CD8, clone 53-6.7; APC anti-mouse H2Kb, clone AF6-88.5).

Fluorescence-activated cell sorting-based TCR-MAP screens

For TCR-MAP screens, 35 μl of antibody (anti-mouse CD40, clone 3/23 and BV421 streptavidin) in a total volume of 1 ml was used per 80 million cells. Staining was conducted for 30 min at 4 °C; cells were then washed in PBE before sorting. Sorting was performed on a Sony MA900 instrument where the top 1% of biotinylated target cells were isolated.

Magnetic bead enrichment-based TCR-MAP screens

CMV-specific TCR-MAP screens were performed by culturing HLA-A*02:01⁺ G₅-target cells with NLV3 TCR⁺ SrtA-Jurkats at a ratio of 1:1 for 8–12 h in the presence of 50 μM LPETG–biotin. Screens were performed at 1,000× library representation with three biological replicates. Cells were collected using enzyme-free dissociation medium (PBS supplemented with 2 mM EDTA) and washed three times in PBE before performing streptavidin microbead magnetic column isolation according to the manufacturer’s protocol (Miltenyi Biotec, 130-048-101). Bound cells were eluted and plated in 10-cm plates and grown for 1 week, after which further cocultures and enrichments were performed. Isolated target cells were saved after each round of enrichment for genomic DNA (gDNA) isolation and library preparation for sequencing.

CMV-specific and virome-wide peptidome libraries

The CMV-focused and virome-wide library was described previously^13,65. The library was cloned into the pHAGE-CMV-Nflag-HA-DEST-IRES-Puro lentiviral vector by Gateway cloning. This vector enables peptide fragments to be uniformly expressed by identical start codons followed by an N-terminal Flag and HA tag. At least 100× library representation was maintained at each step of cloning. HLA-A*02:01⁺ G₅-target cells were infected with lentivirus containing the CMV-focused or virome-wide library at an MOI of 3–5 to achieve 1,000× library representation and selected with 1 μg ml⁻¹ puromycin for 3 days before use in TCR-MAP screens.

Saturation mutagenesis library

The SIINFEKL epitope was mutagenized in the context of a 56-mer (LPFASGTMSMLVLLPDEVSGLEQLESIINFEKLTEWTSSNVMEERKIKVYLPRMKME) and an 8-mer (SIINKEKL). Each amino acid in the SIINFEKL epitope (bold) was substituted to each of the other 19 amino acids. As a control, the two adjacent amino acids outside the NLV epitope in the 56-aa version (underlined) were also substituted to each of the other 19 amino acids. This set contained 228 mutants and two WT epitope controls. For the MAGEA3 saturation mutagenesis library, the MAGEA3 epitope was mutagenized in the context of a 56-mer (VIFSKASSSLQLVFGIELMEVDPIGHLYIFATCLGLSYDGLLGDNQIMPKAGLLIIV) and a 9-mer (EVDPIGHLY). Each amino acid in the MAGEA3 epitope (bold) was substituted to each of the other 19 amino acids. As a control, the two adjacent amino acids outside the MAGEA3 epitope in the 56-aa version (underlined) were also substituted to each of the other 19 amino acids. These 247 mutants were combined with the two WT epitopes. Each peptide was reverse-translated with nonrare human codons in two different nucleic acid sequences for a total of 460 and 498 oligo sequences. The SIINFEKL and MAGEA3 mutagenesis libraries were combined with 1,034 unrelated peptides (saturation mutagenesis tiles for the BMLF epitope derived from Epstein–Barr virus (EBV) and the MLANA cancer/testis antigen epitope) for a total library size of 1,992 oligo tiles. The 5′ (ACCCGTCACCGGCCA) and 3′ adaptors (GGGCTCGCCACGTCG) were added and oligonucleotides were synthesized by Twist Bioscience. The library was PCR-amplified using primers complementary to the adaptor sequences with overhangs encoding BP recombination sites. The amplified library was cloned into the pDONR221 vector using BP Clonase (Thermo Fisher Scientific) and Gateway-cloned into the pHAGE-CMV-Nflag-HA-DEST-IRES-Puro lentiviral expression vector. At least 100× library representation was maintained during all cloning steps. H2-K^b⁺ or HLA-A*01:01 G₅-target cells were infected with lentivirus containing a saturation mutagenesis library MOI of <1 to achieve 1,000× library representation and selected with 1 μg ml⁻¹ puromycin for 3 days before use in TCR-MAP screens to test the OT-I or a3a TCRs, respectively.

Human 90-mer peptidome library

The human peptidome 90-mer library consisted of 90-aa tiles with 22-aa offset covering the entire human proteome, as previously described¹⁵. In addition, GENCODE protein-coding transcripts (version 29) were subjected to the basic local alignment search tool (BLAST) against the ORFeome library; of the 500 nonredundant ORFs that remained, 90-aa tiles offset by 22-aa were generated. Additional 90-aa tiles offset by 22-aa were generated from ORFs encoded by endogenous retroviral elements (ERVs) from the GEVE database, recurrent cancer mutations from the COSMIC database and hotspot mutations⁶⁶. Aspartate residues were placed before the beginning of each methionine of each 29-aa N-terminal peptide and concatenated with two other N-terminal fragments to generate additional representation of the N terminus of human ORFs. The 5′ (GGAATTCCGCTGCGT) and 3′ adaptors (CAGGGAAGAGCTCGA) were added and oligonucleotides were synthesized by Twist Bioscience. The library was PCR-amplified using primers complementary to the adaptor sequences with overhangs encoding BP recombination sites. The amplified library was cloned into the pDONR221 vector using BP Clonase (Thermo Fisher Scientific) and Gateway-cloned into the pHAGE-CMV-Nflag-HA-DEST-IRES-Puro or pHAGE-CD74_1–80-DEST-PGK-Puro lentiviral expression vectors for class I or class II antigen discovery efforts, respectively. At least 100× library representation was maintained during all cloning steps. HLA-A*02:01, HLA-A*01:01 or DR⁺DP⁺DQ⁺ G₅-target cells were infected with lentivirus containing a saturation mutagenesis library MOI of 3–5 to achieve 1,000× library representation and selected with 1 μg ml⁻¹ puromycin for 3 days before use in TCR-MAP screens to test the IG4, 3898-2 or a3a TCRs, respectively.

Mouse 56-mer peptidome library screens

Proteins used for the mouse 56-mer peptidome library were obtained from the following UniProt proteomes: UP000000589 (C57BL/6J), UP000002494 (Brown Norway), UP000002474 (LCMV Armstrong), UP000008479 (Murine polyomavirus A2), UP000158963 (Mus musculus polyomavirus 2) and UP000129308 (Mus musculus papillomavirus type 1). Peptidome tiles were generated by randomly sampling from the de Bruijn graph representation of the mouse proteome dataset to try to achieve the most uniform distribution of k-mers possible. For each protein, additional C-terminal 56-mer tiles were included. Peptide fragments were reverse-translated, adaptors were appended to the 5′ (AGGAATTCCGCTGCGT) and 3′ (ATGGTCACAGCTGTGC) ends and oligonucleotides were synthesized by Twist Bioscience. The library was PCR-amplified using primers complementary to the adaptor sequences with overhangs encoding BP recombination sites. The amplified library was cloned into the pDONR221 vector using BP Clonase (Thermo Fisher Scientific) and Gateway-cloned into the pHAGE-CMV-Nflag-HA-DEST-IRES-Puro or pHAGE-CD74_1–80-DEST-PGK-Puro lentiviral expression vectors for mouse class I or class II antigen discovery efforts, respectively. At least 100× library representation was maintained during all cloning steps. H2-K^b⁺, H2-L^d⁺ or H2-IA^b⁺ G₅-target cells were infected with lentivirus containing a saturation mutagenesis library MOI of 3–5 to achieve 1,000× library representation and selected with 1 μg ml⁻¹ puromycin for 3 days before use in TCR-MAP screens to test the myocarditis, 2C or 6MNO TCRs, respectively.

Library preparation for sequencing

gDNA was extracted from sorted or magnetic enrichment-purified cells using the GeneJET Genomic DNA Purification Kit (Thermo Fisher Scientific). Input library cells with at least 40× representation were collected and prepared for each screen. Antigen libraries were prepared for Illumina sequencing following a previously published protocol^13,15. Samples were sequenced on Illumina MiSeq or NextSeq using the standard Illumina sequencing primers.

Saturation mutagenesis scoring matrix analysis

For analysis of the saturation mutagenesis footprints, we developed an algorithm called EpitopeID⁶⁷. Overall, the effect of EpitopeID is that the data from a position specific scoring matrix (PSSM), such as that generated by a saturation mutagenesis screen, can be used to compute a numeric score for any sequence of interest, where the numeric score serves as a prediction for how that queried sequence would perform if screened. For a motif length of N, with each residue being substituted to the 19 other amino acids, a PSSM of size 20 × N would be generated. This results in the value in each cell of this PSSM representing the measured TCR reactivity relative to WT. For any amino acid sequence of length N, the score S would be computed by the following equation:

$$S=\mathop{\sum }\limits_{i}^{20}\mathop{\sum }\limits_{j}^{N}{s}_{{ij}}$$

where i represents the identity of the amino acid, j represents the position in the queried amino acid sequence and s_ij represents the saturation mutagenesis screen score relative to WT for the peptide with substitution i at position j. Furthermore, p_j represents the ‘position weight’ of position j in the epitope and a_i represents the ‘residue weight’ for substituting amino acid i at position j. Screen score s_ij ranges from 0 to 1, with 0 indicating no TCR activation and 1 indicating activation greater than or equal to WT peptide. To define the amino acid interval that contains an epitope, we compute ${q}_{j}=\frac{{-s}_{{{\rm{WT}}{j}}}\,+\,{\sum }_{i}^{20}\left(1-{s}_{{ij}}\right)}{20-1}$ for a given position j, which can be thought of as the average decrease in activation observed by substituting the residue at position j of the epitope to one of the 19 other amino acids. Peptide positions at which there is a large decrease in activation upon substitution are likely to be critical residues for the TCR–peptide interaction. The amino acid interval containing the epitope of interest was defined as the continuous range containing all positions with q_j ≥ 0.5 × max(q₁, …, q_N). Amino acid positions outside of the interval containing the epitope were given a position weight of pj = 0. Amino acid positions inside the interval containing the epitope were given a position weight of ${p}_{j}=\frac{{q}_{j}}{\max \left({q}_{1},\ldots, {q}_{N}\right)}$. The implementation of EpitopeID in this paper used a uniform amino acid weight for all substitutions across all positions; however, the algorithm is capable of accounting for a customized amino acid weight matrix. To discover potential TCR cross-reactivities, human ORFs from our genome-wide human peptidome library were computationally tiled using a window size of N and an increment of one amino acid between tiles to represent all available k-mers of length N. Each N-mer was then scored using the above equation to assess its potential similarity to the motif tested by saturation mutagenesis. For each human N-mer sequence, this score S was normalized by the theoretical maximum for that PSSM and reported as a percentage. Human peptides containing the best-scoring N-mers from this analysis were then considered for subsequent validation.

Statistical analysis

Statistical tests were conducted using Prism (GraphPad version 9.0) software. Unpaired, two-tailed Student’s t-tests and one-way analyses of variance (ANOVAs) with Tukey–Kramer multiple-comparison tests to further examine pairwise differences were used. The statistical analyses performed for the various experiments are outlined in the figure legends.

Sequence alignment and analysis

Read processing and alignment were performed with Cutadapt⁶⁸ and Bowtie 2 (ref. ⁶⁹), respectively. The fractional abundance of each antigen in each screen replicate was divided by the fractional abundance in the presort input library to calculate the fold enrichment of the peptide tile. Mageck version 0.5.8 was used to assign P values to peptide fragments in TCR-MAP screens, whereby different codons used for a peptide were treated as the sgRNAs and amino acid sequences were used as the genes. Screen figures were generated using DataGraph (version 4.7).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Plasmids and cell lines generated in this study are available upon reasonable request and are subject to a material transfer agreement (MTA) from the lead contact. The MTA template and conditions for its use are provided as Supplementary Note 1. Normal tissue FPKM data were obtained from The Human Protein Atlas (https://www.proteinatlas.org/about/download). Source data are provided with this paper.

Code availability

A custom EpitopeID script⁶⁷ to perform saturation mutagenesis scoring matrix analysis is available at https://doi.org/10.5281/zenodo.8103914 (see Methods for details).

References

Davis, M. M. et al. Ligand recognition by αβ T cell receptors. Annu. Rev. Immunol. 16, 523–44 (1998).
CAS PubMed Google Scholar
Sewell, A. K. Why must T cells be cross-reactive? Nat. Rev. Immunol. 12, 669–677 (2012).
CAS PubMed PubMed Central Google Scholar
Stone, J. D., Chervin, A. S. & Kranz, D. M. T-cell receptor binding affinities and kinetics: impact on T-cell activity and specificity. Immunology 126, 165–176 (2009).
CAS PubMed PubMed Central Google Scholar
Joglekar, A. V. & Li, G.T cell antigen discovery. Nat. Methods 18, 873–880 (2021).
CAS PubMed Google Scholar
Lee, M. N. & Meyerson, M. Antigen identification for HLA class I- and HLA class II-restricted T cell receptors using cytokine-capturing antigen-presenting cells. Sci. Immunol. 6, eabf4001 (2021).
CAS PubMed PubMed Central Google Scholar
Kisielow, J., Obermair, F. J. & Kopf, M. Deciphering CD4⁺ T cell specificity using novel MHC–TCR chimeric receptors. Nat. Immunol. 20, 652–662 (2019).
CAS PubMed Google Scholar
Birnbaum, M. E. et al. Deconstructing the peptide–MHC specificity of T cell recognition. Cell 157, 1073–1087 (2014).
CAS PubMed PubMed Central Google Scholar
Cattaneo, C. M. et al. Identification of patient-specific CD4⁺ and CD8⁺ T cell neoantigens through HLA-unbiased genetic screens. Nat. Biotechnol. 41, 783–787 (2023).
CAS PubMed PubMed Central Google Scholar
Bentzen, A. K. et al. Large-scale detection of antigen-specific T cells using peptide–MHC-I multimers labeled with DNA barcodes. Nat. Biotechnol. 34, 1037–1045 (2016).
CAS PubMed Google Scholar
Joglekar, A. V. et al. T cell antigen discovery via signaling and antigen-presenting bifunctional receptors. Nat. Methods 16, 1–13 (2019).
Google Scholar
Li, G. et al. T cell antigen discovery via trogocytosis. Nat. Methods 16, 1–13 (2019).
Google Scholar
Sharma, G. & Holt, R. A. T-cell epitope discovery technologies. Hum. Immunol. 75, 514–519 (2014).
CAS PubMed Google Scholar
Kula, T. et al. T-Scan: a genome-wide method for the systematic discovery of T cell epitopes. Cell 178, 1016–1028 (2019).
CAS PubMed PubMed Central Google Scholar
Wang, R. F., Wang, X., Atwood, A. C., Topalian, S. L. & Rosenberg, S. A. Cloning genes encoding MHC class II-restricted antigens: mutated CDC27 as a tumor antigen. Science 284, 1351–1354 (1999).
CAS PubMed Google Scholar
Dezfulian, M. H. et al. TScan-II: a genome-scale platform for the de novo identification of CD4⁺ T cell epitopes. Cell 186, 5569–5586 (2023).
CAS PubMed Google Scholar
Mazmanian, S. K., Liu, G., Ton-That, H. & Schneewind, O. Staphylococcus aureus sortase, an enzyme that anchors surface proteins to the cell wall. Science 285, 760–763 (1999).
CAS PubMed Google Scholar
Pasqual, G. et al. Monitoring T cell–dendritic cell interactions in vivo by intercellular enzymatic labelling. Nature 553, 1–22 (2018).
Google Scholar
Skerka, C., Decker, E. L. & Zipfel, P. F. A regulatory element in the human interleukin 2 gene promoter is a binding site for the zinc finger proteins Sp1 and EGR-1. J. Biol. Chem. 270, 22500–22506 (1995).
CAS PubMed Google Scholar
Schub, A., Schuster, I. G., Hammerschmidt, W. & Moosmann, A. CMV-specific TCR-transgenic T cells for immunotherapy. J. Immunol. 183, 6819–6830 (2009).
CAS PubMed Google Scholar
Robbins, P. F. et al. Single and dual amino acid substitutions in TCR CDRs can enhance antigen-specific T cell functions. J. Immunol. 180, 6116–6131 (2008).
CAS PubMed Google Scholar
Driessen, C. et al. Cathepsin S controls the trafficking and maturation of MHC class II molecules in dendritic cells. J. Cell Biol. 147, 775–790 (1999).
CAS PubMed PubMed Central Google Scholar
Steimle, V., Siegrist, C. A., Mottet, A., Lisowska-Grospierre, B. & Mach, B. Regulation of MHC class II expression by interferon-γ mediated by the transactivator gene CIITA. Science 265, 106–109 (1994).
CAS PubMed Google Scholar
Roche, P. A. & Furuta, K. The ins and outs of MHC class II-mediated antigen processing and presentation. Nat. Rev. Immunol. 15, 203–216 (2015).
CAS PubMed PubMed Central Google Scholar
Wang, H. Y. et al. Tumor-specific human CD4⁺ regulatory T cells and their ligands: implications for immunotherapy. Immunity 20, 107–118 (2004).
CAS PubMed Google Scholar
Rosskopf, S. et al. Creation of an engineered APC system to explore and optimize the presentation of immunodominant peptides of major allergens. Sci. Rep. 6, 1–16 (2016).
Google Scholar
Ota, K. et al. T-cell recognition of an immunodominant myelin basic protein epitope in multiple sclerosis. Nature 346, 183–187 (1990).
CAS PubMed Google Scholar
Benati, D. et al. Public T cell receptors confer high-avidity CD4 responses to HIV controllers. J. Clin. Invest. 126, 2093–2108 (2016).
PubMed PubMed Central Google Scholar
Wucherpfennig, K. W. et al. Clonal expansion and persistence of human T cells specific for an immunodominant myelin basic protein peptide. J. Immunol. 152, 5581–5592 (1994).
CAS PubMed Google Scholar
Zehn, D., Lee, S. Y. & Bevan, M. J. Complete but curtailed T-cell response to very low-affinity antigen. Nature 458, 211–214 (2009).
CAS PubMed PubMed Central Google Scholar
Stadinski, B. D. et al. Hydrophobic CDR3 residues promote the development of self-reactive T cells. Nat. Immunol. 17, 946–955 (2016).
CAS PubMed PubMed Central Google Scholar
Poncette, L., Chen, X., Lorenz, F. K. M. & Blankenstein, T. Effective NY-ESO-1-specific MHC II-restricted T cell receptors from antigen-negative hosts enhance tumor regression. J. Clin. Invest. 129, 324–335 (2018).
PubMed PubMed Central Google Scholar
Wang, J. et al. HLA-DR15 molecules jointly shape an autoreactive T cell repertoire in multiple sclerosis. Cell 183, 1264–1281 (2020).
CAS PubMed PubMed Central Google Scholar
Lang, H. L. E. et al. A functional and structural basis for TCR cross-reactivity in multiple sclerosis. Nat. Immunol. 3, 940–943 (2002).
CAS PubMed Google Scholar
Yousef, S. et al. TCR bias and HLA cross-restriction are strategies of human brain-infiltrating JC virus-specific CD4⁺ T cells during viral infection. J. Immunol. 189, 3618–3630 (2012).
CAS PubMed Google Scholar
Graham, D. B. et al. Antigen discovery and specification of immunodominance hierarchies for MHCII-restricted epitopes. Nat. Med. 24, 1–17 (2018).
Google Scholar
Udaka, K., Tsomides, T. J. & Eisen, H. N. A naturally occurring peptide recognized by alloreactive CD8⁺ cytotoxic T lymphocytes in association with a class I MHC protein. Cell 69, 989–998 (1992).
CAS PubMed Google Scholar
Udaka, K., Tsomides, T. J., Walden, P., Fukusen, N. & Eisen, H. N. A ubiquitous protein is the source of naturally occurring peptides that are recognized by a CD8⁺ T-cell clone. Proc. Natl Acad. Sci. USA 90, 11272–11276 (1993).
CAS PubMed PubMed Central Google Scholar
Andreatta, M. & Nielsen, M. Gapped sequence alignment using artificial neural networks: application to the MHC class I system. Bioinformatics 32, 511–517 (2016).
CAS PubMed Google Scholar
Speir, J. A. et al. Structural basis of 2C TCR allorecognition of H-2L^d peptide complexes. Immunity 8, 553–562 (1998).
CAS PubMed Google Scholar
Darnell, E. P., Mooradian, M. J., Baruch, E. N., Yilmaz, M. & Reynolds, K. L. Immune-related adverse events (irAEs): diagnosis, management, and clinical pearls. Curr. Oncol. Rep. 22, 39 (2020).
PubMed Google Scholar
Wang, D. Y. et al. Fatal toxic effects associated with immune checkpoint inhibitors: a systematic review and meta-analysis. JAMA Oncol. 4, 1721–1728 (2018).
PubMed PubMed Central Google Scholar
Axelrod, M. L. et al. T cells specific for α-myosin drive immunotherapy-related myocarditis. Nature 611, 818–826 (2022).
CAS PubMed PubMed Central Google Scholar
Fagerberg, L. et al. Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics. Mol. Cell. Proteomics 13, 397–406 (2014).
CAS PubMed Google Scholar
Restifo, N. P., Dudley, M. E. & Rosenberg, S. A. Adoptive immunotherapy for cancer: harnessing the T cell response. Nat. Rev. Immunol. 12, 269–281 (2012).
CAS PubMed PubMed Central Google Scholar
Liu, Y. et al. TCR-T immunotherapy: the challenges and solutions. Front. Oncol. 11, 794183 (2022).
PubMed PubMed Central Google Scholar
Johnson, L. A. et al. Gene therapy with human and mouse T-cell receptors mediates cancer regression and targets normal tissues expressing cognate antigen. Blood 114, 535–546 (2009).
CAS PubMed PubMed Central Google Scholar
Varela-Rohena, A. et al. Genetic engineering of T cells for adoptive immunotherapy. Immunol. Res. 42, 166–181 (2008).
PubMed PubMed Central Google Scholar
Morgan, R. A. et al. Cancer regression and neurological toxicity following anti-MAGE-A3 TCR gene therapy. J. Immunother. 36, 133–151 (2013).
CAS PubMed PubMed Central Google Scholar
Parkhurst, M. R. et al. T cells targeting carcinoembryonic antigen can mediate regression of metastatic colorectal cancer but induce severe transient colitis. Mol. Ther. 19, 620–626 (2011).
CAS PubMed Google Scholar
Berg & van den, J. H. et al. Case report of a fatal serious adverse event upon administration of T cells transduced with a MART-1-specific T-cell receptor. Mol. Ther. 23, 1541–1550 (2015).
PubMed PubMed Central Google Scholar
Cameron, B. J. et al. Identification of a titin-derived HLA-A1-presented peptide as a cross-reactive target for engineered MAGE A3-directed T cells. Sci. Transl. Med. 5, 197ra103 (2013).
PubMed PubMed Central Google Scholar
Linette, G. P. et al. Cardiovascular toxicity and titin cross-reactivity of affinity-enhanced T cells in myeloma and melanoma. Blood 122, 863–871 (2013).
CAS PubMed PubMed Central Google Scholar
Zhao, X. et al. Tuning T cell receptor sensitivity through catch bond engineering. Science 376, eabl5282 (2022).
CAS PubMed PubMed Central Google Scholar
Dahotre, S. N., Chang, Y. M., Romanov, A. M. & Kwong, G. A. DNA-barcoded pMHC tetramers for detection of single antigen-specific T cells by digital PCR. Anal. Chem. 91, 2695–2700 (2019).
CAS PubMed PubMed Central Google Scholar
Segaliny, A. I. et al. Functional TCR T cell screening using single-cell droplet microfluidics. Lab Chip 18, 3733–3749 (2018).
CAS PubMed PubMed Central Google Scholar
Ng, A. H. C. et al. MATE-Seq: microfluidic antigen–TCR engagement sequencing. Lab Chip 19, 3011–3021 (2019).
CAS PubMed Google Scholar
Gee, M. H. et al. Antigen identification for orphan T cell receptors expressed on tumor-infiltrating lymphocytes. Cell 172, 549–563 (2017).
PubMed PubMed Central Google Scholar
Wen, F., Esteban, O. & Zhao, H. Rapid identification of CD4⁺ T-cell epitopes using yeast displaying pathogen-derived peptide library. J. Immunol. Methods 336, 37–44 (2008).
CAS PubMed Google Scholar
Saligrama, N. et al. Opposing T cell responses in experimental autoimmune encephalomyelitis. Nature 572, 481–487 (2019).
CAS PubMed PubMed Central Google Scholar
Attaf, M., Huseby, E. & Sewell, A. K. αβ T cell receptors as predictors of health and disease. Cell. Mol. Immunol. 12, 391–399 (2015).
CAS PubMed PubMed Central Google Scholar
Meysman, P. et al. Benchmarking solutions to the T-cell receptor epitope prediction problem: IMMREP22 workshop report. Immunoinformatics 9, 100024 (2023).
CAS Google Scholar
Riley, T. P. et al. T cell receptor cross-reactivity expanded by dramatic peptide/MHC adaptability. Nat. Chem. Biol. 14, 934–942 (2018).
CAS PubMed PubMed Central Google Scholar
Robinson, J. et al. The IPD and IMGT/HLA database: allele variant databases. Nucleic Acids Res. 43, D423–D431 (2015).
CAS PubMed Google Scholar
Kim, J. H. et al. High cleavage efficiency of a 2A peptide derived from porcine teschovirus-1 in human cell lines, zebrafish and mice. PLoS ONE 6, e18556 (2011).
CAS PubMed PubMed Central Google Scholar
Xu, G. J. et al. Systematic autoantigen analysis identifies a distinct subtype of scleroderma with coincident cancer. Proc. Natl Acad. Sci. USA 113, E7526–E7534 (2016).
CAS PubMed PubMed Central Google Scholar
Chang, M. T. et al. Identifying recurrent mutations in cancer reveals widespread lineage diversity and mutational specificity. Nat. Biotechnol. 34, 155–163 (2016).
CAS PubMed Google Scholar
Sie, B. EpitopeID (degronID). Zenodo https://doi.org/10.5281/zenodo.8103914 (2022).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 17, 10–12 (2011).
Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank the members of the Elledge lab for helpful discussion and advice, P. Bruno, Chen, G. Chen and G. Zhu for thoughts and comments on the manuscript and G. Victora for sharing the LIPSTIC plasmids. This study was supported by the following sources: The Mark Foundation for Cancer Research of the Life Science Research Foundation (A.C.K), the Department of Defense (BC171184 to S.J.E.) and the National Institutes of Health (R01CA234600 to S.J.E.). S.J.E. is a member of the Ludwig Institute for Cancer Research at Harvard and is an Investigator with the Howard ughes Medical Institute.

Author information

Authors and Affiliations

Division of Genetics, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
Ayano C. Kohlgruber, Mohammad H. Dezfulian, Brandon M. Sie, Charlotte I. Wang, Tomasz Kula & Stephen J. Elledge
Department of Genetics, Harvard University Medical School, Boston, MA, USA
Ayano C. Kohlgruber, Mohammad H. Dezfulian, Brandon M. Sie, Charlotte I. Wang, Tomasz Kula & Stephen J. Elledge
Division of Immunology, Boston Children’s Hospital, Boston, MA, USA
Ayano C. Kohlgruber
Department of Pathology, Massachusetts General Hospital, Boston, MA, USA
Charlotte I. Wang
Society of Fellows, Harvard University, Cambridge, MA, USA
Tomasz Kula
Department of Genetics and Genomic Sciences and Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Uri Laserson
Institute for Cell Engineering, Division of Immunology, Department of Pathology, Johns Hopkins School of Medicine, Baltimore, MD, USA
H. Benjamin Larman
Howard Hughes Medical Institute, Chevy Chase, MD, USA
Stephen J. Elledge

Authors

Ayano C. Kohlgruber
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad H. Dezfulian
View author publications
You can also search for this author in PubMed Google Scholar
Brandon M. Sie
View author publications
You can also search for this author in PubMed Google Scholar
Charlotte I. Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tomasz Kula
View author publications
You can also search for this author in PubMed Google Scholar
Uri Laserson
View author publications
You can also search for this author in PubMed Google Scholar
H. Benjamin Larman
View author publications
You can also search for this author in PubMed Google Scholar
Stephen J. Elledge
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.C.K. and S.J.E. conceptualized and designed the study. A.C.K. performed the analysis and experiments. M.H.D., C.I.W. and T.K. contributed to essential reagents and materials for the study. M.H.D. developed the MHC class II antigen presentation strategy and designed the class II peptidome library. C.I.W. and B.M.S. wrote scripts to aid in data analysis. H.B.L. contributed the mouse peptidome library. U.L. along with H.B.L. designed and provided the mouse peptidome tiling library used in the study. A.C.K., B.M.S. and S.J.E. wrote the paper. S.J.E. supervised the work and provided funding.

Corresponding author

Correspondence to Stephen J. Elledge.

Ethics declarations

Competing interests

T.K. is a founder of and holds equity in T-Scan Therapeutics and Immune ID. S.J.E. is a founder of and holds equity in T-Scan Therapeutics, MAZE Therapeutics, ImmuneID and Mirimus, serves on the scientific advisory boards of Homology Medicines, ImmuneID, MAZE Therapeutics and T-Scan Therapeutics and is an advisor for MPM Capital, none of which affect this work. H.B.L. is a founder of and holds equity in Infinity Bioscience, Alchemab and ImmuneID and is an advisor to T-Scan Therapeutics. The other authors declare no competing interests.

Peer review

Peer review information

Nature Biotechnology thanks Anthony Purcell and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Design of an inducible SrtA expression construct for TCR-MAP.

(a) Vector designs for the different inducible NFAT-SrtA reporters tested. (b) Jurkat cells transduced with the various NFAT-SrtA reporters were stimulated for 6 h (top) or 1, 4, 6, 8, 24 h (bottom) with either 1 μg/ml anti-CD3 antibody (top & bottom) or PMA/I (top). Percentage of murine CD40L expressed on the cell surface of Jurkats was assessed by flow cytometry. Error bars indicate the mean and SD. Data in b are representative of n = 3 independent biological replicates. PMA, phorbol 12-myristate 13-acetate. I, ionomycin.

Extended Data Fig. 2 Optimization of the TCR-MAP platform.

(a) Streptavidin cell surface labeling of HLA-A*02:01⁺ G₅-target cells transduced with (green) or without (gray) the CTAG1B antigen at the indicated effector-to-target cell ratios with IG4 TCR⁺ SrtA-Jurkats for 6 h using an LPETG-biotin concentration of 50 μM. Reported fold change values were calculated based on the ratio of streptavidin fluorophore MFI of the CTAG1B 90-aa peptide expressing G₅-targets over streptavidin fluorophore MFI of no antigen controls. (b) Streptavidin cell surface labeling of HLA-A*02:01⁺ G₅-target cells transduced with (orange) or without (gray) the CTAG1B antigen at the indicated concentration of LPETG-biotin substrate added to a co-culture with IG4 TCR⁺ SrtA-Jurkats for 6 h at an effector-to-target ratio of 1:1. Reported fold change values (Right) were calculated based on the ratio of streptavidin fluorophore MFI of the CTAG1B 90-aa tile expressing G₅-targets over streptavidin fluorophore MFI of no antigen controls on the bottom. (c) Quantification of HLA-DRB1*11:01⁺ CIITA⁺CTSS⁺ G₅-target cell biotinylation after transduction with HIV gag 56-aa fragment containing the known antigen for F24 TCR. LPETG-biotin substrate (50 μM) was added at the indicated times. Reported fold change values were calculated based on the ratio of the mean of antigen-expressing G₅-target cell biotinylation over the mean of no antigen G₅-target cell biotinylation. Co-cultures were performed at an effector-to-target ratio of 1:1 (d) Representative flow plots looking at target cell biotinylation of HLA-A*02:01⁺ G₅-target cells transduced with (red) or without (gray) the CTAG1B antigen. CTV labeled antigen⁺ G₅-target cells were spiked into non-antigen controls at a frequency of 1% and co-cultured with IG4 TCR⁺ SrtA-Jurkats alone or combined with four other non-cognate TCRs at a final effector-to-target ratio of 1:1 for 6 h using an LPETG-biotin concentration of 50 μM. Quantification of the total frequency of CTV labeled antigen⁺ G₅-targets relative to the whole population (light gray) or gated on the top 1% of biotinylated G₅-targets (dark gray) is reported for all TCR pooling conditions. Reported fold change values were calculated based on the ratio of the frequency of CTV⁺ G₅-target cells in the top 1% of biotinylated cells over their frequency in the total input population. (e) Representative flow plots showing the pre- and post-magnetic bead enrichment of target cells. Pp65 antigen⁺ HLA-A*02:01⁺ G₅-target cells were spiked into no antigen control cells at a frequency of <1% and co-cultured with NLV3 TCR⁺ SrtA-Jurkats for 6 h using an LPETG-biotin concentration of 50 μM at an effector-to-target ratio of 1:1 or 1:2, as indicated. Cells from the co-culture were labeled with streptavidin conjugated magnetic beads and run over a magnetic enrichment column (Miltenyi). Cell bound to the column were eluted and analyzed by flow cytometry to quantify cognate antigen (CTV⁺) enrichment. Reported fold change values were calculated based on the ratio of the frequency of CTV⁺ G₅-target cells bound to the column versus their frequency in the input population. Each dot in c-e represents a different biological replicate, where error bars in a-e indicate SD. Data in a-e are representative of n = 3 independent biological replicates. MFI, mean fluorescence intensity. CTV, cell trace violet.

Extended Data Fig. 3 Magnetic bead enrichment is capable of isolating cognate antigens using TCR-MAP.

TCR-MAP screen schematic and results of NLV3 TCR⁺ SrtA-Jurkats screened against a CMV-specific viral peptidome library in HLA-A*02:01⁺ G₅-target cells. Biotinylated G₅-target cells were either FACS isolated (a) or labeled with streptavidin conjugated magnetic beads and run over a magnetic enrichment column for a total of two consecutive purifications for isolation (b). Cellular barcodes from isolated cells are PCR amplified from gDNA. Downstream NGS and analysis of enriched reads relative to the input library enables calculation of adjusted p values to call enriched peptides by the TCR queried. The results of the screen are plotted such that each dot represents one peptide with the y axis indicating the negative log10 p adjusted values by Mageck and the x axis reporting the geometric mean of the enrichment of the peptide across three replicates. Fold enrichment is defined as the ratio of the abundance of the peptide in the sorted population relative to the input library. Peptides highlighted in red contain the known cognate antigen for the NLV3 TCR. CMV, cytomegalovirus. FACS, fluorescence-activated cell sorting. gDNA, genomic DNA.

Source data

Extended Data Fig. 4 TCR-MAP identifies cross-reactive and cross-restrictive antigenic hits for CTAG-reactive CD4 TCR.

(a) Human genome-wide TCR-MAP screen results of the TCR3898-2 TCR. Each dot represents one peptide with the y axis plotting the negative log10 p adjusted values by Mageck and the x axis calling the geometric mean of the enrichment of the peptide across three replicates. Fold enrichment is defined as the ratio of the abundance of the peptide in the sorted population relative to the input library. Peptides highlighted in blue are known antigens of the TCR3898-2 TCR, while yellow indicates new antigens that validated. (b) Determination of CD69 upregulation of TCR3898-2 TCRs to the indicated antigens in CIITA⁺CTSS⁺ G₅-target cells that express six different HLA-DR, -DP, and -DQ alleles or only the HLA-DRB1*04:01 allele to assess HLA-restriction patterns. The relative correlation of the fold enrichment of the peptide fragment vs the CD69 upregulation of TCR3898-2 TCRs by that same epitope sequence is plotted in (c) and the r² value reported. (d) Representative flow cytometry plots of 3898-2 TCR⁺ SrtA-Jurkat cell activation by assessing CD69 upregulation in response to the indicated peptides expressed in CIITA⁺CTSS⁺ G₅-target cells expressing all six DR⁺DP⁺DQ⁺ alleles or only the HLA-DRB1*04:01 allele. * p = 0.0266 (left) and 0.0397 (right), **** p < 0.0001 for each group relative to no antigen control by two-tailed t test. Each dot in b represents a different biological replicate, where error bars indicate the mean and SD. Data in e are representative of n = 3 independent biological replicates. HLA, human leukocyte antigen. CIITA, class II transactivator. CTSS, cathepsin S.

Source data

Extended Data Fig. 5 TCR-MAP accurately identifies Treg TCR reactivities.

(a) Mouse genome-wide TCR-MAP screen results of PAD4-reactive 6MNO TCR⁺ SrtA-Jurkats. 56-aa peptide fragments covering the murine proteome were expressed in H2-IAb⁺ CIITA⁺CTSS⁺ G₅-target cells SrtA-Jurkat cells were added to achieve a final effector-to-target ratio of 1:1. Each dot represents one peptide with the y axis reporting the negative log10 p adjusted values by Mageck and the x axis indicating the geometric mean of the enrichment of the peptide across three replicates. Fold enrichment is defined as the ratio of the abundance of the peptide in the sorted population relative to the input library. Peptides highlighted in red indicate validated hits from the screen. PAD4, peptidylarginine deiminase 4.

Source data

Supplementary information

Reporting Summary

Supplementary Note 1

MTA template.

Source data

Source Data Fig. 3b

Peptide tiles enriched in the NLV3 TCR viral peptidome screen.

Source Data Fig. 3d

Results of the SIINFEKL saturation mutagenesis footprint screens with the OT-I TCR.

Source Data Fig. 4a

Peptide tiles enriched in the IG4 TCR human peptidome screen.

Source Data Fig. 4c

Peptide tiles enriched in the pooled IG4, DMF5 and three irrelevant A2 restricted TCR screens against the human peptidome.

Source Data Fig. 5a

Peptide tiles enriched in the 2C TCR mouse peptidome screen.

Source Data Fig. 5e

Peptide tiles enriched in the pooled myocarditis TCR screen.

Source Data Fig. 6a

Peptide tiles enriched in the MAGEA3 a3a TCR human peptidome screen.

Source Data Fig. 6b,c

List of peptides and their EpitopeID scores when analyzed against the 9-aa human peptidome library. Peptides were then filtered for HLA-A*01:01 binders.

Source Data Extended Data Fig. 3

Results of the NLV3 TCR screened against the CMV peptidome.

Source Data Extended Data Fig. 4

Peptide tiles enriched in the TCR3898-2 human peptidome screen.

Source Data Extended Data Fig. 5

Results of the 6MNO TCR screen against the mouse peptidome.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kohlgruber, A.C., Dezfulian, M.H., Sie, B.M. et al. High-throughput discovery of MHC class I- and II-restricted T cell epitopes using synthetic cellular circuits. Nat Biotechnol (2024). https://doi.org/10.1038/s41587-024-02248-6

Download citation

Received: 05 November 2023
Accepted: 16 April 2024
Published: 02 July 2024
DOI: https://doi.org/10.1038/s41587-024-02248-6
Springer Nature America, Inc.

High-throughput discovery of MHC class I- and II-restricted T cell epitopes using synthetic cellular circuits

Abstract

Similar content being viewed by others

Main

Results

TCR-MAP captures human MHC class I HLA-A2 and class II pMHC–TCR interactions

TCR-MAP captures mouse MHC class I-restricted (H2-Kb) and class II-restricted (H2-IAb) pMHC–TCR interactions

Virome-wide screens and TCR binding footprints with TCR-MAP

Genome-wide TCR-MAP screens using single or pooled TCRs

TCR-MAP discovers autoantigens of mouse TCRs

TCR-MAP predicts adverse cross-reactivities of clinical TCRs

EpitopeID detection of cross-reactive peptides

Discussion

Methods

Cell culture

Generation of TCR-MAP target cell lines

Generation of TCR-MAP Jurkat cell lines

Generation of TCR-MAP reporter constructs

Lentiviral production

SrtA substrates

Peptide pulsing and endogenous antigen expression cocultures

Flow cytometry

Fluorescence-activated cell sorting-based TCR-MAP screens

Magnetic bead enrichment-based TCR-MAP screens

CMV-specific and virome-wide peptidome libraries

Saturation mutagenesis library

Human 90-mer peptidome library

Mouse 56-mer peptidome library screens

Library preparation for sequencing

Saturation mutagenesis scoring matrix analysis

Statistical analysis

Sequence alignment and analysis

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation

TCR-MAP captures mouse MHC class I-restricted (H2-K^b) and class II-restricted (H2-IA^b) pMHC–TCR interactions