Abstract
Respiratory syncytial virus (RSV) is a significant cause of lower respiratory tract infection in infants and elderly. To understand the evolution of neutralizing epitopes on the RSV glycoprotein (G) and fusion (F) proteins, we conducted a multi-year surveillance program (OUTSMART-RSV) in the US. Analysis of 1,146 RSV samples from 2015–2017 revealed a slight shift in prevalence from RSV A (58.7%) to B (53.7%) between the two seasons. RSV B was more prevalent in elderly (52.9% and 73.4%). Approximately 1% of the samples contained both RSV A and B viruses. All RSV A isolates were ON1 and almost all the B isolates were BA9 genotypes. Compared with the 2013 reference sequences, changes at the F antigenic sites of RSV B were greater than RSV A, which mainly occurred at antigenic sites V (L172Q/S173L at 99.6%), Ø (I206M/Q209K at 18.6%) and IV (E463D at 7%) of RSV B F. Sequence diversities in the G protein second hypervariable region were observed in the duplicated regions for RSV A and B, and at the G stop codon resulting in extension of 7 amino acids (22.1%) for RSV B in 2016–17. Thus, RSV surface glycoproteins are continuously evolving, and continued surveillance is important for the clinical evaluation of immunoprophylactic products.
Similar content being viewed by others
Introduction
RSV is the most common cause of lower respiratory tract infections (LRTI) among infants and young children worldwide, resulting in annual epidemics worldwide1. In the United States (US) alone, there are estimated 132,000 to 172,000 RSV-associated hospitalizations annually involving children <5years-old, predominantly in infants2. The only approved anti-RSV immunoprophylaxis is Palivizumab3,4, a humanized monoclonal antibody (mAb) directed against the RSV F protein, which is licensed only for infants at highest risk. Although Palivizumab has been shown to reduce hospitalization by 50% in high-risk infants5, prevention of RSV illnesses in all infants remains a major public health priority. RSV is also an important causative agent of LRTI in the elderly. In the US, among the annual 687,000 hospitalizations and 74,000 deaths caused by pneumonia in the elderly, approximately 2–9% of these are caused by RSV6,7.
RSV belongs to the Paramyxoviridae family. It is a non-segmented, negative-sense, single-stranded, enveloped RNA virus. RSV virions contain three glycoproteins, F, G and SH. The F and G proteins are the two major integral membrane proteins that play critical roles in viral entry and egress, and are the major antigens for eliciting neutralizing antibody responses. Although the G protein is dispensable for RSV replication in vitro, G deletion mutants are highly attenuated in vivo8, suggesting a critical role of the G protein in infecting human airway epithelial cells. RSV is classified into two subtypes, RSV A and RSV B, based on the genetic characteristics of the G gene9. The C-terminal region of the G protein (the second hypervariable region, HVR2) has been used as the basis for genotyping10. RSV A ON1 genotype, characterized by a 72 nucleotide (nt) (23 amino acid residues, AA) duplication in the HVR2, is currently the predominant RSV A genotype. It was first reported in Canada in 201211 and had been detected in at least 25 countries by 201712,13,14,15. RSV B BA genotype, characterized by a 60 nt (20 AA) duplication in the HVR2, was first reported in Argentina in 1999 and has become the predominant RSV B genotype globally16. The emergence of these two distinct RSV subtypes independently has been hypothesized to be due to immune pressure on the G protein selecting for these novel viruses17,18,19.
The RSV F protein is essential for viral entry into cells as it mediates fusion between virion and host cell membranes. Fusion requires a conformational change from the pre-fusion to the post-fusion form of F20. Targeting the transition from the pre-fusion F to the post-fusion F form can prevent the fusion process and block RSV infection in vitro and in vivo18,21,22,23. Several mAbs and small molecules targeting the pre-fusion form of F are currently under clinical development and could become effective prophylactic or therapeutic interventions against RSV18,21,22,23. The F protein contains several antigenic sites (Ø, I-V)24,25. Two antigenic sites, II (target of Palivizumab) and IV (target of 101F and MAb19), are present in both the pre- and post-fusion F proteins26,27. However, the pre-fusion and post-fusion F proteins also harbor their own respective antigenic sites such as site Ø (target of MEDI889718) in pre-fusion form of F and site I in post-fusion form of F, respectively28. Monoclonal antibodies such as Motavizumab against site II29,30 and Suptavumab against site V31,32 have also been tested clinically. Although the F protein is generally thought to be conserved, variability in the F protein leading to viruses that can escape neutralization by anti-RSV mAbs has been described30,33,34,35,36.
To understand the RSV molecular epidemiology and the potential impact of genetic variability on vaccine and antiviral drug development, we collected and sequenced RSV-positive samples from the US and Puerto Rico as part of the OUTSMART-RSV (Observational United States Targeted Surveillance of Monoclonal Antibody Resistance and Testing of Respiratory Syncytial Virus) surveillance program.
Results
Differential distribution of RSV A and RSV B subtypes
A total of 1,146 RSV positive samples from subjects ranging in age from birth to 97 years were subtyped and revealed that both RSV A and B co-circulated in both the 2015–16 and 2016–17 winter RSV seasons. RSV A (58.7%) was more prevalent than RSV B (40.0%) in the 2015–16 season whereas RSV B (53.7%) was more prevalent than RSV A (45.3%) in the 2016–17 season. In both seasons, a small number of samples contained both RSV A and B [n = 4 (1.3%) in 2015–16 and n = 8 (1.0%) in 2016–17] (Table 1).
The majority (60.1%) of the samples were from infants <1 year-old with the remainder of samples from subjects between 1 and 97 years. There were more male than female patients in both RSV seasons overall. Noticeably, during the 2015–16 season when RSV A was more prevalent, RSV A was detected in more younger patients, 60.7% in <1 year-old, 62.2% in 1 to 2 years-old, 56.0% in 3 to 59 years-old, but lower in >60 years-old (41.2%). In contrast, during the 2016–17 season when RSV B was more prevalent, RSV A was still most prevalent in the age group 1 to 2 years-old (58.7%). The percentage of RSV B positive samples in other age groups were 52.9% in <1 year-old, 63.2% in 3 to 59 years-old and 73.4% > 60 years-old, respectively (Table 1). Interestingly, RSV B in subjects >60 years-old was higher than RSV A in both seasons (52.9%, in 2015–16; 73.4%, in 2016–17), suggesting preferential RSV B infection in older adults.
Phylogenetic analyses of RSV A and B fusion proteins
A total of 1,158F protein sequences (561 RSV A, 573 RSV B and 12 RSV A/B co-infections) from the two RSV seasons were aligned against the reference sequences derived from two 2013 clinical isolates RSVA/13–005275 (GenBank accession no. KX858757) and RSVB/13–001273 (GenBank accession no. KX858756). AA variation and phylogenetic analysis of the F protein sequences for RSV A F and RSV B F are summarized in Fig. 1. Overall, there was more genetic diversity in the RSV B F than the RSV A F. Out of the 574 AA residues in the entire F open reading frame (ORF), RSV A F showed variations in 45 (7.8% in 2015–16) and 63 (11.0% in 2016–17) positions in different viruses (ranging from 1–4 changes in any single virus compared with the reference strain). On the other hand, RSV B F had 56 (9.8% in 2015–16) and 91 (15.9% in 2016–17) amino acid positions in different viruses (ranging from 1–11 changes in any single virus relative to the reference strain) during these two seasons, respectively (Fig. 1A).
Since the extracellular region of the mature F protein (AA 24–109, 137–524) is the target of multiple anti-RSV mAb and vaccines in clinical development24,37, we compared the OUTSMART-RSV samples with the reference sequences for the extracellular region. Specifically, 161 out of the 186 RSV A F (86.6%) of the 2015–16 OUTSMART-RSV samples and 322/387 (83.2%) of the 2016–17 samples were identical to the mature F protein reference strain A/13-005275. Sequence diversity was noted in the signal peptide (AA 1–23) and cleaved P27 (AA 110–136) but none of the AA changes in the entire ORF had frequencies greater than 10% among the RSV A viruses compared with the 2013 RSV A reference strain (Fig. 1A). Phylogenetic analysis of the RSV A samples from both RSV seasons relative to their geography indicated that these variations in RSV A were neither temporally nor geographically distinct (Fig. 1B).
In contrast, RSV B F sequences from 2015–16 strains showed that the majority (88.3%, 113/128) contained A103V, L172Q and S173L changes at site V compared with the 2013 RSV B reference (Fig. 1A). These three changes likely emerged in the 2014–2015 RSV season as they are not seen in RSV F sequences before 2013 in the NCBI database (data not shown). Almost all samples (99.6%, 455/457) from 2016–17 had these three substitutions. Other AA changes in the RSV B F with a frequency greater than 10% included N169S (11.5%), P312H (14.5%) and S377N (14.5%) in the mature protein of RSV samples collected from the first season. Unlike the A103V/L172Q/S173L changes that were present in RSV B samples in the second RSV season, P312H and S377N had decreased from 14.5% in the first season to 0% and ~1% in the second season, while K191R (18.6%), I206M (18.6%) and Q209R (18.6%) appeared at a frequency greater than 10% (Fig. 1A).
Phylogenetic analysis of the RSV B F proteins showed that several AA changes occurred together. In addition to the A103V/L172Q/S173L changes present in almost all the F sequences, N169S, P312H/S377N and K191R/I206M/Q209R formed distinct clades (Fig. 1B). The samples containing P312H/S377N were almost all from the 2015–16 season (pink) and K191R/I206M/Q209R were exclusively from the 2016–17 season (purple). There was no clear correlation of the AA substitutions with geographic locations, suggesting that these different strains were not confined in specific regions in the US (Fig. 1B).
Emergence of novel F protein epitopes in site Ø, I – V
The RSV F protein contains six major antigenic sites (Ø, I-V)24,25,38. AA variations at these sites with frequency >1% in their respective seasons of the OUTSMART samples are labelled on the F protein structures for both the pre-fusion and post-fusion forms (Fig. 2, left panels). There were 10 and 16 amino acid variations across the antigenic sites of RSV A F protein over the two RSV seasons, respectively. Changes present at a frequency >1% included the K68N and I206T in site Ø and S276N at site II of RSV A F. In contrast, there were 19 and 34 different amino acid changes in the antigenic sites of RSV B F proteins for these two seasons, 15 of which had frequencies >1% at different antigenic sites (Fig. 2). These include double I206M/Q209R and single Q209K or Q209L substitutions at site Ø, P312H, S377N at site I, S276N at site II, L462Q, E463D/K, N466T/S, L467F at site IV, and S169S, L172Q, S173L, K191R and I291V changes at site V. AA at positions 169, 172, and 173 are in close proximity to one another in the pre-fusion F structure; AA 191 is distal to the other 3 AA in the post-fusion structure and closer to site Ø (Fig. 2, left panel). The I206M/Q209R at site Ø and K191R at site V emerged together during the second OUTSMART-RSV season. L172Q/S173L at the antigenic site V was observed in most of the 2015–16 samples (89.3%) and increased to 99.6% in the following 2016–17 season. The L172Q/S173L double substitutions were not present in RSV F sequences prior to 2014 in the NCBI database (data not shown). Several novel viruses appeared in 2016–17 including the strains carrying the I206M/Q209R double substitutions at site Ø along with a triple AA change (K191R/ L172Q/S173L) at site V. Interestingly, P312H and S377N at site III appeared in 2015–16 at 14.5% but disappeared in the following season. In summary, these data show that the RSV F protein is generally conserved, but there is evidence for continued changes at key antigenic sites on the F protein.
Phylogenetic analyses of the G protein second hyper-variable region of RSV A and B
Phylogenetic analysis of 836 samples from the 2016–17 RSV season using reference sequences of known G genotypes revealed that all RSV subtype A isolates were ON1 genotype and almost all RSV B isolates were of genotype BA9 except for one BA10 isolate (Fig. 3A). These findings are consistent with previously published reports that RSV ON1 and RSV BA9 genotypes have emerged as the predominant strains in the US and throughout the world12,13,14,15. Annotating the phylogenetic trees with the regions of the clinical samples collected showed that some sequences clustered together by geography. For example, some samples from the West (red) are clustered in clades in the left lower corner of the RSV A G tree. Overall, there is no clear geographic distribution differences among all the isolates.
Sequence analysis revealed multiple forms of the G protein with different lengths that were more frequent in RSV B than RSV A isolates (Fig. 3A). For RSV A, a single nucleotide mutation in the stop codon TGA to TTA (L) was observed in 2 samples (2/387, 0.5%) resulting in an extension of the G protein by 1 AA. In contrast, 101 of 457 (22.1%) RSV B samples had a mutation in the stop codon TAA to CAA (Q) resulting in the G protein extension by 7 AA (Q/K)-R-L-Q-S-Y-(H/A) (green dots in Fig. 3A). In addition, three RSV B samples (3/457, 0.7%) had a premature stop codon (GAG- > TAG) that shortened the G protein by 7 AA (deleting “P-S-T-S-N-S-T”, red dots in Fig. 3A).
All RSV A and B samples had the characteristic ON1 and BA genotype duplications in the G HVR2 with either 23 AA (AA 261–283 and 285–307 for RSV A)11 or 20 AA (AA 240–259 and 260–279 for RSV B)38. These duplicated regions have several highly conserved sequence motifs that can be used as markers to evaluate genetic diversity and viral evolution. One of the conserved motifs, “GYPSPSQ” of RSV A G in the first duplicated region, is identical to the corresponding sequence of the second duplicated region of the 2013 reference strain (Fig. 3B). Although 37.3% isolates had the same signature motif defined as “YP-YP” in both duplicated regions, other motif variations including “YL-YL” (25.3%), “YL-YP” (11.1%) and “HL-YP” (1.0%) were observed, suggesting continued evolution in this key neutralizing region of the G protein.
Similarly, the G protein of the RSV B reference strain contained conserved sequences in the duplicated regions, “STVLDTT” in the first duplicated region and “SIALDTT” in the second duplicated region (Fig. 3B). The majority of RSV B isolates (49.9%) contained the “TV-IA” sequence; “TV-IV” (12.3%) is the next major cluster, only 1.1% samples contained the same “TV-TV” sequence. Other combinations and mixed residues were also observed, suggesting constant evolutionary changes in the G gene. There are no clear geographic distribution differences among all the sequences. These data suggest that immune pressure on the G protein has led to continued evolution of the second hypervariable region with multiple strains now in circulation.
Discussion
The evolution of RSV strains is a dynamic process that involves periodic alternation in the dominance of subtypes A and B as well as sequence diversity in the G attachment protein10. In the OUTSMART-RSV study we showed that RSV A infection was more prevalent in subjects younger than 60 years old (<1, 1–2 and 3–59 years) in the 2015–16 season compared with the 2016–17 season (Table 1). However, RSV B was more prevalent in subjects greater than >60 years old in both seasons (52.9% in 2016–2017 and 73.4% in 2016–17). The difference in prevalence of RSV B in 1 to 2 year-old (62/150, 41.3%) versus >60 year-old group (47/64, 73.4%) was statistically significant (p < 0.0001). This data suggest that pre-existing immunity to RSV B in older people may be poorer than RSV A or due to the lower immunity to antigenically drifed comptaporary RSV B strains in the elderly39. Although the sample size of the older adult group was smaller than the infant group and 1 to 2 year-old group, sequence analysis of 81 isolates collected from the elderly provides a valuable set of data for comparison with the future isolates as we will continue characterizing RSV A and B isolates from different age groups in on-going global surveillance studies.
Immune pressure and lack of proof-reading capabilities of the viral RNA-dependent polymerase of RSV likely drive RSV evolution. This evolution is reflected in the sequence diversities in key antigenic regions of the F and G proteins. RSV genotypes have gone through many dynamic changes over the past several decades, wherein newly emergent genotypes replaces the older ones40,41. Both RSV A ON1 and RSV B BA genotypes represented by the duplicated sequence in the C-terminal G protein emerged independently at different times and became predominant RSV genotypes globally16,42. These genotypes have been suggested to alter viral antigenic properties to facilitate viral infection by evading the host immune response43, to enable virus spread efficiently or conferring a fitness advantage during circulation12.
The fast spread of the RSV A ON1 and RSV B BA genotypes has been accompanied with diversifications of the G protein within the genotypes such as protein length variation and sequence diversity in the duplication regions44,45. The biological and immunological significance of these extra amino acids in the G protein extracellular domain remains to be determined. The first reported RSV ON1 genotype (ON138–0111A) in 2012 had an identical 23-AA duplication (261–283 and 285–307)11. Similarly, the first reported RSV BA genotype (BA3833/99B) isolated in 1999 had an identical 20-AA duplication (240–259 and 260–279)38. The divergent sequences observed in these two duplicated regions of the G protein have been hypothesized to either occur in a convergent fashion40 or via multiple independent duplication events.
Although the F protein of RSV is more conserved compared with the G protein, variations at key antigenic sites have been identified in the F proteins of RSV A and B samples in this study (Fig. 2). Noticeably, the RSV B containing A103V/L172Q/S173L in the F protein emerged around 2014 has become the predominant circulating RSV B strain in the US. These substitutions were also reported in the RSV B samples during a similar surveillance period in China46, suggesting global circulation of this new RSV strain. Antigenic site Ø is the binding site for an mAb (MEDI8897) targeting the prefusion form F18,25,47 that is currently being evaluated in infants in Phase II clinical study. For the first time, we identified the emergence of the RSV B I206M/Q209R double changes at a frequency of 18.6%. The I206M/Q209R change at site Ø is also accompanied with a K191R change at site V in addition to the A103V/L172Q/S173L changes, suggesting a potential structural or functional relationship of these amino acids. The I206M/Q209R variant has been shown to be susceptible to MEDI8897 neutralization35. The other three RSV A and four RSV B variants with distinct polymorphisms in the MEDI8897 mAb binding region identified in this study were at low frequency (<2%) and did not affect viral susceptibility to MEDI889735. Various changes at antigenic site IV at 1–7% frequency and site I at <1% were also identified. These data indicate that none of the antigenic sites are invariant despite high sequence conservation of the F protein.
It is unclear from a structural prospective why some of the changes at the antigenic sites involved multiple residues, such as L172Q/S173L at site V and the A103V change outside of the described antigenic sites. Site V is an attractive target for a mAb and these changes could alter viral antigenicity and affect their susceptibility to mAb targeting this region48. Some of the changes occurred in both RSV A and RSV B at the same positions, such as S276N at site II, S377N at site III, which could be due to natural antibodies targeting both A and B subtypes. Other changes appear to come and go, such as S276N in RSV B that increased from 0.8% to 5.9% from the first season to the second season, while S377N in RSV B decreased from 14.5% to 1.1%, and N169S decreased from 11.5% to 1.1%. This data indicates that the 276 and 377 positions might be immunodominant sites for both RSV A and B strains. The changes at F protein antigenic sites may allow viruses to escape neutralization by anti-RSV antibodies from prior RSV infection, which can be examined by testing human sera against the viruses with and without an antigenic change.
The monitoring of the F and G protein antigenic site changes of RSV A and B strains also has implications on RSV vaccine development. In the past, RSV vaccine development has been mainly based on RSV A2 strain, which is a GA1 genotype49,50,51,52. Given the antigenic site change in the F proteins of RSV B isolates sometimes occurring with a frequency of 100% and the diversity of the G protein, it raises the question if the A2 strain or its F protein alone is appropriate or sufficient to be developed as a vaccine. In addition, there are a number of RSV F-based subunit vaccines in development that use either the pre-fusion or post-fusion forms of the F protein24. Characterizing the location of these naturally occurring polymorphisms in sites Ø - V may provide insights into the best antigen(s) to use in an F-based vaccine to protect against both contemporary RSV A and B subtypes.
In conclusion, the OUTSMART-RSV and international surveillance studies enable close monitoring of the molecular evolution of RSV. Characterizing antigenic changes in the RSV F and G proteins and shifts in susceptibility to anti-RSV therapy, will provide important information to help facilitate the development of prophylactic vaccines or therapeutic interventions.
Materials and Methods
RSV Sample Collection
The OUTSMART study collected RSV-positive samples from 25 RSVAlert® laboratories from 4 US census regions and Puerto Rico during the winter seasons of 2015–2017. Fourteen (2015–16, season 1) and 25 (2016–17, season 2) regional laboratories in the US and Puerto Rico provided 525 and 1041 RSV positive samples, respectively. A total of 1,146 samples (310 from season 1 and 836 from season 2) were included in the sequencing analysis. All nasal samples were placed in either viral transport medium (VTM) or universal transport medium (UTM) and stored at −70°C. They were shipped frozen on dry ice to a central biorepository, Fisher BioServices, Inc. (Rockville, MD) before being transferred to MedImmune (Mountain View, CA) for sequencing and data analysis.
The nasal sample collection was coordinated by IQVIA who has agreements with the sites participating in OUTSMART and are part of RSVAlert network (https://rsvalert.com). The samples are “diagnostic retains” that are completely de-identified and cannot be linked to the source individuals. Written informed consent was not obtained because sampling was non-invasive. The use of the materials for RSV sequencing was approved by AstraZeneca’s Biological Sample governance committee. All human samples including methods were performed in accordance with the relevant guidelines and regulations.
Sequencing of RSV G and F Genes
For the 2015–16 samples, RNA was extracted directly from 200 µL of RSV positive nasal samples using viral RNA Mini Kit (Qiagen, Valencia, CA). The RNA was eluted in 60 µL of RNase-free water, and then subjected to RT-PCR amplification of the entire F gene and G HVR2 in separated amplicons (primer sequence and RT-PCR conditions available upon request). Sanger sequencing and contig assemblies were performed at Sequetech Inc. (Mountain View, CA).
For the 2016–17 samples, viral RNA was extracted from 400 μL aliquots of RSV positive nasal specimens and eluted in 100 µL using the Nuclisens easyMAG® instruments (bioMerieux, Durham, NC). The RSV G HVR2 and the F genes were amplified using the purified RNA using the SuperScript III One-Step RT-PCR System (Invitrogen, CA) with forward primer, RSV_F5109–5129Y (5′ AGTGTTCAAYTTYGTWCCYTG 3′) and reverse primer RSV_R7654–7634 (5′ YTACCATTCAAGCAATGACCTC 3′), which were designed to amplify the RSV A and B genomes harboring the G HVR2 and the full-length F gene in a single 2.5 kb amplicon. 10 μL aliquots of amplified RT-PCR product were purified and normalized with the SequalPrep™ Normalization Plate (96) Kit (Invitrogen, CA) and eluted with 20 μL of elution buffer. Approximately 5 μL (~1 ng) of normalized PCR product was used for Nextera Library preparation according to the manufacture’s protocol (Illumina Nextera XT DNA Library Preparation Kit). Each sample was tagged with a unique barcode (96–384 per run), followed by the 2 × 250 bp paired-end sequencing protocol on the Illumina MiSeq instrument. Contigs were constructed from the de-multiplexed MiSeq reads using Geneious software (Version 10.0.9, Biomatters Inc. Newark, NJ). Curated assemblies were validated and annotated by visual inspection and quality control before sequence analysis.
Subtyping and Genotyping Analysis
The sequences were examined for quality and coverage and assigned an RSV subtype based on the alignment to RSV A and RSV B reference sequences. To make a RSV A and B co-infection call, the ratio of minimum coverage (depth) of the less predominant strain to predominant strain was set at ≥5%. Samples that did not generate at least 1000 mapped reads with at least 4-fold depth of coverage of both F and G genes were marked as “quality/quantity not sufficient” (QNS) and were excluded from the analysis.
Genotypes were assigned based on a best match in a nucleotide BLAST alignment of the second hypervariable region of the G gene against a database of reference sequences with known genotypes53. The assigned genotypes were confirmed by inspecting the clustering of sequences observed in a phylogenetic tree built from a joint set of all study sequences and the reference database sequences.
AA Sequence Analysis of the RSV F and G HVR2
The F gene sequences in FASTA format were translated to AA sequences and aligned to the reference F sequences derived from Netherlands RSVA/13–005275 (GenBank accession no. KX858757) and Netherlands RSVB/13-001273 (GenBank accession no. KX858756). Amino acid variations were determined from pairwise alignments of sample sequences to the references, which were reported as AA variant tables for the RSV A F and RSV B F proteins. Similarly, the G HVR2 sequences were aligned against reference sequences derived from HVR2 of Netherlands RSVA/13-005275 (GenBank accession no. KX858754) and Netherlands RSVB/13-001273 (GenBank accession no. KX858755) to generate AA variant tables. Due to incomplete sequences of some samples from the 2015–16 season, only G HRV2 of season 2 were included in the analysis.
Evolutionary analyses were conducted in MEGA754. RSV F and G gene sequences were translated into protein sequences and aligned using Muscle along with the reference sequences KX858757.1 and KX858756.1 for RSV A F and RSV B F, and KX858754.1 and KX858755.1 for RSV A G and B G, respectively. The evolutionary history was inferred by using the Maximum Likelihood method based on the JTT matrix-based model. Phylogenetic trees were visualized and annotated using ITOL v355. Trees were rooted to their respective reference sequences and annotated with metadata described in figure legends.
F proteins are visualized with PyMOL Molecular Graphics System, Version 2.0 (Schrödinger, LLC). Protein data bank (PDB) files 5UDE18 and 3RRR26 are used for the pre-fusion F and post-fusion F forms. Amino acid variations in the antigenic sites with frequency >1% are highlighted in black.
References
Shi, T. et al. Global, regional, and national disease burden estimates of acute lower respiratory infections due to respiratory syncytial virus in young children in 2015: a systematic review and modelling study. Lancet 390, 946–958, https://doi.org/10.1016/S0140-6736(17)30938-8 (2017).
Stockman, L. J., Curns, A. T., Anderson, L. J. & Fischer-Langley, G. Respiratory syncytial virus-associated hospitalizations among infants and young children in the United States, 1997–2006. Pediatr Infect Dis J 31, 5–9, https://doi.org/10.1097/INF.0b013e31822e68e6 (2012).
American Academy of Pediatrics Committee on Infectious, D. & American Academy of Pediatrics Bronchiolitis Guidelines, C. Updated guidance for palivizumab prophylaxis among infants and young children at increased risk of hospitalization for respiratory syncytial virus infection. Pediatrics 134, e620–638, https://doi.org/10.1542/peds.2014-1666 (2014).
Gutfraind, A., Galvani, A. P. & Meyers, L. A. Efficacy and optimization of palivizumab injection regimens against respiratory syncytial virus infection. JAMA Pediatr 169, 341–348, https://doi.org/10.1001/jamapediatrics.2014.3804 (2015).
Palivizumab, A. Humanized Respiratory Syncytial Virus Monoclonal Antibody, Reduces Hospitalization From Respiratory Syncytial Virus Infection in High-risk Infants. Pediatrics 102, 531–537 (1998).
Han, L. L., Alexander, J. P. & Anderson, L. J. Respiratory syncytial virus pneumonia among the elderly: an assessment of disease burden. J Infect Dis 179, 25–30, https://doi.org/10.1086/314567 (1999).
Falsey, A. R. & Walsh, E. E. Respiratory syncytial virus infection in elderly adults. Drugs Aging 22, 577–587 (2005).
Widjojoatmodjo, M. N. et al. A highly attenuated recombinant human respiratory syncytial virus lacking the G protein induces long-lasting protection in cotton rats. Virol J 7, 114, https://doi.org/10.1186/1743-422X-7-114 (2010).
Sullender, W. M. Respiratory syncytial virus genetic and antigenic diversity. Clinical microbiology reviews 13, 1–15, table of contents (2000).
Peret, T. C. et al. Circulation patterns of group A and B human respiratory syncytial virus genotypes in 5 communities in North America. J Infect Dis 181, 1891–1896, https://doi.org/10.1086/315508 (2000).
Eshaghi, A. et al. Genetic variability of human respiratory syncytial virus A strains circulating in Ontario: a novel genotype with a 72 nucleotide G gene duplication. Plos One 7, e32807, https://doi.org/10.1371/journal.pone.0032807 (2012).
Duvvuri, V. R. et al. Genetic diversity and evolutionary insights of respiratory syncytial virus A ON1 genotype: global and local transmission dynamics. Sci Rep 5, 14268, https://doi.org/10.1038/srep14268 (2015).
Zheng, Y. et al. Prevailing genotype distribution and characteristics of human respiratory syncytial virus in northeastern China. J Med Virol 89, 222–233, https://doi.org/10.1002/jmv.24640 (2017).
Fan, R. et al. Respiratory syncytial virus subtype ON1/NA1/BA9 predominates in hospitalized children with lower respiratory tract infections. J Med Virol 89, 213–221, https://doi.org/10.1002/jmv.24619 (2017).
Otieno, J. R. et al. Spread and Evolution of Respiratory Syncytial Virus A Genotype ON1, Coastal Kenya, 2010-2015. Emerg Infect Dis 23, 264–271, https://doi.org/10.3201/eid2302.161149 (2017).
Trento, A. et al. Ten years of global evolution of the human respiratory syncytial virus BA genotype with a 60-nucleotide duplication in the G protein gene. J Virol 84, 7500–7512, https://doi.org/10.1128/JVI.00345-10 (2010).
Hotard, A. L., Laikhter, E., Brooks, K., Hartert, T. V. & Moore, M. L. Functional Analysis of the 60-Nucleotide Duplication in the Respiratory Syncytial Virus Buenos Aires Strain Attachment Glycoprotein. J Virol 89, 8258–8266, https://doi.org/10.1128/JVI.01045-15 (2015).
Zhu, Q. et al. A highly potent extended half-life antibody as a potential RSV vaccine surrogate for all infants. Sci Transl Med 9, https://doi.org/10.1126/scitranslmed.aaj1928 (2017).
Melero, J. A. & Moore, M. L. Influence of respiratory syncytial virus strain differences on pathogenesis and immunity. Curr Top Microbiol Immunol 372, 59–82, https://doi.org/10.1007/978-3-642-38919-1_3 (2013).
McLellan, J. S. et al. Structure of RSV fusion glycoprotein trimer bound to a prefusion-specific neutralizing antibody. Science 340, 1113–1117, https://doi.org/10.1126/science.1234914 (2013).
Samuel, D. et al. GS-5806 inhibits pre- to postfusion conformational changes of the respiratory syncytial virus fusion protein. Antimicrob Agents Chemother 59, 7109–7112, https://doi.org/10.1128/AAC.00761-15 (2015).
DeVincenzo, J. P. et al. Activity of Oral ALS-008176 in a Respiratory Syncytial Virus Challenge Study. N Engl J Med 373, 2048–2058, https://doi.org/10.1056/NEJMoa1413275 (2015).
Mejias, A., Garcia-Maurino, C., Rodriguez-Fernandez, R., Peeples, M. E. & Ramilo, O. Development and clinical applications of novel antibodies for prevention and treatment of respiratory syncytial virus infection. Vaccine 35, 496–502, https://doi.org/10.1016/j.vaccine.2016.09.026 (2017).
Villafana, T., Falloon, J., Griffin, M. P., Zhu, Q. & Esser, M. T. Passive and active immunization against respiratory syncytial virus for the young and old. Expert Rev Vaccines 16, 1–13, https://doi.org/10.1080/14760584.2017.1333425 (2017).
Griffin, M. P. et al. Safety, Tolerability, and Pharmacokinetics of MEDI8897, the Respiratory Syncytial Virus Prefusion F-Targeting Monoclonal Antibody with an Extended Half-Life, in Healthy Adults. Antimicrob Agents Chemother 61, https://doi.org/10.1128/AAC.01714-16 (2017).
McLellan, J. S., Yang, Y., Graham, B. S. & Kwong, P. D. Structure of respiratory syncytial virus fusion glycoprotein in the postfusion conformation reveals preservation of neutralizing epitopes. J Virol 85, 7788–7796, https://doi.org/10.1128/JVI.00555-11 (2011).
Hause, A. M. et al. Sequence variability of the respiratory syncytial virus (RSV) fusion gene among contemporary and historical genotypes of RSV/A and RSV/B. Plos One 12, e0175792, https://doi.org/10.1371/journal.pone.0175792 (2017).
McLellan, J. S., Ray, W. C. & Peeples, M. E. Structure and function of respiratory syncytial virus surface glycoproteins. Curr Top Microbiol Immunol 372, 83–104, https://doi.org/10.1007/978-3-642-38919-1_4 (2013).
Sivapalasingam, S. et al. Phase 1 Study Evaluating Safety, Tolerability, Pharmacokinetics and Immunogenicity of REGN2222 in Healthy Adults: A New Human Monoclonal RSV-F Antibody for RSV Prevention. Open Forum Infectious Diseases 2, https://doi.org/10.1093/ofid/ofv133.628 (2015).
Broadbent, L., Groves, H., Shields, M. D. & Power, U. F. Respiratory syncytial virus, an ongoing medical dilemma: an expert commentary on respiratory syncytial virus prophylactic and therapeutic pharmaceuticals currently in clinical trials. Influenza and other respiratory viruses 9, 169–178, https://doi.org/10.1111/irv.12313 (2015).
Zhu, Q. et al. Analysis of respiratory syncytial virus preclinical and clinical variants resistant to neutralization by monoclonal antibodies palivizumab and/or motavizumab. The Journal of infectious diseases 203, 674–682, https://doi.org/10.1093/infdis/jiq100 (2011).
Zhao, X. & Sullender, W. M. In vivo selection of respiratory syncytial viruses resistant to palivizumab. J Virol 79, 3962–3968, https://doi.org/10.1128/JVI.79.7.3962-3968.2005 (2005).
DeVincenzo, J. P. et al. Surveillance of clinical isolates of respiratory syncytial virus for palivizumab (Synagis)-resistant mutants. J Infect Dis 190, 975–978, https://doi.org/10.1086/423213 (2004).
Adams, O. et al. Palivizumab-resistant human respiratory syncytial virus infection in infancy. Clin Infect Dis 51, 185–188, https://doi.org/10.1086/653534 (2010).
Zhu, Q. et al. Prevalence and significance of substitutions in the fusion protein of respiratory syncytial virus resulting in neutralization escape from antibody MEDI8897. J Infect Dis, https://doi.org/10.1093/infdis/jiy189 (2018).
McCarthy, M., Villafana, T., Stillman, E. & Esser, M. T. Respiratory syncytial virus protein structure, function and implications for subunit vaccine development. Future Virology 9, 753–767 (2014).
Rossey, I. et al. Potent single-domain antibodies that arrest respiratory syncytial virus fusion protein in its prefusion state. Nat Commun 8, 14158, https://doi.org/10.1038/ncomms14158 (2017).
Trento, A. et al. Major changes in the G protein of human respiratory syncytial virus isolates introduced by a duplication of 60 nucleotides. J Gen Virol 84, 3115–3120, https://doi.org/10.1099/vir.0.19357-0 (2003).
Falsey, A. R. et al. Respiratory syncytial virus and other respiratory viral infections in older adults with moderate to severe influenza-like illness. J Infect Dis 209, 1873–1881, https://doi.org/10.1093/infdis/jit839 (2014).
Schobel, S. A. et al. Respiratory Syncytial Virus whole-genome sequencing identifies convergent evolution of sequence duplication in the C-terminus of the G gene. Sci Rep 6, 26311, https://doi.org/10.1038/srep26311 (2016).
Otieno, J. R. et al. Molecular Evolutionary Dynamics of Respiratory Syncytial Virus Group A in Recurrent Epidemics in Coastal Kenya. J Virol 90, 4990–5002, https://doi.org/10.1128/JVI.03105-15 (2016).
Kim, Y. J. et al. Rapid replacement of human respiratory syncytial virus A with the ON1 genotype having 72 nucleotide duplication in G gene. Infect Genet Evol 26, 103–112, https://doi.org/10.1016/j.meegid.2014.05.007 (2014).
van Niekerk, S. & Venter, M. Replacement of previously circulating respiratory syncytial virus subtype B strains with the BA genotype in South Africa. J Virol 85, 8789–8797, https://doi.org/10.1128/JVI.02623-10 (2011).
Zhang, R. F. et al. Human respiratory syncytial virus in children with acute respiratory tract infections in China. J Clin Microbiol 48, 4193–4199, https://doi.org/10.1128/JCM.00179-10 (2010).
Choudhary, M. L., Anand, S. P., Wadhwa, B. S. & Chadha, M. S. Genetic variability of human respiratory syncytial virus in Pune, Western India. Infect Genet Evol 20, 369–377, https://doi.org/10.1016/j.meegid.2013.09.025 (2013).
Chen, X. et al. Genetic variations in the fusion protein of respiratory syncytial virus isolated from children hospitalized with community-acquired pneumonia in China. Sci Rep 8, 4491, https://doi.org/10.1038/s41598-018-22826-4 (2018).
Domachowske, J. B. et al. Safety, Tolerability, and Pharmacokinetics of MEDI8897, an Extended Half-Life Single-Dose Respiratory Syncytial Virus Prefusion F-Targeting Monoclonal Antibody Administered as a Single Dose to Healthy Preterm Infants. Pediatr Infect Dis J, https://doi.org/10.1097/INF.0000000000001916 (2018).
Jorquera, P. A. & Tripp, R. A. Respiratory syncytial virus: prospects for new and emerging therapeutics. Expert Rev Respir Med 11, 609–615, https://doi.org/10.1080/17476348.2017.1338567 (2017).
Schickli, J. H., Kaur, J. & Tang, R. S. Nonclinical phenotypic and genotypic analyses of a Phase 1 pediatric respiratory syncytial virus vaccine candidate MEDI-559 (rA2cp248/404/1030DeltaSH) at permissive and non-permissive temperatures. Virus Res 169, 38–47, https://doi.org/10.1016/j.virusres.2012.06.027 (2012).
Raghunandan, R. et al. An insect cell derived respiratory syncytial virus (RSV) F nanoparticle vaccine induces antigenic site II antibodies and protects against RSV challenge in cotton rats by active and passive immunization. Vaccine 32, 6485–6492, https://doi.org/10.1016/j.vaccine.2014.09.030 (2014).
Glenn, G. M. et al. A Randomized, Blinded, Controlled, Dose-Ranging Study of a Respiratory Syncytial Virus Recombinant Fusion (F) Nanoparticle Vaccine in Healthy Women of Childbearing Age. J Infect Dis 213, 411–422, https://doi.org/10.1093/infdis/jiv406 (2016).
Langley, J. M. et al. A Randomized, Controlled, Observer-Blinded Phase 1 Study of the Safety and Immunogenicity of a Respiratory Syncytial Virus Vaccine With or Without Alum Adjuvant. J Infect Dis 215, 24–33, https://doi.org/10.1093/infdis/jiw453 (2017).
Tabatabai, J., Prifert, C., Pfeil, J., Grulich-Henn, J. & Schnitzler, P. Novel respiratory syncytial virus (RSV) genotype ON1 predominates in Germany during winter season 2012–13. Plos One 9, e109191, https://doi.org/10.1371/journal.pone.0109191 (2014).
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol Biol Evol 33, 1870–1874, https://doi.org/10.1093/molbev/msw054 (2016).
Letunic, I. & Bork, P. Interactive tree of life (iTOL)v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res 44, W242–245, https://doi.org/10.1093/nar/gkw290 (2016).
Acknowledgements
We thank all the members of the RSVAlert® and OUTSMART-RSV laboratories for providing the RSV-positive samples; Fisher BioServ, IQVIA-central labs, LakePharma for their contributions for sample storage and testing; Marla Chu for technical assistance; David Wen and Patrick McTamney for their help with the F protein graphics; Sandrine Buisson for reviewing the manuscript; Fiona Fernandes, Li Yu, Tonya Villafana, JoAnn A. Suzich for helpful discussions.
Author information
Authors and Affiliations
Contributions
B.L. sequenced half samples, analysed data and wrote the paper. H.L. sequenced half samples. D.E.T. drew the phylogenetic trees. A.T. developed sequence analysis tools. Y.Q. analyzed metadata and calculated statistics. A.R. coordinated sample collection and did metadata analysis. M.T.E. perceived the idea, coordinated the study and wrote the paper. H.J. guided the study method development, data analysis and wrote the paper.
Corresponding author
Ethics declarations
Competing Interests
All authors are employees of MedImmune/AstraZeneca and are the share holders of AstraZeneca. This does not alter our adherence to Scientific Reports’ policies on sharing data and materials. Medimmune/AstraZeneca had no direct roles in study design, data collection, analysis, and manuscript preparation.
Additional information
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bin Lu, Liu, H., Tabor, D.E. et al. Emergence of new antigenic epitopes in the glycoproteins of human respiratory syncytial virus collected from a US surveillance study, 2015–17. Sci Rep 9, 3898 (2019). https://doi.org/10.1038/s41598-019-40387-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-019-40387-y
- Springer Nature Limited
This article is cited by
-
Differences Between RSV A and RSV B Subgroups and Implications for Pharmaceutical Preventive Measures
Infectious Diseases and Therapy (2024)
-
Antibody therapies set to transform respiratory syncytial virus prevention for babies
Nature (2023)
-
Prevalence of viral pathogens in a sample of hospitalized Egyptian children with acute lower respiratory tract infections: a two-year prospective study
Bulletin of the National Research Centre (2022)
-
Genetic diversity and molecular evolution of human respiratory syncytial virus A and B
Scientific Reports (2021)
-
Distinct patterns of within-host virus populations between two subgroups of human respiratory syncytial virus
Nature Communications (2021)