Glycoprotein N serves multiple crucial functions in the HCMV life cycle. As a component of the gC-II complex, it is directly involved in virus attachment to the host cell and subsequent spread of infection . The gC-II complex participates in the initial interactions between the virus and cellular receptors, facilitating the complex process of viral entry. The presence of gpUL73 on the cell surface of infected cells further emphasizes its role in cell-to-cell spread of the virus, which represents an important mechanism for viral dissemination while potentially avoiding exposure to neutralizing antibodies in the extracellular environment .
The UL73 locus exhibits remarkable genetic diversity, making it one of the most polymorphic regions in the HCMV genome. While certain genes in HCMV show relatively low nucleotide variation (5-10%), the UL73 gene possesses highly hypervariable regions with approximately 50% variability . This extraordinary level of polymorphism has led to the classification of UL73 into seven distinct genotypes, formally designated as gN1, gN2, gN3a, gN3b, gN4a, gN4b, and gN4c . These genotypes differ primarily in their nucleotide sequences, resulting in amino acid variations that potentially affect the protein's structure, function, and antigenicity.
Sequencing and phylogenetic analysis of UL73 from numerous clinical isolates have confirmed the existence of these genotypes and elucidated their evolutionary relationships . The identification of the gN-3b subgroup as a novel variant within the gN-3 genotype demonstrates the ongoing process of discovery and refinement in our understanding of UL73 diversity . These distinct genetic variants may have evolved in response to various selective pressures, including host immune responses and adaptation to different cellular environments. The persistence of multiple genotypes in global HCMV populations suggests that this diversity confers evolutionary advantages to the virus.
Analysis of the evolutionary forces acting on UL73 has provided valuable insights into the mechanisms driving its genetic diversity. Research examining the number of non-synonymous (dN) and synonymous (dS) nucleotide substitutions and their ratio (dN/dS) among the gN genotypes has revealed intriguing patterns of selection . While the four major variants appear to have evolved primarily through neutral (random) selection processes, evidence indicates that the gN-3 and gN-4 genotypes are maintained by positive selective pressure . This positive selection suggests that amino acid changes in these variants confer advantages that promote their persistence in viral populations.
The selective forces acting on UL73 likely reflect the protein's exposure to host immune responses, particularly neutralizing antibodies. The high degree of polymorphism observed in the exposed regions of gpUL73 may enable the virus to evade recognition by antibodies targeting this glycoprotein, thereby facilitating viral persistence and potential reinfection in seropositive individuals . This immune evasion strategy would explain the maintenance of multiple genotypes within host populations and the ongoing evolution of new variants. Additionally, the different UL73 genotypes may possess varying functional properties that optimize viral fitness in different cellular contexts or host genetic backgrounds.
The global distribution of UL73 genotypes has been extensively investigated through the analysis of HCMV clinical isolates from diverse geographical regions. Studies have identified four main regions of gN prevalence: Europe, China, Australia, and Northern America . Interestingly, the distribution of gN variants appears to be relatively uniform across these regions, with no significant differences in genotype frequency between geographical areas . This widespread distribution suggests that all major UL73 genotypes have been maintained throughout the global spread of HCMV, indicating their fundamental importance to viral fitness.
The lack of geographical clustering for specific genotypes contrasts with the patterns observed for some other polymorphic viral genes and may reflect the ancient origin of these variants. The maintenance of all genotypes across diverse human populations suggests that none has a decisive advantage that would lead to its fixation in the viral population. Instead, the coexistence of multiple genotypes may represent a balanced polymorphism maintained by frequency-dependent selection, where the fitness of each variant depends on its relative frequency in the population.
Recombinant Human cytomegalovirus Envelope glycoprotein N (UL73) represents an important tool for research, diagnostic, and potentially therapeutic applications. Commercially available recombinant UL73 proteins have been developed for various research purposes . These recombinant proteins enable the generation and characterization of UL73-specific antibodies, which are essential tools for studying the expression, localization, and function of this glycoprotein in infected cells . Such antibodies facilitate techniques like immunoblotting and immunofluorescence, which have been instrumental in characterizing gpUL73 properties.
The use of recombinant UL73 in diagnostic applications leverages the genetic diversity of this glycoprotein for HCMV strain identification and epidemiological studies. Researchers have developed rapid and low-cost methods for genetic grouping of HCMV clinical isolates based on restriction fragment length polymorphism (RFLP) analysis of PCR-amplified UL73 sequences . This technique enables the distinction of all four major gN genomic variants and their subtypes, facilitating large-scale epidemiological studies and routine clinical genotyping. The correlation between certain UL73 genotypes and clinical outcomes further enhances the diagnostic value of UL73 genotyping in predicting disease severity and guiding treatment decisions.
Genotyping of UL73 has become an important approach for characterizing HCMV strains in clinical and research settings. A nested PCR approach with two pairs of primers has been successfully used to amplify UL73 gene fragments from various clinical specimens . In one study, researchers performed a nested PCR with two pairs of primers to amplify the UL73 gene fragment, followed by sequencing of positive products . Phylogenetic analysis was then conducted using the neighbor-joining method with 1,000 bootstrap replications for branch support .
This methodology has enabled the successful genotyping of UL73 in various clinical samples, including those from congenitally infected infants. In one study of symptomatic congenitally CMV-infected neonates, researchers were able to successfully genotype UL73 in 28/42 (66.7%) of subjects . This relatively high success rate demonstrates the feasibility of UL73 genotyping in clinical practice, despite the challenges associated with detecting and characterizing viral genetic material in clinical specimens with potentially low viral loads.
The genetic diversity of UL73 has significant implications for HCMV pathogenesis and clinical outcomes. Studies have identified associations between specific UL73 genotypes and disease manifestations, particularly in congenital CMV infection. Research examining the relationship between UL73 genotypes and clinical findings in congenitally infected infants found a significant association between the gN1 genotype and neurologic findings at birth (P = .029) . Logistic regression analysis, adjusted for treatment with CMV-specific hyperimmune globulin during pregnancy, yielded an odds ratio of 7.0 (95% CI, 1.1–45.9; P = .043), indicating that infants infected with gN1 strains had a substantially higher risk of neurological complications .
Further supporting the clinical relevance of gN1, infants with this genotype exhibited significantly lower platelet counts at birth compared to those infected with other gN variants (120,400 cells/L vs 238,550 cells/L; P = .037) . Thrombocytopenia is a recognized manifestation of severe congenital CMV infection and may serve as a marker for more extensive disease. Although the association between gN1 and abnormal imaging findings did not reach statistical significance (P = .081), the trend suggests a potential relationship that warrants further investigation in larger cohorts .
Congenital CMV infection represents a significant cause of long-term neurodevelopmental sequelae, and identifying viral factors that influence disease severity is crucial for risk stratification and management. The dominance of specific UL73 genotypes in symptomatic congenital infections provides valuable insights into the potential role of this glycoprotein in determining clinical outcomes. In one study of symptomatic congenitally CMV-infected neonates, gN1 was the most prevalent genotype, detected in 42.9% of cases . This preponderance of gN1 in symptomatic cases, combined with its association with neurological findings, suggests that this variant may possess unique properties that enhance its pathogenicity in the developing nervous system.
The distribution of UL73 genotypes in congenitally infected infants has been characterized in several studies. In one cohort of 42 symptomatic congenitally CMV-infected neonates, researchers successfully genotyped UL73 in 28 subjects and found the following distribution: gN1 (12/28, 42.9%), gN3a (6/28, 21.4%), gN3b (3/28, 10.7%), gN4a (3/28, 10.7%), gN4b (2/28, 7.1%), gN4c (1/28, 3.6%), and gN2 (1/28, 3.6%) . The presence of all seven genotypes in this cohort, albeit with varying frequencies, underscores the extensive diversity of UL73 even within a relatively small sample population.
HCMV infections frequently involve multiple viral strains, as evidenced by the detection of different genotypes within the same host. Studies examining CMV genetic diversity through genotyping of multiple genes, including UL73, have revealed that mixed infections are common even in immunocompetent individuals . In a study of healthy adolescent females with primary CMV infection, mixed infection (presence of multiple viral strains) was documented in all subjects within three months after primary infection . At the time of primary infection, 5 of 12 (42%) urine samples had multiple virus strains, indicating that acquisition of diverse viral populations can occur early in the course of infection .
The presence of multiple UL73 genotypes within a single individual may result from simultaneous infection with different strains or sequential acquisition of new strains over time. The detection of different genotypes in different compartments (urine, saliva, plasma) within the same individual further complicates the picture, suggesting compartmentalized infection or selective replication of particular variants in specific tissues . This compartmentalization may reflect differences in tissue tropism among UL73 variants, potentially related to their varying abilities to interact with cell-type-specific receptors or evade local immune responses.
Longitudinal studies tracking CMV genotypes over time have revealed dynamic changes in the predominant UL73 variants within infected individuals. Changes in genotypes over time were observed in all subjects in one study of primary CMV infection, indicating ongoing evolution of the viral population . These changes may result from several processes, including differential growth of co-infecting strains, recombination between strains, de novo mutation, or reinfection with new strains. The temporal dynamics of UL73 genotypes may also reflect changes in immune pressure, with certain variants being selected for their ability to evade emerging immune responses.
The observation that 50% of vaccine recipients in one study were infected with the gB1 genotype (the vaccine strain) raises intriguing questions about potential interactions between vaccination and subsequent natural infection . While this finding could simply reflect the prevalence of gB1 in the population, it might also suggest that vaccination modifies the dynamics of natural infection, potentially through selection pressure exerted by vaccine-induced immunity. The complex interplay between vaccination, natural infection, and viral diversity underscores the challenges in developing effective strategies for preventing and controlling CMV infection.
The characterization of UL73 genotypes in clinical and research settings employs various molecular techniques. Nested PCR amplification of the UL73 gene fragment, followed by sequencing, represents the gold standard for genotype determination . Phylogenetic analysis of the resulting sequences, typically using the neighbor-joining method, allows for accurate classification by comparison with reference sequences . This approach provides the most detailed information about genetic variations but may be resource-intensive for large-scale studies.
For epidemiological investigations and routine clinical applications, researchers have developed more streamlined methods. A particularly valuable approach involves RFLP analysis of PCR-amplified UL73 sequences using specific restriction enzymes (SacI, ScaI, and SalI) . This technique enables distinction of all major gN genomic variants and their subtypes without the need for sequencing, providing a rapid and cost-effective method for genetic grouping of HCMV clinical isolates . The development of such practical genotyping methods has facilitated larger-scale studies of UL73 diversity and its clinical implications.
KEGG: vg:3077417
UL73 exhibits significant amino acid sequence variation among HCMV isolates, particularly in the N-terminal portion. Phylogenetic analysis has established 8 distinct genotypes of UL73, characterized by high levels of intergenotypic diversity and low levels of intragenotypic diversity. These genotypes are strongly linked to UL74 genotypes, with only 7 recombinants noted among 243 genome sequences analyzed. This linkage pattern suggests a key role for homologous recombination in HCMV evolution and potential functional constraints on these interacting proteins .
The glycoprotein N (UL73) forms a complex with glycoprotein M (UL100) that elicits antibody responses during HCMV infection. This gM/gN complex, along with UL74/gO, constitutes part of the viral entry complex and plays critical roles in viral exocytosis, cellular tropism, and modulation of antibody neutralization. The complex formation likely explains the evolutionary conservation of linkage patterns between UL73 and UL74 genotypes, reflecting functional interdependence between these viral components .
Researchers typically employ immunoblot and immunofluorescence techniques using UL73-specific antisera. Common experimental approaches include calcium chloride-mediated transfection of approximately 2 μg of UL73-encoding plasmid DNA (such as pcDNA3.1 or pEF-1myc/his vectors) into HEK293T cells, with expression confirmation through immunofluorescent reactivity with specific monoclonal antibodies. For detailed localization studies, imaging is often performed using transfected Cos-7 cells processed through standardized immunofluorescence protocols .
Several complementary methodologies have proven effective for studying UL73 polymorphisms:
Sequence-based genotyping: Extraction and phylogenetic analysis of UL73 nucleotide sequences from complete genome sequences has confirmed the existence of 8 genotypes with distinctive evolutionary patterns .
Motif-based identification: Researchers have developed discriminative sequence motifs for each UL73 genotype. These include short motifs (approximately 12 nucleotides) near the 5' end of each UL73 genotype and longer, non-redundant motifs that provide improved discrimination. For accurate genotyping, threshold requirements (>25 reads and >5% of total reads) help eliminate false positives .
Comparative analysis: Analysis of datasets after purging human reads significantly improves genotype discrimination accuracy, particularly when working with clinical samples that contain significant proportions of non-viral reads .
Multiple gene confirmation: Confirming genotype assignments across multiple hypervariable genes (such as UL73, UL74, RL12, UL146) provides more reliable strain identification in complex multi-strain infections .
Multiple-strain HCMV infections pose significant analytical challenges, with some individuals harboring up to 5 different strains simultaneously. The following approaches have proven effective:
Genotype threshold establishment: A genotype is typically considered present when represented by >25 reads and >5% of the total number of reads detected for all genotypes of that gene .
Multi-gene concordance: The number of strains is best scored as the greatest number of genotypes detected using long motifs for at least 2 genes, improving confidence in strain identification .
Minority strain detection: Using specific sequence motifs can identify minority strains, though those present at <5% may not reliably score with standard thresholds .
The table below illustrates genotype distribution patterns observed in clinical samples with multiple HCMV strains:
| Donor | Dataset | Strains | UL73 Genotypes | UL74 Genotypes | Other Notable Genotypes |
|---|---|---|---|---|---|
| 141 | 141R16 | 2 | 5, 6 | 2, 3A | UL146: 4B; UL139: 3, 8 |
| 173 | 173R16 | 5 | 1, 3, 5 | 1, 3A, 4A, 4B | UL146: 1A, 4B |
| 174 | 174L16 | 3 | 1, 4 | 1, 2, 4A | UL146: 2B, 3A |
| 259 | 259L16 | 3 | 1, 4, 6 | 1, 2, 4B | UL146: 4B |
| 288 | 288R4 | 3 | 1 | 1, 4B, 4D | UL146: 2B, 4B |
Table derived from study data analyzing UL73 genotypes in multiple-strain HCMV infections
Given that different UL73/gN genotypes may influence viral properties, several experimental approaches are valuable:
Recombinant expression systems: Cloning the UL73 coding sequence from different genotypes into expression vectors (pcDNA3.1 or pEF-1myc/his) allows comparative functional studies in cell culture models .
Co-expression studies: Co-transfection of cells with plasmids encoding UL73 and UL100 (gM) enables investigation of complex formation efficiency between different genotype combinations .
Viral tropism assays: Since the gM/gN complex affects cellular tropism, comparative infection studies using recombinant viruses with different UL73 genotypes can reveal functional differences in cell entry and replication efficiency.
Immune evasion analysis: As UL73/gN appears related to immune evasion mechanisms, neutralization assays comparing antibody effectiveness against different UL73 genotypes provide insights into how polymorphism affects immune recognition .
The extensive O-glycosylation of gN (creating a broad 39-53 kDa band in purified virus versus 15-18 kDa in transiently expressed protein) has significant implications for functional studies :
Glycosylation analysis: O-glycosidase digestion of purified gN allows investigation of glycosylation patterns across different genotypes and their impact on protein function.
Epitope accessibility: Glycosylation may shield important epitopes, necessitating careful design of deglycosylation protocols for antibody production and immunological studies.
Interaction mapping: Differential glycosylation may affect gN interactions with gM and other viral components, requiring studies with both native and deglycosylated forms to map functional domains.
Expression system selection: Researchers must select expression systems capable of appropriate post-translational modifications, as bacterial systems will not reproduce the O-glycosylation pattern observed in mammalian cells .
Several methodological challenges must be addressed:
Time-course analysis: Since gN is expressed with true-late kinetics, precise time-course experiments post-infection are necessary to capture its expression and localization dynamics .
Protein-protein interaction studies: Investigating gN interactions with other viral components requires techniques such as co-immunoprecipitation, proximity ligation assays, or FRET to detect transient or weak interactions.
Subcellular localization: The dual localization of gN in perinuclear granular formations and on the cell surface necessitates careful subcellular fractionation and high-resolution imaging to track its trafficking through the infected cell .
Structural constraints: The transmembrane nature of gN creates challenges for structural biology approaches, potentially requiring detergent solubilization strategies that maintain native conformation.