Human DGCR6 (DiGeorge Critical Region 6) exists in two functional copies within the human genome, both located on chromosome 22q11. The original copy (DGCR6) is positioned within a low copy repeat termed sc11.1a, while its paralog (DGCR6L) is found within the duplicate locus sc11.1b . Both genes share the same genomic structure, containing five exons of equal length with conserved intron/exon boundaries .
When investigating DGCR6, researchers should note that the putative initiator methionine is located further upstream than originally described in early publications, which had incorrectly identified a frameshift mutation . The corrected full-length coding sequence encodes proteins that are 220 amino acids in length for both DGCR6 and DGCR6L .
Distinguishing between DGCR6 and DGCR6L presents a significant methodological challenge due to their high sequence similarity (97% identity at both nucleotide and protein levels). The most reliable approach utilizes single nucleotide differences between the two genes. Specifically:
PCR-RFLP analysis targeting a C/T difference at position 167/168 (DGCR6/DGCR6L) that creates a PvuII restriction site in DGCR6 but not in DGCR6L can effectively differentiate the genes .
Amplification of regions containing the seven amino acid differences can allow for sequence-based discrimination.
When analyzing expression patterns, researchers should be aware that both genes demonstrate widespread expression, though DGCR6L appears absent in adult skeletal muscle and small intestine tissue samples . For more precise discrimination in experimental settings, designing PCR primers that span unique regions of each gene and validating them in hemizygous deletion patients (VCFS/DGS) who have only one copy of each gene can provide reliable differentiation .
While the precise function of DGCR6 remains unknown, structural analysis reveals significant evolutionary conservation suggesting important biological roles. The human DGCR6 protein shares approximately 92% and 77% amino acid identity with mouse and chicken orthologs respectively, indicating strong selective pressure for functional conservation across vertebrates .
Comparative analysis with its Drosophila homolog gdl (gonadal) shows 30-35% identity, with the highest homology occurring between amino acid positions 140-188 of the human protein . This region likely represents a functionally significant domain that has been maintained throughout evolution.
The table below summarizes key structural comparisons between DGCR6 proteins across species:
Species Comparison | Amino Acid Identity | Notable Features |
---|---|---|
DGCR6 vs. DGCR6L (human) | 97% | 7 amino acid differences |
Human DGCR6 vs. Mouse | 92% | Conserved initiation codon position |
Human DGCR6 vs. Chicken | 77% | Maintained functional domains |
Vertebrate DGCR6 vs. Drosophila gdl | 30-35% | Highest homology between positions 140-188 |
To investigate the function of DGCR6, researchers should consider experimental approaches that examine protein-protein interactions, subcellular localization, and targeted gene manipulation in model systems .
Expression analysis of DGCR6 and DGCR6L reveals that both genes are widely expressed across human tissues, though with subtle but potentially significant differences in their expression patterns. Research methodologies for investigating these expression differences have utilized tissue-specific cDNA panels and PCR-based approaches exploiting the nucleotide differences between the two genes .
Key findings regarding expression patterns include:
DGCR6 appears to be universally expressed across all tissues examined .
DGCR6L shows widespread expression but appears absent in adult skeletal muscle and small intestine .
Both genes show expression in fetal tissues, suggesting developmental roles .
When designing expression studies, researchers should be aware that tissue-specific cDNAs used in previous research were synthesized from pooled samples of multiple unrelated individuals (ranging from 3 for brain and heart to 550 for peripheral blood leukocytes), which provides robust representation of normal expression patterns .
For developmental expression analysis, reference can be made to mouse studies which demonstrated high levels of Dgcr6 expression in brain, neural tube, pharyngeal arches and nasal process at embryonic day 11.5—regions implicated in the etiology of VCFS/DGS .
The evolutionary history of DGCR6 duplication represents a fascinating case study in gene retention and divergence. Through FISH mapping in various ape species combined with sequence analysis across primate lineages, researchers have determined that the duplication that generated DGCR6 and DGCR6L is at least 12 million years old and may even predate the divergence of Catarrhines from Platyrrhines (approximately 35 million years ago) .
To investigate evolutionary dynamics of duplicated genes like DGCR6, researchers should employ:
Comparative genomic approaches across primate species
Analysis of synonymous versus non-synonymous substitution rates
Examination of regulatory region divergence
The maintenance of both DGCR6 paralogs suggests selective evolutionary pressure toward functional preservation of both copies. This phenomenon aligns with models of paralog retention where asymmetric mutations accumulate in duplicated genes, altering their efficacy or specificity of function . The slight differences in expression patterns between DGCR6 and DGCR6L support this model of subfunctionalization .
Interestingly, the PRODH gene was also duplicated within the same region, but unlike DGCR6, its paralog accumulated mutations and diverged significantly, highlighting different evolutionary trajectories for duplicated genes in the same genomic neighborhood .
A distinctive feature of the human DGCR6 locus is the integration of a full-length HERV-K provirus into the sc11.1a locus after the divergence of chimpanzees and humans . This human-specific insertion represents a significant genomic difference that may potentially influence regulation or function of DGCR6 in humans compared to other primates.
Researchers investigating primate-specific aspects of DGCR6 should employ:
Comparative genomic sequence analysis across primate species
Functional analysis of the HERV-K element's potential impact on DGCR6 expression
Evolutionary dating techniques to precisely define when genomic changes occurred
The q11 region of human chromosome 22 is notably rich in low copy repeat families, making it predisposed to rearrangements that can cause congenital anomaly disorders . The expansion of these low copy repeats has likely contributed to both genome rearrangements and gene amplifications in the region, including the duplication that generated the two DGCR6 copies .
Both DGCR6 and DGCR6L are deleted in most individuals with velo-cardio-facial syndrome/DiGeorge syndrome (VCFS/DGS), as they map immediately adjacent and internal to the low copy repeats (LCR22) that mediate the deletions associated with these disorders . The potential contribution of DGCR6 deletion to disease pathology represents an important area of investigation.
When researching DGCR6's role in VCFS/DGS pathology, scientists should consider:
Expression analysis in developmentally relevant tissues affected in VCFS/DGS
The impact of gene dosage reduction on downstream molecular pathways
The potential functional redundancy between DGCR6 and DGCR6L
Mouse studies have shown that Dgcr6 is expressed at high levels in brain, neural tube, pharyngeal arches, and the nasal process during embryonic development—regions implicated in the etiology of VCFS/DGS . This expression pattern supports the hypothesis that reduced dosage of the DGCR6 genes could contribute to the phenotypes associated with these disorders .
Methodologically, researchers investigating DGCR6's role in disease should utilize:
Animal models with targeted deletions
Patient-derived cells to examine gene dosage effects
Molecular pathway analysis to identify downstream consequences of DGCR6 deficiency
The originally published cDNA sequence of DGCR6 contained a frameshift mutation compared to the EST database and corrected sequences . Rather than representing a sequencing error, this may represent a null allele in the population . Similarly, a DGCR6L cDNA with alternative splicing due to a G to A mutation at the 5′ splice donor between exons 3 and 4 has been identified .
These observations suggest that since the sc11.1 duplication resulted in two copies of the DGCR6 gene, null mutations may have been tolerated and fixed in the population due to functional redundancy between the genes .
For comprehensive genetic variation analysis, researchers should:
Sequence both DGCR6 and DGCR6L in diverse population cohorts
Functionally characterize identified variants
Correlate genotypes with potential phenotypic consequences
Investigating DGCR6 function requires multifaceted experimental approaches given its unknown precise role and the presence of two paralogs in humans. Researchers should consider these methodological strategies:
Gene Editing Approaches:
CRISPR/Cas9-mediated knockout of DGCR6, DGCR6L, or both in cell models
Creation of knock-in models with tagged versions for localization studies
Generation of specific amino acid substitutions to test functional hypotheses
Expression Analysis:
RT-PCR approaches exploiting nucleotide differences between paralogs
Single-cell RNA sequencing to identify cell type-specific expression patterns
Spatial transcriptomics to map expression in developmental contexts
Functional Assays:
Protein interaction studies using immunoprecipitation or proximity labeling
Subcellular localization analysis via immunofluorescence
Phenotypic screening following gene manipulation
Animal Models:
When designing experiments, researchers should consider that the Drosophila homolog gdl is expressed specifically during gametogenesis and in adult reproductive organs , which might suggest specialized functions in certain cell types despite the widespread expression pattern observed in mammals.
Discriminating between the specific functions of DGCR6 and DGCR6L presents a significant challenge due to their high sequence similarity. Methodological approaches to address this challenge include:
Paralog-Specific Knockdown/Knockout:
Rescue Experiments:
Knockout both genes followed by selective rescue with either DGCR6 or DGCR6L
Create chimeric proteins to identify which domains contribute to unique functions
Expression System Analysis:
Protein Structure-Function Analysis:
When performing these experiments, researchers should consider that functional differences might be subtle or context-dependent, as evolutionary pressure has maintained both genes despite their high similarity .
Despite progress in characterizing DGCR6, several fundamental questions remain unresolved:
Function Determination: The precise molecular and cellular functions of DGCR6 and DGCR6L remain unknown despite their evolutionary conservation . Future research should focus on protein interaction networks and subcellular localization to elucidate function.
Developmental Roles: While expression patterns suggest involvement in embryonic development (particularly in structures affected in VCFS/DGS) , the specific developmental processes regulated by DGCR6 require further investigation.
Paralog-Specific Functions: The evolutionary maintenance of both DGCR6 and DGCR6L suggests non-redundant functions , but the nature of these specialized roles remains unclear.
Disease Contribution: The extent to which DGCR6/DGCR6L deletion contributes to VCFS/DGS phenotypes has not been fully determined . Separating their contribution from other genes in the 22q11 deletion region represents a significant challenge.
Regulatory Mechanisms: The regulation of DGCR6 expression, including potential impacts of the HERV-K insertion in humans , remains to be characterized.
To address these questions, researchers should combine genomic, proteomic, and functional approaches while leveraging emerging technologies such as single-cell analyses and advanced imaging techniques.
Emerging technologies offer promising approaches to resolve longstanding questions about DGCR6:
Single-Cell Omics:
Single-cell RNA sequencing can reveal cell type-specific expression patterns
Single-cell ATAC-seq can identify regulatory elements controlling expression
Spatial transcriptomics can map expression in developmental and tissue contexts
Proteomics Approaches:
Proximity labeling methods (BioID, TurboID) can identify protein interaction partners
Mass spectrometry-based approaches to detect post-translational modifications
Structural biology techniques to determine protein structure
Advanced Genetic Engineering:
Base editing or prime editing for precise modification of specific nucleotides
Conditional knockouts to study temporal requirements
Tissue-specific manipulation to address systemic effects
Patient-Derived Models:
iPSC models from VCFS/DGS patients to study developmental impacts
Organoid systems to model tissue-specific effects of DGCR6 deletion
CRISPR-based reversion of deletions to assess phenotypic rescue
Computational Approaches:
Phylogenetic analysis across diverse species to identify conserved functional domains
Protein structure prediction using AI tools like AlphaFold
Network analysis to position DGCR6 within biological pathways
The integration of these technologies, combined with traditional genetic and biochemical approaches, holds promise for resolving the biological functions of DGCR6 and its contribution to human disease. Researchers should consider multi-disciplinary collaborations to leverage these diverse methodological approaches.
DiGeorge Syndrome Critical Region Gene 6 (DGCR6) is a protein-coding gene located on chromosome 22q11.2. This gene is associated with DiGeorge syndrome, a disorder caused by the deletion of a small segment of chromosome 22. The syndrome is also known as CATCH 22, which stands for Cardiac defects, Abnormal facies, Thymic hypoplasia, Cleft palate, and Hypocalcemia, all of which are common features of the disorder .
DGCR6 shares homology with the Drosophila melanogaster gonadal protein, which is involved in gonadal and germ cell development, and with the gamma-1 subunit of human laminin, which plays a role in cell attachment and migration . The gene is thought to be involved in the migration of neural crest cells into the third and fourth pharyngeal pouches during embryonic development .
Microdeletions in the 22q11.2 region, where DGCR6 is located, are implicated in DiGeorge syndrome and other related disorders such as Velocardiofacial syndrome . These deletions can lead to a variety of developmental anomalies, including congenital heart defects, immune deficiencies, and facial dysmorphisms . Additionally, DGCR6 has been suggested as a candidate gene for involvement in schizophrenia .
Human recombinant DGCR6 is a form of the protein that is produced through recombinant DNA technology. This involves inserting the DGCR6 gene into a suitable expression system, such as bacteria or yeast, to produce the protein in large quantities. Recombinant proteins are valuable for research and therapeutic purposes, as they allow scientists to study the protein’s function and role in disease more effectively.
Research on DGCR6 and its recombinant form is ongoing, with studies focusing on its role in neural crest cell migration and its potential involvement in various diseases. Understanding the function of DGCR6 could lead to new insights into the mechanisms underlying DiGeorge syndrome and related disorders, as well as potential therapeutic targets for these conditions.