Recombinant Human Putative uncharacterized protein C18orf62 (C18orf62) is a transmembrane protein encoded by the SMIM21 gene (NCBI GeneID: 284274) located on chromosome 18q23. It is a synthetic version of the human protein produced via cell-free expression systems for research purposes. While its biological function remains uncharacterized, it is implicated in studies related to genetic disorders and membrane protein biology.
C18orf62 is located within a critical genomic region (18q22.3–18q23) implicated in syndromic congenital disorders, including:
Congenital vertical talus (CVT): Linked to deletions in 18q, though C18orf62’s role remains uncertain .
Aural atresia (CAA): Overlapping deletions in 18q22.3–18q23 include TSHZ1 (a known causal gene) and C18orf62, but no direct causality has been established for C18orf62 .
Membrane Localization: Classified as an integral membrane protein, suggesting potential roles in cellular signaling or transport .
Experimental Use: The recombinant protein is primarily used to study protein-protein interactions, membrane biology, or as a control in assays .
Structural Studies: High-resolution crystallography or cryo-EM to elucidate its transmembrane topology.
Functional Screens: CRISPR/Cas9 knockout models to assess phenotypic effects in zebrafish or mice.
Disease Modeling: Investigating its role in 18q-deletion syndromes through patient-derived iPSCs or organoids.
C18orf62 (chromosome 18 open reading frame 62) is a human protein now designated as SMIM21 (Small Integral Membrane Protein 21). It is classified as a putative uncharacterized protein with aliases including MGC126049 . The protein contains the amino acid sequence RNHSRIQGVSEDWKRANSIFRNFLRLKSSRNTAEAE, which has been used to develop antibodies for research purposes . Despite its designation as an "uncharacterized protein," evidence suggests potential roles in neuropsychiatric conditions, particularly mood disorders associated with chromosome 18q deletions .
C18orf62 is located on the long arm of chromosome 18 (18q). More specifically, it is situated within a critical region associated with mood disorders in patients with distal 18q deletions. Studies examining patients with 18q deletions have identified a critical region that includes C18orf62 along with two other genes, ZADH2 and TSHZ1 . This region is likely within the most distal 30 Mb of chromosome 18, as 95% of individuals with 18q deletions studied were hemizygous for some portion of this region .
Detection of C18orf62 expression can be accomplished using immunohistochemistry techniques. Commercially available antibodies for C18orf62/SMIM21 have been validated for immunohistochemistry in both fresh and paraffin-embedded human tissue samples . The recommended dilution range for these applications is 1:200 - 1:500 . While comprehensive tissue expression profiling data is limited in the available literature, researchers can use these immunohistochemical methods to investigate expression patterns across various human tissues of interest.
C18orf62 has been implicated in mood disorders through studies of patients with distal 18q deletions. In one study examining 36 participants with hemizygous distal deletions of chromosome 18q, 16 individuals were identified with lifetime mood disorders (14 with unipolar depression and 2 with bipolar disorders) . Analysis of these patients defined a critical region that included three genes: C18orf62, ZADH2, and TSHZ1 . This suggests that hemizygosity for C18orf62 may contribute to the pathogenesis of mood disorders, although the specific molecular mechanisms remain to be elucidated.
Patients with 18q deletions who developed mood disorders showed several distinguishing characteristics compared to those without mood disorders. The mood-disordered patients had:
These findings suggest that the neuropsychiatric phenotype associated with C18orf62 hemizygosity may be complex and modulated by other factors affecting cognitive function and neurodevelopment.
The distal region of chromosome 18q is associated with multiple phenotypes beyond mood disorders. Studies have identified critical regions for several clinical features:
| Phenotype | Critical Region Borders | Size of Region (Mb) | Penetrance |
|---|---|---|---|
| Kidney malformation | 70,079,559-73,287,604 | 3.21 | 25% |
| Dysmyelination | 71,669,548-73,287,604 | 1.62 | 100% |
| Growth hormone response failure | 71,669,548-73,287,604 | 1.62 | 90% |
| Aural atresia | Not specified | Not specified | 78% |
While the exact position of C18orf62 relative to these critical regions isn't specified in the available data, the overlap between these regions suggests potential roles for C18orf62 or nearby genes in multiple developmental processes .
Commercially available polyclonal antibodies for C18orf62/SMIM21 have been developed against a recombinant protein corresponding to the amino acid sequence RNHSRIQGVSEDWKRANSIFRNFLRLKSSRNTAEAE . These rabbit IgG antibodies have been affinity purified and validated for:
Immunohistochemistry (fresh samples)
Immunohistochemistry-Paraffin (embedded samples)
The recommended dilution range is 1:200 - 1:500 . The specificity of these antibodies has been verified on a protein array containing the target protein plus 383 other non-specific proteins, suggesting high specificity for research applications .
Stable cell lines overexpressing C18orf62 can be generated using lentiviral expression systems. The general methodology involves:
Subcloning the C18orf62 gene into appropriate lentiviral vectors
Generating lentiviral particles containing the C18orf62 construct
Infecting appropriate host cell lines (HEK293 is commonly used, but CHO and other cell types are also viable options)
Selecting clones that consistently express C18orf62 at high levels
Commercially available C18orf62-overexpressing stable cell lines have been developed using HEK293 cells, which can serve as positive controls or research tools for functional studies .
Quantitative real-time PCR (qRT-PCR) can be employed to measure C18orf62 expression levels in research samples. While not specifically described for C18orf62, the methodology would be similar to that used for other genes in the chromosome 18q region:
Design TaqMan probe/primer sets from genomic sequence data in regions of interest
Use a real-time PCR detection system (such as Bio-Rad iCycler iQ)
Quantify target sequence by measuring threshold cycle number (Ct value)
Use a standard curve and housekeeping gene as an internal control reference
Calculate relative expression levels based on fractional Ct values
This approach allows precise quantification of C18orf62 expression in experimental samples compared to controls.
Given the limited characterization of C18orf62/SMIM21, bioinformatic approaches offer valuable insights into potential functions:
Sequence homology analysis: Compare the amino acid sequence with characterized proteins to identify functional domains
Secondary structure prediction: Determine potential membrane-spanning regions consistent with its designation as a Small Integral Membrane Protein
Protein-protein interaction prediction: Identify potential binding partners based on sequence motifs
Expression correlation analysis: Examine co-expression patterns with genes of known function across tissue types and disease states
Evolutionary conservation analysis: Assess conservation across species to identify functionally important regions
These approaches can generate testable hypotheses about C18orf62 function that can guide experimental design.
Based on the association of C18orf62 with mood disorders, several cellular pathways warrant investigation:
Neurotransmitter signaling pathways: Given the mood disorder connection, serotonergic, dopaminergic, or noradrenergic signaling might be affected
Stress response pathways: Mood disorders often involve dysregulation of stress response mechanisms
Synaptic plasticity: Proteins involved in mood regulation often affect synaptic function
Neuroendocrine signaling: The association with growth hormone response failure in the same chromosomal region suggests potential roles in endocrine regulation
Developmental pathways: The multiple developmental phenotypes associated with the region (kidney malformation, dysmyelination) suggest potential roles in tissue development
Experimental approaches that investigate these pathways may reveal the functional significance of C18orf62.
CRISPR-Cas9 genome editing provides powerful approaches to study C18orf62 function:
Gene knockout: Create complete loss-of-function models by introducing frameshift mutations or large deletions
Knockin reporters: Insert fluorescent tags to monitor endogenous protein localization and expression
Domain-specific mutations: Introduce specific mutations to assess the function of predicted domains
Transcriptional modulation: Use CRISPRa (activation) or CRISPRi (interference) to modulate expression without altering the coding sequence
Humanized models: Introduce human C18orf62 into model organisms to study function in vivo
When designing CRISPR studies, researchers should consider the potential for off-target effects and validate editing efficiency through sequencing.
Several challenges complicate research on uncharacterized proteins like C18orf62:
Limited baseline knowledge: Without established functions or pathways, experimental design must be largely exploratory
Potential redundancy: Functional redundancy with other proteins may mask phenotypes in single-gene manipulation studies
Context-dependent functions: The protein may function differently across tissues or developmental stages
Technical limitations: Lack of validated reagents specific to the protein may hinder experimental approaches
Publication bias: Negative results may go unpublished, leading to information gaps in the field
Researchers must adopt flexible, multi-dimensional approaches and collaborate across disciplines to overcome these challenges.
Integrative multi-omics approaches offer powerful means to characterize C18orf62:
Transcriptomics: RNA-seq analysis comparing wild-type and C18orf62-manipulated samples can identify affected pathways
Proteomics: Mass spectrometry-based approaches can identify binding partners and post-translational modifications
Metabolomics: Profiling metabolic changes in response to C18orf62 manipulation may reveal functional roles
Epigenomics: Analysis of chromatin accessibility and modification in relation to C18orf62 expression
Systems biology integration: Computational integration of multiple data types can reveal emergent properties not evident in single-omics approaches
These approaches, while technically and computationally challenging, provide comprehensive insights into protein function that overcome limitations of individual techniques.