The protein FAM118A, also known as Family With Sequence Similarity 118, Member A, is a protein-coding gene in humans . It is also referred to as C22ORF8 . Research suggests FAM118A's involvement in various biological processes, including tumor biology and genetic control of transcription . Studies have explored its expression patterns in different tissues and its potential as a target in cancer research .
| Name | Description |
|---|---|
| Gene Name | FAM118A |
| Alias | Family With Sequence Similarity 118, Member A |
| Alternative Names | C22ORF8 |
| NCBI Gene ID | 55007 |
| Protein | F118A_HUMAN |
FAM118A demonstrates varied expression across different tissues, cell lines, and tumor samples . Studies indicate that FAM118A is consistently expressed in Glioblastoma stem cell (GSC) cultures but not in Neural stem cell (NSC) cultures . Its expression has been confirmed in clinical samples from The Cancer Genome Atlas (TCGA) and Repository for Molecular Brain Neoplasia Data (REMBRANDT) databases .
Functional association data reveals that FAM118A is linked to a variety of biological entities, spanning molecular profiles, organisms, chemicals, functional terms, diseases, phenotypes, cell lines, cell types, tissues, genes, proteins, microRNAs, and sequence features .
Research indicates that FAM118A is connected to Glioblastoma (GBM), a type of brain cancer . In GSC cultures, FAM118A is consistently expressed, suggesting a potential role in tumor maintenance or progression . Analysis of gene expression in GBM samples from TCGA showed that FAM118A, along with other genes, can affect patient survival . While some studies have found FAM118A to be upregulated in GSC cultures, others have observed downregulation, indicating complex regulatory mechanisms .
Genome-wide expression quantitative trait loci (eQTL) analyses have identified connections between genetic variants and FAM118A expression levels in whole blood samples . These findings contribute to understanding the genetic control of FAM118A transcription and its potential involvement in various disorders .
FAM118A interacts with various proteins and is involved in protein-protein interaction networks . These interactions may influence signaling pathways and cellular processes relevant to both normal physiology and disease .
FAM118A (Family with sequence similarity 118 member A) is a lesser-known protein-coding gene that belongs to the SIRim subfamily of SIR2 proteins . While FAM118A's precise cellular functions are still being elucidated, recent research has linked it to several important biological processes. It has been identified as an evolutionary homolog of bacterial SIR2 antiphage proteins, suggesting potential immune-related functions . FAM118A has also been renamed SIRanc2 in some research contexts, reflecting its ancestral SIR domain .
Unlike its better-characterized family member FAM118B (which plays a role in Cajal body formation and inhibits cell proliferation), FAM118A's specific cellular functions remain under investigation . Recent experimental evidence suggests FAM118A may have regulatory roles that impact gene expression, as it has been identified as an expression quantitative trait locus (eQTL) transcript in multiple studies .
FAM118A belongs to the SIRim subfamily of SIR2 domain-containing proteins, distinct from classical sirtuins . Phylogenetic analysis has revealed that humans possess nine SIR2 proteins: seven are well-documented sirtuins involved in histone deacetylation, while FAM118A and FAM118B form a separate subfamily (SIRanc/SIRanc2) with poorly documented functions .
The evolutionary relationship between FAM118A/B and bacterial antiphage systems suggests these proteins may have immune-related functions that differ from classical sirtuins . Structural and sequence analysis has confirmed this phylogenetic partition, indicating that different subfamilies of SIR2 proteins likely serve distinct biological functions .
To study the relationship between FAM118A and other SIR2 proteins, researchers have used multiple sequence alignment with Muscle v5.1 and profile HMM construction using the HMMER package . Through iterative profile HMM searches, researchers have identified 8,697 sirims across different domains of life, providing valuable context for understanding FAM118A's evolutionary origins and potential functions .
While comprehensive tissue expression data for FAM118A isn't directly provided in the search results, researchers have documented allelic expression patterns of FAM118A in human primary T cells . Studies have shown that FAM118A exhibits allelic expression imbalance (ASE), where the two alleles of the gene are expressed at different levels .
In expression studies utilizing RNA-seq approaches, FAM118A has been identified as a positive control for eQTL transcripts, indicating its consistent expression patterns make it suitable as a reference in transcriptomic studies . The gene contains multiple heterozygous SNPs that can be used to track allele-specific expression, with studies documenting 21 heterozygous SNPs with sufficient read depth for ASE analysis .
For researchers interested in studying FAM118A expression patterns, it's worth noting that 77.5% of the heterozygous SNPs with read depth >50 are intronic SNPs, indicating that sequence capture approaches retain sufficient read depth even in introns when capturing cDNA from poly(A)-selected RNA .
Measuring allelic expression imbalance (ASE) of FAM118A requires specialized methodologies that have been validated in the literature. One validated approach is the combination of high-throughput sequencing with robust statistical analysis . Here's a methodological framework based on published research:
Sample preparation: Isolate total RNA from samples and prepare cDNA using fragmentation followed by Superscript III RT kit (or equivalent) .
Sequencing approach: Utilize paired-end (PE) sequencing rather than single-end (SE) sequencing for improved mapping efficiency. Studies have shown that PE approaches map 19% more quality reads than SE approaches for ASE analysis .
Quality control measures: Apply stringent quality control procedures to avoid false positives, particularly from mapping biases caused by indels .
Statistical analysis: Determine imbalance using statistical tests that compare observed allelic ratios to the expected 1:1 ratio. Statistical significance thresholds of P<0.001 or P<0.01 have been used successfully in published research .
Validation: Confirm ASE findings using complementary approaches such as C-BASE (Colony-Based Allele-Specific Expression), which involves PCR amplification of regions containing heterozygous SNPs, cloning into vectors, and screening colonies using TaqMan genotyping assays .
The read depth is critical for statistical power in ASE analysis, with recommendations suggesting depths as high as 1000 for capturing subtle levels of allelic imbalance . For FAM118A specifically, researchers have successfully measured ASE using heterozygous SNPs including rs2064068 .
Recent research has revealed intriguing connections between FAM118A (SIRanc2) and innate immune function, based on its evolutionary relationship with bacterial antiphage systems . While direct experimental validation of FAM118A's immune function is still emerging, its homolog FAM118B (SIRanc) has been demonstrated to play a role in the TLR signaling pathway .
Specifically, FAM118B (SIRanc) has been shown to be involved in signaling downstream of multiple Toll-like receptors (TLRs), with knock-down experiments in human monocyte-derived macrophages showing impaired nitric oxide (NO) production following TLR stimulation . This suggests a general role in TLR signaling pathways.
For researchers investigating FAM118A's potential immune functions, the methodological approaches used for FAM118B provide a valuable template:
Generate knock-down or knockout cell lines using CRISPR-Cas or RNA interference techniques.
Stimulate cells with various TLR agonists (TLR1/2, TLR3, TLR7/8, TLR9).
Measure downstream readouts including NO production and transcription of inflammatory cytokines like IL-8 .
It's worth noting that SIRanc's role appears to be specific to certain aspects of TLR signaling. For example, while it impacts NO production, it doesn't affect TNF production, explaining why it wasn't identified in previous CRISPR screens that used TNF as a readout . This selective involvement suggests FAM118A may similarly have specific roles within immune signaling pathways that require carefully designed experimental readouts to detect.
Recent studies have identified FAM118A as differentially expressed in the context of substance use disorders, particularly related to days of abstinence from cocaine use . Research has shown that FAM118A is differentially expressed when comparing high versus intermediate days of abstinence at 9 months in individuals with cocaine use disorder .
The methodological approach used to identify this association involved:
Collection of longitudinal samples from individuals with cocaine use disorder
Application of an optimized linear mixed model for differential expression analysis
Correlation of gene expression patterns with clinical variables including days of abstinence, perceived stress, and craving
While FAM118A's specific mechanistic role in addiction and abstinence remains to be fully characterized, its differential expression in this context suggests potential involvement in neuroadaptive processes associated with prolonged abstinence. Interestingly, the related protein FAM118B has been linked to bipolar disorder and addiction through genome-wide association studies (GWAS) , potentially indicating a shared genetic architecture between these conditions that may involve FAM118A-related pathways.
For researchers studying FAM118A in the context of addiction, integrating transcriptomic approaches with detailed phenotypic characterization appears to be a productive methodological strategy. The association with abstinence duration specifically suggests FAM118A may be involved in neuroplastic changes that occur during recovery from substance use disorders.
Based on standard recombinant protein production methodologies, researchers working with FAM118A should consider the following approaches:
For validation of recombinant FAM118A, researchers should confirm protein identity using mass spectrometry and assess protein folding using circular dichroism or thermal shift assays. Functional validation will depend on the specific activities of FAM118A being investigated.
To investigate the evolutionary history and relationships of FAM118A, researchers have employed sophisticated computational methodologies . Based on published approaches, the following methods are recommended:
Homology detection and alignment:
Iterative search process:
Phylogenetic analysis:
Include related protein families (e.g., sirtuins, SIR2 domain proteins)
Incorporate proteins from diverse taxonomic groups to establish evolutionary context
This methodology has successfully identified 8,697 SIRim proteins across domains of life, providing valuable evolutionary context for FAM118A . The approach revealed that FAM118A and FAM118B form a distinct subfamily (SIRanc/SIRanc2) within the broader SIR2 protein family, with potential immune functions that differ from classical sirtuins .
Research on FAM118A has validated several techniques for studying allelic expression, with RNA sequencing and C-BASE (Colony-Based Allele-Specific Expression) emerging as particularly effective approaches . The following methodological framework is recommended:
RNA sequencing approach:
Use paired-end (PE) rather than single-end (SE) sequencing for 19% improvement in mapping efficiency
Implement stringent quality control procedures to eliminate mapping biases
Apply statistical tests comparing observed allelic ratios to expected 1:1 ratio
Ensure sufficient read depth (>50 minimum, up to 1000 for subtle imbalances)
C-BASE validation methodology:
Design PCR primers against conserved sequences near heterozygous SNPs (e.g., rs2064068 for FAM118A)
Amplify 200-230bp products from both genomic DNA and cDNA
Ligate purified PCR products into a suitable vector (e.g., pCR4-TOPO Vector)
Screen colonies using TaqMan genotyping assays to determine allelic distribution
When implementing these techniques, researchers should be aware of potential technical biases that can affect ASE measurements, including:
RNA fragmentation biases
Gel purification biases during library preparation
Mapping issues related to indels
The C-BASE validation approach is particularly valuable as it provides an orthogonal method to confirm RNA-seq findings and can detect smaller allelic imbalances with increased colony sampling.
FAM118A has been included in disease-gene association mining efforts, though specific disease associations are still emerging . The protein has been studied in the context of substance use disorders, with differential expression observed in relation to cocaine abstinence periods . This suggests potential involvement in addiction-related neuroadaptations.
Additionally, FAM118A's evolutionary relationship with bacterial antiphage systems and membership in the SIRim subfamily hints at possible immune-related functions . Its homolog FAM118B (SIRanc) has been shown to play roles in TLR signaling pathways crucial for innate immunity , suggesting FAM118A may similarly impact immune responses relevant to infectious or inflammatory diseases.
For researchers investigating FAM118A's disease associations, integrating the following approaches is recommended:
Genetic association studies examining FAM118A variants in disease cohorts
Expression studies comparing FAM118A levels in diseased versus healthy tissues
Functional studies exploring how FAM118A variants or expression changes affect cellular processes
The disease-gene associations for FAM118A are currently being derived through multiple approaches including automatic text mining of biomedical literature, manually curated database annotations, cancer mutation data, and genome-wide association studies .
For researchers planning Mendelian randomization (MR) studies involving FAM118A or its associated pathways, several methodological considerations should be addressed based on established guidelines :
Variant selection:
Choose genetic variants robustly associated with FAM118A expression (eQTLs)
Prioritize variants in or near the FAM118A gene
Verify that selected variants meet MR assumptions (relevance, independence, exclusion restriction)
Data sources:
Analytical approaches:
Implement robust statistical methods to address potential horizontal pleiotropy
Consider colocalization analyses to determine if the same causal variant affects both FAM118A expression and the outcome of interest
Apply sensitivity analyses including MR-Egger, weighted median, and weighted mode approaches
Interpretation caveats:
Researchers should note that MR studies involving FAM118A would be particularly valuable for exploring its potential causal role in addiction processes and abstinence outcomes, given the observational associations identified in previous studies .
Researchers studying FAM118A face several technical challenges that require specific methodological solutions:
Limited functional characterization:
Protein expression and purification:
Challenge: Recombinant expression of novel human proteins can present solubility and folding issues.
Solution: Test multiple expression systems (bacterial, insect, mammalian) and fusion tags. Consider truncation constructs based on predicted domains.
Specificity of antibodies:
Challenge: Commercial antibodies for lesser-studied proteins like FAM118A may lack specificity.
Solution: Validate antibodies using knockdown/knockout controls and recombinant protein standards. Consider epitope-tagged expression constructs as alternatives.
Allelic expression analysis:
Evolutionary analysis complexity:
By addressing these challenges with appropriate methodological solutions, researchers can advance our understanding of FAM118A's structure, function, and biological significance.
When encountering contradictory findings about FAM118A, researchers should apply a systematic approach to interpretation and reconciliation:
Context-dependent functions:
Consider that FAM118A may have different functions in different cell types, developmental stages, or physiological conditions.
Design experiments that directly compare FAM118A's behavior across these contexts.
Methodological differences:
Evaluate whether contradictory findings stem from differences in experimental approaches, models, or technical limitations.
Reproduce key experiments using standardized methods to directly compare results.
Genetic variation effects:
Data integration strategies:
Collaboration approaches:
Establish collaborations between groups with contradictory findings to directly compare methodologies and samples.
Consider consortium approaches for larger, more definitive studies with standardized protocols.
When publishing research on FAM118A, transparency about methodological details and limitations is crucial for enabling proper interpretation of potentially contradictory findings in the field.