CG8111 (Drosophila melanogaster Transmembrane Protein 43 Homolog) is a conserved transmembrane protein sharing structural and functional homology with human TMEM43 . It serves as a critical model for studying arrhythmogenic right ventricular cardiomyopathy (ARVC5), a severe cardiac disorder linked to the human TMEM43 p.S358L mutation . The Drosophila homolog (CG8111 p.S333) mirrors this mutation and is used to investigate its pathogenic mechanisms . Recombinant CG8111 is produced via bacterial expression systems (e.g., E. coli) and is widely used in biochemical and disease modeling studies .
CG8111 is a 376-amino acid protein with four transmembrane domains (TMs) and a large acidic loop between TM1 and TM2 . Key structural features include:
Membrane Topology: N- and C-termini face the cytoplasm/nucleoplasm, while the loop domain resides in the ER lumen .
Conservation: Shares ~34% identity with human TMEM43, including a critical serine residue (p.S333 in CG8111; p.S358 in TMEM43) essential for hydrogen bonding between helices .
Localization: Expressed in the ER membrane and nuclear envelope, with ubiquitous expression across all developmental stages .
| Feature | CG8111 | Human TMEM43 |
|---|---|---|
| Amino Acids | 376 | 376 |
| Transmembrane Domains | 4 | 4 |
| Critical Residue | Serine-333 (p.S333) | Serine-358 (p.S358) |
| Localization | ER membrane, nuclear envelope | ER membrane, nuclear envelope |
CG8111 has been instrumental in studying ARVC5 pathogenesis:
Overexpression Studies: Transgenic flies expressing CG8111 p.S333L exhibit cardiac arrhythmias, growth defects, and premature death .
Metabolic Dysregulation:
Conservation Across Species: Similar metabolic impairments observed in murine Tmem43 models and human ARVC5 patients .
CG8111 is commercially produced as a recombinant protein for research use:
CG8111 is the Drosophila melanogaster homolog of human Transmembrane protein 43 (TMEM43). The Drosophila CG8111 protein exhibits approximately 34% identity with its human counterpart and shares key structural features . CG8111 is located on chromosome 3 at position 66A19 in Drosophila and encodes a 376 amino acid protein. Orthology analysis through DIOPT-based searches via FlyBase has identified CG8111 homologs across diverse model organisms, including mammals, amphibians, fish, and insects, suggesting evolutionary conservation of this protein across metazoan species .
Both CG8111 and TMEM43 contain four conserved transmembrane domains with a larger acidic loop between TM1 and TM2. The conservation extends to specific functional residues, with serine-358 in human TMEM43 being homologous to serine-333 in Drosophila CG8111, and this serine is phylogenetically conserved throughout all known metazoan TMEM43 homologues .
CG8111 is expressed throughout all developmental stages of Drosophila melanogaster. Western blot analyses using specific antibodies against CG8111 have confirmed protein expression in embryos, larvae, pupae, and adults . This widespread temporal expression pattern is consistent with genome-wide microarray expression studies that demonstrated the presence of CG8111 transcripts at all developmental stages across various tissues . The ubiquitous and persistent expression pattern suggests that CG8111 may have fundamental biological functions throughout the Drosophila life cycle, although knockout studies indicate it is not essential for viability under laboratory conditions.
CG8111 localizes primarily to the endoplasmic reticulum (ER) and nuclear envelope compartments in Drosophila cells. This has been demonstrated through experiments using HA-tagged CG8111 fusion proteins expressed in transgenic Drosophila using the UAS/GAL4 system . The localization was confirmed by co-localization studies with the ER chaperone protein Calnexin. Notably, CG8111 was not detected in other cellular membranes, such as those involved in intracellular trafficking pathways or the plasma membrane .
The membrane topology of CG8111 has been characterized using redox-sensitive GFP (roGFP) fusion constructs. Studies with roGFP fused to either the N- or C-terminus of CG8111, as well as constructs with roGFP inserted between TM1 and TM2, have revealed that both the N- and C-termini face the cytoplasm/nucleoplasm, while the loop domain between TM1 and TM2 is located in the ER lumen . This membrane topology is identical to that of human TMEM43, further supporting the functional conservation between these homologs.
CRISPR/Cas9-mediated genome editing has been successfully employed to generate CG8111 knockout models in Drosophila melanogaster. The process involves designing guide RNAs (gRNAs) targeting specific regions of the CG8111 gene and introducing them into vasa-Cas9 transgenic Drosophila . Two effective approaches have been documented:
In vitro transcription of six different gRNAs targeting CG8111, followed by microinjection into vasa-Cas9 transgenic embryos.
Utilization of a transgenic fly line from the "Transgenic RNAi project" (TKO) collection, which expresses a single sgRNA targeting CG8111 .
These approaches have yielded multiple CG8111 mutant alleles with different characteristics. Two particularly useful alleles for detailed analyses include:
CG8111*18: Contains a small deletion causing a frameshift with a premature opal stop at codon 18.
CG8111Δ110-319: Carries a larger deletion that eliminates most of the gene's open reading frame .
Western blot analysis using specific antibodies against CG8111 can confirm the absence of the protein in homozygous knockout mutants, validating the success of the CRISPR/Cas9-mediated mutagenesis approach.
Several transgenic systems have been developed to study CG8111 function and mutations in Drosophila:
UAS/GAL4 expression system: This inducible system allows for tissue-specific and temporal control of CG8111 expression. Transgenic lines carrying constructs with either wild-type CG8111 or mutant variants (e.g., CG8111 p.S333L) under UAS control can be crossed with various GAL4 driver lines to express the protein in specific tissues or ubiquitously .
Site-specific integration: Transgenes are integrated at specific genomic sites to ensure comparable expression levels between different constructs, eliminating position effects that might confound experimental results .
Tagged fusion proteins: CG8111 has been successfully tagged with various epitopes or reporter proteins:
These transgenic systems have proven valuable for investigating the effects of specific mutations, such as p.S333L, which corresponds to the pathogenic p.S358L variant in human TMEM43 associated with arrhythmogenic right ventricular cardiomyopathy type 5 (ARVC5) .
CG8111 knockout Drosophila exhibit surprisingly normal development and physiological function under laboratory conditions. Detailed phenotypic analyses of homozygous CG8111 knockout flies have revealed:
Normal development and complete viability
Normal longevity comparable to control flies
No detectable abnormalities in feeding behavior
Normal cardiac function without arrhythmias or other impairments
These findings suggest that CG8111 is dispensable for Drosophila development and viability under standard laboratory conditions, similar to observations made in Tmem43 knockout mice . The absence of cardiac dysfunction in CG8111 knockout flies aligns with data from mouse models, suggesting that loss of TMEM43/CG8111 function alone is insufficient to cause cardiac phenotypes associated with ARVC5 in humans.
In contrast to the relatively benign effects of CG8111 knockout, the p.S333L mutation (corresponding to human TMEM43 p.S358L) causes severe developmental and physiological defects when overexpressed in Drosophila:
| Effect | CG8111 Wildtype Overexpression | CG8111 p.S333L Overexpression |
|---|---|---|
| Viability | Normal viability and fertility | Lethality during larval and pupal stages |
| Development | Normal morphology | >60% die in second or third larval stage; ~30% develop into pupae but fail to hatch |
| Body size | Normal | Reduced growth and smaller body size |
| Body weight | Normal | Lower body weight |
| Cardiac function | Normal | Cardiac arrhythmias |
| Survival | Normal lifespan | Premature death |
The ubiquitous overexpression of CG8111 p.S333L using the tubulin-GAL4 driver results in complete penetrance of the lethal phenotype, with no adult flies emerging . These findings suggest that the p.S333L mutation confers a toxic gain-of-function effect rather than simply disrupting normal CG8111 function.
When expressed specifically in cardiac tissues using heart-specific GAL4 drivers, CG8111 p.S333L causes cardiac dysfunction including arrhythmias, resembling aspects of the ARVC5 phenotype in humans with the TMEM43 p.S358L mutation .
Structural analyses of CG8111 have provided important insights into its molecular architecture and function:
Transmembrane topology: Both Drosophila CG8111 and human TMEM43 contain four conserved transmembrane domains (TM) with a larger acidic loop between TM1 and TM2 .
AlphaFold predictions: Computational modeling using AlphaFold has revealed the structural organization of CG8111, supporting earlier predictions by Bengtsson et al. regarding TMEM43 structure .
Critical serine residue: The AlphaFold models predict that serine-333 in CG8111 (homologous to serine-358 in TMEM43) is essential for hydrogen bond formation between helices 3 and 4, explaining why mutation of this residue is particularly disruptive .
Membrane orientation: Experimental studies using roGFP fusion constructs have confirmed that both the N- and C-termini of CG8111 face the cytoplasm/nucleoplasm, while the loop domain between TM1 and TM2 is located in the ER lumen . This membrane topology is identical to that of human TMEM43.
These structural insights help explain why the S333L mutation in CG8111 (corresponding to S358L in human TMEM43) disrupts protein function, as it potentially eliminates critical hydrogen bonds between transmembrane helices.
Comprehensive metabolomic analyses have revealed significant alterations in metabolic profiles of Drosophila expressing the CG8111 p.S333L mutation:
| Metabolite Category | Change in CG8111 p.S333L vs. Wildtype | Notes |
|---|---|---|
| Fatty acids | Significantly increased | Suggests dysregulation of lipid metabolism |
| Branched-chain amino acids (BCAA) | Significantly increased | BCAAs are also elevated in failing human myocardium |
| Fructose | Significantly decreased | Indicates altered carbohydrate metabolism |
Principal component analysis (PCA) of NMR data confirmed genotype-specific separation of metabolite profiles between controls, CG8111 wildtype overexpressors, and CG8111 p.S333L overexpressors . These metabolic alterations suggest that the p.S333L mutation disrupts energy homeostasis and lipid metabolism in Drosophila, potentially contributing to the observed growth defects and premature death. The increase in branched-chain amino acids is particularly notable, as this metabolic signature has also been observed in failing human myocardium associated with heart failure .
Proteomic analyses of Drosophila expressing CG8111 p.S333L have identified over 300 differentially regulated proteins compared to flies expressing wildtype CG8111 . Gene Ontology (GO) analysis of proteins with increased abundance in CG8111 p.S333L-expressing larvae revealed several significantly affected biological processes:
Branched-chain amino acid metabolism: Proteins involved in BCAA metabolism, such as 3-hydroxyisobutyrate dehydrogenase, showed altered expression.
Lipid metabolism: Lipases and other lipid-processing enzymes were differentially regulated.
Fatty acid metabolism: Proteins like Acetyl-coenzyme A synthetase showed altered abundance .
These proteomic findings strongly corroborate the metabolomic data, suggesting that dysregulation of energy and lipid metabolism is a central consequence of the CG8111 p.S333L mutation. The proteomic changes may provide mechanistic insights into how the mutation leads to developmental defects, cardiac dysfunction, and premature death in the Drosophila model.
The Drosophila CG8111 model offers several valuable insights for human TMEM43-related disease research, particularly for arrhythmogenic right ventricular cardiomyopathy type 5 (ARVC5):
Mechanistic understanding: The observation that CG8111 knockout flies develop normally while flies expressing the p.S333L mutant exhibit severe phenotypes suggests that the human TMEM43 p.S358L mutation likely causes disease through a toxic gain-of-function rather than loss-of-function mechanism .
Metabolic dysregulation: The metabolomic and proteomic alterations in CG8111 p.S333L flies reveal disruption of energy homeostasis and lipid metabolism, which may contribute to the pathophysiology of ARVC5. This is consistent with the fibrofatty replacement observed in human ARVC5 patients .
Evolutionary conservation: The finding that the corresponding mutations in flies and humans cause similar cardiac phenotypes (arrhythmias) despite the evolutionary distance suggests fundamental conservation of TMEM43/CG8111 function in cardiac physiology .
Drug screening platform: The Drosophila model provides a relatively high-throughput system for testing potential therapeutic compounds that might mitigate the effects of TMEM43 mutations.
Genetic modifier screens: Drosophila's genetic tractability allows for identification of genetic modifiers that enhance or suppress the CG8111 p.S333L phenotype, potentially revealing new therapeutic targets for ARVC5.
Several experimental approaches can be employed to elucidate the molecular mechanisms underlying CG8111 p.S333L pathogenicity:
Structure-function studies: Systematic replacement of S333 with various amino acids to determine which physicochemical properties at this position are critical for normal CG8111 function .
Interactome analysis: Identification of CG8111 protein interaction partners in wildtype versus p.S333L contexts to determine if the mutation alters protein-protein interactions.
Cell biology approaches: Investigation of ER stress, unfolded protein response, and other cellular pathways that might be activated by the mutant protein.
Tissue-specific expression: Targeted expression of CG8111 p.S333L in different tissues to identify which cell types are most sensitive to the mutation's effects.
Rescue experiments: Testing whether human TMEM43 can functionally replace CG8111 in Drosophila, and whether wildtype TMEM43 can rescue phenotypes caused by CG8111 p.S333L expression.
Comparative analysis: Parallel studies in multiple model systems (Drosophila, mice, human cells) to identify conserved mechanisms of TMEM43/CG8111 function and dysfunction.
These approaches can provide complementary insights into how the CG8111 p.S333L mutation disrupts cellular function and leads to pathology, potentially informing therapeutic strategies for human TMEM43-related diseases.
When designing experiments with recombinant CG8111, several technical considerations should be addressed:
Expression system selection: CG8111 has been successfully expressed in Drosophila S2 cells and Sf21 insect cells . The selection of an appropriate expression system should consider protein folding requirements, post-translational modifications, and the specific experimental goals.
Protein tagging strategies: Various tags (HA, GFP, roGFP) have been successfully used with CG8111 . Consider the potential impact of the tag on protein function, localization, and stability. C-terminal tagging appears to preserve CG8111 function better than N-terminal tagging.
Membrane protein purification: As CG8111 is a transmembrane protein, specialized detergent-based extraction methods are required for biochemical studies. Optimization of detergent type and concentration is critical for maintaining protein structure and function.
Transgene design: For in vivo studies in Drosophila, consider using site-specific integration (e.g., phiC31 integrase system) to ensure comparable expression levels between wildtype and mutant constructs .
Controls: Include appropriate controls, such as untagged proteins, empty vectors, and multiple independent transgenic lines, to distinguish genuine phenotypes from artifacts.
Tissue-specific expression: The UAS/GAL4 system allows for spatiotemporal control of CG8111 expression. Selection of appropriate GAL4 drivers can target expression to specific tissues of interest, such as heart, muscles, or neurons .
Optimizing metabolomic and proteomic approaches for CG8111 research requires careful consideration of several factors:
For both approaches, careful experimental design with appropriate biological and technical replicates is essential. Comparison between multiple genotypes (knockout, wildtype overexpression, and p.S333L overexpression) can provide comprehensive insights into CG8111 function and dysfunction .
Integration of metabolomic and proteomic data through pathway analysis can reveal biological processes affected by CG8111 mutations, as demonstrated by the identification of branched-chain amino acid metabolism and lipid metabolism dysregulation in p.S333L mutants .