Recombinant Arabidopsis thaliana CYSTM1 family protein A (At2g41420), also known as WIH2, is a proline-rich uncharacterized protein expressed in Arabidopsis thaliana. It belongs to the cysteine-rich and transmembrane domain-containing protein family (CYSTM1) and is primarily used in research for studying plant molecular mechanisms, protein interactions, and functional genomics. The recombinant version is produced in heterologous systems like E. coli, yeast, or mammalian cells, enabling structural and biochemical analyses .
| Attribute | Value |
|---|---|
| Gene ID | At2g41420 |
| Synonyms | WIH2, T26J13.1, F13H10.3 |
| Protein Length | 98 amino acids (1–98) [1–98aa] |
| UniProt Accession | Q8S8M0 |
| Pfam Domain | Proline-rich family protein |
Primary Structure: Contains proline-rich motifs and cysteine-rich domains, suggesting roles in protein-protein interactions or cell wall integrity .
Expression: Truncated versions (e.g., AtAC261-388) are often used for functional studies, though full-length recombinants are also available .
While At2g41420 is annotated as a proline-rich protein, its specific biological function remains uncharacterized. Limited studies suggest potential roles in:
Cell Wall Dynamics: Proline-rich proteins often stabilize cell walls or mediate stress responses .
Protein Interactions: Cysteine-rich domains may facilitate interactions with other proteins or membranes .
| Protein | Gene ID | Function |
|---|---|---|
| CYSTM1 Family Protein B | At3g57160 | Cysteine-rich TM module stress protein |
| WIH1 | At2g41420 (homolog) | Proline-rich family protein |
ELISA/Western Blot: Used as a control or antigen in immunoassays .
Structural Studies: Full-length recombinants enable X-ray crystallography or NMR .
Most PCM proteins, including CYSTM1 family protein A, primarily localize to the plasma membrane. This localization has been confirmed through transient expression assays in Nicotiana benthamiana using fluorescent protein tags . The membrane association is mediated by the CYSTM domain, which is predicted to form a transmembrane segment. The conserved cysteines in this domain may facilitate interactions with other membrane components or proteins, potentially through the formation of disulfide bridges or other forms of protein-protein interactions .
CYSTM1 family protein A is responsive to various pathogen challenges and defense-related signals, particularly salicylic acid (SA). RNA-seq analysis has shown that the gene is significantly upregulated in response to SA treatment . The expression behavior broadly follows patterns seen within its subgroup of PCM genes, showing both overlaps and differences with members of other subgroups. This differential expression is consistent with varying overrepresentation of different transcription factor binding DNA motifs in the promoters of the PCM genes .
CYSTM domain-containing proteins are present in diverse species across eukaryotic organisms, suggesting conserved functional importance. While the exact molecular mechanism remains unclear, these proteins appear to play roles in stress tolerance. Proposed mechanisms include:
Altering the redox potential of membranes
Quenching radical species to protect the plant
Affecting membrane-associated protein functions
Facilitating protein-protein interactions through their conserved cysteines
The cysteines may serve as interaction sites for ligands or other PCM proteins, potentially resulting in homo- or heterodimerization. Some family members, such as PCM4/PCC1, interact with components of signaling complexes at the plasma membrane, which may lead to post-translational control of multiple protein targets involved in diverse biological processes including light signaling, development, and immunity .
Recombinant CYSTM1 family protein A can be produced through in vitro E. coli expression systems. Commercial versions typically include an N-terminal 10xHis-tag for purification purposes . For proper storage, it is recommended to keep the protein at -20°C, or at -80°C for extended storage. Working aliquots should be stored at 4°C for up to one week, and repeated freezing and thawing should be avoided .
Alternatively, plant-based expression systems can be used, particularly for proteins that require plant-specific post-translational modifications. The protein can be expressed in Arabidopsis seeds using the appropriate promoter (such as the 12S1 promoter) and the 3'UTR of seed storage protein genes to enhance accumulation .
Overexpression studies have revealed that PCM proteins, including CYSTM1 family members, have significant impacts on plant performance. Key findings include:
Enhanced protection against biotrophic pathogens: All eight PCM family members, when overexpressed in Arabidopsis, conferred enhanced resistance against the biotrophic oomycete pathogen Hyaloperonospora arabidopsidis Noco2 .
Subgroup-specific effects: Overexpression of PCM subgroup I genes specifically conferred enhanced resistance to the hemi-biotrophic bacterial pathogen Pseudomonas syringae pv. tomato DC3000 .
Developmental impacts: PCM-overexpressing lines showed altered expression of genes related to light signaling and development. Notably, PCM-overexpressing seedlings displayed elongated hypocotyl growth, suggesting a connection between disease resistance and photomorphogenesis .
Absence of typical defense gene induction: Interestingly, overexpression of PCMs led to the induction of genes associated with light responses and development, but not to typical defense-associated responses, suggesting a novel mechanism of action .
These findings suggest that CYSTM1 and related proteins may function at the interface of immunity and development, possibly through effects on membrane structure or the activity of interacting proteins at the plasma membrane.
S-acylation (also known as palmitoylation) is a post-translational modification that can affect protein localization, stability, and activity. While specific S-acylation data for CYSTM1 is limited, research on related membrane proteins in Arabidopsis suggests that S-acylation can significantly impact receptor activity.
For instance, the P2K1 receptor, which mediates extracellular ATP-induced immune signaling, undergoes S-acylation that affects its temporal dynamics through autophosphorylation and protein degradation . The potential S-acylation sites can be predicted using software such as GPS-lipid 1.0 (http://lipid.biocuckoo.org/webserver.php).
To experimentally determine S-acylation status, researchers typically:
Generate mutant forms where cysteine (C) sites are individually mutated to serine (S)
Express these mutants in Arabidopsis protoplasts
Determine S-acylation status through parallel assays with or without the hydroxylamine thioester-cleavage step
Given the importance of cysteine residues in CYSTM1, S-acylation could be a critical regulatory mechanism affecting its function in immunity and development.
Several approaches have proven effective for studying CYSTM1 and related proteins:
Transient expression assays: Fluorescent protein fusions (e.g., YFP) can be used to visualize subcellular localization in Nicotiana benthamiana leaves.
Yeast expression systems: Yeast two-hybrid or split-ubiquitin systems have been used to demonstrate homo- or heterodimerization of PCM family members.
Co-immunoprecipitation: For identifying protein interaction partners at the plasma membrane.
Recombinant protein binding assays: His-tagged and GST-tagged proteins can be used in pull-down assays with Ni-NTA and GS4B beads to study direct protein interactions .
S-acylation assays: Parallel assays with or without hydroxylamine thioester-cleavage steps can determine S-acylation status.
Membrane fractionation: Differential centrifugation can be used to isolate membrane fractions and confirm protein localization.
| Method | Application | Advantages | Limitations |
|---|---|---|---|
| Transient expression | Localization studies | Rapid results, in planta | May not reflect native conditions |
| Co-IP | Protein interactions | Captures in vivo complexes | May detect indirect interactions |
| Yeast systems | Dimerization studies | Controlled conditions | Non-plant environment |
| Pull-down assays | Direct binding | Confirms direct interactions | Uses purified proteins |
| S-acylation assays | Post-translational modifications | Identifies specific sites | Labor intensive |
A promising approach for enhancing recombinant protein production in plants involves using the 3'UTR of seed storage protein (SSP) genes. Research has shown that:
The 3'UTR of SSP genes are essential for SSP accumulation and can significantly increase recombinant protein yields in Arabidopsis.
Fusion of the 3'UTR of SSP genes (such as 12S1) to the 3' ends of DNA sequences encoding recombinant proteins enables massive accumulation of recombinant proteins with retained enzymatic activity in Arabidopsis seeds .
This method does not require altering the intracellular localization of recombinant proteins, allowing proteins to maintain their correct localization and functionality under near-native cellular conditions.
The approach has been successfully applied to various proteins, including enzymes like peroxisomal malate dehydrogenase 1 (pMDH1) and biopharmaceutical candidates like human Interferon Lambda-3 .
For CYSTM1 family protein A, this approach could be particularly valuable since it would allow production of the protein in its native plant environment, potentially preserving important post-translational modifications and structural features.
The CYSTM domain is highly conserved across eukaryotic organisms, suggesting fundamental importance in cellular function. CYSTM domain-containing proteins are present in diverse species including Arabidopsis, Caenorhabditis elegans, Candida albicans, Homo sapiens, Mus musculus, Oryza sativa, Saccharomysces cerevisae, and Zea mays .
Evolutionary analysis of these proteins can reveal:
Conserved functional domains and critical amino acid residues
Species-specific adaptations in response to different pathogens
Expansion or contraction of the gene family in different lineages
Potential neofunctionalization or subfunctionalization after gene duplication events
For researchers studying CYSTM1, phylogenetic trees can be generated using tools like PLAZA v4.0 (https://bioinformatics.psb.ugent.be/plaza/) with the PCM1 gene as a query . Such analyses help contextualize the role of CYSTM1 in Arabidopsis within the broader evolutionary history of this protein family across plants and other organisms.
Bacterial Expression System:
Clone the full-length CYSTM1 coding sequence into an appropriate expression vector (e.g., with an N-terminal His-tag).
Transform into an E. coli expression strain.
Induce protein expression with IPTG.
Lyse cells and purify using Ni-NTA affinity chromatography.
Verify protein purity by SDS-PAGE and Western blotting.
Store purified protein at -20°C or -80°C for extended storage .
Plant-Based Expression System:
Construct expression vectors containing:
The 12S1 promoter (approximately 1671 bp upstream of the 12S1 gene)
The CYSTM1 coding sequence
The 3'UTR of 12S1 gene for enhanced protein accumulation
Optional: His-tag or other purification tags
Clone these components using Gateway® cloning technology:
Transform the expression vector into Agrobacterium tumefaciens strain C58C1 rif.
Transform Arabidopsis plants using the floral dip method.
Select transformants on medium containing 25 μg/mL hygromycin B.
Evaluate protein accumulation in at least ten independent T3 progeny .
To study the role of CYSTM1 in plant immunity, researchers can employ several approaches:
Pathogen Challenge Assays:
Generate CYSTM1 overexpression and knockout/knockdown lines.
Challenge plants with pathogens such as:
Hyaloperonospora arabidopsidis Noco2 (biotrophic oomycete)
Pseudomonas syringae pv. tomato DC3000 (hemi-biotrophic bacteria)
Assess disease resistance by measuring:
Pathogen growth/sporulation
Disease symptoms
Expression of defense marker genes
Growth Conditions for Arabidopsis:
Stratify seeds at 4°C in the dark for three days
Germinate on ½ MS sterilized plates in growth chambers
Maintain under long day conditions (21°C; 16 hr light/8 hr dark)
For leaf tissue experiments, use plants grown on soil for 5 weeks after germination
Genetic Analysis:
To prevent carry-over mutations from generations of homozygous mutants, use leaves from homozygous mutant plants segregated from heterozygous plants
Molecular Characterization:
RNA-seq analysis can identify genes differentially expressed between wild-type and CYSTM1-overexpressing or CYSTM1-knockout plants
Use tools like DESeq2 for identifying differentially expressed genes
Gene ontology (GO) analysis can reveal biological processes affected by CYSTM1 manipulation
Subcellular Localization:
Generate fusion proteins with fluorescent tags (e.g., YFP-CYSTM1).
Express in plant systems through:
Transient expression in Nicotiana benthamiana leaves
Stable transformation in Arabidopsis
Visualize using confocal microscopy.
Use appropriate markers for co-localization studies (e.g., plasma membrane markers).
Protein-Protein Interaction Studies:
Yeast Two-Hybrid (Y2H):
Clone CYSTM1 into bait and prey vectors
Screen for interactions with known immunity-related proteins
Verify potential interactions through directed Y2H assays
Co-Immunoprecipitation (Co-IP):
Express tagged versions of CYSTM1 in plants
Immunoprecipitate using tag-specific antibodies
Identify interacting proteins by mass spectrometry
Pull-Down Assays:
Bimolecular Fluorescence Complementation (BiFC):
Fuse CYSTM1 and potential interactors to split YFP fragments
Co-express in plant cells
Visualize reconstituted fluorescence as evidence of interaction
While CYSTM1 itself is not a transcription factor, understanding its regulation and downstream effects can benefit from ChIP-seq and transcriptomic analyses:
RNA-seq Analysis Protocol:
Extract RNA from appropriate tissues (e.g., leaves treated with SA or pathogen challenge).
Prepare RNA-seq libraries following standard protocols.
Sequence using appropriate platforms (e.g., Illumina).
Map reads to the Arabidopsis genome (TAIR version 10) using TopHat.
Summarize aligned reads over annotated gene models using HTseq-count.
Identify differentially expressed genes using a generalized linear model or DESeq2 .
ChIP-seq Protocol for Studying Transcription Factors Regulating CYSTM1:
Grow Arabidopsis plants under appropriate conditions:
Perform chromatin immunoprecipitation using antibodies against transcription factors of interest.
Prepare ChIP-seq libraries and sequence.
Analyze data to identify binding sites in the CYSTM1 promoter region.
Validate binding using techniques such as electrophoretic mobility shift assay (EMSA) or luciferase reporter assays.