Uncharacterized protein antibodies are designed to detect proteins with unknown or partially characterized roles. They are often used in discovery-phase research, such as identifying novel interactors in disease pathways or studying mitochondrial components . For example, C17orf80, a mitochondrial nucleoid-associated protein, was recently characterized using proximity-based proteomics and immunofluorescence microscopy . Similarly, FAM47E, linked to PRMT5 regulation, was identified via yeast two-hybrid screening .
The development of reliable antibodies for uncharacterized proteins faces significant hurdles:
Low Sensitivity: Antibodies for proteins like FAM47E often require overexpression due to low endogenous levels .
Cross-Reactivity: Unvalidated antibodies may bind non-target proteins, leading to false positives .
Epitope Uncertainty: Structural unpredictability complicates epitope mapping, as seen in mitochondrial membrane proteins .
Modern techniques mitigate these challenges:
Knockout (KO) Validation: CRISPR-generated KO cell lines confirm antibody specificity by eliminating target protein expression .
Proximity Labeling: BioID and APEX2 tag proteins to map interactomes in situ .
Recombinant Production: Sequence-defined antibodies reduce lot-to-lot variability .
Uncharacterized protein antibodies are pivotal in:
Mitochondrial Biology: C17orf80 antibodies revealed its role in nucleoid stability under replication stress .
Cancer Pathways: FAM47E-PRMT5 antibodies linked PRMT5 stabilization to oncogenic signaling .
Antibody Crisis Mitigation: Standardizing antibody characterization prevents reproducibility failures .
When working with uncharacterized proteins, antibody selection presents a significant challenge due to limited commercial availability. In studies like the FAME protein characterization, researchers tested multiple antibodies (four in total) before identifying one that provided consistent and specific results in immunohistochemistry . The methodological approach should include:
Survey all commercially available antibodies targeting your protein of interest
Test multiple antibodies from different vendors and with different clonality (monoclonal vs. polyclonal)
Include essential controls in your validation process, particularly knockout/knockdown tissues or cells
Consider raising custom antibodies if commercial options prove inadequate
Evaluate antibodies in multiple applications (IHC, WB, IF) as performance may vary significantly across techniques
The FAME protein study exemplifies this challenge, where only one antibody (Santa Cruz mouse monoclonal SC-398907) out of four tested provided reliable results in immunohistochemistry but failed in western blot applications for detecting endogenous protein .
Proper validation of antibodies against uncharacterized proteins requires multiple stringent controls:
Genetic negative controls: Tissues or cells lacking the target protein (knockout/knockdown) represent the gold standard for specificity validation, as demonstrated in the FAME study where knockout animals showed no signal with the antibody
Overexpression systems: Testing the antibody against cells overexpressing your protein of interest can help establish sensitivity thresholds, as seen in the FAME study when researchers validated antibody functionality via FAME-EGFP fusion in cultured cells
Orthogonal techniques: Correlation of protein detection with RNA expression data provides additional validation, as utilized in The Human Protein Atlas reliability scoring system
Non-denaturing vs. denaturing conditions: Testing antibody performance in different conditions can reveal application-specific limitations, as noted with the FAME antibody which performed better in non-denaturing conditions
Cross-species reactivity: Verification of whether the antibody recognizes orthologs from different species, noting the limitation in the FAME study where the antibody was not validated for human FAME
Determining subcellular localization provides crucial initial insights into potential protein function. Multiple complementary approaches should be employed:
Immunofluorescence with validated antibodies: Direct visualization in fixed cells, as used for C17orf80 localization to mitochondria
Fusion protein strategies: Creating fusion proteins with fluorescent tags (GFP, mCherry) can track localization in living cells
Epitope tagging: Adding small epitope tags (FLAG, HA, Myc) that can be detected with well-characterized commercial antibodies, particularly valuable when direct antibodies against your protein are unavailable or underperforming
Subcellular fractionation: Biochemical separation of cellular compartments followed by immunoblotting to determine protein distribution
Proximity labeling approaches: Methods like BioID or APEX can identify neighboring proteins, providing localization context as utilized in identifying C17orf80 as a mitochondrial protein
The approach used for FAME protein determined it localizes to plasma membranes and small cytoplasmic vesicles through FAME-EGFP fusion proteins in HEK293T cells, which was complemented by immunohistochemistry showing enrichment in kidney proximal tubules .
Investigating protein interactions provides critical functional insights for uncharacterized proteins. Multiple complementary methods should be employed:
Co-immunoprecipitation (Co-IP): Using validated antibodies to capture protein complexes, a straightforward approach when specific antibodies are available
Epitope-tagged pull-downs: When direct antibodies are unavailable, fusion of an epitope tag allows complex isolation using well-characterized commercial antibodies
GST-fusion protein pull-downs: Fusion to glutathione S-transferase enables complex purification using glutathione-coated beads without antibodies
Proximity labeling proteomics: Methods like BioID or APEX can identify proteins in close spatial proximity, as used to identify C17orf80 as a Twinkle-proximal protein
Two-hybrid systems: Genetic screening approaches to identify interaction partners
Surface plasmon resonance: Allows real-time monitoring of protein interactions with potential binding partners
Protein arrays: High-throughput screening of potential interaction partners using immobilized protein collections
The C17orf80 study exemplifies a successful multi-method approach, initially identified through proximity labeling mass spectrometry near nucleoid components and subsequently confirmed through interaction proteomics and biochemical assays to associate with mitochondrial DNA nucleoids .
This common challenge was explicitly noted in the FAME protein study, where researchers could detect overexpressed FAME-EGFP fusion proteins but not endogenous FAME in western blots . Methodological approaches include:
Sample enrichment strategies:
Subcellular fractionation to concentrate the compartment where your protein localizes
Immunoprecipitation to concentrate the protein before western blot
Tissue selection focusing on highest expression sites (for FAME, kidney tissue was enriched)
Technical optimizations:
Increase protein loading amounts
Extend primary antibody incubation (overnight at 4°C)
Test various blocking agents (BSA vs. milk)
Evaluate enhanced chemiluminescence (ECL) substrates with different sensitivities
Consider non-denaturing conditions if the antibody recognizes conformational epitopes
Alternative detection methods:
Consider more sensitive detection systems (fluorescent secondary antibodies)
Explore proximity ligation assays for protein detection in situ
Researchers working with FAME concluded the antibody could only detect protein above a certain threshold concentration, preferably in non-denaturing conditions, and that endogenous levels in total kidney extract were too low because the protein was produced only in specific cell types .
When antibodies produce inconsistent or contradictory staining patterns, a systematic analytical approach is needed:
Validate with genetic controls: Test antibodies on tissues from knockout/knockdown models, as performed in the FAME study
Compare multiple independent antibodies: Consistency between antibodies targeting different epitopes increases confidence, as employed in The Human Protein Atlas reliability scoring
Correlate with transcript data: Compare protein detection with RNA-seq data from the same tissues to assess consistency, a method used in The Human Protein Atlas validation
Consider fixation variables: Test multiple fixation methods as epitope accessibility can vary dramatically
Evaluate regional expression differences: As noted in the FAME study, the antibody did not stain all proximal tubules uniformly, possibly due to regional differences in expression levels or section planes
Complementary detection methods: Implement epitope-tagged constructs or reporter gene knock-ins to confirm expression patterns
The Human Protein Atlas employs a systematic reliability scoring approach based on consistency between antibodies, correlation with RNA-seq data, and support from external databases like UniProtKB/Swiss-Prot .
Epitope tagging represents a powerful approach when direct antibodies against an uncharacterized protein prove inadequate:
Selection of appropriate tag:
Small epitope tags (FLAG, HA, Myc) minimize structural interference
Fluorescent protein fusions (GFP, mCherry) enable live-cell imaging
Enzyme tags (HRP, APEX) can be used for proximity labeling
Purification tags (His, GST) facilitate protein isolation
Tag position considerations:
N-terminal vs. C-terminal placement based on predicted protein topology
Internal tagging at permissive sites when termini are functionally important
Multiple tagging strategies to verify consistent results
Functional validation:
Confirm tagged protein retains native functions and localization
Compare overexpression phenotypes with knockdown effects
Application advantages:
Track protein movement in living cells
Isolate protein complexes with established high-affinity purification systems
Use well-characterized commercial antibodies against the tag
The FAME study employed FAME-EGFP fusion proteins to establish subcellular localization to plasma membranes and cytoplasmic vesicles, while also using this system to validate antibody specificity .
Determining protein function requires integrating multiple experimental approaches:
Subcellular localization: Initial clues from immunofluorescence or tagged protein localization
Protein-protein interaction networks:
Immunoprecipitation followed by mass spectrometry
Proximity labeling (BioID, APEX) to identify spatial neighbors
Correlation with known functional complexes
Post-translational modification analysis:
Phospho-specific antibodies to detect activation states
Detection of other modifications (ubiquitination, SUMOylation)
Dynamic behavior analysis:
Tracking protein relocalization under various cellular conditions
Quantifying expression changes during differentiation or stress
Functional perturbation studies:
Antibody-mediated inhibition in cell-free systems or microinjection
Correlation of knockout/knockdown phenotypes with protein distribution
Both FAME and C17orf80 studies exemplify this integrated approach, combining localization data with interaction networks and knockout phenotypes to suggest functions in metabolism for FAME and mitochondrial DNA maintenance for C17orf80 .
Differential antibody performance across applications is common and requires careful interpretation:
Application-specific epitope accessibility:
Fixation-sensitive epitopes may be accessible in IF but not FFPE IHC
Denaturation-sensitive epitopes may work in IHC but fail in western blot
Concentration effects:
Systematic validation approach:
Document conditions where the antibody performs reliably
Utilize complementary detection methods for applications where performance is poor
Consider epitope tags for applications where direct antibodies fail
Binding affinity and avidity factors:
Higher antibody concentrations for tissues with lower expression
Extended incubation times for weaker interactions
The Human Protein Atlas explicitly addresses this challenge through application-specific reliability scores and transparent documentation of antibody performance across different techniques .
Establishing antibody specificity for uncharacterized proteins requires multiple lines of evidence:
Genetic validation: Testing on knockout/knockdown tissues/cells represents the gold standard
Multi-antibody concordance: Consistent results from independent antibodies targeting different epitopes
Blocking peptide competition: Specific signal should be competitively reduced by the immunizing peptide
Orthogonal validation: Correlation with RNA expression data and tagged protein localization
Expected molecular weight: Consistency with predicted size (accounting for post-translational modifications)
The Human Protein Atlas employs a formal reliability scoring system incorporating these principles, particularly emphasizing concordance between antibodies and correlation with RNA-seq data .
Immunoprecipitation of low-abundance proteins presents significant challenges, requiring optimized protocols:
Starting material optimization:
Increase input material quantity
Use tissues/cells with highest expression based on transcript data
Consider subcellular fractionation to enrich compartments containing your protein
Cross-linking strategies:
Implement reversible cross-linking to stabilize transient interactions
Optimize cross-linker concentration and conditions to preserve complex integrity
Antibody considerations:
Test multiple antibodies for IP efficiency
Consider direct antibody conjugation to beads to reduce background
Employ epitope tagging when direct antibodies perform poorly
Buffer optimization:
Test detergent types and concentrations to balance solubilization with complex preservation
Adjust salt concentration to minimize non-specific interactions
Include appropriate protease and phosphatase inhibitors
Detection enhancement:
Use highly sensitive detection methods for western blot
Consider silver staining or fluorescent protein detection for higher sensitivity
Both the FAME and C17orf80 studies successfully employed epitope tagging approaches to overcome detection limitations of endogenous proteins .
Post-translational modifications provide critical functional insights for uncharacterized proteins:
Immunoprecipitation-based approaches:
Isolate the protein using validated antibodies or epitope tags
Probe with modification-specific antibodies (phospho, ubiquitin, SUMO)
Submit for mass spectrometry analysis to identify modification sites
Mobility shift analysis:
Compare migration patterns before and after treatment with phosphatases, deglycosylases
Use Phos-tag or similar systems to enhance separation of phosphorylated species
Site-directed mutagenesis:
Create point mutations at predicted modification sites
Compare localization and function of wild-type vs. mutant proteins
Modification-specific antibodies:
Generate or obtain antibodies specific to the modified form
Validate specificity using appropriate controls (phosphatase treatment, mutants)
The FAME study employed this approach to investigate myristoylation, using inhibitors of N-myristoyltransferases (IMP-1088 and DDD85646) to verify a predicted myristoylation site .