Protein Info for GFF52 in Variovorax sp. SCN45
Annotation: Polyketide synthase modules and related proteins
These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.
Protein Families and Features
Best Hits
Predicted SEED Role
"Malonyl CoA-acyl carrier protein transacylase (EC 2.3.1.39)" in subsystem Fatty Acid Biosynthesis FASII or mycolic acid synthesis (EC 2.3.1.39)
MetaCyc Pathways
- superpathway of fatty acids biosynthesis (E. coli) (48/53 steps found)
- superpathway of fatty acid biosynthesis II (plant) (38/43 steps found)
- superpathway of fatty acid biosynthesis I (E. coli) (14/16 steps found)
- fatty acid biosynthesis initiation (type II) (3/3 steps found)
- superpathway of fatty acid biosynthesis initiation (4/5 steps found)
- fatty acid biosynthesis initiation (mitochondria) (2/4 steps found)
- fatty acid biosynthesis initiation (plant mitochondria) (1/4 steps found)
- mycobactin biosynthesis (4/11 steps found)
- pederin biosynthesis (2/14 steps found)
- bryostatin biosynthesis (2/19 steps found)
- mupirocin biosynthesis (1/26 steps found)
- corallopyronin A biosynthesis (2/30 steps found)
KEGG Metabolic Maps
Isozymes
Compare fitness of predicted isozymes for: 2.3.1.39
Use Curated BLAST to search for 2.3.1.39
Sequence Analysis Tools
PaperBLAST (search for papers about homologs of this protein)
Search CDD (the Conserved Domains Database, which includes COG and superfam)
Compare to protein structures
Predict protein localization: PSORTb (Gram-negative bacteria)
Predict transmembrane helices and signal peptides: Phobius
Check the current SEED with FIGfam search
Find homologs in fast.genomics or the ENIGMA genome browser
Find the best match in UniProt
Protein Sequence (2409 amino acids)
>GFF52 Polyketide synthase modules and related proteins (Variovorax sp. SCN45) VSRANPETPSERHAISLALEPELLQRRAQQPATAAQLPDTLFAAAWLLLQSRWLGVLWPV LHERVGAQALPVSTTVAFDATRPAGEWLAELDAARRAAAHTAAGAEPPSSLWLREATGTA TPDARLLLWLDEASASLHVDAAAGLMDMASAEHLLAALADTAADLLARPDAALNDIRTVP GTGRDDQLLRWNTPASRLDTALTVTGLFRRQVNAAPDAIALVEGDARLSYAELDRRCDLI ARRLQQLGVRSGDSVGLLLDRSLAAVVALIGILKAGGAYVPVPTDFPPERIAYMFGEAQA RHVVTTQAFRHLVPAEQAVFLLDDTLDNAERGSWNEPAIDGESVAYVMYTSGSTGTPKGI EICHRSILRLVVGVDYVDLAPGRAMLHAAPLGFDAATLEIWGPLLNGGCCVVHDERVPTG AGLARTIARHGVHTAWLTAALFNAVVDDDPTHLAGLSQLFTGGEALSVPHVRRALAALPG LTLSNGYGPTECTTFATTHRIAPTLPADTRSVPLGRPIKDTVLRVLSPAMALLPCGFVGE LCIGGHGLARGYLRQPELSADRFVPDPFGGPDDRLYRTGDLARWLPDGTIEFIGRRDGQV KIHGHRIETGEIEAAILAHPEVQSCAVVARPDADGQLRLVAYLVARGRQLSWQALRSHLA ARLPAALMPAAQVWLAQLPVTPNGKLDRRALPEPAAGRPELSQPYEEPQDAVEQQVCEAF ARALRIDKVGRNDNFFDLGGDSLLVLQVLAELKRDTALPLSTNLFFRDPTPKAMAARMRP PVEAPVQPAPSAPRPAAPASEAVALIATAGRFPGAADVEQFWDNLVAGRDTVSFFDDATL DAGVSEALRRDPAYVRARGVIDGIENFDAAFFGIGPKEAQLMDPQQRVFLEICWECLERA GYVPDAAPGPVGVYAGMYNATYFQRHVSTRPDLVEPVGEFQVMLANEKDYITTRVANRLN LTGPAVSVHTACSTSLVAVAHAFHALRTGQCYMALAGGASVTCPPRSGYLYNEGSMLSPD GHTRSFDAKAQGTVFSDGAAVVLLKRLADAQADGDTIYAVLRSACVNNDGGAKASFTAPS VDGQAAVIRAALAAAEVDARSISYVEAHGTATPMGDPIEVEALAVAYAEHTDALDYCTLG SLKSNVGHMVTAAGAAGLIKTALALHHAQIPPTVHFEAPNPAIDFARTPFRVSASLQPWP RGDLPRRAGVSSFGVGGTNAHVIVEEAPERPASPLATGEQMLPLSARSEAALDIATEQLA AHLQAHADQPLADVAHTLAVGRKAHAFRRVIVAADTAQAVAALRTADSPWRASGRLASRA PQPVLMFPGQGAQYAGMGRLLHAADPVFAAAFDDCIDAVVGTTDFDLRERMFSDDPKALS PTAVTQPALFTIEYALARRLLATGVRPHALMGHSVGEFVAAVLAGVMRLEDAARLVARRG ALMQALPAGAMLSVRIGAAELMEKLGPSLSLAAENGPTACVAAGPFEAIAALQAALEAEG IACRTLQTSHAFHSAMMDGAVAPFEALVSQVALHAPQTPIFSTLTGMLLEAEEATSPAYW ARHLRGAVRFSPAVRAAMAHAAQPLFIEAGPRNTLSTLVRQHGASTAVPLLHGELADEDR TLRLAVGRLWTCGADVELSRLSSRVGAQRVCLPTSPFERKRFWVDIAAMPAIAAIPETPV APLSASPAPSATPTPLEPIVTAAATSPLPPSASSPPVDARLRALFEDISGIDMAQAEGHA PFGELGLDSLTLTQAATQIKKHFKVNLSFRQLMENYRCFDALSAFLRESLPPEAAPVAAA APAAEAPAVLQPVMPAPLVHAPMQGNVASSPISQLIAQQMELMRQQLALLSGAAPAAPVM LQAMPAPAVAAPVATPAAEEPVPSKEPLRYDVTKAFGAIARIHTQRTAEPSGRQKARLAA FMRRYIERTLKSKQFTEANRPHMADPRVVNGFRPVTKEITYQIVIERSKGSKMWDLDGNE YVDALNGFGMNMFGWQPDFVQEAVRKQLDDGYEIGPQHPLAAEVTALICELTGCDRAGLC NTGSEAVMAALRIARTVTGRSTVVVFTGSYHGTFDEVLVRAGKGGKGLSAAPGVMSGMFG DIRVLDYGTPEALAFIRENADDLAAVLAEPVQSRRPDFQPREFLHEVREITARSGCALIF DEVITGFRTALGGAQELFGVRADLATYGKVIGGGFPVGVIAGKREFMDALDGGPWQYGDD SIPGVGVTYFAGTFVRHPLALAAAKASLTHLKQAGPALQAGLTSSTGAMASELSAWCKEV GAPIEIRHFGSLWRVSWLEDHPLQDLLFAMMRSRGVHILDNFPCFLTSAHSAEDIAFIQR AFKESVAEMQESGFLPRRAAVVTSFDTRRPTEEGSVLARDVDGQPFWYVPEAQSSAQASG HLVNGKAAA