Protein Info for Rv2048c in Mycobacterium tuberculosis H37Rv
Annotation: Polyketide synthase Pks12
These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.
Protein Families and Features
Best Hits
Predicted SEED Role
"Malonyl CoA-acyl carrier protein transacylase (EC 2.3.1.39)" in subsystem Fatty Acid Biosynthesis FASII or mycolic acid synthesis (EC 2.3.1.39)
MetaCyc Pathways
- superpathway of fatty acid biosynthesis II (plant) (34/43 steps found)
- mycobactin biosynthesis (10/11 steps found)
- superpathway of fatty acid biosynthesis initiation (5/5 steps found)
- superpathway of fatty acids biosynthesis (E. coli) (39/53 steps found)
- fatty acid biosynthesis initiation (type II) (3/3 steps found)
- fatty acid biosynthesis initiation (mitochondria) (3/4 steps found)
- superpathway of fatty acid biosynthesis I (E. coli) (11/16 steps found)
- fatty acid biosynthesis initiation (plant mitochondria) (1/4 steps found)
- pederin biosynthesis (2/14 steps found)
- bryostatin biosynthesis (2/19 steps found)
- mupirocin biosynthesis (1/26 steps found)
- corallopyronin A biosynthesis (2/30 steps found)
KEGG Metabolic Maps
Isozymes
Compare fitness of predicted isozymes for: 2.3.1.39
Use Curated BLAST to search for 2.3.1.39
Sequence Analysis Tools
PaperBLAST (search for papers about homologs of this protein)
Search CDD (the Conserved Domains Database, which includes COG and superfam)
Compare to protein structures
Predict protein localization: PSORTb (Gram-negative bacteria)
Predict transmembrane helices and signal peptides: Phobius
Check the current SEED with FIGfam search
Find homologs in fast.genomics or the ENIGMA genome browser
Find the best match in UniProt
Protein Sequence (4151 amino acids)
>Rv2048c Polyketide synthase Pks12 (Mycobacterium tuberculosis H37Rv) MVDQLQHATEALRKALVQVERLKRTNRALLERSSEPIAIVGMSCRFPGGVDSPEGLWQMV ADARDVMSEFPTDRGWDLAGLFDPDPDVRHKSYARTGGFVDGVADFDPAFFGISPSEALA MDPQHRMLLELSWEALERAGIDPTGLRGSATGVFAGLIVGGYGMLAEEIEGYRLTGMTSS VASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSLRSGECDLALAGGVTVNATPTVFV EFSRHRGLAPDGRCKPYAGRADGVGWSEGGGMLVLQRLSDARRLGHPVLAVVVGSAVNQD GASNGLTAPNGPSQQRVVRAALANAGLSAAEVDVVEGHGTGTTLGDPIEAQALLATYGQD RGEPGEPLWLGSVKSNMGHTQAAAGVAGVIKMVLAMRHELLPATLHVDVPSPHVDWSAGA VELLTAPRVWPAGARTRRAGVSSFGISGTNAHVIIEAVPVVPRREAGWAGPVVPWVVSAK SESALRGQAARLAAYVRGDDGLDVADVGWSLAGRSVFEHRAVVVGGDRDRLLAGLDELAG DQLGGSVVRGTATAAGKTVFVFPGQGSQWLGMGIELLDTAPAFAQQIDACAEAFAEFVDW SLVDVLRGAPGAPGLDRVDVVQPVLFAVMVSLAELWKSVAVHPDAVIGHSQGEIAAAYVA GALSLRDAARVVTLRSKLLAGLAGPGGMVSIACGADQARDLLAPFGDRVSIAVVNGPSAV VVSGEVGALEELIAVCSTKELRTRRIEVDYASHSVEVEAIRGPLAEALSGIEPRSTRTVF FSTVTGNRLDTAGLDADYWYRNVRQTVLFDQAVRNACEQGYRTFIESSPHPALITGVEET FAACTDGDSEAIVVPTLGRGDGGLHRFLLSAASAFVAGVAVNWRGTLDGAGYVELPTYAF DKRRFWLSAEGSGADVSGLGLGASEHPLLGAVVDLPASGGVVLTGRLSPNVQPWLADHAV SDVVLFPGTGFVELAIRAGDEVGCSVLDELTLAAPLLLPATGSVAVQVVVDAGRDSNSRG VSIFSRADAQAGWLLHAEGILRPGSVEPGADLSVWPPAGAVTVDVADGYERLATRGYRYG PAFRGLTAMWARGEEIFAEVRLPEAAGGVGGFGVHPALLDAVLHAVVIAGDPDELALPFA WQGVSLHATGASAVRARIAPAGPSAVSVELADGLGLPVLSVASMVARPVTERQLLAAVSG SGPDRLFEVIWSPASAATSPGPTPAYQIFESVAADQDPVAGSYVRSHQALAAVQSWLTDH ESGVLVVATRGAMALPREDVADLAGAAVWGLVRSAQTEHPGRIVLVDSDAATDDAAIAMA LATGEPQVVLRGGQVYTARVRGSRAADAILVPPGDGPWRLGLGSAGTFENLRLEPVPNAD APLGPGQVRVAMRAIAANFRDIMITLGMFTHDALLGGEGAGVVVEVGPGVTEFSVGDSVF GFFPDGSGTLVAGDVRLLLPMPADWSYAEAAAISAVFTTAYYAFIHLADVQPGQRVLIHA GTGGVGMAAVQLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEFEDKFRAATGG RGFDVVLDSLAGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVRYRAFDLFEPG RPRMHQYMLELATLFGDGVLRPLPVTTFDVRRAPAALRYLSQARHTGKVVMLMPGSWAAG TVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPGAAELVAELAAAGAQVQVVAC DAADRAALAKVIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRVDVVLRSKVDAAWHLHE LTRDLDVSAFVMFSSMAGLVGSSGQANYAAANSFLDALAAHRRAHGLPAISLGWGLWDQA SAMTGGLDAADLARLGREGVLALSTAEALELFDTAMIVDEPFLAPARIDLTALRAHAVAV PPMFSDLASAPTRRQVDDSVAAAKSKSALAHRLHGLPEAEQHAVLLGLVRLHIATVLGNI TPEAIDPDKAFQDLGFDSLTAVEMRNRLKSATGLSLSPTLIFDYPTPNRLASYIRTELAG LPQEIKHTPAVRTTSEDPIAIVGMACRYPGGVNSPDDMWDMLIQGRDVLSEFPADRGWDL AGLYNPDPDAAGACYTRTGGFVDGVGDFDPAFFGVGPSEALAMDPQHRMLLELSWEALER AGIDPTGLRGSATGVFAGVMTQGYGMFAAEPVEGFRLTGQLSSVASGRVAYVLGLEGPAV SVDTACSSSLVALHMAVGSLRSGECDLALAGGVTVNATPDIFVEFSRWRGLSPDGRCKAF AAAADGTGFSEGGGMLVLQRLSDARRLGHPVLAVVVGSAVNQDGASNGLTAPNGPSQQRV VRAALANAGLSAAEVDVVEGHGTGTTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNM GHTQAAAGVAGVIKMVLAMRHELLPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTR RAGVSSFGISGTNAHVIIEAVPVVPRREAGWAGPVVPWVVSAKSESALRGQAARLAAYVR GDDGLDVADVGWSLAGRSVFEHRAVVVGGDRDRLLAGLDELAGDQLGGSVVRGTATAAGK TVFVFPGQGSQWLGMGMGLHAGYPVFAEAFNTVVGELDRHLLRPLREVMWGHDENLLNST EFAQPALFAVEVALFRLLGSWGVRPDFVMGHSIGELSAAHVAGVLSLENAAVLVAARGRL MQALPAGGAMVAVQAAEEEVRPLLSAEVDIAAVNGPASLVISGAQNAVAAVADQLRADGR RVHQLAVSHAFHSPLMDPMIDEFAAVAAGIAIGRPTIGVISNVTGQLAGDDFGSAAYWRR HIRQAVRFADSVRFAQAAGGSRFLEVGPSGGLVASIEESLPDVAVTTMSALRKDRPEPAT LTNAVAQGFVTGMDLDWRAVVGEAQFVELPTYAFQRRRFWLSGDGVAADAAGLGLAASEH ALLGAVIDLPASGGVVLTGRLSPSVQGWLADHSVAGVTIFPGAGFVELAIRAGDEVGCGV VDELTLAAPLVLPASGSVAVQVVVNGPDESGVRGVSVYSRGDVGTGWVLHAEGALRAGSA EPTADLAMWPPAGAVPVEVADGYQQLAERGYGYGPAFRGLTAMWRRGDEVFAEVALPADA GVSVTGFGVHPVLLDAALHAVVLSAESAERGQGSVLVPFSWQGVSLHAAGASAVRARIAP VGPSAVSIELADGLGLPVLSVASMLARPVTDQQLRAAVSSSGPDRLFEVTWSPQPSAAVE PLPVCAWGTTEDSAAVVFESVPLAGDVVAGVYAATSSVLDVLQSWLTRDGAGVLVVMTRG AVALPGEDVTDLAGAAVWGLVRSAQTEHPGRIVLVDSDAPLDDSALAAVVTTGEPQVLWR RGEVYTARVHGSRAVGGLLVPPSDRPWRLAMSTAGTFENLRLELIPDADAPLGPGQVRVA VSAIAANFRDVMIALGLYPDPDAVMGVEACGVVIETSLNKGSFAVGDRVMGLFPEGTGTV ASTDQRLLVKVPAGWSHTAAATTSVVFATAHYALVDLAAARSGQRVLIHAGTGGVGMAAV QLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEFEDKFRAATGGRGFDVVLDSL AGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVRYRAFDLFEPGPDRIAQILAE LATLFGDGVLRPLPVTTFDVRCAPAALRYLSQARHTGKVVMLMPGSWAAGTVLITGGTGM AGSAVARHVVARHGVRNLVLVSRRGPDAPGAAELVAELAAAGAQVQVVACDAADRAALAK VIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRVDVVLRSKVDAAWHLHELTRDLDVSAF VMFSSMAGLVGSSGQANYAAANSFLDALAAHRRAHGLPAISLGWGLWDQASAMTGGLATV DFKRFARDGIVAMSSADALQLFDTAMIVDEPFMLPAHIDFAALKVKFDGGTLPPMFVDLI NAPTRRQVDDSLAAAKSKSALLQRLEGLPEDEQHAVLLDLVRSHIATVLGSASPEAIDPD RAFQELGFDSLTAVEMRNRLKSATGLALSPTLIFDYPNSAALAGYMRRELLGSSPQDTSA VAAGEAELQRIVASIPVKRLRQAGVLDLLLALANETETSGQDPALAPTAEQEIADMDLDD LVNAAFRNDDE