Protein Info for GFF53 in Variovorax sp. SCN45

Annotation: Polyketide synthase modules and related proteins

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1678 transmembrane" amino acids 1339 to 1361 (23 residues), see Phobius details PF00668: Condensation" amino acids 24 to 462 (439 residues), 279.7 bits, see alignment E=8.4e-87 amino acids 1090 to 1523 (434 residues), 252.3 bits, see alignment E=1.7e-78 PF00501: AMP-binding" amino acids 490 to 839 (350 residues), 290.3 bits, see alignment E=4e-90 TIGR01733: amino acid adenylation domain" amino acids 508 to 913 (406 residues), 434.5 bits, see alignment E=1.8e-134 PF13193: AMP-binding_C" amino acids 897 to 975 (79 residues), 25.5 bits, see alignment 4.2e-09 PF00550: PP-binding" amino acids 1005 to 1065 (61 residues), 45.7 bits, see alignment (E = 1.3e-15) amino acids 1571 to 1632 (62 residues), 36.2 bits, see alignment (E = 1.2e-12)

Best Hits

KEGG orthology group: None (inferred from 78% identity to vpe:Varpa_2887)

Predicted SEED Role

No annotation

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (1678 amino acids)

>GFF53 Polyketide synthase modules and related proteins (Variovorax sp. SCN45)
MNAVLRPATGNGTGPAAGGIIECVIPTTESQREVWLGAMLSPEASLAYNESVLLRLHGPL
NARALGLAMAELVERHQSLRATIAPDGGCMLVGQAPAEPMGMMDLAALDASSREQVLESA
RNAAVCTPYSLEHGPLFRAVLYRLGEDEHELVMSAHHVVCDGWSWAVITEQLGHLYAEQI
GQGLRLKAAPSFADFAAEEAAEAAHPDMQQHVDYWLERFSGGTPPVLELPLDHPRPAVRT
FTSLRTERTLDRRLVTAMRSVSTKAGTSLFAGLLGAFVATLHRLTGQDDIVVGIPASGQL
ARDMPGLVGHCVNLLPLRVNAHAHLRFDELMSECGTAVLDAFEHQSLTYGALLGQLSLQR
DASRLPLVSVMFNVDPDVASGTDSFVGLSVKQDTVPRQYENFELFLNLRPLEGGLVIEAQ
YNTGLFDELSVQRWLDMFECVLRSATRDPSEAIGRLEVLSTEASLVLAALQPAPTELLGE
PLAHAGFVARALLQPDRPALRDGARRSSYAALDAQSNRLAHALRERGIGRGQRVGLCLER
GTDMLVALLAVLKSGAAYVPLDPAFPQARLDHYAEDARLGLLLTTSDIAAAPRQWRADAG
LRIFEIDRDTAWHQAPADALEPGEQDAGAEDPAYVIYTSGSTGKPKGVCVPHRAVANLLQ
SMRVEPGIGAMDRIAAVTTLSFDIAVAELLLPLAAGAEIVMVQRETAMDGNRLRALLEEE
DVTILQATPGMWQLLLDAQWPGASGFRGWIGGEPVRPRLALELLERCEQLWNVYGPTETT
VWSTVWNMQRDVVASRGVSIGHPIDNTQVWILDAELRPCPLGVPGEICIGGDGVTLGYHE
RPELTAERFVTARILGHATALYRTGDRGRWRNDGLLEHMGRLDFQVKVRGYRIEPGEIEA
RCNEVAGVSRSVVVAREDNPGDVRLVAYLALAPNAAGAAFDLDALMRHLQASLPAFMLPQ
HVVTLASLPTLPNGKLDRASLPAPQAVPRENMQRGAGPRSDSERKVLAAMEQVLSLPGLD
MKDDFFTMGGHSLLAARLATLLSREFQITLPLRTLFEAPTAERLAVAVEALQGAGVGERV
PLAHRPDRATAPMTPSQERIRFMEELHPGRSVYNAPSAQRLLGEFDAARFESVLREIIRR
QPALRTSMGTDPVSGQSVQSIAQSVDFSLPVIDLRDLPADQREAELAEQMQELADRPIDI
HRAPMAHAALFRLDERDHAFVFVPHHLVWDGWSFDLMQRELSALYDAAERGRPHGLPAIA
TTHGDYAEWLAQWMQEPAFDEQIGFWRKRFAAAPLPRLPNTDMPRRAGKSGQGGTQWIQI
DLATTQRLREVARGMDVTLSMLTFGLYALTMACTIGSDSIVIANPVRGRQQPETEDVMGV
FNNVLPVSLQVDMRQSLPGFMRYVKEELLTLMNYQQVPFERLMAEPGSGGQAKASGPYQS
MFSFQDARDRSRRLGSLQTRQMHLMQRGATDDIGVWLMDKPGGLEGALIYNADIYLRETG
AQLRERYLELLQRAADRPEGSLAYIASAEGSSSAAYLRKLAADTSRAEAPASAAPGAPKA
KTGLMNPEQAQLAQVWAGVVGIDVNDIRASDNFFDLGGDSLLVLRAVQQTELLLGYRVEP
RRYLFENLGQIAASSTFHPVHTAGPDSIPSELQGLPAAATPPRGGLLGRALGGWLRKG