Protein Info for EX28DRAFT_0123 in Enterobacter asburiae PDN3

Annotation: Large extracellular alpha-helical protein

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1650 signal peptide" amino acids 1 to 23 (23 residues), see Phobius details PF17970: bMG1" amino acids 57 to 161 (105 residues), 157.1 bits, see alignment 7.3e-50 PF21142: A2M_bMG2" amino acids 167 to 285 (119 residues), 168.2 bits, see alignment 3.6e-53 PF11974: bMG3" amino acids 287 to 379 (93 residues), 81.1 bits, see alignment 2.9e-26 PF01835: MG2" amino acids 386 to 477 (92 residues), 73 bits, see alignment 1.2e-23 PF17972: bMG5" amino acids 483 to 608 (126 residues), 134.8 bits, see alignment 1.3e-42 PF17962: bMG6" amino acids 636 to 739 (104 residues), 88.2 bits, see alignment 2.4e-28 PF07703: A2M_BRD" amino acids 757 to 903 (147 residues), 58.6 bits, see alignment E=4.8e-19 PF00207: A2M" amino acids 973 to 1054 (82 residues), 28.5 bits, see alignment 6.2e-10 PF07678: TED_complement" amino acids 1174 to 1290 (117 residues), 35.8 bits, see alignment 2.8e-12 PF21765: CUB_A2MG" amino acids 1440 to 1501 (62 residues), 89.1 bits, see alignment (E = 1.1e-28) PF17973: bMG10" amino acids 1502 to 1635 (134 residues), 88.1 bits, see alignment E=3.1e-28

Best Hits

Swiss-Prot: 78% identical to A2MG_ECOLI: Alpha-2-macroglobulin (yfhM) from Escherichia coli (strain K12)

KEGG orthology group: K06894, (no description) (inferred from 96% identity to enc:ECL_03866)

Predicted SEED Role

"Alpha-2-macroglobulin"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (1650 amino acids)

>EX28DRAFT_0123 Large extracellular alpha-helical protein (Enterobacter asburiae PDN3)
MKPFRLAALSLALLTTFTLVGCDNSDDKSQAAAPAASTASTPKAAEKPNAETLAKLAAQS
QGKALTLLDTSEVQLDGAATLVLTFSVPLNPDQDFAKTVHVVDKKSGKVDGAWELAPNLK
ELRLRHLEPNRNLVVTVERDLQALNKATFGIDYEKALTTRDIEPTVGFASRGSLLPGKVV
EGLPVMALNVNNVDVNFYRVKPESLASLVSQWEYRNSLTNWESDNLLKMAELVYTGRFDL
NPARNTREKLLLPLKDIKPLQQSGVYIAVMNQAGHYNYSNAATLFTVSDIGLSAHRYHNR
LDIFTQSLENGAAQSGVTVTLLNDKGQTLAEANSDADGHAKLETDKEAALILASKDGQTT
LLDLKLPALDLAEFDIAGSPGYSKQFFMFGPRDLYRPGETVILNGLLRDSDGKPLPDQPV
KLDVLRPDGQVARTIVVKPENGLYRFNYPLDSGAQTGMWHIRANTGDNLQRMWDFHVEDF
MPERMALNLSGQKTPVSPQDNVDFNVVGYYLYGAPANGNSLQGQLFLRPLRDAVAALPGF
QFGDIAEENLSRSLDEVQLTLDEQGRGQVSTESQWKEVHSPLQLILQASLLESGGRPVTR
RAEQAIWPAAELPGIRPQFASKAVYDYRTDTTVNQPIVDENGNASFDIVYADASGAKKAV
SGLQVRLIRERRDYFWNWSESEGWQSQFDQKDLVEGEQELTLKADETGKVTFPVEWGSYR
LEVKAPDEMVSSVRFWAGYSWQDNSDGTGAARPDRVTLKLDKPAYQPGDTINLHIAAPSA
GKGYAMIESSEGPLWWKEIDVPANGLDLAIPVDKAWKRHDLYLSTLVVRPGDKSKSATPK
RAVGLLHLPMGDENRRLNIALDNPQKMRPNQTLSVKVKASVKEGAVPQKVNVLVSAVDSG
VLNITDYVTPDPWQAFFGQKRYGADIYDIYGQVIEGQGRLAALRFGGDGDELKRGGKPPV
NHVTIIAQQAQPVTLDANGEGTITLPIGDFNGELRLMAQAWTEDDFGSSESKVVVAAPVI
TELNTPRFLASGDTSRLTLDLTNLTDKPQTLNVALTATGKLSLEGGQPQPVQLSPGARST
LFIPVRALEGYGDGEVTAQVTGLQLPGETFAPQQKSWKIGVRPAFPAQTVNTGAMLNPGE
SWTAPAQHSNGFSPATLQGQLLLSGKPPLNLARYIRELQAYPYGCLEQTTSGLFPSLYTN
AAQLTALGIKGDTDDKRRAAIDIGISRLLQMQREDGGFALWDKNGPEEYWLTAYVTDFLV
RAGEQGYSVPADAVNNANNRLLHYLQDPGMMSIRYSDDTQASKFAVQAYAALVLARQQKA
PLGALREIWDRHEQAASGLPLMQLGMALKLMGDAPRSQQALDLALKTSRNDSKTWMADYG
SQLRDNALMLSLLEEYKLLPDAQNTLLNALSEQAFSQRWLSTQESNALFLAGRSLQTLSG
AWQAKTSLSETASGGDKSLVQNVNGDQLGALQVTNTGTTPLWVRLDSTGYPEYAPQPSAN
VLQVERHILATDGSSKSLSSLKSGELVLVWLEVKASQNVPDALVVDLLPAGLELENQNLA
SSSASLQDSGSEVQNLLSQMQQADIQHMEFRDDRFVAAVPVNEGQPVTLVYLARAVTPGT
YQVPVPMVESMYVPQWRATGAASGPLIVVP