Protein Info for JDDGAC_08085 in Escherichia coli ECRC98

Name: fhaB
Annotation: hemagglutinin

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 100 200 300 400 500 600 700 800 900 1000 1100 1200 1270 signal peptide" amino acids 1 to 34 (34 residues), see Phobius details PF05860: TPS" amino acids 42 to 282 (241 residues), 142.4 bits, see alignment E=2.5e-45 TIGR01901: filamentous hemagglutinin family N-terminal domain" amino acids 64 to 161 (98 residues), 77 bits, see alignment 8.4e-26 PF05594: Fil_haemagg" amino acids 209 to 306 (98 residues), 4 bits, see alignment 0.0093 amino acids 312 to 371 (60 residues), 11.9 bits, see alignment 3.1e-05 amino acids 354 to 434 (81 residues), 13.7 bits, see alignment 8.6e-06 amino acids 646 to 679 (34 residues), 9.9 bits, see alignment (E = 0.00013) amino acids 698 to 765 (68 residues), 24.3 bits, see alignment 4.2e-09 amino acids 789 to 884 (96 residues), 12.7 bits, see alignment 1.7e-05 amino acids 880 to 923 (44 residues), 3.7 bits, see alignment (E = 0.011) amino acids 1076 to 1142 (67 residues), 20.9 bits, see alignment 4.7e-08 TIGR01731: adhesin HecA family 20-residue repeat (two copies)" amino acids 272 to 306 (35 residues), 8.3 bits, see alignment (E = 0.00025) amino acids 289 to 327 (39 residues), 7.5 bits, see alignment (E = 0.00045) amino acids 331 to 363 (33 residues), 10.9 bits, see alignment (E = 3.7e-05) amino acids 384 to 413 (30 residues), 8.8 bits, see alignment (E = 0.00017) amino acids 403 to 434 (32 residues), 10.8 bits, see alignment (E = 4.2e-05) amino acids 428 to 456 (29 residues), 8.6 bits, see alignment (E = 0.0002) amino acids 471 to 506 (36 residues), 19.5 bits, see alignment (E = 7.3e-08) amino acids 510 to 549 (40 residues), 13.7 bits, see alignment (E = 5e-06) amino acids 631 to 671 (41 residues), 6.5 bits, see alignment (E = 0.00093) amino acids 726 to 756 (31 residues), 7.3 bits, see alignment (E = 0.00052) amino acids 852 to 885 (34 residues), 6.3 bits, see alignment (E = 0.0011) amino acids 936 to 968 (33 residues), 7.2 bits, see alignment (E = 0.00053) amino acids 1009 to 1049 (41 residues), 11.3 bits, see alignment (E = 2.8e-05) amino acids 1042 to 1069 (28 residues), 6.9 bits, see alignment (E = 0.00067) amino acids 1102 to 1134 (33 residues), 6.5 bits, see alignment (E = 0.0009)

Best Hits

KEGG orthology group: None (inferred from 100% identity to eok:G2583_1273)

Predicted SEED Role

No annotation

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (1270 amino acids)

>JDDGAC_08085 hemagglutinin (Escherichia coli ECRC98)
VAMINLSKEATVGKALTPIAILMMLSFPVASQAAGLVIKNGTVYNANGVPVVDINKPNGS
GLSHNIWDNLNVDKNGVVFNNSANESSTSLAGNIQGNSNLTSGSAKVILNEVTSKNPSTI
NGMMEVAGDKADLIIANPNGITVNGGGSINTGKLTLTTGTPDIQDDKLAGYSVNGGTITL
GKLDNASPTEILSRNVVVNGKVSADELNVVAGNNYVNAAGQVTGSVSATGSRNGYSVDVA
KLGGMYANKISLVSTEKGVGVRNLGVIAGGVNGVSIDSKGNLLNSNAQIQSASTINLTTN
GTLDNTTGTVTSVGTISLNTNKNTIVNTRAGNISTMGDIYVNSGTIDNTNGKLAAAGMLA
VDTNNATLINSGKGSSVGIEAGLVALKTGTLNNSNGQIRGGYVGLESAALNNNNGDIQTT
GDIAIISNGNVDNNKGLIRSSTGHIVIGAAGSVNNGSTKTADTGSSDSLGIIADTGVEIG
ANNINNNGGQIASNGNVSLSSYSTIDDYAGKILSNSKVIIKGSSLRNDTGGISGKQGIEV
AVGGSLTNNIGVISSEEGDISLLANSVDNHGGFMMGQNITMESMSGVNNNTALIVASKKL
KINARGSIENRDGNNFGNAYGLYFGMPQQTGGMVGKEGIELSGQNIYNNNSRLIAEDGPL
TLQAQNTFDNTRALVTSGADASIQVGGTYYNNYATTWSAGNLDIDATTLQNSSSGTMIDN
NATGFIASDKNLSLEVVNSLTNYGWISGKGDVDVTVNNGNLYNRNTIAAEKGLDIAALNG
IENWKDISAGGDLTMNTNRHVTNNSNSNMVGQNIVINAVNDINNRGNIVSDADLNVTTKG
NLYNYLYMVGYGDIALSANSVANNNATIEATGDLIIDSKGNVGNNRGNLHALNGVLSVKG
NNLNNDNGEIRGYGDVTLALTGNYDSYKGSLTSETGDVTLTANIVDNAYGLIAGENVSVD
AKSTIYNNTALIAANKKLVINAGGNLENRDGNNFLRNNGALFGITDNVGGIVGKEGVTLS
AQNVYNNNSSIIAENGPLNLLSRGTLDNTRALLSSGADAIIRAAGTFYNNYATTYSAGNL
DVYAASLNNASDGRLEDNTATGVIASDKNLDLSVDNSVTNYGWISGKGDVHFNVLKGTLY
NRNAIAADNALTINALNGVENFKDIVAGTALTIDTQKYVTNNSNSNMLGQTIAINAVNDI
NNRGNIVGDYSLGVKTTGNIYNYLNMLSYGVAGVSANKVTNSGKDAVLGGFYGLALEANE
TDNTGTIVGM