Protein Info for RR42_RS32080 in Cupriavidus basilensis FW507-4G11

Annotation: filamentous hemagglutinin adhesin

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 200 400 600 800 1000 1200 1400 1600 1800 2000 2200 2400 2677 transmembrane" amino acids 44 to 70 (27 residues), see Phobius details PF13018: ESPR" amino acids 1 to 46 (46 residues), 46.9 bits, see alignment (E = 4.4e-16) PF05860: TPS" amino acids 84 to 343 (260 residues), 177.3 bits, see alignment E=8.9e-56 TIGR01901: filamentous hemagglutinin family N-terminal domain" amino acids 121 to 216 (96 residues), 50.3 bits, see alignment (E = 1.8e-17) TIGR01731: adhesin HecA family 20-residue repeat (two copies)" amino acids 347 to 386 (40 residues), 14.8 bits, see alignment (E = 2.2e-06) amino acids 435 to 471 (37 residues), 19.1 bits, see alignment (E = 9.8e-08) amino acids 508 to 537 (30 residues), 10.1 bits, see alignment (E = 6.7e-05) amino acids 517 to 557 (41 residues), 13.9 bits, see alignment (E = 4.2e-06) amino acids 538 to 576 (39 residues), 11.1 bits, see alignment (E = 3.3e-05) amino acids 566 to 597 (32 residues), 17.2 bits, see alignment (E = 3.9e-07) amino acids 598 to 637 (40 residues), 30.9 bits, see alignment (E = 1.9e-11) amino acids 653 to 686 (34 residues), 9.1 bits, see alignment (E = 0.00014) amino acids 687 to 723 (37 residues), 16.1 bits, see alignment (E = 9.1e-07) amino acids 774 to 812 (39 residues), 6.7 bits, see alignment (E = 0.00077) PF13332: Fil_haemagg_2" amino acids 1606 to 1747 (142 residues), 35.5 bits, see alignment 1.7e-12 amino acids 1768 to 1873 (106 residues), 25.6 bits, see alignment (E = 1.8e-09) amino acids 1910 to 2083 (174 residues), 155.6 bits, see alignment 2e-49

Best Hits

Predicted SEED Role

"Putative large exoprotein involved in heme utilization or adhesion of ShlA/HecA/FhaA family"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Search structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

See A0A0C4YE13 at UniProt or InterPro

Protein Sequence (2677 amino acids)

>RR42_RS32080 filamentous hemagglutinin adhesin (Cupriavidus basilensis FW507-4G11)
MNKNIHRLVFNAARGMLVAAPEHAAGRGKGRQGNAGRGGQGSVLVPAFVPAFVPVLWLTA
LTASMMGLPVLLDQSAHAQSLPIQVDKQVPGQRPVVGVAANGVPVVNIPPPSRNGGTSVS
HFIQYNVGPSGVVLNNSGANSQTRIAGWVQGNTMLGNGHASNIVVQVTQPNPSRLLGAQE
IAGNRANLILANPAGLHCSGCGAINADRFTLSTGRPVFGADGSLAGFDVRGGNIGIDGQG
LSSPQAQVDLLARSIQVNAQLWATHLNVVAGANQVGYETLNAAAQAGNGPAPQFALDASA
LSAMYSGAVRLIGTERGVGFHLGGGVTARSGDIVVDNNGDVRIVPGGRLQAEGSASLAGA
SIDNAGTITTRGGIAAATPGALSNSGTLAAGTDLLARAERIANSGTIGAGVDANASVTGA
GTAHLAARTAIQSHGTIVSGANTRLSAPTLDLSNGKVVAHDAAYLSAAGDIRHQDARLEA
NAVQVDAGGTVDNRGGAIVVGTHGGAVQGANILNQDGAITSSGTLGVDAAQAVDNTGGTI
AGTRATTVAAAGTVNRGGMIASVQDQLRMTGTLDNTSGKIVAGGPLDMTGDSLANTGGQV
ASNRDATLRIAATLDNTAGFIHAGGTLDAQAGAILNRDTLHGTGAAPKGMEGASVRLSAH
AIDNAQGALRADASLDANAATLGNTGGEVASGGTAQLDVQAITNTGGLLAANQALGIRGN
SLTGDGSLQSHTGDVTAILQTDFHNTGTVTAARDLNLLIEDAQGNATSDITNAGKLLAGR
DADVHGRNVTNAGELSGRASSTVRADGAVLNKGLIDGGAVRILAGTTVTNIDRIYGDSVA
IGAGQQLLNDINAATGQGAVIASRSGGVELGAPEIVNREHALVYSSQDLNVGRALDASGK
ATGQADALTNASATIEVAGSANFNVAALANRNDHFRTERTDTGTVHAFYYRLDGSAEKID
PSTAMLVLPTASGAIYGHIASRGDDLAYLGDDDYKRLLLPSARYPFSVFGPPFDWARQAD
GTPGQSDSRSSGPNWPVGLAMLPFAPDAHGSLEADHFVYKADDPIWDKFGMARPGALPPV
PSPCPDGSPAACQTQYRAALLAYHAAKAASKPLYEALNGKIQAFNSDYQGRLVKSWTIER
RETRVQDEVVVHTDPARILVGGNASFTGAVTNDKSQILVGGTLAAGGASGGVNNLAHDAI
RIEDTTGTSQYTRPCGRDRCYDTMGVPAVHLEIPTQLAKGEALDYQVVNGTGTIVADKTG
TGPIATAPTLTQIPLGTGAAGLLGGDVIRAVSPSLAMPDNALFKIRGEPGTPYLIETDPR
FTNQRQWLSSDVMLTQLGQDPTHVLKRLGDGYYEARLVADAVMLGTGQRFVGDYTDNEAQ
YKALMQSGVAFGKQFHLNVGTALTADQMRALTTDIVWLVEKTVTLKDGTTQRVLVPQVYL
KVRAGDLKGDGTLMAARDTVIQTDGDVKNTGTIASRNVTLMDAANIHNSGALSASVMGLS
ARQDINNLAGHITTGQLAAMAGRDINLTTTTASGAGGSGDLAGSRAVIAGISSVAADNAT
FVAGRDINTKAANIQATGSLGMAAGRDIHLGTVEVSESERLYKDAKNNAAVSRSTEIGTQ
VTAGKNTTLSAGRDVNATAAYVSADGALAVGAGRDINVKAGQASSSVRDEHAYQESGFLS
KKSTHAIDANSQTEALGSTFSGDTVAMAAGRDLAVAGSTVAATHDVNLGAGRDLTIGTTE
TTSSAHTYKEETKSGFGATGSGLSYGKRAQKDTTNDNGTRQVGSLVGSTDGSVRMNAGST
LTVTGSDLIARQDITGVGADVNIEAAQNRQHHDETHEVKQSGFTLGVSGGAIGAAIGAGN
KINAASQSQDGRASALWGIAAARDIADTGSALAQAGGNPLSGAAVTLSWGTSQSKQTMTQ
DATSHNGANVSAGGTVAFVATGVDANGNQTAGNLNITGSNIDANKVALGARHDINIVSAT
DTNESHSTNKSSSASVGVSYGLGPDNAGFGVSASASKAKGNSDSRGVSQTNSHVSGRESV
TLLSGNDTNILGATLSGGKVIADVGGNLNIASRQDTEETRARQESISGGFSISQGGGSAS
VSASKGKASGSYANVSEQSGIHAGKDGFDLTVKGNTDLKGAVIASEAGKDKNNLTTGTLT
WTDIANHSDYSATSAGLSAGGAFGAPSGQSNSGQASGKNTGGINPMIPQHESGSQDGLAR
SAIAEGTITINDQASQKQDLASLKRDTAGTNSTVGKNPDLNQVLNKQSDMMAAAQAAGEA
IAKTVGDLAGAKEKQARANAKAARDAGDTQLESKYLADAQNWEEGGNYRAALHAAGGALV
AGLGGGNALAGAAGAGVSSLAAPKLQALGNAVSESIDTGNAGLNEALGNVAANIAAGGVG
AIVSGGSGAATAANVDRFNRQLHPDEKAKILAQANGNKAEEDRLTKAACYVVQCWAEYSV
GSAEYNKNYVSSVEASGLTTEIQWVKLQKQAGLFDYTLAQRVGDAVKSDPIGVAKDGLKA
VTGAVTALTGGTICGTSLVGCAAGGPMAAFGTSDAVEGVTGLTNAMLGYRSGGFNPIKYG
LNEVMPVWGNTAYDGLNLTFNLLAMRAQVPLKLGLADGLARPSSIFGATTSKFNNPKLVP
IINWSLPNGTAQNALLMGAGAKTIGLVNDVNNARPEK