Protein Info for PS417_11315 in Pseudomonas simiae WCS417

Annotation: hemagglutinin

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 500 1000 1500 2000 2500 3000 3500 4190 signal peptide" amino acids 1 to 38 (38 residues), see Phobius details PF05860: TPS" amino acids 140 to 405 (266 residues), 89.8 bits, see alignment 2.9e-29 TIGR01901: filamentous hemagglutinin family N-terminal domain" amino acids 163 to 237 (75 residues), 72.2 bits, see alignment (E = 1.3e-24) PF12545: DUF3739" amino acids 3981 to 4091 (111 residues), 137.2 bits, see alignment (E = 3.2e-44)

Best Hits

Predicted SEED Role

"Filamentous haemagglutinin family outer membrane protein associated with VreARI signalling system"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Search structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

See A0A1N7UGN1 at UniProt or InterPro

Protein Sequence (4190 amino acids)

>PS417_11315 hemagglutinin (Pseudomonas simiae WCS417)
MVRCKTPVNLKAPVTVLRLKPLAQAIALLMVAGNAQAATAFSSGWFADKGASQASTAART
GAGQVPGIPSLNQQAQAKQQLARSISTLNTSVAAIAAQQAAQAAGRQAAFGQVSNIPDGL
GKGGLQVDNSLTQGWTNAKGPIQSQSGGKTTVTIEQTADKAILNWETFNVGRNTTVDFQQ
QSTWAVLNRVNDPNARPSEIQGQINGAGTVMIMNRNGVVFSGTSQVNVRNLVAAAATMTD
DQFTQRGLYVDTNGTQPTFTDAAGKVEVQRGALIQTHTAATSTDSGGYALLLGSEVENAG
TIITAKGQATLAAGDSFYIRKGVGTAGNDRSTTRGNEVATSLKAGSTAGNVTNSGLIMAS
TGDITLTGHQVQQNGVALASTSVDTRGTIHLLNAASDTRGSVTLGEGSTTAVLLDASGST
ALNSQKDNGLIKLDGTPANLITGQFNNLSAVVDRTDQSRIEIVSGGTVDFQKGSITLATG
GQVAVSAGQRSLVRDGAMIDVSGAIGVKVAMESNAIKINVQGNEQRDAPVNRDGGQLINN
DVWVDLRELVFVPAGTNGYATDRWYTAGGLLEVGGYLGTQGHSVGEWMAQGGTVTFTGKE
AVTQKGAQINLSGGTVDVQAGYIQQTWLKGPDGRLYEVSKAPGDILYSGFYKGYEDTSVR
WGQTESYYNPLIAPQRRYEAGYTVGRDAGKLVIGTSSAVLEGQVISDVFQGDRQTQAPTI
NLDGYQQSQKARAQRAQLIIGQYIPVYNKTTGTLRYALSGTSDQVLIDSNTQKIADGLDL
GTVLPSDRQGKILLDSDQLNGARLGAVKVAAKQQVNVNGALSVADGGDITLFGPGVQINA
NLTAHGGSVNAGNILSQVDPLKAGEVTDTILAGGSNVNVAAGVKLDTSGRWNNLALDPAA
SAGVAYVNGGKVSLRSASNVNLAAGSVVDVSSGGTLGVDGKLTGGKGGNGTLGALGALAL
EGDVRGYGVNGGGTLALQAHKVQIGESTNAVGSDTLKLAGDFFNKGFSAYDITGNEGLIV
TDGTQVDVSLPVFRLGAQASGMPTGSDPATALERWTPPLYQEDAAKGVLTQRRGAGVSLT
AGHVNSTAAQLATTALTVGQGAVIHVDPGQAINLRSVGQLTMDGTLNAWGGNVSLGGLTT
GASEAVNAIGHGRSIWVGEQAVIDVSARAATAVDNRGRTYGLVRDGGSLNIGGTFDLNTG
LAKASDLFVVVRNGARLEASGTQALINVPGQGAVQAASNGGSITLASNNGLYLEGRFVAR
AGGAGAAGGSLNVALDTPYYQKASVTDRVLKVRELVLGQAHQPTALAGTAEDAAASLAYG
QGRLGVDQVSAGGFDNLALLSNGLLSFDGDVTLNMGQSLRLYSRGYGLSDNAAGNSRINL
SAPYLLLAGFLAPEETGKDLYVRPVPVLPAPSQQATEAQLTATGSLIDVRGNVIFGNRSH
VFRADNSQASLERRGFDQVNLTSQGDLRFLAGAAENDIITGVSTQLLTQGDMTLRAAQLY
PATEVSARVIAGYRGQSTTTSLDLDFDPLRTLTIGRVGSADAPIPYSAFGRLQLGAATVR
QGGVVRAPLGLIDIGSMGSSVVQLLPGSLTSVSGKGLVLPYGGTLDGQVYKYNGKTVKFI
GQGGVVTREGDLNVGVILGGKTISVLPDATLDLSGGGDLLGAGFVSGRGGSTDARYNPLV
QFGANGGFVLPGLSTNPIYAIVPGIQPGAAPVAAEGGAVTPLVGQQITIGAGVPGLPAGT
YTLMPSTYALMPGAFRVEINGLAGQGSGTATTLMRNGSWATSGHLSIAHTGISNSVASQL
TLTSADTLRRYSDYNETRYAQFARADAARLGVPRPMLEADAKTLKLALRPGDGAEAFSFQ
GIGRFGAASGGYGGTVAVIDRSTQGIEVVGAGQSATAGFAGITFNADSLNALGASRLVVG
GMTVVEYGQSGNFIYLANGVNRADSNVILRNGATLAAPEVFLVSNTGSIIVEQGASINTL
GRGNASYDARDGYIYQVANTLAVSNGLLNVISAPKVGSVISQGINIGACSGGVCGGQTGL
YSNGSIVALTDSTFQLGEQVRYGTRHLNLGVSTLNLGSAEALTAAAAANRLPAGLTLSQA
LLDRLLRGDTQFGAPALETLQLSARDGFNFYGNAVLDTYDPLTGKSLLSNLMLSTPALYG
AGGANDVASIHTANLIWLGAITAPGAVVAGGAGTGSGRLDVNAERIEFGYGAFGQPSNVK
SFDRLALGFANVTLNASERITANHKGSLSVYQRQGAYDPVSGFQYSGGNLTIRTPLMTGE
AGSVNVIKAGGAIDIGASAGPRGTASGQGAELSLQGESIRVASAVVLPSGKVSLSARGDV
MLTDDALIDVAGRAIPFNDVTQYGWGGDVLLDSRSGNILQAAGSTIDLSAKHNQAGKLRA
VATDAAGGIVDLQGKILGSSSGHYLAGGTSIPYLAGTVEIQAQRLGSSGALDQQFADLNQ
RLNNGEVFGARSFQLKQGDLTIGNGLKAGTISVSVDNGSLRVKGLVDASGERVGSINLAG
KNGLTLDSTAVLDAHGSKLRVDSYGKIIDAPNRASVVLGSGTGQLTLADGVRIDLRHGTD
GAAGNDGRNRGTLALNAPRLFNPDGSSNDINIDASGRLTVQGARSITLNGMATYDDAPEK
TDPTASGRPYQEITQGYLDGKHALSEAFINAAKNNLNLLNNKLAGLNNATYADAFHLRPG
VEIVSKTPDGDLVVQGDLDLSGYRYASLNPNSQKDNSIYGSGESGSLTLRAGGDLNIYGS
INDGFAKPPVSEDDKGWVLLPGVDFSGGNIVVPGNGVTLADGTAFPAGATLNYDLPIKGL
TVAAGTRLPVVATLAQDLVLPAGSVLSADVRDGSGNLLFAAGTLLSQAQTLVAGTQLGAG
MLLTEGTRLQAMTWPKGVPLPTVHGATLANNSVLLSGNKALPKGGLIPAGTDVKLPNGAT
SVQLRVGVSGAAGQVWAISPMLAEGTQSWSLRLVAGADTEAADSRILQAHPRSGEILLAD
NHYGMFAKEMPAKGVQTWTAQAVIDMGFAGVVVKEGDPIDQAVLDGFGLGSIEDFCAATP
EYCKPKVTYVFTQYAADDLANNGINVVVGTEMTDAFFQSIGYPSVEEVCATAPDYCLLKA
DKEYAAVPSSTRFSVIRTGTGDLELLSAGSLSMDSLYGVYTAGTSSTGTFAGDPYNLPKA
KPAGADPAAKTVLNDTGSFFEKFVNGSPTSIYRAWYPEAGGNLTLTVGGDLTGNITSVTG
LNPARPNPRDAGVDSAAVGNWLWRQGSSDVATGGQAQPTAWWINFGSYTATTAADKLLGF
TGFGALGGGNVDVDVSGNAGVLNTLGYSTQHVEANPHSQGLVLAVGSTGRVASDGSLQLT
GGGDLRVRIGGALNPASVADSGDHRDGVLVNLRGDVQLRSATVGSVDVMYGSVAGVQSPG
ETRAYDPLVATRGVARGGMTLVPGDATFNLQTLGDLVVQNVEDPGRVVLSHLSPFAQGAE
QGTGTSWFTLWTKGTSIDLFSLGGNLTPFSRSDATADLAVVYPSIVRATAASGSLYYGKA
AQAQANGDNNYTYPLLLAPGENGQLQFMAQDSIYAGGMTVSQSGTAQSTTATPWHPAYQG
RIGNAVKASNLSVNGNSGSQPLFVFGPGTAALSPGGSAEPARFYAIDGDLVGVSSGRVIA
FGTTVGSLHEGQTWYEGNGSVWMMAGRDIVSSGRRFGQDSLAVAGMRTQDNLIIHDNPND
ISVVSAGRDILYSNFQVAGPGLLELSAGRNILLSGSGALGESSVTSLGPLLAGDNRPGAS
IVMQAGVGPHGPDYLRFVEAYLNPVNLAQPGESIQGNKVAKTYENELVTWLAERFGFVGD
TAQARAYYVALPAEQQRVFARDLYFAELKAAGREYNTDGSVRQGSYLRGRAAIAALFPDK
DVAGNPIAYTGDITLFGGSGVHTNFGGSIQMLTPGGSQTFGIEGEAPPSTAGVITQGAGD
IQLYSMGSILLGQSRIMTTFGGSIMGWTSEGDINAGRGSKTTVVYTPPKRVYDNWGNVAL
SPSVPSTGAGIATLNPIPEVPAGDIDLIAPLGTIDAGEAGIRVSGNVNIAALRVVNAANI
QSQGKSSGVPVSAAVNTNAMTSASAAGAAASQAAEDAARSQQAAAKQGRPSIMTVEVLSL
GNEPLPQEPAPAQKTSGYNPDSPVQVLGAGPLSEQARARLTEEERKQISL