Protein Info for GFF786 in Variovorax sp. SCN45

Annotation: Filamentous haemagglutinin family outer membrane protein associated with VreARI signalling system

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 500 1000 1500 2000 2500 3000 3500 4158 signal peptide" amino acids 1 to 31 (31 residues), see Phobius details PF05860: TPS" amino acids 147 to 396 (250 residues), 87 bits, see alignment 2e-28 TIGR01901: filamentous hemagglutinin family N-terminal domain" amino acids 161 to 236 (76 residues), 68.7 bits, see alignment (E = 1.6e-23) PF12545: DUF3739" amino acids 3945 to 4055 (111 residues), 133.4 bits, see alignment (E = 4.9e-43)

Best Hits

Predicted SEED Role

"Filamentous haemagglutinin family outer membrane protein associated with VreARI signalling system"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (4158 amino acids)

>GFF786 Filamentous haemagglutinin family outer membrane protein associated with VreARI signalling system (Variovorax sp. SCN45)
MSKQTFRRAPVAHAVAIALAMAGLAGNAQAQRAFSAAWMAQKNVAQGTAAATGYLPNGTP
ASLLTNPLAQQQKANEQLQRSIGNLNLAAQAIAAQQAAQAAARQAAMNAGTDVPDGLADG
GLKVDTNSLTAGWHNALAPKADNAGGRSTVTVQQTSDKAILNWETFNVGRNTTVEFKQQA
DWAVLNRVNDPAARPSRIQGQVKGDGTVLIANRNGIVFSGTSQVDTRNLVAAAAHISDAQ
FGANGIYSNGTTPTFSNAQGKVEVQAGARIATRAPTISTDGGGYVLLVGQEVHNAGDIAT
PRGQAALAAGDSFLIKKGSGTGGNQASSTRGNEVTPQFNTDSLAGKVTNTGLIVAREGDV
TMAGRDVRQNGVAVATTTVNTRGTVHLNALGADGKVTLGPGAATAVVIEDDGKTTALNSQ
RDGLRGPALPEPNQDVSAVRDRRDQSRIEIASSGTVDFQSDSLTLATGGQIAVTAAARSL
VRDRARIDVSGAVGVNLAMESNNVKVNVQGNEQRDAPLNRDGKSLNNLDVWIDRRSLVLV
PKGTNGYASDRWYTAGGLLEVGGYLGTEGHTASEWMAQGGTVSFGGAEVVTQAGSNINLS
GGTLNVQTGFIHQTWLKGADGRLYEVSKAPGDQRYEGLYRGFEDEHKRWGDKNTGYFYNP
LIGPQKRLENGYTVGRDAGKLVIATGAAVLEGDITSEAFQGARQTQAPQADVDGYNQSHN
AASQRGQLVIGDYVPMFDRRTGALNHVLVAALDRVELKETGERIAAALDLGTALPAERKD
KLVLDTAQLNGMALGAVRIAAREKIEVNADLRVASGGNITLYGNDVAVQANLTAHGGSMA
LGNVLSQVGVSGRVDMLVNAAKPLGNVTLADGVTLDASGLWSNLELDPFAENVAYASGGS
VAIRASGDVALQRGSTVDVTSGAVMRMGGKTQGGKGGSVRLATTTRTGSLRLDPEARLHG
YGVAGGGTLSLEAAKVLISDQAGGTDAGTLVLGGNFFDKGFSAYEVIGGDGLSVAEGTQV
DVTMPVYRFRDEARGVSTASGSAQALELWAQPLYLEDAAKGVLLQRKGASLTLQSGRSLL
GEIDPLTGSLLVGRGARIEVDPGQTIALRGAGQITVDATLVAHGGSIDVRQQQFGLVDPA
QETPQADGKVHTRSIWIGDNAVLDVSGRARTAVDVHGRSYGDVDAGGSIVIGGVLDHDRA
IASSADAYVIVRPGARLDASGAQAALDVPGQGRTNIATDGGRISLSSYLGLYIDGSLRAA
AGGAGAAGGSLDIALESPLYRSTGLNRAQQETIVPRELTLVQSQAPLLAQDLAPGQADAA
LRYGKTRLSADRLSAGGFDNLALLANGQLSFDGDVSLRMGQSLSLFSSSLSLAESSSANA
RVQLAAPYVKLSGTGLYYSAGSGAVRPGIQHGMTPLDSEAAFSVQAGRMLDLGNGISFGT
SGNYAEHTEFTPVRVERRGFENVELASDGDIRFLAPNNADARSRLWTRGSLTLAAAQLYP
ETGAIAQVNAGYRMDPVVLRPVYDPGSVLRIERRGEAPAGVPYSAFGSLEFAAAVIEQGG
VIRAPLGNITLGTDSSQGTKRVALLPGSITSVSGAGLVMPYGGTTDGIAYQYDGRPAALK
GADGGAGIRLGAELVDVQAGALLDLSGGGELTGAGFVAGRGGSTDARFHPLMQIGADGRF
SLPSLSTNAVYAIVPGVQPLAAPAGGPAGASKAGIGRQITIGAGVPGLPAGTYTLMPSTY
ALLPGAFRVELNGAPGSKGAGAQPLAMRNGSWSSSGQLSVAGTGLRDSLSSQVILTPAAV
LRTYAQYNETSYADFVRTDAARLGVPRAVLPADAGSLAFVLSRGNDDVSLNFEGLARFQA
AQGGRGGTAMVLGVSIEVLGAGQRHTPGFDGASVRSDTLNALGAGRLVVGATPLVVYGQQ
GRYVTFSSPASDVVLRAGASLQAPEVFLAGRDLVVEAGAGISTLGRGDAIYDASSGFVYQ
PGTTSLLAVSNGRLDVLAPQAPDIGSTGSIRLGVCALPSCDRNSTFYSEGTITAATNGAF
ELDEAARYGTRHLALAVSGVNAGTTQSLADAQARGVLAGGLTLTQDLLQRLLNGDTSIGA
PALESLTLTVRDAINFYGSVTLDTIDPATGKSRVDSLVLSTPAIYGAGGAGDVATIRTQN
LIWSGATQLPGTVIDGGAGTGGGTLDIQAARIELGGAPHTVSNGQDDDARLALGFATVKL
GASERLSTNRKGSLSVYQSQGAYEEGKGYAYSGGNLEITTPLVTGDAGSVGRITAGGTLR
IGAPEGGRLDADAAKRIQALGADLAFNARSIVLDSAIVLPSGKLAMTAQDDLTLADRALI
DVSGRKLSFNDVDKFSWGGDVGLESQAGGIRQAAGATIDLSAQGNRAGTLKAVALGASAG
VVDLQGRILGASSGHYDAGGTMVPFKAGGVDIRAQRLGDSGTLSDQFAALNQRLSDGAVF
GSRSFQLKQGDLVIGAGLKAGEVNVSVDNGSLAVLGTIDASGERVGSIRLAGRNGLTIGG
GALLDAHGTVLRVDSRGKIIDAPNRAMVELDGGDGVLTLAGGARIDLRHGTGAAVGSQPG
QHDGQKRGTLELYAARTGETSGDIRIDASGSLAIDGARSIAVNAVWRYDDTDPLAVIKDG
LDSVSGRPYREITQAYLDHKHEQSTKFMDAALANGALLHGKLAGLNNATYADALHLRPGV
QIVTDKDLVVQGDLDLSGYRYDSLNPRTQKTGVYGSGEAGSLVLRAAGNMDIYGSINDGF
APPPLTPEENGWALVPGVQPYNGDLVLPIGGVRLAEGTTFPSGATLNYDLPMQALTLGEA
VRLPAAGALAQPYTVPAGTVLSGDVRDASGNLLHARGTLLKEAVVLPTGTQLGAGMALPK
GTALAAMRWPKGVPLPVAFENDVARRNVLLAGTLELQRGALIPSTTRVVLESGVGYIELR
PDSDGTRGRNWALAPMLAAGSESWSVRLVAGADLGAADTRIALAGAGSLRLADTHYRATV
TQGGGTLVWSEFNNWGMPSGQPVTPDEEWLCALLDPNDCVSKSGIAWSPGNGFGFPGGTP
VPEEYLPICTFTPENCIDYGSAMKTLAAHGQGMSVVRTGTGDLDLVASGDVDVRSLYGIY
TAGTQSAGVAPRYNQARGTAPGATSVLGTGGADYESLVAGDARAYQAWFPEAGGNLLVRA
GGDITSDMLTPLNSDVVQKSSVAVGNWLWRQGDGSAQLPTAWWINFGSYAVDPSGTLGAD
VKPSLVGFTGFGTLGGGNVQVQAGGNAGMLERKGTARSQGLVVAVGSTGRVADDGSIVLT
GGGDIDVRIGGGVNSSQLARGNANGSEKPNHDLQGALVNLRGATQLSAASVGGMILGGRD
MKDPRAFDPFTSNRAESTGGLVLMPGDSAMRIDTLGDLVVGGAGDPGRVRLMNSSPFASY
GGGGLSWFSLWTGRTAIDLFSAGGSLTPSTQMSEGTSTSIVTGTNVAPTDFRFVYPAILR
AVAAGGSIYSGSSANDGDGGTNDSTSRFSLLLAPSRSGQLELLAGDSMYMGNYAINQSGA
DPAQMPGPLKPAFVGFNGNQRVADNLGAGGLEVNARTDPFRLFAFVPETYAPGAADAPRT
PARFYALNGDIVGLRSGEILQRRDGATWYEAAAPAWVIAGRDIVSAGTGLGQPTSLSLGT
GTKNDADSSGNLSVHGHAQDVSRMVAGRDILYSSFNVAGPGTLEITAGRNIVMEDRAAVA
SLGPVTPGDKRPGASIAMTAGTGAQGADYAGFLARYLDADRRAESGTPLASQPGKVVKTY
EAELFAWLRDADNGLGFTGTASEAQAFFDALPAEQQRVFARQVYFSELKAGGREYNDADG
ARYGSYLRGRNAISALFPQADAEGKAISYRGDITMYGSAGVNTLFGGGIQMLTPGGMQLF
GVEGNAPVAKAGITPGVITQGKGDISLYSLGSILLGQSRIMTTFGGHILGWSAEGDINAG
RGSKTTVVYTPPKREYDQWGNVTLSSNVPSTGAGIATLAPIPEVPAGDIDLIAPLGTIDA
GEAGIRVSGNVNLAALQVVNAANIAVKGESVGIPTIAAVNVGALTNASAAASQAAVAAQD
VLQRERAATRQSLPSVFTVRVLGFGSEGDGAAAPAEPRAAATRVGYDVNSPVQVLGRGAL
PEAQKAKLTARERQLLGQ