Protein Info for GFF4184 in Variovorax sp. SCN45

Annotation: Filamentous haemagglutinin family outer membrane protein associated with VreARI signalling system

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 500 1000 1500 2000 2500 3000 3500 4000 4422 transmembrane" amino acids 20 to 41 (22 residues), see Phobius details amino acids 3040 to 3056 (17 residues), see Phobius details amino acids 3081 to 3097 (17 residues), see Phobius details PF05860: TPS" amino acids 152 to 420 (269 residues), 93.6 bits, see alignment 1.9e-30 TIGR01901: filamentous hemagglutinin family N-terminal domain" amino acids 176 to 251 (76 residues), 70.9 bits, see alignment (E = 3.4e-24) PF12545: DUF3739" amino acids 4202 to 4312 (111 residues), 142.3 bits, see alignment (E = 7.9e-46)

Best Hits

Predicted SEED Role

"Filamentous haemagglutinin family outer membrane protein associated with VreARI signalling system"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (4422 amino acids)

>GFF4184 Filamentous haemagglutinin family outer membrane protein associated with VreARI signalling system (Variovorax sp. SCN45)
MSTRGRSVRDRQTTRTEDRVFRLAPAARAIALTLAAGGVIGHAQAQRAFSPGWFADKGAA
QNMAVQTGRLPNGMPASSLVGPQAQQQQASAQLRRSLENLNLVAQSIAAQQASQAAARLA
AQGDPSVPDGLAEGGLKIDTDSLTAGWHNANAPVQSRQADGRTNVAIQQTNDKAILNWET
FNVGRNTTVEFKQQADWAVLNRVNDPSARPSRIQGQIKADGTVLVANRNGILFTGTSQVD
TRNLVAAAAAITDDQFRNRGIYAGDTTPSFSDALGKVEVSAGAQIATRAPVSATQGGGHV
FLLGSEVRNAGRIDTPSGQTVLAAGDSFMIRKGVGTDANPGSTTAGSEVAASRRAGSASS
TVVNTGLIMAPTGDITLTGHSVTQAGVVIATTSTARRGTIHLSTRASDEGGTVALAQGGT
TAIVLDAGGGTALDSQRDAALQRMGNAPGNNAAGAFDNLGIIADRRDLSRIEIVSGSTVE
FQDASTTLATGGEIAVSAPRRTLVASGARLDVSGAVGVKVAMEGNNVLVNTQGNEQRDAP
VNRDTGNLNNLNLWVDRRRLVLVPAGTGGYATDRWYTQGGLLEVSGYLATSGHTVGEWMA
QGGTITVAGGDLVTRSGSSINLSGGTLDVATGIIRQSWLKGSDGQLHEVSRAPADLMYTG
LYRGFEDAHARWGKNTTGYYYNPLIGPQSRLENGYTVGRDAGKLVVATTSAVLEGSIAGD
TYQGPQQTAAARTELDGYNQPQNSAPRNGQLVVGQYLPIYDAEARLLRHGLTPLIDRVAL
GELTQGAADGLQLSDPMAAERQGTLVLDARGLSASNLAAVRIAARKDIAVRDVLQVADGG
EIVLLAPAVDLSANLVAHSGAIRVGNVLRQPQAGTGNPVADVRIDVPQGTEGGVRVHEGV
VLDASGRIGDLRLQGSDPGSLAHIDGGIVSLRSTESVMLAGGSRIDVSSGAMVRADGSIV
AGKGGSVQLGSGLASAGGGNPAAILTLDGNIRGHGVNGGGTLDIESASGIVIGGKAVARD
GVLAAGETSLVDLILKEDYQVKAGDVLPVDYTADSFHAAAGTRLPGSSLSPSVWYTLADD
WVLPIPSAVYANSVVADDGRSWTAYSSTQPGSIVVPKGTTVRILDRGRDYFAGYVVEKNV
FPDGMELRTPIKVTTPAGQRAPADFTLAAGTSLEAGRLLTRSVRVGGTTSLSTDLFAAGF
SDYRIRGQLGVAVPGGVSLDVTMPVLRADATASAFEPWLPALYQENAAKATASQRRGASL
SLAGGIADAQAKPGDGGSLVIGEGAQLRVDPGQSISLSSRGSMTVQGTLEAKGGAISLLG
PQTLADANKPNETSVLGDGHANARAIVIGEHAVLDVSAARFVATDVLARPYGMLTDGGSI
VVGGTLREPLGQVSASDAFVVVREGALLDASGSSASLYLPGVGNRTLASDGGSIQLASAS
GVYLDGTARARAGGAGVSGGRLVMALEAPSYAAGADDAVLAVRDLLLTQHRVVAPAARTA
PRYGYAALAADQVRDGGFSSLGLYSNGMLSFGDHVDLSLERELRVYAGGIGIGQGVDPAI
SAKLAAPYVLLSGLTPATVPEGSVRPSFRGGVTTLASNAVLTMQGRQVDIKGDVSLGLNA
RAVGTSIYAPEQRAGFGQLNLISEGDLRFLAGPVTAGTASATTLNSTGSILLRAGQVYPA
TGVIATIRAGRMPDADTYDPEQRLVVERFDPRAAAPALPHSVFGRLGLFSAHVEQGGIVR
APLGEIRLGDAGPAIVQTQSLHLTPGSITSTSAAGLVMPYGGTSDGVNYLAADQLVTPVT
AGLIGGIVMAGKNTDVAQGAVLDLSGGGDLRGAGFVSGRGGSSDARTTPLMQVGKDGFTL
PGLGTNPVYAIVPGYASGYAPAGAGGALAPRTGQQITIGANDVPGLAAGTYTLLPSSYAL
LPGAFRVELNGSAATQQSDRAIAMRNGSFVVSGTLGVADTGMSDRMPRQVILTSADVLRH
YSQYNETSYADFVKQRALLNGTVRGMLPADAKTLTLQAATSAGSSFRFNGSADFAPAGDG
YGGTAEIWAIGSNGKNLMEILADGAQPTAGFQGASLHARDLNSLGAARLVIGGNMAAGRD
TQSSLIDIESRAGSVTLRQGAMLRAPEVFLAAASNVDGITVELGAGINTIGTGRVAFDST
AGYVYRPGAIAVVAVSNGLVDMLAPAAVAGGGIGGGTIRVGTCTGPCDGAPARLYGEGTL
AVATRGDFQMGDDARYGARNLVLAMGAVNAGSSEALSAARTRGALTPGLTLNQQVLDRLL
NGDTSSGAPALERMALTAAESVNFFGGVSLNTIDPATGHSSLASLVLSTPALYGYGNAQD
VASIVTGKLTWVGIGDPLVAPVAFGRGTGAGQLRIKADVIEFGHGPRAQPDNIGDPGRTI
LGFADVRMDAVDRITANHQGSLAVYQSQVAGADGPAYQGGNLTLSAPLVTGQGGSINRIT
AGGALQVLAPAGTGPTAAPKDLQTGAELDLSGASVRIDTTVALPSGRLVVKADGDVVLGD
NSRLDLAGRELAMLDISRYSWGGDVLLQSTRGNIVQAAGSAIDVSARNNDAGSLSVVALD
AGAGRIDLGGRLLGSATGHHDAGGTLLPYRAGGLLLRAQDLGDGGLSEEFAALNRRLNEG
GFFGERNFQFKRGDLNIGDELRARSLAVSLDNGRLTVSGRVDASGEQVGSIRLSAANGLT
IAGGATLDAHGTVLRVDSRGQIIDSPNRAAVVLDSGTGTLTIGDGARIDLRHGTASGGGD
GAARGTLELHVARIGHTGSVLDADAATHGDMAIDARGRIDVQGARSIAVYGRQVYTDAPL
GSDPAASGRPYQVVDQAYLDAKHAESAAFVDNALANGQLLGVKLAGLNNAAYREAFHLRP
EVQITSATPDGDIVVRGDLDLSGQRYASLNPNTPRTGIYGSGEVGLLTLRAGGDLSIHGS
INDGFAPPPETPDDHGWLLIPGVQAYGVDLVVPVGGVTLAEGTVYPKGKALNYDISAKDV
TLPAGTLLPAAVTLDRGLTLPAGTVLGADVRAADGTVLLAAGTLVGAGGLSLPAGARLLA
GVRLPVELPVARLIWPKGAALPVAMTQAGSLVLPVGALIASGTDVKLSDGKTYVDLRPVD
AAGRQGRNWAVAQMLPEGSQSWSMRLVAGADLDAADVRLHNASGTGRLSLADTHFGAQVL
PGQVILGLNSGGVDAIVEAAGGLPAGISSKRQMVGMLEAEVLRLYQASWPDFGLPMDFWT
PEAGNLLMGLTRQGVDAVVAMAGGLPAGVSRPEQMIGMTEADLTRLYQADWGDFGMPSNF
WALSQGNGSVINSPASARVRSQGFSVVRTGTGDLSLAAGGDFTMQSAYGVYTAGTATSLG
SAALDARYNQPRGAINGGTVLGSVAGAQASDYERLVSGPGSLYKAWYPDRGGHLTVDVGG
NLTGDSWSASGEIQRGSSNIGNWLWRQGTGDTAGVNGVPTAWWTNFGTYAYGEAIDGNGQ
ALYWPAVVGFTGLGTLGGGDARISVGGDAGIVDMRTTGALVRNSASSTTARSQGLVLAVG
SSGRVLDDGSIRLTGGGDLDLRIGGGWNGHAESRLAAVGATVAQSHELFGAIANLRGAID
VAAGRIGTMELFYASGQDAKDTRASNPYLASISRATGGLLLMPGDAGTSLSSRADMVLGG
TGNPGLVPLPNGTPFASASGGGQNGVSWFSLWTGRTALQLMTAGGQLAFDTRASEAIVDR
NFASWDYASNGGWYLLPGSVRATAATGSIFYGSSLANLILPSPTSSSAAGLLLAPQGERR
IELVAAQSLYGGGYPISVSGADAWGMATPERPGFAGFSAKGEIVLGNAGAKAPQVDVGRL
PLLVFGANTLSGNLAETGAALAPSRFYAASGDIVGLRTGTTAVHQGGLRMGETDYVAAGP
VAVKAGRDIVGFGSLLGDPAVTTPEFSSDGGSSALLSGNLIVNTHGNDVSVIQAGRDILH
ANVDVAGPGVLEVSAGRNIIQNDRAMLRSIGPAVAADARAGAGIVVQAGMGSRLPPDYGD
FLRRYLDPANQAAAGAPLAEQPGKVAKTYGTELVVWLKARFGFDGSPGEAEKFFFALAPE
QQRIFARQILFAELRAGGREYNAVDGPRIGSYLRGRNAIATMFPAGEGGEPGDLLMYGGS
GIHTDIGGDIQVLTPRGAQTYGIEGSAPPSTAGLITRGKGDVQLFSLGSILLGQSRVMTT
FGGDILAWSAQGDINAGRGSKTTVVYTPPRQVYDNVGNMKISPDVPSTGAGIATLNPLPE
VPPGDVDLIAPLGTIDAGEAGIRVSGNVNVAALHVVNADNIQVQGKSTGLPVTAVVNVGA
LANASAAASSAAAAAQDVLQRDRAAARQNLPSVFTVRVLGFGNESEETDRSPASLRSGLQ
PPRGVPYDPANPVQILGVGQDFDAKQLARLTAEQRSQLRQSR