Protein Info for GFF1903 in Hydrogenophaga sp. GW460-11-11-14-LB1

Annotation: hypothetical protein

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 200 400 600 800 1000 1200 1400 1600 1800 2000 2200 2400 2600 2781 signal peptide" amino acids 1 to 21 (21 residues), see Phobius details PF25564: DUF7933" amino acids 24 to 147 (124 residues), 32 bits, see alignment (E = 5.2e-11) amino acids 161 to 275 (115 residues), 35.4 bits, see alignment (E = 4.5e-12) amino acids 292 to 419 (128 residues), 40.2 bits, see alignment 1.6e-13 amino acids 568 to 690 (123 residues), 63.6 bits, see alignment (E = 8.6e-21) amino acids 720 to 844 (125 residues), 55.9 bits, see alignment 2.1e-18 amino acids 852 to 973 (122 residues), 87.8 bits, see alignment (E = 2.8e-28) amino acids 981 to 1093 (113 residues), 81 bits, see alignment (E = 3.6e-26) amino acids 1108 to 1231 (124 residues), 94.9 bits, see alignment (E = 1.8e-30) amino acids 1237 to 1361 (125 residues), 99.6 bits, see alignment 5.9e-32 TIGR01451: conserved repeat domain" amino acids 434 to 478 (45 residues), 26.5 bits, see alignment (E = 4.1e-10) amino acids 988 to 1025 (38 residues), 18.1 bits, see alignment (E = 1.7e-07) amino acids 2507 to 2557 (51 residues), 31.8 bits, see alignment (E = 9.5e-12) PF17210: SdrD_B" amino acids 1366 to 1436 (71 residues), 37.6 bits, see alignment (E = 1.1e-12) amino acids 1523 to 1599 (77 residues), 40 bits, see alignment (E = 2.1e-13) amino acids 1677 to 1763 (87 residues), 30.2 bits, see alignment (E = 2.3e-10) amino acids 1815 to 1891 (77 residues), 41.1 bits, see alignment (E = 9.3e-14) amino acids 1935 to 2010 (76 residues), 43.9 bits, see alignment (E = 1.3e-14) PF13620: CarboxypepD_reg" amino acids 1823 to 1875 (53 residues), 32.9 bits, see alignment (E = 3.4e-11) PF01345: DUF11" amino acids 2504 to 2621 (118 residues), 33.1 bits, see alignment (E = 3.3e-11)

Best Hits

Predicted SEED Role

No annotation

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (2781 amino acids)

>GFF1903 hypothetical protein (Hydrogenophaga sp. GW460-11-11-14-LB1)
LRALGLWAAATVFSAGAAHATLVVSQAVSTSTIAVGDRTRLTITINNSDVQEDNASLINT
LPAQLRFYPVGHPGRLAPTSTCAGATITLGTGPGGVDQLRVTGFTIPPHVGGVDGSCTIS
LDLTSYDGGDNWDNVINAGDLSPAAASAATQTVSVTELKNPTVAKDLGGATLTQGLSRTA
TITLSNPNSAYNIPLTSFTDNLPSNLQVTGVVANTCGSTTSFTASSVTLTGGAIPAGSPG
TCRVQFTVIGVLPPATASQAGTNSLPAGAIGNTRGLTSPADTTHITVNSPITVAKGFSVT
PLTAGADSLMTVRIGNNSAAALTGVGLSDTTVGARWPAALNNSGAIGAAQLSGCGAGASL
VAGAGSQGFVLSGAEIAAGGECRIQFNVTSTTTGTHTNALPGGAVSNAQGFTSPASSAPI
EVRDNALTVAKAFSPTTVAPGDVALFSVSVTSFSLTPQTSVNFTDTLPAGMVYVDATTGG
ITPVISGGCSFAGGVPMPTPLATAPGFSFNFPGASPGGTTCVVSFTARVPANAAPGTTLT
NASFGAGNGTVSGSSGAVSLTAVNPLVVTKTFDGVTARQRFQGAPSVAQIELTNNNFSAL
ANLSFTDTLPAGLRVADPANATTTCGGSVSANPGDASFSLSGGSLPARAGTDPFSPGQCS
VRVNVVGGTIGVHTNSIPAYNGSNAAATVSATGTVANVPGTTVRNLNATSATLEYLPALT
VAKTFLTNPVQVGGTSRVRITLGNTGNGQLTGVRATDPLAGTGLTLATPAKASTTCAGPV
VLNAPDGGSTASISGATIAGGTSCDFLFDVLSTSATPSVNTLAPGAVSADGGVASTGTTT
ATLNKVTSRVNLTKSFAPTAIAGPGSVSRLTIVIDNTTATALTELAVADNMPAGMVVGVV
PNASTTCPGGSVEAVTGTRLVRLSGGTLAGNASCTVSVDITSSVVGTLNNTLPAGAITNT
QGVTNASPFTANLGALAGMGVEKHFEPSAVAAGQPARLVIQVRNSLPQALTNISTTDNLP
AGMVVASPANASTTCAGAVITAVGDKVSFRGAQLPAGPSVCEVRVNVVVPLAGAYVNTMP
GGTVIANDGAVTNPQPGPSATLHVLQPPTIAKAFAQATVAPGVANRLTVTISNPNAAQTL
TGVALRDPLPAGLFVAQVPNASTTCAGGTVSAIPSADSAQIAGATVPAGGSCSFAFDTVS
NVPGLYTNTIPAGALTSHEGVSNPDPASAPVRVLEPPTVVKAFVPPSISPNGISRLTITL
GNANPAAQTLTQALDDNLPAGVLVAPTPNIGGTCSTGSVTAAAGTALVRYASGASIPSGG
CTISVDVTAAAAGTYPNRILAGALQTDGGSNPTPAEAPLVVSPLGSISGRIYLDANNNGA
FNAGEAPLAGQAVSLLRASDGQVLQKTTTDAGGFYVFSGLADSATLGSTYTVRYLRSGSD
TVGSSGFAPSSALNASSRSFQAQVNGATTAEASIVGGTGQVATPSVDSTNAFTLRSGIRL
DAAGGNVASSVGNHFGELRASEIAGQVYRDDNHNGVVNGTEPGLAGVQITLVGVDDRGTP
VSLTATTDASGHYRFPDLRPSCSPLAAPACPGYVITQGAQPAGTVNGIPTAGTVNDLGSG
AATGTAGTASNTAPAISHGAPAYSIPAGTSRIAGIVLPPNASSANNNFGEITSDRSISGR
IFTDNNGDGAFNGSDAGVGSGAGGANNVPQTLTLTGNDLNGNPVSITTTTDASGQYSFTG
VPPGTSYTVTCTTCAPPAGFINSTTPLAWPGSTGGTAGGSQAVPAITGIDLSGVKTTSVD
NHFTKTLPGSLISGVVYFDPDNSGGVFTPDDQPVPGRTLELRDQVSNALIATTTTDASGA
YSFAGLPAGSYRVVMPALPSGTTHGLTSAGTLAGAPNGTPTAPGVAPAAIASIVVAANQS
TANHNFPLVSDIRIAGRVYEDRDFDGQFGGADVGLPGSTIGLVGTDAFGNAVTRYTTTDA
GGQYSFTGLAPGNYTVRQTQPLGYTSVANTPGPVTGGSAGAVGALGGAVETMAFSFNATP
GAALQVNFGETQGEGVRGYKSVRIVSPAGAAGVSPGAQVVWQVIYQNDSAFPVTMDVRDT
LPDYMTRTGAPALSHGGGNGGVFTPNAGFTGTAGGDLLGSANLPAGAWVSVDIPVLVTAG
ATTTRFNQATAGPTGIRTDAVDSTTPTGGPGQPPSGVIPPGSVPQTPYQTPGLDATGVPL
SSLPAALSGFVWRDNNGDRRRDANDTPIADWAVEVVGSDGQAVACRTAPSNSANGCVTMP
DGRSLFRTGADGAYGVIGLVPGDYKIQFRDPANNVIYGTPQNSGGDANSRVASTRDHLAV
SLQPGANLVQQDLPLDPAGVIYDANTRLPVDGSLVRFCGPAGFNPATMLVGGASYTAVGN
CAQMVTGPNGFYQFVLVPGAPSGEYRLEASAPGYTPAPIASIPPAPGVLTPSGATPYLVQ
AQAGPPTGAQPTTYHLRFNLAPGMPDVIHNHIPLEPFAGGGLFLQKQAMRDVVEIGDSVQ
YTLRVLSPNGAATNVTIADTLPAGLRLIPGTVQQDGVTVADPAGSPGPVLSFALGGFAAN
TPVTVTYRVRVGVGAQQGDGINRAQAFGTVGGVTVSSNRAEARIRITGGVFFQEACVMGK
VFVDCNRNRIQDAEEVGIPGVRLYLQDGTHLVSDSEGKYSICGLPARTSVLRADPTTLPD
GSQLVTSSNRNALDAGSLFLDLKFGELHRADFIEGSCSPAVLEQVENRRRLGQIVAPLVD
PPTPTPRVFRSGPPARVGGAK