Protein Info for MPMX19_04858 in Azospirillum sp. SherDot2

Annotation: hypothetical protein

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6222 PF00353: HemolysinCabind" amino acids 3095 to 3125 (31 residues), 35.6 bits, see alignment (E = 2.9e-12) amino acids 3234 to 3265 (32 residues), 32.7 bits, see alignment (E = 2.3e-11) amino acids 3266 to 3293 (28 residues), 24.2 bits, see alignment (E = 1.1e-08) amino acids 3323 to 3351 (29 residues), 19.9 bits, see alignment (E = 2.4e-07) amino acids 3334 to 3367 (34 residues), 28.8 bits, see alignment (E = 4e-10) amino acids 6114 to 6141 (28 residues), 8.4 bits, see alignment (E = 0.00092) PF17892: Cadherin_5" amino acids 3471 to 3572 (102 residues), 47.1 bits, see alignment (E = 7.1e-16) PF17963: Big_9" amino acids 3472 to 3570 (99 residues), 55.2 bits, see alignment (E = 4.3e-18) PF17803: Cadherin_4" amino acids 3826 to 3896 (71 residues), 27.6 bits, see alignment (E = 1.4e-09)

Best Hits

Predicted SEED Role

No annotation

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (6222 amino acids)

>MPMX19_04858 hypothetical protein (Azospirillum sp. SherDot2)
MAEEAVQGKVPTVDDRQILDDLTLLQQVDGTRLGGVIHEASSVQLQRPDDTLGYVQTDYR
GAEDLMVGGAVQTGAVVGESVAVAADQAANALLLNGDVLRTEGTRGSGPTGPEGSGPTDG
EGRGAAGKTAERAELLSTARTTSAYDPLEAAASDIPMPETPDQVIDNGVVPAAEGVTFLQ
AAAVVAPAVAAPSTPAPAPVSDTPSLTVNGVSGDEDTAIALNIAAALTDTGGTEVLSVTL
AGIPDGAVLRDASGAVLTVVNGSIVLTASQIPGLTLTPPANSGDSFSLTVTATSTDGTAA
PNSVTATLPVTVNPVSDTPTLGVSAVTGAEDTAIPLSISPALTDTDGSETLTVTIAGIPT
GAVLTNAAGEVLTVSGGSITLTPGQLAGLAITPPLNSDADFTLTVTATSRDGSAAPASTS
LPLVVTVNPVTDTPTLSVSAATGNEDSAIPLTITPALTDTDGSETLSITISGIPAGASLA
NAAGDTLSISNGAVTLTPGQLAGLRITPPANSDADFTLTVTTTAQDGTAAPVSVSAPLAV
TVNPVTDTPTLSVTAASGNEDTAIPLVISPALTDTDGSESLSITISGIPVGASLKNSAGD
TLTIGNGSITLSPNQLAGLTITPPANSDVDFTLTVTATARDGGADAVSVSAPLAVTVTPV
TDTPTLSVTAATGSEDTAIPLTISPALTDLDGSETLTITISGIPAGAVLTNTAGDPLTIS
GGSITLTPGQLAGLAITPPTNSDADFSLTVTATAKDGVANPVSVTQTLAVTVNPVTDTPT
LSVSAARGDEDTAIPLTINPALTDTDGSEMLTITVAGVPAGATLSAGTSVTESNGTTTWT
LTPQQLAGLKITPPANSDVDFDLTVTAIAKDGSAPVATTSQILHVTVDPVTDTPTLSVTA
ASGNEDTAIALSISPALTDLDGSEALTITISGIPAGASLSNTAGNTLTISNGSITLTPDQ
LAGLKITPPVNSDDDFTLTVTATAKDGVANPVSVTQTLAVTVNPVTDTPTLSVSAARGDE
DTAIPLTIDPALTDTDGSETLTITVAGVPAGATLSAGTSVTESNGTTTWTLTPQQLAGLK
ITPPANSDVDFDLTVTAIAKDGSAPVATTSQILHVTVDPVTDTPTLSVTAASGNEDTAIA
LSISPALTDLDGSEALTITISGIPAGASLSNTAGNTLTISNGSITLTPDQLAGLKITPPV
NSDDDFTLTVTATAKDGVATPASVSAPLVVTVNPVTDTPTLSVTAATGNEDSAIALSISP
ALTDTDGSETLTVTISGIPAGASLSNTAGTTLTISNGSITLTPDQLAGLKITPPHNSDDD
FTLTVTTTAKDGVAAPVSVSAPLLVTVNPVTDTPTLSVSAARGDEDTAIPLTINPALTDT
DGSETLTITVAGVPAGATLSAGTSVTESNGTTTWTLTPQQLAGLKITPPANSDVDFDLTV
TAIAKDGSAPVATTSQTLHVTVDPVTDTPTLSVTAASGSEDTAIPLTISPALTDLDGSET
LTITISGIPTGASLSNTAGNTLTISNGSITLTPDQLAGLKITPALNSDADFTLTVTATAK
DGVANPVSVSQTLPVTVSPVSDTPSLSVTAATGNEDTAIALSITPALTDTDGSEVLTVTI
SDIPAGATLVNTLNGTLTVTNGSITLTPDQLAGLKVTPPVNSDDDFTLTVTATSKDGSAA
PASTSAPLLVTVNPVSDTPTLSVSAARGDEDTAIPLTINPALTDTDGSETLTITVAGVPA
GATLSAGTSVTEQNGTTSWTLTPQQLAGLKITPPANSDVDFDLTVTAIAKDGSAPVATAS
QTLHVTVDPVTDTPTLSVNAASGNEDTAIPLAISPALTDLDGSETLTITISGIPTGAVLT
NTAGDPLTISGGSITLTAGQLAGLAITPPHNSDDDFTLTVTATAKDGVANPVSVTQTLAV
TVNPVTDTPTLSVSAARGDEDTAIPLTISPALTDTDGSETLTITVAGVPAGATLSAGTSV
TEQDGTTTWTLTPQQLAGLKITPPSNSDVDFNLTVTAIAKDGSAPVATASQTLHVTVDPV
TDAPTLSVSAASGNEDTAIALAITPALTDLDGSETLTITISGIPAGATLANTLNGSLTFT
NGSITLTPDQLAGLKITPPLNSDDDFTLTVTATAKDGVATPVSVSAPLVVTVNPVTDTPT
LSVTAATGNEDTAIPLTINPALTDTDGSEVLTITISGIPAGATLVNTLNGMLTVTNGSIT
LNPNQLAGLKITPPANSDADFDLTVTTIAKDGSAPVATVSQTLHVTVDPVTDTPTLSVTA
ASGNEDTAIALAITPALTDLDGSEALTITISGIPAGATLANTLNGTLTVSNGSITLNPDQ
LAGLKITPPLNSDGTFTLSVTATAKDGVADPVSVTQSLPVTVTPVTDTPTLSVSAARGDE
DTAIPLTISPALTDTDGSETLTIRISGVPAGATLSAGTSVTEQNGTTTWTLTPQQLAGLK
ITPPSNSDVDFDLTVTAIAQDRSAPVATATETLHVTVDPVTDTPTLTVTAASGNEDTAIA
LAISPALTDTDGSEALTITISGIPAGATLVNTLNGMLTVSNGSITLNPDQLAGLKITPPL
NSDDDFTLTVTATAKDGIAAPVSVSAPLVVTVNPVSDTPTLSVSAARGDEDTAIPLTISP
ALTDTDGSEALSIRISGVPAGATLSAGTSVTESNGTTTWTLTKDQLAGLKITPPHNSDVD
FDLTVTAIAKDGSAPVATTSQTLHVTVDPVTDTPTLSVTAATGNEDTAIALAISPALTDT
DGSETLTITISGIPTGAILANTLNGTLTVSNGSITLNPNQLAGLKITPPLNSDDDFTLTV
TATAKDGVADAVSVTQTLPVTVNPVSDAPTLSVTAVTGNEDTAIPLTITPALVDTDGSEV
LTIRISGIPAGATLVNTLNGTLTVTNGSITLNPDQLAGLKITPPLNSDDDFTLSVTAVSK
DRGAAEATTTLPLVVTVNPVTDTPTLSVTAASGNEDTAIALDIRSALTDLDGSESLSIKI
TGVPAGATLSAGSYVTESNGTTSWTLTQAQLASLKITPKADSDVDFTLTVTATATDRGLS
PVSVQTTLAVTVIAVADKPTVTVTNVSYDLAVGQNDTLTGTAGNDTLSGGAGNDTILGGA
GDDVIYGDGSGTFTVALSITPAVTDRDGSEYISKVTISGVPAGATLSAGTLNADGTWTLT
QAQLSGLKLTAKEGDTTHPITLTVVSTSTELENGSTADSDPRTLTVSFGNTPKGNDSLDG
GDGNDTIFGGAGNDTLIGGLGDDSLDGGDDNDLLAGGPGNDTIDGGAGNDTVTYAGSATA
VNANLATGVGTGEGTDVLRNLENVTGSAFDDVITGDGGANILLGGGGNDTITGGAGTDTI
HGGSGDDLSIFTVGTDGGTASAPELQDGDAGIDTFRVMLTSAQMSNPAILADLRTLRDRI
EAAAADPNAATKDTDVANALTLTALGIKVADFEKLEIYVDGQPVNVREVLNYTPTATAAA
ATTAAVEDNAVTSRLVATDQDTLNGRTDDSLTFKGPGDNGQPVRLAHGTLVVNADGTFTY
TPDADFSGTETFSFTVTDAYGGTSTATQTIKVAPVADPITLTTLNARGNEDAAGGIAIAP
TITLSDVDAASPEAVETVTLTMSAAQLDVAGAKLYLNGTELTRSGTGTYSWTIPTSALVA
GGSAGTWTVSGLKIVTTRNSDADLTYTLTVGTIDSTSAGTVRGSKSATGTITVDAVADAP
TVTAAAAATDEDTSVALKITPALTDTDGSESIGYVTISGVPDGASLNVGTLVSTQGDGST
SWKVNAADLASLRLTPPHDFSGTITLRVVATSVERENGSTADSAPATLTVTVTGVADAPI
VIPTAAAGNEDTAIPLSIDAHLSDVTGETLDHITITNLPEGATLSAGIHNADGSWTLTPA
QLTGLTFTPPPNYSGTLTLKITATSVEGTTTATSAQQSLTVTVTAVADTPVLSASDAVGR
EDTAIPLSVVAALTDRDGSESLSILVGGVPAGATLNHGHYDATLGKWVLTQADLSGLTIT
PPANSNTGFNLTFTARATEGSNGAFADAAAVSVHVEVQGVADVVTPNGTLTARGDEDTAI
NLKLGDLVMTDNDGSERLSLVVSNLPSGSWLSLAAGHESGLVYLGNGRWSVDAEYRNELT
LTTRKDFSGTISLKVDVITTDSNGAPIVTVNGVTSGGTLKDTRDLTVTVDPKADEPVVSI
GASGLEDAVGGIPLTISALTGDSDGSEHISSIVISGLPAGVTLTSSDAGALTANADGSWT
VDPAKLGNLRLIVPANNADDFTITVKVTAMDGTDTLTRTYTPTVTITAVADAPVSHAADG
SAVPYHVDDQSTTGVPDGFVLNLNAFLADTDGSEKLAVTIGGVPKGTVLSRGIDNGNGTW
TLTAEDYTAALAEGGLKLTVKDANATGITHTVAGSVDTASVTLTVNPYSIESEGDRNTSV
SDSFVLSWTTTASSGGGSTDPGTGGGNTGGGTSPPPPPPPPPPPPPPPPPVLDPSVGSDR
PHAVTTAEDTSVALGLKDGLANWQDVSTVTLSGIPTGASLTNTAGDTLTVTGGSITLTKA
QLDGLKLTPPSNSDADFTLSIDVVSGSSNAHTTLTQAVTVTAVADSPLLTLQTASGNEDT
AIALDITAALADTDGSETLASIIIEGVPAGASLSAGYLDSTTGRWVLTPGQLAGLKFTPP
HDASGTFQLTVTAVSREISNADRATTTKTLTVEVAPVSDAPAISVSAATALEDHAATLSI
DAAVTDLVGTGEVITALRVTVPAGFTLSGGTSLGNGVYDLTGLSASDRAALTLTPPANYA
GTVTLTVEAASQDGMAAPAWSSKSFSVSYTAVADAPNVTVHDTTGAEDTAIALNLSASLV
DTDGSETMAVILSGLPAGAVLNHGSNNGDGSWTLKPSELAGLTVTPPRNYSGGMDLTLTA
TSLESSNKDTASTTATIHIDVTAVADAPTVIVANASGSEDTAIALNLTAALVDTDGSETM
AIVLSGLPAGAVLIDANGTTLTPTAGSVALTAAQLSGLKLMPPANAGTDFNLTLTATTTE
TSTGATATTSQLFRVSVDPVADQPIVTWAPLSGTEDIAVPLNLSVALYDTDGSERLGLLT
IAGVPTGALLSAGTKNADGSWTLTPAQLTGLTLTPPADSDAAITLTVTAVSIESNGSTAT
RTVTVPITLAAVSDTPTLAVSAATGNEDTAIALTINPALTDTDGSETLTITISGIPTGAS
LSNSAGNVLTISNGSITLTPAQLAGLKITPPLNRDDDFTLTVTAIATDGSAAPASTSAPL
LVTVNPVTDTPTLAVSAATGNEDTPIALTITPALTDLDGSETLTITISGIPTGATLSNSA
GNTLTVSNGSITLTPDKLAGLAIKPPSNSDADFNLTVTATAKDGSAAAVSVSSTLAVTVN
PVSDTPTLTVSAATGNEDTAIALTINPALTDTDGSETLTITISGIPTGASLSNSAGNVLT
ISNGSITLTPAQLAGLKITPPLNRDDDFTLTVTAIATDGSAAPASTSAPLLVTVNPVSDT
PTLSVAPATGNEDTPIALTITPALTDTDGSETLTITISGIPTGATLSNSAGSTLTVSNGS
ITLTPDKLAGLAITPPSNSDADFNLTVTATAKDGSAAAVSVSSTLAVTVNPVSDAPTLTV
SAATGNEDAAIPLSITSALTDTSEVLSITIAGIPAGAKLTNSAGNTLTISNGSITLTPAQ
LAGLKITPPLNDDADFTLTVTATSTDGSAAPASTVAQLAVTVLPVSDTPTLLVNPATVAE
DGTVALSITPALTDLSETLSVTIANIPTGAHLSNTAGDTLTVSNGSITLTPAQLAGLSIT
PPHDSGSSFTLSVTATSTDGTATPASLSRSLAVTVTPVADTPTLTTPAVHGPEDTAIPLT
ITAASTDTDGSETVTVRIAGMPAGASLNHGTHNSDGSWTLSAADLTGLTYTPPADANGTV
TLSVTATAQDGASTAAVTQTLAVTVDPVNDKPTLAAAHTTTVIAADATDHPAVVSDATIT
DVDSSVMSGMSIRIAAGSQTGDTLNLDSLTIATDPSTGRKTVAGTGIEVSWDDANHQLSL
SGTASTATYTDILHHLALTPNGSGARTLEVTVSDDQGLSSDPLAVQVLVSNSATMTGSAG
ADILHAGSTTTSMSGGAGDDLFIFAPRGGDLTINGGAGWTDVVEVGAFVANAGTNWMQQL
DGHAYTLDPSGHGLTFTQETTVTLEDQNHHQLVLQNIERLTF