Protein Info for SO4149 in Shewanella oneidensis MR-1

Annotation: RTX toxin, putative (NCBI ptt file)

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 500 1000 1500 2000 2500 3000 3500 4000 4500 5020 TIGR01965: VCBS repeat" amino acids 240 to 319 (80 residues), 47.5 bits, see alignment (E = 1.6e-16) amino acids 331 to 434 (104 residues), 92 bits, see alignment (E = 2.1e-30) amino acids 443 to 539 (97 residues), 116.1 bits, see alignment (E = 6.6e-38) amino acids 549 to 645 (97 residues), 113.1 bits, see alignment (E = 5.5e-37) amino acids 655 to 751 (97 residues), 111.2 bits, see alignment (E = 2.3e-36) amino acids 761 to 859 (99 residues), 116.7 bits, see alignment (E = 4.4e-38) amino acids 863 to 960 (98 residues), 63.6 bits, see alignment (E = 1.6e-21) amino acids 964 to 1062 (99 residues), 117.6 bits, see alignment (E = 2.2e-38) amino acids 1072 to 1169 (98 residues), 118 bits, see alignment (E = 1.7e-38) amino acids 1173 to 1270 (98 residues), 55.5 bits, see alignment (E = 5.3e-19) amino acids 1280 to 1377 (98 residues), 107.7 bits, see alignment (E = 2.7e-35) amino acids 1476 to 1573 (98 residues), 60.1 bits, see alignment (E = 1.9e-20) amino acids 1584 to 1680 (97 residues), 106.8 bits, see alignment (E = 5.2e-35) amino acids 1684 to 1779 (96 residues), 58.1 bits, see alignment (E = 8.3e-20) amino acids 1791 to 1889 (99 residues), 119.8 bits, see alignment (E = 4.8e-39) amino acids 1893 to 1989 (97 residues), 55.1 bits, see alignment (E = 6.8e-19) amino acids 2001 to 2097 (97 residues), 105.1 bits, see alignment (E = 1.7e-34) amino acids 2101 to 2198 (98 residues), 61.5 bits, see alignment (E = 6.8e-21) amino acids 2202 to 2299 (98 residues), 119.3 bits, see alignment (E = 6.9e-39) amino acids 2313 to 2397 (85 residues), 81.8 bits, see alignment (E = 3.3e-27) amino acids 2400 to 2497 (98 residues), 53.5 bits, see alignment (E = 2.2e-18) amino acids 2501 to 2597 (97 residues), 121.5 bits, see alignment (E = 1.4e-39) amino acids 2601 to 2697 (97 residues), 62.2 bits, see alignment (E = 4.2e-21) amino acids 2707 to 2798 (92 residues), 109.8 bits, see alignment (E = 6.2e-36) amino acids 2965 to 3061 (97 residues), 41.1 bits, see alignment (E = 1.6e-14) amino acids 3066 to 3164 (99 residues), 128 bits, see alignment (E = 1.3e-41) amino acids 3173 to 3254 (82 residues), 86.6 bits, see alignment (E = 1.1e-28) amino acids 3421 to 3517 (97 residues), 118.5 bits, see alignment (E = 1.2e-38) amino acids 3521 to 3611 (91 residues), 48.8 bits, see alignment (E = 6.4e-17) amino acids 4006 to 4096 (91 residues), 100.6 bits, see alignment (E = 4.6e-33) amino acids 4271 to 4369 (99 residues), 115.3 bits, see alignment (E = 1.2e-37) amino acids 4379 to 4475 (97 residues), 111.4 bits, see alignment (E = 1.9e-36) amino acids 4480 to 4571 (92 residues), 38 bits, see alignment (E = 1.5e-13) amino acids 4580 to 4660 (81 residues), 69.3 bits, see alignment (E = 2.7e-23) PF17803: Cadherin_4" amino acids 406 to 480 (75 residues), 26 bits, see alignment (E = 4.3e-09) amino acids 833 to 898 (66 residues), 38.7 bits, see alignment (E = 4.6e-13) amino acids 933 to 1002 (70 residues), 33.2 bits, see alignment (E = 2.4e-11) amino acids 1035 to 1111 (77 residues), 32.2 bits, see alignment (E = 4.9e-11) amino acids 1143 to 1208 (66 residues), 42.1 bits, see alignment (E = 4e-14) amino acids 1446 to 1511 (66 residues), 27.9 bits, see alignment (E = 1.1e-09) amino acids 1546 to 1622 (77 residues), 38.5 bits, see alignment (E = 5.6e-13) amino acids 1654 to 1719 (66 residues), 37.2 bits, see alignment (E = 1.4e-12) amino acids 1754 to 1831 (78 residues), 37.7 bits, see alignment (E = 9.6e-13) amino acids 1864 to 1928 (65 residues), 34.2 bits, see alignment (E = 1.2e-11) amino acids 1963 to 2039 (77 residues), 34.3 bits, see alignment (E = 1.1e-11) amino acids 2071 to 2136 (66 residues), 38.9 bits, see alignment (E = 4.2e-13) amino acids 2171 to 2240 (70 residues), 47 bits, see alignment (E = 1.2e-15) amino acids 2370 to 2435 (66 residues), 36 bits, see alignment (E = 3.2e-12) amino acids 2469 to 2539 (71 residues), 36.4 bits, see alignment (E = 2.5e-12) amino acids 2571 to 2636 (66 residues), 38.6 bits, see alignment (E = 5e-13) amino acids 2671 to 2747 (77 residues), 28.6 bits, see alignment (E = 6.9e-10) amino acids 3034 to 3104 (71 residues), 28.8 bits, see alignment (E = 5.7e-10) amino acids 3136 to 3211 (76 residues), 31.1 bits, see alignment (E = 1.2e-10) amino acids 3493 to 3556 (64 residues), 32.2 bits, see alignment (E = 5.1e-11) amino acids 3591 to 3671 (81 residues), 29.5 bits, see alignment (E = 3.6e-10) amino acids 4342 to 4417 (76 residues), 30.8 bits, see alignment (E = 1.4e-10) amino acids 4451 to 4514 (64 residues), 42.1 bits, see alignment (E = 4.2e-14) PF17963: Big_9" amino acids 843 to 936 (94 residues), 32.2 bits, see alignment (E = 6.9e-11) amino acids 1153 to 1246 (94 residues), 35.2 bits, see alignment (E = 7.9e-12) amino acids 1359 to 1449 (91 residues), 28.1 bits, see alignment (E = 1.3e-09) amino acids 1455 to 1550 (96 residues), 33.6 bits, see alignment (E = 2.5e-11) amino acids 1664 to 1758 (95 residues), 33.9 bits, see alignment (E = 1.9e-11) amino acids 1874 to 1967 (94 residues), 30.5 bits, see alignment (E = 2.2e-10) amino acids 2081 to 2174 (94 residues), 38.8 bits, see alignment (E = 5.7e-13) amino acids 2180 to 2278 (99 residues), 28.2 bits, see alignment (E = 1.2e-09) amino acids 2380 to 2474 (95 residues), 33.1 bits, see alignment (E = 3.6e-11) amino acids 2581 to 2675 (95 residues), 37.7 bits, see alignment (E = 1.3e-12) amino acids 2950 to 3038 (89 residues), 23.8 bits, see alignment (E = 2.8e-08) amino acids 3501 to 3595 (95 residues), 29.3 bits, see alignment (E = 5.5e-10) amino acids 3615 to 3700 (86 residues), 33.2 bits, see alignment (E = 3.3e-11) amino acids 4459 to 4552 (94 residues), 35.2 bits, see alignment (E = 7.9e-12) PF05345: He_PIG" amino acids 1361 to 1452 (92 residues), 82.8 bits, see alignment (E = 7.9e-27) PF17892: Cadherin_5" amino acids 3612 to 3704 (93 residues), 51.2 bits, see alignment (E = 3.7e-17)

Best Hits

Predicted SEED Role

"T1SS secreted agglutinin RTX"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

See Q8E9W3 at UniProt or InterPro

Protein Sequence (5020 amino acids)

>SO4149 RTX toxin, putative (NCBI ptt file) (Shewanella oneidensis MR-1)
MSPKKPLTLSTAKRKSRISAKLAKKSEIAAKKLDKDVAKLKVKEFREGEVQSTATPDIVA
SKSWRLPSHAATQLVNTEESTEQKPDSQSATVDIDNRHLTAQSASSSTDSEKTLSELPQQ
ADETQTKNAPKFQDYGSSESTHLGQGPHFDDTQLPLVTLSQLPEIGHVKQQIEPAALTVS
MARTTLTQIVRPQIISTEQHNPLTIDEQAGHQSVKEDAPVASASGKLHAHGGTVTPLQWT
VAVNQGQFGRLDIDPTTGEWHYQLDNNALSTDALAEGEHQTEQFTITVSSPNGEQVNTVI
IIDVEGSNDLPQILGSHLASINAADSLPAVTGQLHSIDPDHNDAVSWAVLDGQGQYGQLS
INPLTGQWQYQLNTHATATLALVSGQQVTETFTVTATDQSGHPVSQQVSIQVNGADNVAI
IQGEIIGLVTEDTQANNGQLTVSGQLSIQDPDRDQTHFSAGSLQGQYGYLTIDVNGHWIY
SADNSLATVQSLKSGEHLTDTFTIHSADGTAQNITVTINGTDDKAEISGTTSASLTEDDN
IHQGMLRANGNLAITDNDHEPHLFTATEQYGQFGTLIIDELGHWTYTADNSQTAIQALKT
GESITDTLVVQTQDGTQQTISISIHGTDDHSVIMGTSVMRVFEDKRLSQGQLHTDGQLWV
SNPDTGRAGFAAEQLQGQFGILSINTHGHWTYTADNNNPSIQALKHGESLTETLFVQALD
GTTQKIAILIKGADDKAIIGGTDTATLVEDQSPQHGQLQAHGTLTITDPDAGQAQFTALT
DVAGSNGYGHFSLDSSGVWTYNADNTQTAVQQLKTGESLTDTLTVTSADRTSHTVTVTIQ
GCNDAPVLVGQTQSVTEDGTQLSGKMIATDVDAGDTQTFSLVNAIDGFTLNADGSYNFDP
SHASYQHLTAGQTQDVVIAITVTDSTGATSYQNLTISVTGANDGAVIGGVNIGAVSEDNT
LTASGKLTISDVDSTQAGFIAQNNLAATHGCFTLSASGDWTYALDNSNAAVQALKTGESL
TDTVTINSIDGTQHTIAVTINGSNDGAIIAGVDTATATEDANVIAGNIEACGQLTISDPD
AGQSHFTAQTDVAGSSGYGQFSLNSTGAWTYSADNTQAAIQQLKAGDSLTDTLTITSADG
TIHTLTVTIQGTNDAPVLAAQSQAVTEDGTQLSGHMVATDADAGDNLSFSIAQPVDGFTL
NADGSYSFDPSHASYQHLAAGQTQTITIPVTVTDVAGATSTRDLVIKLNGSNDGAVITGT
DVAAVTEDQTAGSGHFLQVTGTLTVADVDAGEAVFQATSTKSQLGDFILHSDGSWMYQAD
NTNTAVQALKAGQQTTDSFTVLSTDGTAHTITITITGSNDAPTVAHALTTTTATEDSAFS
FSIPTGTFADIDAGDTLTLSAGSLPAWLHFDAATGTFSGTPTNGDVGTMQVTITATDANG
AQVSTTFALTVANTNDAPTLNPIATVHATEDGAQVTGQFTATDPDSGDTLIYSIAQPVNG
FTLNADGSYSFDPSHTSYQQLAAGQTQDVLIPITVTDSAGATSTQNLTITVTGSNDGAVI
SGTDSGSVIEDQQVSATHILSCGGKLAVTDVDTGEATFAVQHGTGANGYGSFVLGPSGTW
TYSADNTQAAIQQLKAGETLTDTFTVSSTDGTTHTVTVTIQGSNDAPVLAAQTKAVTEDG
ALLTGKMVATDADAGDTQVFSTAQTVDGFTLNADGSYKFDPSHASYQHLAAGQTQDMVIP
VTVTDNAGATSTQNLTITVTGTNDAAVVSGRVANAVNEDGVVSTGGLLRASGLLTVVDPD
AGEAVFVAQTGVAGAGHYGSFSIDASGHYSYTADNSQTAIQQLKAGETLADRFTVTTADG
TQQVVTITIKGAEDRPVLQAQTHAVTEDGARLTGQMLATDVDAGDTQRFSVAHPVAGFVL
NPDGSFSFNPAHASYQHLAAGQTQDIVIPVTVIDSAGGVSTQNLTITVTGVNDRAVISGV
SNGRITEDQGVSATHTLSCGGKLDVTDVDTGEAAFAVQHGAGANGYGNFVLDPSGNWTYS
ADNTQSAIQQLKAGETLTGTFTVSSTDGTTHTVTVTIQGTNDAPVLAAQTQAVTEDGTLL
QGRMVATDVDANDTKTFSSAQSVDGFTLNADGSYSFDPSHASYQHLAAGQTQTLTIPVTV
TDSEGATSTQNLTIRLTGTNDAPHISGADVGRVVEDQTLSVSGKLAISDADDGQAHFIAQ
TATTGSYGSLTIGEDGQWQYQLDNTKPEVQALKSTETATDTFTVHSADGSSHNITITIQG
QRDNVVIGGVDVGVVKEDVSAVTGGHLVATGESAEFVPLQQHTTLGNFAMDASGAWTYTL
DNSHASTQALKAGETATDTVSITHTDGTTHTVTITVMGTNDAPVLTAQSQAVTEDSTLLT
GQITASDVDRGDTQIFSLAHAVDGFTLNTDGRYSFDPAHGSYQHLATGQTQTLTIPIIVT
DSVGASSTANLTITLTGTNDGATIGGVSTGTASEDSTLLTTGQLTIADADDGEAHFVAQP
HIQGLHGSFSLSEDGQWRYSADSSQAAVQQLKAGDSLSEHFTVHSADGTAHTVTVTLQGT
NDAPVLQAQHQSVSEDASLLQGQMVASDVDHGDTQTFSIAQAVDGFTLNADGSYSFDPSN
AAYQHLTAGQTEDLVIPVTVTDSAGASHTQNLTITITGTNDGAVIGGAFTGAVTEDQSVQ
SGQLQASGQLTITDADSGESHFIPQTDVLGSDGLGRFSLDAQGQWSYSADNSQKAIQQLA
AGATLNDSFTVQGADGAQHTVTVTLTGTNDAPQFHLQVAPDYHPEVESSDVLPLVINNIG
SSRIAQYLDIQQILQGQAPIGAYANTCVTVSLGGLALIGPDGQAAQVFAAGQEFKLQTLV
DWQQQGPGHEFRVFGPKGTTELDLSLHDAGDPNHITGPHGQASLYVNQVMIWPPFHAWQA
SAGSGTGTATDSGLLTATEGGPALSGQFHASDIDVGDTLAFSLGHPVDGFTLNADDSYSF
DPAHPSYQHLAAGQRETLSIPITVTDSAGASTTQTLSILLRGANDAATFGGVETGATQED
RSDALSGQLTVADADDGEAHFIAQSGTAGSYGQFSLDENGQWHYQIDNSKPEVQALRDGQ
NVTDSFTVRSADGTAHTVNISIAGKNDSAVITGEDHQTLTEDQNVSGGQLIAHGQLHAST
PDAGGDQFTPLSDLAGKFGHLSLGADGSWVYKADNNSPAVQALHAGQSATESFTVHSVDG
TAHTLSLSIQGADEPRHDIWSSIVSTIKSNWAPYNLIGHAIDAGDVGALGSMRLAFTSGN
PSVVDANGHVLASGSSLGAFGHDISMGQVVDLLKSHPGARLVLDGNVGGSSGFYLHDSGD
VMELHFRGLSHYNQANSPMGQLPIDGLPTPLSAAGTVPDVSDLTISVTDISVDHASQLQT
SGQLNISDADAGENHFNAQKDVAGTYGSFSIDASGHWVYSVDNGQAAVQNLQAGAQLTDS
FMVTSADGSSHVISVAVYGKNDAPVLFAQSQSVSEDGALLSGQMHASDVDAGDTLSFRID
QAVAGFTLNADGSYSFDPSNAAYQHLAAGQTQTLVIPVFVTDHMGASSVQDLTITLTGAN
DGPQVLHATAATAGDLGAVNEDTPRVFTEAELLQAVGATDPDDGGSLHIVAGSLTSSHGT
FTGDAAHGFTFTPTANFSGQDIDLQFSVTDGSATREAHVQLDITPVADAPLRVTAPTITT
DFDSISLPGVGWTQTDPSPTGWHTDESSGVEIGQERLYGGSGNNQILSLTASWGSNIYRE
LPTQAGEAVHLGFDLSARPGFAVTPLEVLWEGKVIDTITPVAGSYSMLHHDYDLLANGSN
SRLELRASGQGEARVLIDNIQIGPPEHVLLGDHDTLLKPYLGFQLADTDGSETLSYGIHG
LPVGFTLTDGTHSAIVSSAGQVIETAGWNQDTLAVRAPQGFQGTVSIQVTAHAEEGSNHQ
TANAADLTLELRLVPVSHAAVISGENAITLTEDQQVDALGFLEHFGQLNVSDPDAGEAKF
VAQNAVNTQYGRYTIGGDGYWLYEADNKLPAIQQLKTGESLTDSFTVHSIDGTPQTITVT
IQGQDDVAAFVPQAQAAQVDSADVYTQISVQLLDSQHREFEFMGEVLLANTDPRNYIDAE
MSFTNTGVALRAPDGSLSKVYHAEGGMTTVPVMDLRAWHNKGQGYEVVLVDATGMPVNVF
PHDAGDPDKLSGFNQYVTGFSGPFNAAVVLYPDMQAASSGGAVQGGAFVPAIAPLLYDAA
HVLEDNGQQISGFIEVQDADKGQSSMQAMTDHHAAHGTFSIDVNGDWVYQLDSRRPDVQA
LKAGETLLETITVHSADGTPHEVNITIHGQNDGAVISGADTGQLVEDQNVSAASTLEAHG
QLTVTDADAGEAAFVAQNAVATQFGHISIDASGAWRYEVDNNKADVQALKAGESLTEHLT
VSSVDGTTHVLTVTVQGSNDAPVLQAQSQSVTEDGSLLRGQMVAQDADHGDVLRFTTHAA
VGGFALNADGSYSFDPSHHDYQGLTAGQVRDIDIPIVVTDSAGASSIQMLSIHITGRNDS
TFIYGVDSGNVQAGVHPQTSGDLHASDDDAGQSGFQAGTVKGSFGSLSIDAQGHWTYQVD
GHDSFVQHLLPNSYITEQLTVHSVDGTAHDIEIEVSGAPAGSQLSAPADSANVQHVALPQ
VWGAVQGQPVAGHDVFAGILAAVRAGGYQYQGLADELDKGSLAAVGNMDLSITGVHAQVV
DKFGQVVQSISPTAENAATLHMSDVLLWHAQGHSIIATSDSGFDSTRMWLHNTGDMHNIQ
VILPNGTGFNGWVGEYSLGELSLSPAIAYPAPPASDEQDDEPIQILNFNEVIRNEDSMQS
MTAAGVKAGDEAVIGNNDYPLPLNAATEYKGNLNELSLSSSHGTSGTTPLTTTDAHFAAQ
ASPVDHYLQMLGLSPTAVNLSPSVPVELLPSLSSSDHFNDIDAPMYAIPEVNHFENPLLD
NDKEQHKDRHFDLFDVTELHTNPNDDDLLHSALNDMHNQM