Protein Info for SO4317 in Shewanella oneidensis MR-1

Annotation: RTX toxin, putative (NCBI ptt file)

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 200 400 600 800 1000 1200 1400 1600 1800 2000 2200 2400 2600 2768 PF17963: Big_9" amino acids 155 to 242 (88 residues), 40.5 bits, see alignment (E = 1.3e-13) amino acids 246 to 343 (98 residues), 56.8 bits, see alignment (E = 1.1e-18) amino acids 347 to 444 (98 residues), 55.9 bits, see alignment (E = 2e-18) amino acids 1954 to 2051 (98 residues), 28.5 bits, see alignment (E = 7.3e-10) amino acids 2055 to 2153 (99 residues), 37.1 bits, see alignment (E = 1.5e-12) PF17803: Cadherin_4" amino acids 156 to 224 (69 residues), 38.1 bits, see alignment (E = 6.5e-13) amino acids 247 to 325 (79 residues), 55.1 bits, see alignment (E = 3.1e-18) amino acids 348 to 426 (79 residues), 57.3 bits, see alignment (E = 6.3e-19) amino acids 1956 to 2033 (78 residues), 33.3 bits, see alignment (E = 2.1e-11) amino acids 2057 to 2135 (79 residues), 41 bits, see alignment (E = 8.2e-14) PF17892: Cadherin_5" amino acids 164 to 252 (89 residues), 32.9 bits, see alignment (E = 1.7e-11) amino acids 259 to 353 (95 residues), 32.4 bits, see alignment (E = 2.3e-11) amino acids 358 to 453 (96 residues), 34.1 bits, see alignment (E = 7e-12) amino acids 2016 to 2058 (43 residues), 27.1 bits, see alignment (E = 1e-09) PF18200: Big_11" amino acids 182 to 254 (73 residues), 69.2 bits, see alignment (E = 9.1e-23) amino acids 283 to 355 (73 residues), 69.8 bits, see alignment (E = 6.2e-23) amino acids 384 to 456 (73 residues), 71.9 bits, see alignment (E = 1.3e-23) amino acids 1990 to 2062 (73 residues), 35.5 bits, see alignment (E = 3.2e-12) amino acids 2093 to 2166 (74 residues), 44.1 bits, see alignment (E = 6.4e-15) PF19077: Big_13" amino acids 575 to 655 (81 residues), 35.6 bits, see alignment (E = 3.9e-12) amino acids 670 to 764 (95 residues), 56.1 bits, see alignment (E = 1.7e-18) amino acids 779 to 873 (95 residues), 56.1 bits, see alignment (E = 1.7e-18) amino acids 888 to 982 (95 residues), 56.3 bits, see alignment (E = 1.4e-18) PF19078: Big_12" amino acids 981 to 1076 (96 residues), 102.8 bits, see alignment (E = 3.8e-33) amino acids 1080 to 1175 (96 residues), 106 bits, see alignment (E = 3.7e-34) amino acids 1179 to 1274 (96 residues), 106 bits, see alignment (E = 3.7e-34) amino acids 1278 to 1374 (97 residues), 105.7 bits, see alignment (E = 4.7e-34) PF00353: HemolysinCabind" amino acids 2580 to 2609 (30 residues), 17.4 bits, see alignment (E = 1.3e-06) amino acids 2610 to 2644 (35 residues), 29.1 bits, see alignment (E = 2.8e-10) TIGR03661: type I secretion C-terminal target domain (VC_A0849 subclass)" amino acids 2657 to 2738 (82 residues), 68.3 bits, see alignment (E = 5.3e-23)

Best Hits

Predicted SEED Role

"T1SS secreted agglutinin RTX"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Search structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

See Q8E9G6 at UniProt or InterPro

Protein Sequence (2768 amino acids)

>SO4317 RTX toxin, putative (NCBI ptt file) (Shewanella oneidensis MR-1)
MGSVITSKKGLLKLVNGQINIEVDGSKQPAKDGEQLPKGAVLHIGENATYEITFDDGTKL
SNEVAPATEAMALPVTGEATPDEIQALQDLIASGEDPTTNLPETAAGNAPASDGSSGYVS
LARDGSESLATSGYSTSGQTLAAFATNSPEQSIATDSPSILANDSNTVAEDSVATGNVLN
NDSDVDNELSVTSFTVSGQTVAAGTTVALEGGSLVINTDGSYTFTPNANWNGQVPVITYT
TNTGSTATLTINVTPVDDPSVVVNDTNTVAEDSVATGNVLSNDSDVDNELSVTSFTVSGQ
TVAAGTTVTLEGGSLVINTDGSYTFTPNANWNGQVPVITYTTNTGSTATLTINVTPVDDA
SVLTNDSNTVAEDSVATGNVLSNDSDVDNELSVISFTVNGQTVAAGTTVALEGGSLVINA
DGSYTFTPNANWNGQVPVITYTTNTGSTATLTINITPVANGGPSVTINTDTNNDGFISNE
ELGGATEVNVTIGLEGTGANTGDTLTVNGVDYILTQEDIDNGFVNLTLPAPAEGETITVV
ATITDAAGNTSPEGSDSAVLDTTGPVITVSAPDDTQDTTPTITGTTDAPPGSTITIVVTD
STGAQQTLTTTVNPDGTYAVDVTNPIAEGGYTAEASVTDPAGNTGKASDNGNVDTKIDQD
GDGNTVAITAITQDTGSSSSDFITNDNTLIFKGTVDLGDNSTLAVTINGVVYTTANGLVI
DAQGNWSVDLTGTVLPDGIYPVSATVTDVAGNSKTVTQDVVIDTKIDQDGDGNTVAITAI
TQDTGSSSSDFITNDNTLIFKGTVDLGDNSTLAVTINGVVYTTANGLVIDAQGNWSVDLT
GTVLPDGIYPVSATVTDVAGNSKTVTQDVVIDTKIDQDGDGNTVAITAITQDTGSSSSDF
ITNDNTLIFKGTVDLGDNSTLAVTINGVVYTTANGLVIDAQGNWSVDLTGTVLPDGTYPV
SATVTDLAGNSKTVTQDVLIDTQGPSVVVNIVDDVLTVGETSEVTFTFSEKVKDFEVGDL
TMVGGTVTDLKTTDGGKTWTGTFTPTPGFTGTASVTVNNGSYTDLNGNMGTGGQDTAPVD
TQAPSVVVNIVDDKLTVGETSEVTFTFSEKVKDFEVGDLTVVGGTVTDLKTTDGGKTWTG
TFTPTPGFTGTASVTVNNGSYTDLNGNVGTGGQDTAPVDTQAPSVVVNIVDDKLTVGETS
EVTFTFSEKVKDFEVGDLTVVGGTVTDLKTTDGGKTWTGTFTPTPGFTGTASVTVNNGSY
TDLNGNVGTGGQDTAPVDTQAPSVVVNIVDDKLTVGETSEVTFTFSEKVKDFEVGDLTVV
GGTVTDLKTTDGGKTWTGTFTPTPGFTGTASVTVNNGSYTDLNGNVGTGGQDTAPVDTQA
PTAPTVLIVDDGTPGDGLLTQGEINSNGAGVQLTVSINAADFSAGGHVNLTIVNGTATSN
VELKLVNGELQFANGTPATGFTYNNGTISWTETAPANGQSITVTATQTDKAGNTSVQSTD
TAIVYTPGNCNIVVNESTLRDGVPNIVSGTINFTAGTQALTAFNFNSSSINAATNLAGHN
ITWAIATNGALIGSINGVQAIILTLSDISAIAPGTSGSITVNVELLDNLLQINGLNGENL
STLINGIVIEGTSVNGSVVTGNVNVEIVDDVPDANDDTGSVNVIVDSFRVSGVEANWTNW
TNGTSVTTFDGNNNPNGGGTDNDSGLDQIRWGNPINTYKSGYGFIDNDSALNGQFALNQD
IILGTFTHYNFPISSGGAITKATMDITFSVTDAYGVVTPVTLKVNFDHNETPNNDNDPEA
SKDIIKVGNTNVTFEHQGQVYTLQVIGFRVPGTNQVVTEIKTAENAASSYELVVRIVAGD
GYALPSTTGNVLFNDVIGADNDLMVVGAAAGNQTGSGTSGNVGTIINGQYGTLIVFANGS
YTYTVTANASAIPTGATETFTYTMQDADGDKSSALLTINVNTVNANALKAVQDQKNGPED
TAVIGNVLENDGNKNTSVTHFTVANSATQHTAGSKITLAEGELTLNADGSYTFTPAADWN
GQVPVITYTTNTGATSTLTIVVTPVDDPTVTKPESKTIAEDALAKGNVLENDTDVDNTLS
VTSFQVNGVTYNAGNTWYQLPEGTLQLKTNGEYSFDPKDHWSGSLPVITYTTNTGATSTL
NITVQAVADAPNLTINGYTSVAAINFEDAKFSGSWDGVKANDIKGLNTIGTWHTSNRGGQ
VEIGYESVYVSGGSNSNKVMEIEFNSGDKTLYTDIQADAGRFYELGFDIAARSGSVSTSG
LTIKLIPLDAQGNPLNAQAITLYDFDPTNANWLRDQKITLPINQSGKYRLQFEGDNGDSY
GALLDNLAFKVVDNMGYRGEFIKLSNISSSLKDTDTSETLSLKLQGLPEGAVLKDALGNL
ATVGKDGTVDITNWDKSSLQIKVANHGNFTITVVATATETSNQDTAQSTADFQVTVLHPN
NVVGGQSVDSFIMTNWTGNSAQFAVNLGGYGTVASSSKTEYIAKDTELLINAGNSNDYVD
LGVSTANNIVNTGSSLPNLNNPVVTQAEILSSKFMAQDAITTNAGTLKSDVLQPLQPKTD
TVNLGSGDDTVNGGQGSQLVYGGSGDDLLIGGEGIDGLRGGDGNDTLIGGLGDDVLRGDS
GADTFVWRYADADKGTDHIMDFKVGEDKLDLSDLLQGETANTLESYLKFSLNNGSTVIDI
DANKDGTFDQHIVLDGVNLYSQYGATNNAGIINGLLGSNGQGPLIIDTQPTPVDLEVKLD
PLKTVDPV