Protein Info for GFF4156 in Salmonella enterica subsp. enterica serovar Typhimurium str. MS1868

Annotation: T1SS secreted agglutinin RTX

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 500 1000 1500 2000 2500 3000 3500 3824 signal peptide" amino acids 1 to 20 (20 residues), see Phobius details PF22783: BapA_N" amino acids 3 to 122 (120 residues), 111.3 bits, see alignment (E = 2e-35) PF17936: Big_6" amino acids 191 to 271 (81 residues), 85.7 bits, see alignment (E = 1.7e-27) amino acids 298 to 371 (74 residues), 34.7 bits, see alignment (E = 1.4e-11) amino acids 410 to 473 (64 residues), 39.9 bits, see alignment (E = 3.3e-13) amino acids 483 to 564 (82 residues), 68.9 bits, see alignment (E = 3e-22) amino acids 673 to 754 (82 residues), 69.6 bits, see alignment (E = 1.8e-22) amino acids 757 to 839 (83 residues), 80.5 bits, see alignment (E = 7.1e-26) amino acids 842 to 924 (83 residues), 92.3 bits, see alignment (E = 1.5e-29) amino acids 951 to 1022 (72 residues), 42.4 bits, see alignment (E = 5.5e-14) amino acids 1158 to 1225 (68 residues), 31.2 bits, see alignment (E = 1.7e-10) amino acids 1269 to 1330 (62 residues), 35.5 bits, see alignment (E = 8e-12) amino acids 1367 to 1433 (67 residues), 37.9 bits, see alignment (E = 1.4e-12) amino acids 1469 to 1535 (67 residues), 29.6 bits, see alignment (E = 5.4e-10) amino acids 1575 to 1639 (65 residues), 29.9 bits, see alignment (E = 4.4e-10) amino acids 1650 to 1732 (83 residues), 76.9 bits, see alignment (E = 9.4e-25) amino acids 1866 to 1931 (66 residues), 40.6 bits, see alignment (E = 2e-13) amino acids 1970 to 2033 (64 residues), 44.5 bits, see alignment (E = 1.2e-14) amino acids 2073 to 2139 (67 residues), 38.4 bits, see alignment (E = 9.9e-13) amino acids 2180 to 2247 (68 residues), 38.1 bits, see alignment (E = 1.2e-12) amino acids 2389 to 2453 (65 residues), 40.1 bits, see alignment (E = 2.8e-13) amino acids 2491 to 2558 (68 residues), 39.4 bits, see alignment (E = 4.6e-13) amino acids 2595 to 2661 (67 residues), 32.5 bits, see alignment (E = 6.7e-11) amino acids 2672 to 2754 (83 residues), 78.4 bits, see alignment (E = 3.3e-25) amino acids 2783 to 2854 (72 residues), 28.1 bits, see alignment (E = 1.6e-09) amino acids 2860 to 2939 (80 residues), 57.7 bits, see alignment (E = 9.2e-19) PF19077: Big_13" amino acids 289 to 376 (88 residues), 58.9 bits, see alignment (E = 4.9e-19) amino acids 395 to 480 (86 residues), 64.8 bits, see alignment (E = 7.2e-21) amino acids 575 to 671 (97 residues), 47.4 bits, see alignment (E = 1.9e-15) amino acids 941 to 1028 (88 residues), 60 bits, see alignment (E = 2.3e-19) amino acids 1054 to 1134 (81 residues), 46.2 bits, see alignment (E = 4.6e-15) amino acids 1144 to 1235 (92 residues), 70.8 bits, see alignment (E = 9.5e-23) amino acids 1261 to 1339 (79 residues), 49.6 bits, see alignment (E = 3.8e-16) amino acids 1359 to 1442 (84 residues), 59.6 bits, see alignment (E = 2.9e-19) amino acids 1461 to 1546 (86 residues), 50.7 bits, see alignment (E = 1.8e-16) amino acids 1566 to 1647 (82 residues), 40.9 bits, see alignment (E = 1.9e-13) amino acids 1749 to 1835 (87 residues), 34.7 bits, see alignment (E = 1.7e-11) amino acids 1854 to 1939 (86 residues), 55.6 bits, see alignment (E = 5.1e-18) amino acids 1967 to 2043 (77 residues), 53.6 bits, see alignment (E = 2.2e-17) amino acids 2062 to 2149 (88 residues), 65.7 bits, see alignment (E = 3.8e-21) amino acids 2168 to 2254 (87 residues), 64.6 bits, see alignment (E = 8.6e-21) amino acids 2272 to 2358 (87 residues), 49.2 bits, see alignment (E = 5.2e-16) amino acids 2378 to 2461 (84 residues), 54.2 bits, see alignment (E = 1.4e-17) amino acids 2472 to 2566 (95 residues), 65.7 bits, see alignment (E = 3.9e-21) amino acids 2580 to 2670 (91 residues), 61.9 bits, see alignment (E = 5.6e-20) amino acids 2774 to 2858 (85 residues), 53.4 bits, see alignment (E = 2.5e-17) PF12245: Big_3_2" amino acids 446 to 475 (30 residues), 10.6 bits, see alignment (E = 0.00043) PF17963: Big_9" amino acids 3101 to 3188 (88 residues), 27.7 bits, see alignment (E = 3.4e-09) amino acids 3348 to 3443 (96 residues), 27.7 bits, see alignment (E = 3.4e-09) TIGR01965: VCBS repeat" amino acids 3642 to 3691 (50 residues), 16.9 bits, see alignment (E = 5.4e-07) TIGR03661: type I secretion C-terminal target domain (VC_A0849 subclass)" amino acids 3741 to 3821 (81 residues), 59.9 bits, see alignment (E = 4.5e-20)

Best Hits

Predicted SEED Role

"T1SS secreted agglutinin RTX"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (3824 amino acids)

>GFF4156 T1SS secreted agglutinin RTX (Salmonella enterica subsp. enterica serovar Typhimurium str. MS1868)
MRLLAVVSKLTGVSTTVESSAVTLNAPSIVKLSVAREEISQLTRINQDLVVRLHSGETIT
IKNFYVTNDLGASQLVLAENDGTLWWVENPQAGLHFEQIADINELLVTSGASHEAGGAVW
PWVLAGAVAAGGIAAIASSGGGDSHHHSDGDNPPPDNTNPDGNPPDNSNPGGSTPNGNTP
GSSNPVDTTPPLAPGELLISADGKTVSGQAEAGSTITIKDPSGNVVGEGKADSDGKFSID
LTAPQISGEQLTVTATDDAGNTGPSATIDAPNIPLPDTPVITAAIDDAAPLTGTLSNNQF
TNDNTPTLEGTGSAGTVIHIYANGQEIGSTTVDTSGNWHFAITSALADGENHFTAIATNV
KGESSESARFTLTIDTLIPDAPRVELIADNTGLLTGPLQNNDRTDEAKPLFSGQGDAGNT
ITIKEGSTVIGSATVDENGRWTFTPTTPLSDGEHTFTVEQSDKAGNTSRVTTTPTIIVDT
TPPDAAIIDNVAKDGTTVSGTAEAGSTVSIYDPAGNYLGSTITGENNHFSITLNPAQTHG
ERLEARIQDAVGNIGPATEFTASDSQYPAQPTILTVTDDAGAVTGLLKNGDATDDNRPTL
SGTAEPGSTISINDNGFPVATFPPIVADADGKWSFTPSLALADGDHVFTATATNDRGTSG
QSVSFTIDIDTQPPVLEGLAVSDVGDRLTGTTEAGSTVVIKDSLGNTLGSGTAGDDGTFS
IGISPAKINGETLSISVTDKAANSGPVETLNAPDKTAPAAPDGLTVATDGLSVSGQAEAG
ATVTIRDSSNTVLGSAVANGNGQFIVPLNTAQTNGQALIATATDVAKNESAAATVIAPDS
TAPEMPKNVVISEDGTSISGTAEPGSAITIATPDGKPLGSGKADGEGHFTLPLVPAQTNG
EQVTVTATDSANNVSPPTTAQAPDITAPDKPIITQVLDDVESFTGPLVNGQTTNDNRPTL
SGTAEAGARVEVFDNGFSLGLATLQPNGAWTFTPSQNLGEGAHRLTVIATDAKGNASQAA
SFDLVVDTQSPQQPVITFITDDAPGILGSVAHLGLTNDSTPTINGTGEPGSTVHLYQNGA
RIADIIVGNSGVWSYAYTTASPLADDTYTFTVTASDSNGNTTPFSTDFTITIDTQAPAAP
GVIGVADGDGNTIDTNQITQESQPRLSGSGTAGDTIILYDNGNAIGQALVGTDGRWQFTP
PAALGDGDHLLTARANDPAGNESPESISFTLRIDTQAPDAPQIVSAAITGGEGEVLLANG
SITNQRMPTLSGTGEPGTIITLYNNGVELATVQVNPQGSWTYPLTRNLSEGLNILTATAT
DAAGNSSPTSGVFSVTLDTQPPAQPDAPLISDNVAPVIGNIGNNGATNDTTPTFSGTGEI
GSTIILYNNGSEIGRTTVGDNGSWNFTPAALTPETYTITVTETDRAGNISPPSASVTFTL
DTTAPANPVITFAEDNVGEVQDTIVSGATTDDNTPVIHGTGDIGSIITLYNGSSVLGVVT
VDETGTWTLPVTSALPDGVYTLTAIAADAAGNSSGVSNSFTFTVDTVPLQPPVVNEILDD
VAPVTGPLTDGAFTNDRTLTINGSGENGSTVTIYDNGVAIGTALVTDGVWTFNTPELSEA
SHALTFSATDDAGNTTAQTQPITITVDITAPPAPTVQTVDDDGTRVAGIADPYATVEIHH
IDGTLVGSAVANGTGEFVVTLSPAQTDGGTLTAIAIDRAGNNGPATNFPASDSGLPAVPA
ITAIEDDVGSVQGNIAAGGATDDTTPTLRGTTDIGSTVEVFIDGDSAGFATVDASGNWIF
EIATPLSESTHYFTVQATNANGPGGLSAPVGITVDLSAPAQPVITSATDDVPGMTGTLDN
GALTNDSRPTLNGTGEAGATIRILDNGVEIGSATVDQSGNWRFTPNAPLESNAHIFTAVA
TDPAGNSGQPSDGFTLNIDAQAPDVPVITSVIDDNNQPTVPVLPGQSTDDRQPILNGTGE
PGATITIFDNGTPLGTAQVGENGSWTFPVPRNLSEGSHNLTVSATDPAGNTSAVSAPWTI
VVDITPPAIPVLTSVVDDQPGITGNLVSGQLTNDATPTLNGRGEAGATINVYLDGNPASI
GTTTVNSDGTWSFTPQTPLANGSHTFTLSATDPAGNSSAVSSGFVLTIDATPPAAPVIAS
VADNTAPVTGIVPNGGSTNETRPTLSGTGEAGTTISIYNGSALVGTAQVQANGSWSFTPS
TSLGAGVWNLTATATDAAGNTSAASEIRSFTIDTTAPAAPVIDTVYDGTGPITGNLSSGQ
ITDEARPVISGTRETNTTIRLYDNGTLLAEIPADNSSSWRYTPDASLATGNHVITVIAVD
AAGNASPVSDSVNFVVDTTPPLTPVITSVSDDQAPGLGTIANGQNTNDPTPTFSGTAEAG
ATITLYENGTVIGTTTAQPDGAWSVSTSTLASGTHVITAVATDAAGNSSPNSTAFTLTVD
TTAPQTPILTSVVDDVAGGVTGNLANGQITNDNRPTLNGTAEAGSVVSIYDGDTLLGVTS
ANASGAWSFTPTTGLNDGTRTLTVTATDPAGNVSPATSGFTIVVDTLAPTVPLITSIVDD
VPNNTGAIGNGQSTNDTQPTLNGTAEANSAVSIFDNGALVATVNANASGNWSWTPTAALG
QGSHAYSVSAADAAGNVSAASPSITIIVDTIAPGAPGNLVINATGNRVTGTAEAGSTVTI
TSETGVVLGTATADGTGSFTATLTPAQTNGQPLLAFAQDKAGNTGIAAGFTAPDTRVPEA
PIITNVVDDVGIYTGAIANGQVTNDAQPTLNGTAQAGATVSIYNNGALLGTTTANASGNW
SFTPTGNLTEGSHAFTATATNANGTGSVSTAATVIVDTLAPGTPSGTLSADGGSLSGLAE
ANSTVTVTLTGGVTLTTTAGSNGAWSLTLPTKQIEGQLINVTATDAAGNASGTLGITAPV
LPLAARDNITSLDLTSTAVTSTQNYSDYGLLLVGALGNVASVLGNDTAQVEFTIAEGGTG
DVTIDAAATGIVLSLLSTQEIVVQRYDTSLGAWTTIVNTAVGDFANLLTLTGSGVTLNLS
GLGEGQYRVLTYNTSLLATGSYTSLDVDVHQTSAGIISGPTISTGNVMADDTAPTGTTVT
AITNANGVSTPVGAGGVDILGQYGTLHINQDGSYTYTLTKPTAGYGHKESFTYTITQNGV
GSSAAQLVINLGPAPVPGSVIATDNNASLVFDTHVSYVNNGPSTQSGVTVLSVGLGNVLN
ANLLDDMTNPIIFNVEEGATRTMTLQGTVGGVSLVSTFDLYVYRFNDAIQQYEQFRVQKG
WINTLLLAGQSQPLTLTLPGGEYLFVLNTASGISVLTGYTLAISQDHTYAVDSITANTTG
NVLTNDVVPTDALLTEVNGVAIAATGTTEVNGLYGSLIIDARGNYTYTLKNGVGADSIKT
PDSFIYTVKAPNGDTDTASLNITPTARALDAINDVSDTLSVATLQDTAAWLDSSVGSASW
GLLGKSGSGSGTFDVATGTVLKGASLVFDVSTLITLGNLNISWAIQENGTVIRNGTVPVA
NITLGSATVTVNLSGLELDAGTYTLNFTGTNTLAGAATITPRVIGTTVDLDNFETSGTHT
VLGNIFDGSDAAGAMDQLNTVNTRLSISGYNGSAATLDAAANTTSATIQGHYGTLQINLD
GAYTYTLNNGVAMSSITSKEVFTYQLDDKMGHTDSATLTIDMAPQIVSTNQNDVLIGSAY
GDTLIYHLLNGADATGGNGVDRWQNFSTAQGDKIDIHELLTGWDHQAATLGNFVQVHTSG
ANTVISVDRDGTGSAFKSTDLVTLENVQLTLNDLLQNNHLITSG