Protein Info for CSW01_18380 in Vibrio cholerae E7946 ATCC 55056

Annotation: type I secretion C-terminal target domain-containing protein

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 500 1000 1500 2000 2500 3000 3263 TIGR03660: T1SS-143 repeat domain" amino acids 371 to 512 (142 residues), 161.1 bits, see alignment (E = 1.5e-51) amino acids 516 to 657 (142 residues), 186 bits, see alignment (E = 3.2e-59) amino acids 659 to 801 (143 residues), 174.8 bits, see alignment (E = 9.7e-56) amino acids 803 to 945 (143 residues), 178.7 bits, see alignment (E = 6e-57) amino acids 947 to 1078 (132 residues), 152.1 bits, see alignment (E = 9.6e-49) PF00353: HemolysinCabind" amino acids 3122 to 3154 (33 residues), 32.1 bits, see alignment (E = 4.4e-12) TIGR03661: type I secretion C-terminal target domain (VC_A0849 subclass)" amino acids 3167 to 3260 (94 residues), 51.9 bits, see alignment (E = 1.4e-17)

Best Hits

Predicted SEED Role

"RTX toxins and related Ca2+-binding proteins"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (3263 amino acids)

>CSW01_18380 type I secretion C-terminal target domain-containing protein (Vibrio cholerae E7946 ATCC 55056)
MQSQVVTQPVTVTSLSGNVVVVNAQGQARIVNAGDVLQPDEIIITVNQSAIELQTTQGGV
QIDENCVACLPEFSADGQPEVQAAPVQGQINLDLAQLDTANFDEQAIAAIQQAILDGVDP
TTALEAAAAGAGAGGSANGGAITIDYNFLEVLASTAFDTQGYNQTFSTTQTLVNPLRFAA
GGESLSTQVTEGSLSLGTYPQTSTVTSLITAGSLALLPASFVPEAAFLTSLLAELNQDIT
SSGQPVVFRYDAATNSIIGEQNGSTVLSIAISAESIGRDVNLTITTTLSQPIDHLPSVGG
GLVSISGDQISIALQLTGTDSNGNVIQAPIDVVVAINDGSAPVMVDEPTLSLNENDLPAG
SDGADPLTVSGQFDTQLGSDQVASYQIDPSTANPIAGLTSQGDAVILGEPTLIDGNRVYQ
ATAGGRDIFQLTLNADGSYQFVLQGTLDHAAGSDALTISLPIVAIDYDNDSSAPGNLNIE
IQDDKPIIIGAEQLTVAEQTLDTGSIGGGASLVADGNFTTTQGSDGVVSYRLDSLTDSVA
GITSGGVAVTLSESVDANGNYTYTATAGGEPVFTLLLNQDGSYRFTLQGSLDHALNSDEL
LVNFTVVATDFDGDTASITLPVTVKDDKPYFTNVTSLNVHENDLPQGSDVTKEPLTASGQ
FELVQGSDRVASFTLDSSVNPVQGLTSNGVAVTLSAPVDDGHGNLTYTAMAGAVTVFTLT
LNTDGTYSFTLAAPVDHALNSNDLTLNFQVIATDFDGDSDSIVLPVKINDDKPYFTNVQG
LYVHENDLPQGSDTDKEPVTVNGQFQLVQGADTVASFALDSSVNPVQGLTSNGVAVTLSA
PVDDGNGNLTYTAMAGSVTVFTLTLNSDGTYSFTLAAPVEHALNSDSLTLNFKVIATDFD
GDTASIVLPVTVLDDQPSVISAQALSVNEDDLATGTDQSKESTTANGQFTTTQGADGIAH
YQIDTSTSSNTGLTSQGQPVVWGAPSITTTSSGQVYTYQGIANGVVIFTLVLRADGSYSF
TLNGAVDHPLNANELTLNIPVLAQDADGDTSPITLPVTIVDDVPILHDKNIALQEGSVAS
SVNLFSRDNNLSPDTQGADRGVITHFSAVDEAGRDIQFREGSVLSNDIELNGAAKTVTVV
EIVNGVSRDLGTLTIQPNGTATFTPVTQLDHTDGNDIKFTVDVTATDYDHDTSTEQLNIT
IYDHKATITQQKFTGYEDQGHDAALNLVPAGEQSNAQDNLGGLPVEALKLALQVNLYDVD
QGESLGEVSIWNPNQIRGDFYYLDSANQLVKLDVDPASGHVVLPAALLQQSINGTIATVE
NLYFVPDRHYSTGNGGMNASVSVEILHNGVRDHFTNGNMRIEIESVADIATWKSSSEFHY
DAVEDGSNVSLNIAAETQDNSNPEAITYQIRFTENGANANLVYSDGSPIPTKTDANGTYY
EVPANKIAQVQVDPADNFAGQIKLDVTAITKESTNYVAGKQTAQSETKEIVIDVAPEADR
GSFTVNRISIFEDNASNQNAVDPSVEHDPLLLSEVISMTGSSDADGSEALFVRLSDFTDT
GATLVWLGSGPSPITVGTYPNGETYYEIPQSALSQVEVLPTKHSNENFSFVVEGIVKDTV
NLSTGQVQDIESLGSKTVNVTVKGVADLPNIDFISGNTQWQSFNDGTHQGVITTVAEDSL
VDLNFSIISGEIADSPTDSSETISVLLSNIPDGVKLFDSDGTSVDLVFAGYDSNHKPIYQ
ANLTVAQVVTGIQVQPVASSTANIDIKATVIVTENDGHVRQVEETIRILVEPKIDVTENY
HNAVSGNEDDRIHVTWVPQNTPGNIQNPDAQEYFSRVEISGFPDGSRVFVNNVEVTLING
VLVLEPAAGQSDLDFSNQVSAAGYIQVIPPHNSSTDFTLSTAITVKEQDHEYVDAGNPGQ
GIAEEVIHGSIGVKVNPIAEPDGQLLVENAGSVTQTVQADANGKIDFTINNVSGGQAGAN
VIRFDNLDSNTAGSYQSDELVDQLVVSFGNVPQEVLNQLLITGAINNGDGTWTITNEADF
SIKAPNGLVYSSNNDPDKNGFNDIKITITAKVYDQGEDSSEVKITKQVSTELTLSFPTEV
TGNNSVAAQLNWVGDADDLVIGKEDNTVNLGQQIQDKLMVNATGFDAVADELSIVINASD
LPAGASIGGQDFNFVDGHYVFKGTLNPDGSISGLEGLVLIPPRDFAGDFKLPITFVTTDT
QSGDEKTLTAQVPVAISPVADVPSSSGDQPLDNHVTPSITLNVQETLGLDANHQPTDLAN
DTPTQDGIAYEDGIVHLNLAIGLADSLNGSTQGQEVLTEVTLTLNDTNSGVFVDANGQSL
GTSITLTQAELPAALGEIYFKPAPNYPSGNDINTVGMTVTGKVTDSTVFDETNASLQGVS
SSDADKTFTSQVSFEVKPVVDDITIGSGSPISVTGDEDSWIALADQGNAFNVSLNDNDGS
EQFVSLVLTGLPTDFLVKSLSSDYVVKNNGGGEWSVQIRNPNLTSLDLSALAIKPTKDFS
GEVQLGIKVFTQESLLGEPVEHTGQFTLNVTPIGDDVDIAPITNVVGNEGQAIDISLGAQ
ILDKAPSLPGGVTYTENSPETLRVEISGVPDGAFLSLADGTLGTSLGGGVWVFEINAQQL
DKVVFNSGDNNQLNWNGNLHFKVQSVDTGLAGDQHLGSAQEFDVHVDVTAVNDRPELINV
QDQVTEEDTPLLLNSFTLADIDAQLDDPNADYTLQIGVNSGVLIIDSSLSSGLTIQGDGT
GALSITGNVAEINAAIGAGLVKFVPSPDFYGQVAVTLNVNDNGNAGSEIAGDASTAHDNS
AQFVIDVTAVNDKPEVDGIHLTAQIDEASGQKLTGITVSDVDYAGSHTNDVMKVTLSISE
GILSVQAPAGSSVAVSYALDGSVILEGSPEAINALLNHSDSAYGLFVDATAIAGTQINLT
VTAQDMGVYFENASGMALEESKTYPIQVNPVANAPSLSMNPAFGYAQQIYANQSLSAQGI
ALLGAIAALTDLHETLSLRVDHLPAGASLSSTAGSVTDLGNGRWEVSPDALESLKVVGLD
EGVHTLSLTALSTESDGSSAPSANSIDYRIEIAADGSLLDHRSATDDSLLVAGNSGMTLL
SGSGDDFVQGGAGDDVLVGGLGADILVGGTGADMFKWTLDGVDDKVDHIRDFNVNEGDSI
DLIDVVQDLGNHLTMEQLLNNLSVSNQLTAQVVDNDVTLQVTTDNQVQQTIVIENLATQI
DFTGMSSLDIIGTLLDQNLLRHD