Protein Info for SO0341 in Shewanella oneidensis MR-1

Annotation: sensory box protein (NCBI ptt file)

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1515 signal peptide" amino acids 12 to 13 (2 residues), see Phobius details amino acids 35 to 36 (2 residues), see Phobius details transmembrane" amino acids 14 to 34 (21 residues), see Phobius details amino acids 798 to 815 (18 residues), see Phobius details PF07494: Reg_prop" amino acids 55 to 77 (23 residues), 19.6 bits, see alignment (E = 3.4e-07) amino acids 101 to 122 (22 residues), 15.8 bits, see alignment (E = 5.9e-06) amino acids 317 to 337 (21 residues), 16.5 bits, see alignment (E = 3.4e-06) amino acids 364 to 386 (23 residues), 16.2 bits, see alignment (E = 4.2e-06) amino acids 599 to 621 (23 residues), 19.4 bits, see alignment (E = 3.8e-07) PF07495: Y_Y_Y" amino acids 726 to 790 (65 residues), 54.2 bits, see alignment (E = 5.1e-18) TIGR00229: PAS domain S-box protein" amino acids 879 to 956 (78 residues), 24.1 bits, see alignment 3.3e-09 amino acids 961 to 1078 (118 residues), 46.9 bits, see alignment 2.8e-16 PF08447: PAS_3" amino acids 879 to 942 (64 residues), 45.3 bits, see alignment (E = 4e-15) PF00989: PAS" amino acids 961 to 1068 (108 residues), 23.8 bits, see alignment 1.8e-08 PF13188: PAS_8" amino acids 961 to 999 (39 residues), 23.8 bits, see alignment (E = 1.5e-08) PF13426: PAS_9" amino acids 968 to 1070 (103 residues), 28.1 bits, see alignment 9.4e-10 TIGR00254: diguanylate cyclase (GGDEF) domain" amino acids 1079 to 1241 (163 residues), 153.4 bits, see alignment E=4.7e-49 PF00990: GGDEF" amino acids 1082 to 1239 (158 residues), 174.1 bits, see alignment E=8.8e-55 PF00563: EAL" amino acids 1261 to 1493 (233 residues), 252.9 bits, see alignment E=1.2e-78

Best Hits

KEGG orthology group: None (inferred from 100% identity to son:SO_0341)

Predicted SEED Role

"diguanylate cyclase/phosphodiesterase (GGDEF & EAL domains) with PAS/PAC sensor(s)" in subsystem Bacterial hemoglobins

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

See Q8EJW5 at UniProt or InterPro

Protein Sequence (1515 amino acids)

>SO0341 sensory box protein (NCBI ptt file) (Shewanella oneidensis MR-1)
MLGFIGVALSQPILRLIFSVLCIISFFTVPSTGLALPLNAANTQLKFEHIRSEDGLNQNT
ITSLFMDSAGMLWIGTQDGLHSYNGYNFNLFIHSPNDPKSISESYVSDIIQDAEGYLWVG
SFSQGINRLDLKTGTFKRFGVEQGLTDPRVTKLNIVGNTLWIGTQSGLFSFSTRTNRLTK
VSLGTSIEPYITSLANVDNTYLLAGTKGSGTFAVSANTITRLNIPQDAIAYQVKANSSRA
ITLALGNQLWHYDLATQQGKALWQADKNIPYIKDFIQTPEGQFWVVGPEAGLIQLEREAD
KFVATYHQYDMKRTNSISENNILSLLEDPFGNLWLGASYSGLNKINTRRQYFQHLFEYTN
ELPLQSNNVRTIYRSQDQALWIGTEGAGLKRLAVNSTTFEHYTSFFAKALGQQTQHLNLI
LRSIVQDKQGILWFASNYGLGRLTPNGEFNLLNVADSRDTSAEANYIRSLELDNQNRLWV
ATSHALYRKSSVNDEFTPVPLTGIENFHPIQNLLLTLKSDKNILWIGSLNGLVKLDMQTG
HGEVFYHDPHDKNSIINNRIRDIFVASNGDTWFATHGGISRLPAQALSPIFSDYTREQGL
PSDTIYALLEDDDNHIWFSSNAGIGKLNPHNGKIINFNEQEGLQALEFNGGVKLKDSDGD
LWFGGINGINRFNPKTLPNQRSEARVALTAYKIAGQKQTILDLSHPPQIVMNYADQLISF
EVTSLDFSYPGKNRFSYFLEGFDNQWHDLPTGNEITFTNIDPGNYLLHVRHSLEYNSKGN
YALLVNLTVKAPFYRTPFAYAFYVLFTLVSMGWALRWRKQKRSQQQEFETSIRASEERLK
LALWASGDGMWDWNIQEQQVYRTNTDIAVPQWNGHQLLHDNAHPEDRERFKYELTEHLQG
RSPFFEVEYRIEHSPGKWVWILERGKVVETNAQERPIRMTGTTRNITSRKLIENELVLSS
QVLNSMNEAVVVAGLDYRIRSVNPAFSAITGYSEGQISDKFLIHLAYSRQQRDLFNSIEQ
QLLRHKHWAGEIWIRNKARRAILVWLEINQVIDVKGETSHFVAVFTDITERKKAEEDLRF
LASFDTLTGLPNRTLFQDRLNHAISQAHRSNNIVALLFLDLDRFKHINDSMGHHIGDLLL
KAVAHRLQSAVREGDTVARLGGDEFTIILEGVAKTKAATLISEKVLKAFQAPFLLDDKSL
NISTSIGISLYPNDAKDVDSLIKFADTAMYHAKALGRNNFQFYTNKLNEMATRHMQLETG
LKQAISHNELSLVYQPKFCLRNGSLTGLEALLRWQHSELGPISPAEFIPLAEETGMINQI
GHWVINQSCQQLAEWNELGFSDISMAVNLSARQLKADIISTIEVALAVSGLPAKALELEL
TESMIMGNPQESVNILSKLKALGLTIAVDDFGTGYSSLSYLKRFPIDTLKIDREFVRDIT
NDPDDAAITSAIIALAHSLELNVVAEGVETQEQLNFLALQGCDQVQGFLLSKPLSAQDCQ
TLLQQRTKDRQDVKK