Protein Info for GFF1232 in Xanthobacter sp. DMC5

Annotation: hypothetical protein

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 100 200 300 400 500 600 700 800 900 1000 1100 1200 1256 signal peptide" amino acids 1 to 30 (30 residues), see Phobius details transmembrane" amino acids 48 to 69 (22 residues), see Phobius details amino acids 106 to 127 (22 residues), see Phobius details amino acids 389 to 411 (23 residues), see Phobius details TIGR00229: PAS domain S-box protein" amino acids 431 to 556 (126 residues), 68.2 bits, see alignment E=7.2e-23 amino acids 557 to 685 (129 residues), 38.9 bits, see alignment E=8.8e-14 amino acids 690 to 806 (117 residues), 65.3 bits, see alignment E=6e-22 PF13188: PAS_8" amino acids 434 to 486 (53 residues), 33.2 bits, see alignment (E = 1.2e-11) amino acids 690 to 736 (47 residues), 20.2 bits, see alignment (E = 1.6e-07) PF00989: PAS" amino acids 435 to 546 (112 residues), 45 bits, see alignment E=3.5e-15 amino acids 567 to 675 (109 residues), 26.1 bits, see alignment E=2.6e-09 amino acids 690 to 797 (108 residues), 49.4 bits, see alignment E=1.5e-16 PF08448: PAS_4" amino acids 440 to 551 (112 residues), 49.1 bits, see alignment E=2.1e-16 amino acids 584 to 679 (96 residues), 29 bits, see alignment 3.7e-10 amino acids 692 to 801 (110 residues), 30.8 bits, see alignment E=1e-10 PF13426: PAS_9" amino acids 446 to 547 (102 residues), 42 bits, see alignment E=3.3e-14 amino acids 584 to 676 (93 residues), 14.6 bits, see alignment 1.2e-05 amino acids 697 to 799 (103 residues), 62.2 bits, see alignment E=1.7e-20 PF08447: PAS_3" amino acids 584 to 670 (87 residues), 52.7 bits, see alignment 1.5e-17 amino acids 710 to 792 (83 residues), 30.7 bits, see alignment 1.1e-10 TIGR00254: diguanylate cyclase (GGDEF) domain" amino acids 807 to 967 (161 residues), 117.5 bits, see alignment E=4.8e-38 PF00990: GGDEF" amino acids 811 to 964 (154 residues), 136.1 bits, see alignment E=3.4e-43 PF00563: EAL" amino acids 986 to 1219 (234 residues), 255.7 bits, see alignment E=1.4e-79

Best Hits

Predicted SEED Role

"diguanylate cyclase/phosphodiesterase (GGDEF & EAL domains) with PAS/PAC sensor(s)" in subsystem Bacterial hemoglobins

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (1256 amino acids)

>GFF1232 hypothetical protein (Xanthobacter sp. DMC5)
MRAQPARRRFRAATVLTAVAALWSRAAEAQPSLAAGGLGGAPTLKTTLIALIAAAVIGTA
AFMATLVVARRRAAAGTPAETIPAPRKAGAERAAPAVPNSRGAMRAVLLLLGLGAILLVV
TVLVYQASVQSALSETRKLLGTLAEVQVRNLAQERQRLETGALALGSSRFLADDVAGWHG
EPSSERRERLRLLLGESARAFGFTGAELWANNGAFIMASGAEPGRPDMAEMVRQVAATGQ
MAEAPPPRDSTRAGGFAFLVNVPVRDAEHDPCCVLFLQSAVRPEAHPMLSAVFRTARSAE
ALLVGPDPRGGVRVTSAVSRDQAAPLPRTFVADPYPFAAASDRPGQSGAGIDPSGRAVLA
VRTPIPGDQSSLVLKVDRDEILGGARTSAFFSAALACALLLLAVAVGRGLLQKSRMRDAA
RELSAARAVSEAEARFRAAFDQAATGILHLDLDGTILRANDKACRMLGRPASALTGERLW
QLRSPDHAAAGADVIRPLLDGRIAEIETEESLVHPDGSAVWLAFTFSLVRNADGAADHLL
LVAQDISARRAAEAALRESEERFGLAVKGSDQGVWDLNVRTGSLYLSQRAREMLGVGPDE
AVDIRRPWHELLHPDDRRKVAPAWKAFLSGGVPTLELDIRLRRPDGSYHDFRWNGLAARD
AAGTISRAVGLLADITSSKQTERHLRLAAAVFANSHDGLVVTNLKSEVSAVNPAFSRITG
YAEEEILGQNMRLLHSGRHDREFYRSLWSELTTAGIWQGEIWNRRKNGEIFLQQLNISTV
YDDSGAAQNFVGAFQDITQAKHSEFELDRIAHYDPLTDLPNRTLLASLLDLAVSRPGSRC
AVLFVDLDRFKTVNDSLGHMAGDAVLQMAAQRIQDELEAGATLGRHGGDEFVIILEGISG
PEDAATLANHVIQQMGRPFSMPGGGEIYLGASVGISLFPEDADTPASLLQHADSALAEAK
SHGRGGYAFYTQALTRGARVRMEMEADLRRALARDEFILNFQPVVNLATGRITGTEALVR
WQSPTHGLVTPNRFIPLAEETGLIEPLGNWVMEAACRQMAEWLDKGADLDFIAVNLSPRQ
FQRGALCEFVDDVLRRTGLPAHKLEVEITESLLFDARAAAERKLRQLKDTGVRIALDDFG
TGYSSLAYLKRFPISKLKIDRSFVRDLPRAADAEIATAIISIARALGFSVVAEGIEDPRQ
HDFLKARGCDFGQGHLFSRAVTGERLLRLALSGPLAPGPAPGTPRGAGPIGSTSLH