Protein Info for ABI39_RS13850 in Phocaeicola dorei CL03T12C01

Annotation: DUF5113 domain-containing protein

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1465 signal peptide" amino acids 1 to 23 (23 residues), see Phobius details transmembrane" amino acids 439 to 460 (22 residues), see Phobius details amino acids 857 to 874 (18 residues), see Phobius details amino acids 1031 to 1051 (21 residues), see Phobius details amino acids 1188 to 1209 (22 residues), see Phobius details PF17139: DUF5112" amino acids 35 to 301 (267 residues), 363.7 bits, see alignment E=9.4e-113 PF17140: DUF5113" amino acids 303 to 463 (161 residues), 211.9 bits, see alignment E=7.7e-67 amino acids 898 to 1053 (156 residues), 205.2 bits, see alignment E=9.2e-65 PF02518: HATPase_c" amino acids 736 to 856 (121 residues), 54.9 bits, see alignment E=1.7e-18 amino acids 1349 to 1464 (116 residues), 30.2 bits, see alignment E=8e-11

Best Hits

KEGG orthology group: None (inferred from 98% identity to bvu:BVU_2713)

Predicted SEED Role

"putative two-component system sensor histidine kinase"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Search structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (1465 amino acids)

>ABI39_RS13850 DUF5113 domain-containing protein (Phocaeicola dorei CL03T12C01)
MKNMCLLPVWAVLLIIAGTFFSCEKKKDMAIYRQADSLNLLSYHMRYKNLDTACKAAHDA
YKLADGFPSLRAGALNNQGFCAFIHMDFEKAEDLFLRVYEESNNELECLIADIGMMKICQ
RTAMNKEFYDYRNSALRRMKRISDDRSAITDPGELERLNYARSEFSIASAIYYYYLQQEQ
QSLEAINEIKVDEALESDTAQLLYYYYMKGSGGMYEADTPQDVVLGEFNYLIECLGISRE
HGYIYFEANASQAMAELLKERKNFDLIMERRPNVMRAINSEDLPWEELTMRFAWQALDLF
KKYGDLYQISGTYRTLASCSNEQGRYEDALHYLSEALGYVNRHHEKYYHCTDTMDRLRPY
VPMATTSIELEWINDDGIKSVPEWIARFREQLSVTYAALGMKPQSDYNRNIYLDILDYTR
QDKELESRYNALEKESEALNGLLVVVVIGIAVLIILFWILNKRWRVRNTLYIDKLKRTLE
ICRKITASVPIDAGEIEDVTKAVVASVKEDILPLVGATDFRIVAENGEGREVPGQGICTS
FILNIPSREQPLGEVHLYSEHKMKKDDKALMRVITPYISWTLENGLAFISLGDERKRLEK
EQYVHEQHLAENKRQNLVKKACLFIVTGIMPYIDRIMNEVHKLTVHNYIQNEEIKESKYR
YIDELITRINEYNDILALWIKMRQGTLSLNIENFELNSLFDVLVKGRKTFEMKQQTLTIE
PTAAIVKADKALTLFMINTLTENARKYTQPGGNVSVYAQETESYVEISVKDDGPGLSQED
VERILSEKVYDSGKIGLQTSENVSELQKNKGHGFGLMNCKGIIDKYKKTNEIFRVCLFRI
ESELGKGSRFYFRLPKGVRKTLMLVLVAFLSVLTGCNEGREKDGKAEQLTLNDSIQRYDK
LLAVANEYAYDVYNCNIDGLYQQALCYADSALHCLNKHYIMYSGSKGPLLELEGEGAAAD
LDWFNRHFDTDYYALLDVRNEAAVAFLALGNLEAYRYNNNAYTALYKQISEDTSLEQYCR
QMQLSANNKTVAIILCVVILLVLLVGYYILYFRHRLIYRYNLEQVLEINKQVFSASLLDG
RTDRDIAASLVNDMFETVNELLPIDVLGVAVYSEDNHSLNYAFSPVEDENEEMREMMARS
FDAKTSYWREKDRLKCLPLWVEAGNENRCTGVLALKIALPVEREDDRLMLELVAGYVAII
VYNAVVLMAQKYRDIESAQDDARRAIREENQLHVQNLVLDNCLSTIKHETIYYPNRIKQI
IDRLNTRQAGENEAVQVETIGELISYYKDIFTLLSSCAARQLEEITFRRGVVKAGELADY
AARYIKRAGKRMPHRVELRTEVEHVSVLGDVIQLKFMLENLIDEALSYEVDGLLELCIYK
DKDFVRFDFRDTRREKSQEELNLLFYPHLSRMKQGQEGVLTGTEYLICKQVIRDHDEFAG
RRGCRINAQPAAEGGFTVWFTLPAR