Annotation: glycosyltransferase family 41 protein
These analyses and tools can help you predict a protein's function, but be skeptical.
For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect.
For other types of proteins, the error rates may be much higher.
MetaCyc and Swiss-Prot have low error rates,
but the best hits in these databases are often quite distant,
so this protein's function may not be the same.
TIGRFam has low error rates.
Finally, many experimentally-characterized proteins are not in any of these databases.
To find relevant papers, use PaperBLAST.
Protein Families and Features
1
50
100
150
200
250
300
350
400
450
500
550
600
650
700
750
813
PF07721: TPR_4 "
amino acids 7 to 27 (21 residues), 12.4 bits, see alignment (E = 0.0002)
amino acids 211 to 231 (21 residues), 13.6 bits, see alignment (E = 8.2e-05)
amino acids 241 to 263 (23 residues), 12.4 bits, see alignment (E = 0.0002)
PF13432: TPR_16 "
amino acids 9 to 70 (62 residues), 23.3 bits, see alignment
6.5e-08
amino acids 75 to 137 (63 residues), 27.4 bits, see alignment
3.5e-09
amino acids 143 to 205 (63 residues), 34.1 bits, see alignment
2.6e-11
amino acids 177 to 240 (64 residues), 43.1 bits, see alignment
4e-14
amino acids 212 to 270 (59 residues), 25.9 bits, see alignment
9.9e-09
amino acids 339 to 400 (62 residues), 27.6 bits, see alignment
2.9e-09
PF07719: TPR_2 "
amino acids 71 to 102 (32 residues), 24.6 bits, see alignment (E = 1.7e-08)
amino acids 172 to 204 (33 residues), 33.6 bits, see alignment (E = 2.2e-11)
amino acids 242 to 270 (29 residues), 23.3 bits, see alignment (E = 4.4e-08)
PF13181: TPR_8 "
amino acids 72 to 102 (31 residues), 16.8 bits, see alignment (E = 5.4e-06)
amino acids 104 to 135 (32 residues), 13.6 bits, see alignment (E = 5.9e-05)
amino acids 138 to 168 (31 residues), 25 bits, see alignment (E = 1.3e-08)
amino acids 172 to 204 (33 residues), 24.1 bits, see alignment (E = 2.4e-08)
amino acids 209 to 238 (30 residues), 14.9 bits, see alignment (E = 2.1e-05)
amino acids 242 to 271 (30 residues), 20.4 bits, see alignment (E = 3.9e-07)
amino acids 333 to 363 (31 residues), 17.9 bits, see alignment (E = 2.4e-06)
amino acids 368 to 399 (32 residues), 22.1 bits, see alignment (E = 1.1e-07)
PF13414: TPR_11 "
amino acids 77 to 114 (38 residues), 40.2 bits, see alignment
2e-13
amino acids 145 to 186 (42 residues), 38.7 bits, see alignment
5.8e-13
amino acids 179 to 217 (39 residues), 46 bits, see alignment
3e-15
amino acids 314 to 345 (32 residues), 35.5 bits, see alignment (E = 5.9e-12)
PF13431: TPR_17 "
amino acids 128 to 157 (30 residues), 22.6 bits, see alignment (E = 8e-08)
amino acids 160 to 193 (34 residues), 24 bits, see alignment (E = 3e-08)
PF00515: TPR_1 "
amino acids 138 to 170 (33 residues), 27.9 bits, see alignment (E = 1.3e-09)
amino acids 172 to 204 (33 residues), 41.7 bits, see alignment (E = 6e-14)
PF13374: TPR_10 "
amino acids 138 to 169 (32 residues), 21.3 bits, see alignment (E = 1.9e-07)
amino acids 176 to 200 (25 residues), 19.4 bits, see alignment (E = 8e-07)
amino acids 244 to 274 (31 residues), 25.6 bits, see alignment (E = 8.8e-09)
amino acids 337 to 362 (26 residues), 17.4 bits, see alignment (E = 3.4e-06)
PF14559: TPR_19 "
amino acids 152 to 210 (59 residues), 25.3 bits, see alignment
1.4e-08
amino acids 182 to 230 (49 residues), 30.1 bits, see alignment
4.6e-10
PF13424: TPR_12 "
amino acids 172 to 237 (66 residues), 35.1 bits, see alignment
E=1.1e-11
PF13176: TPR_7 "
amino acids 174 to 205 (32 residues), 24.1 bits, see alignment (E = 2.3e-08)
PF13174: TPR_6 "
amino acids 178 to 204 (27 residues), 12.8 bits, see alignment (E = 0.00014)
PF13844: Glyco_transf_41 "
amino acids 438 to 598 (161 residues), 90.9 bits, see alignment
E=5.4e-29
amino acids 620 to 796 (177 residues), 56.1 bits, see alignment
E=2e-18
Best Hits
KEGG orthology group: None (inferred from 87% identity to
bph:Bphy_2992 )
Predicted SEED Role "TPR domain protein, putative component of TonB system" in subsystem Ton and Tol transport systems
Sequence Analysis Tools PaperBLAST (search for papers about homologs of this protein)
Search CDD (the Conserved Domains Database, which includes COG and superfam)
Compare to protein structures
Predict protein localization: PSORTb (Gram-negative bacteria)
Predict transmembrane helices and signal peptides: Phobius
Check the current SEED with FIGfam search
Find homologs in fast.genomics or the ENIGMA genome browser
Find the best match in UniProt
Protein Sequence (813 amino acids) >QEN71_RS29105 glycosyltransferase family 41 protein (Paraburkholderia sabiae LMG 24235)
MDHANLLNTALAHHQAGRLAEAKAIYDEILRANPRHSDALHFLGLLACQIQQYDAGITLM
RQSIAILPNAIYHNNLGNALREHGQLKQSIDSYREAVTLNPGYAEAHNNLGNALREDRQP
DAAMHSCAHAIELRPGYAEAYNNLGNALKDLGETDSAVLAYRKAISFRQHYADAHNNLGN
ALTEQGKYDEAIASYHSAIALDPNRALIHNSLGTLYLARGELAEAAASLRRSVELDPERP
GVHNNLANTLRDMGDLEAARAHYAKALQLAQAVVDAYQTGGGATSLAHLRFPKPQLTLAE
AFATLGNAWYGLYRYEEAIESYLRAVALTDDDAEVHHNLAVAYLKTERANEALRYARKAL
ELKDGSSRMHINLGDVLRSLGELEAAVNSYRSAIERSPDADVAHTALLFCEASMSQRPVE
EFLADAIYFGKRMAANVTPFAHTRAPRGTRPLRVGFVSGDLRTHPVGIFTESVLRHVDPS
RIELIAYQTNDVEDETTQRLKPLFGEWTQLWKLSREAAAQRIFDDRIDILLDMSGHTAFN
RLSVFAMKPAPVQVTWLGFFASTGIAQMDYVLGDRYVLPPEEAHHFIEKPWHLPDGYLCM
TPPEHDVSVGPLPMQANGFVTFGYLGKLAKMTDEVLDLWVRVLREVPDSRLLIKAHELDR
KHAVDATLQRFAERGIDASQLILEGGSKRGEYFKTYHRVDIVLSPFPYPGGTTTAESLWM
GVPVVAMKGDRFLGHICESVLHSAGFGEWICDDQASYVQKVRELASSVESLAKLRAGLRE
HVLASPMCDAHRFARNFEDALEGMWSAYEESGN