Protein Info for Rv0101 in Mycobacterium tuberculosis H37Rv

Annotation: Probable peptide synthetase Nrp (peptide synthase)

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 200 400 600 800 1000 1200 1400 1600 1800 2000 2200 2400 2512 transmembrane" amino acids 882 to 902 (21 residues), see Phobius details amino acids 1270 to 1288 (19 residues), see Phobius details PF00501: AMP-binding" amino acids 473 to 760 (288 residues), 102.7 bits, see alignment E=6.2e-33 amino acids 1499 to 1832 (334 residues), 241.8 bits, see alignment E=3.4e-75 PF00550: PP-binding" amino acids 938 to 998 (61 residues), 31.8 bits, see alignment (E = 4.1e-11) amino acids 1992 to 2055 (64 residues), 51.9 bits, see alignment (E = 2.2e-17) PF00668: Condensation" amino acids 1023 to 1475 (453 residues), 291.1 bits, see alignment E=4.3e-90 TIGR01733: amino acid adenylation domain" amino acids 1519 to 1906 (388 residues), 418.5 bits, see alignment E=2.7e-129 PF13193: AMP-binding_C" amino acids 1890 to 1965 (76 residues), 32.8 bits, see alignment (E = 3.4e-11) TIGR01746: thioester reductase domain" amino acids 2109 to 2511 (403 residues), 409.8 bits, see alignment E=1.2e-126 PF01370: Epimerase" amino acids 2110 to 2317 (208 residues), 31.8 bits, see alignment E=3e-11 PF07993: NAD_binding_4" amino acids 2112 to 2372 (261 residues), 214.2 bits, see alignment E=5.8e-67

Best Hits

Predicted SEED Role

"probable peptide synthase"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (2512 amino acids)

>Rv0101 Probable peptide synthetase Nrp (peptide synthase) (Mycobacterium tuberculosis H37Rv)
VHRVRLSRSQRNLYNGVRQDNNPALYLIGKSYRFRRLELARFLAALHATVLDNPVQLCVL
ENSGADYPDLVPRLRFGDIVRVGSADEHLQSTWCSGILGKPLVRHTVHTDPNGYVTGLDV
HTHHILLDGGATGTIEADLARYLTTDPAGETPSVGAGLAKLREAHRRETAKVEESRGRLS
AVVQRELADEAYHGGHGHSVSDAPGTAAKGVLHESATICGNAFDAILTLSEAQRVPLNVL
VAAAAVAVDASLRQNTETLLVHTVDNRFGDSDLNVATCLVNSVAQTVRFPPFASVSDVVR
TLDRGYVKAVRRRWLREEHYRRMYLAINRTSHVEALTLNFIREPCAPGLRPFLSEVPIAT
DIGPVEGMTVASVLDEEQRTLNLAIWNRADLPACKTHPKVAERIAAALESMAAMWDRPIA
MIVNDWFGIGPDGTRCQGDWPARQPSTPAWFLDSARGVHQFLGRRRFVYPWVAWLVQRGA
APGDVLVFTDDDTDKTIDLLIACHLAGCGYSVCDTADEISVRTNAITEHGDGILVTVVDV
AATQLAVVGHDELRKVVDERVTQVTHDALLATKTAYIMPTSGTTGQPKLVRISHGSLAVF
CDAISRAYGWGAHDTVLQCAPLTSDISVEEIFGGAACGARLVRSAAMKTGDLAALVDDLV
ARETTIVDLPTAVWQLLCADGDAIDAIGRSRLRQIVIGGEAIRCSAVDKWLESAASQGIS
LLSSYGPTEATVVATFLPIVCDQTTMDGALLRLGRPILPNTVFLAFGEVVIVGDLVADGY
LGIDGDGFGTVTAADGSRRRAFATGDRVTVDAEGFPVFSGRKDAVVKISGKRVDIAEVTR
RIAEDPAVSDVAVELHSGSLGVWFKSQRTREGEQDAAAATRIRLVLVSLGVSSFFVVGVP
NIPRKPNGKIDSDNLPRLPQWSAAGLNTAETGQRAAGLSQIWSRQLGRAIGPDSSLLGEG
IGSLDLIRILPETRRYLGWRLSLLDLIGADTAANLADYAPTPDAPTGEDRFRPLVAAQRP
AAIPLSFAQRRLWFLDQLQRPAPVYNMAVALRLRGYLDTEALGAAVADVVGRHESLRTVF
PAVDGVPRQLVIEARRADLGCDIVDATAWPADRLQRAIEEAARHSFDLATEIPLRTWLFR
IADDEHVLVAVAHHIAADGWSVAPLTADLSAAYASRCAGRAPDWAPLPVQYVDYTLWQRE
ILGDLDDSDSPIAAQLAYWENALAGMPERLRLPTARPYPPVADQRGASLVVDWPASVQQQ
VRRIARQHNATSFMVVAAGLAVLLSKLSGSPDVAVGFPIAGRSDPALDNLVGFFVNTLVL
RVNLAGDPSFAELLGQVRARSLAAYENQDVPFEVLVDRLKPTRALTHHPLIQVMLAWQDN
PVGQLNLGDLQATPMPIDTRTARMDLVFSLAERFSEGSEPAGIGGAVEYRTDVFEAQAID
VLIERLRKVLVAVAAAPERTVSSIDALDGTERARLDEWGNRAVLTAPAPTPVSIPQMLAA
QVARIPEAEAVCCGDASMTYRELDEASNRLAHRLAGCGAGPGECVALLFERCAPAVVAMV
AVLKTGAAYLPIDPANPPPRVAFMLGDAVPVAAVTTAGLRSRLAGHDLPIIDVVDALAAY
PGTPPPMPAAVNLAYILYTSGTTGEPKGVGITHRNVTRLFASLPARLSAAQVWSQCHSYG
FDASAWEIWGALLGGGRLVIVPESVAASPNDFHGLLVAEHVSVLTQTPAAVAMLPTQGLE
SVALVVAGEACPAALVDRWAPGRVMLNAYGPTETTICAAISAPLRPGSGMPPIGVPVSGA
ALFVLDSWLRPVPAGVAGELYIAGAGVGVGYWRRAGLTASRFVACPFGGSGARMYRTGDL
VCWRADGQLEFLGRTDDQVKIRGYRIELGEVATALAELAGVGQAVVIAREDRPGDKRLVG
YATEIAPGAVDPAGLRAQLAQRLPGYLVPAAVVVIDALPLTVNGKLDHRALPAPEYGDTN
GYRAPAGPVEKTVAGIFARVLGLERVGVDDSFFELGGDSLAAMRVIAAINTTLNADLPVR
ALLHASSTRGLSQLLGRDARPTSDPRLVSVHGDNPTEVHASDLTLDRFIDADTLATAVNL
PGPSPELRTVLLTGATGFLGRYLVLELLRRLDVDGRLICLVRAESDEDARRRLEKTFDSG
DPELLRHFKELAADRLEVVAGDKSEPDLGLDQPMWRRLAETVDLIVDSAAMVNAFPYHEL
FGPNVAGTAELIRIALTTKLKPFTYVSTADVGAAIEPSAFTEDADIRVISPTRTVDGGWA
GGYGTSKWAGEVLLREANDLCALPVAVFRCGMILADTSYAGQLNMSDWVTRMVLSLMATG
IAPRSFYEPDSEGNRQRAHFDGLPVTFVAEAIAVLGARVAGSSLAGFATYHVMNPHDDGI
GLDEYVDWLIEAGYPIRRIDDFAEWLQRFEASLGALPDRQRRHSVLPMLLASNSQRLQPL
KPTRGCSAPTDRFRAAVRAAKVGSDKDNPDIPHVSAPTIINYVTNLQLLGLL