Protein Info for NIAGMN_05780 in Escherichia coli ECRC102

Name: pAAR
Annotation: type IV secretion protein Rhs

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1616 transmembrane" amino acids 66 to 99 (34 residues), see Phobius details PF05488: PAAR_motif" amino acids 267 to 310 (44 residues), 38.5 bits, see alignment (E = 1.7e-13) PF20148: DUF6531" amino acids 393 to 454 (62 residues), 45.9 bits, see alignment (E = 8.6e-16) TIGR01643: YD repeat (two copies)" amino acids 553 to 592 (40 residues), 30.9 bits, see alignment (E = 2.7e-11) amino acids 609 to 640 (32 residues), 18 bits, see alignment (E = 3.2e-07) amino acids 657 to 690 (34 residues), 22.9 bits, see alignment (E = 9.4e-09) amino acids 670 to 711 (42 residues), 36.5 bits, see alignment (E = 4.5e-13) amino acids 716 to 752 (37 residues), 22.2 bits, see alignment (E = 1.5e-08) amino acids 755 to 793 (39 residues), 24 bits, see alignment (E = 4.1e-09) amino acids 860 to 900 (41 residues), 23.4 bits, see alignment (E = 6.5e-09) amino acids 967 to 1005 (39 residues), 34.4 bits, see alignment (E = 2.1e-12) amino acids 986 to 1013 (28 residues), 20.9 bits, see alignment (E = 3.8e-08) amino acids 1250 to 1281 (32 residues), 23.8 bits, see alignment (E = 4.6e-09) PF05593: RHS_repeat" amino acids 609 to 644 (36 residues), 28.3 bits, see alignment (E = 2.8e-10) amino acids 670 to 707 (38 residues), 34.4 bits, see alignment (E = 3.5e-12) amino acids 734 to 770 (37 residues), 27.9 bits, see alignment (E = 3.8e-10) amino acids 860 to 896 (37 residues), 30.8 bits, see alignment (E = 4.5e-11) amino acids 945 to 982 (38 residues), 24.8 bits, see alignment (E = 3.4e-09) amino acids 986 to 1017 (32 residues), 24.9 bits, see alignment (E = 3.3e-09) amino acids 1250 to 1279 (30 residues), 23.3 bits, see alignment (E = 1e-08) TIGR03696: RHS repeat-associated core domain" amino acids 1399 to 1461 (63 residues), 65.6 bits, see alignment (E = 4.4e-22)

Best Hits

KEGG orthology group: None (inferred from 100% identity to ecf:ECH74115_0647)

Predicted SEED Role

"Rhs core protein with extension"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

Find the best match in UniProt

Protein Sequence (1616 amino acids)

>NIAGMN_05780 type IV secretion protein Rhs (Escherichia coli ECRC102)
MSEGPGGPQGATAGGTLAMRMLSQRAMAASQMKRAANDKAIAQMLASKKSGPPAARLGDE
IQHKSFLGALAGAVLGAIVTIAEGCLIMAACATGPYALVLVPALMYASYKASDYVEEKQN
QLESWINSFCDTDGAINTGSKNVKINGKPAARAAVTLPPPPPPGAIPEVPQGEPSWGDIA
TDLLESAAEKAVPLAKAWGNAVITLTDSNAGFMDRVSAGASLLFPAGPVLMEFATMVGGR
GEIKKDVDFPEAGEDTALCDKENKPPRIAQGSSNVFINNQPAARKGDKLECSAAIVEGSP
DVFIGGEQVTYLDIQPEFPPWQRMILGGITIASYLLPPAGLLGKLGNLAKLGKLGNLLGK
SGKLLGAKLGALLGKTGNSLKSIANKVIRWVTDPVDPVTGAYCDERTDFTLGQTLPLSFT
RFHSSVLPLHGLTGVGWSDSWSEYAWVREQGNRVDVISLGATLNFAFDGESDTAVNPYHA
QYILRRRDDYLELFDRDALSSRFFYDAFPGMRLRHPVTDDTSDDRLAHSPADRMYMLGGM
SDTASNRITFERDTQYRITGVSHTDGIRLKLTYHASGYLKAIHRTDNGIQTLATYEQDAR
LDYHLFYEYDAADRIIRWSDNDQTWSRFTYDAQGRCVTVTGAEGYYNATLDYGDGCTTVT
DGKGIHRYYYDPDGNILREEAPDGSTTTYEWDEFHHLLARHSPAGRVEKFEYNAAHGQLS
RYTAADGADWQYCYDERGLLSNITAPAGQTWTQQCDERGLPVSLVSPQGEETRLAYTPQG
LLSGIFRQDERRLGIEYDHHNWPETLTDVMGREHHTEYSGHDLPVKMRGPGGQSVRLQWQ
QHHKLSGLERAGTGAEGFRYDRHGNLLAYTDGNGVVWTMEYGPFDLPVARTDGEGHRWQY
RYDKDTLQLTEVINPQGESYLYILDNCGRVTEERDWGGVVCRYRYDADGLCTARVNGLEE
TILYSRDAAGRLAEVITPEGKTQYAYDKSGRLTGIFSPDGTSQRTGYDERGRVNVTTQGR
RAIEYHYPDEHTVIRCILPPEDERDRHPDGSLLKTTYRYNAAGELTEVILPGDETLTFSR
DEAGREVLRHSNRGFACEQGWNAAGQPVSQRAGLFPAEATWGGLLPSLLREYRYDSAGNV
SGVTSREDYGRETHREYRLDRNGQVTAVTASGTGLGYGEGDETYGYDSCGYLKAQSAGRH
RISGETDQYAAGHRLKQAGNTQYDYDAAGRMVSRTKHRDGYRPETERFRWDSRDQLTGYR
SAQGEQWEYRHDASGRRTEKRCDRKKIRFTYLWDGDSIAEIREYRDDKLYSVRHLVFNGF
ELISQQFSRVRQPHPSVAPQWVTRTNHAVSDLTGRPLMLFNSEGKTVWRPGQTSLWGLAL
SLPADTDYPDPRGERDPEADPGLLYAGQWQDAESGLCYNRFRYYEPETGMYLVSDPLGLQ
GGEQTYRYVPNPCGYIDPLGLAICQLARWTKWGSEQSNISDVLNSLGNRALKYANGDWIK
SEAAFNKYINMINKRLELTGSKFRVEIQPAIKNGERVPATTNGPFKVNGKWTSGTHYTGG
SKRLDAGIIDITSPTNQYGLHPVIEGFDITLNKTKPSAVDIYSDVFGGIDINDFRL