Protein Info for Echvi_4083 in Echinicola vietnamensis KMM 6221, DSM 17526

Annotation: hypothetical protein

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 200 400 600 800 1000 1200 1400 1600 1800 2000 2200 2378 signal peptide" amino acids 1 to 40 (40 residues), see Phobius details TIGR04189: cell surface protein SprA" amino acids 91 to 2376 (2286 residues), 2483.9 bits, see alignment E=0 PF14349: SprA_N" amino acids 109 to 380 (272 residues), 99.1 bits, see alignment E=1.6e-32 amino acids 1122 to 1618 (497 residues), 527.3 bits, see alignment E=2.6e-162

Best Hits

Predicted SEED Role

"FIG00648977: hypothetical protein"

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

See L0G617 at UniProt or InterPro

Protein Sequence (2378 amino acids)

>Echvi_4083 hypothetical protein (Echinicola vietnamensis KMM 6221, DSM 17526)
MTIVGSYFLKSSNYPKNFSATLSLLLAVFLLFGWQAAQAQGVSSDSIPSKLDSLEERRAL
PSFLLWDNYRTNPLYPNQSPLDYRSSPFYNQSPQEQQVEIELDSTLQYRVYDRLDSTDIN
PGYTFDFEEFSKIQELRMRQQYWRDRSKGMYGESAVSGRGLIPPITMSPTFDRLFGGSDI
NIVPTGYINLDFGGIFRRIDNPSIPIRQQQNGGFNFDQQIQMSVNGNLGEKMSINANFDS
NNSFDFQNQLKVEYRGFEEDIIKSIEIGNVSMPVQNSLIQGSQNLFGVKTQLQFGKLFMT
AVASTQRGQRDEIVIEGNGQGRPFELRASEYDDNRHFFLAHFFRDNYERWLRGLPQVLSG
VNVTRVEVYIMNRANNTETLRNFTAFMDLGEGERIFRPGNPNIGSGTPGSPASNDANDLF
DNLTANPNFRPYDQASNQIESGLELVKGTDFVQVNGARKLAENEYTFHRELGYLSLNRKI
LNDEVVAVSFEYTYNGQAYKVGEMSEDYQNRPESDVLFLKLLRPARINTNVPTWDLMMKN
VYSFNANQIQKEGFQLQVIYRDDRTGLDNPSLLEGQQVKDVPLIRLMGLDNLNPQNDPAP
DGNFDFVPGLTILPEKGMLIFPKLEPFGQTLADNFLPNEESLKDKFVYDTLYRTTKADAE
LVTRLNKYYIKGSLTAGSSSEIMLPGLNISEGSVIVNAGNIPLTEGVDYTVDYNIGRVVI
INEGILASGKRISISFEKADLVSFQTRSLLGTRFDYIFNDNLTLGGTFLYLNERPNITRI
STGSETIKNSLWGLDVNYSEKSRWLTKLADALPFTDTKDESLVTFSGEFAHLIPGTSNQV
DGEGASYIDDFETAVTPFSLGGSPQSWKLSSTPKTEDNRFDMSMMTEDQLGSAYRRARLA
WYNIDNVFYRRSGQGVPENITDEDRQNYYVKSFSPQEIFEGRDQDAIVLPEPLFDLAYYP
SERGMYNYNPNLNNEGLLPNPEDNYGGISRVITSDVDFDRTNIEYIEFWMLDPFISGPNG
RVLDGVFNENNTTGGKLVFNLGEISEDVMKDGRQAFENGLPKDGDPSETTENEWGRVTNQ
QYLTPGFDNSPESRAFQDVGLDGLSSEQEVDYFQDRFLSRLNVNADAYARIEADPSGDTF
QYYLGDELDQQNIKIQERYKYFNGMENNTPISANSNQAYTASGSNKPDNEDLNGDNTINE
VENYYEYEIDLEPGGLEVGRNHVVGKSTRNVNGQDITWYQFRIPVRQPDKVQGDISGFKS
LRFIRTYLTDFEQPVVLRMAQFRMVGSQWRTFRESLYERGLFEVPEPDVSNMTVGVVNIE
ENGQGNSRQSPYVLPPGINRDRDNTSTVERQLNEQSLSLCIDNLQSRDARAVFKNSNLDL
VQYGRLKMFFHADSEDALDGELTAFLRLGTDYTDNYYEIEVPLNITPKGTRDPNQIWPSA
NEIDIAIEEIVGVKSARDNSQHPQNLPYTEQIRQYNVTVVGRPELNYVQGLMIGVRNPAS
TGSASKSICVWANELRVVDFNKSNGWAANARLNAKIADVATISSSIRHNTFGFGGLETRL
SERSRESTTRYDVSANVQVDRILPEGLNVSIPMYVSMENSTTKPQFDPLNPDVPFELALR
KFRTNSEREAYRDLVLDQVKRRNISFSNVRKLPNEKKQKNHLYDLSNFSFSYAYGLVKQS
NINTEDYTYESYRGNISYNYAPNPVVVEPFKNWGIFSSPYLQLMRDFNLNLAPSQITASL
DVDRKFMRTQYRNDQLTTEGVDPLFQKSFYINRFFGLNWDLTKNLSFDYRASVNAIVDEP
EGDLDTETKQDSVKSNVKKLGRPTNYNHSFIANYTLPLDKVPALDWISADFRYEATYSWL
TGSIGQRDTLGNVIQNTRNRTLNGKLDFVSLYNKSAKLRALNAPKRPSIPGQRGNQQDSI
ITDGFGNGLLKFAMMLKEISGQYSITEGTFLPGYMENTGLFGLDRSFMNPGLGFLFGSQN
ASIRHDLADRGVMAPSSELTQAFRQNQVKNLQIQAMLEPTRDFRITLDMRKRETGQYSEI
FRKESGTDSDYLSVNPNRLGAYNITYNMIKTTFAKDDADNNSPLFDDFESYRSVIKGRLD
AINPGGEYNINGQDVLVPAFLAAYSGKSPQEISLNPFPKFPLPNWRVEYRGLSRLSWFEE
NFSSINLTHNYTSTYDVSNFSSSLLYQNGLELYNKLENYPAASMTDEYGSYIPVFILNQV
VLSERFGPFLGVDILTKNRMNISFEFNKERAIGLNFSNAQVTEQKSKDFRFELGYTKSGV
KIPFKIQGQQEILDNDLEIRVSTSIVDTQTLQRKLEEGSTITNGNVNLQIRPSLGYIINQ
NLKITFYFDRTVNDPRITTAYRRSSTAFGGQLRFNLGQ