Annotation: conserved repeat domain
These analyses and tools can help you predict a protein's function, but be skeptical.
For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect.
For other types of proteins, the error rates may be much higher.
MetaCyc and Swiss-Prot have low error rates,
but the best hits in these databases are often quite distant,
so this protein's function may not be the same.
TIGRFam has low error rates.
Finally, many experimentally-characterized proteins are not in any of these databases.
To find relevant papers, use PaperBLAST.
Protein Families and Features
1
500
1000
1500
2000
2500
3000
3500
4000
4500
5000
5500
6228
signal peptide "
amino acids 1 to 26 (26 residues), see Phobius details
PF01345: DUF11 "
amino acids 31 to 143 (113 residues), 103.2 bits, see alignment (E = 5.5e-33)
amino acids 153 to 268 (116 residues), 89.4 bits, see alignment (E = 1.1e-28)
amino acids 278 to 367 (90 residues), 25.3 bits, see alignment (E = 8.2e-09)
amino acids 1630 to 1720 (91 residues), 27.1 bits, see alignment (E = 2.2e-09)
amino acids 1951 to 2049 (99 residues), 29.5 bits, see alignment (E = 4.1e-10)
amino acids 2058 to 2165 (108 residues), 25.1 bits, see alignment (E = 9.7e-09)
amino acids 6017 to 6135 (119 residues), 67.8 bits, see alignment (E = 5.6e-22)
TIGR01451: conserved repeat domain "
amino acids 37 to 81 (45 residues), 51.7 bits, see alignment (E = 5.6e-18)
amino acids 156 to 205 (50 residues), 41.2 bits, see alignment (E = 1.1e-14)
amino acids 284 to 316 (33 residues), 26.1 bits, see alignment (E = 5.8e-10)
amino acids 403 to 433 (31 residues), 20.4 bits, see alignment (E = 3.2e-08)
amino acids 519 to 554 (36 residues), 21.4 bits, see alignment (E = 1.6e-08)
amino acids 637 to 664 (28 residues), 22.1 bits, see alignment (E = 1e-08)
amino acids 752 to 780 (29 residues), 20.8 bits, see alignment (E = 2.6e-08)
amino acids 868 to 896 (29 residues), 25.6 bits, see alignment (E = 7.8e-10)
amino acids 975 to 1004 (30 residues), 24.5 bits, see alignment (E = 1.8e-09)
amino acids 1090 to 1118 (29 residues), 22.7 bits, see alignment (E = 6.3e-09)
amino acids 1205 to 1234 (30 residues), 25.7 bits, see alignment (E = 7.6e-10)
amino acids 1322 to 1350 (29 residues), 26.5 bits, see alignment (E = 4.1e-10)
amino acids 1430 to 1457 (28 residues), 24.9 bits, see alignment (E = 1.3e-09)
amino acids 1527 to 1564 (38 residues), 27.1 bits, see alignment (E = 2.8e-10)
amino acids 1642 to 1671 (30 residues), 33.5 bits, see alignment (E = 2.7e-12)
amino acids 1752 to 1780 (29 residues), 26.8 bits, see alignment (E = 3.4e-10)
amino acids 1857 to 1884 (28 residues), 29.3 bits, see alignment (E = 5.4e-11)
amino acids 1961 to 1992 (32 residues), 24.5 bits, see alignment (E = 1.7e-09)
amino acids 2071 to 2098 (28 residues), 23.5 bits, see alignment (E = 3.5e-09)
amino acids 2185 to 2214 (30 residues), 16.7 bits, see alignment (E = 4.8e-07)
amino acids 2418 to 2448 (31 residues), 28.6 bits, see alignment (E = 9e-11)
amino acids 2626 to 2661 (36 residues), 26.2 bits, see alignment (E = 5.2e-10)
amino acids 2744 to 2770 (27 residues), 24.7 bits, see alignment (E = 1.5e-09)
amino acids 2851 to 2881 (31 residues), 18.9 bits, see alignment (E = 1e-07)
amino acids 2966 to 2991 (26 residues), 17.5 bits, see alignment (E = 2.7e-07)
amino acids 3074 to 3102 (29 residues), 18.4 bits, see alignment (E = 1.4e-07)
amino acids 3178 to 3209 (32 residues), 33.7 bits, see alignment (E = 2.3e-12)
amino acids 3288 to 3314 (27 residues), 17.5 bits, see alignment (E = 2.6e-07)
amino acids 3398 to 3424 (27 residues), 22.5 bits, see alignment (E = 7.5e-09)
amino acids 3499 to 3531 (33 residues), 24.1 bits, see alignment (E = 2.3e-09)
amino acids 3610 to 3638 (29 residues), 24.8 bits, see alignment (E = 1.4e-09)
amino acids 3714 to 3745 (32 residues), 25.2 bits, see alignment (E = 1.1e-09)
amino acids 3930 to 3959 (30 residues), 23.1 bits, see alignment (E = 4.7e-09)
amino acids 4036 to 4066 (31 residues), 24.5 bits, see alignment (E = 1.8e-09)
amino acids 4358 to 4387 (30 residues), 16.8 bits, see alignment (E = 4.6e-07)
amino acids 4466 to 4494 (29 residues), 22 bits, see alignment (E = 1e-08)
amino acids 4564 to 4599 (36 residues), 23.8 bits, see alignment (E = 3e-09)
amino acids 4681 to 4708 (28 residues), 27.9 bits, see alignment (E = 1.5e-10)
amino acids 4897 to 4925 (29 residues), 30.7 bits, see alignment (E = 2.1e-11)
amino acids 5003 to 5032 (30 residues), 23.5 bits, see alignment (E = 3.7e-09)
amino acids 5211 to 5245 (35 residues), 21.6 bits, see alignment (E = 1.4e-08)
amino acids 5321 to 5353 (33 residues), 23.3 bits, see alignment (E = 4.1e-09)
amino acids 5531 to 5566 (36 residues), 24.1 bits, see alignment (E = 2.3e-09)
amino acids 6032 to 6069 (38 residues), 28 bits, see alignment (E = 1.4e-10)
PF24346: DUF7507 "
amino acids 273 to 373 (101 residues), 79.8 bits, see alignment (E = 7.3e-26)
amino acids 389 to 489 (101 residues), 77.7 bits, see alignment (E = 3.3e-25)
amino acids 505 to 607 (103 residues), 75.1 bits, see alignment (E = 2.1e-24)
amino acids 623 to 722 (100 residues), 77.3 bits, see alignment (E = 4.4e-25)
amino acids 738 to 837 (100 residues), 80.9 bits, see alignment (E = 3.2e-26)
amino acids 853 to 948 (96 residues), 83.7 bits, see alignment (E = 4.4e-27)
amino acids 959 to 1059 (101 residues), 81.5 bits, see alignment (E = 2.1e-26)
amino acids 1075 to 1175 (101 residues), 76 bits, see alignment (E = 1.1e-24)
amino acids 1192 to 1291 (100 residues), 76.7 bits, see alignment (E = 6.6e-25)
amino acids 1308 to 1402 (95 residues), 75.4 bits, see alignment (E = 1.7e-24)
amino acids 1413 to 1509 (97 residues), 92.8 bits, see alignment (E = 6.6e-30)
amino acids 1522 to 1616 (95 residues), 70.1 bits, see alignment (E = 7.5e-23)
amino acids 1627 to 1723 (97 residues), 93 bits, see alignment (E = 5.4e-30)
amino acids 1738 to 1825 (88 residues), 75.3 bits, see alignment (E = 1.7e-24)
amino acids 1841 to 1937 (97 residues), 75.7 bits, see alignment (E = 1.4e-24)
amino acids 1948 to 2044 (97 residues), 93 bits, see alignment (E = 5.7e-30)
amino acids 2055 to 2155 (101 residues), 80.4 bits, see alignment (E = 4.7e-26)
amino acids 2170 to 2272 (103 residues), 66.5 bits, see alignment (E = 9.5e-22)
amino acids 2288 to 2389 (102 residues), 65.4 bits, see alignment (E = 2.1e-21)
amino acids 2404 to 2500 (97 residues), 88.6 bits, see alignment (E = 1.3e-28)
amino acids 2511 to 2605 (95 residues), 69 bits, see alignment (E = 1.6e-22)
amino acids 2619 to 2715 (97 residues), 84.7 bits, see alignment (E = 2.2e-27)
amino acids 2728 to 2826 (99 residues), 88.5 bits, see alignment (E = 1.4e-28)
amino acids 2838 to 2934 (97 residues), 86.1 bits, see alignment (E = 7.7e-28)
amino acids 2950 to 3046 (97 residues), 73.9 bits, see alignment (E = 4.9e-24)
amino acids 3058 to 3154 (97 residues), 83.1 bits, see alignment (E = 6.6e-27)
amino acids 3165 to 3261 (97 residues), 92.7 bits, see alignment (E = 6.9e-30)
amino acids 3274 to 3369 (96 residues), 72.4 bits, see alignment (E = 1.4e-23)
amino acids 3383 to 3475 (93 residues), 71.1 bits, see alignment (E = 3.6e-23)
amino acids 3488 to 3584 (97 residues), 86 bits, see alignment (E = 8.3e-28)
amino acids 3595 to 3687 (93 residues), 84.5 bits, see alignment (E = 2.4e-27)
amino acids 3703 to 3797 (95 residues), 84.2 bits, see alignment (E = 3e-27)
amino acids 3810 to 3897 (88 residues), 74.8 bits, see alignment (E = 2.5e-24)
amino acids 3917 to 4011 (95 residues), 86 bits, see alignment (E = 8.6e-28)
amino acids 4024 to 4118 (95 residues), 80.6 bits, see alignment (E = 4.1e-26)
amino acids 4129 to 4224 (96 residues), 84.2 bits, see alignment (E = 3e-27)
amino acids 4236 to 4332 (97 residues), 87.2 bits, see alignment (E = 3.5e-28)
amino acids 4345 to 4438 (94 residues), 74.5 bits, see alignment (E = 3.2e-24)
amino acids 4450 to 4546 (97 residues), 87.9 bits, see alignment (E = 2.1e-28)
amino acids 4560 to 4653 (94 residues), 78.9 bits, see alignment (E = 1.3e-25)
amino acids 4666 to 4760 (95 residues), 88.3 bits, see alignment (E = 1.6e-28)
amino acids 4776 to 4870 (95 residues), 72.2 bits, see alignment (E = 1.6e-23)
amino acids 4882 to 4976 (95 residues), 83.8 bits, see alignment (E = 4.1e-27)
amino acids 4991 to 5083 (93 residues), 81.3 bits, see alignment (E = 2.4e-26)
amino acids 5097 to 5191 (95 residues), 81.9 bits, see alignment (E = 1.5e-26)
amino acids 5202 to 5297 (96 residues), 83.6 bits, see alignment (E = 4.8e-27)
amino acids 5309 to 5405 (97 residues), 88 bits, see alignment (E = 1.9e-28)
amino acids 5418 to 5511 (94 residues), 83.8 bits, see alignment (E = 4e-27)
amino acids 5525 to 5619 (95 residues), 67.7 bits, see alignment (E = 4.1e-22)
PF13585: CHU_C "
amino acids 6144 to 6225 (82 residues), 75.3 bits, see alignment (E = 1.7e-24)
TIGR04131: gliding motility-associated C-terminal domain "
amino acids 6144 to 6225 (82 residues), 38.7 bits, see alignment (E = 8.8e-14)
Best Hits
Predicted SEED Role "internalin, putative" in subsystem Listeria surface proteins: Internalin-like proteins
Sequence Analysis Tools PaperBLAST (search for papers about homologs of this protein)
Search CDD (the Conserved Domains Database, which includes COG and superfam)
Compare to protein structures
Predict protein localization: PSORTb (Gram-negative bacteria)
Predict transmembrane helices and signal peptides: Phobius
Check the current SEED with FIGfam search
Find homologs in fast.genomics or the ENIGMA genome browser
See L0G4D9 at UniProt or InterPro
Protein Sequence (6228 amino acids) >Echvi_3666 conserved repeat domain (Echinicola vietnamensis KMM 6221, DSM 17526)
MIVVRNALLLFMVLFGWVRIPSAYAQILTTDMQLSMTVDNSTPEVGETVTFTLAVQNAGN
IPASGVVVSNPLPSGLNYVSDNGGGNFDPSTGEWGIGIMTAGQGASLEVVATVEPTGSYT
ASATVSANQNDNFPGNNSSNVTLSPVPVASTSLSQTVSNETPSIGEEVTFTLTLTNDGPS
TATGVLVEGLLSNGLQYISSNASSGSFDREAASWTVNDLAAGNQVVLNVVTEVLQPISQN
ISYQLDASISSQNEIDPNEDDNSTAVSIFPVANPSWEITKSSPDASYAAPGDQVGFTITL
QNTGNVNVSGVQLTDELVDQAPVLVSGDSNNDGILNPEESWTYSATYTVTQADIDNGSIT
NTVSANGNPLKGELPDTVDEITIDAAQTPSWTLAKTSNTTPNEYTNPGDELTYEITVENT
GNVSISDVNVNDPTAGSGPTLISGDLDSDGVLDVDEIWTYEVRYTVTQADIDAGTYTNTA
TASGSPAGGVLDPASDQEIIPAVQNPSWSLTKTSNTSPNEFTVPGDQLTYEIILENTGNV
SISEVDVRDPMASGGPTLISGDDNGDGVLDVGESWSYEVDYTVTQEDIDAGDYTNTATAS
GAPAGGTLEDVSDNETVPAVQTPSWTISKDATTASYANPGDVIDYDITVTNTGNVSIRNV
DVQDEQASNGPNYSSGDDDGDQELDVGEVWVYTASYTVTQADIDNGSFTNSVTGTADPAG
GTLEDVTDQETVPAIQEPEWAIVKSTDEINYTKVGQVLNYTIEVTNTGNVSISSIAVTDP
QATSGPSYLSGDDGADEVMAPGETWVFSATHEVTQDDLNNGSFTNTVTANGDPAGGELDD
VTDEVTIPAVKDPAIRVLKVANKSGYSQVGEEITYTIFVNNSGNVSLSDVAVTDPLTGFA
DGIDVINSGDTIRFTTTYTVTQEDIDNGSITNTAVIDATDPDGNPLPSEDDEIINASQTP
KITLSKNTDQTSYDAVGQIITYEIVVSNTGNITIENPVVDDPLATSGPTYSSGDTDGDGV
LDVGESWVYTATHAVTQEDIDAGSFTNTVTANGDPIAGDLREETASETVDADQLPSWTIE
KSNTNPTNSYAQPGDELTYDIVIVNTGNVSISNVSVFDPKATTGPTYVAGDDANPDVLDV
GESWTYSATYTVTQKDIDNGSYTNTASANGNGIGGDLPEVSDDETVPAEALPSWTISKTA
GQDTYQTVGENVTYTIEVENTGNTSITNPVVSDPQATSGPTYQSGDTDGDGEIDPGETWS
YQATYQITQADITNGSYTNTATASGDPTNGTLDDVTDEETIDAVQLTELSLDKSVSPATY
DAAGQEVVFTIVVTNSGNMPINNVIASDPLLGVNEYIGTLQPGESRTITVPYTITQADVD
NGSITNTATTSGEDVEGGTVDDTDDATTNADQQPEISIEKSLDGLGYTGAGDELNYTLTV
TNTGNVTLTDVVVTDPLTGLSQTIPELAPGQTVEIPVTYTVTQEDINNGSVTNAAEVTAD
DPNGDPVIDDDDITTNGGKTGEVNIIKTVDESAFYYAGDTLHYNITVVNTGNVDITDVTV
TDPLTGLTETVPVVAPGDTLRYTTEHIVTQEEVDAGTITNTVEFEGTDPDGNEVVDNDET
NTNAAKNPELTVNKTSNPDEYTSAGDEITYTITVTNTGNVTLTDVVVTDPLTGYNNTIPT
LAPGETVTIEEEYIVTADDVSRGFITNTATAEGQDPDGNDVIDEDSNIINPSGEDRTELT
ITKTLEEFGYSDNGQVLHYTITVRNTGNVTLNDITVTDPLTGLDENIATLNPGETAVFEQ
EYTVTQADVDDAVITNTATAETSTPVVLEDSDTKIIYGTQTPDIELRKRVVENGYKSVGD
VLTYTIFVRNEGNVTLSNVQVTDPRMGIDQSYPSLAPDELQRIDFTYTVEQEDLDRGYIE
NIADVTSTAPKGEVLTDEDTVRIFAAQNPHLEMVKTASEASFGAVGDEINYTITVNNTGN
VSFFDLVVTDPLTGLNSSIGTLAPGEDAQITATYTVTQDDLDNGNVTNTAAVDGTDFDGN
PVNETSTVVVPAEQNPELTLTKTPDINEFDQPGETINYAITVTNSGNITITDVNVTDPQA
VNGPDYESGDTNDDGILEVGETWNFTADYTTTQADVDNGGFTNTVTASGTPAGGNLNDGT
ASATVPSVQSPSWTIEKTSTTAPNTYSSPGDQLTYEISLENTGNVSIRTISLADAGADNS
PLLVAGDTDSDNELDVDETWVFEASHTVSQADLDASSYTNTATASGDPSGGTLADISDDE
TIPATLNPAWTLTKVSTTQPNSFNSADDQLNFTLQLENTGNVTISNVQLMDEKVNEGPTY
RGGDTDQDGQLDVGETWNYSAKYLTVQADVDNGEVVNTAIASGTVEAGTLEDAIGETTVP
AIVNPQLTLRKSVEESGFTAPGEVLNYTIIVTNTGNQTISAVQVTDPLTGLSETIPTLAP
GQSITYNEPYTVTQNDLDVGTIENTATANGTDPDGNDITSEDTEVINGSQSPALDLIKGV
SENGYIHAGEVVNYTLVTQNTGNVTLFDVVIVDSLINLNIEIGQLDPGERRRYDEPYTIT
QADVDRGEIANTAYIIATDINGNELIDSDSELLIGTQLPSLLASKTSSTSNYDAVGDTIQ
YTFTLTNNGNVTLTDVLVTDPNAEVTSVNPVASLAPDESVVLQGRHVVTQADLDAGKYTN
RATAVGNDPNDVPISATTNQVTVPAIQTPQLTVTKATSTENYDAVGDVIAYTIAVENTGN
VTITNINVSDPKAEILDDPAIAQLAPGESVTLQASHTVTQADLDAGEYANTSEATGKDPN
NKNITDRSNEVVVPAVQEPALELTKSSTKTTYSAVGEVIPYSMVVTNTGNVTMHDIEVTD
PKAEILSGSPISSLAPGASAEITAQHVVVQADIDAGIYINQSTATGKDPKDKTLSVVSNE
VVLTAVQTAAFEIRKATTTPTYDAEGDVILYELEVENTGNVTLHNIVVSDPKAEITSGSP
IATLSPGKIAKLTAEHTITQGNVDAGSYTNQATATGLDVNDEQITNLSNEVTVNAVQSPE
IQITKTASPLNYETVGAEIDYTIVVTNTGNVTLSSVNVVDPLTGLEEFIGLLSPGSSRSF
TTDIVITQRDLDNGSITNTASTTAMDTHGNEVNDEDDETVTAIQEPSITLTKEGDKATVF
EVGEQINYTFTVTNTGNLTLTNVSLSDPLTGLSEGVVILTPGQTLTFNDSYVVTQADLDN
GQIPNTATVSGTAANGTLVNDADDFLVTADQIAAIEVNKEGDRFTYYNTGQVITYDLEVT
NTGNVTLSDVTLADAQLDFEQNLGEIAPGQTKRYTGLTYTITQPDLDNGSFLNTTIATGQ
APNGDTVSDEDFFEADALQAGVIGISKTATPRFFNNEGDEVTYTVVVTNEGNVTLTLVQV
NDPITGLNETIPTLAPGESQTYNTIYSVTQENVDNRTLLNTATASGITAGGEEVSVYDQY
RIYSYGIPAIEITKNANVNTFSQAGDVIEYTLRVENIGNQTLNDVVVEDPLTGFSETVST
LSPGEVALFSFTYTTTQEDVDRRLLENTATATGTGPNGNEVTDEDRVEIIPLWSASLDLQ
KTADPLLINQAGDQITYNFVLTNTGNITLTNVRISDSLIGAEQEVGTMAPGASQTYSVTY
TATQNQIDEGRIVNVATATAEAPNGQSGTVEDRAIVLVRRSGQIQVTKTADVSSVQVAGD
IINYDIEVTNNGNVTLTDVVITDPLTEFEQQLDSLIPGESQSFTTSYTVTQADIDAGEIP
NTVTARGVTPANREVVDEDEALVIAERNGSIEVVKTPEISSFDAASVTIPYRITVTNTGN
VSLSSVRVNDPLTELNQVVGTLEPGESQSYETTYQTTQQDVDNGMVTNVATARGRAPYSV
LLIDADTANVPADQRATVDLVKTADVDEYSTVGELITYTLTVTNTGNVSLSNGSLTDPLT
GVNLSGGTLATGETKAFTASYTVTQEDLDRGYIRNIATVRGNAPSGERVSDRDTLQISAV
QNTTIELTKSANISTYTQAGEVIRYSLNVSNAGNVTLSNVTVRDPLTGMDQMVGTMAPDA
SQGFVTRYTVTQEDVDAGAITNDAVAQGFTPSDQSVTDSAAVTVVGLQSPSISLTKLPDV
DSFDAAGYVINYSLEVENTGNTTLTAVQLVDSLTQTDRGLEDLAPGEVVTINASYTTTQE
DLDAGEVLNTATVTGESSLGEIVSGEDQANVPALVEPSIQLQKAANVTNYDALGQVITYT
LTAINTGNVTLDSVTVRDPHNDFEELIGTLSPGQDTTLTTTYVVEQGDLDAGNITNIATA
VGLDPFGEWVSDDATEVITADQLVALDLDKAVTPKTYSAVDQEITYTLTVINEGNVTLRN
VNTDDPLIGFNQTIPTLAPGDTVVFTQPYLIIQDDLDQGRIVNEATAVGDAPDGSTAEAE
DRATIIAKQAPAIEITKTATPRNYRSPGEVISYTLEVTNVGNVTLQSVNVTDPLTGLNRF
ILALAPGESRTFSTTYTVDQVDIDTGIVVNSATANGKAPDGERVNDVDTARVYALRAGVI
EIDKTASPKVFDAAGDVVTYTITVTNAGNLTLSSVQIADDKIGFSQNLPVLAPGASQSYT
VDYSLTQADVDGLVVVNTATATAMTPDDEEVTDSDKATVVARGEGAIELTKSSSVKIYEA
PGQVITYTIQVENIGNVSLSDVTVTDPLTDFEQQVGTMAPGEILTFTEEYTVTQQDIDNG
RIINTASTQGTTPIGRVVESEDRVIIQSIRSLDSGQILVSKEAQQTTYDEAGEVVEFEII
VTNSGILTLSNIHVLDSLTGMLEEVPTMAPGDSITYTTSYTILQEDLDRGQVTNTVEVDG
TTPKGRVVEATDDARIRGLQASAIAFGKTASPKVYHVPGDVITYTLIATNTGNTTLTNVQ
VSDPLTGFAATEDVLAPGDTVIFNTTYTISQADIDAGRVVNTAEVTALDPDGINISAEDD
ARVLALRLGRLGLEKTNVTPDYAEVGDVIDYTISVTNTGNATLVSVNVSDPLTGLNETIP
QLDPGQSVTYNTAYTVTQADLDRGEVANTATAQGLTPIGRNVRATDDALVPALVSGGISL
TKSPDVNDYDAVDDVINYSLEVANVGNVTLTEVQVRDPLTGLTQQVGKLVPGQSTTIQTD
YIIQQDDLDKGTVDNTATTSGRTPMGQAVRDTASAQVVAIQTPAISLTKSANKDTVSQVG
ERITYTLTVTNSGNVSLHDLSLVDSMTNFTETGALMAPGLTVTLETSYEVTQADLDAGMI
ENIAMVEGYSPSDELVSAQDTAVVHVSENPSILITKSANKDTVAQAGESIQYSLSITNTG
NVTLGEVNVTDPLTGLNEAVGTLIPGESTLVESQYEVTQADMDAGTIDNTAQVSGSSPAS
TTVQDEDTYRVYTQQEGAISLTKTASPSSYDAVGTVITYSLTASNTGTVTLSDVTVNDPR
TAFTENLGSLAPGASATTETTYTVTQEDLNNGSITNLADVSGISPNGQNVMAEDSALVMA
QQMAQITLEKSADKQEVFEAGEVITYDLVVTNTGNVDLVDISLTDPMTGLIEDVPALETG
ESITFSAQYTTTINDLAAGTSLINTATVQGFGVNGQEVEAVDQATVTIGCVGNTLITGRL
FNAATDEGLANVPVILVPENNQPGEGLMVLTDESGQYTFEGQSTGKYRLMVFDRNINKTQ
DLYPVNSNTVNLNLQNCTYLTYDFPYDNASTPVINGYVWYDLNSDETQNEWFDANGDGEV
TQNTIMQGEPVDISEWEWFDFNGDGSYEGPENDGELNKAGFGNPQGANIEISGPNGYFRK
ATINAYGFWQHALENADPFGEYTITLVPDSTFARNGIGLAATGLVSMLPNSGARLSAVQE
NLICEFTTEQVLVKNVTIGDVPDFDYGLSCRLEEEPIIANDDDFGEFFLSYDGVLGNVLD
NDLFNGEPAQPEDVTIVVTDADGLLGLNVAANGNLTVVPGINEPREYTLSYDLLETGDED
NFDSAVIVFSLVNDEVDLAISKTSFGAEIYEGDEFEYELVVTNVHDTDATNVIISDVLPD
ELTYISSEVVTNRDDIIVNTNVNGQELAWAMPSLPVGGEVTITIRVEAGNPASIINTAEV
EAFEEDINPEDNTATDINEINPFRIPNVITPNNDGDNDTFEVLGLGKFESNRITIFNRFG
DHVLEQEDYQNDWDAPGQVAGTYYYILLCYDEDGTEHEFKGWIQVIKD