Protein Info for Psyr_2616 in Pseudomonas syringae pv. syringae B728a

Annotation: Amino acid adenylation

These analyses and tools can help you predict a protein's function, but be skeptical. For enzymes, over 10% of annotations from KEGG or SEED are probably incorrect. For other types of proteins, the error rates may be much higher. MetaCyc and Swiss-Prot have low error rates, but the best hits in these databases are often quite distant, so this protein's function may not be the same. TIGRFam has low error rates. Finally, many experimentally-characterized proteins are not in any of these databases. To find relevant papers, use PaperBLAST.

Protein Families and Features

1 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000 6500 7000 7500 8000 8500 9000 9500 10000 10500 11000 11500 12000 12500 13000 13537 transmembrane" amino acids 773 to 789 (17 residues), see Phobius details amino acids 9661 to 9673 (13 residues), see Phobius details amino acids 10035 to 10053 (19 residues), see Phobius details amino acids 10729 to 10741 (13 residues), see Phobius details amino acids 11104 to 11122 (19 residues), see Phobius details amino acids 12174 to 12187 (14 residues), see Phobius details PF00668: Condensation" amino acids 95 to 535 (441 residues), 152.7 bits, see alignment (E = 4.9e-48) amino acids 1170 to 1610 (441 residues), 137.6 bits, see alignment (E = 1.9e-43) amino acids 2243 to 2683 (441 residues), 178.1 bits, see alignment (E = 9.4e-56) amino acids 3320 to 3760 (441 residues), 155.7 bits, see alignment (E = 6.2e-49) amino acids 4408 to 4848 (441 residues), 151.6 bits, see alignment (E = 1e-47) amino acids 5496 to 5935 (440 residues), 158.4 bits, see alignment (E = 8.9e-50) amino acids 6585 to 7027 (443 residues), 150.1 bits, see alignment (E = 2.9e-47) amino acids 7661 to 8102 (442 residues), 155.9 bits, see alignment (E = 5.2e-49) amino acids 8748 to 9186 (439 residues), 151.2 bits, see alignment (E = 1.4e-47) amino acids 9791 to 10236 (446 residues), 365.6 bits, see alignment (E = 1.3e-112) amino acids 10858 to 11304 (447 residues), 379.4 bits, see alignment (E = 8.4e-117) amino acids 11926 to 12372 (447 residues), 366.2 bits, see alignment (E = 8.4e-113) PF00501: AMP-binding" amino acids 556 to 896 (341 residues), 259.3 bits, see alignment (E = 1.9e-80) amino acids 1630 to 1968 (339 residues), 273.7 bits, see alignment (E = 7.8e-85) amino acids 2703 to 3045 (343 residues), 277.7 bits, see alignment (E = 4.9e-86) amino acids 3781 to 4134 (354 residues), 317.3 bits, see alignment (E = 4.5e-98) amino acids 4868 to 5221 (354 residues), 318.9 bits, see alignment (E = 1.5e-98) amino acids 5956 to 6310 (355 residues), 312 bits, see alignment (E = 1.8e-96) amino acids 7047 to 7385 (339 residues), 273 bits, see alignment (E = 1.3e-84) amino acids 8122 to 8473 (352 residues), 322.2 bits, see alignment (E = 1.5e-99) amino acids 9207 to 9545 (339 residues), 269.7 bits, see alignment (E = 1.3e-83) amino acids 10256 to 10614 (359 residues), 276.1 bits, see alignment (E = 1.5e-85) amino acids 11324 to 11682 (359 residues), 278.8 bits, see alignment (E = 2.2e-86) amino acids 12392 to 12735 (344 residues), 267.4 bits, see alignment (E = 6.4e-83) TIGR01733: amino acid adenylation domain" amino acids 577 to 971 (395 residues), 428.1 bits, see alignment (E = 1.6e-131) amino acids 1651 to 2043 (393 residues), 448 bits, see alignment (E = 1.5e-137) amino acids 2724 to 3120 (397 residues), 450.5 bits, see alignment (E = 2.6e-138) amino acids 3802 to 4209 (408 residues), 472.6 bits, see alignment (E = 5e-145) amino acids 4889 to 5296 (408 residues), 469.7 bits, see alignment (E = 3.8e-144) amino acids 5977 to 6385 (409 residues), 468.7 bits, see alignment (E = 7.7e-144) amino acids 7068 to 7460 (393 residues), 448 bits, see alignment (E = 1.5e-137) amino acids 8143 to 8548 (406 residues), 478.4 bits, see alignment (E = 8.6e-147) amino acids 9229 to 9620 (392 residues), 449.3 bits, see alignment (E = 5.9e-138) amino acids 10277 to 10689 (413 residues), 459.8 bits, see alignment (E = 3.8e-141) amino acids 11345 to 11757 (413 residues), 466.6 bits, see alignment (E = 3.4e-143) amino acids 12413 to 12810 (398 residues), 414.5 bits, see alignment (E = 2.1e-127) PF13193: AMP-binding_C" amino acids 955 to 1030 (76 residues), 50.4 bits, see alignment (E = 1.3e-16) amino acids 2027 to 2103 (77 residues), 43.9 bits, see alignment (E = 1.3e-14) amino acids 3104 to 3180 (77 residues), 43.6 bits, see alignment (E = 1.6e-14) amino acids 4193 to 4268 (76 residues), 37.5 bits, see alignment (E = 1.3e-12) amino acids 5280 to 5355 (76 residues), 32.2 bits, see alignment (E = 5.9e-11) amino acids 6369 to 6445 (77 residues), 46.7 bits, see alignment (E = 1.8e-15) amino acids 7444 to 7520 (77 residues), 45.5 bits, see alignment (E = 4.3e-15) amino acids 8532 to 8608 (77 residues), 40.6 bits, see alignment (E = 1.4e-13) amino acids 9604 to 9680 (77 residues), 43.1 bits, see alignment (E = 2.3e-14) amino acids 10673 to 10748 (76 residues), 41.4 bits, see alignment (E = 8.1e-14) amino acids 11741 to 11816 (76 residues), 39.6 bits, see alignment (E = 2.9e-13) amino acids 12794 to 12870 (77 residues), 35.4 bits, see alignment (E = 6.1e-12) PF00550: PP-binding" amino acids 1060 to 1121 (62 residues), 40.3 bits, see alignment (E = 1.1e-13) amino acids 2132 to 2194 (63 residues), 48.9 bits, see alignment (E = 2.2e-16) amino acids 3210 to 3270 (61 residues), 42.7 bits, see alignment (E = 1.9e-14) amino acids 4298 to 4359 (62 residues), 38.9 bits, see alignment (E = 2.9e-13) amino acids 5385 to 5446 (62 residues), 42.7 bits, see alignment (E = 1.9e-14) amino acids 6474 to 6536 (63 residues), 45.6 bits, see alignment (E = 2.4e-15) amino acids 7549 to 7611 (63 residues), 45 bits, see alignment (E = 3.7e-15) amino acids 8638 to 8699 (62 residues), 40.4 bits, see alignment (E = 1e-13) amino acids 9710 to 9770 (61 residues), 53.7 bits, see alignment (E = 7.1e-18) amino acids 10777 to 10839 (63 residues), 49.3 bits, see alignment (E = 1.6e-16) amino acids 11846 to 11907 (62 residues), 47.2 bits, see alignment (E = 7.5e-16) amino acids 12899 to 12961 (63 residues), 45.4 bits, see alignment (E = 2.8e-15) PF00975: Thioesterase" amino acids 12986 to 13129 (144 residues), 75.8 bits, see alignment (E = 2.1e-24) amino acids 13279 to 13529 (251 residues), 84.5 bits, see alignment (E = 4.7e-27) PF00561: Abhydrolase_1" amino acids 13279 to 13378 (100 residues), 29.6 bits, see alignment (E = 1.9e-10) PF12697: Abhydrolase_6" amino acids 13280 to 13493 (214 residues), 33.4 bits, see alignment (E = 2.6e-11)

Best Hits

Predicted SEED Role

"Non-ribosomal peptide synthetase modules, pyoverdine??" in subsystem Siderophore Pyoverdine

Sequence Analysis Tools

PaperBLAST (search for papers about homologs of this protein)

Search CDD (the Conserved Domains Database, which includes COG and superfam)

Compare to protein structures

Predict protein localization: PSORTb (Gram-negative bacteria)

Predict transmembrane helices and signal peptides: Phobius

Check the current SEED with FIGfam search

Find homologs in fast.genomics or the ENIGMA genome browser

See Q4ZT67 at UniProt or InterPro

Protein Sequence (13537 amino acids)

>Psyr_2616 Amino acid adenylation (Pseudomonas syringae pv. syringae B728a)
MHFSELMAVLSTHAIRLQQEEDDLVILGDDEGLDSATLESLSVHKAELLTLVARMGGDWL
SPAFRITADMLPLANLSQEAIDRIVDAIPGGAGNVQDIYPLAPLQEGILYHHLTAGQGDP
YVLRALFGAESRERLDDFAQALQAVIERHDILRTAMVWEGLEEPVQVVLREATLAVDELT
LDAAGGDIETQLHERYDPRHFRLDLRQAPLMRMVCAEDPANARWVAILLFHHIAIDHAAL
DLVKHEMQAFLLGEGHALPEAVPYRNYVAQVRLGVGADAHEGFFREMLADIDEPTLPFGV
LETPGSDSLIEDVHLPVDDALSARLRTQARQLGVSAASLHHLAWARVVGALAGKSDVVFG
TVLMGRMQGGDGADRALGMFINTLPLRVRLEGHAVREGVRTTHARLTALLGHEHASLAQA
QRCSGVAAPAPLFNSLLNYRHSASESVASAEALQAWNGLQSLGSEEQSTYPITLSVDDLG
QGFSLTVQALAQIGAQRIGEYMLTALGALVEALEQQPQTPLQRLQVLSAAERQQVLHDFN
DTAREYPRNSSLQELFEQQVATQPDALAAVQNDQCLTYRELNGRANALARHLVDLGMQPG
ERVALLLERSLDLLAGQLAIIKCGAAYVPLDINAPAERQAFMLQDCGARQVLTLSRHDLP
DGIQRIDLDLLELQSDAPNPVHSASAESVAYIMYTSGSTGMPKGVLVPHRAVSRLVLNNG
YADFNAGDRVAFASNPAFDASTLDVWAPLLNGGCVVVVEQSVLLSLDEFRALLLSQSVSV
LWMTAGLFHQYASGLMEALARLRYLIVGGDVLDPAVIARVLAEGAPQHLLNGYGPTEATT
FSTTHEITSVGSGGIPIGRPIGNSQVYVLDTLRQPVAVGVAGELYIGGQGVAKGYLNRPE
LNATQFVANPFSDDAGALLYRTGDLGRWNADGIVEYLGRNDDQVKIRGFRIELGEIEARL
VECPGVREAVVLARQDESAHKRLVAYVVGEENSALSAVELRRELAASLAEYMVPSAFMVL
DSFPLTANGKLDRRALPVPDADAYASREFEAPEGEVEITLARLWSELLNVERVGRHDHFF
ELGGHSLLAVSLIERMRLAGLSADVRVLFSQPTLAALAAAVGAGREINVPANLIGQGCER
ITPELLPLAGLTQVQIDQVVASVPGGVANVQDIYALAPLQEGILYHHMAAEVGDPYVLQS
QFAFDNRERLDAFVQALQMVIDRHDILRTGVVWDGLDSPVQVVWREARLHLEGLELDPAD
GEIGAQLHSRFDPRHYCLDMTQAPLMRLVYAEDPLNQRITAMLLFHHMALDHMAMDVVQH
EMQAWLLGESETLSAPVPYRNYVAQARLGVSQADHEVFFRDMLGDIDEPTLPFGLQDVQG
DGRDIEEAVLAVDSQLNLRLRAQARQQGVSAASLVHLAWAQVLGKVSDRRDVVFGTVLLG
RMQAGEGADRALGMFINTLPLRVAVGGQGVRAGVKATHAQLTALLAHEHASLALAQRCSG
VAAPTPLFSALLNYRHSAVGSVSERAVQAWQGIHALSSEERTNYPLTLNVDDLGDGFKLA
ALATNAIGAQRVCGYMHTALEHLVDALELMPQASLQGLSILPAVEREQLLVGFNDTALDY
PQQQTIHGMFEAQVERTPQALAVVHGEQRLTYRELNEQANRLAHALIKQGVQPDSRVGIC
VERGADMVVGLLAILKAGGGYVPLDPAYPAERIAYMLQDSAPAAVLAQTATQGLLADVSV
PVINLDLSDWQDQSVQNPQVPGLTSAHLAYLIYTSGSTGLPKGVMIEHRNTVNFLTWAHS
AFDASALEKTLFSTSLNFDLAVYECFAPLTSGGSIEIVKNVLELQHGEHDIGLINTVPSA
LKALLEVNGLPESVHTVNVAGEALKRSLVEDLFESTGVQRLCNLYGPSETTTYSSWVAMD
REDGFAAHIGKPVGNTRFYLLDEQQQPVPLGVAGEIYIGGAGVARGYLNRDDLTAERFLL
DPFSQQPAARMYRTGDLGRYLPDGNIEYLGRNDDQVKIRGFRIELGEIDARLAKHPAVHE
AVVTAREDIPGDKRLVAYYTLVSGHTSVDIDNLRSHLQEKLPEYMVPAIYVLLEAMPLTP
NGKLDRKALPAPDVEALSSRGYEAPQGETETQLAAIWQDLLGVARIGRHDNFFELGGHSL
LAVSLIGRMRQLGLSADVRVLFSQPTLAALAAAVGGSTDIVVPANLITEDCQRITPDLLP
LASLTQTQIDQVVATVPGGVANVQDIYPLAPLQEGILYHHIAAEQGDPYVLQAHFAFDDR
ACLDAFVQALQTVIDRHDILRTGVVWKGLDSPVQVVWRHAQLPLEMLQLDPAAGDVSAQL
HDRFDPRHHRLDVTQAPLMRLAYAEDPLNQRICAMLLFHHMVLDNMAMAVVQHEMQAWLL
GEAQNLSTPVPYRNYVAQARLGVSQADHEAFFRDMLGDVDEPTLPFGLINVQGDGLDIEE
ANLALAPRLNLRLRAQARQQGVSTASLLHLAWAQVLSSVANRQDVVFGTVLMGRMQAGEG
AERALGMFINTLPLRVSVGGKSVRDGVKATHERLTALLGHEHASLALAQRCSGIPAPTPL
FSSLLNYRHSAPGSVSERATQAWQGIHTLNSEERTNYPLTLNVDDSGEGFNLNVLVTGAV
GAKRVCAYMQTVLEHLVDALEQSPSAALDSLSILPAGERAQLLVAFNDTAMAYPQQQTIH
GLFEAQVERTPQALAVVHGEQRLTYRELNQQANRLAHALRKQGVQPDSRVGICVERSAEM
VVGLLAILKAGGGYVPLDPAYPAERIAYMLQDSAPAAVLAQSATEALLADVSVPVINLDL
GNWQDQSVQNPQVPGLTSAHLAYLIYTSGSTGLPKGVMIEHRNTVNFLTWAHSAFDAGTA
TSALEKTLFSTSLNFDLAVYECFAPLTSGGSIEVVKNVLELQNGEHDIGLINTVPSALKA
LLEVDGLPTSVHTVNVAGEALKRSLVESLFEKTGVRRLCNLYGPSETTTYSSWVAMDRED
GFAAHIGKPVGNTQFYLLDEQQQPVPLGVAGEIYIGGAGVARGYLNRDELTAERFLTDPF
SQQPAARMYRTGDLGRYLPDGNIEYLGRNDDQVKIRGFRIELGEIDARLAKHPAVHEAVV
TAREDIPGDKRLVAYYTLSAGHASVDSDSLRGWLQEQLPAHMIPAAYVLLDAMPLTPNGK
LDRKALPAPDGDALIRGGYEAPQGTAEITLAKIWSDLLNIERVGRHDHFFELGGHSLLAV
SLIERMRQAGLSANVRTLFSQPTLATLAAAAGAVDELVIPANRIPADCQKITPDMLPLIN
LTQVQIDQVVAGVPGGAANVQDIYPLAPLQAGILYHHISAEQGDPYVLQAQFAFDSRTHL
DTFAQTLQTVINRHDILRTSMHWESLDEPLQVVWRHVEISVEEVQLNPGLGDISRQLQER
LDPRQIRLDIRQAPLMRLVCALDTVNQRWLATLMFHHMILDHTALDQVRYEMQVCLLGQA
DQLGDSIPYRNYVAQARLGLNERDHELFFKDMLGDIDEPTLPFDLHDVQGDGNAIDQARL
MLDSGLSQRLRVQARQLGVSAASLLHLAFAQMLGRLSGREHVVFGTVLMGRMQSGEGAER
ALGMFINTLPLRVDLGEQAVRDGVKATHRRLTGLLGHEHASLALAQRCSGVVAPAPLFSA
LLNYRHSSVAVTDEALTAWNGMQSLALDDEERTNYPLTVNVDDMGEGFLFTALVAASIGA
QRLCDYLQLAAEGLVDALEQAPQTPLHAISILPLNERAQLLKQWNEPGQHYANGTPIHQK
FEARAAERPDAVALMFEAQTLSYGQLNARANQVAHRLLALGVRPDDRVAICVERGPAMII
GLLGILKSGAGYVPLDPAYPLERLAYTLGDSTPVALLSQQSVQQALPVTEVPIISLDDAD
LQDESVCNPEVSGLTAASLAYVIYTSGSTGLPKGVMVEHRNVARLFSATQDWFGFGEQDV
WALFHSFAFDFSVWEIWGALLHGGRLLIVPQLVSRSPRDFHDLLCSAGVTVLNQTPSAFR
QLIAAQGENEQAHSLRQVIFGGEALETAMLKPWYARQANAGTQLVNMYGITETTVHVTYY
PLQPEDAQRLGASPIGRRIPDLQLYVLDARGEPVPVGVVGELYVGGAGVARGYLNREALT
AERFIDNPFNTAPGARLYRTGDLGRWLADGSLEYLGRNDEQVKIRGFRIELGEIEAQLVA
CDGVQDAMVLVREDEPGDKRLVAYVIGTAGAELDATHLREQLRLSLAEYMLPSAFVSLEA
FPLTANGKLDRKALPAPDRSAVASRGYEAPEGDTEMAIARIWQDLLQLEQVGRHDHFFEL
GGHSLLAVKLIERMRQIDLVADVRVLFSQPTLSALAAAVGGKSAVEVPGNLIPANCERIT
PDMLPLVSLSQDDIDRVVASVPGGLANVQDIYALAPLQEGILYHHLAAAEGDPYLQYALF
AFDNIERLYSFAEALQGVIARHDILRTAVLWERLDSPVQVVWREATLGLDEQVLDPADGD
IAEQLLKRLDPRNTRLDIRQAPMLRIGYAQDTVNNRWLGMLLFHHLVDDATSLRILSSEI
EAHMLGQQASLPPSVPYRNYVAQAMLGVSREEHEAFFRDMLGDIDEPTLPFGLQDVQGDG
RGIEEVRQLVDVDLSRRLRVQARQLGVSSASLYHLAWARVLGAVSGKEDVVFGTVLLGRL
QGGAGSDRALGMFINTLPLRVTLGEQGVRSGLKATHARLSGLLAHEHASLVLAQRCSGVP
ASTPLFSSLLNFRHTGDLDASDQALAAWEGIQALHGEERTNYPLTLCVDDLGEGFNLTVM
AEGQIGAKRVCTYMHCVLENLVQALEQTPDAALGALNILPASERQQLLESWNTPHALQAD
DALIHRTFEAWVVAQPNAVALDYEERTLTYAELNTRANQVAHYLLGLGVQPDDRVAICVE
RSLEMIVGLLGVLKAGAGYVPIDPAYPAERIAYQLQDSAPMAVLADAAGLALLGSFDGPR
VDLHSPALQAQPGHNPQLTGLSPRHLAYVIYTSGSTGLPKGVMVEHRNVARLFSATQSWF
GFNEQDVWALFHSFAFDFSVWEIWGALLHGGRLLIVPQLVSRSPQDCYALLCNAGVTVLN
QTPSAFRQLLNAQGESDQRHSLRQVIFGGEALDTGMLKPWYARVINAGTQLVNMYGITET
TVHVTYHPLVAADAQRSGVSPIGVRIPDLQLYVLDARREPVPVGVVGELYVGGAGVARGY
LNREALTAERFLANPFSKEPQARLYRTGDLGRWMADGSLDYLGRNDDQVKIRGFRIELGE
IQAVLAACNTVREALVLVREDQPGDKRLVAYVIAAPGHEIVAADLRAQLLLSLADHMVPS
AFVALDSFPLTANGKLDQKALPAPDAQALAMREYAPPEGDVEIAIAQIWQSLLQVPQVGR
HDHFFELGGHSLLAVKLIERMRQIDLSADVRVLFSQPTLAALAAAVGGYTEVQVPDNLIV
QGCERITPDMLPLADLDQAGIDRVVASVPGGLANIQDIYALAPLQEGILYHHLAADEGDP
YVLQMLFAFDDRECLAGFAAALQSVVERHDILRTSVVWEGLEQPVQVVWRNAKLSLEEVT
IDPLDGDVLTRLRERFDPRHYRLDIGQAPLMRIAYAEDNTHQRLVGMLLFHHLALDHTSL
EVVVEEMQASLQGQIEQLPTPVPYRNHVAQARLGISQAEHEAFFRDMLGDIDEPTLAYGI
QDVQGDGSGIEEVHQVLDSQLSSRIRSIARQLGVSAASLAHLAWAQVAGRVSGREEVVFG
TVLMGRMQGGNGADRALGMFINTLPLRISVGSQSALAAVKVTHQRLSALLGHEHASLSLA
QRCSGVPGSLPLFSTLLNYRHSNGSAASSETLSAWQGIQTLSMEERTNYPLCLNVDDLGD
AFMLTIQAVQQINAQRIGEYMQVALRNLVDALEHTPQAALNSLSILPDDERELLLTGFND
TAHPYPRDVLIHQLIEQQAAQRPDACAVRGDSGPLLTYAELNQQANQLAHRLIELGVEPD
TRVAVSLRRGAEMVVALLGILKAGGAYVPIDPDLPSARQTYMLEDSSPQAVLTTRDLSDN
LPASDLPVLVLDGRDDRAQLARQPSVNPDAKALGLQPNHLAYVLYTSGSTGTPKGVMNEH
LGVVNRLLWARDAYQVNSQDRVLQKTPFGFDVSVWEFFLPLLTGAELVMARPGGHQDPDY
LAQVMSDAGITLLHFVPSMLDVFLEHRSTRDFPQLRRVLCSGEALPRALQRRFEQHLKGV
ELHNLYGPTEAAIDVTAWECRPTDPGDSVPIGRPIANIQMHVLDALGQLQPLGVAGELHI
GGIGVARGYLNQPDLSAERFIADPFSSDPQARLYKTGDVGRWLANGALEYLGRNDFQVKI
RGLRIEIGEIEAALAKHPAVHEAVVTAREDIPGDKRLVAYYTQSAEHTAVDIETLRGHLQ
QQLPEYMVPAIYVLLEAMPLTSNGKLDRKALPAPDGDALISRGYEAPQGEIEEQIAIIWQ
DLLGVEQVGRHDNFFELGGHSLLAVSLTGRMRQLGLSADVRVLFSQPTLAALAAAVGGGT
EVVVPANLITEDCQRITPELLPLADLTQAQIDQVVATVPGGVTNVQDIYPLAPLQAGILY
HHLAAEIGDPYVLQTQFAFADSERLNAFVQALQMVIDRHDILRTSVVWDSLDSPVQVVWR
QAQLHLDALELDPADGDIGAQLHSRFDPRHYRLDIGQAPLMRVAYAEDSLNQRICAMLLF
HHMALDHVALEVVKHEMQAWLAGEADTLAASVPVPYRNYVAQARLGVSQADHETFFRDML
GDIDEPTLPFGLQDVQGEGRDIEEASLALAPQMSLRLRAQARQQGVSAASLVHLAWAQVL
GKVSNRQDVVFGTVLMGRMQGGEGAERALGMFINTLPLRVSVGEQGVRDGVKATHKRLTA
LLGHEHASLALAQRCSGVAAPAPLFSALLNYRHSGVGSVSDQAMQAWQGIAVLSGEERTN
YPLTLNVDDLGEGFSLTALVVSSIGAQRVCGYMHTALENLLTALEQTPETSLQGLSILPV
VEREQLLVAFNDTALDYPQQQTIHGMFEAQVERTPEGVAVVHGEQRLTYRELNQQANRLA
HALIKQGVQPDSRVGICVERGAAMVVGLLAILKAGGGYVPLDPAYPAERIAYMLQDSAPA
AVLAQTATQGLLADVSVPVINLDLSNWQDESVQNPQVPGLTSAHLAYLIYTSGSTGLPKG
VMIEHRNTVNFLTWAHTAFDGSALEKTLFSTSLNFDLAVYECFAPLTSGGSIEVVKNVLE
LQHGEHDIGLINTVPSALKALLDVDGLPRSVHTVNVAGEALKRSLVEDLFERTGVQRLCN
LYGPSETTTYSSWVAMDREQGFAAHIGKPVGNTQFYLLDEQQQPVPLGVSGEIYIGGAGV
ARGYLNRDDLTAERFLKDPFSQNPAARMYRTGDLGRYLPDGNIEYLGRNDDQVKIRGFRI
ELGEIDARLAKHPAVHEAVVTAREDVPGDKRLVAYYTLSAGHASVDIDSLRGWLQEQLPA
YMIPVAYVLLDALPLTPNGKLDRKALPAPEADALISRGYEAPQGETETQIAAIWQDLLGV
AQVGRHDNFFELGGHSLLAVSLIGRMRQSGLSSDVRVLFGQPTLAALAAAVGGGGTEIVV
PANLITEHCEHITPDLLPLADLTQAQIDQVVATVPGGVANVQDIYPLSPLQEGILYHHLS
AERGDPYLLQSLFAFDSQEHVDDFARALQFVIKRHDILRTALVWEGLDEPMQVVWRQALL
MRESFDPDPQGGDVATQLHQRFDSRHQRLDIRRAPMMRLVHAWDEPNQRWLGWLQFHHLV
MDHTTLEVVRHEMQACLLGQGAQLGLTVPYRNYIAQVRLGMSQEEHEAFFRDMLSDVDEA
TLPFGLQEVQGDGHDIEEGCLAVDLELSRRLRAQARQLGVSAASLVHLAWAQLLSKASGR
DDVVFGTVLLGRMQGGEGSERALGVFINTLPLRVSLGDQGVRDGVKATHRRLTALLGHEH
ASLALAQRCSGVVAPAPLFSALLNYRHTSVAVTDETLTAWNGMHALALGDEERTNYPLTL
NVDDRGEDFLLTVQTVPLIDGARICAYMQQTLSNLVNALEQAPQTPLHAIAILPDSEREQ
LLEQWKQPGTAYTSETPIHLQFEARAAQQPDAVALVFEEQTLSYGELNARANQVAHRLLA
HGVRPDDRVAICVERGPAMIIGLLGILKSGAGYVPLDPAYPLERLAYTLGDSAPVALLSQ
RSVQGTLPVSQVPVICLDDDLQDESVCNPQVPVTPGNLAYVIYTSGSTGKPKGVMIEHRN
VARLFSATEEWFGFNQQDVWALFHSFAFDFSVWEIWGALLHGGRLLIVPQLVSRSPEDFY
NLLCSAGVTVLNQTPSAFRQLIAAQAENTQAHSLRQVIFGGEALETAMLKPWYARQANAG
TQLVNMYGITETTVHVTYYPLQPEDAQRLGASPIGRRIPDLQLYVLDARGEPVPVGVVGE
LYVGGAGVARGYLNREALTAERFLDNPFSHTADARMYRTGDLGRWLADGSLEYLGRNDEQ
VKIRGFRIELGEIEARLAEYPDVRDAVVLCREDVPGDKRLVAYVTAQQPESLLDIESLRE
HLQGALPEHMVPAAYVQLDELPLTANGKLDRKALPVPDRSALASRGYAAPENDTEMAIAR
IWQDLLQLEQVGRHDNFFELGGHSLLAVKLIERMRQIDLVADVRVLFSQPTLSALAAAVG
GKGAVEVPANLIPAGCERITPDMLPLAALSQDDIDRVVASVPGGLANVQDIYALAPLQEG
ILYHHLAATEGDPYLQYALFAFDSLERLHSFAQALQGVIDRHDILRTAVLWERLDAAVQV
IWREAPLGMDEWVLNPADGDIAEQLLKRLDPRHTRLDIRQAPMLRIGYAHDAENDRWLGM
LLFHHLVDDATSLRTLTSEIESYMLGQQASLPPSVPYRNYVAQAMLGVSREEHEAFFRDM
LGDIDEPTLPFGLLDVQGDGRGIEEVHQPVDMDLSRRLRLQARQFGVSSASLYHLAWARV
LGAVSGKEEVVFGTVLLGRLQGGAGSDRALGMFINTLPLRVTLGEQGVRSGLKATHTRLS
GLLAHEHASLVLAQRCSGVAASTPLFSALLNYRHVAGQANQQTLDAWQGIESLRGEERTN
YPLTLSVNDEGTGFSLTIQASACIDAQRICAYVQTTLENVVSALEQSGEVPLAGLSVLSA
AEREHLVYGLNATALDYPQQQTIHGMFEAQVERTPEGVAVVHGEQRLRYRELNEQANRLA
HALIKQGVQPDSRVGICVERGVEMVVGLLAVLKAGGGYVPLDPAYPAERIAYMLQDSAPA
AVLAQTATQGLLADVSAPVINLDLSDWQDQSVQNPQVPGLTSAHLAYLIYTSGSTGLPKG
VMIEHRNTVNFLTWAHTAFDGSALEKTLFSTSLNFDLAVYECFAPLTSGGSIEVVKNVLE
LQHGEHDIGLINTVPSALKALLEVNGLPESVHTVNVAGEALKRSLVENLFEKTGVQRLCN
LYGPSETTTYSSWVAMDREHGFAAHIGKPVGNTQFYLLDEQQQPVPLGVAGEIYIGGAGV
ARGYLNRDDLTAERFVKDPFSPQPAARMYRTGDLGRYLPDGNIEYLGRNDDQVKIRGFRI
ELGEIDARLAKHPAIHEAVVAAREDVPGDKRLVAYYTLSAGHASVDIDSLRGWLQEQLPA
YMIPVAYVLLDALPLTPNGKLDRKALPAPDADALIRRGYEAPQGETETLLAQIWSDLLQL
ERVSRHDQFFELGGHSLLAMRLISQIRQQLGVELSLAALFAHPELSALALAIAQAGRSIL
PDIVPVARDQAWPLSFGQQRLWFLAKMEGASAAYHMPVGLGLRGELDRAAMQRALERIVA
RHEGLRTTFIQGDDEQPLQRISPADAGFNLQLHDLQGLVDAKEKLQALASEESLQGFDLE
QGPLIRGRLIRMAEDHHVLLLTMHHIVSDGWSIGVLTRELAALYAAFSQGQDDPLAPLTL
QYLDYAVWQRRWLSGDLLQQQSDYWQQTLADAPALLMLPTDRVRPAQQDYAGAVLPIVFD
EDLTRGLKALSQRHGSTLFMTIMAAWAALLGRLSGQDDVVIGTPVANRTRSEAEGLVGLF
VNTLAIRVDLSDKPTAETLLAQVKTNTLGAQDHQDLPFEQVVEVIKPVRSLSHSPVFQAM
LSWQDMSGGDLVLGDLQLESLSAGHTLSKFDLSLDIGEAQGQLLGSLEYATALFDESTIA
RYLGYLKRLLHAMVADDRQALEHVPLLDAVERKHLLIDLNATNVPYPQDATIHQLFEEKV
RAQPEAIAVAFQAQRLSYADLNRQANRLAHHLISLGIGPDDRVAICVERGVKMIVGLLGV
LKAGAAYVPLDPAYPAERLAYMINDSQPAALLTQRGLQERLPALSMPLVLLDDEHCQGFT
ECDDNPVVPTLGVRNLAYVIYTSGSTGNPKGVMIEHRGLVNYSVDAARLFGLSQSDTVLQ
QNTLNFDLSVEEIFPALLAGATLAPSREIFGSEGTENHGIYPTVLHLTAAHWHTLVAEWH
NQPQAAEQRLAEVRLINVTGDALSAQKLKLWDEVRPAHTRLINTYGPTEATVSCTAAYVS
HDAAAGSEGSGNATIGKPMANTRIYLLDEHQQPVPYGVAGEIFIGGDGVARGYLNLEEVN
AERFLADPFSNSPDARMYKTGDLARYMADGRIEYLGRNDFQVKVRGFRIELGEIEARLGN
CKGVKEAVVIAREDNPGEKRLVAYVIAQPQANLDAASLRAELAPQLAEYMLPSAFVLLDA
LPLTPNRKLDRKALPAPADDAFASREHVAPQGTSEKALAQIWQNLLNLEAVGRHDHFFEL
GGHSLLAMRLISQVRQRMGVELSLADIFAQPELAALAQVVAHAAGSSQQPIVPVSRDQSL
PLSFAQQRLWFLAQLDGGSAAYHIPAGLRLRGSLDQVALKRALDRIVARHEALRTTFVQE
QDQDPLQCIAPADIGFSLQLQVLNGQADAEQQLLAIAAEEAEEGFDLVNGPLVRGRLVRM
ADDDHVLLVTMHHIVSDGWSAGVLTRELGMLYAAFSEGAEDPLPALPVQYADYALWQRNW
LSGDVLQQQRQYWQQTLAGAPALLTLPTDRPRPAQQDYSGQLLGLVLDADLTRGLKALSQ
RHGSSLFMTVMAAWAALLGRLAGQDDVVIGTPVANRMRAEVEDLIGFFVNTLAVRVDLSG
TPSVQSLLQQVKQQTLAAQANQDLPFEQVVEVVRPQRSLSHSPIFQAMLSWQNNETSDLA
LGDMSLQGVAVAGHTAKFDLTLDMTEVGDQLIGTLEYATALFDESTLQRYMGYFQRLLEA
MVADDRQLLEQVPLLDAVERQHLLVDLNATDVPYPHDCTIHQLFEAKVQAQPDAIAVAFQ
AQRLSYAELNRQANRLAHHLIGLGIGPDDRVAICVERGVEMMIGLLGVLKAGAAYVPLDP
AYPAERLAYMITDSQPAALLTQRDLQKRLPTLTLPLVLLDDDQRNTFTERDDNPVVEALG
VRNLAYVIYTSGSTGNPKGVMIEHRGLVNYSVDAARLFALSQSDTVLQQNTLNFDLSVEE
IFPALLAGATLAPSREIFGSEGTENHGIYPTVLHLTAAHWHTLVAEWHNQPQAAAQRLAE
VRLINVTGDALSAQKLKLWDEVRPAHTRLINTYGPTEATVSCTAAYVSHDAAAGSEGSGN
ATIGKPMANTRIYLLDAHQQPVPYGVAGEIFIGGDGVARGYLNLEEVNAERFLADPFSNS
PEARMYKTGDLARYMADGRIEYLGRNDFQVKVRGFRIELGEIEARLGNCTGVKEAVVIAR
EDTPGDKRLVAYVVGQPQASLDAASLHAELAPQLAEYMLPSAFVILDALPLTPNRKLDRK
ALPAPQAEAFASREHVEPQGDTEMALAQIWQNLLNLEQVGRHDQFFELGGHSLLAMRLIS
QVRQHLGVELGLADIFAQPELAAMARILADAKGSSQPPIVPVSRDQTLPLSFAQQRLWFL
AQLEGGSAAYHIPAGLRLRGTLDNPALKRALDRIVARHEVLRTTFVQVHGEDVAQHIASA
DIGFSLQLQVLTGQADAEEQLLAITAEEANEGFDLLNGPLVRGRLVRMAEDDHVLLVTMH
HIVSDGWSADVLTRELSALYAAFSAGAEDPLPTLPVQYADYAVWQRNWLSGDVLQQQRQY
WQQTLGGAPALLTLPTDRPRPAQQDYSGNIRGLVLDAELTSGLKALSQRHGSTLFMTVMV
AWASLLGRLAGQDDVVIGTPVANRLRAEVEDLIGFFVNTLAIRVDLSGAPSVEALMQQVK
RQTLAAQSHQDLPFEQVVEVVRPQRSMSHSPIFQAMLSWQNNETSDLSLGAMNLQGVAVN
DHTAKFDLVLDMTEVGDQLIGTLEYATALFDESTMDRYLGYFQRLLEAMVANEHRILEHV
PLVDAVEREHLLSGLNATDRAYPQGPLMHRLFEAHAASRPQAIAARQGEQTLTYAELDSR
ANALAQHLRQHGVKAGARVAILLDRSVELLTSMLATLKCGAAYLALDRLAPEERVRFMLE
DSEALMLLTRSELTAPETTPRLDLDTLDLSVAGQGPVALAAEVAGETPACIIYTSGSTGI
PKGVIVTHNGIVRLVRDNYDFRADDRVAFSSNPAFDASSPEIWGALLNGGQSIIVEPSVL
LEPAAFAALLKRHAVTVMFSSTALFNLYAGLIPEALAGLRMLECGGERADPASFRRVREH
SSQVRLFNGYGPTEATTCATRYEVFDVQPGTLSLPIGKPNANVRIYLLDVCGEPVPMGSV
GEIYIGGVGVALGYLNRPELSAERFSDDPFSQQAGARLYRTGDLARWLPDGNLEYLARND
GQVKVRGFRVELGEVESSLHGCDGVRNSVVVAREDSPGDTRLVAYYTVHADVEAPEPEAL
RAQLSAGLPEYMVPSVFICLPDLPLTLNGKVDVQALPAPTADQFANRAFEAPAGELEIRL
AKAWAEVLELQQVGRHDSFFDLGGHSLMAVRLVNQLVQSGLAVSLADVFQHASVAALASL
LGSRSEEVAAQLHEQLVPVRTSGNQRPLFLVHEFTGQDVHFAALAVHIDSDIPVYGLSGI
PLGQEQLLTMECLATRLLGAMRSVQPHGPYRLAGWSFGGLLAYEIAIQLEGMDEQVEFIG
MLDTYMPRLVDQGRERWDLKTAHRQHLLEHCEQFWKARGECAQALEALERVRANRQDFDF
AGLLQHCREQGALPPELDIYTNEGLCQYLDREVAHGHAQANYTVYPNSVALHLFTASEWQ
PNSQKHSGYLGWDSVLPVSQLRSIKVAGDHMSMMKPPHIEGLGQAISQALADLPDRPREA
DSAHRPLLTIQGGRRDHAPVFCVPGAGDSVTGFIGLAEALGPHWPIHGLQPRGLDGRTVP
YSLVETAAEAYLQALDSTHPEGPVHLLGHSFGGWIAFEMAQRLTARGREVASLTLIDSES
PGGNGVVGKPYTAIGVLERLIETMELAAGKSLGIDRAAFRAQDDVGQMRLLHAGMVRAGM
LPQRSSPDAMRGPARAFGTALRTRYQPSQIYTGPVRLVLADDPVLDAAGNQREQQAMVSG
WRQCAPDLTVWRGPGNHFTILKAQHVRHLANWWLPHE