Pairwise Alignments

Query, 1024 a.a., beta-galactosidase from Vibrio cholerae E7946 ATCC 55056

Subject, 1421 a.a., beta-galactosidase (NCBI ptt file) from Bacteroides thetaiotaomicron VPI-5482

 Score =  555 bits (1429), Expect = e-161
 Identities = 343/1004 (34%), Positives = 519/1004 (51%), Gaps = 77/1004 (7%)

Query: 48   QSLNGQWRFALFEKPEAVEPAVIDPDFDDSAWAHIPVPSNWQMQG------FDKPIYTNI 101
            ++LNG W+F     P+        P++D S W +I VP+ WQ++       +DKP+Y N+
Sbjct: 68   KTLNGIWKFHWVADPKDRPQDFCKPEYDVSQWDNIKVPATWQIEAVRHNKNWDKPLYCNV 127

Query: 102  QYPFAD--------------RPP---YVPQDNPTGCYRHRFTLEKQALTESIRIVFDGVN 144
             YPF +              RP    +    NP G YR  F L        I I F+GV 
Sbjct: 128  IYPFCEWDWKKIQWPNVIQPRPSNYTFATMPNPVGSYRREFILPDSWKGRDIFIRFNGVE 187

Query: 145  SAFHLWCNGHWVGYSQDSRLPAEFELTPYLQEGENLLVAMVLRWSDGSYLEDQDMWWLSG 204
            + F++W NG  VGYS+DS LPAEF LTPYL+ G+N+L   V R++DGS+LE QD W  SG
Sbjct: 188  AGFYIWVNGKKVGYSEDSYLPAEFNLTPYLKAGKNVLAVEVYRFTDGSFLECQDFWRFSG 247

Query: 205  IFRDVYLYRKPILAIEDFFIRTELDALYQHAELRVETRLSQVTRHHQVQVALFDAQGECV 264
            IFRDV+L+  P   I DFF RT+LD  Y++A + ++  ++    ++++QV + D  G+ +
Sbjct: 248  IFRDVFLWSAPKTQIRDFFFRTDLDKEYKNASVSLDIDITGKRSNNEIQVKVTDQNGKEI 307

Query: 265  ARSQALHTGQRVVDEKGAWHDKTEHSLAICSPTLWSDEAPYLYRCVICLLDEDGAPIEFE 324
            A   A            A     +    + +P  W+ E P LY   I LL + G  ++  
Sbjct: 308  ATQNA-----------RAVTGTNKLQFEVVNPLKWTAETPNLYNLTI-LLKQKGKTVDIR 355

Query: 325  SAAVGFRKVEITQ-GLLKLNGQPLLIRGVNRHEHHPELGHVMDEASMRRDIELMKQHNFN 383
            S  VGFRK+E+ Q G L +NG+  L +GV+RH+H  E G  + +  M +D++LMK  N N
Sbjct: 356  SVKVGFRKIELAQDGRLLINGKSTLFKGVDRHDHSSENGRTVSKEEMEKDVQLMKSLNIN 415

Query: 384  AVRTAHYPNHPRWYELCDEYGLYVVDEANLETHGQFPMSRLSNDPQWVNAYLQRMIGMVE 443
            AVRT+HYPN+P +Y+LCD YG+YV+ EAN+E HG   +  LS++P WV A+ +R   MV 
Sbjct: 416  AVRTSHYPNNPYFYDLCDRYGIYVLSEANVECHG---LMALSSEPSWVKAFTERSENMVR 472

Query: 444  RDKNHPCVIIWSLGNESGIGTNHHAMYQWTKQRDPSRPVQYEGGGANTAATDIVCPMYAR 503
            R KNH  +++WSLGNESG G N  +  +  K+ D +RP  YEG   N++  D+   MY  
Sbjct: 473  RYKNHASIVMWSLGNESGNGINFKSAAEAVKKLDDTRPTHYEG---NSSYCDVTSSMYPD 529

Query: 504  VDQHQPHPAVPKYALKNWISLPQENRPLILCEYAHAMGNSLGAFYKYWQAFREFPRLQGG 563
            V   Q   +V K  L+ + +  +  +P ++CEYAHAMGNS+G F +YW+ +  +P L GG
Sbjct: 530  V---QWLESVGKERLQKFQN-GETVKPHVVCEYAHAMGNSIGNFKEYWETYERYPALVGG 585

Query: 564  FIWDWVDQGISKWDSEGR-HYWGYGGDFGDTINDRQFCINGLLFPDRTPHPALHEVKKVQ 622
            FIWDWVDQ I     +G  +Y  +GGDFGDT ND  FC NG++F DRT     +EVKK+ 
Sbjct: 586  FIWDWVDQSIKMPAPDGSGYYMAFGGDFGDTPNDGNFCTNGVIFSDRTYSAKAYEVKKIH 645

Query: 623  QP-YQFSLSYPKLTIHNERLFAAL-PLELVVSVLCDGQEI---KQERLPLDIAPRGTITL 677
            QP +  ++      + N+R  A L  L     +  DG+ +     E L L+      IT+
Sbjct: 646  QPVWVEAMGNGTYKLTNKRFHAGLDDLYGRYEIEEDGKVVFSANLEELSLNAQDSKVITI 705

Query: 678  DLASLPMLPEHEYHLNAVLLCREDQPWSNAGHCIASEQWCLQPRRSMLPKITHAPLPQWQ 737
                +  +P  EY +      ++D  W  AG+ +ASEQ+ L      + K     +   +
Sbjct: 706  ADNQINKIPGAEYFIKFRFCQKQDTEWEKAGYEVASEQFKLSDSAKPVFKAGEGSIDLIE 765

Query: 738  QDGDKVRIEAANQQWQFNRQTGLLEQWWQNGQPVLSEPLRDNFYRAVLDNDIGTSEAQHL 797
             D D   ++ +  +  F++Q G +  +  N  P++S+ L  N +RA  DND      + +
Sbjct: 766  TD-DAYLVKGSQFEASFSKQQGTISSYTLNELPMISKGLELNAFRAPTDND------KQV 818

Query: 798  DPNSWIARWHAAGLDKLRVECDDLRVTTLNESVEVVIDVAHYHQQALALRTRWRYQIFGD 857
            D +     W+  GL ++ +E     V   +  V + I+  +  +     RT   Y +  D
Sbjct: 819  DGD-----WYQKGLYQMTLEPGHWNVRKEDNKVTLQIENLYRGKTGFDYRTNIEYTVAAD 873

Query: 858  ARVELNVEVMLCSDLPPLPRVGLTLALPVAENPVSWFGRGPHENYPDRLQSAHVGRYTAT 917
              + +N  ++  +    +PR+G  + LP     + W+GRGP ENY DR  + +VG Y   
Sbjct: 874  GSILVNSTIIPSTKGVIIPRIGYRMELPEGFERMRWYGRGPLENYVDRKDATYVGVYDEL 933

Query: 918  VDELHTPYIFPSENGLRCDTRQLQ------VGALVVEGH--FHFSLSRYSQTMLDKAKHS 969
            V +    Y+   E G R D R +       +G + + G      +L   +Q M+D A H 
Sbjct: 934  VSDQWVNYVRAQEMGNREDLRWISITNPDGIGFVFIAGDKMSASALHATAQDMVDPANHR 993

Query: 970  N-----ELVAGDKWYLNLDAQHMGVGGDDSWSQSVHPEFLLTQP 1008
                  E+    +  L LDA    +G        +    L +QP
Sbjct: 994  RLLHKYEVPMRKETVLCLDANQRPLGNASCGPGPMQKYELRSQP 1037