KCC000054A_c01_2
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000054A_C01_2 KCC000054A_c01_2
GGTGCCCTGGGCACTCGGGCCGGAGGGAGCGGAGCAGCTGCGGTTGCACGACGGCAGCCA
ATGGCTTGTGGTCACCAGCTGGGCCTCGCACGCGCTCTGGCGCGACTCGCTCGCCCTCCA
GCGCCTCGGCGGCCTGCAGCAGCAGCTAGCCAGCCGCCGCCTTCCCCAGCTCGCCTTCCT
CGCGGAGGGTCCCTGGTGTGCGGAGTCACCAGACGCCTCGCCGCCCTTCGGGCCGCCCGC
CGGCATTGATGCTGTGCAGGCCGGCAGCCCCGCGCATCGCGTCACCTCTCTCTACACCAA
CCTGGCCGACTCATCCCAAGTCGACTTTGCCTTCTCTCACCTGCGCCCTCTTCCCAGCCG
CCTGCTTGCTGCCGTGCTGCCGCACGGCCGCACGCCTGGTGCTGCGCAACGGCCCCTCCC
TCCTCCTTACGTGGCGCTGCAGGCCCCCGGCCAACCCCTCGCCGCGCTGCCCCCGCCTTC
CCCGCCGCCCGCCGCCCCGGAGCCGCCACCGCTGCGTGCCGGCCCTGATGGCCTGCCCTG
CGCCCCGGACGCGGCGGACTGGCTCGCCGTCTCCACCCACCACTCCTCCCTCTCCTTGCC
CGCCGCCTTCGCCGTCCAGCTGCTCCGCCGCCACACGCCGCCGGCTGTCTTCACGGCCGC
CTTGGCTGCCGCCACGGCTCTCCACCGCCTCTTCTTCACGCGCCACGACATGCAGCCGCT
TCCCCCTCGTCAGATCGACGCCCCGGCCTCCACGCTGGGGGGGGGAGCTGCCGCCTCGGG
CCCAGCAGACCACCTGGAGACCGCCTTTGCAGCGCTGGCGCTGTCAGACGAGCCGTCACC
ACTGCTGGACGCGACCGAGACCACTCTCGCGGTTGCCCTGCTTCACCAGCGTCTCCTCGG
GGCGCGCATGGGTGCGCAGCTGCATGTCTACGCCGCAGTGGCGGAAGCTGCGGAGCAAGC
GGACTGGGCTGACGCCGCACTGTCGGCGTCGGCGGCCCCGGCCGCCGCCAGCACCACCAT
CGCCTCCGCAGCCCTGGGCCCGTCAACCCCTGACACCGCAGACGTCTGGAACGACGCCCC
CGTCATGGACCTGCTGCGCCTGGGCGCCCAGCCTGACAGCCTCAGCCGCGAGGAACGGCA
ACGTGTACAGCGCCGCGCCGCCAGCCACCGCTGGACC


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000054A_C01_2 KCC000054A_c01_2
         (1177 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|XP_307770.1| ENSANGP00000004103 [Anopheles gambiae] gi|21291...    84  4e-15
ref|NP_730262.2| CG13731-PA [Drosophila melanogaster] gi|2838049...    83  1e-14
ref|ZP_00091493.1| COG1721: Uncharacterized conserved protein (s...    82  1e-14
ref|ZP_00057775.1| COG1205: Distinct helicase family with a uniq...    82  2e-14
sp|Q9FPQ6|GP1_CHLRE Vegetative cell wall protein gp1 precursor (...    79  2e-13

>ref|XP_307770.1| ENSANGP00000004103 [Anopheles gambiae] gi|21291398|gb|EAA03543.1|
            ENSANGP00000004103 [Anopheles gambiae str. PEST]
          Length = 972

 Score = 84.3 bits (207), Expect = 4e-15
 Identities = 97/337 (28%), Positives = 127/337 (36%), Gaps = 13/337 (3%)
 Frame = +3

Query: 114  PSSASAACSSS*PAAAFPSSPSSRRVPGVRSHQTPRRPSGRPPALMLCRPAAPRIASPLS 293
            P S    C  + P     ++P+ R  PG      PR P   P AL    PA   +  P S
Sbjct: 672  PGSTDPRCPQTTPRPVPTTTPAPRCYPGSND---PRCPQTTPRALPTTTPAP--VCYPGS 726

Query: 294  T-PTWPTHPKSTLPSLTCALFPAACLLPCCRTAARLVLRNGPSLLLTWRCRPPANPSPRC 470
            T P  P    +T P L C  +P +    C +T  R V    P+     RC P +N  PRC
Sbjct: 727  TDPRCPKPTTTTQPPLRC--YPGSTDPRCPQTTPRPVPTTTPAP----RCYPGSN-DPRC 779

Query: 471  PRLPRRPPPRSRHRCVPALMACPAPRTRRTGSPSPPTTPPSPCPPPSPSS-CSAATRRRL 647
            P+   RP P +     PA +  P     R   P+  T PP  C P S    C   T R +
Sbjct: 780  PQTTPRPVPTT----TPAPVCYPGSTDPRCPKPTTTTQPPLRCYPGSTDPRCPQTTPRPV 835

Query: 648  SSRPPWLPPRLSTASSSRATTCSRFPLVRSTPRPPR--WGGELPPRAQQTTWRP-----P 806
             +  P   PR    S+     C   P    T +PP   + G   PR  QTT RP     P
Sbjct: 836  PTTTP--APRCYPGSND--PRCPTTPRPTPTTQPPLRCYPGSTDPRCPQTTPRPVPTTTP 891

Query: 807  LQRWRCQTSRHHC-WTRPRPLSRLPCFTSVSSGRAWVRSCMSTPQWRKLRSKRTGLTPHC 983
              R    ++   C  T PRP+      T+  + R +  S                  P C
Sbjct: 892  APRCYPGSNDPRCPQTTPRPVP-----TTTPAPRCYPGSN----------------DPRC 930

Query: 984  RRRRPRPPPAPPSPP---QPWARQPLTPQTSGTTPPS 1085
             +  PRP   P  PP    P +  P  PQT+    P+
Sbjct: 931  PQTTPRPTQPPTQPPLRCYPGSADPRCPQTTPRPVPT 967

 Score = 81.3 bits (199), Expect = 3e-14
 Identities = 101/377 (26%), Positives = 138/377 (35%), Gaps = 26/377 (6%)
 Frame = +3

Query: 39   CGCTTAANGLWSPAGPRTRSGATRSPSSASAACSSS*PAAAFPSSPSSRRVPGVRSHQTP 218
            C  TTAA   + P     R   T  P++A    ++  P   +P S   R        QT 
Sbjct: 6    CSQTTAAPRCY-PGSNDPRCPTTPRPTTARPTPTTQPPLRCYPGSTDPR------CPQTT 58

Query: 219  RRPSGRPPALMLCRPAA--PRIASPLSTPTWPTH----------PKST----LPSLTCA- 347
             RP        +C P +  PR   P +T   P            P++T    +P+ T A 
Sbjct: 59   ARPVPTTTPAPVCYPGSTDPRCPKPTTTTQPPLRCYPGSNDPRCPQTTTTRPVPTTTAAP 118

Query: 348  -LFPAACLLPCCRTAARLVLRNGPSLLLTWRCRPPANPSPRCPRLPRRPPPRSRH--RCV 518
              +P +    C +T  R V    P+     RC P +N  PRCP+   RP P ++   RC 
Sbjct: 119  RCYPGSTDPRCPQTTTRPV----PTTTAAPRCYPGSN-DPRCPQTTPRPTPTTQPPLRCY 173

Query: 519  PALMACPAPRTRRTGSPSPPTTPPSPCPP----PSPSSCSAATRRRLSSRPPWLPPRLST 686
            P       P+T  T  P P TTP   C P    P     S  T+  L   P    PR   
Sbjct: 174  PGSNDPRCPQT--TPRPVPTTTPAPVCYPGSTDPRCPKPSTTTQPPLRCYPGSTDPRCPQ 231

Query: 687  ASSSRATTCSRFPLVRSTPRPPRWGGELPPRAQQTTWRPPLQRWRCQTSRHHCWTRPRPL 866
             +          P+  +TP P  + G   PR  QTT RP               T P P+
Sbjct: 232  TTPR--------PVPTTTPAPRCYPGSNDPRCPQTTPRPVPT------------TTPAPV 271

Query: 867  SRLPCFTSVSSGRAWVRSCMSTPQWRKLRSKRTGLTPHCRRRRPRPPPAPPSPPQ--PWA 1040
                C+   +  R    +  + P    LR       P C +  PRP P     P+  P +
Sbjct: 272  ----CYPGSTDPRCPKPTTTTQP---PLRCYPGSTDPRCPQTTPRPVPTTTPAPRCYPGS 324

Query: 1041 RQPLTPQTSGTTPPSWT 1091
              P  PQT+    P+ T
Sbjct: 325  NDPRCPQTTARPVPTTT 341

 Score = 80.5 bits (197), Expect = 5e-14
 Identities = 95/344 (27%), Positives = 130/344 (37%), Gaps = 5/344 (1%)
 Frame = +3

Query: 75   PAGPRTRSGATRSPSSASAACSSS*PAAAFPSSPSSRRVPGVRSHQTPRRPSGRPPALML 254
            P  P T+      P S    C  + P     ++ + R  PG      PR P   P  +  
Sbjct: 585  PTLPPTQPPLRCYPGSTDPRCPQTTPRPVPTTTAARRCYPGSND---PRCPQTTPRPVPT 641

Query: 255  CRPAAPRIASPLST-PTWPTHPKSTLPSLTCALFPAACLLPCCRTAARLVLRNGPSLLLT 431
              PA   +  P ST P  P    +T P L C  +P +    C +T  R V    P+    
Sbjct: 642  TTPAP--VCYPGSTDPRCPKPTTTTQPPLRC--YPGSTDPRCPQTTPRPVPTTTPAP--- 694

Query: 432  WRCRPPANPSPRCPRLPRRPPPRSRHRCVPALMACPAPRTRRTGSPSPPTTPPSPCPPPS 611
             RC P +N  PRCP+      PR+     PA +  P     R   P+  T PP  C P S
Sbjct: 695  -RCYPGSN-DPRCPQTT----PRALPTTTPAPVCYPGSTDPRCPKPTTTTQPPLRCYPGS 748

Query: 612  PSS-CSAATRRRLSSRPPWLPPRLSTASSS-RATTCSRFPLVRSTPRPPRWGGELPPRAQ 785
                C   T R + +  P   PR    S+  R    +  P+  +TP P  + G   PR  
Sbjct: 749  TDPRCPQTTPRPVPTTTP--APRCYPGSNDPRCPQTTPRPVPTTTPAPVCYPGSTDPRCP 806

Query: 786  Q--TTWRPPLQRWRCQTSRHHCWTRPRPLSRLPCFTSVSSGRAWVRSCMSTPQWRKLRSK 959
            +  TT +PPL+ +   T      T PRP+      T+  + R +  S             
Sbjct: 807  KPTTTTQPPLRCYPGSTDPRCPQTTPRPVP-----TTTPAPRCYPGS----------NDP 851

Query: 960  RTGLTPHCRRRRPRPPPAPPSPPQPWARQPLTPQTSGTTPPSWT 1091
            R   TP     RP P   PP    P +  P  PQT+    P+ T
Sbjct: 852  RCPTTP-----RPTPTTQPPLRCYPGSTDPRCPQTTPRPVPTTT 890

 Score = 76.6 bits (187), Expect = 8e-13
 Identities = 90/347 (25%), Positives = 118/347 (33%), Gaps = 21/347 (6%)
 Frame = +3

Query: 114  PSSASAACSSS*PAAAFPSSPSSRRVPGV---RSHQTPRRPSGRPPALMLCRPAAPRIAS 284
            P S    C  + P     ++P+ R  PG    R  QT  RP        +C P       
Sbjct: 296  PGSTDPRCPQTTPRPVPTTTPAPRCYPGSNDPRCPQTTARPVPTTTPAPVCYPG------ 349

Query: 285  PLSTPTWPTHPKSTLPSLTCALFPAACLLPCCRTAARLVLRNGPSLLL-----TWRCRPP 449
             L+ P  P    +T P L C  +P +    C +T  R V    P+          RC   
Sbjct: 350  -LTDPRCPKPATTTQPPLRC--YPGSTDPRCPQTTTRPVPTTTPAPRCYPGSNDPRCPTT 406

Query: 450  ANPSPRCPRLPRRPPPRSRHRCVPALMACPAPRTRRTGSPSPPTTPPSPCPPPSPSSCSA 629
              P+P     PR  P  S  RC       P P T R     P +T P  CP         
Sbjct: 407  PRPTPTTTSAPRCYPGSSDPRCPQTTTPRPVPTTTRGPVCYPGSTDPR-CP--------- 456

Query: 630  ATRRRLSSRPPWLPPRLSTASSSRATTCSRFPLVRSTPRPPRWGGELPPRAQQTTWRP-- 803
                + ++RP  +P   +       +T  R P     P P  + G   PR  QTT RP  
Sbjct: 457  ----QTTTRP--VPTTTAAPRCYPGSTDPRCPQTTPRPEPKCYPGSTDPRCPQTTTRPVP 510

Query: 804  ---PLQRWRCQTSRHHCWTRPRPLSRLPCFTSVSSGRAWVRSCMSTPQ-----WRKLRSK 959
               P  R    ++   C T PRP            G    R   +TP+         R  
Sbjct: 511  TTTPALRCYPGSNDPRCPTTPRPTPTTTAAPRCYPGSTDPRCPQTTPRPVPTTTPAPRCY 570

Query: 960  RTGLTPHCRRRRPRPPPAPPSPP---QPWARQPLTPQTSGTTPPSWT 1091
                 P C +  PRP   P  PP    P +  P  PQT+    P+ T
Sbjct: 571  PGSTDPRCPQTTPRPTLPPTQPPLRCYPGSTDPRCPQTTPRPVPTTT 617

 Score = 54.3 bits (129), Expect = 4e-06
 Identities = 62/233 (26%), Positives = 81/233 (34%), Gaps = 22/233 (9%)
 Frame = +3

Query: 462  PRCPRLPRRPPPRSRHRCVPALM--ACPAPRTRRTGSPSPPTTPPSPCPPPSPS-SCSAA 632
            PRC +    P      RC P      CP      T  P+P T PP  C P S    C   
Sbjct: 4    PRCSQTTAAP------RCYPGSNDPRCPTTPRPTTARPTPTTQPPLRCYPGSTDPRCPQT 57

Query: 633  TRRRLSSRPPWLPPRLSTASSSRATTCSRFPLVRSTPRPPR--WGGELPPRAQQTTWRPP 806
            T R + +  P              +T  R P   +T +PP   + G   PR  QTT   P
Sbjct: 58   TARPVPTTTP-------APVCYPGSTDPRCPKPTTTTQPPLRCYPGSNDPRCPQTTTTRP 110

Query: 807  LQRWRCQTSRHHCW------------TRPRPLSRLP--CFTSVSSGRAWVRSCMSTPQWR 944
            +      T+   C+            TRP P +     C+   +  R    +   TP  +
Sbjct: 111  VP---TTTAAPRCYPGSTDPRCPQTTTRPVPTTTAAPRCYPGSNDPRCPQTTPRPTPTTQ 167

Query: 945  -KLRSKRTGLTPHCRRRRPRPPPAPPSPP--QPWARQPLTPQTSGTTPPSWTC 1094
              LR       P C +  PRP P     P   P +  P  P+ S TT P   C
Sbjct: 168  PPLRCYPGSNDPRCPQTTPRPVPTTTPAPVCYPGSTDPRCPKPSTTTQPPLRC 220

 Score = 38.9 bits (89), Expect = 0.18
 Identities = 42/161 (26%), Positives = 59/161 (36%), Gaps = 27/161 (16%)
 Frame = +3

Query: 114  PSSASAACSSS*PAAAFPSSPSSRRVPGV---RSHQTPR-RPSGRPPALMLCRPAAPRIA 281
            P S    C  + P     ++P+ R  PG    R   TPR  P+ +PP  + C P +    
Sbjct: 820  PGSTDPRCPQTTPRPVPTTTPAPRCYPGSNDPRCPTTPRPTPTTQPP--LRCYPGSTDPR 877

Query: 282  SPLSTP-----------------------TWPTHPKSTLPSLTCALFPAACLLPCCRTAA 392
             P +TP                       T P    +T P+  C  +P +    C +T  
Sbjct: 878  CPQTTPRPVPTTTPAPRCYPGSNDPRCPQTTPRPVPTTTPAPRC--YPGSNDPRCPQTTP 935

Query: 393  RLVLRNGPSLLLTWRCRPPANPSPRCPRLPRRPPPRSRHRC 515
            R      P      RC P  +  PRCP+   RP P +R  C
Sbjct: 936  R---PTQPPTQPPLRCYP-GSADPRCPQTTPRPVPTTRQPC 972

>ref|NP_730262.2| CG13731-PA [Drosophila melanogaster] gi|28380492|gb|AAF49349.4|
            CG13731-PA [Drosophila melanogaster]
          Length = 926

 Score = 82.8 bits (203), Expect = 1e-14
 Identities = 108/406 (26%), Positives = 141/406 (34%), Gaps = 41/406 (10%)
 Frame = +3

Query: 75   PAGPRTRSGATRSPSSASAACSSS*PAAAFPSSPSSRRVPGV-----RSHQTPRRPSGRP 239
            P  P TR   T  P           P    P+ P +  +P V     R+   P RP  RP
Sbjct: 405  PTKPPTRPPTTYLPPPTVRTTRPPPPPTRPPTKPPTTYLPPVTVRTTRATPPPTRPPTRP 464

Query: 240  PALMLCRPAAPRIASPLSTPTWPTHPKSTLPSLTCALFPAACLLPCCRTAARLVLRNGPS 419
            P      P   R+ +P  T   PT+    LP +T             RT  R   R  P+
Sbjct: 465  PTYP---PTTRRLTTPAPTYLPPTN--KPLPPVTV------------RTTVRTTPR--PT 505

Query: 420  LLLTWRCRPPANPS-----PRCPRLPRRPPPRSRHRCVPALMACPAPRTRRTGSPSPPTT 584
            L  T   +PP  P      P   R  R PPP +R    P     P    R T +  PPT 
Sbjct: 506  LPPT---KPPTRPPTTYLPPPTVRTTRPPPPPTRPPTKPPTTYLPPVTVRTTRATPPPTR 562

Query: 585  PPSPCPPPSPSSCSAATRRRLSSRPPWLPPR------LSTASSSRATTCSRFPLVRSTPR 746
            PP+  P P P+       R  +  P +LPP       ++  ++ R T     P  +   R
Sbjct: 563  PPTRPPTPPPTRPPPPPTRASTPAPTYLPPTNKPLPPVTVRTTVRTTPRPTLPPTKPPTR 622

Query: 747  PPRWGGELPPRAQQTTWRPPLQRWRCQTSRHHCWTRP-------------RPLSRLPCFT 887
            PP     LPP + +TT RPP    R  T     +  P             RP +R P + 
Sbjct: 623  PPT--TYLPPPSVRTT-RPPPPPTRPPTKPPTTYLPPVTVRTTRATPPPTRPPTRPPTYP 679

Query: 888  SVSSGRAWVRSCMSTPQWRKLRSKRTGLTPHCRRRRPRPPPAPPSPP-QPWARQPLT--- 1055
              +         ++TP    L      L P   R   R  P P  PP +P  R P T   
Sbjct: 680  PTTRR-------LTTPAPTYLPPTNKPLPPVTVRTTVRTTPRPTLPPTKPPTRPPTTYLP 732

Query: 1056 --------PQTSGTTPPSWTCCAWAPSLTASAARNGNVYSAAPPAT 1169
                    P    T PP+     + P +T    R      A PP T
Sbjct: 733  PPTVRTTRPPPPPTRPPTKPPTTYLPPVTVRTTR------ATPPPT 772

 Score = 79.7 bits (195), Expect = 9e-14
 Identities = 100/377 (26%), Positives = 130/377 (33%), Gaps = 24/377 (6%)
 Frame = +3

Query: 75   PAGPRTRSGATRSPSSASAACSSS*PAAAFPSSPSSRRVPGV-----RSHQTPRRPSGRP 239
            P  P TR   T  P           P    P+ P +  +P V     R+   P RP  RP
Sbjct: 508  PTKPPTRPPTTYLPPPTVRTTRPPPPPTRPPTKPPTTYLPPVTVRTTRATPPPTRPPTRP 567

Query: 240  PALMLCRPAAPRIASPLSTPTWPTHPKSTLPSLTCALFPAACLLPCCRTAARLVLRNGPS 419
            P     RP  P   +    PT+       LP +T             RT  R   R  P+
Sbjct: 568  PTPPPTRPPPPPTRASTPAPTYLPPTNKPLPPVTV------------RTTVRTTPR--PT 613

Query: 420  LLLTWRCRPPANPS-----PRCPRLPRRPPPRSRHRCVPALMACPAPRTRRTGSPSPPTT 584
            L  T   +PP  P      P   R  R PPP +R    P     P    R T +  PPT 
Sbjct: 614  LPPT---KPPTRPPTTYLPPPSVRTTRPPPPPTRPPTKPPTTYLPPVTVRTTRATPPPTR 670

Query: 585  PPSPCPPPSPSSCSAATRRRLSSRPPWLPPR------LSTASSSRATTCSRFPLVRSTPR 746
            PP+  P   P+     TRR  +  P +LPP       ++  ++ R T     P  +   R
Sbjct: 671  PPTRPPTYPPT-----TRRLTTPAPTYLPPTNKPLPPVTVRTTVRTTPRPTLPPTKPPTR 725

Query: 747  PPRWGGELPPRAQQTTWRPPLQRWRCQTSRHHCWTRPRPLSRLPCFTSVSSGRAWVRSCM 926
            PP     LPP   +TT RPP    R  T          P + LP  T        VR+  
Sbjct: 726  PPTT--YLPPPTVRTT-RPPPPPTRPPTKP--------PTTYLPPVT--------VRTTR 766

Query: 927  STPQWRKLRSKRTGLTPHCRRRRPRPP--------PAPPSPPQPWARQPLTPQTSGTTPP 1082
            +TP   +  ++     P  RR     P        P PP   +   R    P    T PP
Sbjct: 767  ATPPPTRPPTRPPTYPPTTRRLTTPAPTYLPPTNKPLPPVTVRTTVRTTPRPTLPPTRPP 826

Query: 1083 SWTCCAWAPSLTASAAR 1133
            +     + P  T    R
Sbjct: 827  TKPPTTYLPPPTVRTTR 843

 Score = 72.8 bits (177), Expect = 1e-11
 Identities = 96/355 (27%), Positives = 119/355 (33%), Gaps = 10/355 (2%)
 Frame = +3

Query: 75   PAGPRTRSGATRSPSSASAACSSS*PAAAFPSSPSSRRVPGV-----RSHQTPRRPSGRP 239
            P  P TR   T  P  +        P    P+ P +  +P V     R+   P RP  RP
Sbjct: 616  PTKPPTRPPTTYLPPPSVRTTRPPPPPTRPPTKPPTTYLPPVTVRTTRATPPPTRPPTRP 675

Query: 240  PALMLCRPAAPRIASPLSTPTWPTHPKSTLPSLTCALFPAACLLPCCRTAARLVLRNGPS 419
            P      P   R+ +P  T   PT+    LP +T             RT  R   R  P+
Sbjct: 676  PTYP---PTTRRLTTPAPTYLPPTN--KPLPPVTV------------RTTVRTTPR--PT 716

Query: 420  LLLTWRCRPPANPS-----PRCPRLPRRPPPRSRHRCVPALMACPAPRTRRTGSPSPPTT 584
            L  T   +PP  P      P   R  R PPP +R    P     P    R T +  PPT 
Sbjct: 717  LPPT---KPPTRPPTTYLPPPTVRTTRPPPPPTRPPTKPPTTYLPPVTVRTTRATPPPTR 773

Query: 585  PPSPCPPPSPSSCSAATRRRLSSRPPWLPPRLSTASSSRATTCSRFPLVRSTPRPPRWGG 764
            PP+  P   P+     TRR  +  P +LPP           T      VR+TPRP     
Sbjct: 774  PPTRPPTYPPT-----TRRLTTPAPTYLPPTNKPLPPVTVRTT-----VRTTPRPTL-PP 822

Query: 765  ELPPRAQQTTWRPPLQRWRCQTSRHHCWTRPRPLSRLPCFTSVSSGRAWVRSCMSTPQWR 944
              PP    TT+ PP       T R    TRP P    P                      
Sbjct: 823  TRPPTKPPTTYLPP------PTVRT---TRPPPPPTRP---------------------- 851

Query: 945  KLRSKRTGLTPHCRRRRPRPPPAPPSPPQPWARQPLTPQTSGTTPPSWTCCAWAP 1109
              +   T L P    R  RPPP P      +   P    T    PP+     + P
Sbjct: 852  PTKPPTTYLPPVTVVRTTRPPPPPTRRTTVYVPPPTVRTTVYVPPPTQRTTVYVP 906

 Score = 70.5 bits (171), Expect = 5e-11
 Identities = 91/334 (27%), Positives = 105/334 (31%), Gaps = 10/334 (2%)
 Frame = +3

Query: 198  VRSHQTPRRPSGRPPALMLCRPAAPRIASPLSTPTWPTHPKSTLPSLTCALFPAACLLPC 377
            VR+ Q P RP  RPP      P  P    P   PT+       LP +T  L         
Sbjct: 180  VRTTQPPTRPPTRPPTRP---PTRPPTRPPTPPPTYLPPTNKPLPPVTTRL--------- 227

Query: 378  CRTAARLVLRNGPSLLLTWRCRPPANPSPRCPRLPRRPPPRSRHRCVPALMACPAPRTRR 557
                                  PP  P PR P  P RPP R      P     PA     
Sbjct: 228  ----------------------PPPPPPPRTPP-PTRPPTR------PPTTRPPATYLPP 258

Query: 558  TGSPSPPTTPPSPCPPPSPSSCSAATRRRLSSRPPWLPPRLSTASSSRATTCSRFPLVRS 737
            T  P PP T   P PPPSP       R    +RPP  PP     ++    T    P V +
Sbjct: 259  TNKPLPPVTTRLPPPPPSP-------RTPPPTRPPTRPPTTRPPATYLPPTNKPLPPVTT 311

Query: 738  ----TPRPPRWGGELPPRAQQTTWRPPLQRWRCQTSRHHCWTRPRPLSRLPCFTSVSSGR 905
                 P PPR      P  +  T RPP              T  RP +  P        R
Sbjct: 312  RLPPPPPPPRTPPPTRPPTKPPTTRPPATYLPPTNKPPPPVTTRRP-TPPPTRPPPPPTR 370

Query: 906  AWVRSCMSTPQWRKLRSKRTGLTPHCRRRRPRPPPA--PPSPPQPWARQPLT----PQTS 1067
            A   +    P   K     T  T      RP PPP   P  PP  +   P      P   
Sbjct: 371  ASTPAPTYLPPTNKPLPPVTVRTTVRTTPRPTPPPTKPPTRPPTTYLPPPTVRTTRPPPP 430

Query: 1068 GTTPPSWTCCAWAPSLTASAARNGNVYSAAPPAT 1169
             T PP+     + P +T    R      A PP T
Sbjct: 431  PTRPPTKPPTTYLPPVTVRTTR------ATPPPT 458

 Score = 54.7 bits (130), Expect = 3e-06
 Identities = 65/225 (28%), Positives = 80/225 (34%), Gaps = 6/225 (2%)
 Frame = +3

Query: 150  PAAAFPSSPSSRRVPG-----VRSHQTPRRPSGRPPALMLCRPAAPRIASPLSTPTWP-T 311
            P    P+ P +  +P       R    P RP  +PP   L  P   R       PT P T
Sbjct: 718  PPTKPPTRPPTTYLPPPTVRTTRPPPPPTRPPTKPPTTYL-PPVTVRTTRATPPPTRPPT 776

Query: 312  HPKSTLPSLTCALFPAACLLPCCRTAARLVLRNGPSLLLTWRCRPPANPSPRCPRLPRRP 491
             P +  P+      PA   LP           N P   +T R      P P  P  P RP
Sbjct: 777  RPPTYPPTTRRLTTPAPTYLPPT---------NKPLPPVTVRTTVRTTPRPTLP--PTRP 825

Query: 492  PPRSRHRCVPALMACPAPRTRRTGSPSPPTTPPSPCPPPSPSSCSAATRRRLSSRPPWLP 671
            P +      P     P P  R T  P PPT PP+  PP +        R   ++RPP  P
Sbjct: 826  PTK------PPTTYLPPPTVRTTRPPPPPTRPPTK-PPTTYLPPVTVVR---TTRPPPPP 875

Query: 672  PRLSTASSSRATTCSRFPLVRSTPRPPRWGGELPPRAQQTTWRPP 806
             R       R T     P VR+T   P      PP  + T + PP
Sbjct: 876  TR-------RTTVYVPPPTVRTTVYVP------PPTQRTTVYVPP 907

 Score = 47.4 bits (111), Expect = 5e-04
 Identities = 54/209 (25%), Positives = 64/209 (29%), Gaps = 28/209 (13%)
 Frame = +3

Query: 75   PAGPRTRSGATRSPSSASAACSSS*PAAAFPSSPSSRRVPGVRSHQT---------PRRP 227
            P  P TR   T  P           P    P+ P +  +P V    T         P RP
Sbjct: 719  PTKPPTRPPTTYLPPPTVRTTRPPPPPTRPPTKPPTTYLPPVTVRTTRATPPPTRPPTRP 778

Query: 228  SGRPPALMLCRPAAPRIASPLSTPTWP--------THPKSTLPSLTCALFPAACLLPCCR 383
               PP        AP    P + P  P        T P+ TLP       P    LP   
Sbjct: 779  PTYPPTTRRLTTPAPTYLPPTNKPLPPVTVRTTVRTTPRPTLPPTRPPTKPPTTYLP--- 835

Query: 384  TAARLVLRNGPSLLLTWRCRPPANPSPRCP-----------RLPRRPPPRSRHRCVPALM 530
                      P  + T R  PP    P  P           R  R PPP +R   V    
Sbjct: 836  ----------PPTVRTTRPPPPPTRPPTKPPTTYLPPVTVVRTTRPPPPPTRRTTVYV-- 883

Query: 531  ACPAPRTRRTGSPSPPTTPPSPCPPPSPS 617
              P P  R T    PPT   +   PP+P+
Sbjct: 884  --PPPTVRTTVYVPPPTQRTTVYVPPAPT 910

 Score = 40.0 bits (92), Expect = 0.079
 Identities = 56/202 (27%), Positives = 73/202 (35%), Gaps = 4/202 (1%)
 Frame = +3

Query: 492  PPRSRHRCVPALMACPAPRTRRTGSPSPPTTPPS--PCPPPSPSSCSAATRRRLSSRPPW 665
            PP      +P     P  R        PPT PP+  P PPP+    +      +++R P 
Sbjct: 170  PPPDVPFDLPVRTTQPPTRPPTRPPTRPPTRPPTRPPTPPPTYLPPTNKPLPPVTTRLPP 229

Query: 666  LPPRLSTASSSRATTCSRFPLVR--STPRPPRWGGELPPRAQQTTWRPPLQRWRCQTSRH 839
             PP   T   +R  T  R P  R  +T  PP     LPP   +    PP  R    T   
Sbjct: 230  PPPPPRTPPPTRPPT--RPPTTRPPATYLPPT-NKPLPPVTTRLPPPPPSPRTPPPTRPP 286

Query: 840  HCWTRPRPLSRLPCFTSVSSGRAWVRSCMSTPQWRKLRSKRTGLTPHCRRRRPRPPPAPP 1019
               TRP P +R P             +    P  + L    T L P      P PPP  P
Sbjct: 287  ---TRP-PTTRPP-------------ATYLPPTNKPLPPVTTRLPP------PPPPPRTP 323

Query: 1020 SPPQPWARQPLTPQTSGTTPPS 1085
             P +P  + P T   +   PP+
Sbjct: 324  PPTRPPTKPPTTRPPATYLPPT 345

 Score = 33.1 bits (74), Expect = 9.7
 Identities = 51/182 (28%), Positives = 63/182 (34%), Gaps = 8/182 (4%)
 Frame = +1

Query: 112 RPPAPRRPAAAASQPP-PSPARLPRGGSLVCGVTRRLAALRAARR--H*CCAGRQPRASR 282
           RPP  R P    ++PP P P  LP     +  VT RL       R         +P  +R
Sbjct: 192 RPPT-RPPTRPPTRPPTPPPTYLPPTNKPLPPVTTRLPPPPPPPRTPPPTRPPTRPPTTR 250

Query: 283 HLSLHQPGR---LIPSRLCLLSPAPSSQPPACCRAAARPHAWCCATAPPSSLRGAAGP-- 447
             + + P     L P    L  P PS + P   R   RP      T PP++         
Sbjct: 251 PPATYLPPTNKPLPPVTTRLPPPPPSPRTPPPTRPPTRPP----TTRPPATYLPPTNKPL 306

Query: 448 RPTPRRAAPAFPAARRPGAATAACRP*WPALRPGRGGLARRLHPPLLPLLARRLRRPAAP 627
            P   R  P  P  R P       +P  P  RP    L     PP  P+  RR   P   
Sbjct: 307 PPVTTRLPPPPPPPRTPPPTRPPTKP--PTTRPPATYLPPTNKPP-PPVTTRRPTPPPTR 363

Query: 628 PP 633
           PP
Sbjct: 364 PP 365

>ref|ZP_00091493.1| COG1721: Uncharacterized conserved protein (some members contain a
           von Willebrand factor type A (vWA) domain) [Azotobacter
           vinelandii]
          Length = 746

 Score = 82.4 bits (202), Expect = 1e-14
 Identities = 101/302 (33%), Positives = 118/302 (38%), Gaps = 18/302 (5%)
 Frame = -3

Query: 941 PLRRRHAAAHPCAPRGDAGEAGQPREWSRSRPAVVTARLTA--PALQRR--------SPG 792
           PL RR    HP  PR DA    + R     + AV  A+ T    +L RR         PG
Sbjct: 43  PLFRRRFRTHPVHPRPDAQRYHRARGVRPGQRAVQAAQGTGVHSSLARRRDQPRPGQDPG 102

Query: 791 GLLGPRRQLPPPAWRPGRRSDEGEAAACRGA*RRGGGEPWRQPRRP*RQPAACGGGAAGR 612
           G  G     P     PGR    G AA          G    +P R  R  +A GG  AG 
Sbjct: 103 GPAGGHAGTPGD---PGRPRPAGSAAV--------PGARHAEPDRAGRHLSATGG-RAGP 150

Query: 611 RRRRARRGRSGGWRRRASPPRPGR----RAGHQGRHAAVAAPGRRAAGKAGAARRGVGRG 444
              +A  G  GG RRR  P  PGR    R   +GRHAA  A GRR AG AGA +R   R 
Sbjct: 151 FHAQAAHGLPGG-RRRVGPGSPGRPFAPRRHARGRHAASPAAGRRRAGAAGAGQRTAARR 209

Query: 443 PAAPRKEEGGA----VAQHQACGRAAARQQAGGWEEGAGERRQSRLGMSRPGWCRER*RD 276
             A      GA    +A+  + GRAA   ++G    G G   + RL            RD
Sbjct: 210 AGARLCRAPGAPYPRLARPDSGGRAARLDRSGALRPGPGPAARRRL------------RD 257

Query: 275 ARGCRPAQHQCRRAARRAARRLVTPHTRDPPRGRRAGEGGGWLAAAAGRRGAGGRASRAR 96
                P +HQ  R  R A           PPR   AG        A  RR  GG  +RA 
Sbjct: 258 -----PRRHQGLRPGRAA-----------PPRVPGAG--------ARHRRLVGGPGARAA 293

Query: 95  AR 90
           AR
Sbjct: 294 AR 295

 Score = 41.6 bits (96), Expect = 0.027
 Identities = 55/176 (31%), Positives = 64/176 (36%), Gaps = 6/176 (3%)
 Frame = +1

Query: 16  RAGGSGAAAVARRQPMACGHQLGLARALARLA-----RPPAPRRPAAAASQPPPSPARLP 180
           RAG   +A   R  P       GL     R+      RP APRR A       P+  R  
Sbjct: 136 RAGRHLSATGGRAGPFHAQAAHGLPGGRRRVGPGSPGRPFAPRRHARGRHAASPAAGRRR 195

Query: 181 RGGSLVCGVTRRLAALRAARRH*CCAGRQ-PRASRHLSLHQPGRLIPSRLCLLSPAPSSQ 357
            G +   G  +R AA RA  R     G   PR +R  S  +  RL   R   L P P   
Sbjct: 196 AGAA---GAGQRTAARRAGARLCRAPGAPYPRLARPDSGGRAARL--DRSGALRPGPGPA 250

Query: 358 PPACCRAAARPHAWCCATAPPSSLRGAAGPRPTPRRAAPAFPAARRPGAATAACRP 525
                R   R        A P  + GA G R       P   AA RPG  TAA +P
Sbjct: 251 ARRRLRDPRRHQGLRPGRAAPPRVPGA-GARHRRLVGGPGARAAARPGRGTAAMKP 305

 Score = 40.4 bits (93), Expect = 0.061
 Identities = 73/258 (28%), Positives = 88/258 (33%), Gaps = 22/258 (8%)
 Frame = +1

Query: 82  GLARALARLARPPAPRR----PAAAASQPPPSPARLPRGGSLVCGVTRRLAALRAARRH* 249
           G+  +LAR    P P +    PA   +  P  P R    GS         AA+  AR   
Sbjct: 83  GVHSSLARRRDQPRPGQDPGGPAGGHAGTPGDPGRPRPAGS---------AAVPGARH-- 131

Query: 250 CCAGRQPRASRHLSLH---------QPGRLIPSRLCLLSPAPSSQPPACCRAAARPHAWC 402
                  RA RHLS           Q    +P     + P    +P A  R A   HA  
Sbjct: 132 ---AEPDRAGRHLSATGGRAGPFHAQAAHGLPGGRRRVGPGSPGRPFAPRRHARGRHAAS 188

Query: 403 CATAPPSSLRGAAGPRPTPRRA------APAFPAAR--RPGAATAACR-P*WPALRPGRG 555
            A     +    AG R   RRA      AP  P  R  RP +   A R     ALRPG G
Sbjct: 189 PAAGRRRAGAAGAGQRTAARRAGARLCRAPGAPYPRLARPDSGGRAARLDRSGALRPGPG 248

Query: 556 GLARRLHPPLLPLLARRLRRPAAPPPHAAGCLHGRLGCRHGSPPPLLHAPRHAAASPSSD 735
             ARR      P   + LR   A PP   G      G RH          R     P + 
Sbjct: 249 PAARRRLRD--PRRHQGLRPGRAAPPRVPGA-----GARH----------RRLVGGPGA- 290

Query: 736 RRPGLHAGGGSCRLGPSR 789
            R     G G+  + P+R
Sbjct: 291 -RAAARPGRGTAAMKPAR 307

 Score = 37.7 bits (86), Expect = 0.39
 Identities = 77/249 (30%), Positives = 83/249 (32%), Gaps = 10/249 (4%)
 Frame = +1

Query: 445  PRPTPRRAAPAFPAARRPGAATAACRP*WPALRPGRGGLARRLHPPLLPLLARRLRRPA- 621
            PRP      PA   A  PG      RP   A  PG    AR   P       R L     
Sbjct: 95   PRPGQDPGGPAGGHAGTPGDPGRP-RPAGSAAVPG----ARHAEPDRA---GRHLSATGG 146

Query: 622  -APPPHAAGCLHGRLGCRH----GSP----PPLLHAPRHAAASPSSDRRPGLHAGGGSCR 774
             A P HA    HG  G R     GSP     P  HA    AASP++ RR    AG G  +
Sbjct: 147  RAGPFHAQAA-HGLPGGRRRVGPGSPGRPFAPRRHARGRHAASPAAGRRRAGAAGAG--Q 203

Query: 775  LGPSRPPGDRLCSAGAVRRAVTTAGRDRDHSRGCPASPASPRGAHGCAAACLRRSGGSCG 954
               +R  G RLC A                  G P  P   R   G  AA L RSG    
Sbjct: 204  RTAARRAGARLCRA-----------------PGAP-YPRLARPDSGGRAARLDRSGALRP 245

Query: 955  ASGLG*RRTVGVGGPGRRQHHHRLRSPGPVNP*HRRRLERRPRHGPAAPGRPA*QPQPRG 1134
              G   RR +      R    H+   PG   P        R R     PG  A     RG
Sbjct: 246  GPGPAARRRL------RDPRRHQGLRPGRAAPPRVPGAGARHRRLVGGPGARAAARPGRG 299

Query: 1135 TATCTAPRR 1161
            TA     RR
Sbjct: 300  TAAMKPARR 308

 Score = 36.6 bits (83), Expect = 0.87
 Identities = 48/148 (32%), Positives = 54/148 (36%), Gaps = 5/148 (3%)
 Frame = +1

Query: 16  RAGGSGAAAVARRQPMACGHQLGLARALARLARPPAPRRPAAAASQPPPSPARLPRGGSL 195
           R   S AA   R      G +    RA ARL R P    P  A        ARL R G+L
Sbjct: 184 RHAASPAAGRRRAGAAGAGQRTAARRAGARLCRAPGAPYPRLARPDSGGRAARLDRSGAL 243

Query: 196 VCGVTRRLAALRAARRH*CCAGRQPRASRHLSLHQPGRLIPSRLCLLSPAPSSQPPAC-- 369
                 R     AARR      R PR  RH  L +PGR  P R+    P   ++      
Sbjct: 244 ------RPGPGPAARRR----LRDPR--RHQGL-RPGRAAPPRV----PGAGARHRRLVG 286

Query: 370 ---CRAAARPHAWCCATAPPSSLRGAAG 444
               RAAARP     A  P   L    G
Sbjct: 287 GPGARAAARPGRGTAAMKPARRLLALFG 314

 Score = 35.4 bits (80), Expect = 1.9
 Identities = 34/107 (31%), Positives = 42/107 (38%)
 Frame = -3

Query: 401 HQACGRAAARQQAGGWEEGAGERRQSRLGMSRPGWCRER*RDARGCRPAQHQCRRAARRA 222
           H   GR   RQ   G   G   RR+ R     P    +R   ARG RP Q +  +AA+  
Sbjct: 24  HSRRGRPRTRQDLAGARSGPLFRRRFRTHPVHPRPDAQRYHRARGVRPGQ-RAVQAAQGT 82

Query: 221 ARRLVTPHTRDPPRGRRAGEGGGWLAAAAGRRGAGGRASRARARARP 81
                    RD PR  +  + GG     AG  G  GR   A + A P
Sbjct: 83  GVHSSLARRRDQPRPGQ--DPGGPAGGHAGTPGDPGRPRPAGSAAVP 127

>ref|ZP_00057775.1| COG1205: Distinct helicase family with a unique C-terminal domain
            including a metal-binding cysteine cluster [Thermobifida
            fusca]
          Length = 1958

 Score = 82.0 bits (201), Expect = 2e-14
 Identities = 136/416 (32%), Positives = 150/416 (35%), Gaps = 64/416 (15%)
 Frame = -3

Query: 1139 AVPRG*GCQAGRPG--AAGP*RGRRSRRLRCQG--LTGPGLRRRWWCWRRPGPPTPTVRR 972
            A P G G Q GRP   A GP R R +     +G  + G G+ R      RP       RR
Sbjct: 3    AAPGGAGVQ-GRPHRPARGPARRRPAGGAAGRGRPVVGGGVARL----ARPA------RR 51

Query: 971  QPSPLAPQLPPLRRRHAAAHPCAPRGDAGEAGQPREWSRSRPAVVTARLTAPALQRRSPG 792
            QP+ L     P  RR        PRG A   G PR   R RPA    R   PA+  R PG
Sbjct: 52   QPAHLPRARRPRLRR--------PRGGAALPG-PRRGGRRRPAARHRRDRRPAVAGRGPG 102

Query: 791  GLL-----GPRRQ---LPPPAWRPGRRSDEGEAAACRGA*RRGGGEPWRQPRRP*RQPAA 636
                    GPRR+    P P   P  R       A R   RR G  P RQ   P RQ   
Sbjct: 103  RPARLRRRGPRRRRRLCPRPRRTPRHR-------AARPRLRRTG--PRRQRPHPHRQ--- 150

Query: 635  CGGGAAGRRR--RRARRGRSG----GWRRRASPPRPGRRAGHQGRHAAVAAPGRRAAGKA 474
                  GRR   R   RGR G       RR  P      A H  R    AA  RR AG A
Sbjct: 151  ----RPGRRTPARPGHRGRGGPAVAAAARRGRPGHQRPAAPHPHRPRPAAARARRTAGGA 206

Query: 473  GA---ARR----GVGRGPAAPRK------EEGGAVAQHQACGRAAARQQAGGWEEGAGER 333
            GA   ARR    G  R PA PR+            A+    GR A R+       GA  R
Sbjct: 207  GAQLRARRRATAGRRRRPARPRRPLRRLRRTPRPHARGSGQGRVAGRRSGA---PGAAGR 263

Query: 332  RQSRLGMSRPGWCRER*RDARGCRPAQHQC---RRAARRAARRLVTPHTRD--------- 189
            RQ   G  RP    ++ R A+G  P  H+    RR  R A RRL    TR          
Sbjct: 264  RQPLPGRPRPVRGAQQHRAAQGALPGGHRPEADRRGVRPAPRRLPDSGTRPDRQPPPRPA 323

Query: 188  ---------------------PPRGRRAGEGGGWLAAAAGRRGAGGRASRARARAR 84
                                 P RGR    GG   A   G    G R  R R R R
Sbjct: 324  RVLARPARPVRRAGHRRPPGRPGRGRLPTAGGALPAGVGGTGRRGPRPRRRRPRPR 379

 Score = 81.6 bits (200), Expect = 2e-14
 Identities = 133/394 (33%), Positives = 142/394 (35%), Gaps = 34/394 (8%)
 Frame = +1

Query: 25   GSGAAAVAR---RQPMACGHQLGLARALARLARP------PAPRR-----PAA------- 141
            G G A +AR   RQP      L  AR   RL RP      P PRR     PAA       
Sbjct: 39   GGGVARLARPARRQPA----HLPRARR-PRLRRPRGGAALPGPRRGGRRRPAARHRRDRR 93

Query: 142  --AASQPPPSPARL----PRGGSLVCGVTRRLAALRAARRH*CCAGRQPRASR-HLSLHQ 300
               A + P  PARL    PR    +C   RR    RAAR      G  PR  R H    +
Sbjct: 94   PAVAGRGPGRPARLRRRGPRRRRRLCPRPRRTPRHRAARPRLRRTG--PRRQRPHPHRQR 151

Query: 301  PGRLIPSRLCLLSPAPSSQPPACCRAAARPHAWCCATAPPSSLRGAAGPRPTPRRAAPAF 480
            PGR  P+R     P    +      AAAR          P   R AA   P P R  PA 
Sbjct: 152  PGRRTPAR-----PGHRGRGGPAVAAAAR-------RGRPGHQRPAA---PHPHRPRPAA 196

Query: 481  PAARRP-GAATAACRP*WPALRPGRGGLARRLHPPLLPLLARRLRRPAAPPPHAAGCLHG 657
              ARR  G A A  R    A R    G  RR   P  PL  RRLRR   P PHA G   G
Sbjct: 197  ARARRTAGGAGAQLR----ARRRATAGRRRRPARPRRPL--RRLRR--TPRPHARGSGQG 248

Query: 658  RLGCRHGSPPPLLHAPRHAAASPSSDRRPGLHAGGGSCRLGPSRPPGDRLCSAGAVRRAV 837
            R+  R    P      +     P   R    H        G  RP  DR     A RR  
Sbjct: 249  RVAGRRSGAPGAAGRRQPLPGRPRPVRGAQQHRAAQGALPGGHRPEADRRGVRPAPRRLP 308

Query: 838  TTAGRDRDHSRGCPASPAS--PRGAHGCAAACLRRSGGSCGASGL---G*RRTVGVGGPG 1002
             +  R     R  P  PA    R A     A  RR  G  G   L   G     GVGG G
Sbjct: 309  DSGTRP---DRQPPPRPARVLARPARPVRRAGHRRPPGRPGRGRLPTAGGALPAGVGGTG 365

Query: 1003 RRQHHHRLRSPGPVNP*HRRRLERRPRHGPAAPG 1104
            RR    R R P P     RR   RR R  P   G
Sbjct: 366  RRGPRPRRRRPRP-----RRTHPRRNRTRPTVQG 394

 Score = 65.1 bits (157), Expect = 2e-09
 Identities = 126/399 (31%), Positives = 144/399 (35%), Gaps = 23/399 (5%)
 Frame = +1

Query: 34   AAAVARRQPMACGHQLGLAR-----ALARLARPPAPRRPAAAASQPPPSPARLPRGGSLV 198
            A   ARR+P   G   G  R      +ARLARP A R+PA       P   R PRGG+ +
Sbjct: 18   ARGPARRRP--AGGAAGRGRPVVGGGVARLARP-ARRQPAHLPRARRPR-LRRPRGGAAL 73

Query: 199  CGVTR----RLAALRAARRH*CCAGRQPRASRHLSLHQPGRLIPSRLCLLSPAPSSQPPA 366
             G  R    R AA     R    AGR P     L    P R    RLC   P P   P  
Sbjct: 74   PGPRRGGRRRPAARHRRDRRPAVAGRGPGRPARLRRRGPRRR--RRLC---PRPRRTPR- 127

Query: 367  CCRAAARPHAWCCATAPPSSLRGAAGPRPTPRRAAPAFPAARRPGAATAACRP*WPALRP 546
                AARP            LR     R  PRR  P  P  +RPG  T A     P  R 
Sbjct: 128  --HRAARPR-----------LR-----RTGPRRQRP-HPHRQRPGRRTPAR----PGHR- 163

Query: 547  GRGGLARRLHPPLLPLLARRLR----RPAAPPPHAAGCLHGRLGCRHGSPPPLLHAPRHA 714
            GRGG       P +   ARR R    RPAAP PH               P P     R  
Sbjct: 164  GRGG-------PAVAAAARRGRPGHQRPAAPHPHR--------------PRPAAARARRT 202

Query: 715  AASPSSDRRPGLHAGGGSCRLGPSRPPGDRLCSAGAVRRAVTTAGRD-RDHSRGCPASPA 891
            A    +  R    A  G  R  P+RP           RR +    R  R H+RG      
Sbjct: 203  AGGAGAQLRARRRATAGR-RRRPARP-----------RRPLRRLRRTPRPHARG------ 244

Query: 892  SPRGAHGCAAACLRRSGGSCGASGLG*RRTVGVGGP----GRRQHHHRLRSPGPVNP*HR 1059
            S +G         R +G   GA G   RR    G P    G +QH     + G +   HR
Sbjct: 245  SGQG---------RVAGRRSGAPGAAGRRQPLPGRPRPVRGAQQHR---AAQGALPGGHR 292

Query: 1060 RRLERR-----PRHGPAAPGRPA*QPQPRGTATCTAPRR 1161
               +RR     PR  P +  RP  QP PR       P R
Sbjct: 293  PEADRRGVRPAPRRLPDSGTRPDRQPPPRPARVLARPAR 331

 Score = 52.8 bits (125), Expect = 1e-05
 Identities = 83/247 (33%), Positives = 95/247 (37%), Gaps = 19/247 (7%)
 Frame = -3

Query: 689 GGGEPWRQPRRP*RQPAAC--GGGAAGRRRRRARRGRSGGWRRRASPPRPGRRAGHQGRH 516
           GG     +P RP R PA     GGAAGR R        GG  R A   RP RR   Q  H
Sbjct: 6   GGAGVQGRPHRPARGPARRRPAGGAAGRGRPVV----GGGVARLA---RPARR---QPAH 55

Query: 515 AAVAAPGRRAAGKAGAA----RRGVGRGPAAPRKEEGGAVAQHQACGRAAARQQAGGWEE 348
              A   R    + GAA    RRG  R PAA  + +       +  GR A  ++ G    
Sbjct: 56  LPRARRPRLRRPRGGAALPGPRRGGRRRPAARHRRDRRPAVAGRGPGRPARLRRRG---- 111

Query: 347 GAGERRQSRLGMSRPGWCRER*RDARGCRPAQHQCRRAARRAARRLVTPHTRDPPRGRRA 168
               RR+ RL   RP     R    R  RP   +     +R       P  R P R    
Sbjct: 112 ---PRRRRRL-CPRP----RRTPRHRAARPRLRRTGPRRQRPHPHRQRPGRRTPARPGHR 163

Query: 167 GEGGGWLAAAAGRRGAGG----------RASRARARARPS---W*PQAIGCRRATAAAPL 27
           G GG  +AAAA RRG  G          R   A ARAR +      Q    RRATA    
Sbjct: 164 GRGGPAVAAAA-RRGRPGHQRPAAPHPHRPRPAAARARRTAGGAGAQLRARRRATAGRRR 222

Query: 26  PPARVPR 6
            PAR  R
Sbjct: 223 RPARPRR 229

 Score = 45.4 bits (106), Expect = 0.002
 Identities = 61/181 (33%), Positives = 66/181 (35%), Gaps = 2/181 (1%)
 Frame = -3

Query: 1076 RRSRRLRCQGLTGPGLRRRWWCWRRPGPPTPTVRRQPSPLAPQLPPLRRRHAAAHPCAPR 897
            RR RR       G G  R     RR G P    RRQP P  P+     ++H AA    P 
Sbjct: 232  RRLRRTPRPHARGSGQGRV--AGRRSGAPGAAGRRQPLPGRPRPVRGAQQHRAAQGALPG 289

Query: 896  GDAGEAGQPREWSRSRPAVVTARLTAPALQRRSPGGLLGPRRQLPPPAWRPGRRSDEGEA 717
            G   EA +       RPA            RR P     P RQ PP   RP R       
Sbjct: 290  GHRPEADR----RGVRPA-----------PRRLPDSGTRPDRQPPP---RPARVLARPAR 331

Query: 716  AACRGA*RRGGGEP--WRQPRRP*RQPAACGGGAAGRRRRRARRGRSGGWRRRASPPRPG 543
               R   RR  G P   R P      PA  GG   GRR  R RR R    R R + PR  
Sbjct: 332  PVRRAGHRRPPGRPGRGRLPTAGGALPAGVGG--TGRRGPRPRRRRP---RPRRTHPRRN 386

Query: 542  R 540
            R
Sbjct: 387  R 387

 Score = 35.0 bits (79), Expect = 2.5
 Identities = 26/70 (37%), Positives = 30/70 (42%)
 Frame = -2

Query: 567 ASQSAASGAQGRPSGPARSGGGSGAAGGGEGGGSAARGWPGACSAT*GGGRGRCAAPGVR 388
           A+    +G QGRP  PAR     G A     GG+A RG P       GGG  R A P  R
Sbjct: 2   AAAPGGAGVQGRPHRPAR-----GPARRRPAGGAAGRGRPVV-----GGGVARLARPARR 51

Query: 387 PCGSTAASRR 358
                  +RR
Sbjct: 52  QPAHLPRARR 61

 Score = 33.5 bits (75), Expect = 7.4
 Identities = 55/165 (33%), Positives = 59/165 (35%), Gaps = 11/165 (6%)
 Frame = +1

Query: 709  HAAASPSSDRRPGLHAGGGSCRLGPSRPPGDRLCSAGAVRRAVTTAGRDRDH-------- 864
            H  A   + RRP   AGG +   G  RP        G V R    A R   H        
Sbjct: 15   HRPARGPARRRP---AGGAA---GRGRP-----VVGGGVARLARPARRQPAHLPRARRPR 63

Query: 865  ---SRGCPASPASPRGAHGCAAACLRRSGGSCGASGLG*RRTVGVGGPGRRQHHHRLRSP 1035
                RG  A P   RG     AA  RR            R  V   GPGR     RLR  
Sbjct: 64   LRRPRGGAALPGPRRGGRRRPAARHRRDR----------RPAVAGRGPGRPA---RLRRR 110

Query: 1036 GPVNP*HRRRLERRPRHGPAAPGRPA*QPQPRGTATCTAPRRQPP 1170
            GP     RRRL  RPR     P   A +P+ R T     PRRQ P
Sbjct: 111  GPRR---RRRLCPRPRR---TPRHRAARPRLRRTG----PRRQRP 145

>sp|Q9FPQ6|GP1_CHLRE Vegetative cell wall protein gp1 precursor (Hydroxyproline-rich
            glycoprotein 1) gi|12018147|gb|AAG45420.1|AF309494_1
            vegetative cell wall protein gp1 [Chlamydomonas
            reinhardtii]
          Length = 555

 Score = 79.0 bits (193), Expect = 2e-13
 Identities = 97/343 (28%), Positives = 118/343 (34%), Gaps = 5/343 (1%)
 Frame = +3

Query: 72   SPAGPRTRSGATRSPSSASAACSSS*PAAAFPSSPSSRRVPGVRSHQTPRRPSGRPPALM 251
            +P  P   S A  SP+  S A  S  P +  P SP S   P   +  +P  PS  PP+  
Sbjct: 44   APPSPAPPSPAPPSPAPPSPAPPSPGPPSPAPPSPPSPAPPSP-APPSPAPPSPAPPSPA 102

Query: 252  LCRPAAPRIASPLSTPTWPTHPKSTLPSLTCALFPAACLLPCCRTAARLVLRNGPSLLLT 431
               PA P  A P   P  P  P    PS      P A   P   + A             
Sbjct: 103  PPSPAPPSPAPPSPAPPSPPSPAPPSPS------PPAPPSPSPPSPA------------- 143

Query: 432  WRCRPPANPSPRCPRLPRRPPPRSRHRCVPALMACPAPRTRRTGSPSPPT-TPPSPCPP- 605
                PP  PSP  P  P  P P S    VP   A P+P       P PP+  PPSP PP 
Sbjct: 144  ----PPLPPSP-APPSPSPPVPPSPSPPVPPSPAPPSPTPPSPSPPVPPSPAPPSPAPPV 198

Query: 606  -PSPSSCSAATRRRLSSRPPWLPPRLSTASSSRATTCSRFPLVRSTPRPPRWGGELPPRA 782
             PSP+  S A     S  PP  P     +  S A           +P PP     +PP  
Sbjct: 199  PPSPAPPSPAPPVPPSPAPPSPPSPAPPSPPSPAPP---------SPSPPAPPSPVPPSP 249

Query: 783  QQTTWRPPLQRWRCQTSRHHCWTRPRPLSRLPCFTSV--SSGRAWVRSCMSTPQWRKLRS 956
               +  PP  +             P P    P  T +  S           TP      S
Sbjct: 250  APPSPAPPSPKPPAPPPPPSPPPPPPPRPPFPANTPMPPSPPSPPPSPAPPTPPTPPSPS 309

Query: 957  KRTGLTPHCRRRRPRPPPAPPSPPQPWARQPLTPQTSGTTPPS 1085
              + + P      P P P  P+P  P +  P TP  S +  PS
Sbjct: 310  PPSPVPPSPAPVPPSPAPPSPAPSPPPSPAPPTPSPSPSPSPS 352

 Score = 77.4 bits (189), Expect = 4e-13
 Identities = 105/364 (28%), Positives = 124/364 (33%), Gaps = 5/364 (1%)
 Frame = +3

Query: 54   AANGLWSPAGPRTRSGATRSPSSASAACSSS*PAAAFPSSPSSRRVPGVRSHQTPRRPSG 233
            +AN    P G      +   PS A  + +   PA   P+ PS    PG  S   P  PS 
Sbjct: 26   SANAQCVPGGIFNCPPSPAPPSPAPPSPAPPSPAPPSPAPPS----PGPPSPAPPSPPSP 81

Query: 234  RPPALMLCRPAAPRIASPLSTPTWPTHPKSTLPSLTCALFPAACLLPCCRTAARLVLRNG 413
             PP+     PA P  A P   P  P  P    PS      PA    P             
Sbjct: 82   APPSPAPPSPAPPSPAPPSPAPPSPAPPSPAPPS------PAPPSPPS------------ 123

Query: 414  PSLLLTWRCRPPANPSPRCPRLPRRPPPRSRHRCVPALMACPAPRTRRTGSPSPPTTPPS 593
                       PA PSP  P  P  P P S    +P   A P+P      SPSPP  PPS
Sbjct: 124  -----------PAPPSPS-PPAPPSPSPPSPAPPLPPSPAPPSPSPPVPPSPSPPV-PPS 170

Query: 594  PCP----PPSPSSCSAATRRRLSSRPPWLPPRLSTASSSRATTCSRFPLVRSTPRPPRWG 761
            P P    PPSPS     +    S  PP +PP  +  S +     S  P    +P PP   
Sbjct: 171  PAPPSPTPPSPSPPVPPSPAPPSPAPP-VPPSPAPPSPAPPVPPSPAPPSPPSPAPPSPP 229

Query: 762  GELPPRAQQTTWRPPLQRWRCQTSRHHCWTRPRPLSRLPCFTSVSSGRAWVRSCMSTPQW 941
               PP         P+       S       P P S  P             S    P  
Sbjct: 230  SPAPPSPSPPAPPSPVPPSPAPPS-------PAPPSPKPPAPPPPP------SPPPPPPP 276

Query: 942  RKLRSKRTGLTPHCRRRRPRP-PPAPPSPPQPWARQPLTPQTSGTTPPSWTCCAWAPSLT 1118
            R      T + P      P P PP PP+PP P    P+ P +    PPS    + APS  
Sbjct: 277  RPPFPANTPMPPSPPSPPPSPAPPTPPTPPSPSPPSPV-PPSPAPVPPSPAPPSPAPSPP 335

Query: 1119 ASAA 1130
             S A
Sbjct: 336  PSPA 339

 Score = 64.3 bits (155), Expect = 4e-09
 Identities = 65/189 (34%), Positives = 75/189 (39%), Gaps = 7/189 (3%)
 Frame = +3

Query: 72  SPAGPRTRSGATRSPSSA--SAACSSS*PAAAFPSSPSSRRVPGVRSHQTPRRPSGRPPA 245
           SPA P   S A  SP+     +    S P+ A PS PS    P   S   P  PS  PP+
Sbjct: 193 SPAPPVPPSPAPPSPAPPVPPSPAPPSPPSPAPPSPPS----PAPPSPSPPAPPSPVPPS 248

Query: 246 LMLCRPAAPRIASPLSTPTWPTHPKSTLPSLTCAL-FPAACLLPCCRTAARLVLRNGPSL 422
                PA P  A P   P  P  P S  P       FPA   +P           + PS 
Sbjct: 249 -----PAPPSPAPPSPKPPAPPPPPSPPPPPPPRPPFPANTPMP----------PSPPS- 292

Query: 423 LLTWRCRPPANPSPRCPRLPRRPPPRS----RHRCVPALMACPAPRTRRTGSPSPPTTPP 590
                  PP +P+P  P  P  P P S        VP   A P+P      SP+PPT  P
Sbjct: 293 -------PPPSPAPPTPPTPPSPSPPSPVPPSPAPVPPSPAPPSPAPSPPPSPAPPTPSP 345

Query: 591 SPCPPPSPS 617
           SP P PSPS
Sbjct: 346 SPSPSPSPS 354

 Score = 63.2 bits (152), Expect = 9e-09
 Identities = 72/249 (28%), Positives = 89/249 (34%), Gaps = 7/249 (2%)
 Frame = +3

Query: 447  PANPSPRCPRLPRRPPPRSRHRCVPALMACPAPRTRRTGSPSPPT-TPPSPCPP-PSPSS 620
            P +P+P  P  P   PP       P   A P+P       PSPP+  PPSP PP P+P S
Sbjct: 40   PPSPAPPSPAPPSPAPPSP----APPSPAPPSPGPPSPAPPSPPSPAPPSPAPPSPAPPS 95

Query: 621  CSAATRRRLSSRPPW-LPPRLSTASSSRATTCSRFPLVRSTPRPPRWGGELPPRAQQTTW 797
             +  +    S  PP   PP  +  S       S  P    +P PP     LPP     + 
Sbjct: 96   PAPPSPAPPSPAPPSPAPPSPAPPSPPSPAPPSPSPPAPPSPSPPSPAPPLPPSPAPPSP 155

Query: 798  RPPLQRWRCQTSRHHCWTRPRPLSRLPCFTSVSSGRAWVRSCMSTPQWRKLRSKRTGLTP 977
             PP+               P P   +P   +  S          TP      S    + P
Sbjct: 156  SPPVP--------------PSPSPPVPPSPAPPS---------PTPP-----SPSPPVPP 187

Query: 978  HCRRRRPRPP----PAPPSPPQPWARQPLTPQTSGTTPPSWTCCAWAPSLTASAARNGNV 1145
                  P PP    PAPPSP  P    P  P      PPS    A  PS +  A  +   
Sbjct: 188  SPAPPSPAPPVPPSPAPPSPAPPVPPSPAPPSPPSPAPPSPPSPA-PPSPSPPAPPSPVP 246

Query: 1146 YSAAPPATA 1172
             S APP+ A
Sbjct: 247  PSPAPPSPA 255

 Score = 60.1 bits (144), Expect = 7e-08
 Identities = 77/242 (31%), Positives = 95/242 (38%), Gaps = 16/242 (6%)
 Frame = +3

Query: 72  SPAGPRTRSGATRSPSSASAAC----SSS*PAAAFPSSPSSRRVPGVRSHQTPRRPSGRP 239
           SP+ P   S A  SP+  S +     S + P+ A P  PS    P   S   P  PS  P
Sbjct: 162 SPSPPVPPSPAPPSPTPPSPSPPVPPSPAPPSPAPPVPPS----PAPPSPAPPVPPSPAP 217

Query: 240 PALMLCRPAAPRIASPLSTPTWPTHPKSTLPSLTCALFPAACLLPCCRTAARLVLRNGPS 419
           P+     PA P   SP      P  P S +P       PA    P  +  A         
Sbjct: 218 PSPP--SPAPPSPPSPAPPSPSPPAPPSPVPPSPAPPSPAP---PSPKPPA--------- 263

Query: 420 LLLTWRCRPPANPSPRCPRLPRRPPPRSRHR--CVPALMACPAPRTRRTG-SPSPPT-TP 587
                   PP  PSP  P  PR P P +       P+    PAP T  T  SPSPP+  P
Sbjct: 264 --------PPPPPSPPPPPPPRPPFPANTPMPPSPPSPPPSPAPPTPPTPPSPSPPSPVP 315

Query: 588 PSPCP-PPSPSSCSAATRRRLSSRPPWLPPRLS-------TASSSRATTCSRFPLVRSTP 743
           PSP P PPSP+  S A     S  PP   P  S       + S S + + S  P+   +P
Sbjct: 316 PSPAPVPPSPAPPSPAPSPPPSPAPPTPSPSPSPSPSPSPSPSPSPSPSPSPSPIPSPSP 375

Query: 744 RP 749
           +P
Sbjct: 376 KP 377

 Score = 54.3 bits (129), Expect = 4e-06
 Identities = 57/191 (29%), Positives = 72/191 (36%), Gaps = 2/191 (1%)
 Frame = +3

Query: 72  SPAGPRTRSGATRSPSSASAACSSS*PAAAFPS-SPSSRRVPGVRSHQTPRRPSGRPPAL 248
           SPA P   S A  SPS   A  S   P+ A PS +P S + P      +P  P    P  
Sbjct: 222 SPAPPSPPSPAPPSPSPP-APPSPVPPSPAPPSPAPPSPKPPAPPPPPSPPPPPPPRPPF 280

Query: 249 MLCRPAAPRIASPLSTPTWPTHPKSTLPSLTCALFPAACLLPCCRTAARLVLRNGPSLLL 428
               P  P   SP  +P  PT P    PS    + P+   +P             PS   
Sbjct: 281 PANTPMPPSPPSPPPSPAPPTPPTPPSPSPPSPVPPSPAPVP-------------PS--- 324

Query: 429 TWRCRPPANPSPRCPRLPRRPPPRSRHRCVPALMACPAPRTRRTGSPSPPTTP-PSPCPP 605
                 P +P+P  P  P  P P       P+    P+P    + SPSP  +P P P P 
Sbjct: 325 ----PAPPSPAPSPPPSPAPPTPS------PSPSPSPSPSPSPSPSPSPSPSPSPIPSPS 374

Query: 606 PSPSSCSAATR 638
           P PS    A +
Sbjct: 375 PKPSPSPVAVK 385

 Score = 40.0 bits (92), Expect = 0.079
 Identities = 23/57 (40%), Positives = 26/57 (45%), Gaps = 1/57 (1%)
 Frame = +2

Query: 380 PHGRTPGAAQRPLP-PPYVALQAPGQPLAALPPPSPPPAAPEPPPLRAGPDGLPCAP 547
           P    P +   P P PP  A  +P  P  A P P+PP  AP  PP  A P   P AP
Sbjct: 78  PPSPAPPSPAPPSPAPPSPAPPSPAPPSPAPPSPAPPSPAPPSPPSPAPPSPSPPAP 134



EST assemble image


clone accession position
1 HCL072c04_r AV643583 1 499
2 MXL047a10_r BP095766 229 505
3 MXL079e01_r BP097676 277 491
4 MXL013f02_r BP093775 354 627
5 LCL098g04_r AV631724 367 813
6 HCL100d09_r AV645268 396 723
7 LCL048h10_r AV628915 471 726
8 CL02e03_r AV397619 587 1162
9 LCL009e05_r AV626462 763 1245




Chlamydomonas reinhardtii
Kazusa DNA Research Institute