Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC149547.3 + phase: 0 
         (492 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_195313.2| aspartyl protease family protein [Arabidopsis t...   650  0.0
emb|CAB81497.1| putative protein [Arabidopsis thaliana] gi|38058...   516  e-145
gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana] gi|179...   483  e-135
ref|XP_474980.1| OSJNBb0071D01.4 [Oryza sativa (japonica cultiva...   469  e-130
pir||B84556 hypothetical protein At2g17760 [imported] - Arabidop...   444  e-123
pir||T08860 hypothetical protein A_TM017A05.8 - Arabidopsis thal...   441  e-122
dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like p...   429  e-119
emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]    405  e-111
gb|AAO22575.1| unknown protein [Arabidopsis thaliana] gi|1840932...   397  e-109
dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza ...   390  e-107
gb|AAL79734.1| putative chloroplast nucleoid DNA-binding protein...   389  e-106
emb|CAB62655.1| putative protein [Arabidopsis thaliana] gi|11281...   384  e-105
ref|NP_190704.2| aspartyl protease family protein [Arabidopsis t...   367  e-100
ref|NP_190703.2| aspartyl protease family protein [Arabidopsis t...   365  2e-99
emb|CAB62656.1| putative protein [Arabidopsis thaliana] gi|15230...   338  2e-91
ref|XP_467818.1| aspartyl protease-like [Oryza sativa (japonica ...   308  3e-82
emb|CAB92049.1| putative protein [Arabidopsis thaliana] gi|15238...   302  1e-80
pir||T45766 hypothetical protein F24M12.390 - Arabidopsis thaliana    280  6e-74
emb|CAB62657.2| putative protein [Arabidopsis thaliana]               280  1e-73
emb|CAB62658.1| putative protein [Arabidopsis thaliana] gi|11357...   209  2e-52

>ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 524

 Score =  650 bits (1678), Expect = 0.0
 Identities = 317/451 (70%), Positives = 372/451 (82%), Gaps = 10/451 (2%)

Query: 4   FTKIIVIILIILHLSM-CCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYA 62
           F   + +I I++ LS   CN  IFTF MHHR+S+ VK+WS S    + ++P KGS EY+ 
Sbjct: 6   FKTTLFLIPILMLLSFGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFA-KFPPKGSFEYFN 64

Query: 63  ELADRDRFLRGRRLSQFDA----GLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVAL 118
            L  RD  +RGRRLS+ ++     L FSDGNST RISSLGFLHYTT++LGTPG++FMVAL
Sbjct: 65  ALVLRDWLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVAL 124

Query: 119 DTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQ 178
           DTGSDLFWVPCDC +C+ T  + +AS    +F+LS+YNP  S+T+KKVTCNNSLC  RNQ
Sbjct: 125 DTGSDLFWVPCDCGKCAPTEGATYAS----EFELSIYNPKVSTTNKKVTCNNSLCAQRNQ 180

Query: 179 CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLD 238
           CLGTFS CPYMVSYVSA+TSTSGIL+EDV+HLT  D N + VEA V FGCGQVQSGSFLD
Sbjct: 181 CLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD 240

Query: 239 VAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNV 298
           +AAPNGLFGLGMEKISVPS+L+REG  ADSFSMCFG DG+GRISFGDKGS DQ+ETPFN+
Sbjct: 241 IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNL 300

Query: 299 NPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRP 358
           NPSHP YNIT+ +VRVGTTLID EFTALFD+GTSFTYLVDP Y+ +SESFHSQ +D+R  
Sbjct: 301 NPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHS 360

Query: 359 PDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSA 418
           PDSRIPF+YCYDMS D+N SLIPS+SLTM G S F + DPII+IST+ ELVYCLA+VKS+
Sbjct: 361 PDSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPIIVISTEGELVYCLAIVKSS 420

Query: 419 ELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           ELNIIGQN+MTGYRVVFDR KL+L WKK DC
Sbjct: 421 ELNIIGQNYMTGYRVVFDREKLVLAWKKFDC 451


>emb|CAB81497.1| putative protein [Arabidopsis thaliana] gi|3805854|emb|CAA21474.1|
           putative protein [Arabidopsis thaliana]
           gi|7486389|pir||T04698 hypothetical protein F4B14.150 -
           Arabidopsis thaliana
          Length = 455

 Score =  516 bits (1330), Expect = e-145
 Identities = 255/373 (68%), Positives = 301/373 (80%), Gaps = 14/373 (3%)

Query: 4   FTKIIVIILIILHLSM-CCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYA 62
           F   + +I I++ LS   CN  IFTF MHHR+S+ VK+WS S    + ++P KGS EY+ 
Sbjct: 6   FKTTLFLIPILMLLSFGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFA-KFPPKGSFEYFN 64

Query: 63  ELADRDRFLRGRRLSQFDA----GLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVAL 118
            L  RD  +RGRRLS+ ++     L FSDGNST RISSLGFLHYTT++LGTPG++FMVAL
Sbjct: 65  ALVLRDWLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVAL 124

Query: 119 DTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQ 178
           DTGSDLFWVPCDC +C+ T  + +AS    +F+LS+YNP  S+T+KKVTCNNSLC  RNQ
Sbjct: 125 DTGSDLFWVPCDCGKCAPTEGATYAS----EFELSIYNPKVSTTNKKVTCNNSLCAQRNQ 180

Query: 179 CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLD 238
           CLGTFS CPYMVSYVSA+TSTSGIL+EDV+HLT  D N + VEA V FGCGQVQSGSFLD
Sbjct: 181 CLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD 240

Query: 239 VAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNV 298
           +AAPNGLFGLGMEKISVPS+L+REG  ADSFSMCFG DG+GRISFGDKGS DQ+ETPFN+
Sbjct: 241 IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNL 300

Query: 299 NPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRP 358
           NPSHP YNIT+ +VRVGTTLID EFTALFD+GTSFTYLVDP Y+ +SES     +D+R  
Sbjct: 301 NPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQDKRHS 356

Query: 359 PDSRIPFDYCYDM 371
           PDSRIPF+YCYDM
Sbjct: 357 PDSRIPFEYCYDM 369


>gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
           gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid
           DNA-binding protein [Arabidopsis thaliana]
           gi|30680102|ref|NP_849967.1| aspartyl protease family
           protein [Arabidopsis thaliana]
          Length = 513

 Score =  483 bits (1243), Expect = e-135
 Identities = 239/448 (53%), Positives = 311/448 (69%), Gaps = 11/448 (2%)

Query: 4   FTKIIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAE 63
           F  +++++     L  C     F F  HHR+S+ V         P    P + S +YY  
Sbjct: 11  FLGLLILLASSWVLDRCEGFGEFGFEFHHRFSDQVV-----GVLPGDGLPNRDSSKYYRV 65

Query: 64  LADRDRFLRGRRLSQFDAGLA-FSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGS 122
           +A RDR +RGRRL+  D  L  FSDGN T R+ +LGFLHY  + +GTP   FMVALDTGS
Sbjct: 66  MAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGS 125

Query: 123 DLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGT 182
           DLFW+PCDCT C     +   S+L    DL++Y+PN SSTS KV CN++LCT  ++C   
Sbjct: 126 DLFWLPCDCTNCVRELKAPGGSSL----DLNIYSPNASSTSTKVPCNSTLCTRGDRCASP 181

Query: 183 FSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAP 242
            S+CPY + Y+S  TS++G+LVEDVLHL   D +   + A V FGCGQVQ+G F D AAP
Sbjct: 182 ESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAP 241

Query: 243 NGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSH 302
           NGLFGLG+E ISVPS+L++EG  A+SFSMCFG DG GRISFGDKGS+DQ ETP N+   H
Sbjct: 242 NGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPH 301

Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRR-RPPDS 361
           PTYNIT+ ++ VG    D+EF A+FDSGTSFTYL D  Y+ +SESF+S   D+R +  DS
Sbjct: 302 PTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDS 361

Query: 362 RIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAELN 421
            +PF+YCY +SP+ ++   P+++LTM GGS + VY P+++I  +   VYCLA++K  +++
Sbjct: 362 ELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIEDIS 421

Query: 422 IIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           IIGQNFMTGYRVVFDR KLILGWK+SDC
Sbjct: 422 IIGQNFMTGYRVVFDREKLILGWKESDC 449


>ref|XP_474980.1| OSJNBb0071D01.4 [Oryza sativa (japonica cultivar-group)]
           gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza
           sativa (japonica cultivar-group)]
          Length = 530

 Score =  469 bits (1206), Expect = e-130
 Identities = 238/432 (55%), Positives = 305/432 (70%), Gaps = 25/432 (5%)

Query: 31  HHRYSEPVKKWS----HSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAG---- 82
           HHR+S PV++W+    H  P     WPE GS +Y A L   DR  R    +  D G    
Sbjct: 38  HHRFSSPVQRWAEARGHVLPGG---WPEHGSADYVAALNGHDR-RRALSAAGGDGGGGGD 93

Query: 83  ----LAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATR 138
               L FS+GN+T ++S+LGFLHY  + +GTPG  FMVALDTGSDLFW+PC C  C+   
Sbjct: 94  KPPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 153

Query: 139 SSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETS 198
           S+A  SA       S Y P+ SSTS+ V CN+  C  R +C  T S CPY + YVSA+TS
Sbjct: 154 SAASGSA-------SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTS 205

Query: 199 TSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSM 258
           +SG LVEDVL+L+  D    +++A ++FGCGQVQ+GSFLD AAPNGLFGLG++ IS+PS+
Sbjct: 206 SSGFLVEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSI 265

Query: 259 LSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTL 318
           L+++G T++SF+MCF RDGIGRISFGD+GS DQ+ETP +VNP HPTY I+I+++ VG +L
Sbjct: 266 LAQKGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSL 325

Query: 319 IDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTS 378
            D+EF+ +FD+GTSFTYL DP Y+ +++SFH+QV   R   DSRIPF+YCYD+S   +  
Sbjct: 326 TDLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRI 385

Query: 379 LIPSMSLTMGGGSRFVVYDPIIIISTQS-ELVYCLAVVKSAELNIIGQNFMTGYRVVFDR 437
             PS+SL   GGS F V D   +IS Q  E VYCLA+VKSA+LNIIGQNFMTG RVVFDR
Sbjct: 386 QTPSISLRTVGGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMTGLRVVFDR 445

Query: 438 GKLILGWKKSDC 449
            + ILGWKK +C
Sbjct: 446 ERKILGWKKFNC 457


>pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
          Length = 473

 Score =  444 bits (1142), Expect = e-123
 Identities = 223/413 (53%), Positives = 286/413 (68%), Gaps = 31/413 (7%)

Query: 64  LADRDRFLRGRRLSQFDAGLA-FSDGNSTFRISSLGF----------------LHYTTIE 106
           +A RDR +RGRRL+  D  L  FSDGN T R+ +LGF                LHY  + 
Sbjct: 1   MAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFFKVNVFMETCELFMRDLHYANVT 60

Query: 107 LGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKV 166
           +GTP   FMVALDTGSDLFW+PCDCT C     +   S+L    DL++Y+PN SSTS KV
Sbjct: 61  VGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL----DLNIYSPNASSTSTKV 116

Query: 167 TCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIF 226
            CN++LCT  ++C    S+CPY + Y+S  TS++G+LVEDVLHL   D +   + A V F
Sbjct: 117 PCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTF 176

Query: 227 GCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDK 286
           GCGQVQ+G F D AAPNGLFGLG+E ISVPS+L++EG  A+SFSMCFG DG GRISFGDK
Sbjct: 177 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDK 236

Query: 287 GSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSE 346
           GS+DQ ETP N+   HPTYNIT+ ++ VG    D+EF A+FDSGTSFTYL D  Y+ +SE
Sbjct: 237 GSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISE 296

Query: 347 SFHSQVEDRR-RPPDSRIPFDYCYDM---------SPDSNTSLIPSMSLTMGGGSRFVVY 396
           SF+S   D+R +  DS +PF+YCY +          P+ ++   P+++LTM GGS + VY
Sbjct: 297 SFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNLTMKGGSSYPVY 356

Query: 397 DPIIIISTQSELVYCLAVVKSAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
            P+++I  +   VYCLA++K  +++IIGQNFMTGYRVVFDR KLILGWK+SDC
Sbjct: 357 HPLVVIPMKDTDVYCLAIMKIEDISIIGQNFMTGYRVVFDREKLILGWKESDC 409


>pir||T08860 hypothetical protein A_TM017A05.8 - Arabidopsis thaliana
          Length = 472

 Score =  441 bits (1133), Expect = e-122
 Identities = 223/413 (53%), Positives = 286/413 (68%), Gaps = 32/413 (7%)

Query: 64  LADRDRFLRGRRLSQFDAGLA-FSDGNSTFRISSLGF----------------LHYTTIE 106
           +A RDR +RGRRL+  D  L  FSDGN T R+ +LGF                LHY  + 
Sbjct: 1   MAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFFKVNVFMETCELFMRDLHYANVT 60

Query: 107 LGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKV 166
           +GTP   FMVALDTGSDLFW+PCDCT C     +   S+L    DL++Y+PN SSTS KV
Sbjct: 61  VGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL----DLNIYSPNASSTSTKV 116

Query: 167 TCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIF 226
            CN++LCT  ++C    S+CPY + Y+S  TS++G+LVEDVLHL   D +   + A V F
Sbjct: 117 PCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTF 176

Query: 227 GCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDK 286
           GCGQVQ+G F D AAPNGLFGLG+E ISVPS+L++EG  A+SFSMCFG DG GRISFGDK
Sbjct: 177 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDK 236

Query: 287 GSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSE 346
           GS+DQ ETP N+   HPTYNIT+ ++ VG    D+EF A+FDSGTSFTYL D  Y+ +SE
Sbjct: 237 GSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISE 296

Query: 347 SFHSQVEDRR-RPPDSRIPFDYCYDM---------SPDSNTSLIPSMSLTMGGGSRFVVY 396
           SF+S   D+R +  DS +PF+YCY +          P+ ++   P+++LTM GGS + VY
Sbjct: 297 SFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNLTMKGGSSYPVY 356

Query: 397 DPIIIISTQSELVYCLAVVKSAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
            P+++I  +   VYCLA++K  +++IIGQNFMTGYRVVFDR KLILGWK+SDC
Sbjct: 357 HPLVVIPMKVN-VYCLAIMKIEDISIIGQNFMTGYRVVFDREKLILGWKESDC 408


>dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
           sativa (japonica cultivar-group)]
          Length = 732

 Score =  429 bits (1104), Expect = e-119
 Identities = 222/449 (49%), Positives = 303/449 (67%), Gaps = 20/449 (4%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++V+  ++L L+    A   +  +HHRYS  V++W+ +A +P H     G+ EYYA LA 
Sbjct: 8   VVVVRAVLLLLAAVAAAEALSLDVHHRYSAAVRRWA-AAAAPPH-----GTAEYYAALAG 61

Query: 67  RDRFLRGRRLSQFDAG----LAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGS 122
            D  LR R L     G     AF+DGN T+R++  GFLHY  + LGTP V F+VALDTGS
Sbjct: 62  HDG-LRRRSLGVGGGGGGAEFAFADGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGS 120

Query: 123 DLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGT 182
           DLFWVPCDC +C+  +S  + S     FD  VY+P  S+TS+KV C+++LC  +N C   
Sbjct: 121 DLFWVPCDCLKCAPFQSPNYGSL---KFD--VYSPAQSTTSRKVPCSSNLCDLQNACRSK 175

Query: 183 FSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAP 242
            ++CPY + Y+S  TS+SG+LVEDVL+LT       +V A ++FGCGQVQ+GSFL  AAP
Sbjct: 176 SNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAP 235

Query: 243 NGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSH 302
           NGL GLGM+  SVPS+L+ +G  A+SFSMCFG DG GRI+FGD GS DQ ETP NV   +
Sbjct: 236 NGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQN 295

Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
           P YNITI  + VG+  I  EF+A+ DSGTSFT L DP Y++++ SF +Q+   R   DS 
Sbjct: 296 PYYNITITGITVGSKSISTEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSS 355

Query: 363 IPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVVKSAEL 420
           +PF++CY +S  +N  + P++SLT  GGS F V DPII I+    + + YCLA++KS  +
Sbjct: 356 MPFEFCYSVS--ANGIVHPNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGV 413

Query: 421 NIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           N+IG+NFM+G +VVFDR +++LGWK  +C
Sbjct: 414 NLIGENFMSGLKVVFDRERMVLGWKNFNC 442


>emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
          Length = 335

 Score =  405 bits (1040), Expect = e-111
 Identities = 193/257 (75%), Positives = 220/257 (85%), Gaps = 8/257 (3%)

Query: 115 MVALDTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCT 174
           MVALDTGSDLFWVPCDC +C+ T  + +AS    +F+LS+YNP  S+T+KKVTCNNSLC 
Sbjct: 1   MVALDTGSDLFWVPCDCGKCAPTEGATYAS----EFELSIYNPKVSTTNKKVTCNNSLCA 56

Query: 175 HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSG 234
            RNQCLGTFS CPYMVSYVSA+TSTSGIL+EDV+HLT  D N + VEA V FGCGQVQSG
Sbjct: 57  QRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSG 116

Query: 235 SFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDET 294
           SFLD+AAPNGLFGLGMEKISVPS+L+REG  ADSFSMCFG DG+GRISFGDKGS DQ+ET
Sbjct: 117 SFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEET 176

Query: 295 PFNVNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVED 354
           PFN+NPSHP YNIT+ +VRVGTTLID EFTALFD+GTSFTYLVDP Y+ +SES     +D
Sbjct: 177 PFNLNPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQD 232

Query: 355 RRRPPDSRIPFDYCYDM 371
           +R  PDSRIPF+YCYDM
Sbjct: 233 KRHSPDSRIPFEYCYDM 249


>gb|AAO22575.1| unknown protein [Arabidopsis thaliana] gi|18409320|ref|NP_566948.1|
           aspartyl protease family protein [Arabidopsis thaliana]
          Length = 529

 Score =  397 bits (1019), Expect = e-109
 Identities = 199/450 (44%), Positives = 290/450 (64%), Gaps = 13/450 (2%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++ ++++   L  C  +  F+F +HH +S+ VK+    +       PEKGS+EY+  LA 
Sbjct: 10  LLSLLVVCWGLERCEASGKFSFEVHHMFSDRVKQ----SLGLDDLVPEKGSLEYFKVLAQ 65

Query: 67  RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
           RDR +RGR L+    +  + F  GN T  I  LGFLHY  + +GTP   F+VALDTGSDL
Sbjct: 66  RDRLIRGRGLASNNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDL 125

Query: 125 FWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFS 184
           FW+PC+C   S          L+    L++Y+PN SSTS  + C++  C   ++C    S
Sbjct: 126 FWLPCNCG--STCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPAS 183

Query: 185 NCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNG 244
           +CPY + Y+S +T T+G L EDVLHL   D+  + V+AN+  GCG+ Q+G     AA NG
Sbjct: 184 SCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNG 243

Query: 245 LFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPSH 302
           L GLG++  SVPS+L++   TA+SFSMCFG   D +GRISFGDKG  DQ ETP       
Sbjct: 244 LLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPS 303

Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
           PTY +++ +V VG   + V+  ALFD+GTSFT+L++P Y  ++++F   V D+RRP D  
Sbjct: 304 PTYAVSVTEVSVGGDAVGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPE 363

Query: 363 IPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIII-STQSELVYCLAVVKSAE-- 419
           +PF++CYD+SP+  T L P +++T  GGS+  + +P+ I+ +  +  +YCL ++KS +  
Sbjct: 364 LPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFK 423

Query: 420 LNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           +NIIGQNFM+GYR+VFDR ++ILGWK+SDC
Sbjct: 424 INIIGQNFMSGYRIVFDRERMILGWKRSDC 453


>dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa (japonica
           cultivar-group)]
          Length = 476

 Score =  390 bits (1002), Expect = e-107
 Identities = 202/445 (45%), Positives = 280/445 (62%), Gaps = 49/445 (11%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++V+  ++L L+    A   +  +HHRYS  V++W+ +A  P                  
Sbjct: 8   VVVVRAVLLLLAAVAAAEALSLDVHHRYSAAVRRWAAAAAPP------------------ 49

Query: 67  RDRFLRGRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFW 126
                                 + T  ++  GFLHY  + LGTP V F+VALDTGSDLFW
Sbjct: 50  ----------------------HGTADLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFW 87

Query: 127 VPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNC 186
           VPCDC +C+  +S  + S     FD  VY+P  S+TS+KV C+++LC  +N C    ++C
Sbjct: 88  VPCDCLKCAPFQSPNYGSL---KFD--VYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSC 142

Query: 187 PYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLF 246
           PY + Y+S  TS+SG+LVEDVL+LT       +V A ++FGCGQVQ+GSFL  AAPNGL 
Sbjct: 143 PYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLL 202

Query: 247 GLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYN 306
           GLGM+  SVPS+L+ +G  A+SFSMCFG DG GRI+FGD GS DQ ETP NV   +P YN
Sbjct: 203 GLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYN 262

Query: 307 ITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFD 366
           ITI  + VG+  I  EF+A+ DSGTSFT L DP Y++++ SF +Q+   R   DS +PF+
Sbjct: 263 ITITGITVGSKSISTEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFE 322

Query: 367 YCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVVKSAELNIIG 424
           +CY +S  +N  + P++SLT  GGS F V DPII I+    + + YCLA++KS  +N+IG
Sbjct: 323 FCYSVS--ANGIVHPNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIG 380

Query: 425 QNFMTGYRVVFDRGKLILGWKKSDC 449
           +NFM+G +VVFDR +++LGWK  +C
Sbjct: 381 ENFMSGLKVVFDRERMVLGWKNFNC 405


>gb|AAL79734.1| putative chloroplast nucleoid DNA-binding protein [Oryza sativa]
           gi|54291046|dbj|BAD61723.1| aspartic proteinase
           nepenthesin II-like [Oryza sativa (japonica
           cultivar-group)]
          Length = 551

 Score =  389 bits (998), Expect = e-106
 Identities = 215/445 (48%), Positives = 278/445 (62%), Gaps = 26/445 (5%)

Query: 28  FTMHHRYSEPVKKWSHSAPSPSHRWPEK----GSVEYYAELADRDRFLRGRR-LSQFDAG 82
           F +HHRYS  V++W+         WP      GS EYY+ L+  D  L  RR L+Q D  
Sbjct: 29  FDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDGL 88

Query: 83  LAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAF 142
           + F+DGN T R+   G LHY  + +GTP   F+VALDTGSDLFWVPCDC +C A   +  
Sbjct: 89  VTFADGNITLRLD--GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQC-APLGNLT 145

Query: 143 ASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGI 202
           A       +L  Y+P+ SSTSK VTC ++LC   N C    S+CPY V Y  A TS+SG 
Sbjct: 146 AVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGE 205

Query: 203 LVEDVLHLTQPDDNHDL-----VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPS 257
           LVEDVL+LT+            V   V+FGCGQVQ+GSFLD AA +GL GLGMEK+SVPS
Sbjct: 206 LVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPS 265

Query: 258 MLSREGFT-ADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGT 316
           +L+  G   ++SFSMCF +DG+GRI+FGD GS DQ ETPF V  +H  YNI+I  + VG 
Sbjct: 266 ILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITSMSVGD 325

Query: 317 TLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRR-----RPPDSRIPFDYCYDM 371
             + + F A+ DSGTSFTYL DP Y+  + +F++Q+ +RR            PF+YCY +
Sbjct: 326 KNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSL 385

Query: 372 SPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSE------LVYCLAVVKS-AELNIIG 424
           SPD  T  +P +SLT  GG+ F V  P+  I+ Q        + YCLAV+KS   ++IIG
Sbjct: 386 SPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIG 445

Query: 425 QNFMTGYRVVFDRGKLILGWKKSDC 449
           QNFMTG +VVF+R K +LGW+K DC
Sbjct: 446 QNFMTGLKVVFNREKSVLGWQKFDC 470


>emb|CAB62655.1| putative protein [Arabidopsis thaliana] gi|11281623|pir||T45764
           hypothetical protein F24M12.370 - Arabidopsis thaliana
          Length = 519

 Score =  384 bits (987), Expect = e-105
 Identities = 198/449 (44%), Positives = 288/449 (64%), Gaps = 21/449 (4%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++ ++++   L  C  +  F+F +HH +S+ VK+    +       PEKGS+EY+  LA 
Sbjct: 10  LLSLLVVCWGLERCEASGKFSFEVHHMFSDRVKQ----SLGLDDLVPEKGSLEYFKVLAQ 65

Query: 67  RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
           RDR +RGR L+    +  + F  GN T  I  LGFLHY  + +GTP   F+VALDTGSDL
Sbjct: 66  RDRLIRGRGLASNNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDL 125

Query: 125 FWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFS 184
           FW+PC+C   S          L+    L++Y+PN SSTS  + C++  C   ++C    S
Sbjct: 126 FWLPCNCG--STCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPAS 183

Query: 185 NCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNG 244
           +CPY + Y+S +T T+G L EDVLHL   D+  + V+AN+  GCG+ Q+G     AA NG
Sbjct: 184 SCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNG 243

Query: 245 LFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPSH 302
           L GLG++  SVPS+L++   TA+SFSMCFG   D +GRISFGDKG  DQ ETP  + P+ 
Sbjct: 244 LLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETP--LLPTE 301

Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
           P    ++ +V VG   + V+  ALFD+GTSFT+L++P Y  ++++F   V D+RRP D  
Sbjct: 302 P----SVTEVSVGGDAVGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPE 357

Query: 363 IPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAE--L 420
           +PF++CYD+SP+  T L P +++T  GGS+  + +P+ I    +  +YCL ++KS +  +
Sbjct: 358 LPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFI---DNSAMYCLGILKSVDFKI 414

Query: 421 NIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           NIIGQNFM+GYR+VFDR ++ILGWK+SDC
Sbjct: 415 NIIGQNFMSGYRIVFDRERMILGWKRSDC 443


>ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 488

 Score =  367 bits (943), Expect = e-100
 Identities = 194/430 (45%), Positives = 278/430 (64%), Gaps = 24/430 (5%)

Query: 27  TFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAG---L 83
           +F +HHR+SE VK          H  PE GS++YY  L  RDR   GR+L+  +     +
Sbjct: 23  SFEIHHRFSEQVK-----TVLGGHGLPEMGSLDYYKALVHRDR---GRQLTSNNNNQTTI 74

Query: 84  AFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFA 143
           +F+ GNST  IS   FLHY  + +GTP   F+VALDTGSDLFW+PC+C   ++T   +  
Sbjct: 75  SFAQGNSTEEIS---FLHYANVTIGTPAQWFLVALDTGSDLFWLPCNC---NSTCVRSME 128

Query: 144 SALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGIL 203
           +       L++YNP+ S +S KVTCN++LC  RN+C+   S+CPY + Y+S  + ++G+L
Sbjct: 129 TDQGERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKSTGVL 188

Query: 204 VEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREG 263
           VEDV+H++  +   +  +A + FGC + Q G F +VA  NG+ GL +  I+VP+ML + G
Sbjct: 189 VEDVIHMSTEEG--EARDARITFGCSESQLGLFKEVAV-NGIMGLAIADIAVPNMLVKAG 245

Query: 264 FTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVEF 323
             +DSFSMCFG +G G ISFGDKGS DQ ETP +   S   Y+++I + +VG   +D EF
Sbjct: 246 VASDSFSMCFGPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEF 305

Query: 324 TALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSM 383
           TA FDSGT+ T+L++P Y+ L+ +FH  V DRR       PF++CY ++  S+   +PS+
Sbjct: 306 TATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSV 365

Query: 384 SLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVVK--SAELNIIGQNFMTGYRVVFDRGK 439
           S  M GG+ + V+ PI++  T   S  VYCLAV+K  +A+ +IIGQNFMT YR+V DR +
Sbjct: 366 SFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNFMTNYRIVHDRER 425

Query: 440 LILGWKKSDC 449
            ILGWKKS+C
Sbjct: 426 RILGWKKSNC 435


>ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 528

 Score =  365 bits (937), Expect = 2e-99
 Identities = 195/452 (43%), Positives = 279/452 (61%), Gaps = 18/452 (3%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++ ++++      C     F F +HH +S+ VK+    +       PE+GS+EY+  LA 
Sbjct: 10  LLSVLVVCWGFERCEATGKFGFEVHHIFSDSVKQ----SLGLGDLVPEQGSLEYFKVLAH 65

Query: 67  RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
           RDR +RGR L+    +  + F  GN T  +  LG L+Y  + +GTP   F+VALDTGSDL
Sbjct: 66  RDRLIRGRGLASNNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDL 125

Query: 125 FWVPCDC-TRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTF 183
           FW+PC+C T C           +     L++Y PN S+TS  + C++  C    +C    
Sbjct: 126 FWLPCNCGTTCIRDLEDI---GVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPS 182

Query: 184 SNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPN 243
           S CPY +SY S  T T G L++DVLHL   D+N   V+ANV  GCGQ Q+G F    + N
Sbjct: 183 SICPYQISY-SNSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVN 241

Query: 244 GLFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPF-NVNP 300
           G+ GLG++  SVPS+L++   TA+SFSMCFGR    +GRISFGD+G  DQ+ETPF +V P
Sbjct: 242 GVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAP 301

Query: 301 SHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPD 360
           S   Y + I+ V V    +D+   A FD+G+SFT+L +P Y  L++SF   VEDRRRP D
Sbjct: 302 S-TAYGVNISGVSVAGDPVDIRLFAKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVD 360

Query: 361 SRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ-SELVYCLAVVKSA- 418
             +PF++CYD+SP++ T   P + +T  GGS+ ++ +P     TQ   ++YCL V+KS  
Sbjct: 361 PELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVG 420

Query: 419 -ELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
            ++N+IGQNF+ GYR+VFDR ++ILGWK+S C
Sbjct: 421 LKINVIGQNFVAGYRIVFDRERMILGWKQSLC 452


>emb|CAB62656.1| putative protein [Arabidopsis thaliana]
           gi|15230458|ref|NP_190702.1| aspartyl protease family
           protein [Arabidopsis thaliana] gi|11281622|pir||T45765
           hypothetical protein F24M12.380 - Arabidopsis thaliana
          Length = 518

 Score =  338 bits (867), Expect = 2e-91
 Identities = 188/462 (40%), Positives = 268/462 (57%), Gaps = 36/462 (7%)

Query: 9   VIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRD 68
           +++LI   L  C  +  F+F +HH +S+ VK+            PE GS+EY+  LA RD
Sbjct: 1   MLVLIFWGLERCEASGKFSFEVHHMFSDVVKQ----TLGFDDLVPENGSLEYFKVLAHRD 56

Query: 69  RFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFW 126
           RF+RGR L+    +  L     N T  ++ LGFLHY  + LGTP   F+VALDTGSDLFW
Sbjct: 57  RFIRGRGLASNNEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFW 116

Query: 127 VPCDC-TRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSN 185
           +PC+C T C      A     +    L++Y PN S+TS  + C++  C    +C    S 
Sbjct: 117 LPCNCGTTCIHDLKDA---RFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESI 173

Query: 186 CPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGL 245
           CPY ++ +S+ T T+G L++DVLHL   D++   V ANV  GCGQ Q+G+F    A NG+
Sbjct: 174 CPYQIA-LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGV 232

Query: 246 FGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPSHP 303
            GL M++ SVPS+L++   TA+SFSMCFGR    +GRISFGDKG  DQ+ETP     +  
Sbjct: 233 LGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETST 292

Query: 304 TYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRI 363
            Y + +  V VG   +DV   ALFD+G+SFT L++  Y   +++F   +ED+RRP D   
Sbjct: 293 AYGVNVTGVSVGGVPVDVPLFALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDF 352

Query: 364 PFDYCYDMSPDS-NTSLIPSMSLTMGGGSRFVVYDPI---------------IIISTQSE 407
           PF++CYD+  +  N+   P    +         Y+P                +  S +  
Sbjct: 353 PFEFCYDLREEHLNSDARPRHMQSK-------CYNPCRDDFRWRIQNDSQESVSYSNEGT 405

Query: 408 LVYCLAVVKSAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
            +YCL ++KS  LNIIGQN M+G+R+VFDR ++ILGWK+S+C
Sbjct: 406 KMYCLGILKSINLNIIGQNLMSGHRIVFDRERMILGWKQSNC 447


>ref|XP_467818.1| aspartyl protease-like [Oryza sativa (japonica cultivar-group)]
           gi|46390211|dbj|BAD15642.1| aspartyl protease-like
           [Oryza sativa (japonica cultivar-group)]
          Length = 520

 Score =  308 bits (788), Expect = 3e-82
 Identities = 180/449 (40%), Positives = 257/449 (57%), Gaps = 15/449 (3%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           +++++L          A   +  M HR S+  +    +      RWP +GS +Y+  L  
Sbjct: 9   LLLLLLAAAAAVAAAEAATLSARMVHRLSDEARL--AAGARGGRRWPRRGSGDYFRALVR 66

Query: 67  RDRFLRGRRLSQFDAGLAFSDGNSTFRISS-LGFLHYTTIELGTPGVKFMVALDTGSDLF 125
            D   + RR+      L+ S G S F   + LG+L+YT +++GTP   F+VALDTGSDLF
Sbjct: 67  SDLQRQKRRVGGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLF 126

Query: 126 WVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSN 185
           WVPCDC +C+    S++  +L  D DL +Y P+ S+TS+ + C++ LC+  + C      
Sbjct: 127 WVPCDCIQCAPL--SSYHGSL--DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQP 182

Query: 186 CPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGL 245
           CPY + Y S  T++SG+L+ED+LHL   +  H  V A+VI GCG+ QSGS+L+  AP+GL
Sbjct: 183 CPYNIDYFSENTTSSGLLIEDMLHLDSRE-GHAPVNASVIIGCGKKQSGSYLEGIAPDGL 241

Query: 246 FGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPF-NVNPSHPT 304
            GLGM  ISVPS L+R G   +SFSMCF +D  GRI FGD+G   Q  TPF  +N    T
Sbjct: 242 LGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQT 301

Query: 305 YNITINQVRVGTTLID-VEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRI 363
           Y + +++  +G    +   F AL D+GTSFT L    Y  ++  F  Q+   R   D   
Sbjct: 302 YAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDD-Y 360

Query: 364 PFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIIS-TQSEL-VYCLAVVKSAE-L 420
            F+YCY   P      +P+++LT      F   +PI+  +  Q E  V+CLAV+ S E +
Sbjct: 361 SFEYCYSTGPLEMPD-VPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPV 419

Query: 421 NIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
            IIGQNFM GY VVFDR  + LGW +S+C
Sbjct: 420 GIIGQNFMVGYHVVFDRENMKLGWYRSEC 448


>emb|CAB92049.1| putative protein [Arabidopsis thaliana]
           gi|15238055|ref|NP_196570.1| aspartyl protease family
           protein [Arabidopsis thaliana] gi|11281625|pir||T50012
           hypothetical protein T31P16.70 - Arabidopsis thaliana
          Length = 528

 Score =  302 bits (774), Expect = 1e-80
 Identities = 174/440 (39%), Positives = 252/440 (56%), Gaps = 19/440 (4%)

Query: 23  AHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAG 82
           A +F+  + HR+S+  +  S   PS S   P K S+EYY  LA+ D   +   L      
Sbjct: 22  ASLFSSRLIHRFSDEGRA-SIKTPSSSDSLPNKQSLEYYRLLAESDFRRQRMNLGAKVQS 80

Query: 83  LAFSDGNSTFRISS-LGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSA 141
           L  S+G+ T    +  G+LHYT I++GTP V F+VALDTGS+L W+PC+C +C A  +S 
Sbjct: 81  LVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQC-APLTST 139

Query: 142 FASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSG 201
           + S+LA+  DL+ YNP+ SSTSK   C++ LC   + C      CPY V+Y+S  TS+SG
Sbjct: 140 YYSSLATK-DLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSG 198

Query: 202 ILVEDVLHLTQPDDNHDL-----VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVP 256
           +LVED+LHLT   +N  +     V+A V+ GCG+ QSG +LD  AP+GL GLG  +ISVP
Sbjct: 199 LLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVP 258

Query: 257 SMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNV--NPSHPTYNITINQVRV 314
           S LS+ G   +SFS+CF  +  GRI FGD G   Q  TPF    N  +  Y + +    +
Sbjct: 259 SFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCI 318

Query: 315 GTT-LIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSP 373
           G + L    FT   DSG SFTYL +  Y +++      +    +  +  + ++YCY+ S 
Sbjct: 319 GNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKNFEG-VSWEYCYESSA 377

Query: 374 DSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELV-YCLAVVKSAELNI--IGQNFMTG 430
           +     +P++ L     + FV++ P+ +      LV +CL +  S +  I  IGQN+M G
Sbjct: 378 EPK---VPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRG 434

Query: 431 YRVVFDRGKLILGWKKSDCK 450
           YR+VFDR  + LGW  S C+
Sbjct: 435 YRMVFDRENMKLGWSPSKCQ 454


>pir||T45766 hypothetical protein F24M12.390 - Arabidopsis thaliana
          Length = 475

 Score =  280 bits (717), Expect = 6e-74
 Identities = 165/453 (36%), Positives = 236/453 (51%), Gaps = 73/453 (16%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++ ++++      C     F F +HH +S+ VK+    +       PE+GS+EY+  LA 
Sbjct: 10  LLSVLVVCWGFERCEATGKFGFEVHHIFSDSVKQ----SLGLGDLVPEQGSLEYFKVLAH 65

Query: 67  RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
           RDR +RGR L+    +  + F  GN T  +  LG L+Y  + +GTP   F+VALDTGSDL
Sbjct: 66  RDRLIRGRGLASNNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDL 125

Query: 125 FWVPCDC-TRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTF 183
           FW+PC+C T C           +     L++Y PN S+TS  + C++  C    +C    
Sbjct: 126 FWLPCNCGTTCIRDLEDI---GVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPS 182

Query: 184 SNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPN 243
           S CPY +SY S  T T G L++DVLHL   D+N   V+ANV  GCGQ Q+G F    + N
Sbjct: 183 SICPYQISY-SNSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVN 241

Query: 244 GLFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPS 301
           G+ GLG++  SVPS+L++   TA+SFSMCFGR    +GRISFG                 
Sbjct: 242 GVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGRISFG----------------- 284

Query: 302 HPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDS 361
                                               D  Y+   E+    V  RRRP D 
Sbjct: 285 ------------------------------------DRGYTDQEETPFISVAPRRRPVDP 308

Query: 362 RIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQS-----ELVYCLAVVK 416
            +PF++CYD+SP++ T   P + +T  GGS+ ++ +P     TQ+      ++YCL V+K
Sbjct: 309 ELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQARHGEGNVMYCLGVLK 368

Query: 417 SAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           S  L I   NF+ GYR+VFDR ++ILGWK+S C
Sbjct: 369 SVRLKI--NNFVAGYRIVFDRERMILGWKQSLC 399


>emb|CAB62657.2| putative protein [Arabidopsis thaliana]
          Length = 475

 Score =  280 bits (715), Expect = 1e-73
 Identities = 165/453 (36%), Positives = 236/453 (51%), Gaps = 73/453 (16%)

Query: 7   IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
           ++ ++++      C     F F +HH +S+ VK+    +       PE+GS+EY+  LA 
Sbjct: 10  LLSVLVVCWGFERCEATGKFGFEVHHIFSDSVKQ----SLGLGDLVPEQGSLEYFKVLAH 65

Query: 67  RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
           RDR +RGR L+    +  + F  GN T  +  LG L+Y  + +GTP   F+VALDTGSDL
Sbjct: 66  RDRLIRGRGLASNNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDL 125

Query: 125 FWVPCDC-TRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTF 183
           FW+PC+C T C           +     L++Y PN S+TS  + C++  C    +C    
Sbjct: 126 FWLPCNCGTTCIRDLEDI---GVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPS 182

Query: 184 SNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPN 243
           S CPY +SY S  T T G L++DVLHL   D+N   V+ANV  GCGQ Q+G F    + N
Sbjct: 183 SICPYQISY-SNSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVN 241

Query: 244 GLFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPS 301
           G+ GLG++  SVPS+L++   TA+SFSMCFGR    +GRISFG                 
Sbjct: 242 GVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGRISFG----------------- 284

Query: 302 HPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDS 361
                                               D  Y+   E+    V  RRRP D 
Sbjct: 285 ------------------------------------DRGYTDQEETPFISVAPRRRPVDP 308

Query: 362 RIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQS-----ELVYCLAVVK 416
            +PF++CYD+SP++ T   P + +T  GGS+ ++ +P     TQ+      ++YCL V+K
Sbjct: 309 ELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQARHGEGNVMYCLGVLK 368

Query: 417 SAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
           S  L I   NF+ GYR+VFDR ++ILGWK+S C
Sbjct: 369 SVGLKI--NNFVAGYRIVFDRERMILGWKQSLC 399


>emb|CAB62658.1| putative protein [Arabidopsis thaliana] gi|11357633|pir||T45767
           hypothetical protein F24M12.400 - Arabidopsis thaliana
          Length = 426

 Score =  209 bits (532), Expect = 2e-52
 Identities = 110/258 (42%), Positives = 167/258 (64%), Gaps = 9/258 (3%)

Query: 178 QCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFL 237
           +C+   S+CPY + Y+S  + ++G+LVEDV+H++  +   +  +A + FG  + Q G F 
Sbjct: 127 RCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEG--EARDARITFG--ESQLGLFK 182

Query: 238 DVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFN 297
           +VA  NG+ GL +  I+VP+ML + G  +DSFSMCFG +G G ISFGDKGS DQ ETP +
Sbjct: 183 EVAV-NGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQLETPLS 241

Query: 298 VNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRR 357
              S   Y+++I + +VG   +D EFTA FDSGT+ T+L++P Y+ L+ +FH  V DRR 
Sbjct: 242 GTISPMFYDVSITKFKVGKVTVDTEFTATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRL 301

Query: 358 PPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVV 415
                 PF++CY ++  S+   +PS+S  M GG+ + V+ PI++  T   S  VYCLAV+
Sbjct: 302 SKSVDSPFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVL 361

Query: 416 K--SAELNIIGQNFMTGY 431
           K  +A+ +IIG+N   G+
Sbjct: 362 KQVNADFSIIGRNDTNGF 379


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.323    0.137    0.425 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 843,440,475
Number of Sequences: 2540612
Number of extensions: 35671134
Number of successful extensions: 80934
Number of sequences better than 10.0: 747
Number of HSP's better than 10.0 without gapping: 243
Number of HSP's successfully gapped in prelim test: 504
Number of HSP's that attempted gapping in prelim test: 79597
Number of HSP's gapped (non-prelim): 1085
length of query: 492
length of database: 863,360,394
effective HSP length: 132
effective length of query: 360
effective length of database: 527,999,610
effective search space: 190079859600
effective search space used: 190079859600
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)


Medicago: description of AC149547.3