
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149547.3 + phase: 0
(492 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_195313.2| aspartyl protease family protein [Arabidopsis t... 650 0.0
emb|CAB81497.1| putative protein [Arabidopsis thaliana] gi|38058... 516 e-145
gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana] gi|179... 483 e-135
ref|XP_474980.1| OSJNBb0071D01.4 [Oryza sativa (japonica cultiva... 469 e-130
pir||B84556 hypothetical protein At2g17760 [imported] - Arabidop... 444 e-123
pir||T08860 hypothetical protein A_TM017A05.8 - Arabidopsis thal... 441 e-122
dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like p... 429 e-119
emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana] 405 e-111
gb|AAO22575.1| unknown protein [Arabidopsis thaliana] gi|1840932... 397 e-109
dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza ... 390 e-107
gb|AAL79734.1| putative chloroplast nucleoid DNA-binding protein... 389 e-106
emb|CAB62655.1| putative protein [Arabidopsis thaliana] gi|11281... 384 e-105
ref|NP_190704.2| aspartyl protease family protein [Arabidopsis t... 367 e-100
ref|NP_190703.2| aspartyl protease family protein [Arabidopsis t... 365 2e-99
emb|CAB62656.1| putative protein [Arabidopsis thaliana] gi|15230... 338 2e-91
ref|XP_467818.1| aspartyl protease-like [Oryza sativa (japonica ... 308 3e-82
emb|CAB92049.1| putative protein [Arabidopsis thaliana] gi|15238... 302 1e-80
pir||T45766 hypothetical protein F24M12.390 - Arabidopsis thaliana 280 6e-74
emb|CAB62657.2| putative protein [Arabidopsis thaliana] 280 1e-73
emb|CAB62658.1| putative protein [Arabidopsis thaliana] gi|11357... 209 2e-52
>ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 650 bits (1678), Expect = 0.0
Identities = 317/451 (70%), Positives = 372/451 (82%), Gaps = 10/451 (2%)
Query: 4 FTKIIVIILIILHLSM-CCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYA 62
F + +I I++ LS CN IFTF MHHR+S+ VK+WS S + ++P KGS EY+
Sbjct: 6 FKTTLFLIPILMLLSFGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFA-KFPPKGSFEYFN 64
Query: 63 ELADRDRFLRGRRLSQFDA----GLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVAL 118
L RD +RGRRLS+ ++ L FSDGNST RISSLGFLHYTT++LGTPG++FMVAL
Sbjct: 65 ALVLRDWLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVAL 124
Query: 119 DTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQ 178
DTGSDLFWVPCDC +C+ T + +AS +F+LS+YNP S+T+KKVTCNNSLC RNQ
Sbjct: 125 DTGSDLFWVPCDCGKCAPTEGATYAS----EFELSIYNPKVSTTNKKVTCNNSLCAQRNQ 180
Query: 179 CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLD 238
CLGTFS CPYMVSYVSA+TSTSGIL+EDV+HLT D N + VEA V FGCGQVQSGSFLD
Sbjct: 181 CLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD 240
Query: 239 VAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNV 298
+AAPNGLFGLGMEKISVPS+L+REG ADSFSMCFG DG+GRISFGDKGS DQ+ETPFN+
Sbjct: 241 IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNL 300
Query: 299 NPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRP 358
NPSHP YNIT+ +VRVGTTLID EFTALFD+GTSFTYLVDP Y+ +SESFHSQ +D+R
Sbjct: 301 NPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHS 360
Query: 359 PDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSA 418
PDSRIPF+YCYDMS D+N SLIPS+SLTM G S F + DPII+IST+ ELVYCLA+VKS+
Sbjct: 361 PDSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPIIVISTEGELVYCLAIVKSS 420
Query: 419 ELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
ELNIIGQN+MTGYRVVFDR KL+L WKK DC
Sbjct: 421 ELNIIGQNYMTGYRVVFDREKLVLAWKKFDC 451
>emb|CAB81497.1| putative protein [Arabidopsis thaliana] gi|3805854|emb|CAA21474.1|
putative protein [Arabidopsis thaliana]
gi|7486389|pir||T04698 hypothetical protein F4B14.150 -
Arabidopsis thaliana
Length = 455
Score = 516 bits (1330), Expect = e-145
Identities = 255/373 (68%), Positives = 301/373 (80%), Gaps = 14/373 (3%)
Query: 4 FTKIIVIILIILHLSM-CCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYA 62
F + +I I++ LS CN IFTF MHHR+S+ VK+WS S + ++P KGS EY+
Sbjct: 6 FKTTLFLIPILMLLSFGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFA-KFPPKGSFEYFN 64
Query: 63 ELADRDRFLRGRRLSQFDA----GLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVAL 118
L RD +RGRRLS+ ++ L FSDGNST RISSLGFLHYTT++LGTPG++FMVAL
Sbjct: 65 ALVLRDWLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVAL 124
Query: 119 DTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQ 178
DTGSDLFWVPCDC +C+ T + +AS +F+LS+YNP S+T+KKVTCNNSLC RNQ
Sbjct: 125 DTGSDLFWVPCDCGKCAPTEGATYAS----EFELSIYNPKVSTTNKKVTCNNSLCAQRNQ 180
Query: 179 CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLD 238
CLGTFS CPYMVSYVSA+TSTSGIL+EDV+HLT D N + VEA V FGCGQVQSGSFLD
Sbjct: 181 CLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD 240
Query: 239 VAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNV 298
+AAPNGLFGLGMEKISVPS+L+REG ADSFSMCFG DG+GRISFGDKGS DQ+ETPFN+
Sbjct: 241 IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNL 300
Query: 299 NPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRP 358
NPSHP YNIT+ +VRVGTTLID EFTALFD+GTSFTYLVDP Y+ +SES +D+R
Sbjct: 301 NPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQDKRHS 356
Query: 359 PDSRIPFDYCYDM 371
PDSRIPF+YCYDM
Sbjct: 357 PDSRIPFEYCYDM 369
>gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid
DNA-binding protein [Arabidopsis thaliana]
gi|30680102|ref|NP_849967.1| aspartyl protease family
protein [Arabidopsis thaliana]
Length = 513
Score = 483 bits (1243), Expect = e-135
Identities = 239/448 (53%), Positives = 311/448 (69%), Gaps = 11/448 (2%)
Query: 4 FTKIIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAE 63
F +++++ L C F F HHR+S+ V P P + S +YY
Sbjct: 11 FLGLLILLASSWVLDRCEGFGEFGFEFHHRFSDQVV-----GVLPGDGLPNRDSSKYYRV 65
Query: 64 LADRDRFLRGRRLSQFDAGLA-FSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGS 122
+A RDR +RGRRL+ D L FSDGN T R+ +LGFLHY + +GTP FMVALDTGS
Sbjct: 66 MAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGS 125
Query: 123 DLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGT 182
DLFW+PCDCT C + S+L DL++Y+PN SSTS KV CN++LCT ++C
Sbjct: 126 DLFWLPCDCTNCVRELKAPGGSSL----DLNIYSPNASSTSTKVPCNSTLCTRGDRCASP 181
Query: 183 FSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAP 242
S+CPY + Y+S TS++G+LVEDVLHL D + + A V FGCGQVQ+G F D AAP
Sbjct: 182 ESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTGVFHDGAAP 241
Query: 243 NGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSH 302
NGLFGLG+E ISVPS+L++EG A+SFSMCFG DG GRISFGDKGS+DQ ETP N+ H
Sbjct: 242 NGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPH 301
Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRR-RPPDS 361
PTYNIT+ ++ VG D+EF A+FDSGTSFTYL D Y+ +SESF+S D+R + DS
Sbjct: 302 PTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDS 361
Query: 362 RIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAELN 421
+PF+YCY +SP+ ++ P+++LTM GGS + VY P+++I + VYCLA++K +++
Sbjct: 362 ELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIEDIS 421
Query: 422 IIGQNFMTGYRVVFDRGKLILGWKKSDC 449
IIGQNFMTGYRVVFDR KLILGWK+SDC
Sbjct: 422 IIGQNFMTGYRVVFDREKLILGWKESDC 449
>ref|XP_474980.1| OSJNBb0071D01.4 [Oryza sativa (japonica cultivar-group)]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza
sativa (japonica cultivar-group)]
Length = 530
Score = 469 bits (1206), Expect = e-130
Identities = 238/432 (55%), Positives = 305/432 (70%), Gaps = 25/432 (5%)
Query: 31 HHRYSEPVKKWS----HSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAG---- 82
HHR+S PV++W+ H P WPE GS +Y A L DR R + D G
Sbjct: 38 HHRFSSPVQRWAEARGHVLPGG---WPEHGSADYVAALNGHDR-RRALSAAGGDGGGGGD 93
Query: 83 ----LAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATR 138
L FS+GN+T ++S+LGFLHY + +GTPG FMVALDTGSDLFW+PC C C+
Sbjct: 94 KPPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 153
Query: 139 SSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETS 198
S+A SA S Y P+ SSTS+ V CN+ C R +C T S CPY + YVSA+TS
Sbjct: 154 SAASGSA-------SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTS 205
Query: 199 TSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSM 258
+SG LVEDVL+L+ D +++A ++FGCGQVQ+GSFLD AAPNGLFGLG++ IS+PS+
Sbjct: 206 SSGFLVEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSI 265
Query: 259 LSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTL 318
L+++G T++SF+MCF RDGIGRISFGD+GS DQ+ETP +VNP HPTY I+I+++ VG +L
Sbjct: 266 LAQKGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSL 325
Query: 319 IDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTS 378
D+EF+ +FD+GTSFTYL DP Y+ +++SFH+QV R DSRIPF+YCYD+S +
Sbjct: 326 TDLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRI 385
Query: 379 LIPSMSLTMGGGSRFVVYDPIIIISTQS-ELVYCLAVVKSAELNIIGQNFMTGYRVVFDR 437
PS+SL GGS F V D +IS Q E VYCLA+VKSA+LNIIGQNFMTG RVVFDR
Sbjct: 386 QTPSISLRTVGGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMTGLRVVFDR 445
Query: 438 GKLILGWKKSDC 449
+ ILGWKK +C
Sbjct: 446 ERKILGWKKFNC 457
>pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 444 bits (1142), Expect = e-123
Identities = 223/413 (53%), Positives = 286/413 (68%), Gaps = 31/413 (7%)
Query: 64 LADRDRFLRGRRLSQFDAGLA-FSDGNSTFRISSLGF----------------LHYTTIE 106
+A RDR +RGRRL+ D L FSDGN T R+ +LGF LHY +
Sbjct: 1 MAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFFKVNVFMETCELFMRDLHYANVT 60
Query: 107 LGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKV 166
+GTP FMVALDTGSDLFW+PCDCT C + S+L DL++Y+PN SSTS KV
Sbjct: 61 VGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL----DLNIYSPNASSTSTKV 116
Query: 167 TCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIF 226
CN++LCT ++C S+CPY + Y+S TS++G+LVEDVLHL D + + A V F
Sbjct: 117 PCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTF 176
Query: 227 GCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDK 286
GCGQVQ+G F D AAPNGLFGLG+E ISVPS+L++EG A+SFSMCFG DG GRISFGDK
Sbjct: 177 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDK 236
Query: 287 GSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSE 346
GS+DQ ETP N+ HPTYNIT+ ++ VG D+EF A+FDSGTSFTYL D Y+ +SE
Sbjct: 237 GSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISE 296
Query: 347 SFHSQVEDRR-RPPDSRIPFDYCYDM---------SPDSNTSLIPSMSLTMGGGSRFVVY 396
SF+S D+R + DS +PF+YCY + P+ ++ P+++LTM GGS + VY
Sbjct: 297 SFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNLTMKGGSSYPVY 356
Query: 397 DPIIIISTQSELVYCLAVVKSAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
P+++I + VYCLA++K +++IIGQNFMTGYRVVFDR KLILGWK+SDC
Sbjct: 357 HPLVVIPMKDTDVYCLAIMKIEDISIIGQNFMTGYRVVFDREKLILGWKESDC 409
>pir||T08860 hypothetical protein A_TM017A05.8 - Arabidopsis thaliana
Length = 472
Score = 441 bits (1133), Expect = e-122
Identities = 223/413 (53%), Positives = 286/413 (68%), Gaps = 32/413 (7%)
Query: 64 LADRDRFLRGRRLSQFDAGLA-FSDGNSTFRISSLGF----------------LHYTTIE 106
+A RDR +RGRRL+ D L FSDGN T R+ +LGF LHY +
Sbjct: 1 MAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFFKVNVFMETCELFMRDLHYANVT 60
Query: 107 LGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKV 166
+GTP FMVALDTGSDLFW+PCDCT C + S+L DL++Y+PN SSTS KV
Sbjct: 61 VGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL----DLNIYSPNASSTSTKV 116
Query: 167 TCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIF 226
CN++LCT ++C S+CPY + Y+S TS++G+LVEDVLHL D + + A V F
Sbjct: 117 PCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTF 176
Query: 227 GCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDK 286
GCGQVQ+G F D AAPNGLFGLG+E ISVPS+L++EG A+SFSMCFG DG GRISFGDK
Sbjct: 177 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDK 236
Query: 287 GSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSE 346
GS+DQ ETP N+ HPTYNIT+ ++ VG D+EF A+FDSGTSFTYL D Y+ +SE
Sbjct: 237 GSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISE 296
Query: 347 SFHSQVEDRR-RPPDSRIPFDYCYDM---------SPDSNTSLIPSMSLTMGGGSRFVVY 396
SF+S D+R + DS +PF+YCY + P+ ++ P+++LTM GGS + VY
Sbjct: 297 SFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNLTMKGGSSYPVY 356
Query: 397 DPIIIISTQSELVYCLAVVKSAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
P+++I + VYCLA++K +++IIGQNFMTGYRVVFDR KLILGWK+SDC
Sbjct: 357 HPLVVIPMKVN-VYCLAIMKIEDISIIGQNFMTGYRVVFDREKLILGWKESDC 408
>dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa (japonica cultivar-group)]
Length = 732
Score = 429 bits (1104), Expect = e-119
Identities = 222/449 (49%), Positives = 303/449 (67%), Gaps = 20/449 (4%)
Query: 7 IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
++V+ ++L L+ A + +HHRYS V++W+ +A +P H G+ EYYA LA
Sbjct: 8 VVVVRAVLLLLAAVAAAEALSLDVHHRYSAAVRRWA-AAAAPPH-----GTAEYYAALAG 61
Query: 67 RDRFLRGRRLSQFDAG----LAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGS 122
D LR R L G AF+DGN T+R++ GFLHY + LGTP V F+VALDTGS
Sbjct: 62 HDG-LRRRSLGVGGGGGGAEFAFADGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGS 120
Query: 123 DLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGT 182
DLFWVPCDC +C+ +S + S FD VY+P S+TS+KV C+++LC +N C
Sbjct: 121 DLFWVPCDCLKCAPFQSPNYGSL---KFD--VYSPAQSTTSRKVPCSSNLCDLQNACRSK 175
Query: 183 FSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAP 242
++CPY + Y+S TS+SG+LVEDVL+LT +V A ++FGCGQVQ+GSFL AAP
Sbjct: 176 SNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAP 235
Query: 243 NGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSH 302
NGL GLGM+ SVPS+L+ +G A+SFSMCFG DG GRI+FGD GS DQ ETP NV +
Sbjct: 236 NGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQN 295
Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
P YNITI + VG+ I EF+A+ DSGTSFT L DP Y++++ SF +Q+ R DS
Sbjct: 296 PYYNITITGITVGSKSISTEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSS 355
Query: 363 IPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVVKSAEL 420
+PF++CY +S +N + P++SLT GGS F V DPII I+ + + YCLA++KS +
Sbjct: 356 MPFEFCYSVS--ANGIVHPNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGV 413
Query: 421 NIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
N+IG+NFM+G +VVFDR +++LGWK +C
Sbjct: 414 NLIGENFMSGLKVVFDRERMVLGWKNFNC 442
>emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 405 bits (1040), Expect = e-111
Identities = 193/257 (75%), Positives = 220/257 (85%), Gaps = 8/257 (3%)
Query: 115 MVALDTGSDLFWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCT 174
MVALDTGSDLFWVPCDC +C+ T + +AS +F+LS+YNP S+T+KKVTCNNSLC
Sbjct: 1 MVALDTGSDLFWVPCDCGKCAPTEGATYAS----EFELSIYNPKVSTTNKKVTCNNSLCA 56
Query: 175 HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSG 234
RNQCLGTFS CPYMVSYVSA+TSTSGIL+EDV+HLT D N + VEA V FGCGQVQSG
Sbjct: 57 QRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSG 116
Query: 235 SFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDET 294
SFLD+AAPNGLFGLGMEKISVPS+L+REG ADSFSMCFG DG+GRISFGDKGS DQ+ET
Sbjct: 117 SFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEET 176
Query: 295 PFNVNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVED 354
PFN+NPSHP YNIT+ +VRVGTTLID EFTALFD+GTSFTYLVDP Y+ +SES +D
Sbjct: 177 PFNLNPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQD 232
Query: 355 RRRPPDSRIPFDYCYDM 371
+R PDSRIPF+YCYDM
Sbjct: 233 KRHSPDSRIPFEYCYDM 249
>gb|AAO22575.1| unknown protein [Arabidopsis thaliana] gi|18409320|ref|NP_566948.1|
aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 397 bits (1019), Expect = e-109
Identities = 199/450 (44%), Positives = 290/450 (64%), Gaps = 13/450 (2%)
Query: 7 IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
++ ++++ L C + F+F +HH +S+ VK+ + PEKGS+EY+ LA
Sbjct: 10 LLSLLVVCWGLERCEASGKFSFEVHHMFSDRVKQ----SLGLDDLVPEKGSLEYFKVLAQ 65
Query: 67 RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
RDR +RGR L+ + + F GN T I LGFLHY + +GTP F+VALDTGSDL
Sbjct: 66 RDRLIRGRGLASNNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDL 125
Query: 125 FWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFS 184
FW+PC+C S L+ L++Y+PN SSTS + C++ C ++C S
Sbjct: 126 FWLPCNCG--STCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPAS 183
Query: 185 NCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNG 244
+CPY + Y+S +T T+G L EDVLHL D+ + V+AN+ GCG+ Q+G AA NG
Sbjct: 184 SCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNG 243
Query: 245 LFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPSH 302
L GLG++ SVPS+L++ TA+SFSMCFG D +GRISFGDKG DQ ETP
Sbjct: 244 LLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPS 303
Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
PTY +++ +V VG + V+ ALFD+GTSFT+L++P Y ++++F V D+RRP D
Sbjct: 304 PTYAVSVTEVSVGGDAVGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPE 363
Query: 363 IPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIII-STQSELVYCLAVVKSAE-- 419
+PF++CYD+SP+ T L P +++T GGS+ + +P+ I+ + + +YCL ++KS +
Sbjct: 364 LPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFK 423
Query: 420 LNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
+NIIGQNFM+GYR+VFDR ++ILGWK+SDC
Sbjct: 424 INIIGQNFMSGYRIVFDRERMILGWKRSDC 453
>dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa (japonica
cultivar-group)]
Length = 476
Score = 390 bits (1002), Expect = e-107
Identities = 202/445 (45%), Positives = 280/445 (62%), Gaps = 49/445 (11%)
Query: 7 IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
++V+ ++L L+ A + +HHRYS V++W+ +A P
Sbjct: 8 VVVVRAVLLLLAAVAAAEALSLDVHHRYSAAVRRWAAAAAPP------------------ 49
Query: 67 RDRFLRGRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFW 126
+ T ++ GFLHY + LGTP V F+VALDTGSDLFW
Sbjct: 50 ----------------------HGTADLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFW 87
Query: 127 VPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNC 186
VPCDC +C+ +S + S FD VY+P S+TS+KV C+++LC +N C ++C
Sbjct: 88 VPCDCLKCAPFQSPNYGSL---KFD--VYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSC 142
Query: 187 PYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLF 246
PY + Y+S TS+SG+LVEDVL+LT +V A ++FGCGQVQ+GSFL AAPNGL
Sbjct: 143 PYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLL 202
Query: 247 GLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYN 306
GLGM+ SVPS+L+ +G A+SFSMCFG DG GRI+FGD GS DQ ETP NV +P YN
Sbjct: 203 GLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYN 262
Query: 307 ITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFD 366
ITI + VG+ I EF+A+ DSGTSFT L DP Y++++ SF +Q+ R DS +PF+
Sbjct: 263 ITITGITVGSKSISTEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFE 322
Query: 367 YCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVVKSAELNIIG 424
+CY +S +N + P++SLT GGS F V DPII I+ + + YCLA++KS +N+IG
Sbjct: 323 FCYSVS--ANGIVHPNVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIG 380
Query: 425 QNFMTGYRVVFDRGKLILGWKKSDC 449
+NFM+G +VVFDR +++LGWK +C
Sbjct: 381 ENFMSGLKVVFDRERMVLGWKNFNC 405
>gb|AAL79734.1| putative chloroplast nucleoid DNA-binding protein [Oryza sativa]
gi|54291046|dbj|BAD61723.1| aspartic proteinase
nepenthesin II-like [Oryza sativa (japonica
cultivar-group)]
Length = 551
Score = 389 bits (998), Expect = e-106
Identities = 215/445 (48%), Positives = 278/445 (62%), Gaps = 26/445 (5%)
Query: 28 FTMHHRYSEPVKKWSHSAPSPSHRWPEK----GSVEYYAELADRDRFLRGRR-LSQFDAG 82
F +HHRYS V++W+ WP GS EYY+ L+ D L RR L+Q D
Sbjct: 29 FDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDGL 88
Query: 83 LAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAF 142
+ F+DGN T R+ G LHY + +GTP F+VALDTGSDLFWVPCDC +C A +
Sbjct: 89 VTFADGNITLRLD--GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQC-APLGNLT 145
Query: 143 ASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGI 202
A +L Y+P+ SSTSK VTC ++LC N C S+CPY V Y A TS+SG
Sbjct: 146 AVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGE 205
Query: 203 LVEDVLHLTQPDDNHDL-----VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPS 257
LVEDVL+LT+ V V+FGCGQVQ+GSFLD AA +GL GLGMEK+SVPS
Sbjct: 206 LVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPS 265
Query: 258 MLSREGFT-ADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGT 316
+L+ G ++SFSMCF +DG+GRI+FGD GS DQ ETPF V +H YNI+I + VG
Sbjct: 266 ILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITSMSVGD 325
Query: 317 TLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRR-----RPPDSRIPFDYCYDM 371
+ + F A+ DSGTSFTYL DP Y+ + +F++Q+ +RR PF+YCY +
Sbjct: 326 KNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSL 385
Query: 372 SPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSE------LVYCLAVVKS-AELNIIG 424
SPD T +P +SLT GG+ F V P+ I+ Q + YCLAV+KS ++IIG
Sbjct: 386 SPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIG 445
Query: 425 QNFMTGYRVVFDRGKLILGWKKSDC 449
QNFMTG +VVF+R K +LGW+K DC
Sbjct: 446 QNFMTGLKVVFNREKSVLGWQKFDC 470
>emb|CAB62655.1| putative protein [Arabidopsis thaliana] gi|11281623|pir||T45764
hypothetical protein F24M12.370 - Arabidopsis thaliana
Length = 519
Score = 384 bits (987), Expect = e-105
Identities = 198/449 (44%), Positives = 288/449 (64%), Gaps = 21/449 (4%)
Query: 7 IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
++ ++++ L C + F+F +HH +S+ VK+ + PEKGS+EY+ LA
Sbjct: 10 LLSLLVVCWGLERCEASGKFSFEVHHMFSDRVKQ----SLGLDDLVPEKGSLEYFKVLAQ 65
Query: 67 RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
RDR +RGR L+ + + F GN T I LGFLHY + +GTP F+VALDTGSDL
Sbjct: 66 RDRLIRGRGLASNNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDL 125
Query: 125 FWVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFS 184
FW+PC+C S L+ L++Y+PN SSTS + C++ C ++C S
Sbjct: 126 FWLPCNCG--STCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPAS 183
Query: 185 NCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNG 244
+CPY + Y+S +T T+G L EDVLHL D+ + V+AN+ GCG+ Q+G AA NG
Sbjct: 184 SCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNG 243
Query: 245 LFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPSH 302
L GLG++ SVPS+L++ TA+SFSMCFG D +GRISFGDKG DQ ETP + P+
Sbjct: 244 LLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETP--LLPTE 301
Query: 303 PTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSR 362
P ++ +V VG + V+ ALFD+GTSFT+L++P Y ++++F V D+RRP D
Sbjct: 302 P----SVTEVSVGGDAVGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPE 357
Query: 363 IPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAE--L 420
+PF++CYD+SP+ T L P +++T GGS+ + +P+ I + +YCL ++KS + +
Sbjct: 358 LPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFI---DNSAMYCLGILKSVDFKI 414
Query: 421 NIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
NIIGQNFM+GYR+VFDR ++ILGWK+SDC
Sbjct: 415 NIIGQNFMSGYRIVFDRERMILGWKRSDC 443
>ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 367 bits (943), Expect = e-100
Identities = 194/430 (45%), Positives = 278/430 (64%), Gaps = 24/430 (5%)
Query: 27 TFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAG---L 83
+F +HHR+SE VK H PE GS++YY L RDR GR+L+ + +
Sbjct: 23 SFEIHHRFSEQVK-----TVLGGHGLPEMGSLDYYKALVHRDR---GRQLTSNNNNQTTI 74
Query: 84 AFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFA 143
+F+ GNST IS FLHY + +GTP F+VALDTGSDLFW+PC+C ++T +
Sbjct: 75 SFAQGNSTEEIS---FLHYANVTIGTPAQWFLVALDTGSDLFWLPCNC---NSTCVRSME 128
Query: 144 SALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGIL 203
+ L++YNP+ S +S KVTCN++LC RN+C+ S+CPY + Y+S + ++G+L
Sbjct: 129 TDQGERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKSTGVL 188
Query: 204 VEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREG 263
VEDV+H++ + + +A + FGC + Q G F +VA NG+ GL + I+VP+ML + G
Sbjct: 189 VEDVIHMSTEEG--EARDARITFGCSESQLGLFKEVAV-NGIMGLAIADIAVPNMLVKAG 245
Query: 264 FTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVEF 323
+DSFSMCFG +G G ISFGDKGS DQ ETP + S Y+++I + +VG +D EF
Sbjct: 246 VASDSFSMCFGPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEF 305
Query: 324 TALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSM 383
TA FDSGT+ T+L++P Y+ L+ +FH V DRR PF++CY ++ S+ +PS+
Sbjct: 306 TATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSV 365
Query: 384 SLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVVK--SAELNIIGQNFMTGYRVVFDRGK 439
S M GG+ + V+ PI++ T S VYCLAV+K +A+ +IIGQNFMT YR+V DR +
Sbjct: 366 SFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNFMTNYRIVHDRER 425
Query: 440 LILGWKKSDC 449
ILGWKKS+C
Sbjct: 426 RILGWKKSNC 435
>ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 365 bits (937), Expect = 2e-99
Identities = 195/452 (43%), Positives = 279/452 (61%), Gaps = 18/452 (3%)
Query: 7 IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
++ ++++ C F F +HH +S+ VK+ + PE+GS+EY+ LA
Sbjct: 10 LLSVLVVCWGFERCEATGKFGFEVHHIFSDSVKQ----SLGLGDLVPEQGSLEYFKVLAH 65
Query: 67 RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
RDR +RGR L+ + + F GN T + LG L+Y + +GTP F+VALDTGSDL
Sbjct: 66 RDRLIRGRGLASNNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDL 125
Query: 125 FWVPCDC-TRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTF 183
FW+PC+C T C + L++Y PN S+TS + C++ C +C
Sbjct: 126 FWLPCNCGTTCIRDLEDI---GVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPS 182
Query: 184 SNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPN 243
S CPY +SY S T T G L++DVLHL D+N V+ANV GCGQ Q+G F + N
Sbjct: 183 SICPYQISY-SNSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVN 241
Query: 244 GLFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPF-NVNP 300
G+ GLG++ SVPS+L++ TA+SFSMCFGR +GRISFGD+G DQ+ETPF +V P
Sbjct: 242 GVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAP 301
Query: 301 SHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPD 360
S Y + I+ V V +D+ A FD+G+SFT+L +P Y L++SF VEDRRRP D
Sbjct: 302 S-TAYGVNISGVSVAGDPVDIRLFAKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVD 360
Query: 361 SRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ-SELVYCLAVVKSA- 418
+PF++CYD+SP++ T P + +T GGS+ ++ +P TQ ++YCL V+KS
Sbjct: 361 PELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVG 420
Query: 419 -ELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
++N+IGQNF+ GYR+VFDR ++ILGWK+S C
Sbjct: 421 LKINVIGQNFVAGYRIVFDRERMILGWKQSLC 452
>emb|CAB62656.1| putative protein [Arabidopsis thaliana]
gi|15230458|ref|NP_190702.1| aspartyl protease family
protein [Arabidopsis thaliana] gi|11281622|pir||T45765
hypothetical protein F24M12.380 - Arabidopsis thaliana
Length = 518
Score = 338 bits (867), Expect = 2e-91
Identities = 188/462 (40%), Positives = 268/462 (57%), Gaps = 36/462 (7%)
Query: 9 VIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRD 68
+++LI L C + F+F +HH +S+ VK+ PE GS+EY+ LA RD
Sbjct: 1 MLVLIFWGLERCEASGKFSFEVHHMFSDVVKQ----TLGFDDLVPENGSLEYFKVLAHRD 56
Query: 69 RFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFW 126
RF+RGR L+ + L N T ++ LGFLHY + LGTP F+VALDTGSDLFW
Sbjct: 57 RFIRGRGLASNNEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFW 116
Query: 127 VPCDC-TRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSN 185
+PC+C T C A + L++Y PN S+TS + C++ C +C S
Sbjct: 117 LPCNCGTTCIHDLKDA---RFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESI 173
Query: 186 CPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGL 245
CPY ++ +S+ T T+G L++DVLHL D++ V ANV GCGQ Q+G+F A NG+
Sbjct: 174 CPYQIA-LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGV 232
Query: 246 FGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPSHP 303
GL M++ SVPS+L++ TA+SFSMCFGR +GRISFGDKG DQ+ETP +
Sbjct: 233 LGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETST 292
Query: 304 TYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRI 363
Y + + V VG +DV ALFD+G+SFT L++ Y +++F +ED+RRP D
Sbjct: 293 AYGVNVTGVSVGGVPVDVPLFALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDF 352
Query: 364 PFDYCYDMSPDS-NTSLIPSMSLTMGGGSRFVVYDPI---------------IIISTQSE 407
PF++CYD+ + N+ P + Y+P + S +
Sbjct: 353 PFEFCYDLREEHLNSDARPRHMQSK-------CYNPCRDDFRWRIQNDSQESVSYSNEGT 405
Query: 408 LVYCLAVVKSAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
+YCL ++KS LNIIGQN M+G+R+VFDR ++ILGWK+S+C
Sbjct: 406 KMYCLGILKSINLNIIGQNLMSGHRIVFDRERMILGWKQSNC 447
>ref|XP_467818.1| aspartyl protease-like [Oryza sativa (japonica cultivar-group)]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like
[Oryza sativa (japonica cultivar-group)]
Length = 520
Score = 308 bits (788), Expect = 3e-82
Identities = 180/449 (40%), Positives = 257/449 (57%), Gaps = 15/449 (3%)
Query: 7 IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
+++++L A + M HR S+ + + RWP +GS +Y+ L
Sbjct: 9 LLLLLLAAAAAVAAAEAATLSARMVHRLSDEARL--AAGARGGRRWPRRGSGDYFRALVR 66
Query: 67 RDRFLRGRRLSQFDAGLAFSDGNSTFRISS-LGFLHYTTIELGTPGVKFMVALDTGSDLF 125
D + RR+ L+ S G S F + LG+L+YT +++GTP F+VALDTGSDLF
Sbjct: 67 SDLQRQKRRVGGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLF 126
Query: 126 WVPCDCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSN 185
WVPCDC +C+ S++ +L D DL +Y P+ S+TS+ + C++ LC+ + C
Sbjct: 127 WVPCDCIQCAPL--SSYHGSL--DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQP 182
Query: 186 CPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGL 245
CPY + Y S T++SG+L+ED+LHL + H V A+VI GCG+ QSGS+L+ AP+GL
Sbjct: 183 CPYNIDYFSENTTSSGLLIEDMLHLDSRE-GHAPVNASVIIGCGKKQSGSYLEGIAPDGL 241
Query: 246 FGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPF-NVNPSHPT 304
GLGM ISVPS L+R G +SFSMCF +D GRI FGD+G Q TPF +N T
Sbjct: 242 LGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQT 301
Query: 305 YNITINQVRVGTTLID-VEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRI 363
Y + +++ +G + F AL D+GTSFT L Y ++ F Q+ R D
Sbjct: 302 YAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDD-Y 360
Query: 364 PFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIIS-TQSEL-VYCLAVVKSAE-L 420
F+YCY P +P+++LT F +PI+ + Q E V+CLAV+ S E +
Sbjct: 361 SFEYCYSTGPLEMPD-VPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPV 419
Query: 421 NIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
IIGQNFM GY VVFDR + LGW +S+C
Sbjct: 420 GIIGQNFMVGYHVVFDRENMKLGWYRSEC 448
>emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|15238055|ref|NP_196570.1| aspartyl protease family
protein [Arabidopsis thaliana] gi|11281625|pir||T50012
hypothetical protein T31P16.70 - Arabidopsis thaliana
Length = 528
Score = 302 bits (774), Expect = 1e-80
Identities = 174/440 (39%), Positives = 252/440 (56%), Gaps = 19/440 (4%)
Query: 23 AHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAG 82
A +F+ + HR+S+ + S PS S P K S+EYY LA+ D + L
Sbjct: 22 ASLFSSRLIHRFSDEGRA-SIKTPSSSDSLPNKQSLEYYRLLAESDFRRQRMNLGAKVQS 80
Query: 83 LAFSDGNSTFRISS-LGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSA 141
L S+G+ T + G+LHYT I++GTP V F+VALDTGS+L W+PC+C +C A +S
Sbjct: 81 LVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQC-APLTST 139
Query: 142 FASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSG 201
+ S+LA+ DL+ YNP+ SSTSK C++ LC + C CPY V+Y+S TS+SG
Sbjct: 140 YYSSLATK-DLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSG 198
Query: 202 ILVEDVLHLTQPDDNHDL-----VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVP 256
+LVED+LHLT +N + V+A V+ GCG+ QSG +LD AP+GL GLG +ISVP
Sbjct: 199 LLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVP 258
Query: 257 SMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNV--NPSHPTYNITINQVRV 314
S LS+ G +SFS+CF + GRI FGD G Q TPF N + Y + + +
Sbjct: 259 SFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCI 318
Query: 315 GTT-LIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDSRIPFDYCYDMSP 373
G + L FT DSG SFTYL + Y +++ + + + + ++YCY+ S
Sbjct: 319 GNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKNFEG-VSWEYCYESSA 377
Query: 374 DSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELV-YCLAVVKSAELNI--IGQNFMTG 430
+ +P++ L + FV++ P+ + LV +CL + S + I IGQN+M G
Sbjct: 378 EPK---VPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRG 434
Query: 431 YRVVFDRGKLILGWKKSDCK 450
YR+VFDR + LGW S C+
Sbjct: 435 YRMVFDRENMKLGWSPSKCQ 454
>pir||T45766 hypothetical protein F24M12.390 - Arabidopsis thaliana
Length = 475
Score = 280 bits (717), Expect = 6e-74
Identities = 165/453 (36%), Positives = 236/453 (51%), Gaps = 73/453 (16%)
Query: 7 IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
++ ++++ C F F +HH +S+ VK+ + PE+GS+EY+ LA
Sbjct: 10 LLSVLVVCWGFERCEATGKFGFEVHHIFSDSVKQ----SLGLGDLVPEQGSLEYFKVLAH 65
Query: 67 RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
RDR +RGR L+ + + F GN T + LG L+Y + +GTP F+VALDTGSDL
Sbjct: 66 RDRLIRGRGLASNNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDL 125
Query: 125 FWVPCDC-TRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTF 183
FW+PC+C T C + L++Y PN S+TS + C++ C +C
Sbjct: 126 FWLPCNCGTTCIRDLEDI---GVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPS 182
Query: 184 SNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPN 243
S CPY +SY S T T G L++DVLHL D+N V+ANV GCGQ Q+G F + N
Sbjct: 183 SICPYQISY-SNSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVN 241
Query: 244 GLFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPS 301
G+ GLG++ SVPS+L++ TA+SFSMCFGR +GRISFG
Sbjct: 242 GVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGRISFG----------------- 284
Query: 302 HPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDS 361
D Y+ E+ V RRRP D
Sbjct: 285 ------------------------------------DRGYTDQEETPFISVAPRRRPVDP 308
Query: 362 RIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQS-----ELVYCLAVVK 416
+PF++CYD+SP++ T P + +T GGS+ ++ +P TQ+ ++YCL V+K
Sbjct: 309 ELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQARHGEGNVMYCLGVLK 368
Query: 417 SAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
S L I NF+ GYR+VFDR ++ILGWK+S C
Sbjct: 369 SVRLKI--NNFVAGYRIVFDRERMILGWKQSLC 399
>emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 280 bits (715), Expect = 1e-73
Identities = 165/453 (36%), Positives = 236/453 (51%), Gaps = 73/453 (16%)
Query: 7 IIVIILIILHLSMCCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELAD 66
++ ++++ C F F +HH +S+ VK+ + PE+GS+EY+ LA
Sbjct: 10 LLSVLVVCWGFERCEATGKFGFEVHHIFSDSVKQ----SLGLGDLVPEQGSLEYFKVLAH 65
Query: 67 RDRFLRGRRLSQF--DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDL 124
RDR +RGR L+ + + F GN T + LG L+Y + +GTP F+VALDTGSDL
Sbjct: 66 RDRLIRGRGLASNNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDL 125
Query: 125 FWVPCDC-TRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTF 183
FW+PC+C T C + L++Y PN S+TS + C++ C +C
Sbjct: 126 FWLPCNCGTTCIRDLEDI---GVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPS 182
Query: 184 SNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPN 243
S CPY +SY S T T G L++DVLHL D+N V+ANV GCGQ Q+G F + N
Sbjct: 183 SICPYQISY-SNSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVN 241
Query: 244 GLFGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNVNPS 301
G+ GLG++ SVPS+L++ TA+SFSMCFGR +GRISFG
Sbjct: 242 GVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGRISFG----------------- 284
Query: 302 HPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRRPPDS 361
D Y+ E+ V RRRP D
Sbjct: 285 ------------------------------------DRGYTDQEETPFISVAPRRRPVDP 308
Query: 362 RIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQS-----ELVYCLAVVK 416
+PF++CYD+SP++ T P + +T GGS+ ++ +P TQ+ ++YCL V+K
Sbjct: 309 ELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQARHGEGNVMYCLGVLK 368
Query: 417 SAELNIIGQNFMTGYRVVFDRGKLILGWKKSDC 449
S L I NF+ GYR+VFDR ++ILGWK+S C
Sbjct: 369 SVGLKI--NNFVAGYRIVFDRERMILGWKQSLC 399
>emb|CAB62658.1| putative protein [Arabidopsis thaliana] gi|11357633|pir||T45767
hypothetical protein F24M12.400 - Arabidopsis thaliana
Length = 426
Score = 209 bits (532), Expect = 2e-52
Identities = 110/258 (42%), Positives = 167/258 (64%), Gaps = 9/258 (3%)
Query: 178 QCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFL 237
+C+ S+CPY + Y+S + ++G+LVEDV+H++ + + +A + FG + Q G F
Sbjct: 127 RCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEG--EARDARITFG--ESQLGLFK 182
Query: 238 DVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFN 297
+VA NG+ GL + I+VP+ML + G +DSFSMCFG +G G ISFGDKGS DQ ETP +
Sbjct: 183 EVAV-NGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQLETPLS 241
Query: 298 VNPSHPTYNITINQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESFHSQVEDRRR 357
S Y+++I + +VG +D EFTA FDSGT+ T+L++P Y+ L+ +FH V DRR
Sbjct: 242 GTISPMFYDVSITKFKVGKVTVDTEFTATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRL 301
Query: 358 PPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQ--SELVYCLAVV 415
PF++CY ++ S+ +PS+S M GG+ + V+ PI++ T S VYCLAV+
Sbjct: 302 SKSVDSPFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVL 361
Query: 416 K--SAELNIIGQNFMTGY 431
K +A+ +IIG+N G+
Sbjct: 362 KQVNADFSIIGRNDTNGF 379
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.323 0.137 0.425
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 843,440,475
Number of Sequences: 2540612
Number of extensions: 35671134
Number of successful extensions: 80934
Number of sequences better than 10.0: 747
Number of HSP's better than 10.0 without gapping: 243
Number of HSP's successfully gapped in prelim test: 504
Number of HSP's that attempted gapping in prelim test: 79597
Number of HSP's gapped (non-prelim): 1085
length of query: 492
length of database: 863,360,394
effective HSP length: 132
effective length of query: 360
effective length of database: 527,999,610
effective search space: 190079859600
effective search space used: 190079859600
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)
Medicago: description of AC149547.3