Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC001544A_C01 KCC001544A_c01
(1260 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAE18153.1| aspartic proteinase [Chlamydomonas reinhardtii] 603 0.0
ref|NP_172655.1| aspartic proteinase -related [Arabidopsis thali... 154 4e-61
gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana] 154 4e-61
emb|CAC86004.1| aspartic proteinase [Theobroma cacao] 159 4e-52
ref|NP_176419.2| aspartic protease -related [Arabidopsis thalian... 114 9e-50
>emb|CAE18153.1| aspartic proteinase [Chlamydomonas reinhardtii]
Length = 578
Score = 603 bits (1554), Expect(3) = 0.0
Identities = 307/308 (99%), Positives = 307/308 (99%)
Frame = +2
Query: 188 EHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 367
EHTWVPVTRQGYWQF MEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG
Sbjct: 243 EHTWVPVTRQGYWQFTMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 302
Query: 368 ATSALSAQCRQLVRDYLPQIIAQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTAA 547
ATSALSAQCRQLVRDYLPQIIAQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTAA
Sbjct: 303 ATSALSAQCRQLVRDYLPQIIAQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTAA 362
Query: 548 GTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAASD 727
GTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAASD
Sbjct: 363 GTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAASD 422
Query: 728 GGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAVG 907
GGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAVG
Sbjct: 423 GGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAVG 482
Query: 908 QLCDQVSFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGFMG 1087
QLCDQVSFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGFMG
Sbjct: 483 QLCDQVSFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGFMG 542
Query: 1088 LDVPAGPL 1111
LDVPAGPL
Sbjct: 543 LDVPAGPL 550
Score = 132 bits (333), Expect(3) = 0.0
Identities = 61/61 (100%), Positives = 61/61 (100%)
Frame = +1
Query: 1 DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 180
DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF
Sbjct: 181 DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 240
Query: 181 T 183
T
Sbjct: 241 T 241
Score = 67.0 bits (162), Expect(3) = 0.0
Identities = 30/31 (96%), Positives = 30/31 (96%)
Frame = +3
Query: 1104 GPWWILGDIFLGAYHTVFDYGAARLGFANAA 1196
GP WILGDIFLGAYHTVFDYGAARLGFANAA
Sbjct: 548 GPLWILGDIFLGAYHTVFDYGAARLGFANAA 578
>ref|NP_172655.1| aspartic proteinase -related [Arabidopsis thaliana]
gi|25290005|pir||F86253 hypothetical protein [imported] -
Arabidopsis thaliana gi|3157937|gb|AAC17620.1| Identical
to aspartic proteinase cDNA gb|U51036 from A. thaliana.
ESTs gb|N96313, gb|T21893, gb|R30158, gb|T21482,
gb|T43650, gb|R64749, gb|R65157, gb|T88269, gb|T44552,
gb|T22542, gb|T76533, gb|T44350, gb|Z34591, gb|AA728734,
gb|T46003, gb|R65157, gb|N38290, gb|AA395468, gb|T20815
and gb|Z34173 come from this gene. [Arabidopsis thaliana]
gi|15912219|gb|AAL08243.1| At1g11910/F12F1_24
[Arabidopsis thaliana] gi|15912251|gb|AAL08259.1|
At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|17381036|gb|AAL36330.1| putative aspartic proteinase
[Arabidopsis thaliana] gi|21617929|gb|AAM66979.1|
putative aspartic proteinase [Arabidopsis thaliana]
gi|25055040|gb|AAN71979.1| putative aspartic proteinase
[Arabidopsis thaliana]
Length = 506
Score = 154 bits (388), Expect(3) = 4e-61
Identities = 106/312 (33%), Positives = 152/312 (47%), Gaps = 4/312 (1%)
Frame = +2
Query: 188 EHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 367
+HT+VPVT++GYWQF+M + +G C GC+AIAD+GTSL+AGP+ + +NHAIG
Sbjct: 249 KHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIG 308
Query: 368 ATSALSAQCRQLVRDYLPQII-AQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTA 544
A +S QC+ +V Y I+ L + ++C+ IGLC T
Sbjct: 309 AAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLC------------------TF 350
Query: 545 AGTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAAS 724
GT + S G +V D+ A+L+N +G AA A
Sbjct: 351 DGTRGV--SMGIESVVDKE---------------NAKLSNGVGDAACSA----------- 382
Query: 725 DGGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAV 904
C+ AV +I+ L+ N T E+I + V
Sbjct: 383 -----------------------------------CEMAVVWIQSQLRQNMTQERILNYV 407
Query: 905 GQLCDQV-SFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGF 1081
+LC+++ S G S VDC ++ST+P +S IGG+VF L PE+YVL++ G QCISGF
Sbjct: 408 NELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKV-GEGPVAQCISGF 466
Query: 1082 MGLDV--PAGPL 1111
+ LDV P GPL
Sbjct: 467 IALDVAPPRGPL 478
Score = 72.8 bits (177), Expect(3) = 4e-61
Identities = 32/60 (53%), Positives = 42/60 (69%)
Frame = +1
Query: 1 DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 180
DGILG+GF ISV P + ++++G + PVFSFWLNR+ + GGELV GG+DP HF
Sbjct: 187 DGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHF 246
Score = 53.1 bits (126), Expect(3) = 4e-61
Identities = 21/31 (67%), Positives = 26/31 (83%)
Frame = +3
Query: 1104 GPWWILGDIFLGAYHTVFDYGAARLGFANAA 1196
GP WILGD+F+G YHTVFD+G ++GFA AA
Sbjct: 476 GPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 506
>gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana]
Length = 486
Score = 154 bits (388), Expect(3) = 4e-61
Identities = 106/312 (33%), Positives = 152/312 (47%), Gaps = 4/312 (1%)
Frame = +2
Query: 188 EHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 367
+HT+VPVT++GYWQF+M + +G C GC+AIAD+GTSL+AGP+ + +NHAIG
Sbjct: 229 KHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIG 288
Query: 368 ATSALSAQCRQLVRDYLPQII-AQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTA 544
A +S QC+ +V Y I+ L + ++C+ IGLC T
Sbjct: 289 AAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLC------------------TF 330
Query: 545 AGTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAAS 724
GT + S G +V D+ A+L+N +G AA A
Sbjct: 331 DGTRGV--SMGIESVVDKE---------------NAKLSNGVGDAACSA----------- 362
Query: 725 DGGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAV 904
C+ AV +I+ L+ N T E+I + V
Sbjct: 363 -----------------------------------CEMAVVWIQSQLRQNMTQERILNYV 387
Query: 905 GQLCDQV-SFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGF 1081
+LC+++ S G S VDC ++ST+P +S IGG+VF L PE+YVL++ G QCISGF
Sbjct: 388 NELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKV-GEGPVAQCISGF 446
Query: 1082 MGLDV--PAGPL 1111
+ LDV P GPL
Sbjct: 447 IALDVAPPRGPL 458
Score = 72.8 bits (177), Expect(3) = 4e-61
Identities = 32/60 (53%), Positives = 42/60 (69%)
Frame = +1
Query: 1 DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 180
DGILG+GF ISV P + ++++G + PVFSFWLNR+ + GGELV GG+DP HF
Sbjct: 167 DGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHF 226
Score = 53.1 bits (126), Expect(3) = 4e-61
Identities = 21/31 (67%), Positives = 26/31 (83%)
Frame = +3
Query: 1104 GPWWILGDIFLGAYHTVFDYGAARLGFANAA 1196
GP WILGD+F+G YHTVFD+G ++GFA AA
Sbjct: 456 GPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 486
>emb|CAC86004.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 159 bits (401), Expect(2) = 4e-52
Identities = 109/322 (33%), Positives = 155/322 (47%), Gaps = 4/322 (1%)
Frame = +2
Query: 188 EHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 367
+HT+VPVT++GYWQF+M + + CA CAAIAD+GTSL+AGPS + +NHAIG
Sbjct: 257 KHTYVPVTQKGYWQFDMGDVLIADKPTGYCAGSCAAIADSGTSLLAGPSTVITMINHAIG 316
Query: 368 ATSALSAQCRQLVRDYLPQIIAQL-HDLPLDQVCASIGLCPMAAASTIKPARRLLATTTA 544
AT +S +C+ +V+ Y II L + ++C+ IGLC T
Sbjct: 317 ATGVVSQECKAVVQQYGRTIIDLLIAEAQPQKICSQIGLC------------------TF 358
Query: 545 AGTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAAS 724
G H + S+G +V DE S
Sbjct: 359 NGAHGV--STGIESVVDE-----------------------------------------S 375
Query: 725 DGGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAV 904
+G SGV+ +C C+ AV +++ ++ N T ++I V
Sbjct: 376 NGKSSGVLR--------------------DAMCPACEMAVVWMQNQVRQNQTQDRILSYV 415
Query: 905 GQLCDQV-SFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGF 1081
+LCD+V + G S VDC +S++P ISF IGG+VF L PE+Y+L++ G E QCISGF
Sbjct: 416 NELCDRVPNPMGESAVDCGSLSSMPTISFTIGGKVFDLTPEEYILKV-GEGSEAQCISGF 474
Query: 1082 MGLDV--PAGPLVDPGRHIPGR 1141
LD+ P GPL G GR
Sbjct: 475 TALDIPPPRGPLWILGDIFMGR 496
Score = 70.1 bits (170), Expect(2) = 4e-52
Identities = 30/60 (50%), Positives = 42/60 (70%)
Frame = +1
Query: 1 DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 180
DGILG+GF ISV P + ++++G + PVFSFWLNR+ + GGE+V GG+DP H+
Sbjct: 195 DGILGLGFKEISVGDAVPVWYNMIKQGLIKEPVFSFWLNRNVDEEAGGEIVFGGVDPNHY 254
Score = 55.5 bits (132), Expect = 2e-06
Identities = 23/31 (74%), Positives = 26/31 (83%)
Frame = +3
Query: 1104 GPWWILGDIFLGAYHTVFDYGAARLGFANAA 1196
GP WILGDIF+G YHTVFD+G R+GFA AA
Sbjct: 484 GPLWILGDIFMGRYHTVFDFGKLRVGFAEAA 514
>ref|NP_176419.2| aspartic protease -related [Arabidopsis thaliana]
gi|17979428|gb|AAL49856.1| putative aspartic protease
[Arabidopsis thaliana] gi|23297031|gb|AAN13225.1|
putative aspartic protease [Arabidopsis thaliana]
Length = 513
Score = 114 bits (285), Expect(3) = 9e-50
Identities = 96/312 (30%), Positives = 134/312 (42%), Gaps = 4/312 (1%)
Frame = +2
Query: 188 EHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 367
EHT+VPVT++GYWQF+M + + S C GC+AIAD+GTSL+AGP+ VA +N AIG
Sbjct: 256 EHTFVPVTQRGYWQFDMGEVLIAGESTGYCGSGCSAIADSGTSLLAGPTAVVAMINKAIG 315
Query: 368 ATSALSAQCRQLVRDYLPQII-AQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTA 544
A+ +S QC+ +V Y I+ L + ++C+ IGLC
Sbjct: 316 ASGVVSQQCKTVVDQYGQTILDLLLAETQPKKICSQIGLC------------------AY 357
Query: 545 AGTHSIRTSSGAAAVADEAAAGDASDV-DAAVAAVKAQLANLLGHAAAGATTTNGRGAAA 721
GTH + S G +V D+ +S + DA A + + + T
Sbjct: 358 DGTHGV--SMGIESVVDKENTRSSSGLRDAGCPACEMAVVWIQSQLRQNMTQER------ 409
Query: 722 SDGGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADA 901
I + E + ESA D
Sbjct: 410 --------IVNYINEICERMPSPNGESAVD------------------------------ 431
Query: 902 VGQLCDQVSFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGF 1081
C Q+S K+ T+ SF IGG+VF L PE+YVL++ G QCISGF
Sbjct: 432 ----CSQLS----------KMPTV---SFTIGGKVFDLAPEEYVLKI-GEGPVAQCISGF 473
Query: 1082 MGLDV--PAGPL 1111
LD+ P GPL
Sbjct: 474 TALDIPPPRGPL 485
Score = 75.9 bits (185), Expect(3) = 9e-50
Identities = 31/60 (51%), Positives = 44/60 (72%)
Frame = +1
Query: 1 DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 180
DG+LG+GF I+V + P + ++++G + PVFSFWLNRDP + GGE+V GG+DP HF
Sbjct: 194 DGLLGLGFQEIAVGNATPVWYNMLKQGLIKRPVFSFWLNRDPKSEEGGEIVFGGVDPKHF 253
Score = 51.6 bits (122), Expect(3) = 9e-50
Identities = 20/30 (66%), Positives = 25/30 (82%)
Frame = +3
Query: 1104 GPWWILGDIFLGAYHTVFDYGAARLGFANA 1193
GP WILGD+F+G YHTVFD+G ++GFA A
Sbjct: 483 GPLWILGDVFMGKYHTVFDFGNEQVGFAEA 512