Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC019043A_C01 KMC019043A_c01
(649 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_172403.1| unknown protein; protein id: At1g09320.1 [Arabi... 134 1e-30
pir||C86226 protein T31J12.4 [imported] - Arabidopsis thaliana g... 100 2e-20
ref|NP_187304.1| hypothetical protein; protein id: At3g06520.1 [... 98 8e-20
ref|NP_182245.1| unknown protein; protein id: At2g47230.1 [Arabi... 83 3e-15
ref|NP_171829.1| unknown protein; protein id: At1g03300.1 [Arabi... 77 2e-13
>ref|NP_172403.1| unknown protein; protein id: At1g09320.1 [Arabidopsis thaliana]
Length = 491
Score = 134 bits (336), Expect = 1e-30
Identities = 84/247 (34%), Positives = 129/247 (52%), Gaps = 45/247 (18%)
Frame = +3
Query: 42 YLEPDAEVEFCSEIAGQR-SWSIGKIVSRPSFSP-DHVLVEYDY------EQDTNPKTQS 197
YL+P + VE S+ G R SW +GK+++ PS S D V + +Y ++ T P +
Sbjct: 13 YLKPGSAVEISSDEIGFRGSWYMGKVITIPSSSDKDSVKCQVEYTTLFFDKEGTKPLKEV 72
Query: 198 VSIDKVRPRPPPETHHDFK----IGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQL 365
V + ++RP PP + + K +G++VDA+ W EG + + L+DGKF+V F K+
Sbjct: 73 VDMSQLRPPAPPMSEIEKKKKIVVGEEVDAFYNDGWWEGDVTEVLDDGKFSVFFRSSKEQ 132
Query: 366 NEFPKENLRTHREWIDDHWEPPIQQQKQE------------------------------- 452
F K+ LR HREW+D W+PP+++ ++E
Sbjct: 133 IRFRKDELRFHREWVDGAWKPPLEETEEEEDESEEDKLDDSEDEEDILARVDLETTRAIA 192
Query: 453 --LFRIGDLVEVSSKVKGYRGTWFLAEVVELKVQGKFLVEHKHRLHDVTGKLLKEKIDDD 626
+F G +VEVSS +G++G WF A+VVE + KFLVE++ + LKE+ D
Sbjct: 193 KQMFSSGTVVEVSSDEEGFQGCWFAAKVVEPVGEDKFLVEYRDLREKDGIEPLKEETDFL 252
Query: 627 HIRPLPP 647
HIRP PP
Sbjct: 253 HIRPPPP 259
Score = 98.6 bits (244), Expect = 6e-20
Identities = 62/218 (28%), Positives = 107/218 (48%), Gaps = 24/218 (11%)
Frame = +3
Query: 63 VEFCSEIAG-QRSWSIGKIVSRPSFSPDHVLVEYD--YEQD-TNPKTQSVSIDKVRPRPP 230
VE S+ G Q W K+V D LVEY E+D P + +RP PP
Sbjct: 202 VEVSSDEEGFQGCWFAAKVVE--PVGEDKFLVEYRDLREKDGIEPLKEETDFLHIRPPPP 259
Query: 231 PETHHDFKIGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLRTHREWI 410
+ DF +GDK++A+ W G ++ ++ G + F ++ F ++ LR H++W+
Sbjct: 260 RDEDIDFAVGDKINAFYNDGWWVGVVIDGMKHGTVGIYFRQSQEKMRFGRQGLRLHKDWV 319
Query: 411 DDHWEPPIQQQK--------------------QELFRIGDLVEVSSKVKGYRGTWFLAEV 530
D W+ P++ K ++ F IG +EVS + +G+ +WFLA++
Sbjct: 320 DGTWQLPLKGGKIKREKTVSCNRNVRPKKATEKQAFSIGTPIEVSPEEEGFEDSWFLAKL 379
Query: 531 VELKVQGKFLVEHKHRLHDVTGKLLKEKIDDDHIRPLP 644
+E + + K LVE+ + + + L+E+++ IRPLP
Sbjct: 380 IEYRGKDKCLVEYDNLKAEDGKEPLREEVNVSRIRPLP 417
Score = 56.2 bits (134), Expect = 4e-07
Identities = 36/113 (31%), Positives = 52/113 (45%), Gaps = 4/113 (3%)
Frame = +3
Query: 96 SWSIGKIVSRPSFSPDHVLVEYDY---EQDTNPKTQSVSIDKVRPRPPPETH-HDFKIGD 263
SW + K++ D LVEYD E P + V++ ++RP P F+ D
Sbjct: 373 SWFLAKLIEYRG--KDKCLVEYDNLKAEDGKEPLREEVNVSRIRPLPLESVMVSPFERHD 430
Query: 264 KVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLRTHREWIDDHW 422
KV+A W G + K L + V F ++L +F LR H+EWID W
Sbjct: 431 KVNALYNDGWWVGVIRKVLAKSSYLVLFKNTQELLKFHHSQLRLHQEWIDGKW 483
>pir||C86226 protein T31J12.4 [imported] - Arabidopsis thaliana
gi|4337176|gb|AAD18097.1| T31J12.4 [Arabidopsis
thaliana]
Length = 514
Score = 100 bits (249), Expect = 2e-20
Identities = 54/149 (36%), Positives = 88/149 (58%), Gaps = 12/149 (8%)
Frame = +3
Query: 42 YLEPDAEVEFCSEIAGQR-SWSIGKIVSRPSFSP-DHVLVEYDY------EQDTNPKTQS 197
YL+P + VE S+ G R SW +GK+++ PS S D V + +Y ++ T P +
Sbjct: 13 YLKPGSAVEISSDEIGFRGSWYMGKVITIPSSSDKDSVKCQVEYTTLFFDKEGTKPLKEV 72
Query: 198 VSIDKVRPRPPPETHHDFK----IGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQL 365
V + ++RP PP + + K +G++VDA+ W EG + + L+DGKF+V F K+
Sbjct: 73 VDMSQLRPPAPPMSEIEKKKKIVVGEEVDAFYNDGWWEGDVTEVLDDGKFSVFFRSSKEQ 132
Query: 366 NEFPKENLRTHREWIDDHWEPPIQQQKQE 452
F K+ LR HREW+D W+PP+++ ++E
Sbjct: 133 IRFRKDELRFHREWVDGAWKPPLEETEEE 161
Score = 98.6 bits (244), Expect = 6e-20
Identities = 62/218 (28%), Positives = 107/218 (48%), Gaps = 24/218 (11%)
Frame = +3
Query: 63 VEFCSEIAG-QRSWSIGKIVSRPSFSPDHVLVEYD--YEQD-TNPKTQSVSIDKVRPRPP 230
VE S+ G Q W K+V D LVEY E+D P + +RP PP
Sbjct: 225 VEVSSDEEGFQGCWFAAKVVE--PVGEDKFLVEYRDLREKDGIEPLKEETDFLHIRPPPP 282
Query: 231 PETHHDFKIGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLRTHREWI 410
+ DF +GDK++A+ W G ++ ++ G + F ++ F ++ LR H++W+
Sbjct: 283 RDEDIDFAVGDKINAFYNDGWWVGVVIDGMKHGTVGIYFRQSQEKMRFGRQGLRLHKDWV 342
Query: 411 DDHWEPPIQQQK--------------------QELFRIGDLVEVSSKVKGYRGTWFLAEV 530
D W+ P++ K ++ F IG +EVS + +G+ +WFLA++
Sbjct: 343 DGTWQLPLKGGKIKREKTVSCNRNVRPKKATEKQAFSIGTPIEVSPEEEGFEDSWFLAKL 402
Query: 531 VELKVQGKFLVEHKHRLHDVTGKLLKEKIDDDHIRPLP 644
+E + + K LVE+ + + + L+E+++ IRPLP
Sbjct: 403 IEYRGKDKCLVEYDNLKAEDGKEPLREEVNVSRIRPLP 440
Score = 56.2 bits (134), Expect = 4e-07
Identities = 36/113 (31%), Positives = 52/113 (45%), Gaps = 4/113 (3%)
Frame = +3
Query: 96 SWSIGKIVSRPSFSPDHVLVEYDY---EQDTNPKTQSVSIDKVRPRPPPETH-HDFKIGD 263
SW + K++ D LVEYD E P + V++ ++RP P F+ D
Sbjct: 396 SWFLAKLIEYRG--KDKCLVEYDNLKAEDGKEPLREEVNVSRIRPLPLESVMVSPFERHD 453
Query: 264 KVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLRTHREWIDDHW 422
KV+A W G + K L + V F ++L +F LR H+EWID W
Sbjct: 454 KVNALYNDGWWVGVIRKVLAKSSYLVLFKNTQELLKFHHSQLRLHQEWIDGKW 506
Score = 38.5 bits (88), Expect = 0.079
Identities = 24/67 (35%), Positives = 35/67 (51%), Gaps = 6/67 (8%)
Frame = +3
Query: 465 GDLVEVSSKVKGYRGTWFLAEVVEL-----KVQGKFLVEHKHRLHDVTG-KLLKEKIDDD 626
G VE+SS G+RG+W++ +V+ + K K VE+ D G K LKE +D
Sbjct: 17 GSAVEISSDEIGFRGSWYMGKVITIPSSSDKDSVKCQVEYTTLFFDKEGTKPLKEVVDMS 76
Query: 627 HIRPLPP 647
+RP P
Sbjct: 77 QLRPPAP 83
>ref|NP_187304.1| hypothetical protein; protein id: At3g06520.1 [Arabidopsis
thaliana] gi|12322681|gb|AAG51333.1|AC020580_13
hypothetical protein; 66083-64412 [Arabidopsis thaliana]
Length = 466
Score = 98.2 bits (243), Expect = 8e-20
Identities = 69/236 (29%), Positives = 117/236 (49%), Gaps = 29/236 (12%)
Frame = +3
Query: 27 MSASPYLEPDAEVEFCSEIAGQRSWSIGKIVSRPSFSPDHVLVEYDYEQDTNPKT----Q 194
M++ + VE ++G ++ +VS PS V VE++ + +
Sbjct: 1 MTSDRFWRGGDRVEVERLVSGATAYFPASVVSAPSVRKKLVWVEHESLTVGGSVSVRMKE 60
Query: 195 SVSIDKVRPRPPPETHHDFKIGDKVDAY--DKGSWREGHLVKELEDGKFAVDF---NLPK 359
V+ ++RP PP E + FK D+VD + +G W G++ LED ++ V+F N P+
Sbjct: 61 YVTPTRLRPSPPRELNRRFKADDEVDVFRDSEGCWVRGNVTTVLEDSRYIVEFKGENRPE 120
Query: 360 -QLNEFPKENLRTHREWIDDHWEPPIQQQ------------------KQELFRIGDLVEV 482
++++F NLR HREW+D W P + QQ +++ + G LVEV
Sbjct: 121 IEVDQF---NLRLHREWLDGGWVPSLLQQSNFSESTAQRIKLKIKIKRRDQYEKGALVEV 177
Query: 483 SSKVKGYRGTWFLAEVVELKVQGKFLVEH-KHRLHDVTGKLLKEKIDDDHIRPLPP 647
S+ K Y+G+W+ A ++ L K++VEH K D L++ ++ IRP+PP
Sbjct: 178 RSEEKAYKGSWYCARILCLLGDDKYIVEHLKFSRDDGESIPLRDVVEAKDIRPVPP 233
Score = 67.4 bits (163), Expect = 2e-10
Identities = 64/229 (27%), Positives = 99/229 (42%), Gaps = 29/229 (12%)
Frame = +3
Query: 48 EPDAEVEFCSEIAGQR-SWSIGKIVSRPSFSPDHVLVEY-DYEQDTN---PKTQSVSIDK 212
E A VE SE + SW +I+ D +VE+ + +D P V
Sbjct: 170 EKGALVEVRSEEKAYKGSWYCARILCL--LGDDKYIVEHLKFSRDDGESIPLRDVVEAKD 227
Query: 213 VRPRPPPETHHD--FKIGDKVDAYDKGSWREGHLVKELEDG--KFAVDFNLPKQLNEFPK 380
+RP PP E ++ G VDA+ W + K L G K++V +
Sbjct: 228 IRPVPPSELSPVVCYEPGVIVDAWFNKRWWTSRVSKVLGGGSNKYSVFIISTGEETTILN 287
Query: 381 ENLRTHREWIDDHW---------------EPPIQQQK-----QELFRIGDLVEVSSKVKG 500
NLR H++WI+ W +PP+++ K +++F G VEV S G
Sbjct: 288 FNLRPHKDWINGQWVIPSKVLTDVPEECYKPPLKKLKSCERAEKVFNNGMEVEVRSDEPG 347
Query: 501 YRGTWFLAEVVELKVQGKFLVEHKHRLHDVTGKLLKEKIDDDHIRPLPP 647
Y +WF A++V + ++ VE++ D +LLKE+ IRP PP
Sbjct: 348 YEASWFSAKIVSYLGENRYTVEYQTLKTDDERELLKEEARGSDIRPPPP 396
>ref|NP_182245.1| unknown protein; protein id: At2g47230.1 [Arabidopsis thaliana]
gi|25364476|pir||F84912 hypothetical protein At2g47230
[imported] - Arabidopsis thaliana
gi|2275201|gb|AAB63823.1| unknown protein [Arabidopsis
thaliana]
Length = 701
Score = 83.2 bits (204), Expect = 3e-15
Identities = 53/164 (32%), Positives = 82/164 (49%), Gaps = 8/164 (4%)
Frame = +3
Query: 180 NPKTQSVSIDKVRPRPPPETHHDFKI--GDKVDAYDKGSWREGHLVKELEDGKFAVDFNL 353
+P +++ +RP PP ++ + G VDA K W G ++K+LE+GKF V ++
Sbjct: 55 SPLIENIEPRFIRPVPPENEYNGIVLEEGTVVDADHKDGWWTGVIIKKLENGKFWVYYDS 114
Query: 354 PKQLNEFPKENLRTHREWIDDHW-EPPIQQQKQELFRIGDLVEVSSKVKGYRGTWFLAEV 530
P + EF + LR H W W P IQ+ + +F G + EVS+ V WF A +
Sbjct: 115 PPDIIEFERNQLRPHLRWSGWKWLRPDIQELDKSMFSSGTMAEVSTIVDKAEVAWFPAMI 174
Query: 531 V-ELKVQG--KFLVE--HKHRLHDVTGKLLKEKIDDDHIRPLPP 647
+ E++V G KF+V+ +KH ID +RP PP
Sbjct: 175 IKEIEVDGEKKFIVKDCNKHLSFSGDEARTNSTIDSSRVRPTPP 218
Score = 41.2 bits (95), Expect = 0.012
Identities = 28/116 (24%), Positives = 51/116 (43%)
Frame = +3
Query: 78 EIAGQRSWSIGKIVSRPSFSPDHVLVEYDYEQDTNPKTQSVSIDKVRPRPPPETHHDFKI 257
E+ G++ + + SFS D E TN ++ +VRP PPP +++
Sbjct: 179 EVDGEKKFIVKDCNKHLSFSGD--------EARTN---STIDSSRVRPTPPPFPVEKYEL 227
Query: 258 GDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLRTHREWIDDHWE 425
D+V+ + WR+G + L+ + V + K+ +LR + W D W+
Sbjct: 228 MDRVEVFRGSVWRQGLVRGVLDHNCYMVCLVVTKEEPVVKHSDLRPCKVWEDGVWQ 283
Score = 35.0 bits (79), Expect = 0.87
Identities = 25/70 (35%), Positives = 33/70 (46%), Gaps = 3/70 (4%)
Frame = +3
Query: 447 QELFRIGDLVEVSSKVKGYRGTWF---LAEVVELKVQGKFLVEHKHRLHDVTGKLLKEKI 617
+E R G VEVSS +G+ WF L E + K V + L+D L E I
Sbjct: 2 EETIRKGSEVEVSSTEEGFADAWFRGILQENPTKSGRKKLRVRYLTLLNDDALSPLIENI 61
Query: 618 DDDHIRPLPP 647
+ IRP+PP
Sbjct: 62 EPRFIRPVPP 71
>ref|NP_171829.1| unknown protein; protein id: At1g03300.1 [Arabidopsis thaliana]
gi|25364480|pir||E86164 F15K9.10 protein - Arabidopsis
thaliana gi|3850574|gb|AAC72114.1| Strong similarity to
T08I13.7 gi|2275201 unknown protein from Arabidopsis
thaliana BAC gb|AC002337. EST gb|Z17450 comes from this
gene
Length = 670
Score = 77.0 bits (188), Expect = 2e-13
Identities = 61/215 (28%), Positives = 101/215 (46%), Gaps = 17/215 (7%)
Frame = +3
Query: 54 DAEVEFCSEIAGQRSWSIGKIVSRPSFSP---------DHVLVEYDYEQDTNPKTQSVSI 206
D EVE SE G R+ I+ +P ++ + E ++P T V
Sbjct: 6 DCEVEIFSEEDGFRNAWYRAILEETPTNPTSESKKLRFSYMTKSLNKEGSSSPPT--VEQ 63
Query: 207 DKVRPRPPPETHHD--FKIGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPK 380
+RP PP ++ F+ G VDA K WR G ++ ++E+ + V F+ P + +F
Sbjct: 64 RFIRPVPPENLYNGVVFEEGTMVDADYKHRWRTGVVINKMENDSYLVLFDCPPDIIQFET 123
Query: 381 ENLRTHREWIDDHW-EPPIQQQKQELFRIGDLVEVSSKVKGYRGTWFLAEVV-ELKVQG- 551
++LR H +W W +P +++ + +F G LVEVS + +W A +V E++ G
Sbjct: 124 KHLRAHLDWTGSEWVQPEVRELSKSMFSPGTLVEVSCVIDKVEVSWVTAMIVKEIEESGE 183
Query: 552 -KFLVE--HKHRLHDVTGKLLKEKIDDDHIRPLPP 647
KF+V+ +KH V +D +RP PP
Sbjct: 184 KKFIVKVCNKHLSCRVDEAKPNMTVDSCCVRPRPP 218
Score = 37.0 bits (84), Expect = 0.23
Identities = 21/77 (27%), Positives = 34/77 (43%)
Frame = +3
Query: 213 VRPRPPPETHHDFKIGDKVDAYDKGSWREGHLVKELEDGKFAVDFNLPKQLNEFPKENLR 392
VRPRPP ++ + D V+ + SWR+G + + ++ V K +LR
Sbjct: 213 VRPRPPLFFVEEYDLRDCVEVFHGSSWRQGVVKGVHIEKQYTVTLEATKDKLVVKHSDLR 272
Query: 393 THREWIDDHWEPPIQQQ 443
+ W D W QQ+
Sbjct: 273 PFKVWEDGVWHNGPQQK 289
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 629,571,714
Number of Sequences: 1393205
Number of extensions: 15513848
Number of successful extensions: 65487
Number of sequences better than 10.0: 109
Number of HSP's better than 10.0 without gapping: 57637
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 64901
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27576232529
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)