Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC002004A_C01 KMC002004A_c01
(611 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_564108.1| expressed protein; protein id: At1g20220.1, sup... 81 1e-14
pir||H86335 T20H2.2 protein - Arabidopsis thaliana gi|8778978|gb... 81 1e-14
ref|NP_174414.1| hypothetical protein; protein id: At1g31290.1 [... 70 3e-11
ref|NP_565124.1| expressed protein; protein id: At1g76010.1, sup... 66 3e-10
pir||F96788 protein T4O12.22 [imported] - Arabidopsis thaliana g... 66 3e-10
>ref|NP_564108.1| expressed protein; protein id: At1g20220.1, supported by cDNA:
gi_16612273 [Arabidopsis thaliana]
gi|16612274|gb|AAL27504.1|AF439833_1 At1g20220/T20H2_3
[Arabidopsis thaliana]
Length = 315
Score = 80.9 bits (198), Expect = 1e-14
Identities = 54/101 (53%), Positives = 56/101 (54%), Gaps = 18/101 (17%)
Frame = -3
Query: 609 RGRGFYNNGGMEYGGGDGWDGGRGYGGRGRGFGRGRGRGFRGRGRGGYGAQPVGYY---- 442
RG G+ NN EY G G + R YG RGRG GRG GRG GRGRGGY P YY
Sbjct: 207 RGSGYVNN---EYNDG-GMEQDRSYG-RGRGRGRGGGRG--GRGRGGYNGPPPPYYEAQQ 259
Query: 441 ---DYG----------EYDAPPAP-RGRGRGRGGRGRGRGR 361
DYG YD PP RGRGRGRGGRGRG GR
Sbjct: 260 DGGDYGYNNVAPPADHGYDGPPPQGRGRGRGRGGRGRGGGR 300
Score = 58.5 bits (140), Expect = 7e-08
Identities = 42/87 (48%), Positives = 45/87 (51%), Gaps = 4/87 (4%)
Frame = -3
Query: 609 RGRGFYNNG--GMEYGGGDGWDGGRGYGGRGRGFGRGRGRGFRGRGRGGYGAQPVG--YY 442
RGRG NG +EY DGGRG GGRG G+ GRGRGG G+ V Y
Sbjct: 164 RGRGGRGNGPANVEYD-----DGGRGRGGRGNGYVNNEYDD-GGRGRGGRGSGYVNNEYN 217
Query: 441 DYGEYDAPPAPRGRGRGRGGRGRGRGR 361
D G RGRGRGRGG GRGR
Sbjct: 218 DGGMEQDRSYGRGRGRGRGGGRGGRGR 244
Score = 53.9 bits (128), Expect = 2e-06
Identities = 40/85 (47%), Positives = 44/85 (51%), Gaps = 11/85 (12%)
Frame = -3
Query: 579 MEYGGGDGWDGGRGYGGRGRGFGRGRGRGFRGRGRG-----------GYGAQPVGYYDYG 433
++Y G DG GRG G RGRG GRGRGRG GRG G G G + GY +
Sbjct: 139 IDYEGQDGSPRGRG-GRRGRG-GRGRGRGRGGRGNGPANVEYDDGGRGRGGRGNGYVN-N 195
Query: 432 EYDAPPAPRGRGRGRGGRGRGRGRN 358
EYD GRGRGGRG G N
Sbjct: 196 EYD------DGGRGRGGRGSGYVNN 214
Score = 41.6 bits (96), Expect = 0.008
Identities = 19/33 (57%), Positives = 21/33 (63%)
Frame = -3
Query: 459 QPVGYYDYGEYDAPPAPRGRGRGRGGRGRGRGR 361
+P+ DY D P RG RGRGGRGRGRGR
Sbjct: 134 KPLAEIDYEGQDGSPRGRGGRRGRGGRGRGRGR 166
Score = 39.3 bits (90), Expect = 0.041
Identities = 27/62 (43%), Positives = 31/62 (49%), Gaps = 14/62 (22%)
Frame = -3
Query: 609 RGRGFYNNGGMEY------GGGDGWDG-----GRGYGG---RGRGFGRGRGRGFRGRGRG 472
RGRG YN Y GG G++ GY G +GRG GRGRG RG GRG
Sbjct: 242 RGRGGYNGPPPPYYEAQQDGGDYGYNNVAPPADHGYDGPPPQGRGRGRGRGGRGRGGGRG 301
Query: 471 GY 466
G+
Sbjct: 302 GF 303
>pir||H86335 T20H2.2 protein - Arabidopsis thaliana
gi|8778978|gb|AAF79893.1|AC022472_2 Contains similarity
to pigpen protein from Mus musculus gb|AF224264 and
contains protein of unknown function DUF78 PF|01918
domain. ESTs gb|N38077, gb|BE037702, gb|AV442191,
gb|AV441368, gb|Z17998, gb|AV527266, gb|AV520794,
gb|AI997847, gb|AV543000 come from this gene.
[Arabidopsis thaliana]
Length = 538
Score = 80.9 bits (198), Expect = 1e-14
Identities = 54/101 (53%), Positives = 56/101 (54%), Gaps = 18/101 (17%)
Frame = -3
Query: 609 RGRGFYNNGGMEYGGGDGWDGGRGYGGRGRGFGRGRGRGFRGRGRGGYGAQPVGYY---- 442
RG G+ NN EY G G + R YG RGRG GRG GRG GRGRGGY P YY
Sbjct: 430 RGSGYVNN---EYNDG-GMEQDRSYG-RGRGRGRGGGRG--GRGRGGYNGPPPPYYEAQQ 482
Query: 441 ---DYG----------EYDAPPAP-RGRGRGRGGRGRGRGR 361
DYG YD PP RGRGRGRGGRGRG GR
Sbjct: 483 DGGDYGYNNVAPPADHGYDGPPPQGRGRGRGRGGRGRGGGR 523
Score = 58.5 bits (140), Expect = 7e-08
Identities = 42/87 (48%), Positives = 45/87 (51%), Gaps = 4/87 (4%)
Frame = -3
Query: 609 RGRGFYNNG--GMEYGGGDGWDGGRGYGGRGRGFGRGRGRGFRGRGRGGYGAQPVG--YY 442
RGRG NG +EY DGGRG GGRG G+ GRGRGG G+ V Y
Sbjct: 387 RGRGGRGNGPANVEYD-----DGGRGRGGRGNGYVNNEYDD-GGRGRGGRGSGYVNNEYN 440
Query: 441 DYGEYDAPPAPRGRGRGRGGRGRGRGR 361
D G RGRGRGRGG GRGR
Sbjct: 441 DGGMEQDRSYGRGRGRGRGGGRGGRGR 467
Score = 53.9 bits (128), Expect = 2e-06
Identities = 40/85 (47%), Positives = 44/85 (51%), Gaps = 11/85 (12%)
Frame = -3
Query: 579 MEYGGGDGWDGGRGYGGRGRGFGRGRGRGFRGRGRG-----------GYGAQPVGYYDYG 433
++Y G DG GRG G RGRG GRGRGRG GRG G G G + GY +
Sbjct: 362 IDYEGQDGSPRGRG-GRRGRG-GRGRGRGRGGRGNGPANVEYDDGGRGRGGRGNGYVN-N 418
Query: 432 EYDAPPAPRGRGRGRGGRGRGRGRN 358
EYD GRGRGGRG G N
Sbjct: 419 EYD------DGGRGRGGRGSGYVNN 437
Score = 41.6 bits (96), Expect = 0.008
Identities = 19/33 (57%), Positives = 21/33 (63%)
Frame = -3
Query: 459 QPVGYYDYGEYDAPPAPRGRGRGRGGRGRGRGR 361
+P+ DY D P RG RGRGGRGRGRGR
Sbjct: 357 KPLAEIDYEGQDGSPRGRGGRRGRGGRGRGRGR 389
Score = 39.3 bits (90), Expect = 0.041
Identities = 27/62 (43%), Positives = 31/62 (49%), Gaps = 14/62 (22%)
Frame = -3
Query: 609 RGRGFYNNGGMEY------GGGDGWDG-----GRGYGG---RGRGFGRGRGRGFRGRGRG 472
RGRG YN Y GG G++ GY G +GRG GRGRG RG GRG
Sbjct: 465 RGRGGYNGPPPPYYEAQQDGGDYGYNNVAPPADHGYDGPPPQGRGRGRGRGGRGRGGGRG 524
Query: 471 GY 466
G+
Sbjct: 525 GF 526
>ref|NP_174414.1| hypothetical protein; protein id: At1g31290.1 [Arabidopsis
thaliana] gi|6692121|gb|AAF24586.1|AC007654_2 T19E23.8
[Arabidopsis thaliana]
Length = 1194
Score = 69.7 bits (169), Expect = 3e-11
Identities = 46/102 (45%), Positives = 52/102 (50%), Gaps = 15/102 (14%)
Frame = -3
Query: 609 RGRGFYNNGGMEYGG-----GDGWDGGRGYGGRGRGFGRG----RGRGFR----GRGRGG 469
RGRG + G Y G G G DG RGY GRG G GRG RGRG+ GRGRGG
Sbjct: 14 RGRGGGGDRGRGYSGRGDGRGRGGDGDRGYSGRGDGHGRGGGGDRGRGYSGRGDGRGRGG 73
Query: 468 YGAQPVGYYDYGEYDAPPAPRGRGRGRGGRGRG--RGRNASW 349
G + GY G+ RGRG GRGRG + R+ W
Sbjct: 74 GGDRGRGYSGRGDGHGRGGGGDRGRGYSGRGRGFVQDRDGGW 115
Score = 62.8 bits (151), Expect = 3e-09
Identities = 42/88 (47%), Positives = 45/88 (50%), Gaps = 7/88 (7%)
Frame = -3
Query: 603 RGFYNNG-GMEYGGGDGWDGGRGYGGRGRGFGRGRG--RGFRGRG----RGGYGAQPVGY 445
RG Y G G G G G D GRGY GRG G GRG RG+ GRG RGG G + GY
Sbjct: 3 RGGYRGGRGDGRGRGGGGDRGRGYSGRGDGRGRGGDGDRGYSGRGDGHGRGGGGDRGRGY 62
Query: 444 YDYGEYDAPPAPRGRGRGRGGRGRGRGR 361
G+ RGRG GRG G GR
Sbjct: 63 SGRGDGRGRGGGGDRGRGYSGRGDGHGR 90
Score = 58.2 bits (139), Expect = 9e-08
Identities = 37/79 (46%), Positives = 44/79 (54%), Gaps = 4/79 (5%)
Frame = -3
Query: 609 RGRGFYNNGGMEYGGGDGWDGGRGYGGRGRGFGRG----RGRGFRGRGRGGYGAQPVGYY 442
RGRG Y+ G G G G D GRGY GRG G GRG RGRG+ GRGRG + G+
Sbjct: 58 RGRG-YSGRGDGRGRGGGGDRGRGYSGRGDGHGRGGGGDRGRGYSGRGRGFVQDRDGGWV 116
Query: 441 DYGEYDAPPAPRGRGRGRG 385
+ G+ + G RGRG
Sbjct: 117 NPGQ-----SSGGHVRGRG 130
Score = 51.2 bits (121), Expect = 1e-05
Identities = 34/70 (48%), Positives = 36/70 (50%), Gaps = 6/70 (8%)
Frame = -3
Query: 552 DGGRGYGGRGRGFGRG----RGRGFRGR--GRGGYGAQPVGYYDYGEYDAPPAPRGRGRG 391
D G GGRG G GRG RGRG+ GR GRG G GY G+ RGRG
Sbjct: 2 DRGGYRGGRGDGRGRGGGGDRGRGYSGRGDGRGRGGDGDRGYSGRGDGHGRGGGGDRGRG 61
Query: 390 RGGRGRGRGR 361
GRG GRGR
Sbjct: 62 YSGRGDGRGR 71
>ref|NP_565124.1| expressed protein; protein id: At1g76010.1, supported by cDNA:
gi_15724323, supported by cDNA: gi_15809881, supported
by cDNA: gi_16226598 [Arabidopsis thaliana]
gi|15724324|gb|AAL06555.1|AF412102_1 At1g76010/T4O12_22
[Arabidopsis thaliana] gi|15809882|gb|AAL06869.1|
At1g76010/T4O12_22 [Arabidopsis thaliana]
gi|16226599|gb|AAL16210.1|AF428441_1 At1g76010/T4O12_22
[Arabidopsis thaliana] gi|21700865|gb|AAM70556.1|
At1g76010/T4O12_22 [Arabidopsis thaliana]
Length = 350
Score = 66.2 bits (160), Expect = 3e-10
Identities = 47/97 (48%), Positives = 53/97 (54%), Gaps = 9/97 (9%)
Frame = -3
Query: 609 RGRGFYNNGGMEYGGGDGWDGGRGYGGRGRGFG------RGRGRGFRG--RGRGGYGAQP 454
+GRG Y+ G GG DG G RGY G +G G +GRG G+ G +GRGGY
Sbjct: 245 QGRGGYD-GPQGRGGYDGPQGRRGYDGPPQGRGGYDGPSQGRG-GYDGPSQGRGGYDGPS 302
Query: 453 VGYYDYGEYDAPPAP-RGRGRGRGGRGRGRGRNASWG 346
G G YD P RGRGRGRGGRGRG GR G
Sbjct: 303 QGR---GGYDGPQGRGRGRGRGRGGRGRGGGRGGDGG 336
Score = 62.8 bits (151), Expect = 3e-09
Identities = 43/99 (43%), Positives = 47/99 (47%), Gaps = 21/99 (21%)
Frame = -3
Query: 609 RGRGFYNNG--GMEYGGGDGWDGGRGYGGRGRGFGRGRGRGFRGRGRGGYGAQP------ 454
RGRG N +E+ G GW+ + YG RG GRGRGR RGRGRGGY P
Sbjct: 163 RGRGGRGNAYVNVEHEDG-GWEREQSYG---RGRGRGRGRSSRGRGRGGYNGPPNEYDAP 218
Query: 453 -------------VGYYDYGEYDAPPAPRGRGRGRGGRG 376
GY D G YDAPP RG G GRG
Sbjct: 219 QDGGYGYDAPHEHRGYDDRGGYDAPPQGRGGYDGPQGRG 257
Score = 62.4 bits (150), Expect = 5e-09
Identities = 37/74 (50%), Positives = 42/74 (56%)
Frame = -3
Query: 585 GGMEYGGGDGWDGGRGYGGRGRGFGRGRGRGFRGRGRGGYGAQPVGYYDYGEYDAPPAPR 406
G ++Y G +G GGRG G RGRG GRGRGRG RG + G+ Y R
Sbjct: 137 GDIDYEGREGSPGGRGRG-RGRGRGRGRGRGGRGNAYVNVEHEDGGWEREQSYGRG---R 192
Query: 405 GRGRGRGGRGRGRG 364
GRGRGR RGRGRG
Sbjct: 193 GRGRGRSSRGRGRG 206
Score = 48.5 bits (114), Expect = 7e-05
Identities = 37/90 (41%), Positives = 42/90 (46%), Gaps = 6/90 (6%)
Frame = -3
Query: 609 RGRGFYNNGGMEYGGGDGWDGGRG-YGGRGRGFG-----RGRGRGFRGRGRGGYGAQPVG 448
+GRG Y+ GG DG GRG Y G +G G +GRGRG RGRGRGG
Sbjct: 273 QGRGGYDGPSQGRGGYDGPSQGRGGYDGPSQGRGGYDGPQGRGRG-RGRGRGG------- 324
Query: 447 YYDYGEYDAPPAPRGRGRGRGGRGRGRGRN 358
RGRG GRGG G R+
Sbjct: 325 -------------RGRGGGRGGDGGFNNRS 341
Score = 48.5 bits (114), Expect = 7e-05
Identities = 32/67 (47%), Positives = 37/67 (54%), Gaps = 2/67 (2%)
Frame = -3
Query: 609 RGRGFYNNGGMEYGGGDGWDGGRGY--GGRGRGFGRGRGRGFRGRGRGGYGAQPVGYYDY 436
+GRG Y+ GG DG GRG G +GRG GRGRGRG GRGRGG G+ +
Sbjct: 283 QGRGGYDGPSQGRGGYDGPSQGRGGYDGPQGRGRGRGRGRG--GRGRGGGRGGDGGFNN- 339
Query: 435 GEYDAPP 415
D PP
Sbjct: 340 -RSDGPP 345
Score = 41.2 bits (95), Expect = 0.011
Identities = 20/33 (60%), Positives = 23/33 (69%), Gaps = 1/33 (3%)
Frame = -3
Query: 459 QPVGYYDYGEYDAPPAPRGRGRGRG-GRGRGRG 364
+P+G DY + P RGRGRGRG GRGRGRG
Sbjct: 134 KPMGDIDYEGREGSPGGRGRGRGRGRGRGRGRG 166
>pir||F96788 protein T4O12.22 [imported] - Arabidopsis thaliana
gi|8778814|gb|AAF79819.1|AC007396_20 T4O12.22
[Arabidopsis thaliana]
Length = 369
Score = 66.2 bits (160), Expect = 3e-10
Identities = 47/97 (48%), Positives = 53/97 (54%), Gaps = 9/97 (9%)
Frame = -3
Query: 609 RGRGFYNNGGMEYGGGDGWDGGRGYGGRGRGFG------RGRGRGFRG--RGRGGYGAQP 454
+GRG Y+ G GG DG G RGY G +G G +GRG G+ G +GRGGY
Sbjct: 264 QGRGGYD-GPQGRGGYDGPQGRRGYDGPPQGRGGYDGPSQGRG-GYDGPSQGRGGYDGPS 321
Query: 453 VGYYDYGEYDAPPAP-RGRGRGRGGRGRGRGRNASWG 346
G G YD P RGRGRGRGGRGRG GR G
Sbjct: 322 QGR---GGYDGPQGRGRGRGRGRGGRGRGGGRGGDGG 355
Score = 62.8 bits (151), Expect = 3e-09
Identities = 43/99 (43%), Positives = 47/99 (47%), Gaps = 21/99 (21%)
Frame = -3
Query: 609 RGRGFYNNG--GMEYGGGDGWDGGRGYGGRGRGFGRGRGRGFRGRGRGGYGAQP------ 454
RGRG N +E+ G GW+ + YG RG GRGRGR RGRGRGGY P
Sbjct: 182 RGRGGRGNAYVNVEHEDG-GWEREQSYG---RGRGRGRGRSSRGRGRGGYNGPPNEYDAP 237
Query: 453 -------------VGYYDYGEYDAPPAPRGRGRGRGGRG 376
GY D G YDAPP RG G GRG
Sbjct: 238 QDGGYGYDAPHEHRGYDDRGGYDAPPQGRGGYDGPQGRG 276
Score = 62.4 bits (150), Expect = 5e-09
Identities = 37/74 (50%), Positives = 42/74 (56%)
Frame = -3
Query: 585 GGMEYGGGDGWDGGRGYGGRGRGFGRGRGRGFRGRGRGGYGAQPVGYYDYGEYDAPPAPR 406
G ++Y G +G GGRG G RGRG GRGRGRG RG + G+ Y R
Sbjct: 156 GDIDYEGREGSPGGRGRG-RGRGRGRGRGRGGRGNAYVNVEHEDGGWEREQSYGRG---R 211
Query: 405 GRGRGRGGRGRGRG 364
GRGRGR RGRGRG
Sbjct: 212 GRGRGRSSRGRGRG 225
Score = 48.5 bits (114), Expect = 7e-05
Identities = 37/90 (41%), Positives = 42/90 (46%), Gaps = 6/90 (6%)
Frame = -3
Query: 609 RGRGFYNNGGMEYGGGDGWDGGRG-YGGRGRGFG-----RGRGRGFRGRGRGGYGAQPVG 448
+GRG Y+ GG DG GRG Y G +G G +GRGRG RGRGRGG
Sbjct: 292 QGRGGYDGPSQGRGGYDGPSQGRGGYDGPSQGRGGYDGPQGRGRG-RGRGRGG------- 343
Query: 447 YYDYGEYDAPPAPRGRGRGRGGRGRGRGRN 358
RGRG GRGG G R+
Sbjct: 344 -------------RGRGGGRGGDGGFNNRS 360
Score = 48.5 bits (114), Expect = 7e-05
Identities = 32/67 (47%), Positives = 37/67 (54%), Gaps = 2/67 (2%)
Frame = -3
Query: 609 RGRGFYNNGGMEYGGGDGWDGGRGY--GGRGRGFGRGRGRGFRGRGRGGYGAQPVGYYDY 436
+GRG Y+ GG DG GRG G +GRG GRGRGRG GRGRGG G+ +
Sbjct: 302 QGRGGYDGPSQGRGGYDGPSQGRGGYDGPQGRGRGRGRGRG--GRGRGGGRGGDGGFNN- 358
Query: 435 GEYDAPP 415
D PP
Sbjct: 359 -RSDGPP 364
Score = 41.2 bits (95), Expect = 0.011
Identities = 20/33 (60%), Positives = 23/33 (69%), Gaps = 1/33 (3%)
Frame = -3
Query: 459 QPVGYYDYGEYDAPPAPRGRGRGRG-GRGRGRG 364
+P+G DY + P RGRGRGRG GRGRGRG
Sbjct: 153 KPMGDIDYEGREGSPGGRGRGRGRGRGRGRGRG 185
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 594,005,651
Number of Sequences: 1393205
Number of extensions: 17507106
Number of successful extensions: 378685
Number of sequences better than 10.0: 8654
Number of HSP's better than 10.0 without gapping: 106257
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 227860
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 24568846532
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)