Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC004138A_C01 KMC004138A_c01
(582 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAG51026.1|AC069474_25 zinc finger protein, putative, 5' part... 162 3e-39
gb|AAH19429.1| Unknown (protein for MGC:30371) [Mus musculus] 162 3e-39
ref|NP_187874.2| hypothetical protein; protein id: At3g12680.1, ... 162 3e-39
dbj|BAB02411.1| zinc finger protein-like [Arabidopsis thaliana] 134 6e-31
ref|NP_182306.1| unknown protein; protein id: At2g47850.1 [Arabi... 107 1e-22
>gb|AAG51026.1|AC069474_25 zinc finger protein, putative, 5' partial; 146-2518 [Arabidopsis
thaliana]
Length = 328
Score = 162 bits (410), Expect = 3e-39
Identities = 76/128 (59%), Positives = 90/128 (69%), Gaps = 2/128 (1%)
Frame = -3
Query: 580 IQLSLVYQA--YEPRLSNPTSQVGIAGPIYPQRPGQIECDFYMKNGECKFGERCKFHHPI 407
+ L LV A + L+ PT +G+ YPQRPGQ ECD+YMK GECKFGERCKFHHP
Sbjct: 194 LNLGLVTPATSFYQTLTQPT--LGVISATYPQRPGQSECDYYMKTGECKFGERCKFHHPA 251
Query: 406 DRSAPLLSKQASQQTVKLTPAGLPRREGAAMCPYYLKTGTCKFGATCKFDHPPPGEVMEM 227
DR + + + Q VKL+ AG PRREGA CPYY+KTGTCK+GATCKFDHPPPGEVM
Sbjct: 252 DRLSAMTKQAPQQPNVKLSLAGYPRREGALNCPYYMKTGTCKYGATCKFDHPPPGEVMAK 311
Query: 226 AKSQGTSA 203
S+ +A
Sbjct: 312 TTSEADAA 319
Score = 95.1 bits (235), Expect = 6e-19
Identities = 46/101 (45%), Positives = 56/101 (54%), Gaps = 19/101 (18%)
Frame = -3
Query: 496 PQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQT-------------VK 356
P+RP + C FYMK G+CKFG CKFHHP D P S+ V
Sbjct: 70 PERPSEPMCTFYMKTGKCKFGLSCKFHHPKDIQLPSSSQDIGSSVGLTSEPDATNNPHVT 129
Query: 355 LTPA------GLPRREGAAMCPYYLKTGTCKFGATCKFDHP 251
TPA GLP R G CP+YLKTG+CK+GATC+++HP
Sbjct: 130 FTPALYHNSKGLPVRSGEVDCPFYLKTGSCKYGATCRYNHP 170
Score = 89.7 bits (221), Expect = 2e-17
Identities = 39/98 (39%), Positives = 58/98 (58%)
Frame = -3
Query: 499 YPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQTVKLTPAGLPRREGA 320
YP+RPG+ +C +Y+K CK+G +CKF+HP + +A + Q S LP R
Sbjct: 26 YPERPGEPDCPYYIKTQRCKYGSKCKFNHPREEAAVSVETQDS----------LPERPSE 75
Query: 319 AMCPYYLKTGTCKFGATCKFDHPPPGEVMEMAKSQGTS 206
MC +Y+KTG CKFG +CKF HP ++ ++ G+S
Sbjct: 76 PMCTFYMKTGKCKFGLSCKFHHPKDIQLPSSSQDIGSS 113
>gb|AAH19429.1| Unknown (protein for MGC:30371) [Mus musculus]
Length = 526
Score = 162 bits (410), Expect = 3e-39
Identities = 78/131 (59%), Positives = 91/131 (68%), Gaps = 1/131 (0%)
Frame = -3
Query: 565 VYQAYEPRLSNPTSQVGIAGPIYPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLL 386
++Q+ +P L P + G IYPQRPGQIECD+YMK GECKFGERCKFHHPIDRSAP
Sbjct: 392 IFQSLQPGL--PQTTFGQVPVIYPQRPGQIECDYYMKRGECKFGERCKFHHPIDRSAP-A 448
Query: 385 SKQASQQTVKLTPAGLPRREGAAMCPYYLKTGTCKFGATCKFDHPPPGEVMEMA-KSQGT 209
Q +Q VK T AGLPRREG CP+Y+KTGTC + CKFDHPPPGEVM A K T
Sbjct: 449 RNQTLEQNVKFTLAGLPRREGVQHCPFYMKTGTCSYAVNCKFDHPPPGEVMAAAGKETLT 508
Query: 208 SANNGGEARKQ 176
A G+ ++
Sbjct: 509 KAEEEGKESEE 519
Score = 90.1 bits (222), Expect = 2e-17
Identities = 39/88 (44%), Positives = 57/88 (64%)
Frame = -3
Query: 514 IAGPIYPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQTVKLTPAGLP 335
+ IYP+RPG+ +C +++K +CKFG +CKF+HP D+ +L++ S +T + LP
Sbjct: 191 VPSEIYPERPGEPDCPYFLKTQKCKFGLKCKFNHPKDKL--ILAQDTSAET---DASDLP 245
Query: 334 RREGAAMCPYYLKTGTCKFGATCKFDHP 251
R +C +Y KTG CKFGA CKF HP
Sbjct: 246 ERPTEPLCSFYTKTGKCKFGAACKFHHP 273
Score = 87.0 bits (214), Expect = 2e-16
Identities = 44/104 (42%), Positives = 57/104 (54%), Gaps = 1/104 (0%)
Frame = -3
Query: 505 PIYPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQTVKLTPAGL-PRR 329
P+YPQRPG+ +C YM CKFG+ CKF HPI A + V P+ + P R
Sbjct: 143 PVYPQRPGEKDCTHYMLTRTCKFGDNCKFDHPIWVPAGGIPDWKEPPAV---PSEIYPER 199
Query: 328 EGAAMCPYYLKTGTCKFGATCKFDHPPPGEVMEMAKSQGTSANN 197
G CPY+LKT CKFG CKF+HP ++ S T A++
Sbjct: 200 PGEPDCPYFLKTQKCKFGLKCKFNHPKDKLILAQDTSAETDASD 243
Score = 87.0 bits (214), Expect = 2e-16
Identities = 44/112 (39%), Positives = 58/112 (51%), Gaps = 25/112 (22%)
Frame = -3
Query: 496 PQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQA------------------- 374
P+RP + C FY K G+CKFG CKFHHP D P ++A
Sbjct: 245 PERPTEPLCSFYTKTGKCKFGAACKFHHPKDIQVPSDGQEADDEEQTHSIDHLGTTSDGN 304
Query: 373 SQQTVKLTPA------GLPRREGAAMCPYYLKTGTCKFGATCKFDHPPPGEV 236
S + + PA GLP R G CP+YLKTG+CK+GA C+++HP E+
Sbjct: 305 SLKALTSNPASSYNSKGLPVRMGEVDCPFYLKTGSCKYGANCRYNHPDRNEL 356
>ref|NP_187874.2| hypothetical protein; protein id: At3g12680.1, supported by cDNA:
gi_16797660 [Arabidopsis thaliana]
gi|16797661|gb|AAK01470.1| floral homeotic protein HUA1
[Arabidopsis thaliana]
Length = 524
Score = 162 bits (410), Expect = 3e-39
Identities = 76/128 (59%), Positives = 90/128 (69%), Gaps = 2/128 (1%)
Frame = -3
Query: 580 IQLSLVYQA--YEPRLSNPTSQVGIAGPIYPQRPGQIECDFYMKNGECKFGERCKFHHPI 407
+ L LV A + L+ PT +G+ YPQRPGQ ECD+YMK GECKFGERCKFHHP
Sbjct: 390 LNLGLVTPATSFYQTLTQPT--LGVISATYPQRPGQSECDYYMKTGECKFGERCKFHHPA 447
Query: 406 DRSAPLLSKQASQQTVKLTPAGLPRREGAAMCPYYLKTGTCKFGATCKFDHPPPGEVMEM 227
DR + + + Q VKL+ AG PRREGA CPYY+KTGTCK+GATCKFDHPPPGEVM
Sbjct: 448 DRLSAMTKQAPQQPNVKLSLAGYPRREGALNCPYYMKTGTCKYGATCKFDHPPPGEVMAK 507
Query: 226 AKSQGTSA 203
S+ +A
Sbjct: 508 TTSEADAA 515
Score = 95.1 bits (235), Expect = 6e-19
Identities = 46/101 (45%), Positives = 56/101 (54%), Gaps = 19/101 (18%)
Frame = -3
Query: 496 PQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQT-------------VK 356
P+RP + C FYMK G+CKFG CKFHHP D P S+ V
Sbjct: 266 PERPSEPMCTFYMKTGKCKFGLSCKFHHPKDIQLPSSSQDIGSSVGLTSEPDATNNPHVT 325
Query: 355 LTPA------GLPRREGAAMCPYYLKTGTCKFGATCKFDHP 251
TPA GLP R G CP+YLKTG+CK+GATC+++HP
Sbjct: 326 FTPALYHNSKGLPVRSGEVDCPFYLKTGSCKYGATCRYNHP 366
Score = 89.7 bits (221), Expect = 2e-17
Identities = 39/98 (39%), Positives = 58/98 (58%)
Frame = -3
Query: 499 YPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQTVKLTPAGLPRREGA 320
YP+RPG+ +C +Y+K CK+G +CKF+HP + +A + Q S LP R
Sbjct: 222 YPERPGEPDCPYYIKTQRCKYGSKCKFNHPREEAAVSVETQDS----------LPERPSE 271
Query: 319 AMCPYYLKTGTCKFGATCKFDHPPPGEVMEMAKSQGTS 206
MC +Y+KTG CKFG +CKF HP ++ ++ G+S
Sbjct: 272 PMCTFYMKTGKCKFGLSCKFHHPKDIQLPSSSQDIGSS 309
Score = 82.0 bits (201), Expect = 5e-15
Identities = 38/85 (44%), Positives = 49/85 (56%)
Frame = -3
Query: 505 PIYPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQTVKLTPAGLPRRE 326
PIYPQR G+ +C YM+ CKFGE C+F HPI P ++ + P R
Sbjct: 169 PIYPQRAGEKDCTHYMQTRTCKFGESCRFDHPI--WVPEGGIPDWKEAPVVPNEEYPERP 226
Query: 325 GAAMCPYYLKTGTCKFGATCKFDHP 251
G CPYY+KT CK+G+ CKF+HP
Sbjct: 227 GEPDCPYYIKTQRCKYGSKCKFNHP 251
>dbj|BAB02411.1| zinc finger protein-like [Arabidopsis thaliana]
Length = 326
Score = 134 bits (338), Expect = 6e-31
Identities = 58/88 (65%), Positives = 67/88 (75%)
Frame = -3
Query: 466 FYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQTVKLTPAGLPRREGAAMCPYYLKTGT 287
+YMK GECKFGERCKFHHP DR + + + Q VKL+ AG PRREGA CPYY+KTGT
Sbjct: 230 YYMKTGECKFGERCKFHHPADRLSAMTKQAPQQPNVKLSLAGYPRREGALNCPYYMKTGT 289
Query: 286 CKFGATCKFDHPPPGEVMEMAKSQGTSA 203
CK+GATCKFDHPPPGEVM S+ +A
Sbjct: 290 CKYGATCKFDHPPPGEVMAKTTSEADAA 317
Score = 90.5 bits (223), Expect = 1e-17
Identities = 50/142 (35%), Positives = 72/142 (50%), Gaps = 2/142 (1%)
Frame = -3
Query: 499 YPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQTVKLTPAGLPRREGA 320
YP+RPG+ +C +Y+K CK+G +CKF+HP + +A + Q S LP R
Sbjct: 39 YPERPGEPDCPYYIKTQRCKYGSKCKFNHPREEAAVSVETQDS----------LPERPSE 88
Query: 319 AMCPYYLKTGTCKFGATCKFDHPPPGEVMEMAKSQGTSANNGGEA-RKQ*KGVWFCSGTA 143
MC +Y+KTG CKFG +CKF HP ++ ++ G+S E V F
Sbjct: 89 PMCTFYMKTGKCKFGLSCKFHHPKDIQLPSSSQDIGSSVGLTSEPDATNNPHVTFTPALY 148
Query: 142 VRSY-M*FRSMFYETLSCLFYL 80
S + RS+F + C FYL
Sbjct: 149 HNSKGLPVRSLFQGEVDCPFYL 170
Score = 90.5 bits (223), Expect = 1e-17
Identities = 46/104 (44%), Positives = 57/104 (54%), Gaps = 22/104 (21%)
Frame = -3
Query: 496 PQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQT-------------VK 356
P+RP + C FYMK G+CKFG CKFHHP D P S+ V
Sbjct: 83 PERPSEPMCTFYMKTGKCKFGLSCKFHHPKDIQLPSSSQDIGSSVGLTSEPDATNNPHVT 142
Query: 355 LTPA------GLPRR---EGAAMCPYYLKTGTCKFGATCKFDHP 251
TPA GLP R +G CP+YLKTG+CK+GATC+++HP
Sbjct: 143 FTPALYHNSKGLPVRSLFQGEVDCPFYLKTGSCKYGATCRYNHP 186
Score = 67.4 bits (163), Expect = 1e-10
Identities = 32/95 (33%), Positives = 48/95 (49%), Gaps = 10/95 (10%)
Frame = -3
Query: 505 PIYPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSA----------PLLSKQASQQTVK 356
P+ G+++C FY+K G CK+G C+++HP +R+A L+S + +
Sbjct: 155 PVRSLFQGEVDCPFYLKTGSCKYGATCRYNHP-ERTAFIPQAAGVNYSLVSSNTANLNLG 213
Query: 355 LTPAGLPRREGAAMCPYYLKTGTCKFGATCKFDHP 251
L + YY+KTG CKFG CKF HP
Sbjct: 214 LVTPATSFYQTLTQPTYYMKTGECKFGERCKFHHP 248
Score = 60.8 bits (146), Expect = 1e-08
Identities = 29/70 (41%), Positives = 38/70 (53%)
Frame = -3
Query: 460 MKNGECKFGERCKFHHPIDRSAPLLSKQASQQTVKLTPAGLPRREGAAMCPYYLKTGTCK 281
M+ CKFGE C+F HPI P ++ + P R G CPYY+KT CK
Sbjct: 1 MQTRTCKFGESCRFDHPI--WVPEGGIPDWKEAPVVPNEEYPERPGEPDCPYYIKTQRCK 58
Query: 280 FGATCKFDHP 251
+G+ CKF+HP
Sbjct: 59 YGSKCKFNHP 68
Score = 50.8 bits (120), Expect = 1e-05
Identities = 20/41 (48%), Positives = 27/41 (65%)
Frame = -3
Query: 532 PTSQVGIAGPIYPQRPGQIECDFYMKNGECKFGERCKFHHP 410
P ++ +AG YP+R G + C +YMK G CK+G CKF HP
Sbjct: 263 PNVKLSLAG--YPRREGALNCPYYMKTGTCKYGATCKFDHP 301
>ref|NP_182306.1| unknown protein; protein id: At2g47850.1 [Arabidopsis thaliana]
gi|25409067|pir||C84920 hypothetical protein At2g47850
[imported] - Arabidopsis thaliana
gi|3738297|gb|AAC63639.1| unknown protein [Arabidopsis
thaliana]
Length = 553
Score = 107 bits (266), Expect = 1e-22
Identities = 48/99 (48%), Positives = 62/99 (62%)
Frame = -3
Query: 547 PRLSNPTSQVGIAGPIYPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQ 368
P LS+PT + +P+RPG+ EC +Y+K G+CKFG CKFHHP DR P +
Sbjct: 356 PSLSSPTGVIQ-KEQAFPERPGEPECQYYLKTGDCKFGTSCKFHHPRDRVPP-------R 407
Query: 367 QTVKLTPAGLPRREGAAMCPYYLKTGTCKFGATCKFDHP 251
L+P GLP R G C +Y++ G CKFG+TCKFDHP
Sbjct: 408 ANCVLSPIGLPLRPGVQRCTFYVQNGFCKFGSTCKFDHP 446
Score = 95.5 bits (236), Expect = 4e-19
Identities = 47/105 (44%), Positives = 56/105 (52%), Gaps = 5/105 (4%)
Frame = -3
Query: 544 RLSNPTSQVGIAGPI-----YPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSK 380
R ++P + + + YP+R G+ C FY+K G CKFG CKFHHP +
Sbjct: 141 RYNHPRDRASVEATVRATGQYPERFGEPPCQFYLKTGTCKFGASCKFHHPKNAGG----- 195
Query: 379 QASQQTVKLTPAGLPRREGAAMCPYYLKTGTCKFGATCKFDHPPP 245
S V L G P REG C YYLKTG CKFG TCKF HP P
Sbjct: 196 --SMSHVPLNIYGYPVREGDNECSYYLKTGQCKFGITCKFHHPQP 238
Score = 90.1 bits (222), Expect = 2e-17
Identities = 39/83 (46%), Positives = 54/83 (64%)
Frame = -3
Query: 499 YPQRPGQIECDFYMKNGECKFGERCKFHHPIDRSAPLLSKQASQQTVKLTPAGLPRREGA 320
YP+RPG +C +YM+ G C +G RC+++HP DR++ + +A+ Q P R G
Sbjct: 116 YPERPGAPDCAYYMRTGVCGYGNRCRYNHPRDRASVEATVRATGQ--------YPERFGE 167
Query: 319 AMCPYYLKTGTCKFGATCKFDHP 251
C +YLKTGTCKFGA+CKF HP
Sbjct: 168 PPCQFYLKTGTCKFGASCKFHHP 190
Score = 48.5 bits (114), Expect = 6e-05
Identities = 22/47 (46%), Positives = 28/47 (58%)
Frame = -3
Query: 547 PRLSNPTSQVGIAGPIYPQRPGQIECDFYMKNGECKFGERCKFHHPI 407
PR + S +G+ P RPG C FY++NG CKFG CKF HP+
Sbjct: 406 PRANCVLSPIGL-----PLRPGVQRCTFYVQNGFCKFGSTCKFDHPM 447
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 522,381,278
Number of Sequences: 1393205
Number of extensions: 11712753
Number of successful extensions: 30443
Number of sequences better than 10.0: 263
Number of HSP's better than 10.0 without gapping: 28173
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 29993
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21712003912
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)