Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC004370A_C01 KMC004370A_c01
(731 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_176441.1| hypothetical protein; protein id: At1g62520.1 [... 243 2e-63
ref|NP_192982.1| putative protein; protein id: At4g12450.1 [Arab... 229 4e-59
ref|NP_193987.1| putative protein; protein id: At4g22560.1 [Arab... 225 4e-58
ref|NP_567769.1| expressed protein; protein id: At4g27240.1, sup... 127 2e-28
dbj|BAB92595.1| B1070A12.19 [Oryza sativa (japonica cultivar-gro... 124 1e-27
>ref|NP_176441.1| hypothetical protein; protein id: At1g62520.1 [Arabidopsis
thaliana] gi|5454194|gb|AAD43609.1|AC005698_8 T3P18.8
[Arabidopsis thaliana] gi|28393500|gb|AAO42171.1|
unknown protein [Arabidopsis thaliana]
gi|28973499|gb|AAO64074.1| unknown protein [Arabidopsis
thaliana]
Length = 280
Score = 243 bits (620), Expect = 2e-63
Identities = 133/192 (69%), Positives = 152/192 (78%), Gaps = 4/192 (2%)
Frame = -2
Query: 730 LTELTEGHPSRNVVEIIFHTSWGPKPFSGRVEMIFKVHNGSRTVSRFEEYREAVKVRA-G 554
LTEL+EGH SRNVVEIIF TSWGPKPFSGRVEMIFKV NGS+T++RFEEYREAVK R+ G
Sbjct: 100 LTELSEGHQSRNVVEIIFQTSWGPKPFSGRVEMIFKVQNGSKTLTRFEEYREAVKARSVG 159
Query: 553 SAKGSEDWEENARCVADGNEVMRFHCLGPTSGGGPYGGACAWSFPGGK--GSAICTFSGS 380
A+ EENAR VADGNE MRF+CLGP+ G GG AW GGK G++I TF+GS
Sbjct: 160 KAR-----EENARSVADGNETMRFYCLGPSYG----GGGSAWGILGGKGGGASIYTFAGS 210
Query: 379 GGAHESSGGGKGRRAMLVCRVVAGRVSKQIGF-MDSLLDGRVGFDSVSGDNGELFVFDSR 203
A+E +GGGKGR+AMLVCRV+AGRV+KQ DS D R FDSVSGD+GEL VFD+R
Sbjct: 211 STANEKAGGGKGRKAMLVCRVIAGRVTKQNELKYDS--DLRSRFDSVSGDDGELLVFDTR 268
Query: 202 AVLPCFLIIYKL 167
AVLPCFLIIY+L
Sbjct: 269 AVLPCFLIIYRL 280
>ref|NP_192982.1| putative protein; protein id: At4g12450.1 [Arabidopsis thaliana]
gi|7487232|pir||T07637 hypothetical protein T1P17.40 -
Arabidopsis thaliana gi|4725944|emb|CAB41715.1| putative
protein [Arabidopsis thaliana]
gi|7267947|emb|CAB78288.1| putative protein [Arabidopsis
thaliana]
Length = 277
Score = 229 bits (583), Expect = 4e-59
Identities = 121/193 (62%), Positives = 141/193 (72%), Gaps = 5/193 (2%)
Frame = -2
Query: 730 LTELTEGHPSRNVVEIIFHTSWGPKPFSGRVEMIFKVHNGSRTVSRFEEYREAVKVRAGS 551
LT+L +GHPSRNVVEIIF +SW F GRVEMIFKV NGS+ V+RFEEYREAVK R+ S
Sbjct: 97 LTDLPDGHPSRNVVEIIFQSSWSSDEFPGRVEMIFKVENGSKAVTRFEEYREAVKSRSCS 156
Query: 550 AKGSE-----DWEENARCVADGNEVMRFHCLGPTSGGGPYGGACAWSFPGGKGSAICTFS 386
S+ +ENARC ADGNE+MRF LGP GG G AW FPGGKG+A+CTFS
Sbjct: 157 KVDSDRVDGSACDENARCSADGNEMMRFFPLGPIPGGINGG---AWGFPGGKGAAVCTFS 213
Query: 385 GSGGAHESSGGGKGRRAMLVCRVVAGRVSKQIGFMDSLLDGRVGFDSVSGDNGELFVFDS 206
GSG AH S+GGG GRRAML+CRV+AGRV+K+ G G DSV+G GEL VFD+
Sbjct: 214 GSGEAHASTGGGGGRRAMLICRVIAGRVAKK---------GEFGSDSVAGRAGELIVFDA 264
Query: 205 RAVLPCFLIIYKL 167
RAVLPCFLI ++L
Sbjct: 265 RAVLPCFLIFFRL 277
>ref|NP_193987.1| putative protein; protein id: At4g22560.1 [Arabidopsis thaliana]
gi|7486606|pir||T05450 hypothetical protein F7K2.140 -
Arabidopsis thaliana gi|3892711|emb|CAA22161.1| putative
protein [Arabidopsis thaliana]
gi|7269102|emb|CAB79211.1| putative protein [Arabidopsis
thaliana]
Length = 264
Score = 225 bits (574), Expect = 4e-58
Identities = 121/188 (64%), Positives = 143/188 (75%)
Frame = -2
Query: 730 LTELTEGHPSRNVVEIIFHTSWGPKPFSGRVEMIFKVHNGSRTVSRFEEYREAVKVRAGS 551
LTEL +GHPSRNVVEIIFH+SW F GR+EMIFKV +GSRTV+RFEEYRE VK RAG
Sbjct: 93 LTELPDGHPSRNVVEIIFHSSWSSDEFPGRIEMIFKVEHGSRTVTRFEEYREVVKSRAGF 152
Query: 550 AKGSEDWEENARCVADGNEVMRFHCLGPTSGGGPYGGACAWSFPGGKGSAICTFSGSGGA 371
G+ + EE+ARC+ADGNE+MRF+ P G GGAC F GGKG A+CTFSGSG A
Sbjct: 153 NGGTCE-EEDARCLADGNEMMRFY---PVLDGF-NGGACV--FAGGKGQAVCTFSGSGEA 205
Query: 370 HESSGGGKGRRAMLVCRVVAGRVSKQIGFMDSLLDGRVGFDSVSGDNGELFVFDSRAVLP 191
+ SSGGG GR+AM++CRV+AGRV IGF G DSV+G +GELFVFD+RAVLP
Sbjct: 206 YVSSGGGGGRKAMMICRVIAGRVDDVIGF---------GSDSVAGRDGELFVFDTRAVLP 256
Query: 190 CFLIIYKL 167
CFLII++L
Sbjct: 257 CFLIIFRL 264
>ref|NP_567769.1| expressed protein; protein id: At4g27240.1, supported by cDNA:
2944. [Arabidopsis thaliana] gi|7486806|pir||T05748
hypothetical protein M4I22.50 - Arabidopsis thaliana
gi|3269285|emb|CAA19718.1| hypothetical protein
[Arabidopsis thaliana] gi|7269577|emb|CAB79579.1|
hypothetical protein [Arabidopsis thaliana]
Length = 431
Score = 127 bits (318), Expect = 2e-28
Identities = 82/215 (38%), Positives = 117/215 (54%), Gaps = 28/215 (13%)
Frame = -2
Query: 730 LTELTEGHPSRNVVEIIFHTSW-GPKPFSGRVEMIFKVHNGSRTVSRFEEYREAVKVRAG 554
+TEL EG SR +VEII TSW + GR++ I KVHN +T++RFEEYR+ VK+RA
Sbjct: 221 VTELMEGDSSRRIVEIICRTSWLKTENQGGRIDRILKVHNMQKTLARFEEYRDTVKIRAS 280
Query: 553 SAKGSEDWEENARCVADGNEVMRFHCLGPTSGGGPYGGACAWSFPG-------------- 416
+ +++ RC+ADGNE++RFH G G S
Sbjct: 281 KLQ-----KKHPRCIADGNELLRFHGTTVACALGINGSTSLCSSEKCCVCRIIRNGFSAK 335
Query: 415 ---GKGSAICTFSGSGGAHES----SGGGKGRRAMLVCRVVAGRVSKQIGFMDSLLDGRV 257
G + T S S A ES GGG R+A++VCRV+AGRV + + ++ +
Sbjct: 336 REMNNGIGVFTASTSERAFESIVIGDGGGGDRKALIVCRVIAGRVHRPVENVEEMGGLLS 395
Query: 256 GFDSVSGDNG------ELFVFDSRAVLPCFLIIYK 170
GFDS++G G EL++ +SRA+LPCF++I K
Sbjct: 396 GFDSLAGKVGLYTNVEELYLLNSRALLPCFVLICK 430
>dbj|BAB92595.1| B1070A12.19 [Oryza sativa (japonica cultivar-group)]
Length = 485
Score = 124 bits (311), Expect = 1e-27
Identities = 88/223 (39%), Positives = 119/223 (52%), Gaps = 36/223 (16%)
Frame = -2
Query: 730 LTELTEGHPSRNVVEIIFHTSWGPKPFSG-RVEMIFKVHNGSRTVSRFEEYREAVKVRAG 554
+TEL EG SR +VEII TS S R+E +FKVHN RT++RFEEYREAVK++A
Sbjct: 268 VTELVEGDSSRKIVEIICRTSLLKSESSCVRIERVFKVHNTQRTLARFEEYREAVKLKAS 327
Query: 553 SAKGSEDWEENARCVADGNEVMRFH------CLGPTSGGGPY--GGACA----------W 428
+++ RC+ADGNE++RFH LG +G CA
Sbjct: 328 KLP-----KKHPRCLADGNELLRFHGATLSCALGGAAGSSSLCASDKCAVCRIIRHGFSA 382
Query: 427 SFPGGKGSAICTFSGSGGAHESSGGGKG-----------RRAMLVCRVVAGRVSKQIGFM 281
G G + T S SG A+ES G R+A+LVCRV+AGRV K + +
Sbjct: 383 KKEGKAGVGVFTTSTSGRAYESIEASAGAVVGGDDPAATRKALLVCRVIAGRVHKPLENL 442
Query: 280 DSLLDGRVGFDSVSGDNG------ELFVFDSRAVLPCFLIIYK 170
G+ GFDS++G G EL++ + RA+LPCF++I K
Sbjct: 443 KEFA-GQTGFDSLAGKVGPYSNIEELYLLNPRALLPCFVVICK 484
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 735,747,431
Number of Sequences: 1393205
Number of extensions: 21250702
Number of successful extensions: 181954
Number of sequences better than 10.0: 1916
Number of HSP's better than 10.0 without gapping: 95603
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 155075
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 34625071581
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)