Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC017641A_C01 KMC017641A_c01
(580 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAD33925.1| proline rich protein 3 [Cicer arietinum] 244 4e-64
gb|AAK32800.1|AF361632_1 At3g21211 [Arabidopsis thaliana] gi|235... 226 2e-58
dbj|BAB01713.1| gene_id:MXL8.7~unknown protein [Arabidopsis thal... 226 2e-58
ref|NP_683582.1| similar to RRM-containing protein; protein id: ... 211 4e-54
gb|AAO37215.1| hypothetical protein [Arabidopsis thaliana] 89 4e-17
>emb|CAD33925.1| proline rich protein 3 [Cicer arietinum]
Length = 284
Score = 244 bits (624), Expect = 4e-64
Identities = 115/124 (92%), Positives = 123/124 (98%)
Frame = +1
Query: 208 DEVRTIFITGLPDDVKERELQNLLRWLPGFEASQLNFKADKPMGFALFSSPHQAIAAKDI 387
+EVRTIFITGLP+DVKERE+QNLLRWLPGFEASQLNFKA+KPMGFALFSSPHQAIAAKDI
Sbjct: 2 EEVRTIFITGLPEDVKEREIQNLLRWLPGFEASQLNFKAEKPMGFALFSSPHQAIAAKDI 61
Query: 388 LQDMLFDPEAKSVLHTEMAKKNLFIKRGIGADAAAFDQSKRLRTAGDYTHTGYVTPSPYH 567
LQDMLFDP++KSVLHTEMAKKNLF+KRGIGADA AFDQSKRLRTAGDYTHTGYVTPSP+H
Sbjct: 62 LQDMLFDPDSKSVLHTEMAKKNLFVKRGIGADAVAFDQSKRLRTAGDYTHTGYVTPSPFH 121
Query: 568 PPPP 579
PPPP
Sbjct: 122 PPPP 125
Score = 37.0 bits (84), Expect = 0.18
Identities = 33/118 (27%), Positives = 48/118 (39%), Gaps = 8/118 (6%)
Frame = +1
Query: 148 PAAAVPPPTPPAAAPP--------TLSPDEVRTIFITGLPDDVKERELQNLLRWLPGFEA 303
P A VP P P + A P T T+FI L +++ E E++ L PGF+
Sbjct: 149 PVAPVPMPAPVSIAAPSSYVPVQNTKDNPPCNTLFIGNLGENINEEEVRGLFSVQPGFKQ 208
Query: 304 SQLNFKADKPMGFALFSSPHQAIAAKDILQDMLFDPEAKSVLHTEMAKKNLFIKRGIG 477
++ + + F F + A LQ + P + SV KN F KR G
Sbjct: 209 MKILRQERHTVCFIEFEDVNSATNVHHNLQGAVI-PSSGSVGMRIQYSKNPFGKRKDG 265
>gb|AAK32800.1|AF361632_1 At3g21211 [Arabidopsis thaliana] gi|23505943|gb|AAN28831.1|
At3g21211/At3g21211 [Arabidopsis thaliana]
gi|26451397|dbj|BAC42798.1| unknown protein [Arabidopsis
thaliana]
Length = 339
Score = 226 bits (575), Expect = 2e-58
Identities = 113/177 (63%), Positives = 130/177 (72%), Gaps = 21/177 (11%)
Frame = +1
Query: 112 GNGIHPYHQQWPPAAAVPPPTPPAAAPPTLSP---------------------DEVRTIF 228
G GIHPYHQQWPPA A PPP ++A P P DE+RTIF
Sbjct: 3 GAGIHPYHQQWPPAGAPPPPAAVSSAAPPHPPPIHHHPPPPPVLVDNHNRPPYDELRTIF 62
Query: 229 ITGLPDDVKERELQNLLRWLPGFEASQLNFKADKPMGFALFSSPHQAIAAKDILQDMLFD 408
I GLPDDVKEREL NLLRWLPG+EASQ+NFK +KPMGFALFS+ A+AAKD LQ M+FD
Sbjct: 63 IAGLPDDVKERELLNLLRWLPGYEASQVNFKGEKPMGFALFSTAQFAMAAKDTLQHMVFD 122
Query: 409 PEAKSVLHTEMAKKNLFIKRGIGADAAAFDQSKRLRTAGDYTHTGYVTPSPYHPPPP 579
E+KSV+HTEMAKKNLF+KRGI D+ A+DQSKRLRT GD TH+ Y +PSP+HPPPP
Sbjct: 123 AESKSVIHTEMAKKNLFVKRGIVGDSNAYDQSKRLRTGGDCTHSVY-SPSPFHPPPP 178
Score = 41.6 bits (96), Expect = 0.007
Identities = 38/144 (26%), Positives = 59/144 (40%), Gaps = 7/144 (4%)
Frame = +1
Query: 127 PYHQQWPPAAAVPPPTPPAAAPPTLSPDE-------VRTIFITGLPDDVKERELQNLLRW 285
PY P +P P PP AAP + P + T+FI L +++ E EL++LL
Sbjct: 197 PYAGYHAPPVPMPTP-PPIAAPSSYVPVQNIKDNPPCNTLFIGNLGENINEEELRSLLSA 255
Query: 286 LPGFEASQLNFKADKPMGFALFSSPHQAIAAKDILQDMLFDPEAKSVLHTEMAKKNLFIK 465
PGF+ ++ + + F F + A LQ + P + S+ KN + K
Sbjct: 256 QPGFKQMKILRQERHTVCFIEFEDVNSATNVHHNLQGAVI-PSSGSIGMRIQYSKNPYGK 314
Query: 466 RGIGADAAAFDQSKRLRTAGDYTH 537
R G + F G T+
Sbjct: 315 RKEGGGYSFFPSPSANGAQGALTY 338
>dbj|BAB01713.1| gene_id:MXL8.7~unknown protein [Arabidopsis thaliana]
Length = 317
Score = 226 bits (575), Expect = 2e-58
Identities = 113/177 (63%), Positives = 130/177 (72%), Gaps = 21/177 (11%)
Frame = +1
Query: 112 GNGIHPYHQQWPPAAAVPPPTPPAAAPPTLSP---------------------DEVRTIF 228
G GIHPYHQQWPPA A PPP ++A P P DE+RTIF
Sbjct: 3 GAGIHPYHQQWPPAGAPPPPAAVSSAAPPHPPPIHHHPPPPPVLVDNHNRPPYDELRTIF 62
Query: 229 ITGLPDDVKERELQNLLRWLPGFEASQLNFKADKPMGFALFSSPHQAIAAKDILQDMLFD 408
I GLPDDVKEREL NLLRWLPG+EASQ+NFK +KPMGFALFS+ A+AAKD LQ M+FD
Sbjct: 63 IAGLPDDVKERELLNLLRWLPGYEASQVNFKGEKPMGFALFSTAQFAMAAKDTLQHMVFD 122
Query: 409 PEAKSVLHTEMAKKNLFIKRGIGADAAAFDQSKRLRTAGDYTHTGYVTPSPYHPPPP 579
E+KSV+HTEMAKKNLF+KRGI D+ A+DQSKRLRT GD TH+ Y +PSP+HPPPP
Sbjct: 123 AESKSVIHTEMAKKNLFVKRGIVGDSNAYDQSKRLRTGGDCTHSVY-SPSPFHPPPP 178
Score = 38.1 bits (87), Expect = 0.080
Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 7/69 (10%)
Frame = +1
Query: 127 PYHQQWPPAAAVPPPTPPAAAPPTLSPDE-------VRTIFITGLPDDVKERELQNLLRW 285
PY P +P P PP AAP + P + T+FI L +++ E EL++LL
Sbjct: 197 PYAGYHAPPVPMPTP-PPIAAPSSYVPVQNIKDNPPCNTLFIGNLGENINEEELRSLLSA 255
Query: 286 LPGFEASQL 312
PGF+ ++
Sbjct: 256 QPGFKQMKI 264
>ref|NP_683582.1| similar to RRM-containing protein; protein id: At3g21215.1
[Arabidopsis thaliana]
Length = 285
Score = 211 bits (538), Expect = 4e-54
Identities = 112/207 (54%), Positives = 129/207 (62%), Gaps = 51/207 (24%)
Frame = +1
Query: 112 GNGIHPYHQQWPPAAAVPPPTPPAAAPPTLSPD--------------------------- 210
G GIHPYHQQWPPA A PPP ++A P P
Sbjct: 3 GAGIHPYHQQWPPAGAPPPPAAVSSAAPPHPPPIHHHPPPPPVLVDNHNRPPYDEVQLFL 62
Query: 211 ------------------------EVRTIFITGLPDDVKERELQNLLRWLPGFEASQLNF 318
E+RTIFI GLPDDVKEREL NLLRWLPG+EASQ+NF
Sbjct: 63 FLSIHIGDSCAVLSRWACLIFVYVELRTIFIAGLPDDVKERELLNLLRWLPGYEASQVNF 122
Query: 319 KADKPMGFALFSSPHQAIAAKDILQDMLFDPEAKSVLHTEMAKKNLFIKRGIGADAAAFD 498
K +KPMGFALFS+ A+AAKD LQ M+FD E+KSV+HTEMAKKNLF+KRGI D+ A+D
Sbjct: 123 KGEKPMGFALFSTAQFAMAAKDTLQHMVFDAESKSVIHTEMAKKNLFVKRGIVGDSNAYD 182
Query: 499 QSKRLRTAGDYTHTGYVTPSPYHPPPP 579
QSKRLRT GD TH+ Y +PSP+HPPPP
Sbjct: 183 QSKRLRTGGDCTHSVY-SPSPFHPPPP 208
Score = 31.6 bits (70), Expect = 7.5
Identities = 20/51 (39%), Positives = 28/51 (54%), Gaps = 8/51 (15%)
Frame = +1
Query: 151 AAAVPPPTPP-AAAPPTLSPDE-------VRTIFITGLPDDVKERELQNLL 279
A VP PTPP AAP + P + T+FI L +++ E EL++LL
Sbjct: 233 APPVPMPTPPPIAAPSSYVPVQNIKDNPPCNTLFIGNLGENINEEELRSLL 283
>gb|AAO37215.1| hypothetical protein [Arabidopsis thaliana]
Length = 277
Score = 89.0 bits (219), Expect = 4e-17
Identities = 55/144 (38%), Positives = 74/144 (51%), Gaps = 15/144 (10%)
Frame = +1
Query: 160 VPPPTPPAAAPPTLSP--------------DEVRTIFITGLPDDVKERELQNLLRWLPGF 297
VPPP P + P S DEVRT+F+ GLP+DVK RE+ NL R PG+
Sbjct: 2 VPPPPPGVSPIPITSAHSVYLPTHVSIGARDEVRTLFVAGLPEDVKPREIYNLFREFPGY 61
Query: 298 EASQL-NFKADKPMGFALFSSPHQAIAAKDILQDMLFDPEAKSVLHTEMAKKNLFIKRGI 474
E S L + KP FA+FS A+A L M+FD E S LH ++AK N KR
Sbjct: 62 ETSHLRSSDGAKPFAFAVFSDLQSAVAVMHALNGMVFDLEKHSTLHIDLAKSNPKSKRSR 121
Query: 475 GADAAAFDQSKRLRTAGDYTHTGY 546
D ++ K+L++ T +G+
Sbjct: 122 TDD--GWESLKKLKSWNTTTESGF 143
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 581,827,914
Number of Sequences: 1393205
Number of extensions: 14663436
Number of successful extensions: 142383
Number of sequences better than 10.0: 429
Number of HSP's better than 10.0 without gapping: 76688
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 126889
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21426319650
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)