Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000472A_C01 KMC000472A_c01
(936 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_173499.1| hypothetical protein; protein id: At1g20760.1 [... 179 1e-49
ref|NP_173582.1| unknown protein; protein id: At1g21630.1 [Arabi... 165 2e-44
ref|NP_566657.1| expressed protein; protein id: At3g20290.1, sup... 69 1e-10
dbj|BAB02809.1| contains similarity to EH domain containing prot... 69 1e-10
gb|EAA33988.1| hypothetical protein [Neurospora crassa] 62 2e-08
>ref|NP_173499.1| hypothetical protein; protein id: At1g20760.1 [Arabidopsis
thaliana] gi|8886934|gb|AAF80620.1|AC069251_13 F2D10.25
[Arabidopsis thaliana]
Length = 1019
Score = 179 bits (454), Expect(2) = 1e-49
Identities = 89/121 (73%), Positives = 102/121 (83%)
Frame = +2
Query: 149 MAGGPPNMDQFESFFRRADLDGDGRISGAEAVSFFQGSNLSKQVLAQVWMYADQAKTGFL 328
MAG PNMDQFE++F+RADLDGDGRISGAEAV FFQGS LSKQVLAQ+W +D++ +GFL
Sbjct: 1 MAGQNPNMDQFEAYFKRADLDGDGRISGAEAVGFFQGSGLSKQVLAQIWSLSDRSHSGFL 60
Query: 329 GRTEFYNALRLVTVAQSKRDLTPDIVKAALFGPAAAKIPAPQINLAAIPQXRPNPAPPQM 508
R FYN+LRLVTVAQSKRDLTP+IV AAL PAAAKIP P+INL+AIP RPNPA +
Sbjct: 61 DRQNFYNSLRLVTVAQSKRDLTPEIVNAALNTPAAAKIPPPKINLSAIPAPRPNPAATTV 120
Query: 509 G 511
G
Sbjct: 121 G 121
Score = 47.0 bits (110), Expect = 4e-04
Identities = 39/147 (26%), Positives = 57/147 (38%), Gaps = 17/147 (11%)
Frame = +2
Query: 161 PPNMDQFESFFRRADLDGDGRISGAEAVSFFQGSNLSKQVLAQVWMYADQAKTGFLGRTE 340
P ++ ++ F D D DG+I+G +A + F L ++VL VW +DQ L E
Sbjct: 357 PSDVQKYTKVFMEVDSDKDGKITGEQARNLFLSWRLPREVLKHVWELSDQDNDTMLSLRE 416
Query: 341 FYNALRLVTVAQSKRDLTPDIVKAALFGPAAAKIP-APQINLAAI-----------PQXR 484
F +L L+ + R L + + +F I AP A P
Sbjct: 417 FCISLYLMERYREGRPLPTALPSSIMFDETLLSISGAPSHGYANAGWGSGQGFVQQPGMG 476
Query: 485 PNPAPPQMGVTAP-----PQMGVTTPP 550
P P G+ P PQ G PP
Sbjct: 477 ARPITPTTGMRPPVPAPGPQPGSGIPP 503
Score = 40.4 bits (93), Expect(2) = 1e-49
Identities = 44/147 (29%), Positives = 58/147 (38%), Gaps = 30/147 (20%)
Frame = +3
Query: 570 YRGQGLPGPVAANQQYFPSQQSQTMRPPQS--------------------------MPVG 671
+ G G P + NQ YFP QQ+Q MRP Q +PVG
Sbjct: 126 FGGPGAPNAIV-NQNYFPPQQNQQMRPNQGISGLTSLRPAAGPEYRPSALSGQFQPVPVG 184
Query: 672 TVPRPEQ----GLGGPNVAQGFNMAGHNVPNPGISSDWSSGRTGMPPARPAGITPSVGLQ 839
+V P Q + GP + FN+ G +S +SSG G A PS GL+
Sbjct: 185 SVTHPPQPVPTSVSGPG-SSTFNL-NSLYAGAGNTSGYSSGFGGGSLA-----APSPGLK 237
Query: 840 TSTPLSSVSQSQPGNTNARALAVSGNG 920
Q + + +AL VSGNG
Sbjct: 238 -----------QESHIDPKALVVSGNG 253
>ref|NP_173582.1| unknown protein; protein id: At1g21630.1 [Arabidopsis thaliana]
gi|25518104|pir||C86349 F8K7.4 protein - Arabidopsis
thaliana gi|5263313|gb|AAD41415.1|AC007727_4 Contains
similarity to gb|U07707 epidermal growth factor receptor
substrate (eps15) from Homo sapiens and contains 2
PF|00036 EF hand domains. ESTs gb|T44428 and
gb|AA395440 come from this gene. [Arabidopsis thaliana]
Length = 1181
Score = 165 bits (418), Expect(2) = 2e-44
Identities = 85/130 (65%), Positives = 101/130 (77%), Gaps = 5/130 (3%)
Frame = +2
Query: 173 DQFESFFRRADLDGDGRISGAEAVSFFQGSNLSKQVLAQVWMYADQAKTGFLGRTEFYNA 352
D F+++FRRADLDGDG ISGAEAV+FFQGSNL K VLAQVW YAD K G+LGR EFYNA
Sbjct: 11 DLFDTYFRRADLDGDGHISGAEAVAFFQGSNLPKHVLAQVWSYADSKKAGYLGRAEFYNA 70
Query: 353 LRLVTVAQSKRDLTPDIVKAALFGPAAAKIPAPQINLAAIPQXRPN---PAPPQMGVTAP 523
L+LVTVAQS+R+LT +IVKAA++ PA+A IPAP+INLAA P +P PA GVT+
Sbjct: 71 LKLVTVAQSRRELTAEIVKAAIYSPASANIPAPKINLAATPSPQPRGVLPATQAQGVTSM 130
Query: 524 PQM--GVTTP 547
P + GV P
Sbjct: 131 PSVAAGVRGP 140
Score = 47.4 bits (111), Expect = 3e-04
Identities = 34/130 (26%), Positives = 53/130 (40%)
Frame = +2
Query: 161 PPNMDQFESFFRRADLDGDGRISGAEAVSFFQGSNLSKQVLAQVWMYADQAKTGFLGRTE 340
P ++ ++ F + D D DG+I+G +A + F L + L QVW +DQ L E
Sbjct: 422 PADVQKYTKVFVQVDTDRDGKITGNQARNLFLSWRLPRDALKQVWDLSDQDNDSMLSLRE 481
Query: 341 FYNALRLVTVAQSKRDLTPDIVKAALFGPAAAKIPAPQINLAAIPQXRPNPAPPQMGVTA 520
F A+ L+ + R L P + + + P + P + PQ G
Sbjct: 482 FCIAVYLMERYREGRPLPPVFPSSIIHSESMFTSPGQSV----APHGNASWGHPQ-GFQQ 536
Query: 521 PPQMGVTTPP 550
P G PP
Sbjct: 537 QPHPGGLRPP 546
Score = 36.6 bits (83), Expect(2) = 2e-44
Identities = 46/167 (27%), Positives = 61/167 (35%), Gaps = 46/167 (27%)
Frame = +3
Query: 573 RGQGLPGPVA-ANQQYFPSQQSQTMRPPQSMPVGTVPRPEQGLGGPNVAQGFNMAGHNVP 749
RG + G V+ +NQQ P QQ+Q P S P GG N + N P
Sbjct: 138 RGPHMGGTVSTSNQQVVPGQQNQFTGIPPSQTQQNFQSPGMPAGGTNAPRPANQ-----P 192
Query: 750 NPGISSDWSSGRTGMPPAR-----PAG--------------------ITPSVGLQTST-P 851
P SDW SGR+ P P+ ITP+V T+T P
Sbjct: 193 MP---SDWLSGRSVGPSGNVNSQIPSSQSTYGLTAPNSTANHITKPHITPAVTSSTTTRP 249
Query: 852 LSSVSQSQPGNTNA-------------------RALAVSGNGYSSNS 935
S P ++A + LA SGNG++S+S
Sbjct: 250 QESAPVHNPQESSATFGSRVSNVPSNQLVPKDPKELAASGNGFTSDS 296
>ref|NP_566657.1| expressed protein; protein id: At3g20290.1, supported by cDNA:
gi_14334439 [Arabidopsis thaliana]
gi|14334440|gb|AAK59418.1| unknown protein [Arabidopsis
thaliana] gi|28394001|gb|AAO42408.1| unknown protein
[Arabidopsis thaliana]
Length = 545
Score = 68.6 bits (166), Expect = 1e-10
Identities = 30/76 (39%), Positives = 52/76 (67%)
Frame = +2
Query: 179 FESFFRRADLDGDGRISGAEAVSFFQGSNLSKQVLAQVWMYADQAKTGFLGRTEFYNALR 358
++ +F +D DGDGRI+G +A+ FF SNL + L Q+W AD + G+LG EF A++
Sbjct: 19 YKEWFEFSDSDGDGRITGNDAIKFFTMSNLPRPELKQIWAIADSKRQGYLGFKEFIVAMQ 78
Query: 359 LVTVAQSKRDLTPDIV 406
LV++AQ+ +++ +++
Sbjct: 79 LVSLAQTGHEISHEVL 94
>dbj|BAB02809.1| contains similarity to EH domain containing
proteins~gene_id:MQC12.3 [Arabidopsis thaliana]
Length = 524
Score = 68.6 bits (166), Expect = 1e-10
Identities = 30/76 (39%), Positives = 52/76 (67%)
Frame = +2
Query: 179 FESFFRRADLDGDGRISGAEAVSFFQGSNLSKQVLAQVWMYADQAKTGFLGRTEFYNALR 358
++ +F +D DGDGRI+G +A+ FF SNL + L Q+W AD + G+LG EF A++
Sbjct: 19 YKEWFEFSDSDGDGRITGNDAIKFFTMSNLPRPELKQIWAIADSKRQGYLGFKEFIVAMQ 78
Query: 359 LVTVAQSKRDLTPDIV 406
LV++AQ+ +++ +++
Sbjct: 79 LVSLAQTGHEISHEVL 94
>gb|EAA33988.1| hypothetical protein [Neurospora crassa]
Length = 1285
Score = 61.6 bits (148), Expect = 2e-08
Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 9/131 (6%)
Frame = +2
Query: 161 PPNMDQFESFFRRADLDGDGRISGAEAVSFFQGSNLSKQVLAQVWMYADQAKTGFLGRTE 340
P + FR AD D G I+G AV FF+ + L +VL ++W AD+ GFL
Sbjct: 16 PEEKRVYGQLFRAADTDSVGVITGEVAVKFFERTKLDSRVLGEIWQIADKENRGFLTPAG 75
Query: 341 FYNALRLVTVAQSKRDLTPD-------IVKAALFGPAAAKIPAP-QINLAAIPQXRPNPA 496
F LRL+ AQ+ R+ +P+ I + F P A +P P A+P +P
Sbjct: 76 FGVVLRLIGHAQAGREPSPELALSQGPIPRFDGFTPTPAPVPVPGPAQSPAVPAAMVSPQ 135
Query: 497 PPQMG-VTAPP 526
G + PP
Sbjct: 136 ATGSGPIRIPP 146
Score = 58.2 bits (139), Expect = 2e-07
Identities = 38/122 (31%), Positives = 56/122 (45%), Gaps = 7/122 (5%)
Frame = +2
Query: 161 PPNMDQFESFFRRADLDGDGRISGAEAVSFFQGSNLSKQVLAQVWMYADQAKTGFLGRTE 340
P + +F+ + D G I+G EAV FF SNL++ VLAQ+W AD G L R E
Sbjct: 302 PADKARFDLLYEELDKQKKGFITGEEAVPFFSQSNLNEDVLAQIWDLADINSAGRLTRDE 361
Query: 341 FYNALRLVTVAQSK-------RDLTPDIVKAALFGPAAAKIPAPQINLAAIPQXRPNPAP 499
F A+ L+ ++K L P+++ ++ P PQ RP P
Sbjct: 362 FAVAMYLIREQRTKPGQVPLPTTLPPNLIPPSMRAPQG----RPQTAAGGFQPPRPQPPA 417
Query: 500 PQ 505
P+
Sbjct: 418 PK 419
Score = 57.4 bits (137), Expect = 3e-07
Identities = 43/132 (32%), Positives = 63/132 (47%), Gaps = 5/132 (3%)
Frame = +2
Query: 161 PPNMDQFESFFRRADLDGDGRISGAEAVSFFQGSNLSKQVLAQVWMYADQAKTGFLGRTE 340
P + Q+ + F R L + G +A F+ S LS ++L ++WM AD + G L TE
Sbjct: 149 PEKVAQYSALFERQPLLQGNMLPGEQAKQIFEKSGLSNEILGRIWMLADTEQRGALVLTE 208
Query: 341 FYNALRLVTVAQ--SKRDLTPDIVKAALFGPAAAKIPAPQIN--LAAIPQXRPNPAPP-Q 505
F A+ L+T + + R L P I+ AAL+ A + P IN P P PP
Sbjct: 209 FVIAMHLLTSMKTGALRGL-PTILPAALYEAATRRGPVGGINPPPGRSPTTATPPLPPAA 267
Query: 506 MGVTAPPQMGVT 541
+T P Q+ T
Sbjct: 268 RHLTGPAQLTQT 279
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 905,968,117
Number of Sequences: 1393205
Number of extensions: 22834669
Number of successful extensions: 86682
Number of sequences better than 10.0: 404
Number of HSP's better than 10.0 without gapping: 68545
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 84486
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 52137106016
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)