
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147178.5 + phase: 0 /pseudo
(676 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At5g49530 unknown protein 265 8e-71
At2g24930 hypothetical protein 39 0.012
At5g01400 putative protein 37 0.047
At1g05240 putative peroxidase ATP12a 36 0.062
At1g05250 putative peroxidase ATP12a 36 0.062
At1g31410 unknown protein 35 0.18
At4g09290 putative protein 34 0.23
At5g25070 unknown protein 33 0.52
At5g52830 unknown protein 33 0.68
At5g52290 unknown protein 33 0.68
At1g17790 unknown protein (At1g17790) 33 0.68
At4g34440 serine/threonine protein kinase - like 32 0.89
At4g30160 putative villin 32 0.89
At1g72150 putative cytosolic factor protein 32 0.89
At5g24710 unknown protein 32 1.2
At4g39110 putative receptor-like protein kinase 32 1.2
At5g53930 unknown protein 32 1.5
At4g11200 putative protein 32 1.5
At5g25590 unknown protein (At5g25590) 31 2.0
At3g60380 putative protein 31 2.0
>At5g49530 unknown protein
Length = 689
Score = 265 bits (676), Expect = 8e-71
Identities = 163/374 (43%), Positives = 224/374 (59%), Gaps = 28/374 (7%)
Query: 1 MDFDDLDAPVK-PRVSRFAP-KSSKFKPK-KEEPSLVPKSEPPPLAS----AKTEPQEID 53
MDFDD D P + + RFAP ++ K KPK K EP+ +PPP + +KTE
Sbjct: 1 MDFDDDDKPKEVTKTRRFAPGRAGKSKPKPKPEPTADKPVQPPPQSQTESVSKTEHDVDA 60
Query: 54 LTAPTPVKHEP-NASVKTDAESKIEGKVDSRNVDVEMTEAVDGSREVNPMDEE------- 105
T V+ E N SVK + +SK++ + ++ TE ++ +++ P+ EE
Sbjct: 61 KFVGTKVETEVCNGSVKMEIDSKVD-----KEPEIMETELMEEDQQL-PLQEEKEEEEEE 114
Query: 106 -DTVVREIDVYFSPSIDNDTKLYVMQYPLRPSWRPYDLDEQCEEVRLKPGTSEVEVDLSI 164
D VVREIDV+F PSID +T+LYV+QYPLRPSWRPY++DE+CEEVR+ P TS+VE+DLS+
Sbjct: 115 DDVVVREIDVFFKPSIDANTQLYVLQYPLRPSWRPYEMDERCEEVRVNPSTSQVEIDLSM 174
Query: 165 DLESYNVEKATADKLKYTKQTLSTTWKPPPANGCAVGLLMGDKLHLHPIHAVVQLRPSRH 224
D+ S N + L TKQTL TTWK PP AVG+L GDKLHL+P+HAV QLRPS
Sbjct: 175 DVHSKNYDSNFG--LNMTKQTLKTTWKQPPTLDYAVGVLSGDKLHLNPVHAVAQLRPS-- 230
Query: 225 YLDSGGSEKKNVATSNNQNKQTTPSMEQKSDGAPSIEQKSDSNQCWVPLEYHGCKSDISS 284
+ S S+KK + + T + K S +QK + + WV L+YHG +S+ S
Sbjct: 231 -MQSLSSDKKKKQEESTEESVGTSKKQNKGVQQASTDQKPINEETWVSLKYHGLQSEYCS 289
Query: 285 RYFQQMVTHESSPINFEMSTYDYIATLCPGVSS-NTLAKGPSKRYLLSLPVEKRLQTLLI 343
RY M+ + +S I+F MS YI LC G SS N+ +K KR LLSLP+++R+Q LL
Sbjct: 290 RYLNGMMANGNSSIDFNMSPGTYINELCRGGSSRNSESKETLKRVLLSLPLKERVQKLLC 349
Query: 344 ESNNRTLYDRALHF 357
E + Y H+
Sbjct: 350 EGSPLIRYSVLKHY 363
Score = 96.7 bits (239), Expect = 4e-20
Identities = 53/114 (46%), Positives = 73/114 (63%), Gaps = 4/114 (3%)
Query: 563 VCSFQLICQELRGLALAQTMLPKGGSKMSVDAANSLDVPQDELKAILGEVACDIHGCYVL 622
VC ++ ICQ LR LA++ + PK S M+V+ A ++D Q EL+ ++ VA +IHG YV
Sbjct: 558 VCRYETICQGLRDLAVSTSNNPKADSGMAVNVALAVDAYQGELEDVINGVATNIHGSYVS 617
Query: 623 KSAQNDP----FRDVVIDMLRGSGPNGKLKKAEILEAARRKLGRDVPNNEYIKV 672
S+ + P R+VVI +L GS P KL KAE+ A R KL R++ NNEYIKV
Sbjct: 618 ISSPDHPEYDSLREVVISLLTGSPPGTKLMKAEVFAAGRTKLEREITNNEYIKV 671
>At2g24930 hypothetical protein
Length = 926
Score = 38.5 bits (88), Expect = 0.012
Identities = 31/111 (27%), Positives = 53/111 (46%), Gaps = 8/111 (7%)
Query: 1 MDFDDLDAPVKPRVSRFAPKSSKFKPKKEEPSLVPKSEPPPLASAK-----TEPQEIDLT 55
+DF + +P+ R+ P S K+E +L + PP + +P+++ T
Sbjct: 506 VDFVETVSPLNRRLRSRKPPSPNVPKHKKEKTLNELIQKPPTRRGRGRKPLEQPKKVPPT 565
Query: 56 APTPVKHEPNASVKTDA---ESKIEGKVDSRNVDVEMTEAVDGSREVNPMD 103
A ++P S +T + E+K + ++DS VDV A DGS +NP D
Sbjct: 566 ALKIRINKPKPSEETKSKAEEAKEKAELDSDVVDVTDKLAPDGSSLINPED 616
>At5g01400 putative protein
Length = 1467
Score = 36.6 bits (83), Expect = 0.047
Identities = 24/89 (26%), Positives = 45/89 (49%), Gaps = 10/89 (11%)
Query: 28 KEEPSLVPKSEPPPLASAKTE----PQEIDLTAPTPVKH----EPNASVKT--DAESKIE 77
+ + S + +S+ P+ + +++ PQ D +AP P H +P AS +T D + KI+
Sbjct: 1378 ESQSSPIGQSQSSPIGTGQSDMSQTPQVSDSSAPEPTSHTRTSDPQASSQTLRDDDEKID 1437
Query: 78 GKVDSRNVDVEMTEAVDGSREVNPMDEED 106
S N E+ ++ + S E +EE+
Sbjct: 1438 DTATSENEVTEIEKSKESSEEEEEEEEEE 1466
Score = 29.3 bits (64), Expect = 7.5
Identities = 22/69 (31%), Positives = 32/69 (45%), Gaps = 6/69 (8%)
Query: 232 EKKNVATSNNQNKQTTPSMEQKSDGAPSIEQKSDSNQCWVPLEYHGCKSDISSRYFQQMV 291
+ + TS Q +T S EQ+ A +Q S S Q VPL + S + + Q+V
Sbjct: 1245 DSQGTQTSQVQANETQTSQEQQQQQASEPQQTSQSQQVSVPLSH----SQVDHQEPSQVV 1300
Query: 292 T--HESSPI 298
+SSPI
Sbjct: 1301 ASQSQSSPI 1309
>At1g05240 putative peroxidase ATP12a
Length = 325
Score = 36.2 bits (82), Expect = 0.062
Identities = 23/56 (41%), Positives = 30/56 (53%), Gaps = 11/56 (19%)
Query: 614 CDIHGC---YVLKSAQNDPFRDVVIDMLRGSGPNGKLKKAEILEAARRKLGRDVPN 666
C + GC +LKSA+ND RD V PN LK E+++AA+ L R PN
Sbjct: 68 CFVRGCDGSVLLKSAKNDAERDAV--------PNLTLKGYEVVDAAKTALERKCPN 115
>At1g05250 putative peroxidase ATP12a
Length = 325
Score = 36.2 bits (82), Expect = 0.062
Identities = 23/56 (41%), Positives = 30/56 (53%), Gaps = 11/56 (19%)
Query: 614 CDIHGC---YVLKSAQNDPFRDVVIDMLRGSGPNGKLKKAEILEAARRKLGRDVPN 666
C + GC +LKSA+ND RD V PN LK E+++AA+ L R PN
Sbjct: 68 CFVRGCDGSVLLKSAKNDAERDAV--------PNLTLKGYEVVDAAKTALERKCPN 115
>At1g31410 unknown protein
Length = 524
Score = 34.7 bits (78), Expect = 0.18
Identities = 27/106 (25%), Positives = 47/106 (43%), Gaps = 6/106 (5%)
Query: 90 TEAVDGSREVNPMDEEDTVVREIDVYFSP--SIDNDTKLYVMQYPLRPSWRPYDLDEQCE 147
TE V G +N +++E+ + + + Y S S+ KL ++ + PSW + Q +
Sbjct: 83 TETVAGEENLNEVEDEEKLEADFEAYKSKVYSLTVPLKLVALRGSVPPSWIKEFMSSQGK 142
Query: 148 EVRLKP---GTSEVEVDLSIDLESYNVEKATADKLKYTKQTLSTTW 190
VRLK G E E+ + S N +K A ++ +W
Sbjct: 143 RVRLKTQFRGNLE-EIFFDLSKPSKNGKKGAASTAAADMISIGDSW 187
>At4g09290 putative protein
Length = 376
Score = 34.3 bits (77), Expect = 0.23
Identities = 24/93 (25%), Positives = 45/93 (47%), Gaps = 3/93 (3%)
Query: 4 DDLDAPVKPRVSRFAPKSSKFKPKKEEPSLVPKSEPPPLASAKTEPQEIDLTAPTPVKHE 63
D ++A + + R F K+++ + P+++ P K + +T+P P K
Sbjct: 147 DQVEARIDAKFERRVVIREMFLTKEQQGNPKPQTQQTPKGPRKPTNNQ-SVTSPPPSKEV 205
Query: 64 PNASVKTDAESKIEGKVDSRNVDVEMTEAVDGS 96
++ V+ DA+ G SRN+D E +A DG+
Sbjct: 206 LDSVVEKDADFAEVGV--SRNLDGEFAKATDGN 236
>At5g25070 unknown protein
Length = 736
Score = 33.1 bits (74), Expect = 0.52
Identities = 23/87 (26%), Positives = 34/87 (38%), Gaps = 5/87 (5%)
Query: 25 KPKKEEPSLVPKSEPPPLASAKTEPQEIDLTAPTPVKHEPNASVKTDAESKIEGKVDSRN 84
+P E +L+ + P + A E DLT TPV+H+P + K R
Sbjct: 51 QPDVTEATLITATSQPDITEALDENLFSDLTIVTPVQHQPEPMEAVITTHQSPAKNYGRQ 110
Query: 85 VDVEMTEAVD-----GSREVNPMDEED 106
V A G E N +DE++
Sbjct: 111 VSRRKKRAAGLRIGYGRHETNNLDEDE 137
>At5g52830 unknown protein
Length = 348
Score = 32.7 bits (73), Expect = 0.68
Identities = 33/129 (25%), Positives = 53/129 (40%), Gaps = 16/129 (12%)
Query: 12 PRVSRFAPKSSKFKPKKEEPSLVPKSEPPPLASAKTEPQEIDLTAPTPVKHEPNASVKTD 71
PR + + + K + + VPK + PL+ T +EI L+ TP+K D
Sbjct: 223 PRPTHRNSLAGSTRNKSQPVNPVPKPDTSPLSD--TVKEEIHLSPTTPLK-------GND 273
Query: 72 AESKIEGKVDSRNVDVEMTEAVDGSREVNPMDEEDTVVREIDVYFSPSI---DNDTKLYV 128
+ G D +V M E + EV DEE+ ++D P++ D D +
Sbjct: 274 DVQETNGDEDMVGQEVNMEEE-EEEEEVEEDDEEEEDDDDVDDLLIPNLAVRDRDDLFFA 332
Query: 129 MQYPLRPSW 137
+ PSW
Sbjct: 333 GSF---PSW 338
>At5g52290 unknown protein
Length = 1569
Score = 32.7 bits (73), Expect = 0.68
Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 5/81 (6%)
Query: 63 EPNASVKTDAESKIEGKVDSRNVDVEMTEAVDGSREVNPMDEEDTVVREIDVYFSPSID- 121
E + SV TD+ S + DS D GS++ + E+D + + V+FSPSI+
Sbjct: 1122 EDSRSVMTDSSSSVSSGPDS---DTHHVSVHSGSKKKQYIAEKDEIDMDDLVHFSPSIEF 1178
Query: 122 NDTKLYVM-QYPLRPSWRPYD 141
DT+L + L SW D
Sbjct: 1179 ADTQLKSSGDFQLDDSWSSKD 1199
>At1g17790 unknown protein (At1g17790)
Length = 487
Score = 32.7 bits (73), Expect = 0.68
Identities = 35/154 (22%), Positives = 63/154 (40%), Gaps = 15/154 (9%)
Query: 1 MDFDDLDAPVKP-RVSRFAPKSSKFKPKKE---------EPSLVPKSEPPPLASAKTEPQ 50
M +D+L KP R F + P E PS P PPP+A+ E +
Sbjct: 235 MQYDNLHRKFKPTRDIEFPAPAPSIAPIVEPLPAIVPSPSPSSPPPPPPPPVAAPVLENR 294
Query: 51 EIDLTAPTPVKHEPNASVKT-DAESKIEGKVDSRNVDVEMTEAVDGSREVNPMDEEDTVV 109
+ + EP A + + + E V++R++ +E + + P D+ +TVV
Sbjct: 295 TWEREESMTIPVEPEAVITAPEKAEEEEAPVNNRDLTLEEKRRLSEELQDLPYDKLETVV 354
Query: 110 REIDVYFSPSI---DNDTKLYVMQYPLRPSWRPY 140
+I +P + D++ +L + + W Y
Sbjct: 355 -QIIKKSNPELSQKDDEIELDIDSLDINTLWELY 387
>At4g34440 serine/threonine protein kinase - like
Length = 670
Score = 32.3 bits (72), Expect = 0.89
Identities = 23/99 (23%), Positives = 35/99 (35%), Gaps = 4/99 (4%)
Query: 6 LDAPVKPRVSRFAPKSSKFKPKKEEPSLVPKSEPPPLASAKTEPQEIDLT---APTPVKH 62
+D+ P S P S+ P E P S PPP +S P +I + P P
Sbjct: 6 VDSSPAPETSNGTPPSNGTSPSNESSPPTPPSSPPP-SSISAPPPDISASFSPPPAPPTQ 64
Query: 63 EPNASVKTDAESKIEGKVDSRNVDVEMTEAVDGSREVNP 101
E + + + + + A +GS V P
Sbjct: 65 ETSPPTSPSSSPPVVANPSPQTPENPSPPAPEGSTPVTP 103
>At4g30160 putative villin
Length = 974
Score = 32.3 bits (72), Expect = 0.89
Identities = 29/114 (25%), Positives = 46/114 (39%), Gaps = 9/114 (7%)
Query: 10 VKPRVSRFAP--KSSKFKPKKEEPSLVPKSEPPPLASAKTEPQEIDLTAPTPVKHEPNAS 67
V P S+FAP KSS + +P EP K P+ + AP E
Sbjct: 820 VTPDSSKFAPAPKSSAIASRSALFEKIPPQEPSIPKPVKASPKTPESPAPESNSKEQEEK 879
Query: 68 VKTD-AESKIEGKVDSRNVDVEMTEAVDGSREV--NPMDEEDTV----VREIDV 114
+ D E + +++S + + E V+ ++ +P D T V +IDV
Sbjct: 880 KENDKEEGSMSSRIESLTIQEDAKEGVEDEEDLPAHPYDRLKTTSTDPVSDIDV 933
>At1g72150 putative cytosolic factor protein
Length = 573
Score = 32.3 bits (72), Expect = 0.89
Identities = 27/112 (24%), Positives = 46/112 (40%), Gaps = 15/112 (13%)
Query: 10 VKPRVSRFAPKSSKFKPKKEEPSLVPKSEPPPLASAKTEPQEID--------------LT 55
V P A ++ K KEE ++V + L + + + + ++ T
Sbjct: 55 VTPEKEAPAAEAEKSVSVKEEETVVVAEKVVVLTAEEVQKKALEEFKELVREALNKREFT 114
Query: 56 AP-TPVKHEPNASVKTDAESKIEGKVDSRNVDVEMTEAVDGSREVNPMDEED 106
AP TPVK E KT+ E+K E K + + + V+ + P EE+
Sbjct: 115 APVTPVKEEKTEEKKTEEETKEEEKTEEKKEETTTEVKVEEEKPAVPAAEEE 166
>At5g24710 unknown protein
Length = 1003
Score = 32.0 bits (71), Expect = 1.2
Identities = 23/86 (26%), Positives = 32/86 (36%), Gaps = 6/86 (6%)
Query: 6 LDAPVKPRVSRFAP--KSSKFKPKKEEPSLVPKSEPPP----LASAKTEPQEIDLTAPTP 59
+D PV VS P K +K +PS P +E + T P+ + P P
Sbjct: 900 VDGPVTETVSEPPPVEKEETSLEEKSDPSSTPNTETATSTENTSQTTTTPESVTTAPPEP 959
Query: 60 VKHEPNASVKTDAESKIEGKVDSRNV 85
+ P +V T A E R V
Sbjct: 960 ITTAPPETVTTTAVKPTENAATERRV 985
>At4g39110 putative receptor-like protein kinase
Length = 878
Score = 32.0 bits (71), Expect = 1.2
Identities = 18/57 (31%), Positives = 25/57 (43%), Gaps = 2/57 (3%)
Query: 18 APKSSKFKPKKEEPSLVPKSEPPPLASAKTEPQEIDLTAPTPVKHEPNASVKTDAES 74
A ++ KP P VP S+P P+ + T + T P P K E N+ D S
Sbjct: 810 AEETENAKPDVVTPGSVPVSDPSPITPSVTTNEA--ATVPVPAKVEENSGTAVDEHS 864
>At5g53930 unknown protein
Length = 529
Score = 31.6 bits (70), Expect = 1.5
Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 4/55 (7%)
Query: 4 DDLDAPVKPR----VSRFAPKSSKFKPKKEEPSLVPKSEPPPLASAKTEPQEIDL 54
DDL+A +K R + RF + K K+E S V + EP + S K E Q+ DL
Sbjct: 275 DDLEAILKKRALENLKRFRGVTQKSGIAKKEVSSVSEGEPMQIESEKVESQDHDL 329
>At4g11200 putative protein
Length = 462
Score = 31.6 bits (70), Expect = 1.5
Identities = 29/89 (32%), Positives = 41/89 (45%), Gaps = 12/89 (13%)
Query: 65 NASVKTDA--ESKIEGKV-DSRNVDVEMTEAVDGSREVNPMDEEDTVVR--EIDVYFSPS 119
NAS K + E+ E +V D +N D ++E+ D E +DEED V+ EI V+
Sbjct: 246 NASKKRKSVIEAAAEEEVSDDQNSDARLSESPDSELEAEIVDEEDVNVKAEEIQVF---- 301
Query: 120 IDNDTKLYVMQYPLRPSWRPYDLDEQCEE 148
D + Y Q P P DE +E
Sbjct: 302 ---DIRNYEEQIPYEDEEYPTTDDESGDE 327
>At5g25590 unknown protein (At5g25590)
Length = 775
Score = 31.2 bits (69), Expect = 2.0
Identities = 19/81 (23%), Positives = 36/81 (43%), Gaps = 4/81 (4%)
Query: 30 EPSLVPKSEPPPLASAKTEPQEIDLTAPTPVKHE---PNASVKTDAESKIEG-KVDSRNV 85
+P+ PPP+ + P + +P+P+K P+ +V+ ++G ++
Sbjct: 84 DPASPQPPPPPPIENLPPPPPPLPKFSPSPIKRAISLPSMAVRGRKVQTLDGMAIEEEEE 143
Query: 86 DVEMTEAVDGSREVNPMDEED 106
D E E V GS +EE+
Sbjct: 144 DEEEEEEVKGSGRDTAQEEEE 164
>At3g60380 putative protein
Length = 743
Score = 31.2 bits (69), Expect = 2.0
Identities = 21/78 (26%), Positives = 38/78 (47%), Gaps = 5/78 (6%)
Query: 25 KPKKEEPSLVPKSEPPPLASAKTEPQEIDLTAPTPVKHEPNASVKTDAESKIEGKV---- 80
+P++EE S+V E AK+EP+E+ + P + +P + + + E+ E +
Sbjct: 640 RPRQEELSIVLHQEKSSETRAKSEPEEVAMEEP-QAEQQPEVTFEEEEEAAWESQSNASH 698
Query: 81 DSRNVDVEMTEAVDGSRE 98
D VD + E + RE
Sbjct: 699 DHNEVDRKAGEFIAKFRE 716
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.327 0.140 0.425
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,829,970
Number of Sequences: 26719
Number of extensions: 627176
Number of successful extensions: 3143
Number of sequences better than 10.0: 47
Number of HSP's better than 10.0 without gapping: 8
Number of HSP's successfully gapped in prelim test: 39
Number of HSP's that attempted gapping in prelim test: 3023
Number of HSP's gapped (non-prelim): 116
length of query: 676
length of database: 11,318,596
effective HSP length: 106
effective length of query: 570
effective length of database: 8,486,382
effective search space: 4837237740
effective search space used: 4837237740
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 63 (28.9 bits)
Medicago: description of AC147178.5