
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144484.9 + phase: 0
(177 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At3g52300 unknown protein 290 4e-79
At3g04340 unknown protein 38 0.003
At5g15020 transcriptional regulatory - like protein 33 0.061
At4g04440 putative transposon protein 33 0.080
At3g53820 putative C2H2-type zinc finger protein 33 0.080
At5g52280 hyaluronan mediated motility receptor-like protein 33 0.10
At4g17000 hypothetical protein 32 0.14
At3g54670 structural maintenance of chromosomes (SMC) - like pro... 32 0.18
At1g44910 splicing factor like protein 32 0.18
At4g31340 unknown protein 32 0.23
At2g22560 hypothetical protein 31 0.30
At5g20720 chloroplast Cpn21 protein 30 0.52
At1g67230 unknown protein 30 0.52
At4g04400 hypothetical protein 30 0.68
At3g45380 putative protein 30 0.68
At1g47300 hypothetical protein 30 0.68
At4g05190 kinesin - like protein 30 0.88
At3g28770 hypothetical protein 30 0.88
At3g20520 Glycerophosphodiesterase-like (GPDL4) 30 0.88
At3g17520 unknown protein 30 0.88
>At3g52300 unknown protein
Length = 168
Score = 290 bits (741), Expect = 4e-79
Identities = 137/168 (81%), Positives = 153/168 (90%)
Query: 1 MSGTVKKVTDVAFKAGKKIDWDGMAKLLVTDEARREFFNLRRAFDEVNTQLETKFSQEPE 60
MSG KK+ DVAFKA + IDWDGMAK+LVTDEARREF NLRRAFDEVNTQL+TKFSQEPE
Sbjct: 1 MSGAGKKIADVAFKASRTIDWDGMAKVLVTDEARREFSNLRRAFDEVNTQLQTKFSQEPE 60
Query: 61 PIDWEYYRKGIGTRLVDMYKQHYESIEIPKFVDTVTPQYKPKFDALLVELKEAEEKSLKE 120
PIDW+YYRKGIG +VD YK+ Y+SIEIPK+VD VTP+YKPKFDALLVELKEAE+KSLKE
Sbjct: 61 PIDWDYYRKGIGAGIVDKYKEAYDSIEIPKYVDKVTPEYKPKFDALLVELKEAEQKSLKE 120
Query: 121 SERLEKEIVNVQSLKKRISTMTADEYFAEHPELKKKFDDEIRNDNWGY 168
SERLEKEI +VQ + K++STMTADEYF +HPELKKKFDDEIRNDNWGY
Sbjct: 121 SERLEKEIADVQEISKKLSTMTADEYFEKHPELKKKFDDEIRNDNWGY 168
>At3g04340 unknown protein
Length = 1320
Score = 37.7 bits (86), Expect = 0.003
Identities = 34/146 (23%), Positives = 69/146 (46%), Gaps = 11/146 (7%)
Query: 5 VKKVTDVAFKAGKKIDWDGMAKLLVTDEARREFFNLRRAFDEVNTQLETK-FSQEPEPID 63
+ ++ D+ K G+ +L+ + RE L ++F N +L K F PE
Sbjct: 285 IDEIDDIILKKETTTLSFGVRELIFIE---RECVELVKSF---NRELNQKSFESVPESSI 338
Query: 64 WEYYRKGIGTRLVDMYKQHYESIEIPKFVDTVTPQYKPKFDALLVELKEAEEKSLKESER 123
+ R I LV+ ++H E + +P ++ + P FD V+ +K L+ES++
Sbjct: 339 TKLSRSEIKQELVNAQRKHLEQMILPNVLE--LEEVDPFFDRDSVDFSLRIKKRLEESKK 396
Query: 124 LEKEIVNVQSLKKRISTMTADEYFAE 149
L++++ N ++KR+ ++ F +
Sbjct: 397 LQRDLQN--RIRKRMKKFGEEKLFVQ 420
>At5g15020 transcriptional regulatory - like protein
Length = 1343
Score = 33.5 bits (75), Expect = 0.061
Identities = 21/71 (29%), Positives = 35/71 (48%), Gaps = 3/71 (4%)
Query: 53 TKFSQEPEPIDWEYYRKGIGTRLVDMY-KQHYESIEIPKFVDTVTPQYKPKFDALLVELK 111
T+ Q+ + +W R+G D+Y K HY+S++ F AL+ E+K
Sbjct: 605 TRLKQKQD--EWTKCREGFNVVWADVYAKNHYKSLDHRSFYFKQQDSKNLSAKALVSEVK 662
Query: 112 EAEEKSLKESE 122
+ +EKS KE +
Sbjct: 663 DLKEKSQKEDD 673
>At4g04440 putative transposon protein
Length = 570
Score = 33.1 bits (74), Expect = 0.080
Identities = 33/161 (20%), Positives = 69/161 (42%), Gaps = 16/161 (9%)
Query: 5 VKKVTDVAFKAGKKIDWDGMAKLLVTD--------EARREFFNLRRAFDEVNTQLETKFS 56
VK +T+ + G+ + G+ +L V+D E F +EV ++ + F
Sbjct: 115 VKSITETEIEKGESSNLGGVIELDVSDSEPEYFEPEGEAPFAFALYTTEEVESEEPSYFH 174
Query: 57 QEPEPIDWEYYRKGIGTRLVDMYKQHYESIEIPKFVDTVTPQYKPKFDALLVELKEAEEK 116
+ E +WE + G+ + ++K I VD P+ + ++ A LV + K
Sbjct: 175 EATEDKNWEKWNGGMNDEMNSLWKNFTWDI-----VD--RPKDQKRYKARLVARGFTQRK 227
Query: 117 SLKESERLEKEIVNVQSLKKRISTMTADEYFAEHPELKKKF 157
+ + + + +V S++ +S + D+ E ++K F
Sbjct: 228 GI-DYDEVFAPVVKHTSIRVLMSIVAQDDLELEQMDVKTAF 267
>At3g53820 putative C2H2-type zinc finger protein
Length = 142
Score = 33.1 bits (74), Expect = 0.080
Identities = 29/123 (23%), Positives = 54/123 (43%), Gaps = 3/123 (2%)
Query: 17 KKIDWDG-MAKLLVTDEARREFFNLRRAFDEVNTQLETKFSQEPEPIDWEYYRKGIGTRL 75
+ W G A+ + + R F N + +N + + + E I T
Sbjct: 9 RSYSWSGSQARPYICEFCERGFSNAQALGGHMNIHRKDRAKLRQANLKEEDSEDAICTTS 68
Query: 76 VDMYKQHYESIEIPKFVDTVTPQYKPKFDALLVELKEAEEKSLKESERLEKEIVNVQSLK 135
+ + Q E IE+P FVDTV P+ K + D L + E+K+++ ++ + +V L+
Sbjct: 69 RNRFGQ--ELIELPFFVDTVGPRRKGEDDKSEKGLGDEEKKNMRILQKALSQSADVIDLE 126
Query: 136 KRI 138
R+
Sbjct: 127 LRL 129
>At5g52280 hyaluronan mediated motility receptor-like protein
Length = 853
Score = 32.7 bits (73), Expect = 0.10
Identities = 31/126 (24%), Positives = 62/126 (48%), Gaps = 9/126 (7%)
Query: 42 RAFDEVNTQLETKFSQEPEPIDWEY--YRKGIGTR--LVDMYKQHYESIEIPKFVDT--- 94
+ D N +++T Q+ E +DWE Y+K + L+D Q YES++ + +
Sbjct: 417 KGMDSGNNEIDT-LKQQIEDLDWELDSYKKKNEEQEILLDELTQEYESLKEENYKNVSSK 475
Query: 95 VTPQYKPKFDALLVELKEAEEKSLKESERLEKEIVNVQSLKKRISTMTADEYFAEHPELK 154
+ Q + ++ K+ ++ + E LE ++ QSL+ +T +E ++ ELK
Sbjct: 476 LEQQECSNAEDEYLDSKDIIDELKSQIEILEGKLKQ-QSLEYSECLITVNELESQVKELK 534
Query: 155 KKFDDE 160
K+ +D+
Sbjct: 535 KELEDQ 540
>At4g17000 hypothetical protein
Length = 674
Score = 32.3 bits (72), Expect = 0.14
Identities = 14/32 (43%), Positives = 21/32 (64%), Gaps = 2/32 (6%)
Query: 75 LVDMYKQH--YESIEIPKFVDTVTPQYKPKFD 104
+VD + + YE+IE PKFVD P ++P+ D
Sbjct: 8 IVDEFNEEDFYETIEAPKFVDLTAPDHRPEGD 39
>At3g54670 structural maintenance of chromosomes (SMC) - like
protein
Length = 1265
Score = 32.0 bits (71), Expect = 0.18
Identities = 15/55 (27%), Positives = 30/55 (54%)
Query: 92 VDTVTPQYKPKFDALLVELKEAEEKSLKESERLEKEIVNVQSLKKRISTMTADEY 146
++T + +YK + +L EL+ +EK + E K + L+ ++S +TA+ Y
Sbjct: 452 IETSSSKYKNETTSLKTELRALQEKHVNAREASAKLKTRIAELEDQLSDLTAERY 506
>At1g44910 splicing factor like protein
Length = 958
Score = 32.0 bits (71), Expect = 0.18
Identities = 26/102 (25%), Positives = 47/102 (45%), Gaps = 11/102 (10%)
Query: 44 FDEVNTQLETKFSQEPEPIDWEYYRKGIGTRLVDMYKQH-YESIEIPKFVDTVTPQY--- 99
F++V +LE ++ + D Y + + +R + M +E + D T Q
Sbjct: 668 FEDVTEELEKQYHE-----DKSYVKDAMKSRKISMVSSWLFEDFKSAISEDLSTQQISDI 722
Query: 100 --KPKFDALLVELKEAEEKSLKESERLEKEIVNVQSLKKRIS 139
K +D L+ +KE EEK ++ +RL +E N+ K I+
Sbjct: 723 NLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHTFKEIT 764
>At4g31340 unknown protein
Length = 437
Score = 31.6 bits (70), Expect = 0.23
Identities = 23/83 (27%), Positives = 36/83 (42%), Gaps = 14/83 (16%)
Query: 82 HYESIE---IPKFVDTVTPQYKPKFDALLVELKEAEEKSLKESERLEKEIVNVQSLKKRI 138
H E+++ IP +TVT +P F L ++ KEA S + IV VQ
Sbjct: 254 HVENVKTKYIPAIKETVTIHVEPHFRTLSIKAKEAYHSS---KSAVSPHIVTVQEF---- 306
Query: 139 STMTADEYFAEHPELKKKFDDEI 161
D Y+ E + K + D++
Sbjct: 307 ----VDPYYQEAKKFSKPYVDQV 325
>At2g22560 hypothetical protein
Length = 891
Score = 31.2 bits (69), Expect = 0.30
Identities = 35/141 (24%), Positives = 64/141 (44%), Gaps = 23/141 (16%)
Query: 31 DEARREFFNLRRAFDEVNTQLETKFSQEPEPIDWEYYRKGIGT---RLVDMYKQHYESIE 87
D+ ++E L+ + V + E S+ WE+ KGI R+ + + ES+
Sbjct: 128 DKLQKEILALQTEKEFVKSSYEIGLSKY-----WEF-EKGIKEKQERICGLQDEFGESVA 181
Query: 88 IPK------FVDTVTPQYKPKFDALLVELKEAEEKSLKESERLEKEIVNVQSLKKRISTM 141
I +T + K LVEL+E +EKS +E+ +E V ++ K+++ +M
Sbjct: 182 IEDEEARRLMTETAIKSCQEK----LVELQEKQEKSYEEA---REEHVKIKESKEKLRSM 234
Query: 142 TADEYFAEHPELKKKFDDEIR 162
A ++ + K DE+R
Sbjct: 235 -ASQFLGDESVFAKDDGDEVR 254
>At5g20720 chloroplast Cpn21 protein
Length = 253
Score = 30.4 bits (67), Expect = 0.52
Identities = 16/34 (47%), Positives = 22/34 (64%), Gaps = 3/34 (8%)
Query: 88 IPKFVDTVTPQY---KPKFDALLVELKEAEEKSL 118
+ K V P+Y KP D +LV++KEAEEK+L
Sbjct: 48 VVKAASVVAPKYTSIKPLGDRVLVKIKEAEEKTL 81
>At1g67230 unknown protein
Length = 1132
Score = 30.4 bits (67), Expect = 0.52
Identities = 19/69 (27%), Positives = 38/69 (54%), Gaps = 5/69 (7%)
Query: 95 VTPQYKPKFDALLVELKEAEEKSLKESERLEKEIVNVQSLKKRISTMTADEYFAEHPELK 154
VT + + ++ L ELKE EK + E L+KE ++++ ++ + ++ + E E K
Sbjct: 478 VTEEERSEYLRLQTELKEQIEKCRSQQELLQKEAEDLKAQRE-----SFEKEWEELDERK 532
Query: 155 KKFDDEIRN 163
K +E++N
Sbjct: 533 AKIGNELKN 541
>At4g04400 hypothetical protein
Length = 735
Score = 30.0 bits (66), Expect = 0.68
Identities = 22/67 (32%), Positives = 37/67 (54%), Gaps = 4/67 (5%)
Query: 100 KPKFDALLVELKEAEEKSLKESERLEKEIVNVQSLKKRISTMTADEYFAEHPE--LKKKF 157
KP DA++ + +EA++ + +E E E+E+VN + + D Y AE E +K+
Sbjct: 123 KPLIDAVIQQFEEAKDGNGEELE--EEEMVNTIPKEGVENVNRQDTYEAEDDEVSVKETE 180
Query: 158 DDEIRND 164
+DE ND
Sbjct: 181 EDEAGND 187
>At3g45380 putative protein
Length = 690
Score = 30.0 bits (66), Expect = 0.68
Identities = 22/67 (32%), Positives = 37/67 (54%), Gaps = 4/67 (5%)
Query: 100 KPKFDALLVELKEAEEKSLKESERLEKEIVNVQSLKKRISTMTADEYFAEHPE--LKKKF 157
KP DA++ + +EA++ + +E E E+E+VN + + D Y AE E +K+
Sbjct: 123 KPLIDAVIQQFEEAKDGNGEELE--EEEMVNTIPKEGVENVNRQDTYEAEDDEVSVKETE 180
Query: 158 DDEIRND 164
+DE ND
Sbjct: 181 EDEAGND 187
>At1g47300 hypothetical protein
Length = 306
Score = 30.0 bits (66), Expect = 0.68
Identities = 20/81 (24%), Positives = 44/81 (53%), Gaps = 11/81 (13%)
Query: 63 DWEYYRKGIGTRLVDMYKQHYESIEI--PKFVDTVTPQYKPKFDALLVELKEAEEKSLKE 120
+++YYR +R+V ++ H E +++ PK +++ + + + E +E EE+ +E
Sbjct: 212 EYDYYR----SRVVYVFADHVEDLDVNDPKLLESKEEEEEEE-----EEEEEEEEEEEEE 262
Query: 121 SERLEKEIVNVQSLKKRISTM 141
E E+E + +K+I T+
Sbjct: 263 EEEEEEEESKEREKEKKIETV 283
>At4g05190 kinesin - like protein
Length = 777
Score = 29.6 bits (65), Expect = 0.88
Identities = 31/156 (19%), Positives = 64/156 (40%), Gaps = 11/156 (7%)
Query: 1 MSGTVKKVTDVAFKAGKKIDWDGMAKLLVTDEARREFFNLRRAFDEVNTQLETKFSQEPE 60
+ T+ ++ + +K+ + ++KL + RRE + R +++ L + + E
Sbjct: 132 LRATITEMKENIESLQEKLSKEKLSKLDAIENHRREK-DCRVVAEKLQVSLREELDKVKE 190
Query: 61 PIDWEYYRKGIGTRLVDMYKQHYESIEIPKFVDTVTPQYKPKFDALLVELKEAEEKSLKE 120
+ K T L DMYK+ E +T QY K L +EA ++ KE
Sbjct: 191 E---KMAAKQKVTSLEDMYKRLQE-------YNTSLQQYNTKLQTDLEVAREAHTRAEKE 240
Query: 121 SERLEKEIVNVQSLKKRISTMTADEYFAEHPELKKK 156
+ + + ++ K + A ++ +K+K
Sbjct: 241 KSSILENLTTLRGHSKSLQDQLASSRVSQDEAVKQK 276
>At3g28770 hypothetical protein
Length = 2081
Score = 29.6 bits (65), Expect = 0.88
Identities = 14/32 (43%), Positives = 22/32 (68%)
Query: 109 ELKEAEEKSLKESERLEKEIVNVQSLKKRIST 140
E+KE+EEK LK++E K+ +V+ KK+ T
Sbjct: 1196 EMKESEEKKLKKNEEDRKKQTSVEENKKQKET 1227
>At3g20520 Glycerophosphodiesterase-like (GPDL4)
Length = 729
Score = 29.6 bits (65), Expect = 0.88
Identities = 24/78 (30%), Positives = 34/78 (42%), Gaps = 4/78 (5%)
Query: 86 IEIPKFVDTVTPQYKPKFDALLVELKEAEEKSLKESERLEKEIVNVQSLKKRISTMTADE 145
IE+ V Q DA+L ELK A + K S R + +QS K + ++
Sbjct: 475 IEVENAAYLVEHQGISVVDAVLDELKRATTQQNKTSART----ILIQSTDKSVLMKFKEK 530
Query: 146 YFAEHPELKKKFDDEIRN 163
H EL + DD IR+
Sbjct: 531 NKMNHDELVYRVDDNIRD 548
>At3g17520 unknown protein
Length = 298
Score = 29.6 bits (65), Expect = 0.88
Identities = 34/161 (21%), Positives = 68/161 (42%), Gaps = 14/161 (8%)
Query: 3 GTVKKVTDVAFKAGKKIDWDGMAKLLVTDEARREFFNLRRAFDEVN---TQLETKFSQEP 59
G K+ A++ K +G+A V D+A + + + + D+V+ Q++ S +
Sbjct: 92 GAAKEKAGSAYETAKSKVEEGLAS--VKDKASQSYDSAGQVKDDVSHKSKQVKDSLSGDE 149
Query: 60 EPIDWEYYRKG-IGTRLVDMYKQHYESIEIPKFVDTVTPQYKPKFDALLVELKEAEEKSL 118
W + K IG + E I P +TV+ + K +A + +A+EK
Sbjct: 150 NDESWTGWAKEKIGIK--------NEDINSPNLGETVSEKAKEAKEAAKRKAGDAKEKLA 201
Query: 119 KESERLEKEIVNVQSLKKRISTMTADEYFAEHPELKKKFDD 159
+ E +++ ++ S K + +E E K+K +
Sbjct: 202 ETVETAKEKASDMTSAAKEKAEKLKEEAERESKSAKEKIKE 242
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.313 0.132 0.375
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,024,727
Number of Sequences: 26719
Number of extensions: 173023
Number of successful extensions: 742
Number of sequences better than 10.0: 81
Number of HSP's better than 10.0 without gapping: 25
Number of HSP's successfully gapped in prelim test: 56
Number of HSP's that attempted gapping in prelim test: 693
Number of HSP's gapped (non-prelim): 102
length of query: 177
length of database: 11,318,596
effective HSP length: 93
effective length of query: 84
effective length of database: 8,833,729
effective search space: 742033236
effective search space used: 742033236
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (26.2 bits)
Medicago: description of AC144484.9