
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC145329.1 - phase: 0
(236 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At1g02660 unknown protein 131 4e-31
At3g62590 unknown protein 116 1e-26
At3g61680 unknown protein 78 4e-15
At3g54670 structural maintenance of chromosomes (SMC) - like pro... 33 0.17
At1g48250 hypothetical protein 33 0.17
At1g13160 unknown protein 33 0.17
At2g43650 unknown protein 32 0.38
At2g24440 unknown protein 31 0.50
At1g55600 putative protein 31 0.50
At5g27630 unknown protein 31 0.65
At3g59020 putative protein 31 0.65
At1g63980 unknown protein 31 0.65
At1g05910 unknown protein 31 0.65
At4g31570 putative protein 30 0.86
At2g42700 unknown protein 30 0.86
At1g80930 unknown protein 30 0.86
At4g05410 U3 snoRNP-associated -like protein 30 1.1
At2g11910 unknown protein 30 1.1
At5g17910 putative protein 30 1.5
At4g29520 unknown protein 30 1.5
>At1g02660 unknown protein
Length = 713
Score = 131 bits (329), Expect = 4e-31
Identities = 104/283 (36%), Positives = 137/283 (47%), Gaps = 62/283 (21%)
Query: 1 MVTLCLLQSGIPGIVPLITSISATTTTSRANDHVHVHQSHVTTVRRSNKSSMFSRFI--- 57
M +LCL SG+ G++P IT++ V + T S K F
Sbjct: 1 MDSLCL-NSGLHGVIPAITAVGNGGCGG-------VVEVRATASAPSQKRGPFGFSFKYP 52
Query: 58 -----------GSSARNNCLAAVNDAFTAENAD-------RTVKEGDGQ--NGNWVFKVF 97
G ++R ++DA ++ D T E D + NG+WV K+
Sbjct: 53 LTPFWSRGGGGGIASRRRSGLCLDDAVLVDSGDSRKPIAEETAVEMDTERRNGSWVLKIL 112
Query: 98 DLNSVWKGEQESGDN-----DGDE-----------------CDVCRVDEEVDDENEDEEI 135
D+ S WK E+E D+ DGDE CDVC V E DD NE +
Sbjct: 113 DVQSTWKHEEEEDDDEVEDEDGDEDEEVELDDAVVSEDDGGCDVCSVLE--DDGNEANKF 170
Query: 136 RFDRESFSRMLRRVTLVEARMYAHMSHLGNLAYSIPNIKQGNLLKRCGLRFVTSSIEKKE 195
+ DRESFS++LRRVTL E+++YA +S+LGNLAYSI IK NL K GLRFVTSS EK E
Sbjct: 171 QLDRESFSKLLRRVTLPESKLYAQLSYLGNLAYSISKIKPANLSKYYGLRFVTSSAEKTE 230
Query: 196 LAASIKKEETNGK-----DAGE--RKVEKNGELKTSASNACEI 231
A + E +G+ +A E + EKN K SAS A EI
Sbjct: 231 SALKAENGEVSGETKPIVEAEEEVEEEEKNKSRKISASAAYEI 273
>At3g62590 unknown protein
Length = 649
Score = 116 bits (290), Expect = 1e-26
Identities = 71/178 (39%), Positives = 96/178 (53%), Gaps = 29/178 (16%)
Query: 57 IGSSARNNCLAAVNDAFTAENADRTVKEGDGQNGNWVFKVFDLNSVWKGE-QESGDNDG- 114
IG +DA E DR E D NGNWV K+ ++ S+WKG+ Q SG G
Sbjct: 58 IGGKREEKGTVRDDDAVLLERRDRNRNEND--NGNWVLKILEVGSIWKGKRQRSGGGGGG 115
Query: 115 ------------------DECDVCRVDEEVDDENEDEEIRFDRESFSRMLRRVTLVEARM 156
+ECD CR+D++ +DE +++ + FS ML ++ + +A+M
Sbjct: 116 EEDEEEEVAEPKKKEDLCEECDFCRIDDDDEDEEKEKTVF----EFSEMLSKIPVEDAQM 171
Query: 157 YAHMSHLGNLAYSIPNIKQGNLLKRCGLRFVTSSIEKKELAASIKKEETNGKDAGERK 214
+A +S LGNLAYSIP IK NLLK LRFVTSSIEK+ S+K EE N + E K
Sbjct: 172 FAKLSFLGNLAYSIPKIKPENLLKYQKLRFVTSSIEKR---MSLKVEENNNGEEDEEK 226
>At3g61680 unknown protein
Length = 658
Score = 78.2 bits (191), Expect = 4e-15
Identities = 49/123 (39%), Positives = 75/123 (60%), Gaps = 7/123 (5%)
Query: 88 QNGNWVFKVFDLNSVWKGEQ--ESGDNDGDECDV---CRVDEEVDDENEDEEIRFD--RE 140
+ NWV ++ ++ WK EQ ESG++D E V C +EE + D RE
Sbjct: 123 KKANWVERLLEIRRQWKREQKTESGNSDVAEESVDVTCGCEEEEGCIANYGSVNGDWGRE 182
Query: 141 SFSRMLRRVTLVEARMYAHMSHLGNLAYSIPNIKQGNLLKRCGLRFVTSSIEKKELAASI 200
SFSR+L +V+ EA+ + +++L NLAY+IP IK +L + GL+FVTSS+EKK AA +
Sbjct: 183 SFSRLLVKVSWSEAKKLSQLAYLCNLAYTIPEIKGEDLRRNYGLKFVTSSLEKKAKAAIL 242
Query: 201 KKE 203
+++
Sbjct: 243 REK 245
>At3g54670 structural maintenance of chromosomes (SMC) - like
protein
Length = 1265
Score = 32.7 bits (73), Expect = 0.17
Identities = 38/120 (31%), Positives = 54/120 (44%), Gaps = 25/120 (20%)
Query: 116 ECDVCRVDEEVDDENED------EEIRFDRESFSRM------LRRVTLVEARMYAHMSHL 163
E D+ + +E+VD E + E +F+RE+ R L+ + E ++ S L
Sbjct: 242 ENDIEKANEDVDSEKSNRKDVMRELEKFEREAGKRKVEQAKYLKEIAQREKKIAEKSSKL 301
Query: 164 GNLAYSIP-NIKQGNLLKRCGLRFVTSSIEKKELAASIKKEETNGKDAGERKVEKNGELK 222
G + SIP Q LL RF K+E+A K ETN KD +RK EK K
Sbjct: 302 GKIV-SIPWKSVQPELL-----RF------KEEIARIKAKIETNRKDVDKRKKEKGKHSK 349
>At1g48250 hypothetical protein
Length = 354
Score = 32.7 bits (73), Expect = 0.17
Identities = 19/56 (33%), Positives = 32/56 (56%), Gaps = 6/56 (10%)
Query: 105 GEQESGDNDG--DECDVCRVDEEVDDENEDEEIRFDRESFSRMLRRVTLVEARMYA 158
GE+E+ DNDG D D ++ + + ++ED+ DR +R RR+TL + + A
Sbjct: 288 GEEEASDNDGGDDIWDEDKIPDPLSSDDEDD----DRVEAARNDRRITLTDVLLIA 339
>At1g13160 unknown protein
Length = 804
Score = 32.7 bits (73), Expect = 0.17
Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 2/64 (3%)
Query: 71 DAFTAENADRTVKEGDGQNGNWVFKVFDLNSVWKGEQESGDNDGDECDVCRVDEEVDDEN 130
D + + A+ +GD N D+++ G+++ ND DE D +EE++ E
Sbjct: 554 DCGSEDKAEEDSNDGDDMNNTEDDS--DIDTSIGGDEDEEVNDSDEADTDSENEEIESEE 611
Query: 131 EDEE 134
ED E
Sbjct: 612 EDGE 615
>At2g43650 unknown protein
Length = 654
Score = 31.6 bits (70), Expect = 0.38
Identities = 22/80 (27%), Positives = 35/80 (43%), Gaps = 18/80 (22%)
Query: 84 EGDGQNGNWVFKVFDLNSVWKGEQESGDNDGDECDVCRVDEEVDDENEDEEIRFDRESFS 143
EGDG+N N F + DNDGD D+ VD++ E E+ + +
Sbjct: 512 EGDGRNKNGAF----------ASDDEDDNDGDNNDM------VDNDGESEDEFYKQVKQK 555
Query: 144 RMLRRVTLVEARMYAHMSHL 163
+ +R +A +Y+ HL
Sbjct: 556 QQAKRA--AKAEIYSRKPHL 573
>At2g24440 unknown protein
Length = 183
Score = 31.2 bits (69), Expect = 0.50
Identities = 25/107 (23%), Positives = 46/107 (42%), Gaps = 4/107 (3%)
Query: 124 EEVDDENEDEEIRFDRESFSRMLRRVTLVEARMYAHMSHLGNLAYSIPNIKQGNLLKRCG 183
++VD E + + I R R + R T + + S L + P K+ K
Sbjct: 4 KKVDGEGKGKAIANTR--MLRSMDRKTRSDTKRDGSSSKL--MKIESPEKKKRKTTKAKN 59
Query: 184 LRFVTSSIEKKELAASIKKEETNGKDAGERKVEKNGELKTSASNACE 230
+ ++K+E+A I+KEE DA E++ + + + K C+
Sbjct: 60 VGAAKKKVKKEEVAVKIEKEEEEDDDAAEKEEDDDSDKKKIVIEHCK 106
>At1g55600 putative protein
Length = 485
Score = 31.2 bits (69), Expect = 0.50
Identities = 14/43 (32%), Positives = 24/43 (55%)
Query: 98 DLNSVWKGEQESGDNDGDECDVCRVDEEVDDENEDEEIRFDRE 140
D+ S+ E E G+ D D+ D DE+ D ++D+++ D E
Sbjct: 213 DIISIEDSESEDGNKDDDDEDFQYEDEDEDQYDQDQDVDEDEE 255
>At5g27630 unknown protein
Length = 648
Score = 30.8 bits (68), Expect = 0.65
Identities = 16/49 (32%), Positives = 29/49 (58%)
Query: 37 HQSHVTTVRRSNKSSMFSRFIGSSARNNCLAAVNDAFTAENADRTVKEG 85
+ + V ++ S+KSS+ S+ +G+SA + +AVN+A T + EG
Sbjct: 475 YNNEVNVLKPSHKSSLKSKIMGASAVPDSFSAVNNATTRDIESEIKVEG 523
>At3g59020 putative protein
Length = 1112
Score = 30.8 bits (68), Expect = 0.65
Identities = 16/35 (45%), Positives = 20/35 (56%), Gaps = 5/35 (14%)
Query: 104 KGEQESGDNDGDECDVCRV-----DEEVDDENEDE 133
K E+E D DGD+ D+ DE+ DDEN DE
Sbjct: 970 KAEEEEEDEDGDDDDMDEFQTDDEDEDGDDENPDE 1004
>At1g63980 unknown protein
Length = 391
Score = 30.8 bits (68), Expect = 0.65
Identities = 34/132 (25%), Positives = 64/132 (47%), Gaps = 25/132 (18%)
Query: 103 WKGEQESGDNDGDECDVCRVDEEVDDENEDEEIRFDRESFSRMLRRVTLVEARMYAHMSH 162
++G++ S DN D+ D D+E D+E +++E D + + +++E+ + A H
Sbjct: 256 YEGKKTSFDNSDDDDDDDDDDDEEDEEEDEDESEADDDD------KDSVIESSLPAKRKH 309
Query: 163 LGNLAYSIPNIKQGNLLKR-----------CGLRFVTSSIEKKELAASIKKEETNGKDA- 210
+ P IK NL K+ L+ + S I+ E A S+ E ++ KDA
Sbjct: 310 DEIIE---PKIKLKNLCKQIVKKDAGKGGFMKLKQLKSLID--EQAPSVLSEFSSRKDAI 364
Query: 211 --GERKVEKNGE 220
+ K+E++G+
Sbjct: 365 AYLKLKLERSGK 376
>At1g05910 unknown protein
Length = 1210
Score = 30.8 bits (68), Expect = 0.65
Identities = 30/106 (28%), Positives = 51/106 (47%), Gaps = 7/106 (6%)
Query: 36 VHQSHVTTVRRSNKSSMFS-RFIGSSARNNCLAAVNDAFTAENADR-TVKEGDGQ----N 89
VH++ T+ R + + + R G R + A T AD+ T +E DGQ N
Sbjct: 121 VHKNFSTSKSRKDMDAELAPRREGLRPRRSTTIANKRLKTESGADQDTSEEKDGQDETEN 180
Query: 90 GNWVFKVFDLNSVWKGEQE-SGDNDGDECDVCRVDEEVDDENEDEE 134
GN + D + + E E +G+++GD D D + D+E ++E+
Sbjct: 181 GNELDDADDGENEVEAEDEGNGEDEGDGEDEGEEDGDDDEEGDEEQ 226
>At4g31570 putative protein
Length = 2712
Score = 30.4 bits (67), Expect = 0.86
Identities = 30/146 (20%), Positives = 56/146 (37%), Gaps = 13/146 (8%)
Query: 84 EGDGQNGNWVFKVFDLNSVWKGEQESGDNDGDECDVCRVDEEVDDENEDEEIRFDRESFS 143
E D Q+ +V ++ L + + E D+ +E C+ ++ S +
Sbjct: 457 EFDHQHNQFVAEISQLRASYSAVTERNDSLAEELSECQ-----------SKLYAATSSNT 505
Query: 144 RMLRRVTLVEARMYAHMSHLGNLAYSIPNIKQGNLLKRCGLRFVTSSIEKKELAASIKKE 203
+ ++ EA++ + + L S+ K L +F+ +E L A I
Sbjct: 506 NLENQLLATEAQVEDFTAKMNELQLSLE--KSLLDLSETKEKFINLQVENDTLVAVISSM 563
Query: 204 ETNGKDAGERKVEKNGELKTSASNAC 229
K+ E K KN E+K +S C
Sbjct: 564 NDEKKELIEEKESKNYEIKHLSSELC 589
>At2g42700 unknown protein
Length = 788
Score = 30.4 bits (67), Expect = 0.86
Identities = 27/101 (26%), Positives = 43/101 (41%), Gaps = 6/101 (5%)
Query: 110 GDNDGDECDVCRVDEEVDDENEDEEIRFDRESFSRMLRRVTLVEARMYAHMSHLGNLA-- 167
GD + +E D + DE DD ++R +S R L +++ + R G+LA
Sbjct: 614 GDEEEEEVDNSKADESYDDMQLKLDLRDRVDSLFRFLHKLSSLRTRNLPLRE--GSLASE 671
Query: 168 YSIPNIKQGNLLKRCGLRFVTSSIEKKELAASIKKEETNGK 208
S P GN K R +T + K+E+ T G+
Sbjct: 672 SSFPGEPSGN--KGLVYRLITKVLSKQEIPGLEYHSSTVGR 710
>At1g80930 unknown protein
Length = 900
Score = 30.4 bits (67), Expect = 0.86
Identities = 25/96 (26%), Positives = 42/96 (43%), Gaps = 5/96 (5%)
Query: 105 GEQESGDNDGDECDVCRVDEEVDDENEDEE----IRFDRESFSRMLRRVTLVEARMYAHM 160
G++ES D DG + DEE D+ +E++E IR + E+ LRR +
Sbjct: 605 GDEESEDEDGSDASSEDNDEEEDESDEEDEEQMRIRDETETNLVNLRRTIYLTIMSSVDF 664
Query: 161 SHLGNLAYSIPNIKQGNLLKRCGLRFVTSSIEKKEL 196
G+ I ++ G ++ C + S E+ L
Sbjct: 665 EEAGHKLLKI-KLEPGQEMELCIMLLECCSQERTYL 699
>At4g05410 U3 snoRNP-associated -like protein
Length = 504
Score = 30.0 bits (66), Expect = 1.1
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 1/34 (2%)
Query: 98 DLNSVWKGEQESGDNDGDECDVCRVDEEVDDENE 131
D+ SV +E+G GDE D RVD EV+DE+E
Sbjct: 44 DIESVDSDAEENGFTGGDE-DGRRVDGEVEDEDE 76
>At2g11910 unknown protein
Length = 168
Score = 30.0 bits (66), Expect = 1.1
Identities = 14/31 (45%), Positives = 21/31 (67%), Gaps = 1/31 (3%)
Query: 106 EQESGDNDGDECDVCRVDEEVDDENEDEEIR 136
+ E GDND DE + +EE DDE +D+++R
Sbjct: 131 DDEEGDND-DEDEDNEDEEEDDDEEDDDDVR 160
>At5g17910 putative protein
Length = 1342
Score = 29.6 bits (65), Expect = 1.5
Identities = 15/39 (38%), Positives = 25/39 (63%), Gaps = 2/39 (5%)
Query: 104 KGEQESGDND--GDECDVCRVDEEVDDENEDEEIRFDRE 140
+G + GD++ G+E D DEE D+E EDEE + +++
Sbjct: 306 EGMESDGDSESHGEEGDNENEDEEEDEEEEDEEEKQEKK 344
Score = 28.5 bits (62), Expect = 3.2
Identities = 22/74 (29%), Positives = 39/74 (51%), Gaps = 7/74 (9%)
Query: 108 ESGDNDGDECDVCRVDEEVDDENEDEEIRFDRESFSRMLRRVTLVEARMYAHMSHLGNLA 167
E GDN+ ++ + DEE +DE E +E + D++ S+ + T + R ++ LG+L
Sbjct: 319 EEGDNENEDEEE---DEEEEDEEEKQEKKEDKDDESKSAIKWTEADQR---NVMDLGSLE 372
Query: 168 YSIPNIKQGNLLKR 181
N + NL+ R
Sbjct: 373 LE-RNQRLENLIAR 385
>At4g29520 unknown protein
Length = 306
Score = 29.6 bits (65), Expect = 1.5
Identities = 33/141 (23%), Positives = 59/141 (41%), Gaps = 41/141 (29%)
Query: 79 DRTVKEGDGQNGNWVFKVFDLNSVWKGEQESGDNDGDECDVCRVDEEVDDENEDEEIRF- 137
D+ ++ G G KV+ + KG + D+DGD DDE+E+E+ +F
Sbjct: 193 DKILRSMQGMPGAPGMKVYSREDIEKGNIGNEDDDGD-----------DDEDEEEDDKFP 241
Query: 138 --------DRESFSRMLRRVTLVEARMYAHMSHLGNLAYSIPNIKQGNLLKRCGLRFVTS 189
++ES + L++ E + K+G LKR + V++
Sbjct: 242 KNLGKVLKEKESKTEELKKTITKEFK------------------KKGEALKRHAQK-VSN 282
Query: 190 SIEK--KELAASIKKEETNGK 208
+ + K L +S K+ +GK
Sbjct: 283 RVRRWWKGLGSSSSKKPKSGK 303
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.313 0.129 0.365
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,434,326
Number of Sequences: 26719
Number of extensions: 248244
Number of successful extensions: 1697
Number of sequences better than 10.0: 78
Number of HSP's better than 10.0 without gapping: 50
Number of HSP's successfully gapped in prelim test: 30
Number of HSP's that attempted gapping in prelim test: 1436
Number of HSP's gapped (non-prelim): 209
length of query: 236
length of database: 11,318,596
effective HSP length: 96
effective length of query: 140
effective length of database: 8,753,572
effective search space: 1225500080
effective search space used: 1225500080
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.9 bits)
Medicago: description of AC145329.1