
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0010.17
(237 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At3g50670 U1 snRNP 70K protein 39 0.002
At5g47430 DNA-binding protein-like 37 0.007
At5g42820 U2 snRNP auxiliary factor, small subunit 36 0.016
At3g54220 SCARECROW1 35 0.046
At1g74720 hypothetical protein 35 0.046
At2g27100 unknown protein 34 0.078
At1g79730 unknown protein 33 0.10
At5g25060 unknown protein 33 0.13
At4g11430 putative proline-rich protein 33 0.13
At4g16790 glycoprotein homolog 33 0.17
At1g60900 putative U2 snRNP auxiliary factor 32 0.23
At1g49490 hypothetical protein 32 0.23
At5g57370 unknown protein 32 0.30
At2g33730 putative U5 small nuclear ribonucleoprotein, an RNA he... 32 0.30
At2g21230 bZIP transcription factor AtbZip30 32 0.39
At5g37370 unknown protein 31 0.50
At1g48620 unknown protein 31 0.50
At3g29060 unknown protein 31 0.66
At1g68690 protein kinase, putative 31 0.66
At1g43860 unknown protein 31 0.66
>At3g50670 U1 snRNP 70K protein
Length = 427
Score = 39.3 bits (90), Expect = 0.002
Identities = 30/84 (35%), Positives = 36/84 (42%), Gaps = 16/84 (19%)
Query: 132 PQPTTFTLSTEPRKTDKRRRRRDPGKEIERDRE------------PPIETSHPNLHRDRD 179
PQ T R ++R + R+ GKE ER RE P E H HRDRD
Sbjct: 248 PQGRTSQSEEPSRPREEREKSREKGKERERSRELSHEQPRERSRDRPREDKH---HRDRD 304
Query: 180 GGGMER-RGARSRETAKRDRETMD 202
GG +R R +R RDR D
Sbjct: 305 QGGRDRDRDSRRDRDRTRDRGDRD 328
>At5g47430 DNA-binding protein-like
Length = 889
Score = 37.4 bits (85), Expect = 0.007
Identities = 30/108 (27%), Positives = 50/108 (45%), Gaps = 13/108 (12%)
Query: 117 PYNLHEHRQQQQP-----PPPQPTTFTLSTEPRKTDKRRRRRDPG-KEIERDREPPIETS 170
P +H++ ++++P P PT +S P + KR+ R P ++ +RDRE +
Sbjct: 667 PPPIHDYDRRRRPEKRLSPEHPPTRKNIS--PSRDSKRKSERYPDERDRQRDRE---RSR 721
Query: 171 HPNLHRDRDGGGMERRGARSRETA--KRDRETMDHHHALHLPPQIPEP 216
H ++ R+ D R RSR+ + + E HHH P EP
Sbjct: 722 HQDVDREHDRTRDRRDEDRSRDHRHHRGETERSQHHHRKRSEPPSSEP 769
>At5g42820 U2 snRNP auxiliary factor, small subunit
Length = 283
Score = 36.2 bits (82), Expect = 0.016
Identities = 24/69 (34%), Positives = 33/69 (47%), Gaps = 2/69 (2%)
Query: 140 STEPRKTDKRRRRRDPGKEIERDREPPIETSHPNLHR-DRDGGGMERRGARSRETAKRD- 197
S PR+ + R R+ G +RDR + S R DRDGGG R G+ R + R+
Sbjct: 202 SISPRRKREHSRERERGDVRDRDRHGNGKRSSDRSERHDRDGGGRRRHGSPKRSRSPRNV 261
Query: 198 RETMDHHHA 206
RE + A
Sbjct: 262 REGSEERRA 270
>At3g54220 SCARECROW1
Length = 653
Score = 34.7 bits (78), Expect = 0.046
Identities = 29/124 (23%), Positives = 47/124 (37%), Gaps = 20/124 (16%)
Query: 98 DPKFTVASPIFTL--NPATQQPYNLHEHRQQQQPPPPQPTTFTLSTEPRKTDKRRRRRDP 155
DP P++ + NP+ Q H+ +QQQ PPP P + ++ + P
Sbjct: 198 DPSPQTFEPLYQISNNPSPPQQQQQHQQQQQQHKPPPPP------IQQQERENSSTDAPP 251
Query: 156 GKEIERDREPPIETSHPNLHRDRDGGGMERRGARSRETAKRDRETMDHHHALHLPPQIPE 215
E P ++T+ R+R +E KR ++ + H L L Q E
Sbjct: 252 QPETVTATVPAVQTNTAEALRER------------KEEIKRQKQDEEGLHLLTLLLQCAE 299
Query: 216 PTFA 219
A
Sbjct: 300 AVSA 303
>At1g74720 hypothetical protein
Length = 1081
Score = 34.7 bits (78), Expect = 0.046
Identities = 20/72 (27%), Positives = 30/72 (40%), Gaps = 1/72 (1%)
Query: 112 PATQQPYNLHEHRQQQQPPPPQPTTFTLSTEPRKTDKRRRRRDPGKE-IERDREPPIETS 170
P P+ H Q+ PPP P+ + P + K + R PG + I + PP
Sbjct: 245 PNDNHPHRNDNHPQRPPSPPPPPSAGEVHYYPPEVRKMQVGRPPGGDRIRVTKRPPNGDY 304
Query: 171 HPNLHRDRDGGG 182
P + + GGG
Sbjct: 305 SPRVINSKTGGG 316
>At2g27100 unknown protein
Length = 720
Score = 33.9 bits (76), Expect = 0.078
Identities = 34/112 (30%), Positives = 46/112 (40%), Gaps = 12/112 (10%)
Query: 110 LNPATQQPYNLHEHRQQQQPPPPQPTTFTLSTEPRKTDKR----RRRRDPGKEIERDREP 165
L P+ L E PPPP P++ +L + ++ D++ RR RD ER E
Sbjct: 6 LPPSDSVDNRLPEKSTSSSPPPPPPSS-SLPQQEQEQDQQQLPLRRERD---SRERRDER 61
Query: 166 PIETSHPN-LHRDRDGGGMERRGARSRETAKRDRETMDHHHALHLPPQIPEP 216
IE PN RDR RR + R + D H+ PPQ P
Sbjct: 62 DIERPPPNRRERDRSPLPPPRRDYKRRPSLSPPPPYRDRRHS---PPQRRSP 110
>At1g79730 unknown protein
Length = 589
Score = 33.5 bits (75), Expect = 0.10
Identities = 26/107 (24%), Positives = 41/107 (38%), Gaps = 14/107 (13%)
Query: 109 TLNPATQQPYNLHEHRQQQQPPPPQPTTFTLSTEPRKTDKRRRRRDPGKEIERDREPPIE 168
+L P P H+ PPPP P + ++ +++ PP
Sbjct: 28 SLPPPVPPPPPSHQPYSYPPPPPPPPHAYY---------QQGPHYPQFNQLQAPPPPPPP 78
Query: 169 TSHPNLHRD--RDGGGMERRGARSRETAKRDRETMD---HHHALHLP 210
++ P L D R G + S++ +R+R D HHH HLP
Sbjct: 79 SAPPPLVPDPPRHQGPNDHEKGASKQVGRRERAKPDPSKHHHRSHLP 125
>At5g25060 unknown protein
Length = 946
Score = 33.1 bits (74), Expect = 0.13
Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 3/57 (5%)
Query: 143 PRKTDKRRRRRDPGKEIERDREPPIETSHPNLHRDRDGGGMERRGARSRETAKRDRE 199
PRK+ R R D G++ +R+R + H +L+RDRD E+ + R+ R +E
Sbjct: 882 PRKSSTRERDHDLGRDRDRERHRDRDRQH-DLNRDRD--RREKSSSHDRDDNDRSKE 935
>At4g11430 putative proline-rich protein
Length = 219
Score = 33.1 bits (74), Expect = 0.13
Identities = 29/94 (30%), Positives = 31/94 (32%), Gaps = 6/94 (6%)
Query: 121 HEHRQQQQPPPPQPTTFTLSTEPRKT---DKRRRRRDPGKEIERDREPPIETSHPNLHRD 177
H HR+ PPPP P T P T RR P PP T P +
Sbjct: 114 HHHRRSPPPPPPPPPPPPTITPPVTTTTAGHHHHRRSPPP--PPPPPPPPPTITPPVTTT 171
Query: 178 RDGGGMERRGARSRETAKRDRETMDHHHALHLPP 211
G R T T DHH LH PP
Sbjct: 172 TTGHHHHRPPPPPPATTTPITNTSDHHQ-LHPPP 204
Score = 30.4 bits (67), Expect = 0.86
Identities = 26/100 (26%), Positives = 32/100 (32%), Gaps = 6/100 (6%)
Query: 127 QQPPPPQPTT---FTLSTEPRKTDKRRRRRDPGKEIERDREPPIETSHPNLHRDRDGGGM 183
++ PPP TT S P T R P R PP+ + HR +
Sbjct: 9 KRQPPPLATTAGHHRRSPPPATTGHHHRSPPPAITACHHRRPPLPATTAGHHRQLRPPSI 68
Query: 184 E---RRGARSRETAKRDRETMDHHHALHLPPQIPEPTFAL 220
G R T HH L PP P P A+
Sbjct: 69 PVTTNTGHRHCRPPSNPATTNSGHHQLRPPPPPPPPLSAI 108
Score = 29.3 bits (64), Expect = 1.9
Identities = 19/61 (31%), Positives = 21/61 (34%), Gaps = 6/61 (9%)
Query: 112 PATQQPYNLHEHRQQQQPPPPQPTTFTLSTEPRKTDKRRRRRDPGKEIERDREPPIETSH 171
P T H HR+ PPPP P T P T G R PP T+
Sbjct: 136 PVTTTTAGHHHHRRSPPPPPPPPPPPPTITPPVTT------TTTGHHHHRPPPPPPATTT 189
Query: 172 P 172
P
Sbjct: 190 P 190
Score = 29.3 bits (64), Expect = 1.9
Identities = 33/116 (28%), Positives = 37/116 (31%), Gaps = 21/116 (18%)
Query: 111 NPATQQPYNLHEHRQQQQPPPPQPTTFTLSTEPRKTDKRRRRRDPGKEIERDREPPIETS 170
NPAT H Q + PPPP P ++T T RR P PP T
Sbjct: 84 NPATTNS----GHHQLRPPPPPPPPLSAITT----TGHHHHRRSPPP--PPPPPPPPPTI 133
Query: 171 HPNLHRDRDGGGMERRGARSRE---------TAKRDRETMDHHHALHLPPQIPEPT 217
P + G RR T T HHH H PP P T
Sbjct: 134 TPPVTTTTAGHHHHRRSPPPPPPPPPPPPTITPPVTTTTTGHHH--HRPPPPPPAT 187
>At4g16790 glycoprotein homolog
Length = 473
Score = 32.7 bits (73), Expect = 0.17
Identities = 20/89 (22%), Positives = 39/89 (43%), Gaps = 8/89 (8%)
Query: 123 HRQQQQPPPPQ-------PTTFTLSTEPRKTDKRRRRRDPGKEIERDREPPIETSHPNLH 175
H PPPP PT F LS E RK+ +++ +R+ K++ +P +E+ +
Sbjct: 328 HPPPPPPPPPPVEYYKSPPTKFRLSNERRKSSEQKMKRNAPKKVWWS-DPIVESKEQDTE 386
Query: 176 RDRDGGGMERRGARSRETAKRDRETMDHH 204
++ + + E ++ R + H
Sbjct: 387 KNDQRSNLGSKAVEESENGEQRRGENEIH 415
>At1g60900 putative U2 snRNP auxiliary factor
Length = 589
Score = 32.3 bits (72), Expect = 0.23
Identities = 21/61 (34%), Positives = 29/61 (47%), Gaps = 3/61 (4%)
Query: 145 KTDKRRRRRDPGKEIERDREPPIETSHPNLHRDRDGGGMERRGARSRETAKRDRETMDHH 204
K R + RD +E RDR+ E S R RD +R RSRE +++ + D H
Sbjct: 70 KDRDREKSRDRDREKSRDRDRDRERSKD---RQRDRHHRDRHRDRSRERSEKRDDLDDDH 126
Query: 205 H 205
H
Sbjct: 127 H 127
Score = 30.0 bits (66), Expect = 1.1
Identities = 16/61 (26%), Positives = 26/61 (42%)
Query: 144 RKTDKRRRRRDPGKEIERDREPPIETSHPNLHRDRDGGGMERRGARSRETAKRDRETMDH 203
R D+ R R + +++ D RDRD RR +RSR ++ +R +
Sbjct: 107 RHRDRSRERSEKRDDLDDDHHRRSRDRDRRRSRDRDREVRHRRRSRSRSRSRSERRSRSE 166
Query: 204 H 204
H
Sbjct: 167 H 167
Score = 28.9 bits (63), Expect = 2.5
Identities = 15/58 (25%), Positives = 27/58 (45%)
Query: 142 EPRKTDKRRRRRDPGKEIERDREPPIETSHPNLHRDRDGGGMERRGARSRETAKRDRE 199
E + R + RD ++ ER ++ + H + HRDR E+R + +R R+
Sbjct: 75 EKSRDRDREKSRDRDRDRERSKDRQRDRHHRDRHRDRSRERSEKRDDLDDDHHRRSRD 132
>At1g49490 hypothetical protein
Length = 847
Score = 32.3 bits (72), Expect = 0.23
Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 3/63 (4%)
Query: 111 NPATQQPYNLHEHRQQQQPPPPQPTTFTLSTEPRKTDKRRRRRDPGKEIERDREPPIETS 170
+P ++P N HE +Q++ P PQP+ S +P ++ + P E + EP S
Sbjct: 442 SPKPEEPENKHELPKQKESPKPQPSKPEDSPKP---EQPKPEESPKPEQPQIPEPTKPVS 498
Query: 171 HPN 173
PN
Sbjct: 499 PPN 501
Score = 27.7 bits (60), Expect = 5.6
Identities = 12/48 (25%), Positives = 22/48 (45%)
Query: 86 EKQKQQRQGFLVDPKFTVASPIFTLNPATQQPYNLHEHRQQQQPPPPQ 133
E+ + Q + +P V+ P P PY+ + ++ PPPP+
Sbjct: 480 EESPKPEQPQIPEPTKPVSPPNEAQGPTPDDPYDASPVKNRRSPPPPK 527
>At5g57370 unknown protein
Length = 219
Score = 32.0 bits (71), Expect = 0.30
Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 1/58 (1%)
Query: 147 DKRRRRRDPGKEIERDREPPIETSHPNLHRDRDGGGMERRGARSRETAKRDRETMDHH 204
D+R R RD +E +RDR + + RDR+ ER R K T DHH
Sbjct: 5 DRRERERDKDREPDRDRRRGRDDRDRDRDRDRE-RDRERDRDRGLRNKKSRSRTPDHH 61
>At2g33730 putative U5 small nuclear ribonucleoprotein, an RNA
helicase
Length = 733
Score = 32.0 bits (71), Expect = 0.30
Identities = 27/88 (30%), Positives = 38/88 (42%), Gaps = 25/88 (28%)
Query: 125 QQQQPPPPQPTTFTLSTEPRKTDKRR-------------RRRDPGKEIERDREPPIETSH 171
+++Q P+P T +S DK R R RD G++ +RDR+
Sbjct: 51 RREQLGRPEPETEDVSNGDTNRDKDRDRDRDRDRERDRDRERDRGRDRDRDRD------- 103
Query: 172 PNLHRDRDGGGMERRGARSRETAKRDRE 199
RDRD ER R R+ +RDRE
Sbjct: 104 ----RDRD-RDRERERDRERDRRERDRE 126
Score = 28.9 bits (63), Expect = 2.5
Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 1/60 (1%)
Query: 144 RKTDKRRRR-RDPGKEIERDREPPIETSHPNLHRDRDGGGMERRGARSRETAKRDRETMD 202
R+ D+ R R RD ++ +RDRE + RDR+ R R E R++ ++
Sbjct: 90 RERDRGRDRDRDRDRDRDRDRERERDRERDRRERDREPDRRNREKEREEEVKAREKARVE 149
>At2g21230 bZIP transcription factor AtbZip30
Length = 519
Score = 31.6 bits (70), Expect = 0.39
Identities = 27/108 (25%), Positives = 39/108 (36%), Gaps = 5/108 (4%)
Query: 110 LNPATQQPYNLHEHRQQQQPPPPQPTTFTLSTEPRKTDKRRRR--RDPGKEIERDREPPI 167
LNPA + ++ H PPPP P S P R R P D PP+
Sbjct: 34 LNPALIRSHHHFRHPFTGAPPPPIPPISPYSQIPATLQPRHSRSMSQPSSFFSFDSLPPL 93
Query: 168 ETSHPNLH---RDRDGGGMERRGARSRETAKRDRETMDHHHALHLPPQ 212
S P++ ++ G G S T + + +LPP+
Sbjct: 94 NPSAPSVSVSVEEKTGAGFSPSLPPSPFTMCHSSSSRNAGDGENLPPR 141
>At5g37370 unknown protein
Length = 393
Score = 31.2 bits (69), Expect = 0.50
Identities = 28/108 (25%), Positives = 42/108 (37%), Gaps = 2/108 (1%)
Query: 112 PATQQPYNLHEHRQQQQPPPPQPTTFTLSTEPRKTDKRRRRRDPGKEIERDREPPIETSH 171
P T N + QQ+ P Q + + + +R R +D +E RDR E +
Sbjct: 228 PPTGYDRNGGDEVQQRSPRRSQSRDYYSDRDSDRQREREREKDRERERGRDRYRERERDY 287
Query: 172 PNLHRD-RDGGGMERRGARSRETAKRDRETMDHHHALHLPPQIP-EPT 217
N R RD RR + ++ DR + + QI EPT
Sbjct: 288 GNDRRSRRDYDSRSRRNDYEDDRSRHDRRSRSRSRSRSRSVQIEREPT 335
>At1g48620 unknown protein
Length = 479
Score = 31.2 bits (69), Expect = 0.50
Identities = 37/154 (24%), Positives = 57/154 (36%), Gaps = 11/154 (7%)
Query: 82 THGREKQKQQRQGFLVDPKFTVASPIFTLNPATQQPYNLHEHRQQQQPPPPQPTTFTLST 141
T+G+ +Q + + P L P QQP R ++ P +
Sbjct: 206 TNGKLTWEQSELPVSRPEEIQIQPPQLPLQP--QQPVKRPPGRPRKDGTSPTVKPAASVS 263
Query: 142 EPRKTDKRRRRRDPGKEIERDREPPIETSHPNLHRDRDGGGMERRGARSRETAKRDRETM 201
+T KRR R G+ R+R+P + ++ ++ GG+ RRG R +
Sbjct: 264 GGVETVKRRGRPPSGRAAGRERKPIVVSAPASVFPYVANGGVRRRGRPKR---------V 314
Query: 202 DHHHALHLPPQIPEPTFALGFLRLVVVLGDGGGR 235
D A + P P PT V V G GR
Sbjct: 315 DAGGASSVAPPPPPPTNVESGGEEVAVKKRGRGR 348
Score = 26.9 bits (58), Expect = 9.5
Identities = 10/19 (52%), Positives = 13/19 (67%)
Query: 129 PPPPQPTTFTLSTEPRKTD 147
PPPP PT+ S EP ++D
Sbjct: 145 PPPPPPTSVAPSLEPPRSD 163
>At3g29060 unknown protein
Length = 794
Score = 30.8 bits (68), Expect = 0.66
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 124 RQQQQPPPPQPTTFTLSTEPRKTD 147
++QQ+PPPP P T T P KTD
Sbjct: 38 QKQQRPPPPPPPPSTGDTVPLKTD 61
>At1g68690 protein kinase, putative
Length = 708
Score = 30.8 bits (68), Expect = 0.66
Identities = 24/80 (30%), Positives = 30/80 (37%), Gaps = 9/80 (11%)
Query: 99 PKFTVASPIFTLNPATQQPYNLHEHRQQQQPPPPQPTTFTLSTEPRKTDKRRRRRDPGKE 158
P V SP ++ P Q P R Q PPPP P P +R + P
Sbjct: 138 PPILVRSPPPSVRPI-QSPPPPPSDRPTQSPPPPSP--------PSPPSERPTQSPPSPP 188
Query: 159 IERDREPPIETSHPNLHRDR 178
ER + P S P+ DR
Sbjct: 189 SERPTQSPPPPSPPSPPSDR 208
>At1g43860 unknown protein
Length = 370
Score = 30.8 bits (68), Expect = 0.66
Identities = 25/103 (24%), Positives = 44/103 (42%), Gaps = 9/103 (8%)
Query: 18 LSWRLGEEDHSQPMLCGEGKALGQDIDKPLVGASSVLYARSTMEDMVGLLLGLLEPFRIW 77
LSWR G E +L + + ++ K ++ S L +D + + +LE +
Sbjct: 39 LSWRSGVEKDIDEVL--QSHTVYSNVSKGVLAKSKDLMKSFGSDDHTKICIDILEKGELQ 96
Query: 78 VVTATHGREKQKQQRQGFLVDPKFTVASPIFTLNPATQQPYNL 120
V G+E++ Q F + T+NP TQ+PY +
Sbjct: 97 VA----GKERESQFSSQFRDIATIVMQK---TINPETQRPYTI 132
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.318 0.138 0.424
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,327,382
Number of Sequences: 26719
Number of extensions: 297785
Number of successful extensions: 1608
Number of sequences better than 10.0: 95
Number of HSP's better than 10.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 67
Number of HSP's that attempted gapping in prelim test: 1383
Number of HSP's gapped (non-prelim): 214
length of query: 237
length of database: 11,318,596
effective HSP length: 96
effective length of query: 141
effective length of database: 8,753,572
effective search space: 1234253652
effective search space used: 1234253652
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 58 (26.9 bits)
Lotus: description of TM0010.17