
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144726.5 - phase: 0 /pseudo
(293 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At2g03780 unknown protein 344 3e-95
At5g64860 4-alpha-glucanotransferase 33 0.18
At3g43220 putative protein 32 0.31
At5g28430 putative protein 30 1.2
At1g16650 Hypothetical protein 30 1.2
At5g48690 unknown protein 30 1.5
At3g28360 P-glycoprotein like protein 30 1.5
At2g26890 unknown protein 30 1.5
At5g37190 COP1-interacting protein 4 (CIP4) 30 2.0
At3g26050 unknown protein (At3g26050) 30 2.0
At4g26020 hypothetical protein 29 3.4
At4g00930 29 3.4
At3g14090 unknown protein 29 3.4
At2g39360 putative protein kinase 29 3.4
At1g75150 hypothetical protein 29 3.4
At4g01820 P-glycoprotein-like protein pgp3 28 5.8
At3g28380 P-glycoprotein, putative 28 5.8
At3g09430 hypothetical protein 28 5.8
At3g02260 unknown protein 28 5.8
At4g01830 putative P-glycoprotein-like protein 28 7.6
>At2g03780 unknown protein
Length = 287
Score = 344 bits (883), Expect = 3e-95
Identities = 182/279 (65%), Positives = 215/279 (76%), Gaps = 22/279 (7%)
Query: 1 MLRSFSSSFHRLSLFMASEKT--QRLHQSILLFFDFGFSTVTGTNFQNTSKRPKTMSIAT 58
ML SS+F R++ + + K QRLHQ + + ++ K+ +TMS
Sbjct: 1 MLSCSSSAFQRVAFMLMAPKLKPQRLHQ------------IAESGVEHLVKKARTMS--- 45
Query: 59 DTATVTDSAMKEPFTKYTEYLNNLNDKRERVVKASRDITMNSKKVIFQVHRMSKYNKDEV 118
T+S+MK+ F+ Y +YLNN N+KRERVVK SRDITMNSKKVIFQVHR+SK NK+EV
Sbjct: 46 -----TESSMKDAFSTYADYLNNFNEKRERVVKVSRDITMNSKKVIFQVHRLSKDNKEEV 100
Query: 119 LEKAEKDLAAVTNQHVSRLVKELQGTDFWKLRRAYSPGIQEYVEAATFCSFCKNGTLLKL 178
LEKA KDL AV +QH +RL+KELQGTDFWKLRRAYSPG+QEYVEAATF FC +GTL L
Sbjct: 101 LEKAGKDLEAVRDQHFARLMKELQGTDFWKLRRAYSPGVQEYVEAATFYKFCLSGTLCTL 160
Query: 179 DEINKTLLPLSDPSLQPLQINILDYILGLADLTGELMRLAIGRISDGELEFAEKICSFAR 238
DEIN TL+PLSDPSL+PLQINILDYILGLADLTGELMR+AIGRISDGE+EFA++IC F R
Sbjct: 161 DEINTTLVPLSDPSLEPLQINILDYILGLADLTGELMRMAIGRISDGEIEFAQRICQFVR 220
Query: 239 DIYRELTLVVPHMDDSSDMKTKMETMLQSVMKIENVFMS 277
I+REL LVVP MDDS DMK+KME MLQSV+KIEN S
Sbjct: 221 QIHRELMLVVPKMDDSYDMKSKMEVMLQSVIKIENACFS 259
>At5g64860 4-alpha-glucanotransferase
Length = 576
Score = 33.1 bits (74), Expect = 0.18
Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 1/70 (1%)
Query: 170 CKNGTLLKLDEINKTLLPLSDPSLQPLQINILDYILGLADLTGELMRLAIGRISDGELEF 229
C N L+ LDE+ K L + D QP+ + ++Y L L+ A R+ DG E
Sbjct: 143 CGNTLLISLDELVKDGLLIKDELPQPIDADSVNYQTA-NKLKSPLITKAAKRLIDGNGEL 201
Query: 230 AEKICSFARD 239
K+ F D
Sbjct: 202 KSKLLDFRND 211
>At3g43220 putative protein
Length = 794
Score = 32.3 bits (72), Expect = 0.31
Identities = 12/67 (17%), Positives = 39/67 (57%)
Query: 65 DSAMKEPFTKYTEYLNNLNDKRERVVKASRDITMNSKKVIFQVHRMSKYNKDEVLEKAEK 124
+S ++ F+ ++++++ + + S D+ ++ K++ V ++ ++ D+V E+
Sbjct: 641 ESMPQKGFSAPLQHVSHILSESSSDIPVSNDVALSRKQLFGDVQKVHRFGSDQVYFGGEE 700
Query: 125 DLAAVTN 131
D+++V+N
Sbjct: 701 DMSSVSN 707
>At5g28430 putative protein
Length = 486
Score = 30.4 bits (67), Expect = 1.2
Identities = 27/100 (27%), Positives = 48/100 (48%), Gaps = 5/100 (5%)
Query: 106 QVHRMSKYNKDEVLEKAEKDLAAVTN-QHVSRLVKELQGTDFWKLRRAYSPGIQEYVEAA 164
+V RM+ NK+ ++ AE++L A + + R++ E +G ++ RA +E E
Sbjct: 305 EVERMAYVNKENAIKMAEQNLKANSEIVRLKRMLSEERGLRDSEVARAVQTTRREVSE-- 362
Query: 165 TFCSFCKNG--TLLKLDEINKTLLPLSDPSLQPLQINILD 202
TF + K + LDE+N + LS I +L+
Sbjct: 363 TFIAKMKTAEHKVSLLDEVNDRFMYLSQARANAQLIEVLE 402
>At1g16650 Hypothetical protein
Length = 521
Score = 30.4 bits (67), Expect = 1.2
Identities = 26/112 (23%), Positives = 49/112 (43%), Gaps = 9/112 (8%)
Query: 91 KASRDITMNSKKVI-FQVHRMSKYNKDEVLEKAEKDLAAVTNQHVSRLVKELQGTDFWKL 149
K R ++NS F+ +S +++ + + DL A N+ + EL G +W +
Sbjct: 414 KPMRQTSLNSDMCSSFEKFCLSAFSRLNLEHPRDLDLNATWNE--ADAFTELIGP-YWSI 470
Query: 150 RRAYSPGIQEYVEAATFCSFCKNGTLLKLDEINKTLLPLSDPSLQPLQINIL 201
R A P ++ + + G D I +LP+ DP++ P + I+
Sbjct: 471 RAALGPVLETLILLDRLMFLQEQG-----DSIKVVMLPIFDPTISPRNVAII 517
>At5g48690 unknown protein
Length = 301
Score = 30.0 bits (66), Expect = 1.5
Identities = 19/62 (30%), Positives = 28/62 (44%)
Query: 11 RLSLFMASEKTQRLHQSILLFFDFGFSTVTGTNFQNTSKRPKTMSIATDTATVTDSAMKE 70
RL + E+ R + I GF V G+ F + SKR ++ D A + +SA
Sbjct: 229 RLKNRLFHERVGRYKEGIEFMELCGFKRVEGSEFLSLSKRDSDINRLRDAAFLLNSASAN 288
Query: 71 PF 72
PF
Sbjct: 289 PF 290
>At3g28360 P-glycoprotein like protein
Length = 1158
Score = 30.0 bits (66), Expect = 1.5
Identities = 30/128 (23%), Positives = 48/128 (37%), Gaps = 8/128 (6%)
Query: 20 KTQRLHQSILLFFDFGFSTVTGTNFQNTSKRPKTMSIATDTATVTDSAMKEP-FTKYTEY 78
KT H+ I D G+ T G S K + + +K P E
Sbjct: 1026 KTANAHEFITSLSD-GYDTYCGDRGVQLSGGQK------QRIAIARTILKNPSILLLDEA 1078
Query: 79 LNNLNDKRERVVKASRDITMNSKKVIFQVHRMSKYNKDEVLEKAEKDLAAVTNQHVSRLV 138
+ L+ + ERVV+ + + M K + HR+S + + +K + H S L
Sbjct: 1079 TSALDSQSERVVQDALEHVMVGKTSVVIAHRLSTIQNCDTIAVLDKGKVVESGTHASLLA 1138
Query: 139 KELQGTDF 146
K G+ F
Sbjct: 1139 KGPTGSYF 1146
>At2g26890 unknown protein
Length = 2535
Score = 30.0 bits (66), Expect = 1.5
Identities = 20/84 (23%), Positives = 42/84 (49%), Gaps = 6/84 (7%)
Query: 71 PFTKYTEYLNNLNDK-RERVVKASRDITMNSKKVIFQVHRMSKYNKDEVLEKAEKDLAAV 129
P ++ E+L +L RE + + D++ I ++ ++ + D++ A +L
Sbjct: 1448 PIVEHVEFLQSLLVMWREELTRKPMDLSEGEACKILEIS-LNNVSSDDLNRTASVEL--- 1503
Query: 130 TNQHVSRLVKELQGTDFWKLRRAY 153
N+ +S + K++Q D KL+R Y
Sbjct: 1504 -NEEISNISKQIQNLDEEKLKRQY 1526
>At5g37190 COP1-interacting protein 4 (CIP4)
Length = 876
Score = 29.6 bits (65), Expect = 2.0
Identities = 16/47 (34%), Positives = 27/47 (57%), Gaps = 7/47 (14%)
Query: 82 LNDKRERVVKASRDITMNSKKVIFQVHRMSKYNKDEVLEKAEKDLAA 128
+N+K+E V K+S +T+N+ KV + NK EV++K + A
Sbjct: 734 VNNKKEAVKKSSNSVTVNNSKV-------NVNNKKEVVKKISNSVTA 773
>At3g26050 unknown protein (At3g26050)
Length = 533
Score = 29.6 bits (65), Expect = 2.0
Identities = 23/100 (23%), Positives = 43/100 (43%), Gaps = 7/100 (7%)
Query: 32 FDFGFSTVTGTNFQNTS-KRPKTMSIATDTATVTDSAMKEPFTKYTEYLNNLNDKRERVV 90
F GF TN + K P+ T+T + T E N+ +
Sbjct: 431 FSCGFKANQNTNLASEEHKNPQVGGFQVTPMTLTSPRFRRNQTPGKE---NIKKPHQTPH 487
Query: 91 KASRDITMNSKKVIFQVHRMSKY---NKDEVLEKAEKDLA 127
KAS +N+KKV+ + H+ SK +K + ++ +++++
Sbjct: 488 KASSMKIINTKKVVMEKHKSSKIHPSSKQQTTKRTQENMS 527
>At4g26020 hypothetical protein
Length = 273
Score = 28.9 bits (63), Expect = 3.4
Identities = 32/143 (22%), Positives = 59/143 (40%), Gaps = 9/143 (6%)
Query: 48 SKRPKTMSIATDTATVTDSAMKEPFTKYTEYLNNLNDKRERVVKAS--RDITMNSKKVIF 105
S R + + I+ + +K+ YTE LNN D+ R S + +++ +
Sbjct: 93 SSRREVLEISNKNLKEENERLKK---LYTESLNNFADQEHRNALESLRQKHVTKVEELEY 149
Query: 106 QVHRM--SKYNKDEVLEKAEKDLAAVTNQHVSRLVKELQGTDFWKLRRAYSPGIQEYVEA 163
++ + K D V+++ +DL A H+ + K+L ++ Y IQ+ +
Sbjct: 150 KIRSLLVEKATNDMVIDRLRQDLTA-NKSHIQAMSKKLDRV-VTEVECKYELEIQDLKDC 207
Query: 164 ATFCSFCKNGTLLKLDEINKTLL 186
KN KL + K LL
Sbjct: 208 LLMEQAEKNDISNKLQSLQKELL 230
>At4g00930
Length = 976
Score = 28.9 bits (63), Expect = 3.4
Identities = 16/61 (26%), Positives = 33/61 (53%), Gaps = 9/61 (14%)
Query: 82 LNDKRERVVKASRDITMNSKKVIFQVHRMSKYNKDEVLEKAEKDLAAVTNQHVSRLVKEL 141
+N+K+E V K+S+ +T+N K M+ NK++ ++K + A N+ + K+
Sbjct: 884 VNNKKEAVKKSSKSVTVNKSK-------MNVNNKEKAVKKISNSVTA--NKSTTNFFKDA 934
Query: 142 Q 142
+
Sbjct: 935 E 935
>At3g14090 unknown protein
Length = 623
Score = 28.9 bits (63), Expect = 3.4
Identities = 20/97 (20%), Positives = 44/97 (44%), Gaps = 6/97 (6%)
Query: 184 TLLPLSDPSLQPLQINILDYILGLADLTGELMRLAIGRISDG------ELEFAEKICSFA 237
+++P+ ++ PL +++YI+ ++D L L + S G +++F E
Sbjct: 367 SIVPVPGGTIHPLTRYVMNYIVMISDYKQTLDDLIMSNPSTGSDPNTPDMDFTELDSKSP 426
Query: 238 RDIYRELTLVVPHMDDSSDMKTKMETMLQSVMKIENV 274
D++ +VV H + K +T L + + N+
Sbjct: 427 LDLHLIWLIVVLHFNLEEKSKHYRDTSLAHIFIMNNI 463
>At2g39360 putative protein kinase
Length = 815
Score = 28.9 bits (63), Expect = 3.4
Identities = 24/97 (24%), Positives = 44/97 (44%), Gaps = 9/97 (9%)
Query: 187 PLSDPSLQPLQINILDYILGL---ADLTGELMRLAIGRISDGELEFAEKICSFARDIYRE 243
P+ DPSL ++N++++ + L L + +G++ +LE +K C +
Sbjct: 687 PVIDPSLPREKVNLIEWAMKLVKKGKLEDIIDPFLVGKV---KLEEVKKYCEVTEKCLSQ 743
Query: 244 LTLVVPHMDDSSDMKTKMETMLQSVMKIENVFMSEDQ 280
+ P M D+ +E MLQ K E M +D+
Sbjct: 744 NGIERPAM---GDLLWNLEFMLQVQAKDEKAAMVDDK 777
>At1g75150 hypothetical protein
Length = 753
Score = 28.9 bits (63), Expect = 3.4
Identities = 21/88 (23%), Positives = 40/88 (44%), Gaps = 7/88 (7%)
Query: 49 KRPKTMSIATDTATVTDSAMKEPFTKYTEYLNNLNDKRERVVKASRDITMNSKKVIFQVH 108
K+ K S D V+ ++M + EYL+ L + +R+++ +RD F+
Sbjct: 190 KKRKKKSDDFDELPVSTASMNMTKKERREYLDQLRAENQRLLRETRD-------AAFEAA 242
Query: 109 RMSKYNKDEVLEKAEKDLAAVTNQHVSR 136
+ + VLEK + ++ Q +SR
Sbjct: 243 PLVRKPISSVLEKIRRRKEEISKQFLSR 270
>At4g01820 P-glycoprotein-like protein pgp3
Length = 1229
Score = 28.1 bits (61), Expect = 5.8
Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 1/67 (1%)
Query: 77 EYLNNLNDKRERVVKASRDITMNSKKVIFQVHRMSKYNKDEVLEKAEKDLAAVTNQHVSR 136
E + L+ + ERVV+ + D M S+ + HR+S +++ + H S
Sbjct: 512 EATSALDAESERVVQEALDRVMMSRTTVIVAHRLSTVRNADMIAVIHRGKIVEEGSH-SE 570
Query: 137 LVKELQG 143
L+K+ +G
Sbjct: 571 LLKDHEG 577
>At3g28380 P-glycoprotein, putative
Length = 1240
Score = 28.1 bits (61), Expect = 5.8
Identities = 20/70 (28%), Positives = 31/70 (43%)
Query: 77 EYLNNLNDKRERVVKASRDITMNSKKVIFQVHRMSKYNKDEVLEKAEKDLAAVTNQHVSR 136
E + L+ K ERVV+ + + M + I HR+S +++ K + H S
Sbjct: 1161 EATSALDSKSERVVQDALERVMVGRTSIMIAHRLSTIQNCDMIVVLGKGKIVESGTHSSL 1220
Query: 137 LVKELQGTDF 146
L K GT F
Sbjct: 1221 LEKGPTGTYF 1230
>At3g09430 hypothetical protein
Length = 247
Score = 28.1 bits (61), Expect = 5.8
Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 5/84 (5%)
Query: 108 HRMSKYNKDEVLE--KAEKDLAAVTNQHVSRLVKELQGTDFWKLRRAYSPGIQEYVEAAT 165
+R SK + +L K+E +A VT H + + T F SPGI+ +
Sbjct: 88 NRPSKMARSRILSEVKSETPMALVTTAHRNSTPNIINSTKFSSPESYLSPGIR---QPTP 144
Query: 166 FCSFCKNGTLLKLDEINKTLLPLS 189
F + + +K D IN+ LS
Sbjct: 145 FAEVSPSASCVKNDPINEMRHSLS 168
>At3g02260 unknown protein
Length = 5079
Score = 28.1 bits (61), Expect = 5.8
Identities = 23/80 (28%), Positives = 35/80 (43%), Gaps = 17/80 (21%)
Query: 115 KDEVLEKAEKDLAAVTNQHVSRLVKELQ-----------GTDFWKLRRAYSPGI------ 157
+D VLE L ++T++ S L KE Q TD +L++AY G
Sbjct: 1714 EDRVLELFSSLLPSITSKRDSGLSKEKQVNLGKDKVLSFDTDLLQLKKAYKSGSLDLKIK 1773
Query: 158 QEYVEAATFCSFCKNGTLLK 177
+Y + S NG+L+K
Sbjct: 1774 ADYTNSKDLKSLLANGSLVK 1793
>At4g01830 putative P-glycoprotein-like protein
Length = 1230
Score = 27.7 bits (60), Expect = 7.6
Identities = 24/100 (24%), Positives = 40/100 (40%), Gaps = 7/100 (7%)
Query: 35 GFSTVTGTNFQNTSKRPKTMSIATDTATVTDSAMKEP-FTKYTEYLNNLNDKRERVVKAS 93
G+ TV G S K + + +KEP E + L+ + ERVV+ +
Sbjct: 1114 GYDTVVGERGIQLSGGQK------QRVAIARAIVKEPKILLLDEATSALDAESERVVQDA 1167
Query: 94 RDITMNSKKVIFQVHRMSKYNKDEVLEKAEKDLAAVTNQH 133
D M ++ I HR+S +V+ + + A H
Sbjct: 1168 LDRVMVNRTTIVVAHRLSTIKNADVIAVVKNGVIAEKGTH 1207
Score = 27.7 bits (60), Expect = 7.6
Identities = 30/119 (25%), Positives = 49/119 (40%), Gaps = 30/119 (25%)
Query: 77 EYLNNLNDKRERVVKASRDITMNSKKVIFQVHRMSKYNKDEVLEKAEKDLAAVTNQHVSR 136
E + L+ + ERVV+ + D M ++ + HR+S +++ + H S
Sbjct: 517 EATSALDAESERVVQEALDRIMVNRTTVIVAHRLSTVRNADIIAVIHRGKIVEEGSH-SE 575
Query: 137 LVKELQGTDFWKLRRAYSPGIQEYVEAATFCSFCKNGTLLKLDEINK--TLLPLSDPSL 193
L+K+ +G AYS LL+L EINK L +SD S+
Sbjct: 576 LLKDHEG--------AYS-------------------QLLRLQEINKESKRLEISDGSI 607
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.320 0.133 0.371
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,858,016
Number of Sequences: 26719
Number of extensions: 228476
Number of successful extensions: 750
Number of sequences better than 10.0: 27
Number of HSP's better than 10.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 735
Number of HSP's gapped (non-prelim): 35
length of query: 293
length of database: 11,318,596
effective HSP length: 99
effective length of query: 194
effective length of database: 8,673,415
effective search space: 1682642510
effective search space used: 1682642510
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (27.3 bits)
Medicago: description of AC144726.5