
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144474.2 - phase: 0
(201 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At2g37970 unknown protein (At2g37970) 262 8e-71
At3g10130 unknown protein 102 1e-22
At5g20140 unknown protein 94 6e-20
At1g17100 SOUL-like protein 35 0.035
At4g04280 putative transposon protein 32 0.30
At5g64760 proteasome regulatory subunit-like 31 0.39
At3g42580 putative protein 31 0.50
At5g09900 26S proteasome p55 protein-like 30 0.66
At5g35390 putative receptor-like protein kinase 30 0.86
At5g49910 heat shock protein 70 (Hsc70-7) 29 1.5
At5g10680 putative protein 29 1.9
At5g67620 unknown protein 28 2.5
At2g25560 putative DnaJ protein 28 2.5
At5g67100 DNA polymerase alpha 1 28 3.3
At5g35080 unknown protein 28 4.3
At4g38500 unknown protein 28 4.3
At3g11870 putative protein kinase 28 4.3
At1g13700 unknown protein 28 4.3
At5g23300 dihydroorotate dehydrogenase 27 5.6
At2g05790 putative beta-1,3-glucanase 27 5.6
>At2g37970 unknown protein (At2g37970)
Length = 215
Score = 262 bits (670), Expect = 8e-71
Identities = 132/216 (61%), Positives = 160/216 (73%), Gaps = 16/216 (7%)
Query: 1 MGLVFGRFSAETPKYEILKTTQNYVIRKYAPSLVAEITYDPSTFKGDKDGGFKVLVDYIG 60
MG+VFG+ + ETPKY + K+ Y IR+Y P++ AE+TYD S FKGDKDGGF++L YIG
Sbjct: 1 MGMVFGKIAVETPKYTVTKSGDGYEIREYPPAVAAEVTYDASEFKGDKDGGFQLLAKYIG 60
Query: 61 IFGKPQNTKTEKISMTTPVITKENKS----------SSEKIAMTVPVVTNE-----KNKM 105
+FGKP+N K EKI+MT PVITKE + SEKI MT PVVT E + K+
Sbjct: 61 VFGKPENEKPEKIAMTAPVITKEGEKIAMTAPVITKESEKIEMTSPVVTKEGGGEGRKKL 120
Query: 106 VTMQFTLPSMYLKVEEVPKPIDERVVIREEGGKKYGVVTFGGVASDEVVKEKVEKLRLCL 165
VTMQF LPSMY K EE P+P DERVVI+EEGG+KYGV+ F G+AS+ VV EKV+KL L
Sbjct: 121 VTMQFLLPSMYKKAEEAPRPTDERVVIKEEGGRKYGVIKFSGIASESVVSEKVKKLSSHL 180
Query: 166 EKDGFKVIGDFLLGRYNPPAITIPMFRTNEVLIPVE 201
EKDGFK+ GDF+L RYNPP T+P FRTNEV+IPVE
Sbjct: 181 EKDGFKITGDFVLARYNPP-WTLPPFRTNEVMIPVE 215
>At3g10130 unknown protein
Length = 309
Score = 102 bits (255), Expect = 1e-22
Identities = 75/199 (37%), Positives = 104/199 (51%), Gaps = 16/199 (8%)
Query: 11 ETPKYEILKTTQNYVIRKYAPSLVAEITYDPSTFKGDKDGG---FKVLVDYIGIFGKPQN 67
ET + +L T Y IR+ P VAE T P D G F VL +Y+ FGK N
Sbjct: 115 ETMNFRVLFRTDKYEIRQVEPYFVAE-TIMPGETGFDSYGASKSFNVLAEYL--FGK--N 169
Query: 68 TKTEKISMTTPVITKENKSSSEKIAMTVPVVTN--EKNKMVTMQFTLPSMYLKVEEVPKP 125
T EK+ MTTPV+T++ +S EK+ MT PV+T+ + M F +PS Y +P P
Sbjct: 170 TIKEKMEMTTPVVTRKVQSVGEKMEMTTPVITSKAKDQNQWRMSFVMPSKY--GSNLPLP 227
Query: 126 IDERVVIREEGGKKYGVVTFGGVASDEVVKEKVEKLRLCLEKDGFKVIGD---FLLGRYN 182
D V I++ K VV F G +DE ++ + +LR L+ D + D F + +YN
Sbjct: 228 KDPSVKIQQVPRKIVAVVAFSGYVTDEEIERRERELRRALQNDKKFRVRDGVSFEVAQYN 287
Query: 183 PPAITIPMFRTNEVLIPVE 201
PP T+P R NEV + VE
Sbjct: 288 PP-FTLPFMRRNEVSLEVE 305
>At5g20140 unknown protein
Length = 378
Score = 93.6 bits (231), Expect = 6e-20
Identities = 67/191 (35%), Positives = 95/191 (49%), Gaps = 23/191 (12%)
Query: 11 ETPKYEILKTTQNYVIRKYAPSLVAEITYDPSTFKGDKDGGFKVLVDYIGIFGKPQNTKT 70
ETPKY+ILK T NY +R Y P +V E D K GF + YI FGK N+
Sbjct: 206 ETPKYQILKRTANYEVRNYEPFIVVETIGD----KLSGSSGFNNVAGYI--FGK--NSTM 257
Query: 71 EKISMTTPVITKENKSSSEKIAMTVPVVTNEKNKMVTMQFTLPSMYLKVEEVPKPIDERV 130
EKI MTTPV T+ + + + V++Q +PS + +P P +E+V
Sbjct: 258 EKIPMTTPVFTQTTDT--------------QLSSDVSVQIVIPSGK-DLSSLPMPNEEKV 302
Query: 131 VIREEGGKKYGVVTFGGVASDEVVKEKVEKLRLCLEKDGFKVIGDFLLGRYNPPAITIPM 190
+++ G V F G +++VV+ K +LR L KDG + +L RYN P T
Sbjct: 303 NLKKLEGGFAAAVKFSGKPTEDVVQAKENELRSSLSKDGLRAKKGCMLARYNDPGRTWNF 362
Query: 191 FRTNEVLIPVE 201
NEV+I +E
Sbjct: 363 IMRNEVIIWLE 373
>At1g17100 SOUL-like protein
Length = 232
Score = 34.7 bits (78), Expect = 0.035
Identities = 49/206 (23%), Positives = 76/206 (36%), Gaps = 37/206 (17%)
Query: 11 ETPKYEILKTTQNYVIRKYAPSLVAEITYDPSTFKGDKD-GGFKVLVDYIGIFGKPQNTK 69
E P YE++ + Y IR+Y ++ P D F L YI + +N
Sbjct: 46 ECPSYELVHSGNGYEIRRYNNTVWVSTEPIPDISLVDATRTAFFQLFAYI----QGKNEY 101
Query: 70 TEKISMTTPVITKENKSSSEKIAMTVPVVTNEKNKMVTMQFTLPSMYLKVEEVPKPI-DE 128
+KI MT PVI++ V+ FT+ S Y+ + P P E
Sbjct: 102 HQKIEMTAPVISQ---------------VSPSDGPFCESSFTV-SFYVPKKNQPDPAPSE 145
Query: 129 RVVIREEGGKKYGVVTFGGVASDEVVKEKVEKLRLCL-------------EKDGFKVIGD 175
+ I++ + V F G SD+ + E+ L L E G
Sbjct: 146 NLHIQKWNSRYVAVRQFSGFVSDDSIGEQAAALDSSLKGTAWANAIAKSKEDGGVGSDSA 205
Query: 176 FLLGRYNPPAITIPMFRTNEVLIPVE 201
+ + +YN P R NE+ +P E
Sbjct: 206 YTVAQYNSPFEF--SGRVNEIWLPFE 229
>At4g04280 putative transposon protein
Length = 1104
Score = 31.6 bits (70), Expect = 0.30
Identities = 21/62 (33%), Positives = 31/62 (49%), Gaps = 3/62 (4%)
Query: 43 TFKGDKDGGF-KVLVD-YIGIFGKPQNTKTEKISMTTPVITKENKSSS-EKIAMTVPVVT 99
TF GD+D F K+ ++ +G+ G + K++ T PV KE K S E A + T
Sbjct: 7 TFNGDRDFSFWKIRIEAQLGVLGLKNSLTDFKLTKTVPVAKKEEKESEYEDDASDIKQAT 66
Query: 100 NE 101
E
Sbjct: 67 EE 68
>At5g64760 proteasome regulatory subunit-like
Length = 442
Score = 31.2 bits (69), Expect = 0.39
Identities = 23/72 (31%), Positives = 35/72 (47%), Gaps = 5/72 (6%)
Query: 112 LPSMYLKVEEVPKPIDERVVIREEGGKKYGVVTFGGVASDEVVKEKVEKLRLCLEKDGFK 171
L M K++E I E + +E V TFG +A E + +E++RLCL++ F
Sbjct: 112 LTKMLAKIKEEQGLIAEAADLMQE----VAVETFGAMAKTEKIAFILEQVRLCLDQKDF- 166
Query: 172 VIGDFLLGRYNP 183
V L + NP
Sbjct: 167 VRAQILSRKINP 178
>At3g42580 putative protein
Length = 903
Score = 30.8 bits (68), Expect = 0.50
Identities = 18/42 (42%), Positives = 22/42 (51%)
Query: 119 VEEVPKPIDERVVIREEGGKKYGVVTFGGVASDEVVKEKVEK 160
VEE PI E+ + E GKK VV G ++E V VEK
Sbjct: 616 VEEKESPIPEKDTVPEHPGKKDDVVHMGVSQTEENVAHDVEK 657
>At5g09900 26S proteasome p55 protein-like
Length = 442
Score = 30.4 bits (67), Expect = 0.66
Identities = 21/66 (31%), Positives = 33/66 (49%), Gaps = 5/66 (7%)
Query: 118 KVEEVPKPIDERVVIREEGGKKYGVVTFGGVASDEVVKEKVEKLRLCLEKDGFKVIGDFL 177
K++E I E + +E V TFG +A E + +E++RLCL++ F V L
Sbjct: 118 KIKEEQGQIAEAADLMQE----VAVETFGAMAKTEKIAFILEQVRLCLDRQDF-VRAQIL 172
Query: 178 LGRYNP 183
+ NP
Sbjct: 173 SRKINP 178
>At5g35390 putative receptor-like protein kinase
Length = 657
Score = 30.0 bits (66), Expect = 0.86
Identities = 31/140 (22%), Positives = 57/140 (40%), Gaps = 26/140 (18%)
Query: 7 RFSAETPKYE----ILKTTQNYVIRKYAPSLVAEITYDPSTFKGDKDGGFKVLVDYIGIF 62
+F+ E P++E +L + N + SL DP F+G+K G++
Sbjct: 181 QFTGEIPEFEHQLHLLNLSNNALTGPIPESLSMT---DPKVFEGNK-----------GLY 226
Query: 63 GKPQNTKTEKISMTTPVITKENKSSSEKIAMTVPVVTNEKNKMVTMQ--FTLPSMY---- 116
GKP T+ + + P ++ SS + + + + ++ + F L Y
Sbjct: 227 GKPLETECDSPYIEHPPQSEARPKSSSRGPLVITAIVAALTILIILGVIFLLNRSYKNKK 286
Query: 117 --LKVEEVPKPIDERVVIRE 134
L VE P + ++ IRE
Sbjct: 287 PRLAVETGPSSLQKKTGIRE 306
>At5g49910 heat shock protein 70 (Hsc70-7)
Length = 718
Score = 29.3 bits (64), Expect = 1.5
Identities = 24/102 (23%), Positives = 40/102 (38%), Gaps = 3/102 (2%)
Query: 63 GKPQNTKTEKISMTTPVITKENKSSSEKIAMTVP--VVTNEKNKMVTMQFTLPSMYLKVE 120
GKP + TTP + KS + V N +N +++ + +V
Sbjct: 98 GKPTIVTNAEGQRTTPSVVAYTKSKDRLVGQIAKRQAVVNPENTFFSVKRFIGRRMNEVA 157
Query: 121 EVPKPIDERVVIREEGGKKYGVVTFG-GVASDEVVKEKVEKL 161
E K + RV+ E G K G A++E+ + + KL
Sbjct: 158 EESKQVSYRVIKDENGNVKLDCPAIGKQFAAEEISAQVLRKL 199
>At5g10680 putative protein
Length = 241
Score = 28.9 bits (63), Expect = 1.9
Identities = 31/133 (23%), Positives = 55/133 (41%), Gaps = 18/133 (13%)
Query: 18 LKTTQNYVIRKYAPSLVAEITYDPSTFKGDKDGGFKVLVD--YIGIFGKPQNTK------ 69
LK + Y + P V+ + F+G + +VLVD I I+ + + K
Sbjct: 110 LKIIEEYNFESHQPEYVSVLDETSDLFRGFRQKRREVLVDSKKIEIYEEFDSKKVDDEEN 169
Query: 70 TEKISMTTPVITKENKSSSEKIAMTVPVVTNEKNKMVTMQFTLPSMYLKVEEVPKPIDER 129
+E ++ TT + N S++ V E+++ ++ ++EE+P
Sbjct: 170 SETVATTTDMNEAVNFEKSKQEEQEEAEVKEEESESSRVKEETAETKAQIEELP------ 223
Query: 130 VVIREEGGKKYGV 142
EEG KK GV
Sbjct: 224 ----EEGTKKEGV 232
>At5g67620 unknown protein
Length = 182
Score = 28.5 bits (62), Expect = 2.5
Identities = 30/107 (28%), Positives = 47/107 (43%), Gaps = 18/107 (16%)
Query: 65 PQNTKTEKI--SMTTPVITKENKSSSEKIAMTVPVVTNEKNKMVTMQFTLPSMYLKVEEV 122
P K E+I S+T I K N + +T P + NEK LP LK ++
Sbjct: 17 PAENKVERIYWSVTASDIMKSNPGHYVAVVVTSPTMKNEKG--------LP---LKQLKL 65
Query: 123 PKPIDERVVIREEGGKKYGVVTFGGVASDEVVKEKVEKLRLCLEKDG 169
+P D ++ G Y +V+F V ++ K+ V+ +L E G
Sbjct: 66 LRPDDTLLI-----GHVYRLVSFEEVLNEFATKKCVKLGKLLKEGGG 107
>At2g25560 putative DnaJ protein
Length = 656
Score = 28.5 bits (62), Expect = 2.5
Identities = 13/35 (37%), Positives = 20/35 (57%)
Query: 140 YGVVTFGGVASDEVVKEKVEKLRLCLEKDGFKVIG 174
YGV+ A DE+V+++ KL + L D K +G
Sbjct: 68 YGVLGLNPEADDEIVRKRYRKLAVMLHPDRNKSVG 102
>At5g67100 DNA polymerase alpha 1
Length = 1492
Score = 28.1 bits (61), Expect = 3.3
Identities = 46/179 (25%), Positives = 71/179 (38%), Gaps = 36/179 (20%)
Query: 6 GRFSAETPKYEILKT-TQNYVIRKYAPSLVAEIT----------YDPSTFKGDKDGGFKV 54
GRFS K + K TQ ++K P+L A T + S+FK K+
Sbjct: 113 GRFSGRLKKKKKGKEQTQQPQVKKVNPALKAAATITGEGRLSSMFTSSSFKKVKETD--- 169
Query: 55 LVDYIGIFGKPQNTKTEKISMTTPVITKENKSSSEKIAMTVPVVTNEKNKMVTMQFTLPS 114
Y GI E I+ TP + K + K+ TVPV + K+ F++ S
Sbjct: 170 KAQYEGILD-------EIIAQVTPDESDRKKHTRRKLPGTVPVTIFKNKKL----FSVAS 218
Query: 115 MYLKVEEVPKPID--------ERVVIREEGGKKYGVV---TFGGVASDEVVKEKVEKLR 162
E P P + +++EE K+ V+ T + SD V ++ K+R
Sbjct: 219 SMGMKESEPTPSTYEGDSVSMDNELMKEEDMKESEVIPSETMELLGSDIVKEDGSNKIR 277
>At5g35080 unknown protein
Length = 282
Score = 27.7 bits (60), Expect = 4.3
Identities = 22/88 (25%), Positives = 42/88 (47%), Gaps = 14/88 (15%)
Query: 99 TNEKNKMVTMQFTLPSMYLKV-EEVPKPIDERVVIREEGGKKYGVVTFGGVASDEVVKEK 157
++++N M T + LK +E+ +P+ E+ + R+EG Y E +K
Sbjct: 90 SSQQNISTVMMETQQLVKLKTPDELLQPLSEKCLFRQEGWWSY-----------EFCHQK 138
Query: 158 VEKLRLCLEKDGFKVIGDFLLGRYNPPA 185
+R +D K++ +F LG ++P A
Sbjct: 139 Y--VRQLHVEDENKIVQEFFLGTFDPEA 164
>At4g38500 unknown protein
Length = 499
Score = 27.7 bits (60), Expect = 4.3
Identities = 26/116 (22%), Positives = 48/116 (40%), Gaps = 6/116 (5%)
Query: 47 DKDGGFKVLVDYIGIFGKPQNTKTEKISMTTPVITKENKSSSEKIAMTVPVVTNEKNKMV 106
+K+G V++D IG KPQ+ + + + +++ I + VT +++
Sbjct: 91 NKEGRLYVMLDTIGFVPKPQHRCPIPVDYDPDKVLLPSDKTADTIVRNLTYVTEDESS-- 148
Query: 107 TMQFTLPSMYLKVEEVPKPIDERVVIREEGGKKYGVVTFGGVASDEVVKEKVEKLR 162
QF L + E +E ++ E G + GG + KE V+K R
Sbjct: 149 KSQFPLFGGNISWSE----REESFKLKPEMKVHCGFMPRGGAEMSSLDKEYVKKCR 200
>At3g11870 putative protein kinase
Length = 554
Score = 27.7 bits (60), Expect = 4.3
Identities = 23/80 (28%), Positives = 41/80 (50%), Gaps = 7/80 (8%)
Query: 66 QNTKTEKISMTTPVITKEN-KSSSEKIAMTVPVVTNEKNKMVTMQFTLPSMYLK---VEE 121
+N + EK S + + +EN K++SE ++VP +++N T+ LPS+ L+ E
Sbjct: 54 KNEQVEKKSDPSGGLGEENEKTNSESKVLSVP---SDQNINKTLPVMLPSLELRKYDENE 110
Query: 122 VPKPIDERVVIREEGGKKYG 141
P + R ++ KYG
Sbjct: 111 TPGKVVNRRLLVSTNEMKYG 130
>At1g13700 unknown protein
Length = 268
Score = 27.7 bits (60), Expect = 4.3
Identities = 13/48 (27%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 80 ITKENKSSSEKIAMTVPVVTNEKNKMV--TMQFTLPSMYLKVEEVPKP 125
+T +K E+I T+PV+ + N +V T + +++L ++++P P
Sbjct: 184 LTDSHKPPPERITFTLPVINSAANVVVVATGESKANAIHLAIDDLPLP 231
>At5g23300 dihydroorotate dehydrogenase
Length = 460
Score = 27.3 bits (59), Expect = 5.6
Identities = 15/34 (44%), Positives = 20/34 (58%), Gaps = 3/34 (8%)
Query: 140 YGVVTFGGVASDEVVKEKVEKLRLCLEKDGFKVI 173
Y +GG A +KE++ K CLE+DGFK I
Sbjct: 421 YTGFAYGGPALIPQIKEELVK---CLERDGFKSI 451
>At2g05790 putative beta-1,3-glucanase
Length = 473
Score = 27.3 bits (59), Expect = 5.6
Identities = 30/117 (25%), Positives = 45/117 (37%), Gaps = 9/117 (7%)
Query: 1 MGLVFGRFSAETPKYEILKTTQNYVIRKYAPSLVAEITYDPSTFKGDKDGGFKVLVDYIG 60
+G+ +GR S E P K Q +++ + V DPS K G KV VD
Sbjct: 25 IGVNYGRISDELPS--AFKVVQ--LLKSQGITRVKIFDADPSVLKALSGSGIKVTVDLPN 80
Query: 61 --IFGKPQNTKTEKISMTTPVITKENKSSSEKIAMTVPVVTNEKNKMVTMQFTLPSM 115
+F + T + V + E IA+ V + N T F +P+M
Sbjct: 81 ELLFSAAKRTSFAVSWVKRNVAAYHPSTQIESIAVGNEVFVDTHN---TTSFLIPAM 134
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.316 0.137 0.383
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,728,315
Number of Sequences: 26719
Number of extensions: 208506
Number of successful extensions: 589
Number of sequences better than 10.0: 29
Number of HSP's better than 10.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 561
Number of HSP's gapped (non-prelim): 30
length of query: 201
length of database: 11,318,596
effective HSP length: 94
effective length of query: 107
effective length of database: 8,807,010
effective search space: 942350070
effective search space used: 942350070
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (26.6 bits)
Medicago: description of AC144474.2