
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146329.15 - phase: 0
(204 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At1g08010 GATA transcription factor 3, putative 130 5e-31
At3g54810 unknown protein 129 1e-30
At1g08000 GATA transcription factor 3, putative 125 2e-29
At5g66320 GATA-binding transcription factor-like protein 121 2e-28
At5g25830 GATA transcription factor - like 121 2e-28
At3g51080 transcription factor-like protein 121 3e-28
At3g24050 GATA transcription factor 1 (AtGATA-1) 118 2e-27
At3g60530 GATA transcription factor 4 116 9e-27
At4g34680 GATA transcription factor 3 115 1e-26
At4g32890 unknown protein 115 2e-26
At2g45050 putative GATA-type zinc finger transcription factor 114 5e-26
At4g36240 unknown protein 112 1e-25
At2g28340 hypothetical protein 109 1e-24
At3g45170 putative protein 102 1e-22
At5g56860 unknown protein 67 6e-12
At5g26930 unknown protein 62 2e-10
At4g36620 transcription factor like protein 62 3e-10
At4g26150 putative transcription factor 62 3e-10
At3g50870 transcription factor-like protein 62 3e-10
At2g18380 putative GATA-type zinc finger transcription factor 62 3e-10
>At1g08010 GATA transcription factor 3, putative
Length = 303
Score = 130 bits (327), Expect = 5e-31
Identities = 63/117 (53%), Positives = 82/117 (69%), Gaps = 3/117 (2%)
Query: 78 STEKFPDSQIAAKKQKLSSGESKKNKKTKAPLLAALDHNALGLVRQCTHCEATKTPQWRT 137
ST P+S+ ++ + + K + T+ N+ G+VR+CTHCE TKTPQWR
Sbjct: 176 STPGKPESECYFSSEQHAKKKRKIHLTTRTVSSTLEASNSDGIVRKCTHCETTKTPQWRE 235
Query: 138 GPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLHSNSHKKILEMRVMRRKDNK 194
GP GPKTLCNACGVR++SGRL PEYRPA+S TF P +HSNSH+KI+E MRRKD++
Sbjct: 236 GPSGPKTLCNACGVRFRSGRLVPEYRPASSPTFIPAVHSNSHRKIIE---MRRKDDE 289
>At3g54810 unknown protein
Length = 322
Score = 129 bits (323), Expect = 1e-30
Identities = 78/193 (40%), Positives = 98/193 (50%), Gaps = 34/193 (17%)
Query: 18 VLPKSNSSPTCEKTTVR-------RTRSKRPRLATFSSHHSTMQLISSTSSFVGENMQDS 70
VL S+SS TT R R+KRPR + ++D+
Sbjct: 123 VLESSSSSSQTTNTTSLVLPGKHGRPRTKRPRPPVQDK----------------DRVKDN 166
Query: 71 VISNKGASTEKFPDSQIAAKKQKLSSGESKKNKKTKAPLLAALDHNALGL---------- 120
V + P ++ + ++ + KK K T + + +D G
Sbjct: 167 VCGGDSRLIIRIPKQFLSDHNKMINKKKKKKAKITSSSSSSGIDLEVNGNNVDSYSSEQY 226
Query: 121 -VRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLHSNSH 179
+R+C HCE TKTPQWR GP GPKTLCNACGVRYKSGRL PEYRPAAS TF+P LHSNSH
Sbjct: 227 PLRKCMHCEVTKTPQWRLGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFTPALHSNSH 286
Query: 180 KKILEMRVMRRKD 192
KK+ EMR R D
Sbjct: 287 KKVAEMRNKRCSD 299
>At1g08000 GATA transcription factor 3, putative
Length = 308
Score = 125 bits (314), Expect = 2e-29
Identities = 61/105 (58%), Positives = 73/105 (69%), Gaps = 9/105 (8%)
Query: 96 SGESKKNKKTKAPLLAALDHNAL------GLVRQCTHCEATKTPQWRTGPEGPKTLCNAC 149
S E KK K L+ + + L G+VR CTHCE TPQWR GP GPKTLCNAC
Sbjct: 186 SSEQHAKKKRKIHLITHTESSTLESSKSDGIVRICTHCETITTPQWRQGPSGPKTLCNAC 245
Query: 150 GVRYKSGRLCPEYRPAASSTFSPDLHSNSHKKILEMRVMRRKDNK 194
GVR+KSGRL PEYRPA+S TF P +HSNSH+KI+E MR+KD++
Sbjct: 246 GVRFKSGRLVPEYRPASSPTFIPSVHSNSHRKIIE---MRKKDDE 287
>At5g66320 GATA-binding transcription factor-like protein
Length = 339
Score = 121 bits (304), Expect = 2e-28
Identities = 76/195 (38%), Positives = 108/195 (54%), Gaps = 25/195 (12%)
Query: 6 QHPSSSVNKEDFVLPKSNSSPTCEKTTV-RRTRSKRPRLATFSSHHSTMQLISSTSSFVG 64
+HP ++V +E TC K+ V + RSKR R + + + S+SS
Sbjct: 148 KHPVTAVTEE-----------TCFKSPVPAKARSKRNR------NGLKVWSLGSSSSSGP 190
Query: 65 ENMQDSVISNKGASTEKFPDSQIAAKKQKLSSGES----KKNKKTKAPLLAALDHNALGL 120
+ + S+ G S+ F +++ + + + E KK+KK A + + + L
Sbjct: 191 SSSGSTSSSSSGPSSPWFSGAELL---EPVVTSERPPFPKKHKKRSAESVFSGELQQLQP 247
Query: 121 VRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLHSNSHK 180
R+C+HC KTPQWR GP G KTLCNACGVRYKSGRL PEYRPA S TFS +LHSN H+
Sbjct: 248 QRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHR 307
Query: 181 KILEMRVMRRKDNKN 195
K++EMR + + N
Sbjct: 308 KVIEMRRKKEPTSDN 322
>At5g25830 GATA transcription factor - like
Length = 331
Score = 121 bits (304), Expect = 2e-28
Identities = 76/172 (44%), Positives = 99/172 (57%), Gaps = 19/172 (11%)
Query: 22 SNSSP--TCEKTTVRRTRSKRPRLATFSSHHSTMQLISST---SSFVGENMQDSVISNKG 76
++SSP T + + + RSKR R A + + ++ L+ T S F GE + + S +
Sbjct: 124 NSSSPIFTTDVSVPAKARSKRSRAA--ACNWASRGLLKETFYDSPFTGETI---LSSQQH 178
Query: 77 ASTEKFPDSQIA--AKKQKLSSGESKKNKKTKAPLLAALDHNALGLVRQCTHCEATKTPQ 134
S P +A KKQ + G +K K +P + R+C HC KTPQ
Sbjct: 179 LSPPTSPPLLMAPLGKKQAVDGGHRRK-KDVSSPESGGAEE------RRCLHCATDKTPQ 231
Query: 135 WRTGPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLHSNSHKKILEMR 186
WRTGP GPKTLCNACGVRYKSGRL PEYRPAAS TF HSNSH+K++E+R
Sbjct: 232 WRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELR 283
>At3g51080 transcription factor-like protein
Length = 312
Score = 121 bits (303), Expect = 3e-28
Identities = 76/185 (41%), Positives = 92/185 (49%), Gaps = 15/185 (8%)
Query: 13 NKEDFVLPKSNSSPTCEKTTVRRTRSKRPRLATFSSHHSTMQLISSTSSFVGENMQDSVI 72
N+ V P + + +TR KR R H + L S+SS
Sbjct: 121 NRRHLVQPVKEETCFKSQHPAVKTRPKRARTGVRVWSHGSQSLTDSSSS----------- 169
Query: 73 SNKGASTEKFPDSQI-AAKKQKLSSGESKKNKKTKAPLLAALDHNALGL-VRQCTHCEAT 130
S +S+ P S + A Q L +K KK K A RQC HC
Sbjct: 170 STTSSSSSPRPSSPLWLASGQFLDEPMTKTQKKKKVWKNAGQTQTQTQTQTRQCGHCGVQ 229
Query: 131 KTPQWRTGPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLHSNSHKKILEMRVMRR 190
KTPQWR GP G KTLCNACGVRYKSGRL PEYRPA S TFS +LHSN H K++EMR R+
Sbjct: 230 KTPQWRAGPLGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHSKVIEMR--RK 287
Query: 191 KDNKN 195
K+ +
Sbjct: 288 KETSD 292
>At3g24050 GATA transcription factor 1 (AtGATA-1)
Length = 274
Score = 118 bits (296), Expect = 2e-27
Identities = 57/84 (67%), Positives = 65/84 (76%), Gaps = 3/84 (3%)
Query: 103 KKTKAPLLAALDHNALGLVRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGRLCPEY 162
+K K +AA AL + R+C HC A KTPQWR GP GPKTLCNACGVRYKSGRL PEY
Sbjct: 178 QKKKTMTVAAA---ALIMGRKCQHCGAEKTPQWRAGPAGPKTLCNACGVRYKSGRLVPEY 234
Query: 163 RPAASSTFSPDLHSNSHKKILEMR 186
RPA S TF+ +LHSNSH+KI+EMR
Sbjct: 235 RPANSPTFTAELHSNSHRKIVEMR 258
>At3g60530 GATA transcription factor 4
Length = 240
Score = 116 bits (290), Expect = 9e-27
Identities = 52/75 (69%), Positives = 62/75 (82%), Gaps = 2/75 (2%)
Query: 122 RQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLHSNSHKK 181
R+CTHC + KTPQWRTGP GPKTLCNACGVRYKSGRL PEYRPA+S TF HSNSH+K
Sbjct: 158 RRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTQHSNSHRK 217
Query: 182 ILEMRVMRRKDNKNS 196
++E+R R+K+ + S
Sbjct: 218 VMELR--RQKEQQES 230
>At4g34680 GATA transcription factor 3
Length = 269
Score = 115 bits (289), Expect = 1e-26
Identities = 48/65 (73%), Positives = 56/65 (85%)
Query: 122 RQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLHSNSHKK 181
R+C+HC TPQWRTGP GPKTLCNACGVR+KSGRLCPEYRPA S TFS ++HSN H+K
Sbjct: 180 RRCSHCGTNNTPQWRTGPVGPKTLCNACGVRFKSGRLCPEYRPADSPTFSNEIHSNLHRK 239
Query: 182 ILEMR 186
+LE+R
Sbjct: 240 VLELR 244
>At4g32890 unknown protein
Length = 308
Score = 115 bits (288), Expect = 2e-26
Identities = 57/104 (54%), Positives = 72/104 (68%), Gaps = 5/104 (4%)
Query: 100 KKNKKTKAPLLAA---LDHNALGLVRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSG 156
KK ++ K A +D G R+C HC KTPQWRTGP GPKTLCNACGVRYKSG
Sbjct: 172 KKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTLCNACGVRYKSG 231
Query: 157 RLCPEYRPAASSTFSPDLHSNSHKKILEMRVMRRKDNKNSGILA 200
RL PEYRPA+S TF HSNSH+K++E+R R+K+ ++ +L+
Sbjct: 232 RLVPEYRPASSPTFVMARHSNSHRKVMELR--RQKEMRDEHLLS 273
>At2g45050 putative GATA-type zinc finger transcription factor
Length = 264
Score = 114 bits (284), Expect = 5e-26
Identities = 49/68 (72%), Positives = 58/68 (85%)
Query: 119 GLVRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLHSNS 178
G +R+CTHC + KTPQWRTGP GPKTLCNACGVR+KSGRL PEYRPA+S TF HSNS
Sbjct: 176 GGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNS 235
Query: 179 HKKILEMR 186
H+K++E+R
Sbjct: 236 HRKVMELR 243
>At4g36240 unknown protein
Length = 238
Score = 112 bits (281), Expect = 1e-25
Identities = 64/160 (40%), Positives = 92/160 (57%), Gaps = 30/160 (18%)
Query: 60 SSFVGENMQDSVISNKGASTEKFPDSQIAAKKQKLSSGESKKNKKTK---------APLL 110
S+FV ++ +S IS+ P + + ++Q + K ++T +PLL
Sbjct: 78 SNFVEDSFSESYISSDFPVN---PVASVEVRRQCVPVKPRSKRRRTNGRIWSMESPSPLL 134
Query: 111 AA------------LDHNALGLVRQ------CTHCEATKTPQWRTGPEGPKTLCNACGVR 152
+ +D + G+V+Q C+HC KTPQWR GP G KTLCNACGVR
Sbjct: 135 STAVARRKKRGRQKVDASYGGVVQQQQLRRCCSHCGVQKTPQWRMGPLGAKTLCNACGVR 194
Query: 153 YKSGRLCPEYRPAASSTFSPDLHSNSHKKILEMRVMRRKD 192
+KSGRL PEYRPA S TF+ ++HSNSH+K+LE+R+M+ D
Sbjct: 195 FKSGRLLPEYRPACSPTFTNEIHSNSHRKVLELRLMKVAD 234
>At2g28340 hypothetical protein
Length = 315
Score = 109 bits (272), Expect = 1e-24
Identities = 55/94 (58%), Positives = 63/94 (66%), Gaps = 15/94 (15%)
Query: 100 KKNKKTKAPLLAALDHNALGLVRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGRLC 159
KK KK K+ L +CTHCE T TPQWR GP G KTLCNACG+R++SGRL
Sbjct: 205 KKRKKNKSRRL------------KCTHCETTTTPQWREGPNGRKTLCNACGIRFRSGRLV 252
Query: 160 PEYRPAASSTFSPDLHSNSHKKILEMRVMRRKDN 193
EYRPAAS TF P +HSN HKKI+ MR+ KDN
Sbjct: 253 LEYRPAASPTFIPTVHSNLHKKIIYMRM---KDN 283
>At3g45170 putative protein
Length = 204
Score = 102 bits (254), Expect = 1e-22
Identities = 42/75 (56%), Positives = 55/75 (73%)
Query: 122 RQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLHSNSHKK 181
+ C+HC KTP WR GP G TLCNACG+RY++GRL PEYRPA+S F P++HSN H+K
Sbjct: 115 KSCSHCGTRKTPLWREGPRGAGTLCNACGMRYRTGRLLPEYRPASSPDFKPNVHSNFHRK 174
Query: 182 ILEMRVMRRKDNKNS 196
++E+R R+ NS
Sbjct: 175 VMEIRRERKSSPPNS 189
>At5g56860 unknown protein
Length = 398
Score = 67.0 bits (162), Expect = 6e-12
Identities = 30/82 (36%), Positives = 44/82 (53%)
Query: 116 NALGLVRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLH 175
N G++R C+ C TKTP WR+GP GPK+LCNACG+R + R AA+ +
Sbjct: 224 NNNGVIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAMAAAAAAGDQEVAVA 283
Query: 176 SNSHKKILEMRVMRRKDNKNSG 197
+ L+ ++ +K N G
Sbjct: 284 PRVQQLPLKKKLQNKKKRSNGG 305
>At5g26930 unknown protein
Length = 120
Score = 62.4 bits (150), Expect = 2e-10
Identities = 23/39 (58%), Positives = 30/39 (75%)
Query: 119 GLVRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGR 157
G +R C+ C+ TKTP WR GP GPK+LCNACG+R++ R
Sbjct: 23 GTIRCCSECKTTKTPMWRGGPTGPKSLCNACGIRHRKQR 61
>At4g36620 transcription factor like protein
Length = 211
Score = 61.6 bits (148), Expect = 3e-10
Identities = 22/35 (62%), Positives = 28/35 (79%)
Query: 120 LVRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYK 154
L R+C +C+ T TP WR GP GPK+LCNACG+R+K
Sbjct: 73 LARRCANCDTTSTPLWRNGPRGPKSLCNACGIRFK 107
>At4g26150 putative transcription factor
Length = 352
Score = 61.6 bits (148), Expect = 3e-10
Identities = 40/140 (28%), Positives = 71/140 (50%), Gaps = 16/140 (11%)
Query: 68 QDSVISNKGASTEKFPDSQIAAKKQK---LSSGESKK---NKKTKAPLLAALDHNALG-- 119
Q + G ++ K+ S++ K+K +++ +S K N + L + N
Sbjct: 136 QSPIKDMTGTNSLKWISSKVRLMKKKKAIITTSDSSKQHTNNDQSSNLSNSERQNGYNND 195
Query: 120 -LVRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYKSGRLCPEYRPAASSTFSPDLHSNS 178
++R C+ C TKTP WR+GP GPK+LCNACG+R + R AA +T + S
Sbjct: 196 CVIRICSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKA------RRAAMATATATAVSGV 249
Query: 179 HKKILEMRVMRRKDNKNSGI 198
+++ + M+ K+ ++G+
Sbjct: 250 SPPVMKKK-MQNKNKISNGV 268
>At3g50870 transcription factor-like protein
Length = 295
Score = 61.6 bits (148), Expect = 3e-10
Identities = 22/35 (62%), Positives = 28/35 (79%)
Query: 120 LVRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYK 154
L R+C +C+ T TP WR GP GPK+LCNACG+R+K
Sbjct: 150 LARRCANCDTTSTPLWRNGPRGPKSLCNACGIRFK 184
>At2g18380 putative GATA-type zinc finger transcription factor
Length = 207
Score = 61.6 bits (148), Expect = 3e-10
Identities = 26/53 (49%), Positives = 35/53 (65%), Gaps = 3/53 (5%)
Query: 102 NKKTKAPLLAALDHNALGLVRQCTHCEATKTPQWRTGPEGPKTLCNACGVRYK 154
N KT + + H+ L R+C C+ T TP WR GP+GPK+LCNACG+R+K
Sbjct: 74 NAKTSSYKKGGVAHS---LPRRCASCDTTSTPLWRNGPKGPKSLCNACGIRFK 123
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.311 0.124 0.353
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,492,527
Number of Sequences: 26719
Number of extensions: 176944
Number of successful extensions: 632
Number of sequences better than 10.0: 65
Number of HSP's better than 10.0 without gapping: 36
Number of HSP's successfully gapped in prelim test: 29
Number of HSP's that attempted gapping in prelim test: 585
Number of HSP's gapped (non-prelim): 76
length of query: 204
length of database: 11,318,596
effective HSP length: 95
effective length of query: 109
effective length of database: 8,780,291
effective search space: 957051719
effective search space used: 957051719
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 57 (26.6 bits)
Medicago: description of AC146329.15