
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC140721.4 + phase: 0
(101 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At2g40170 ABA-regulated gene (ATEM6) 149 2e-37
At3g51810 embryonic abundant protein AtEm1 123 1e-29
At5g07530 glycine-rich protein atGRP-7 36 0.004
At2g05580 unknown protein 34 0.014
At3g05220 unknown protein 33 0.024
At1g21530 amp-binding protein, putative 33 0.024
At5g20780 putative protein 32 0.070
At5g28630 unknown protein 30 0.16
At3g50370 putative protein 30 0.16
At5g48560 putative bHLH transcription factor (bHLH078) 30 0.20
At3g26400 eukaryotic initiation factor 4B (EIF4B7) 30 0.26
At2g42560 putative seed maturation protein 30 0.26
At2g14910 unknown protein 30 0.26
At1g15830 hypothetical protein 30 0.26
At5g41520 unknown protein 29 0.35
At5g53460 NADH-dependent glutamate synthase 29 0.45
At4g02510 chloroplast protein import component Toc159-like 29 0.45
At3g28780 histone-H4-like protein 29 0.45
At2g25670 unknown protein 29 0.45
At2g14210 pseudogene 29 0.45
>At2g40170 ABA-regulated gene (ATEM6)
Length = 92
Score = 149 bits (377), Expect = 2e-37
Identities = 69/89 (77%), Positives = 83/89 (92%)
Query: 3 SKQQNRQELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGR 62
+ QQ +++L+E+AK+GETVVPGGTGGKS EAQ+HLAEGRS+GGQTRKEQLGTEGYQ+MGR
Sbjct: 2 ASQQEKKQLDERAKKGETVVPGGTGGKSFEAQQHLAEGRSRGGQTRKEQLGTEGYQQMGR 61
Query: 63 KGGLSTMEKSGGERAEEEGIDIDESKFKT 91
KGGLST +K GGE AEEEG++IDESKF+T
Sbjct: 62 KGGLSTGDKPGGEHAEEEGVEIDESKFRT 90
>At3g51810 embryonic abundant protein AtEm1
Length = 152
Score = 123 bits (309), Expect = 1e-29
Identities = 62/81 (76%), Positives = 69/81 (84%)
Query: 1 MASKQQNRQELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEM 60
MASKQ +R+EL+EKAKQGETVVPGGTGG SLEAQEHLAEGRSKGGQTRKEQLG EGYQE+
Sbjct: 1 MASKQLSREELDEKAKQGETVVPGGTGGHSLEAQEHLAEGRSKGGQTRKEQLGHEGYQEI 60
Query: 61 GRKGGLSTMEKSGGERAEEEG 81
G KGG + E+ G E +E G
Sbjct: 61 GHKGGEARKEQLGHEGYQEMG 81
Score = 92.4 bits (228), Expect = 3e-20
Identities = 47/66 (71%), Positives = 50/66 (75%)
Query: 24 GGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGID 83
GG K E E KGG+ RKEQLG EGY+EMGRKGGLSTMEKSGGERAEEEGI+
Sbjct: 84 GGEARKEQLGHEGYQEMGHKGGEARKEQLGHEGYKEMGRKGGLSTMEKSGGERAEEEGIE 143
Query: 84 IDESKF 89
IDESKF
Sbjct: 144 IDESKF 149
>At5g07530 glycine-rich protein atGRP-7
Length = 543
Score = 35.8 bits (81), Expect = 0.004
Identities = 26/88 (29%), Positives = 42/88 (47%), Gaps = 6/88 (6%)
Query: 14 KAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSG 73
KAK G+ G S E G S GG ++ + ++ ++G+K +S G
Sbjct: 293 KAKLGKKKGMSGGMSGSEEGMSGSEGGMSSGGGSKSKSKKSKLKAKLGKKKSMS-----G 347
Query: 74 GERAEEEGIDIDESKFKTGGGGGRSQNK 101
G EEG+ E +GGGGG+S+++
Sbjct: 348 GMSGSEEGMSGSEGGM-SGGGGGKSKSR 374
Score = 33.1 bits (74), Expect = 0.024
Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 5/75 (6%)
Query: 27 GGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGIDIDE 86
G +S E G S GG ++ + ++ ++G+K G+ SGG EEG+ E
Sbjct: 263 GSESEEGMSGSEGGMSGGGGSKSKSKKSKLKAKLGKKKGM-----SGGMSGSEEGMSGSE 317
Query: 87 SKFKTGGGGGRSQNK 101
+GGG K
Sbjct: 318 GGMSSGGGSKSKSKK 332
Score = 31.2 bits (69), Expect = 0.091
Identities = 28/99 (28%), Positives = 36/99 (36%), Gaps = 16/99 (16%)
Query: 12 EEKAKQGETVVPGGTGGKSLEAQEHLAEGRSK------------GGQTRKEQLGTEGYQE 59
EE E + GG GGKS + L K GG +R E G
Sbjct: 353 EEGMSGSEGGMSGGGGGKSKSRKSKLKANLGKKKCMSGGMSGSEGGMSRSEG----GISG 408
Query: 60 MGRKGGLSTMEKSGGERAEEEGIDIDESKFKTGGGGGRS 98
G GG + K GG + G + + +G GGG S
Sbjct: 409 GGMSGGSGSKHKIGGGKHGGLGGKFGKKRGMSGSGGGMS 447
Score = 24.6 bits (52), Expect = 8.5
Identities = 21/72 (29%), Positives = 37/72 (51%), Gaps = 14/72 (19%)
Query: 29 KSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGIDIDESK 88
K L+++ +G+S GG+++ G KGG S E+ G + +EG+ E
Sbjct: 188 KMLKSKFGGKKGKSGGGKSK-----------FGGKGGKSEGEE--GMSSGDEGMSGSEGG 234
Query: 89 FKTGGGGGRSQN 100
+GG GG+S++
Sbjct: 235 M-SGGEGGKSKS 245
>At2g05580 unknown protein
Length = 302
Score = 33.9 bits (76), Expect = 0.014
Identities = 24/78 (30%), Positives = 37/78 (46%), Gaps = 11/78 (14%)
Query: 24 GGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGID 83
GG GG+ +G GGQ ++ G +G Q+ G GG + GG+ ++ G
Sbjct: 146 GGQGGQ---------KGGGGGGQGGQKGGGGQGGQKGG--GGQGGQKGGGGQGGQKGGGG 194
Query: 84 IDESKFKTGGGGGRSQNK 101
+ K GGGGG+ +K
Sbjct: 195 RGQGGMKGGGGGGQGGHK 212
Score = 32.3 bits (72), Expect = 0.041
Identities = 21/62 (33%), Positives = 31/62 (49%), Gaps = 1/62 (1%)
Query: 40 GRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGIDIDESKFKTGGGGGRSQ 99
GR +GGQ + G +G Q+ G GG + GG + ++G + K GGGGG+
Sbjct: 81 GRGQGGQ-KGGGGGGQGGQKGGGGGGQGGQKGGGGGQGGQKGGGGGQGGQKGGGGGGQGG 139
Query: 100 NK 101
K
Sbjct: 140 QK 141
Score = 31.2 bits (69), Expect = 0.091
Identities = 27/92 (29%), Positives = 40/92 (43%), Gaps = 7/92 (7%)
Query: 10 ELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTM 69
E EE++K + + G GG+ + GG + Q G G Q G+KGG
Sbjct: 68 EEEEESKNNQGGIGRGQGGQKGGGGGGQGGQKGGGGGGQGGQKGGGGGQG-GQKGG---- 122
Query: 70 EKSGGERAEEEGIDIDESKFKTGGGGGRSQNK 101
GG+ ++ G + K GGGGG+ K
Sbjct: 123 --GGGQGGQKGGGGGGQGGQKGGGGGGQGGQK 152
>At3g05220 unknown protein
Length = 541
Score = 33.1 bits (74), Expect = 0.024
Identities = 33/100 (33%), Positives = 40/100 (40%), Gaps = 21/100 (21%)
Query: 16 KQGETVVPGGTGG--------KSLEAQEHLAEGRSKGGQTRKEQL-----GTEGYQEMGR 62
K G+T G GG KS E + G+ G K + G + Y +
Sbjct: 284 KTGKTDAKSGGGGLLGFFKKGKSGNGDEKKSAGKKDGHGGNKVKSHGGGGGVQHYDSGPK 343
Query: 63 KGGLSTMEKSGGERAEEEGIDIDE--SKFKTGGGGGRSQN 100
KGG T K GG G+DIDE K GGGGG N
Sbjct: 344 KGGGGT--KGGGHG----GLDIDELMKHSKGGGGGGNKGN 377
Score = 27.3 bits (59), Expect = 1.3
Identities = 12/28 (42%), Positives = 16/28 (56%)
Query: 73 GGERAEEEGIDIDESKFKTGGGGGRSQN 100
GG + G + ++ K K GGGGG QN
Sbjct: 65 GGNNKPKGGKESNQVKGKAGGGGGGGQN 92
>At1g21530 amp-binding protein, putative
Length = 776
Score = 33.1 bits (74), Expect = 0.024
Identities = 20/59 (33%), Positives = 31/59 (51%), Gaps = 3/59 (5%)
Query: 41 RSKGGQTRKEQLGTEGYQEMGRKGGLSTME---KSGGERAEEEGIDIDESKFKTGGGGG 96
R K + + GT+ + GG +E +SGG A ++ ++IDE K+GGGGG
Sbjct: 657 RKKKALSSDDLNGTKVLEATEGTGGEEVLEAASESGGLEAAQKRVNIDEHGRKSGGGGG 715
>At5g20780 putative protein
Length = 818
Score = 31.6 bits (70), Expect = 0.070
Identities = 24/73 (32%), Positives = 33/73 (44%), Gaps = 12/73 (16%)
Query: 27 GGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGIDIDE 86
GG+ LEA E+L +G+ KG + K+ E G +KSG EE D D
Sbjct: 203 GGRELEAVENLFDGQDKGTENCKD-------DEDSSDGSSVYSQKSG-----EEDDDFDG 250
Query: 87 SKFKTGGGGGRSQ 99
K + G G S+
Sbjct: 251 DKIEDMGNKGDSK 263
>At5g28630 unknown protein
Length = 148
Score = 30.4 bits (67), Expect = 0.16
Identities = 25/100 (25%), Positives = 46/100 (46%), Gaps = 14/100 (14%)
Query: 3 SKQQNRQELEEKAKQGETVVPGGTG-GKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMG 61
SK + + +++ + E + G G + ++H KGG ++++ G G +E G
Sbjct: 44 SKDEKDGDKKKEGSKREKIAAAMVGLGATFMKKKH------KGGGKKEKRGGGGGKEEEG 97
Query: 62 RKGGLSTMEKSGGERAEEEGIDIDESKFKTGGGGGRSQNK 101
GG + GE EE +E + + GGGGG + +
Sbjct: 98 --GG-----EEEGEEEEESSSSEEEEEEEEGGGGGGDEEE 130
>At3g50370 putative protein
Length = 2152
Score = 30.4 bits (67), Expect = 0.16
Identities = 14/56 (25%), Positives = 22/56 (39%)
Query: 39 EGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGIDIDESKFKTGGG 94
+G + G + + + + GG GG E E +D S F +GGG
Sbjct: 15 DGNKYASVNLNKSYGYQSHHQYNQSGGYGRGRGGGGYAVEHERVDSSGSSFHSGGG 70
>At5g48560 putative bHLH transcription factor (bHLH078)
Length = 498
Score = 30.0 bits (66), Expect = 0.20
Identities = 25/90 (27%), Positives = 39/90 (42%), Gaps = 10/90 (11%)
Query: 14 KAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLG---TEGYQEMGRKGGLSTME 70
KA V PGG + ++ + +G+SK ++ ++ G KGG + E
Sbjct: 205 KALVSPEVTPGGEFSRK---RKSVPKGKSKENPISTASPSPSFSKTAEKNGGKGGSKSSE 261
Query: 71 KSGGERAEEEGIDIDESKFKTGGGGGRSQN 100
+ GG+R EE D +E G G G N
Sbjct: 262 EKGGKRRREEEDDEEEE----GEGEGNKSN 287
>At3g26400 eukaryotic initiation factor 4B (EIF4B7)
Length = 532
Score = 29.6 bits (65), Expect = 0.26
Identities = 29/95 (30%), Positives = 34/95 (35%), Gaps = 7/95 (7%)
Query: 3 SKQQNRQELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGR 62
S++ + L E G PGG L QE L T Q E Q
Sbjct: 54 SRKMKKMSLSEFTT-GAYTAPGGRNSVGLTQQEILQL------PTGPRQRSEEEMQPGRL 106
Query: 63 KGGLSTMEKSGGERAEEEGIDIDESKFKTGGGGGR 97
GG S+ G R + D D S GGGGGR
Sbjct: 107 GGGFSSYGGRSGGRIGRDRDDSDGSWSGGGGGGGR 141
>At2g42560 putative seed maturation protein
Length = 635
Score = 29.6 bits (65), Expect = 0.26
Identities = 19/60 (31%), Positives = 30/60 (49%), Gaps = 11/60 (18%)
Query: 24 GGTGGKSL---------EAQEHLAEGRSKGGQ--TRKEQLGTEGYQEMGRKGGLSTMEKS 72
GG GG+S+ +A+E + EG K G + K Q +E E G++ G T E++
Sbjct: 220 GGVGGRSVKDTVAEKGQQAKESVGEGAQKAGSATSEKAQRASEYATEKGKEAGNMTAEQA 279
>At2g14910 unknown protein
Length = 386
Score = 29.6 bits (65), Expect = 0.26
Identities = 23/79 (29%), Positives = 37/79 (46%), Gaps = 11/79 (13%)
Query: 1 MASKQQNRQELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEM 60
++SK+ +R + ET+ G G S EAQE++ +S+ +KE QEM
Sbjct: 191 VSSKRDSRTQ-----NLSETIDEEGLGRVSSEAQEYILRLQSQLSSVKKE------LQEM 239
Query: 61 GRKGGLSTMEKSGGERAEE 79
RK M++ GE +
Sbjct: 240 RRKNAALQMQQFVGEEKND 258
>At1g15830 hypothetical protein
Length = 483
Score = 29.6 bits (65), Expect = 0.26
Identities = 27/95 (28%), Positives = 35/95 (36%), Gaps = 17/95 (17%)
Query: 19 ETVVPG---------GTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKG----- 64
ETV PG G GG+ E G GG+ + T+G G KG
Sbjct: 266 ETVPPGRGGGGDKTNGRGGEGREEDNGGGRGAEGGGRGSTGEGVTDGGGRTGNKGGNGGS 325
Query: 65 ---GLSTMEKSGGERAEEEGIDIDESKFKTGGGGG 96
G+ T +GG E G + + GGG G
Sbjct: 326 IKIGVGTNGITGGTGGGEAGAGMQVMQGWGGGGSG 360
>At5g41520 unknown protein
Length = 180
Score = 29.3 bits (64), Expect = 0.35
Identities = 19/66 (28%), Positives = 26/66 (38%)
Query: 34 QEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGIDIDESKFKTGG 93
Q+ L GG + +G + G + G KSGGE ++ G D GG
Sbjct: 96 QKPLGRPFGGGGDRPRGPPRGDGERRFGDRDGYRGGPKSGGEYGDKAGAPADYQPGFRGG 155
Query: 94 GGGRSQ 99
GG Q
Sbjct: 156 AGGARQ 161
>At5g53460 NADH-dependent glutamate synthase
Length = 2216
Score = 28.9 bits (63), Expect = 0.45
Identities = 20/83 (24%), Positives = 39/83 (46%), Gaps = 6/83 (7%)
Query: 1 MASKQQNRQELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEM 60
M ++ ++Q +E +++ + T K LE ++ AE ++ + KE++ G
Sbjct: 1644 MKHEEVSKQAIERASEEADE-----TEEKELEEKDAFAELKNMAAASSKEEMSGNGVAAE 1698
Query: 61 GRKGGLSTMEKSGGERA-EEEGI 82
R + K+GG A E EG+
Sbjct: 1699 ARPSKVDNAVKNGGFIAYEREGV 1721
>At4g02510 chloroplast protein import component Toc159-like
Length = 1503
Score = 28.9 bits (63), Expect = 0.45
Identities = 19/83 (22%), Positives = 38/83 (44%), Gaps = 7/83 (8%)
Query: 8 RQELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGT----EGYQEMGRK 63
++++E+ GE+ + G+ ++ E SK +E +GT EG E+
Sbjct: 144 KEDVEDIKDDGESKIENGSVDVDVKQASTDGESESKVKDVEEEDVGTKKDDEGESEL--- 200
Query: 64 GGLSTMEKSGGERAEEEGIDIDE 86
GG ++ EEEG+++ +
Sbjct: 201 GGKVDVDDKSDNVIEEEGVELTD 223
>At3g28780 histone-H4-like protein
Length = 614
Score = 28.9 bits (63), Expect = 0.45
Identities = 18/59 (30%), Positives = 27/59 (45%), Gaps = 6/59 (10%)
Query: 27 GGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEM------GRKGGLSTMEKSGGERAEE 79
GG S A E EG + GGQ+ Q G+ YQ + G S++ S E++ +
Sbjct: 550 GGPSGSASESSMEGGTFGGQSMGGQAGSASYQSTNYQKTHSKSAGKSSVSHSSEEKSSD 608
Score = 25.4 bits (54), Expect = 5.0
Identities = 20/78 (25%), Positives = 25/78 (31%), Gaps = 1/78 (1%)
Query: 18 GETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERA 77
GE+ G G S E + G + GG T G G + GG A
Sbjct: 439 GESTSSGVASGGST-GSESASAGAASGGSTEANGGAAAGGSTEAGSGTSTETSSMGGGSA 497
Query: 78 EEEGIDIDESKFKTGGGG 95
G+ S T GG
Sbjct: 498 AAGGVSESSSGGSTAAGG 515
>At2g25670 unknown protein
Length = 318
Score = 28.9 bits (63), Expect = 0.45
Identities = 25/91 (27%), Positives = 41/91 (44%), Gaps = 9/91 (9%)
Query: 1 MASKQQNRQELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRK----EQLGTEG 56
+A K+ N E ++A Q + G G E +E+ A G SK + +K ++ E
Sbjct: 180 VAPKENNGLEESQEAGQEKKEDVNGEG----EKKENAAGGESKASKKKKKKDKQKEVKES 235
Query: 57 YQEMGRKGGLSTMEKSGGERAEEEG-IDIDE 86
++ + E +G E EEE ID+ E
Sbjct: 236 QEQQANNNADAVDEAAGSEPTEEESPIDVKE 266
>At2g14210 pseudogene
Length = 234
Score = 28.9 bits (63), Expect = 0.45
Identities = 21/70 (30%), Positives = 32/70 (45%)
Query: 1 MASKQQNRQELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEM 60
+AS QQ Q L+E ++ G L+ E KG + +K+QL T +E+
Sbjct: 94 VASLQQQLQYLQECHRKLVGEELSGMNANDLQNLEDQLVTSLKGVRLKKDQLMTNEIREL 153
Query: 61 GRKGGLSTME 70
RKG + E
Sbjct: 154 NRKGQIIQKE 163
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.299 0.126 0.331
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,542,460
Number of Sequences: 26719
Number of extensions: 115569
Number of successful extensions: 278
Number of sequences better than 10.0: 87
Number of HSP's better than 10.0 without gapping: 56
Number of HSP's successfully gapped in prelim test: 31
Number of HSP's that attempted gapping in prelim test: 187
Number of HSP's gapped (non-prelim): 114
length of query: 101
length of database: 11,318,596
effective HSP length: 77
effective length of query: 24
effective length of database: 9,261,233
effective search space: 222269592
effective search space used: 222269592
T: 11
A: 40
X1: 17 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (22.0 bits)
S2: 52 (24.6 bits)
Medicago: description of AC140721.4