
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0145.4
(72 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At1g78080 putative AP2 domain containing protein (At1g78080) 83 3e-17
At1g22190 unknown protein 49 6e-07
At1g36060 putative AP2 domain containing protein RAP2.4 gi|2281633 47 2e-06
At4g39780 putative protein 37 0.001
At1g64380 AP2-containing DNA-binding protein 32 0.075
At4g00580 hypothetical protein 31 0.13
At2g01800 hypothetical protein 30 0.17
At2g22200 pseudogene 30 0.22
At5g27690 unknown protein (At5g27690) 28 0.83
At3g59990 unknown protein 28 0.83
At2g31660 importin (nuclear transport factor ) like protein 28 0.83
At1g56660 hypothetical protein 28 0.83
At5g61260 putative protein 28 1.1
At1g56100 unknown protein 27 1.9
At1g15660 unknown protein 27 1.9
At2g24830 unknown protein 27 2.4
At1g80410 putative N-terminal acetyltransferase (At1g80410) 27 2.4
At2g43500 hypothetical protein 26 3.2
At2g13930 putative retroelement pol polyprotein 26 3.2
At1g47900 mysoin-like protein 26 3.2
>At1g78080 putative AP2 domain containing protein (At1g78080)
Length = 334
Score = 82.8 bits (203), Expect = 3e-17
Identities = 43/69 (62%), Positives = 52/69 (75%), Gaps = 3/69 (4%)
Query: 1 PNLRRHQGSSVGGDFGEYKPLHSSVDAKLQAICEGLAEMQKQGKTEKKSEKPQKK--SKA 58
PNLR H GS +GGDFGEYKPLHSSVDAKL+AIC+ +AE QKQ K+ K S+K +KK S
Sbjct: 208 PNLR-HNGSHIGGDFGEYKPLHSSVDAKLEAICKSMAETQKQDKSTKSSKKREKKVSSPD 266
Query: 59 TASKVAPQE 67
+ KV +E
Sbjct: 267 LSEKVKAEE 275
>At1g22190 unknown protein
Length = 261
Score = 48.5 bits (114), Expect = 6e-07
Identities = 29/52 (55%), Positives = 38/52 (72%), Gaps = 3/52 (5%)
Query: 17 EYKPLHSSVDAKLQAICEGLAE-MQKQGKTEKKSEKPQKKSKATASKVAPQE 67
EY+PL SSVDAKL+AIC+ LAE QKQ ++ KKS +K+S A K+ P+E
Sbjct: 146 EYQPLQSSVDAKLEAICQNLAETTQKQVRSTKKSSS-RKRSSTVAVKL-PEE 195
>At1g36060 putative AP2 domain containing protein RAP2.4 gi|2281633
Length = 314
Score = 46.6 bits (109), Expect = 2e-06
Identities = 33/69 (47%), Positives = 44/69 (62%), Gaps = 6/69 (8%)
Query: 1 PNLRRHQGSSVGGDFGEYKPLHSSVDAKLQAICEGLAEMQKQ-GKTEKKSEKPQKKSKAT 59
P LR GSS D GEY P+ ++VDAKL+AI LAE + Q GKTE+ S K K + ++
Sbjct: 199 PALRYQTGSSPS-DTGEYGPIQAAVDAKLEAI---LAEPKNQPGKTERTSRKRAKAAASS 254
Query: 60 ASK-VAPQE 67
A + APQ+
Sbjct: 255 AEQPSAPQQ 263
>At4g39780 putative protein
Length = 272
Score = 37.4 bits (85), Expect = 0.001
Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
Query: 1 PNLRRHQGSSVGGDFGEYKPLHSSVDAKLQAICEGLAEMQ 40
P R G GG + PLHSSVDAKLQ IC+ L + +
Sbjct: 149 PQFRHEDGYYGGGSC--FNPLHSSVDAKLQEICQSLRKTE 186
>At1g64380 AP2-containing DNA-binding protein
Length = 335
Score = 31.6 bits (70), Expect = 0.075
Identities = 21/60 (35%), Positives = 31/60 (51%), Gaps = 5/60 (8%)
Query: 1 PNLRRHQGSSVGGDFGEYKPLHSSVDAKLQAICEGLAEMQKQGKTEKKSEKPQKKSKATA 60
PNL+ GD + L ++VD K+Q+IC+ + + + KKS K K S ATA
Sbjct: 192 PNLKDPSELLGLGDSSKLIALKNAVDGKIQSICQRVRK-----ERAKKSVKVSKNSSATA 246
>At4g00580 hypothetical protein
Length = 317
Score = 30.8 bits (68), Expect = 0.13
Identities = 22/57 (38%), Positives = 31/57 (53%), Gaps = 4/57 (7%)
Query: 14 DFGEYKPLHSSVDAKLQAICEGLAEM---QKQGKTEKKSEKPQKKSKATASKVAPQE 67
D E + + V+A +AI E EM +K G E KSEKP+KK++A K +E
Sbjct: 214 DDAEKEGIIGLVNATSEAI-ENETEMSVKEKDGDEEAKSEKPKKKNRAKKVKTPTKE 269
>At2g01800 hypothetical protein
Length = 304
Score = 30.4 bits (67), Expect = 0.17
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 4/57 (7%)
Query: 14 DFGEYKPLHSSVDAKLQAICEGLAEM---QKQGKTEKKSEKPQKKSKATASKVAPQE 67
D E + + V+A +AI E EM +K G E KSEKP+KK +A K +E
Sbjct: 229 DDAEKEEIIGLVNATPEAI-ENETEMSAKEKDGDEEAKSEKPKKKKRAKKVKTPTKE 284
>At2g22200 pseudogene
Length = 261
Score = 30.0 bits (66), Expect = 0.22
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 17 EYKPLHSSVDAKLQAICEGLAEMQK 41
+ PL SSVD KLQAIC+ L + ++
Sbjct: 133 DMNPLPSSVDTKLQAICKSLRKTEE 157
>At5g27690 unknown protein (At5g27690)
Length = 352
Score = 28.1 bits (61), Expect = 0.83
Identities = 14/42 (33%), Positives = 25/42 (59%)
Query: 8 GSSVGGDFGEYKPLHSSVDAKLQAICEGLAEMQKQGKTEKKS 49
G GG+F + K + + V+ +LQ +G + +K+ K +KKS
Sbjct: 142 GGGGGGNFDQVKQVVTFVNGQLQPQGDGAPKKKKKKKKKKKS 183
>At3g59990 unknown protein
Length = 439
Score = 28.1 bits (61), Expect = 0.83
Identities = 16/54 (29%), Positives = 29/54 (53%), Gaps = 2/54 (3%)
Query: 16 GEYKPLHSSVDAKLQAICEGLAEM--QKQGKTEKKSEKPQKKSKATASKVAPQE 67
G+ + L S + KL+ +G E ++ K E ++K +KK+K+ K PQ+
Sbjct: 25 GKEEQLESELSKKLEIAEDGQEENDGEEGSKAETSTKKKKKKNKSKKKKELPQQ 78
>At2g31660 importin (nuclear transport factor ) like protein
Length = 1040
Score = 28.1 bits (61), Expect = 0.83
Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 1/50 (2%)
Query: 14 DFGEYKPLHSSVDAKLQAICEGLAEMQKQGKTEKKSEKPQKKSKATASKV 63
D ++ L+ S+D QAI G+A+ + + E + EK QKK A ++ V
Sbjct: 989 DAQRFQNLNQSLDFTYQAIANGIAQHAELRRVEIEKEK-QKKLAAASTPV 1037
>At1g56660 hypothetical protein
Length = 522
Score = 28.1 bits (61), Expect = 0.83
Identities = 15/34 (44%), Positives = 18/34 (52%)
Query: 34 EGLAEMQKQGKTEKKSEKPQKKSKATASKVAPQE 67
EG + +K K EKKSEK +K K K P E
Sbjct: 355 EGETKQKKNKKKEKKSEKGEKDVKEDKKKENPLE 388
>At5g61260 putative protein
Length = 496
Score = 27.7 bits (60), Expect = 1.1
Identities = 14/38 (36%), Positives = 20/38 (51%)
Query: 28 KLQAICEGLAEMQKQGKTEKKSEKPQKKSKATASKVAP 65
K++A+ + T K S KPQ K+ +ASKV P
Sbjct: 454 KVKALIGAFETVISLQDTNKTSHKPQSKATTSASKVRP 491
>At1g56100 unknown protein
Length = 232
Score = 26.9 bits (58), Expect = 1.9
Identities = 12/28 (42%), Positives = 16/28 (56%)
Query: 40 QKQGKTEKKSEKPQKKSKATASKVAPQE 67
+K G E KSEKP+KK + S+ E
Sbjct: 157 EKDGDEEAKSEKPKKKKEQRKSRFKKME 184
>At1g15660 unknown protein
Length = 710
Score = 26.9 bits (58), Expect = 1.9
Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 3/65 (4%)
Query: 2 NLRRHQGSSVGGDFGEYKPLHSSV--DAKLQAIC-EGLAEMQKQGKTEKKSEKPQKKSKA 58
N RR +GSS K +H D +++ + E A+ Q +GK+ ++ EK KK+
Sbjct: 559 NKRRKRGSSDSNVKKRSKTVHGETGGDKQMKTLPHESRAKKQTKGKSNEREEKKPKKTLT 618
Query: 59 TASKV 63
K+
Sbjct: 619 HEGKL 623
>At2g24830 unknown protein
Length = 497
Score = 26.6 bits (57), Expect = 2.4
Identities = 17/49 (34%), Positives = 25/49 (50%), Gaps = 1/49 (2%)
Query: 20 PLHSSVDAKLQAICEGLAEMQKQGKTEKKSEKPQKKSK-ATASKVAPQE 67
P S+D L+ I G + +KQ K + K ++ K A A+K A QE
Sbjct: 336 PAKRSLDYALEHIRNGECKSEKQKKKRSRGGKRKRGKKFAEAAKAAKQE 384
>At1g80410 putative N-terminal acetyltransferase (At1g80410)
Length = 714
Score = 26.6 bits (57), Expect = 2.4
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 35 GLAEMQKQGKTEKKSEKPQKKSKATASKVA 64
G EM K +KK K QKK++A A K A
Sbjct: 582 GEDEMSKLAPAQKKKIKKQKKAEARAKKEA 611
>At2g43500 hypothetical protein
Length = 947
Score = 26.2 bits (56), Expect = 3.2
Identities = 13/40 (32%), Positives = 21/40 (52%)
Query: 17 EYKPLHSSVDAKLQAICEGLAEMQKQGKTEKKSEKPQKKS 56
E + L S+ +Q IC L + + G T+K+ KP +S
Sbjct: 482 EQQLLLDSLSGTMQRICRTLRTVSEVGSTKKEGTKPGFRS 521
>At2g13930 putative retroelement pol polyprotein
Length = 1335
Score = 26.2 bits (56), Expect = 3.2
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 45 TEKKSEKPQKKSKATASKVAPQEGC 69
TE++ E P+K+ K A +VA E C
Sbjct: 22 TEEEEEDPEKRKKRDADEVARLERC 46
>At1g47900 mysoin-like protein
Length = 1054
Score = 26.2 bits (56), Expect = 3.2
Identities = 18/56 (32%), Positives = 31/56 (55%), Gaps = 5/56 (8%)
Query: 2 NLRRHQGSSVGGDFGEYKPLHSSVDAKLQAICEGLAEMQKQ----GKTEKKSEKPQ 53
+++RH+ +S+ + E + S + +L A E LAE Q+ GK + KS +PQ
Sbjct: 900 HIQRHRNTSLVAEDDEEADIKSKQERELSAAAEKLAECQETIFVLGK-QLKSFRPQ 954
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.316 0.131 0.384
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,540,721
Number of Sequences: 26719
Number of extensions: 49713
Number of successful extensions: 386
Number of sequences better than 10.0: 38
Number of HSP's better than 10.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 350
Number of HSP's gapped (non-prelim): 43
length of query: 72
length of database: 11,318,596
effective HSP length: 48
effective length of query: 24
effective length of database: 10,036,084
effective search space: 240866016
effective search space used: 240866016
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 52 (24.6 bits)
Lotus: description of TM0145.4