
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144388.8 - phase: 0
(160 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At1g24020 pollen allergen-like protein 64 3e-11
At5g45860 putative protein 50 5e-07
At2g40330 unknown protein 45 2e-05
At5g45870 putative protein 42 1e-04
At2g38310 unknown protein 41 2e-04
At2g26040 hypothetical protein 40 4e-04
At5g05440 unknown protein 40 7e-04
At5g28010 major latex protein homolog - like 39 0.001
At1g70880 unknown protein 35 0.017
At1g70850 putative protein 35 0.023
At1g73000 hypothetical protein 33 0.086
At1g70840 unknown protein (At1g70840) 32 0.11
At1g01360 unknown protein 32 0.11
At5g46790 unknown protein 31 0.25
At1g70890 unknown protein 31 0.25
At4g18620 putative protein 31 0.33
At1g70830 unknown protein 31 0.33
At4g01026 unknown protein 29 1.2
At5g60720 unknown protein 28 2.1
At4g17870 unknown protein 28 2.1
>At1g24020 pollen allergen-like protein
Length = 155
Score = 63.9 bits (154), Expect = 3e-11
Identities = 47/162 (29%), Positives = 76/162 (46%), Gaps = 19/162 (11%)
Query: 1 MGVTTFTHEYSSTVAPSRMFTALIIDSRNLIPKLLPQFVKDVNIIEGDGGA-GSIEQVNF 59
MG++ H +P+ F + D NL PK P K + ++ GDG A GSI + +
Sbjct: 1 MGLSGVLHVEVEVKSPAEKFWVALGDGINLFPKAFPNDYKTIQVLAGDGNAPGSIRLITY 60
Query: 60 NEGSPF-KYLKQKIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEAT-----NDGGC 113
EGSP K ++I+ +D EN Y++I G E + Y F+ T DGG
Sbjct: 61 GEGSPLVKISAERIEAVDLENKSMSYSIIGG-------EMLEYYKTFKGTITVIPKDGGS 113
Query: 114 LCKMASSY-KTIGDFDVKEEDVKEGRESTIGIYEVVESYLLE 154
L K + + KT + D D ++ + ++ ++ YLL+
Sbjct: 114 LLKWSGEFEKTAHEID----DPHVIKDFAVKNFKEIDEYLLK 151
>At5g45860 putative protein
Length = 161
Score = 50.1 bits (118), Expect = 5e-07
Identities = 22/59 (37%), Positives = 36/59 (60%)
Query: 32 PKLLPQFVKDVNIIEGDGGAGSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEGD 90
P+ QFVK N+ GDGG GS+ +V G P ++ ++++D LD E+ + ++I GD
Sbjct: 35 PQAYKQFVKTCNLSSGDGGEGSVREVTVVSGLPAEFSRERLDELDDESHVMMISIIGGD 93
>At2g40330 unknown protein
Length = 215
Score = 44.7 bits (104), Expect = 2e-05
Identities = 28/125 (22%), Positives = 59/125 (46%), Gaps = 13/125 (10%)
Query: 10 YSSTVAPSRMFTALIID------------SRNLIPKLLPQFVKDVNIIEGDGG-AGSIEQ 56
++ V PS+ F+ ++ D SR P+ FVK +++ GDG GS+ +
Sbjct: 52 HTHVVGPSQCFSVVVQDVEAPVSTVWSILSRFEHPQAYKHFVKSCHVVIGDGREVGSVRE 111
Query: 57 VNFNEGSPFKYLKQKIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGGCLCK 116
V G P + ++++++D + + ++++ GD +S+ + E +DG +
Sbjct: 112 VRVVSGLPAAFSLERLEIMDDDRHVISFSVVGGDHRLMNYKSVTTVHESEEDSDGKKRTR 171
Query: 117 MASSY 121
+ SY
Sbjct: 172 VVESY 176
>At5g45870 putative protein
Length = 159
Score = 42.4 bits (98), Expect = 1e-04
Identities = 20/59 (33%), Positives = 31/59 (51%)
Query: 32 PKLLPQFVKDVNIIEGDGGAGSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEGD 90
PK FVK + GDGG GS+ +V P + +++D LD E+ + ++I GD
Sbjct: 35 PKTFKHFVKTCKLRSGDGGEGSVREVTVVSDLPASFSLERLDELDDESHVMVISIIGGD 93
>At2g38310 unknown protein
Length = 207
Score = 41.2 bits (95), Expect = 2e-04
Identities = 19/60 (31%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 32 PKLLPQFVKDVNIIEGDG-GAGSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEGD 90
P+ F+K ++I GDG GS+ QV+ G P +++D+LD E + ++++ GD
Sbjct: 77 PQAYKHFLKSCSVIGGDGDNVGSLRQVHVVSGLPAASSTERLDILDDERHVISFSVVGGD 136
>At2g26040 hypothetical protein
Length = 190
Score = 40.4 bits (93), Expect = 4e-04
Identities = 24/104 (23%), Positives = 45/104 (43%)
Query: 32 PKLLPQFVKDVNIIEGDGGAGSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEGDP 91
P+ FVK +I GDG GS+ +V G P ++++ +D ++ + + ++ G+
Sbjct: 60 PERYKHFVKRCRLISGDGDVGSVREVTVISGLPASTSTERLEFVDDDHRVLSFRVVGGEH 119
Query: 92 LGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVKEEDVK 135
+S+ +F + G + SY EED K
Sbjct: 120 RLKNYKSVTSVNEFLNQDSGKVYTVVLESYTVDIPEGNTEEDTK 163
>At5g05440 unknown protein
Length = 203
Score = 39.7 bits (91), Expect = 7e-04
Identities = 25/99 (25%), Positives = 49/99 (49%), Gaps = 6/99 (6%)
Query: 15 APSRMFTALIIDSRNLIPKLLPQFVKDVNIIEGDG-GAGSIEQVNFNEGSPFKYLKQKID 73
AP AL+ N PK+ F++ I++GDG G + +V G P ++++
Sbjct: 68 APPESVWALVRRFDN--PKVYKNFIRQCRIVQGDGLHVGDLREVMVVSGLPAVSSTERLE 125
Query: 74 VLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGG 112
+LD+E + ++++ GD +L++ A++D G
Sbjct: 126 ILDEERHVISFSVVGGD---HRLKNYRSVTTLHASDDEG 161
>At5g28010 major latex protein homolog - like
Length = 166
Score = 38.9 bits (89), Expect = 0.001
Identities = 21/81 (25%), Positives = 44/81 (53%), Gaps = 1/81 (1%)
Query: 15 APSRMFTALIIDSRNLIPKLLPQFVKDVNIIEGDGGA-GSIEQVNFNEGSPFKYLKQKID 73
AP+ +F + + + K P+ V+ ++ +G+ G GSI N+ K K++I+
Sbjct: 27 APAAIFYHIYAGRPHHVAKATPRNVQSCDLHDGEWGTVGSIVYWNYVHEGQAKVAKERIE 86
Query: 74 VLDKENLICKYTMIEGDPLGD 94
+++ E + K+ +IEGD + +
Sbjct: 87 LVEPEKKLIKFRVIEGDVMAE 107
>At1g70880 unknown protein
Length = 159
Score = 35.0 bits (79), Expect = 0.017
Identities = 29/113 (25%), Positives = 50/113 (43%), Gaps = 7/113 (6%)
Query: 1 MGVTTFTHEYSSTVAPSRMFTALIIDSRNLIPKLLPQFVKDVNIIEGD-GGAGSIEQVNF 59
+G+ T E S+V F L++ + + P ++ + EG+ G G++ N+
Sbjct: 9 VGIVETTVELKSSV---EKFHDLLVGRPHHMSNATPSNIQSAELQEGEMGQVGAVILWNY 65
Query: 60 NEGSPFKYLKQKIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGG 112
K KQ+I+ LD E Y ++EGD L E ++ F+ T G
Sbjct: 66 VHDGEAKSAKQRIESLDPEKNRITYRVVEGDLL---KEYTSFVTTFQVTPKEG 115
>At1g70850 putative protein
Length = 316
Score = 34.7 bits (78), Expect = 0.023
Identities = 21/77 (27%), Positives = 37/77 (47%), Gaps = 1/77 (1%)
Query: 15 APSRMFTALIIDSRNLIPKLLPQFVKDVNIIEGDGGA-GSIEQVNFNEGSPFKYLKQKID 73
A + F + + + K P ++ ++ EGD G GSI N+ K K++I+
Sbjct: 177 ASAEKFHHMFAGKPHHVSKATPGNIQSCDLHEGDWGTVGSIVFWNYVHDGEAKVAKERIE 236
Query: 74 VLDKENLICKYTMIEGD 90
+D E + + +IEGD
Sbjct: 237 AVDPEKNLITFRVIEGD 253
Score = 32.7 bits (73), Expect = 0.086
Identities = 25/98 (25%), Positives = 46/98 (46%), Gaps = 3/98 (3%)
Query: 31 IPKLLPQFVKDVNIIEGDGGA-GSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEG 89
+ K P ++ ++ EGD G GSI N+ K K++I+ ++ E + + +IEG
Sbjct: 37 VSKASPGNIQSCDLHEGDWGTVGSIVFWNYVHDGEAKVAKERIEAVEPEKNLITFRVIEG 96
Query: 90 DPLGDKLESIAYEVKFEATNDG-GCLCKMASSYKTIGD 126
D L + +S ++ + G G + Y+ I D
Sbjct: 97 D-LMKEYKSFLITIQVTPKHGGPGSIVHWHLEYEKISD 133
>At1g73000 hypothetical protein
Length = 209
Score = 32.7 bits (73), Expect = 0.086
Identities = 35/149 (23%), Positives = 62/149 (41%), Gaps = 36/149 (24%)
Query: 15 APSRMFTALIIDSRNLIPKLLPQFVKDVNI-IEGDG----GAGSIEQVNFNEGSPFKYLK 69
AP+ + D N P F+K I + G+G G+I +V+ G P
Sbjct: 60 APAHAIWRFVRDFAN--PNKYKHFIKSCTIRVNGNGIKEIKVGTIREVSVVSGLPASTSV 117
Query: 70 QKIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDV 129
+ ++VLD+E I + ++ G+ + S+ ++ +F V
Sbjct: 118 EILEVLDEEKRILSFRVLGGEHRLNNYRSVT----------------------SVNEFVV 155
Query: 130 KEEDVKEGRESTIGIYEVV-ESYLLENPQ 157
E+D K+ +Y VV ESY+++ PQ
Sbjct: 156 LEKDKKK------RVYSVVLESYIVDIPQ 178
>At1g70840 unknown protein (At1g70840)
Length = 171
Score = 32.3 bits (72), Expect = 0.11
Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 1/63 (1%)
Query: 31 IPKLLPQFVKDVNIIEGDGG-AGSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEG 89
+ K P ++ + EGD G GSI N+ K K++I+ ++ E + + +IEG
Sbjct: 48 VSKATPGKIQGCELHEGDWGKVGSIVFWNYVHDGEAKVAKERIEAVEPEKNLITFRVIEG 107
Query: 90 DPL 92
D L
Sbjct: 108 DLL 110
>At1g01360 unknown protein
Length = 187
Score = 32.3 bits (72), Expect = 0.11
Identities = 19/59 (32%), Positives = 29/59 (48%), Gaps = 1/59 (1%)
Query: 32 PKLLPQFVKDVNIIEGDGGAGSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEGD 90
P+ FV +I GD GS+ +VN G P +++++LD E I +I GD
Sbjct: 59 PQKYKPFVSRCTVI-GDPEIGSLREVNVKSGLPATTSTERLELLDDEEHILGIKIIGGD 116
>At5g46790 unknown protein
Length = 221
Score = 31.2 bits (69), Expect = 0.25
Identities = 17/76 (22%), Positives = 36/76 (47%), Gaps = 1/76 (1%)
Query: 32 PKLLPQFVKDVNIIEG-DGGAGSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEGD 90
P++ F+K N+ E + G VN G P ++++D+LD + + +++ G+
Sbjct: 82 PQIYKHFIKSCNVSEDFEMRVGCTRDVNVISGLPANTSRERLDLLDDDRRVTGFSITGGE 141
Query: 91 PLGDKLESIAYEVKFE 106
+S+ +FE
Sbjct: 142 HRLRNYKSVTTVHRFE 157
>At1g70890 unknown protein
Length = 158
Score = 31.2 bits (69), Expect = 0.25
Identities = 22/91 (24%), Positives = 43/91 (47%), Gaps = 2/91 (2%)
Query: 15 APSRMFTALIIDSRNLIPKLLPQFVKDVNIIEGDGG-AGSIEQVNFNEGSPFKYLKQKID 73
A ++ F + + + + K P + + EGD G GSI + K KI+
Sbjct: 19 ASAKKFHHMFTERPHHVSKATPDKIHGCELHEGDWGKVGSIVIWKYVHDGKLTVGKNKIE 78
Query: 74 VLDKENLICKYTMIEGDPLGDKLESIAYEVK 104
+D E + + ++EGD L ++ +S A+ ++
Sbjct: 79 AVDPEKNLITFKVLEGD-LMNEYKSFAFTLQ 108
>At4g18620 putative protein
Length = 164
Score = 30.8 bits (68), Expect = 0.33
Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 6/65 (9%)
Query: 32 PKLLPQFVKDVNIIEGDGGA------GSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYT 85
P+ +FVK + G GG GS+ V G P + ++++ LD E+ + +
Sbjct: 34 PQAYQRFVKSCTMRSGGGGGKGGEGKGSVRDVTLVSGFPADFSTERLEELDDESHVMVVS 93
Query: 86 MIEGD 90
+I G+
Sbjct: 94 IIGGN 98
>At1g70830 unknown protein
Length = 335
Score = 30.8 bits (68), Expect = 0.33
Identities = 17/61 (27%), Positives = 32/61 (51%), Gaps = 1/61 (1%)
Query: 31 IPKLLPQFVKDVNIIEGDGGA-GSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEG 89
+ K P ++ ++ EGD G GSI N+ K K++I+ ++ + + + +IEG
Sbjct: 50 VSKASPGNIQGCDLHEGDWGTVGSIVFWNYVHDGEAKVAKERIEAVEPDKNLITFRVIEG 109
Query: 90 D 90
D
Sbjct: 110 D 110
Score = 28.9 bits (63), Expect = 1.2
Identities = 18/77 (23%), Positives = 36/77 (46%), Gaps = 1/77 (1%)
Query: 15 APSRMFTALIIDSRNLIPKLLPQFVKDVNIIEGDGG-AGSIEQVNFNEGSPFKYLKQKID 73
A + F + + + K P ++ ++ EGD G GSI N+ K K++I+
Sbjct: 196 ASAEKFHHMFAGKPHHVSKASPGNIQGCDLHEGDWGQVGSIVFWNYVHDREAKVAKERIE 255
Query: 74 VLDKENLICKYTMIEGD 90
++ + + +I+GD
Sbjct: 256 AVEPNKNLITFRVIDGD 272
>At4g01026 unknown protein
Length = 211
Score = 28.9 bits (63), Expect = 1.2
Identities = 14/46 (30%), Positives = 22/46 (47%)
Query: 45 IEGDGGAGSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEGD 90
+ GD G + +VN G P ++++ LD E I +I GD
Sbjct: 73 VNGDPEIGCLREVNVKSGLPATTSTERLEQLDDEEHILGINIIGGD 118
>At5g60720 unknown protein
Length = 645
Score = 28.1 bits (61), Expect = 2.1
Identities = 21/80 (26%), Positives = 38/80 (47%), Gaps = 8/80 (10%)
Query: 31 IPKLLPQFVKDVNIIEGDGGAGSIEQVNFNEGSPFKYLKQKIDV---LDKENLIC-KYTM 86
+P+LL + D ++ DGG G +EQ+ GS K++ ++ L K + C K
Sbjct: 565 LPELLLKHATDFVVLTADGGTGEMEQL----GSLVKWVCNQLPTSGSLRKSMVDCFKNPN 620
Query: 87 IEGDPLGDKLESIAYEVKFE 106
+ +E I Y+ +F+
Sbjct: 621 SKASSSSSAVEKIPYDFEFQ 640
>At4g17870 unknown protein
Length = 191
Score = 28.1 bits (61), Expect = 2.1
Identities = 17/79 (21%), Positives = 35/79 (43%), Gaps = 1/79 (1%)
Query: 32 PKLLPQFVKDVNIIEG-DGGAGSIEQVNFNEGSPFKYLKQKIDVLDKENLICKYTMIEGD 90
P+ F+K ++ + + G V G P +++D+LD E + +++I G+
Sbjct: 55 PQTYKHFIKSCSVEQNFEMRVGCTRDVIVISGLPANTSTERLDILDDERRVTGFSIIGGE 114
Query: 91 PLGDKLESIAYEVKFEATN 109
+S+ +FE N
Sbjct: 115 HRLTNYKSVTTVHRFEKEN 133
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.315 0.137 0.386
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,799,552
Number of Sequences: 26719
Number of extensions: 168238
Number of successful extensions: 297
Number of sequences better than 10.0: 28
Number of HSP's better than 10.0 without gapping: 19
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 274
Number of HSP's gapped (non-prelim): 30
length of query: 160
length of database: 11,318,596
effective HSP length: 91
effective length of query: 69
effective length of database: 8,887,167
effective search space: 613214523
effective search space used: 613214523
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 56 (26.2 bits)
Medicago: description of AC144388.8