
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149050.15 + phase: 0
(235 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At3g56880 unknown protein 145 1e-35
At2g41010 unknown protein 132 2e-31
At4g15120 unknown protein 61 6e-10
At3g22160 unknown protein 59 3e-09
At5g65170 putative protein 52 4e-07
At4g39720 putative protein 49 2e-06
At1g35830 hypothetical protein 47 9e-06
At4g20000 unknown protein 43 1e-04
At1g16190 hypothetical protein, 3' partial 35 0.045
At4g37440 unknown protein 34 0.059
At1g79280 hypothetical protein 33 0.13
At1g49890 unknown protein 33 0.13
At3g15300 unknown protein 32 0.22
At2g33780 hypothetical protein 32 0.22
At1g58100 unknown protein 32 0.22
At4g39790 putative protein 32 0.29
At1g80450 unknown protein 32 0.29
At4g20950 putative protein 32 0.38
At1g25540 hypothetical protein 32 0.38
At5g44160 unknown protein 31 0.50
>At3g56880 unknown protein
Length = 245
Score = 145 bits (367), Expect = 1e-35
Identities = 113/256 (44%), Positives = 148/256 (57%), Gaps = 32/256 (12%)
Query: 1 MASSENLAASIESWMYRPAMT-DTWL-SDYISRDAETLTKALHKSLSSS--SSPEDALSP 56
MASSE LA S++ W +R D+WL SD S D++ L KALH+S+S+S SSP S
Sbjct: 1 MASSEGLA-SVDPWSFRQNFNIDSWLLSDSFSHDSDILAKALHRSISTSTESSPLSPSSF 59
Query: 57 FLNLIKTDSATTTTTTTP--TVSSLS-ASDDSAP--------KRQR---VAAGKISKRKS 102
F DS+T +P T+S++S SD P KR+R V+ GK +KR+S
Sbjct: 60 F------DSSTAAVDFSPPQTLSNVSFGSDPEIPAASALGLGKRKRGPGVSGGKQTKRRS 113
Query: 103 RAS-KRSQTTFITADPANFRQMVQQVTGVRFGSGSNVSMAPLVKPEPHR-AVGVNGGGRF 160
R S K+SQTTFITAD ANFRQMVQQVTG +F SN AP+VKPEPHR A +
Sbjct: 114 RVSNKKSQTTFITADAANFRQMVQQVTGAKFLGSSNSIFAPIVKPEPHRLASRLPPSCGN 173
Query: 161 TTGGGCLPTLDTSAFLLQHHQQQQQTMVG-PNSDGPEMTGLGPLSFGQPIGEDAAGYDFE 219
+PTLDTS+FL HHQ+ T +G P + G + +G ++ + +
Sbjct: 174 LDRSSAVPTLDTSSFLSNHHQENIITDLGAPTGSFHHQSSAGTTTAN--VGGGSSAVELD 231
Query: 220 TFSSCFPTLESSWKVM 235
++ S FPTLE SWKVM
Sbjct: 232 SYPS-FPTLE-SWKVM 245
>At2g41010 unknown protein
Length = 238
Score = 132 bits (332), Expect = 2e-31
Identities = 99/246 (40%), Positives = 137/246 (55%), Gaps = 19/246 (7%)
Query: 1 MASSENLAASIESWMYRPAMT-DTWL-SDYISRDAETLTKALHKSLSSSSSPEDALSPFL 58
M +SE LA S++SW+YR D+WL SD S D + L +ALH ++++ + + S F
Sbjct: 1 MVTSEGLA-SVDSWLYRQGFNVDSWLLSDTFSHDNDLLARALHTTVTAPHTLTPS-SAFF 58
Query: 59 NLIKTDSATTTTTTTPTVSSLSASD--DSAPKRQR---VAAGKISKRKSRASKRSQTTFI 113
+ ++T T + TVS S + KR+R + GK +KR++RASK+SQTTFI
Sbjct: 59 DSSAVSHPSSTNTLSSTVSGASDPEIIGGGAKRKRNCLLTDGKAAKRRARASKKSQTTFI 118
Query: 114 TADPANFRQMVQQVTGVRF--GSGSNVSMAPLVKPEPHRAVGVNGGGRFTTGGGCLPTLD 171
TADP+NFRQMVQQVTG ++ S S P+VKPEP R V G + +P LD
Sbjct: 119 TADPSNFRQMVQQVTGAKYIDDSSSFGIFDPIVKPEPLRFVNKLPCGP-SDRSTAVPMLD 177
Query: 172 TSAFLLQHHQQQQQ--TMVGPNSDGPEMTGLGPLSFGQPIGEDAAGYDFETFSSCFPTLE 229
TSAFL HHQ+ NS + P + P G + +F+ + + FPTLE
Sbjct: 178 TSAFLSNHHQENLAVGNAFSGNSSSVGLPSGKPSATADPGG---SAVEFDNYPT-FPTLE 233
Query: 230 SSWKVM 235
SWKVM
Sbjct: 234 -SWKVM 238
>At4g15120 unknown protein
Length = 193
Score = 60.8 bits (146), Expect = 6e-10
Identities = 38/84 (45%), Positives = 49/84 (58%), Gaps = 6/84 (7%)
Query: 55 SPFLNLIKTDSATTTTTTTPTVSSLSASDDSAPKRQRVAAGKISKRKSRASKRSQTTFIT 114
S F N + T+T TT T ++ SA +P +RVA K ++R+SRAS+R+ TT
Sbjct: 9 SQFYNNQTFFTTATSTVTTTTTTATSADSPLSPDNRRVA--KPTRRRSRASRRTPTTLFN 66
Query: 115 ADPANFRQMVQQVTG----VRFGS 134
D ANFR MVQQ TG V FGS
Sbjct: 67 TDTANFRAMVQQFTGGPSAVAFGS 90
>At3g22160 unknown protein
Length = 192
Score = 58.5 bits (140), Expect = 3e-09
Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 7/92 (7%)
Query: 55 SPFLNLIKTDSATTTTTTTPTVSSLSASDDSAPKRQRVAAGKISK---RKSRASKRSQTT 111
S F N +T T+TT +T ++ + S R G+++K R+SRAS+R+ TT
Sbjct: 8 SQFYNNNQTFFTTSTTASTAVTTTTAGDTTSIDSRLSPETGRVTKPTRRRSRASRRTPTT 67
Query: 112 FITADPANFRQMVQQVTG----VRFGSGSNVS 139
+ D +NFR MVQQ TG + FGSG+ S
Sbjct: 68 LLNTDTSNFRAMVQQYTGGPSAMAFGSGNTTS 99
>At5g65170 putative protein
Length = 362
Score = 51.6 bits (122), Expect = 4e-07
Identities = 43/173 (24%), Positives = 74/173 (41%), Gaps = 35/173 (20%)
Query: 45 SSSSSPEDALSPFLNLIKTDSATTTTTTTPTVSSLSASDDSAP-------KRQRVAAGKI 97
++++ P D P I ++ T + ++ ++ ++P K+ +A +
Sbjct: 79 TTATQPTDGSRPVPPPISSEQVFFTNPLQQNLRTVPNTNTTSPICSVPTDKKNGLATTRN 138
Query: 98 SKRKSRASKRSQTTFITADPANFRQMVQQVTG------------------VRFGSGSNVS 139
K++SR S+R+ TT +T D +NFR MVQ+ TG FGS S+ S
Sbjct: 139 PKKRSRVSRRAPTTVLTTDTSNFRAMVQEFTGNPSTPFTGLSSSFPRSRFDLFGSSSSSS 198
Query: 140 MAPLVKPEPHRAVGVNGGGRFTTGGGCLPTLDTSAFLLQHHQQQQQTMVGPNS 192
P+ KP PH+ + + T LP HH Q Q ++ N+
Sbjct: 199 SRPM-KPFPHKLISPS-----TLNHHYLPPSSE----YHHHHQHQNLLLNMNT 241
>At4g39720 putative protein
Length = 290
Score = 48.9 bits (115), Expect = 2e-06
Identities = 22/51 (43%), Positives = 34/51 (66%)
Query: 80 SASDDSAPKRQRVAAGKISKRKSRASKRSQTTFITADPANFRQMVQQVTGV 130
S++ S P + K +K++SRAS+R+ TT +T D +NFR MVQ+ TG+
Sbjct: 96 SSATSSLPPTNNIGVIKKTKKRSRASRRAPTTVLTTDTSNFRAMVQEFTGI 146
>At1g35830 hypothetical protein
Length = 302
Score = 47.0 bits (110), Expect = 9e-06
Identities = 29/88 (32%), Positives = 45/88 (50%), Gaps = 6/88 (6%)
Query: 43 SLSSSSSPEDALSPFLNLIKTDSATTTTTTTPTVSSLSASDDSAPKRQRVAAGKISKRKS 102
+LSSSSS F+N + +D T P P ++ + ++++
Sbjct: 50 TLSSSSSSSATYLNFVNNLISDDILNQTHLLPP------QPPPPPPPPPPSSSRNPRKRT 103
Query: 103 RASKRSQTTFITADPANFRQMVQQVTGV 130
RAS+R+ TT +T D +NFR MVQ+ TGV
Sbjct: 104 RASRRAPTTVLTTDTSNFRAMVQEFTGV 131
>At4g20000 unknown protein
Length = 208
Score = 43.1 bits (100), Expect = 1e-04
Identities = 25/67 (37%), Positives = 41/67 (60%), Gaps = 1/67 (1%)
Query: 78 SLSASDDSAPKRQRVAAGKISKRKSRASKRS-QTTFITADPANFRQMVQQVTGVRFGSGS 136
++++S S+ AA + + R+SRAS+R+ TT + A+P+NFR +VQ+ TG G S
Sbjct: 29 NIASSSGSSLGDGGAAAHEAAGRRSRASRRAIPTTLLNANPSNFRALVQKFTGRSAGGES 88
Query: 137 NVSMAPL 143
N P+
Sbjct: 89 NRRKGPV 95
>At1g16190 hypothetical protein, 3' partial
Length = 236
Score = 34.7 bits (78), Expect = 0.045
Identities = 46/188 (24%), Positives = 75/188 (39%), Gaps = 20/188 (10%)
Query: 36 LTKALHKSLSSSSSPEDALSPFLNLIKTDSATTTTTTTPTVSSLSASDDSAPKRQRVAAG 95
L L KS ++SS+ + P T S+TT + T S + +S P +++ A
Sbjct: 47 LVVMLSKSKTASSAGPSSTQPTSTTTSTISSTTLAAPSTTQSIAVPASNSTPVQEQPTA- 105
Query: 96 KISKRKSRASKRSQTTFITADPANFRQMVQQVTGVRFGSGSNVSMAPLVKP---EPHRAV 152
+S ++ +T ++ ++ QMVQQ+ + GS ++ ++ P RAV
Sbjct: 106 -----QSDTYGQAASTLVSG--SSIEQMVQQIMEMGGGSWDKETVTRALRAAYNNPERAV 158
Query: 153 GVNGGGRFTTGGGCLPTLDTSAFLLQHHQQQQQTMVGPNSDGPEMTGLGPLSFGQPIGED 212
G T+ A L ++ P S GP + L F Q D
Sbjct: 159 DY-------LYSGIPETVTIPATNLSGVGSGRELTAPPPSGGPNSSPLD--LFPQEAVSD 209
Query: 213 AAGYDFET 220
AAG D T
Sbjct: 210 AAGGDLGT 217
>At4g37440 unknown protein
Length = 471
Score = 34.3 bits (77), Expect = 0.059
Identities = 29/109 (26%), Positives = 48/109 (43%), Gaps = 18/109 (16%)
Query: 17 RPAMTDTWLSDYIS-RDAETLTKALHKSLSSSSSPEDALSPFLNLIKTDSATT------- 68
+P + + S ++S D ET L + L+S ++ P NL+KT+ A+
Sbjct: 352 KPVKSASVSSHHVSPEDDETTDILLSEILASKRREGKSIIPDKNLVKTEQASIEEGPSRP 411
Query: 69 TTTTTPTVSSLSASDDSAPKRQRVA----------AGKISKRKSRASKR 107
TP + ++S PKR+RV+ A + S RK + KR
Sbjct: 412 VRKRTPRNREIITKEESNPKRRRVSREKPKSNAVMASRFSNRKRKRGKR 460
>At1g79280 hypothetical protein
Length = 2111
Score = 33.1 bits (74), Expect = 0.13
Identities = 31/136 (22%), Positives = 52/136 (37%), Gaps = 12/136 (8%)
Query: 33 AETLTKALHKSLSSSSSPEDALSPFLNLIKTDSATTTTTTTPTVSSLSASDDSAPKRQRV 92
AE + + +PE + P + T +T TT+ T++S + + AP+ +
Sbjct: 1977 AEEAADIPNNANDQQEAPETDIKPETSAATTSPVSTAPTTSSTLAS-AITSSGAPETED- 2034
Query: 93 AAGKISKRKSRASKRSQTTFITADPANFRQMVQQVTGVRFGSGSNVSMAPLVKPEPHRAV 152
KR S T AD A ++ +++ + N P + R V
Sbjct: 2035 -----PKRAPSPGGGSSTIVTLADRAQMKRR-ERIANIVVSRAPN----PATRGARGRTV 2084
Query: 153 GVNGGGRFTTGGGCLP 168
+ GGGR GG P
Sbjct: 2085 NLRGGGRLLPRGGRAP 2100
>At1g49890 unknown protein
Length = 659
Score = 33.1 bits (74), Expect = 0.13
Identities = 24/64 (37%), Positives = 31/64 (47%)
Query: 50 PEDALSPFLNLIKTDSATTTTTTTPTVSSLSASDDSAPKRQRVAAGKISKRKSRASKRSQ 109
P LSP + + + TTTTTTT T SS S+S SA R S SR++ S
Sbjct: 37 PSRYLSPSPSHSVSSTTTTTTTTTTTTSSSSSSSSSAILRTSKRYPSPSPLLSRSTTNSA 96
Query: 110 TTFI 113
+ I
Sbjct: 97 SNSI 100
>At3g15300 unknown protein
Length = 219
Score = 32.3 bits (72), Expect = 0.22
Identities = 22/85 (25%), Positives = 35/85 (40%), Gaps = 9/85 (10%)
Query: 76 VSSLSASDDSAPKRQRVAAGKISKRKSRASKRSQTTFITADPANFRQMVQQVTGVRFGSG 135
+S+ S S+ + G + S TTF+ AD ++F+Q+VQ +TG
Sbjct: 3 ISTNPPSSSSSSVSSSIINGSLHHHIITRSDHYPTTFVQADSSSFKQVVQMLTG------ 56
Query: 136 SNVSMAPLVKPEPHRAVGVNGGGRF 160
S +P P +G G F
Sbjct: 57 ---SSSPRSPDSPRPPTTPSGKGNF 78
>At2g33780 hypothetical protein
Length = 204
Score = 32.3 bits (72), Expect = 0.22
Identities = 12/22 (54%), Positives = 19/22 (85%)
Query: 109 QTTFITADPANFRQMVQQVTGV 130
+TTFI DP++F+Q+VQ +TG+
Sbjct: 35 ETTFIRTDPSSFKQVVQLLTGI 56
>At1g58100 unknown protein
Length = 401
Score = 32.3 bits (72), Expect = 0.22
Identities = 44/178 (24%), Positives = 66/178 (36%), Gaps = 34/178 (19%)
Query: 33 AETLTKALHKSLSSSSSPEDALSPFLNLIKTDSATTTTTTTPTVSSLSASDDSAPKRQRV 92
A L A + S+ PED+ L T S T TTT + D +R R+
Sbjct: 22 ARQLVDASLSIVPRSTPPEDS-----TLATTSSTATATTTKRSTKDRHTKVDGRGRRIRM 76
Query: 93 AA------GKISKRKSRAS---------KRSQTTFITAD-----PANFRQMVQQVTGVRF 132
A ++++ S ++++ + A PANF + +
Sbjct: 77 PALCAARVFQLTRELGHKSDGETIEWLLQQAEPAIVAATGTGTIPANFSTLSVSLRS--- 133
Query: 133 GSGSNVSMAPLVKPEPHRAVGVNGGGRFTTGGGCLPTLDTSAFL-----LQHHQQQQQ 185
SGS +S P + A+G+ GGG + TS L LQHHQ Q Q
Sbjct: 134 -SGSTLSAPPSKSVPLYGALGLTHHQYDEQGGGGVFAAHTSPLLGFHHQLQHHQNQNQ 190
>At4g39790 putative protein
Length = 631
Score = 32.0 bits (71), Expect = 0.29
Identities = 25/99 (25%), Positives = 39/99 (39%)
Query: 18 PAMTDTWLSDYISRDAETLTKALHKSLSSSSSPEDALSPFLNLIKTDSATTTTTTTPTVS 77
P++T T S ++ ++ LS +S+P P NL + T ++T T T++
Sbjct: 60 PSLTATEPEKSPSHNSSYPDDSVDSPLSHNSNPNPNPKPLFNLSYMKTETASSTVTFTIN 119
Query: 78 SLSASDDSAPKRQRVAAGKISKRKSRASKRSQTTFITAD 116
LS DD + R R S F T D
Sbjct: 120 PLSDGDDDLEVTMPAFSPPPPPRPRRPETSSWDYFDTCD 158
>At1g80450 unknown protein
Length = 177
Score = 32.0 bits (71), Expect = 0.29
Identities = 20/47 (42%), Positives = 27/47 (56%), Gaps = 8/47 (17%)
Query: 110 TTFITADPANFRQMVQQVTG----VRFGSGSNVSMA----PLVKPEP 148
T F+ ADP+NFR +VQ++TG + S S VS A PL +P
Sbjct: 15 TMFVQADPSNFRNIVQKLTGAPPDISSSSFSAVSAAHQKLPLTPKKP 61
>At4g20950 putative protein
Length = 264
Score = 31.6 bits (70), Expect = 0.38
Identities = 25/78 (32%), Positives = 39/78 (49%), Gaps = 4/78 (5%)
Query: 38 KALHKSLSSSSSPEDALSPFL--NLIKTDSATTTTTTTPTVSSLSASDDSAPKRQRVAAG 95
K LSSSSS +P L L+ + S++++ TP ++ S+ D R+AAG
Sbjct: 33 KLFRSPLSSSSSSARPQAPQLVIKLLSSSSSSSSARPTPRLAISSSLDCKTHLPPRLAAG 92
Query: 96 KISK--RKSRASKRSQTT 111
SK K+ +RS+ T
Sbjct: 93 DSSKISLKTTTQRRSKVT 110
>At1g25540 hypothetical protein
Length = 809
Score = 31.6 bits (70), Expect = 0.38
Identities = 38/180 (21%), Positives = 66/180 (36%), Gaps = 5/180 (2%)
Query: 18 PAMTDTWLSDYISRDAETLTKALHKSLSSSSSPEDALSPFLNLIKTDSATTTTTTTPTVS 77
P + L + +S+ + ++ A +SS N+I T AT+ + TV
Sbjct: 338 PGGANVNLLNNLSQVRQVMSSAALAGAASSVGQSAVAMHMSNMISTGMATSLPPSQ-TVF 396
Query: 78 SLSASDDSAPKRQRVAAGKISKRKSRASKRSQTTFITADPANFRQMVQQVTGVRFGSGSN 137
S ++ G +S + + T++ A+ + Q + G+ GS S
Sbjct: 397 STGQQGITSMAGSGALMGSAQTGQSPGPNNAFSPQTTSNVASNLGVSQPMQGMNQGSHSG 456
Query: 138 VSMAPLVKPEPHRAVGVNGGGRFTTGGGCLPTLDTSAFLLQHHQQQQQTMVGPNSDGPEM 197
M + + G+ G + GG +PT Q Q Q + G NS P M
Sbjct: 457 AMMQGGISMNQNMMSGLGQGNVSSGTGGMMPTPGVG----QQAQSGIQQLGGSNSSAPNM 512
>At5g44160 unknown protein
Length = 466
Score = 31.2 bits (69), Expect = 0.50
Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 4/62 (6%)
Query: 65 SATTTTTTTPTVSSLSASDD----SAPKRQRVAAGKISKRKSRASKRSQTTFITADPANF 120
+A+ TTTTT + SL +SD +A ++A + ++ + S TT T DP+ F
Sbjct: 305 NASLTTTTTLSAPSLFSSDQPQNANANSNVNMSATALLQKAAEIGATSTTTAATNDPSTF 364
Query: 121 RQ 122
Q
Sbjct: 365 LQ 366
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.311 0.125 0.359
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,251,861
Number of Sequences: 26719
Number of extensions: 217856
Number of successful extensions: 963
Number of sequences better than 10.0: 69
Number of HSP's better than 10.0 without gapping: 33
Number of HSP's successfully gapped in prelim test: 37
Number of HSP's that attempted gapping in prelim test: 868
Number of HSP's gapped (non-prelim): 94
length of query: 235
length of database: 11,318,596
effective HSP length: 96
effective length of query: 139
effective length of database: 8,753,572
effective search space: 1216746508
effective search space used: 1216746508
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.9 bits)
Medicago: description of AC149050.15