
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC123570.3 + phase: 0 /pseudo
(1023 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At5g48120 unknown protein 738 0.0
At2g19330 putative leucine-rich-repeat protein 39 0.015
At3g12780 phosphoglycerate kinase like protein 32 2.4
At5g64660 unknown protein 31 3.1
ndhD -chloroplast genome- NADH dehydrogenase ND4 31 4.1
At1g79550 phosphoglycerate kinase (EC 2.7.2.3) like protein 30 7.0
At1g78930 30 7.0
At1g75420 unknown protein 30 7.0
At2g36810 unknown protein 30 9.1
At2g25370 hypothetical protein 30 9.1
>At5g48120 unknown protein
Length = 1152
Score = 738 bits (1906), Expect = 0.0
Identities = 438/1042 (42%), Positives = 630/1042 (60%), Gaps = 52/1042 (4%)
Query: 1 MVTGSDAKAIAQYFIQHLHFQSLGLYDRKLCFELLDCLLEHHADSVASLEEDLIFGFCAA 60
+VT D +A+A+ IQ++ Q+L L++RKL FELL+CLL+ H++++ ++ + L++ C A
Sbjct: 125 VVTDIDVQAMAKSMIQNVQVQALALHERKLAFELLECLLQQHSEAILTMGDLLVYAMCEA 184
Query: 61 IDAERDPECLMPAFHIVESLARLYPDPSGLFASFARDVFDLLEPYFPIQFTHTTSGDAHV 120
ID E+DP+CLM FH+VE LA L+P PSG AS A D+F+++ YFP+ FTHT +A++
Sbjct: 185 IDGEKDPQCLMIVFHLVELLAPLFPSPSGPLASDASDLFEVIGCYFPLHFTHTKDDEANI 244
Query: 121 HRDDLSRTLMNAFSSTPLFEPFVIPLLLQKLSSSLHSAKIDSLQYLRVCSSKYGAERIAK 180
R+DLSR L+ A SSTP FEP+ IPLLL+KLSSSL AK+DSL+ L+ C+ KYG +R+ K
Sbjct: 245 RREDLSRGLLLAISSTPFFEPYAIPLLLEKLSSSLPVAKVDSLKCLKDCALKYGVDRMKK 304
Query: 181 YVGAIWSSLKDTINTYLMEPNFSFTLAPTDGIDFPKNEVVIEALSLLQQLIVQNSSQLVS 240
+ GA+WS+LKDT + + SF + F NE+ +A+SLLQ+L+ Q+ S +
Sbjct: 305 HYGALWSALKDTFYS-STGTHLSFAIESLTSPGFEMNEIHRDAVSLLQRLVKQDIS-FLG 362
Query: 241 LIIDDKDVNFIINSIASYEMYDAVSVQEKKKLHVIGRILYIFAKTSIPSCNAVFQSLLLR 300
++DD +N + ++I Y Y + K ++ VI +IL + AK S+ SCN +F+++ R
Sbjct: 363 FVVDDTRINTVFDTIYRYPQYKEMPDPSKLEVLVISQILSVSAKASVQSCNIIFEAIFFR 422
Query: 301 MMDSLGF--SVSNIDGLKNAGILASQSVNFGFLYLCIELLAGCRELVILSEEKPGTC--- 355
+M++LG S D ++N S + G L+LCIELLA ++L++ EE T
Sbjct: 423 LMNTLGIVEKTSTGDVVQNGNSTVSTRLYHGGLHLCIELLAASKDLILGFEECSPTSGCA 482
Query: 356 ---FTILHSSSDFLFNSFCSVLAVSADRFPPDPDIYIGVKGLQILAMFNLDVFPIPKSTF 412
+++ S S L F S + S D D+Y+GVKGL + MF P+ ++ F
Sbjct: 483 NSGCSMVKSFSVPLIQVFTSAVCRSND--DSVVDVYLGVKGLLTMGMFRGGSSPVSRTEF 540
Query: 413 ENILKKFMSIIIEDFNKTILWNSTLKSLFHIGSLFQNFSESEKAMSYRSFVLDKTMELLS 472
ENIL SII KT++W LK+L IGS + ES+KAMSY S V+D + L
Sbjct: 541 ENILVTLTSIITAKSGKTVVWELALKALVCIGSFIDRYHESDKAMSYMSIVVDNLVSLAC 600
Query: 473 LDDISLPFSLKLEVLSDIGMTSMKNMLKILQGLEGAIFANLSE-------VLFIYM**LH 525
LP+ + LE S++ T K + K++QGLE A ++LS+ V Y+
Sbjct: 601 SSHCGLPYQMILEATSEVCSTGPKYVEKMVQGLEEAFCSSLSDFYIESFLVSNFYLETTC 660
Query: 526 YLIFHGQGI----MLLECYSCKLLPWYSAFQGAEEFILQFSVDIWNQAGNCMDFNSPFEE 581
+ + + I LL+C + KLLP + G E+ ++ F++ +W Q C F+ F
Sbjct: 661 QVNGNFESIDNCSQLLKCLTNKLLPRVAEIDGLEQLLVHFAISMWKQIEFCGVFSCDFNG 720
Query: 582 KGLLDATMKAMKFSVGCCSEESQNVIILKSYSILSSRTNFQLNDVQRLPLTFE-----KY 636
+ ++A M M+ VG +SQN II K+YS++SS T L ++ +PLTF +
Sbjct: 721 REFVEAAMTTMRQVVGIALVDSQNSIIQKAYSVVSSCT---LPAMESIPLTFVALEGLQR 777
Query: 637 DISLRDEGILLLFASVIIALRPKTHVPNIRGILHLFIITLLKGVVPVAQALGSMVNKLIS 696
D+S RDE IL LFASVIIA P +P+ + ++HL ++TLLKG +P AQALGSMVNKL S
Sbjct: 778 DLSSRDELILSLFASVIIAASPSASIPDAKSLIHLLLVTLLKGYIPAAQALGSMVNKLGS 837
Query: 697 KSNGAEKSDELTLEEALHIIFNTKIC----FSSDNMLQICDGSINRNEIVLTDVCLGMTN 752
S G S + +LEEA IIF+ SS+ +I GS E ++ +CLG
Sbjct: 838 GSGGTNTSRDCSLEEACAIIFHADFASGKKISSNGSAKIIVGS----ETTMSKICLGYCG 893
Query: 753 DRLLQTNAVCGLSWIGKGLLLRGHEKIKDITKILTECLISDRNSSLPLIEGLDENNEEHK 812
LQT A+ GL+WIGKGLL+RG+E++ +I +L ECL S+ N H
Sbjct: 894 SLDLQTRAITGLAWIGKGLLMRGNERVNEIALVLVECLKSN-------------NCSGHA 940
Query: 813 GDHLARKCAADAFHVLMSDAEDCLNRKFHATMRPLYKQRFFSSMMPIFLQLISRSDSSSS 872
A K AADAF ++MSD+E CLNRKFHA +RPLYKQR FS+++PI LI S +S S
Sbjct: 941 LHPSAMKHAADAFSIIMSDSEVCLNRKFHAVIRPLYKQRCFSTIVPILESLIMNSQTSLS 1000
Query: 873 RYLLLRAFARVMSVTPLIVILNDAKELISVLLDCLSMLTEDIQDKDILYGLLLVLSGMLT 932
R +L A A V+S P+ VIL++ K+L ++L+ LS+L+ D +K+ L+ LLLVLSG LT
Sbjct: 1001 RTMLHVALAHVISNVPVTVILDNTKKLQPLILEGLSVLSLDSVEKETLFSLLLVLSGTLT 1060
Query: 933 EKNESLLFKAIQIKNDVNG*FLFLPILQLVRESCIQCLVALSKLPHVRIYPLRTQVLEAI 992
+ S F I N L +Q+VRE+ IQCLVAL +LPH RIYP R +VL+AI
Sbjct: 1061 DTKASSFFPTELILESSNDNMSTLNGMQVVRETSIQCLVALLELPHRRIYPFRREVLQAI 1120
Query: 993 SKCLDDTKRSVRNEAVKCRQAW 1014
K LDD KR VR EA++CRQAW
Sbjct: 1121 EKSLDDPKRKVREEAIRCRQAW 1142
>At2g19330 putative leucine-rich-repeat protein
Length = 380
Score = 38.9 bits (89), Expect = 0.015
Identities = 48/177 (27%), Positives = 77/177 (43%), Gaps = 21/177 (11%)
Query: 200 PNFSFTLAPTDGIDFPKNEVVIEALSLLQQLIVQNSSQLVSLIIDDKDVNFIINSIASYE 259
PN S LA +D N + SL +L+ L++L + + + NSI
Sbjct: 76 PNPSLNLAQICKLDLSNNHLQTIPESLTARLL-----NLIALDVHSNQIKALPNSIGC-- 128
Query: 260 MYDAVSVQEKKKLHVIGRILYIFAKT-----SIPSCNAVFQSLLLRMMDSLGFSVSNIDG 314
+ + K L+V G L F K+ S+ NA F L+ R+ DS+GF ++N+
Sbjct: 129 ------LSKLKTLNVSGNFLVSFPKSIQHCRSLEELNANFNKLI-RLPDSIGFELTNLRK 181
Query: 315 LK-NAGILASQSVNFGFLYLCIELLAGCRELVILSEEKPGTC-FTILHSSSDFLFNS 369
L N+ L S ++ L L A L+IL ++ IL+ S +F + S
Sbjct: 182 LSINSNKLISLPISITHLTSLRVLDARLNCLMILPDDLENLINLEILNVSQNFQYLS 238
>At3g12780 phosphoglycerate kinase like protein
Length = 481
Score = 31.6 bits (70), Expect = 2.4
Identities = 44/187 (23%), Positives = 76/187 (40%), Gaps = 13/187 (6%)
Query: 61 IDAERDPECLMPAFHIVESLARLYPDPSGLFASFARDVFDLLEPYFPIQFTHTTSGDAHV 120
I+ + +C+ P VESL P+ L R F E +F + A +
Sbjct: 165 IEVTKADDCIGPE---VESLVASLPEGGVLLLENVR--FYKEEEKNDPEFAKKLASLADL 219
Query: 121 HRDDLSRTLMNAFSSTP----LFEPFVIPLLLQKLSSSLHSAKIDSLQYLRVCSSKYGAE 176
+ +D T A +ST +P V LLQK L A + R ++ G
Sbjct: 220 YVNDAFGTAHRAHASTEGVTKFLKPSVAGFLLQKELDYLVGAVSNPK---RPFAAIVGGS 276
Query: 177 RIAKYVGAIWSSLKDTINTYLMEPNFSFTLAPTDGIDFPKNEVVIEALSLLQQLIVQNSS 236
+++ +G I SL + + L+ FT G+ + V + L L +L+ + +
Sbjct: 277 KVSSKIGVI-ESLLEKCDILLLGGGMIFTFYKAQGLSVGSSLVEEDKLELATELLAKAKA 335
Query: 237 QLVSLII 243
+ VSL++
Sbjct: 336 KGVSLLL 342
>At5g64660 unknown protein
Length = 420
Score = 31.2 bits (69), Expect = 3.1
Identities = 25/104 (24%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 900 ISVLLDCLSMLTEDIQDKDILYGLLLVLSGMLTEKNESLLFKAIQIKNDVNG*FLFLPIL 959
+ + +DC +L D + L+ G++TE L K+I +D++
Sbjct: 210 VELKIDCAGLLEFIAVDAESKL-LIAERDGLITE-----LMKSISKDSDLS--------- 254
Query: 960 QLVRESCIQCLVALSKLPHVRIYPLRTQVLEAISKCLDDTKRSV 1003
+ ES + CL+A+S V++ LR +++ ++K L D+ S+
Sbjct: 255 --LIESSLSCLIAISSPKRVKLNLLREKLIGDVTKLLSDSTSSL 296
>ndhD -chloroplast genome- NADH dehydrogenase ND4
Length = 506
Score = 30.8 bits (68), Expect = 4.1
Identities = 26/125 (20%), Positives = 54/125 (42%), Gaps = 5/125 (4%)
Query: 272 LHVIGRILYIFAKTSIPSCNAVFQSLLLRMMDSLGFSVSNIDGLKNAGILAS--QSVNFG 329
L V+G I I+A ++ P + + + + +GF + I + + G+ + Q ++ G
Sbjct: 286 LLVVGTIQIIYAASTSPGQRNLKKRIAYSSVSHMGFIIIGISSITDPGLNGAILQIISHG 345
Query: 330 FLYLCIELLAGC---RELVILSEEKPGTCFTILHSSSDFLFNSFCSVLAVSADRFPPDPD 386
F+ + LAG R ++ +E G +I + F S S+ F +
Sbjct: 346 FIGAALFFLAGTSYDRIRLVYLDEMGGMAISIPKIFTMFTILSMASLALPGMSGFIAEFI 405
Query: 387 IYIGV 391
++ G+
Sbjct: 406 VFFGI 410
>At1g79550 phosphoglycerate kinase (EC 2.7.2.3) like protein
Length = 401
Score = 30.0 bits (66), Expect = 7.0
Identities = 32/139 (23%), Positives = 61/139 (43%), Gaps = 8/139 (5%)
Query: 109 QFTHTTSGDAHVHRDDLSRTLMNAFSSTP----LFEPFVIPLLLQKLSSSLHSAKIDSLQ 164
+F + A V+ +D T A +ST +P V L+QK L A + +
Sbjct: 133 EFAKKLAALADVYVNDAFGTAHRAHASTEGVAKFLKPSVAGFLMQKELDYLVGAVANPKK 192
Query: 165 YLRVCSSKYGAERIAKYVGAIWSSLKDTINTYLMEPNFSFTLAPTDGIDFPKNEVVIEAL 224
++ G +++ +G I SL +T++ L+ FT G+ + V + L
Sbjct: 193 PF---AAIVGGSKVSTKIGVI-ESLLNTVDILLLGGGMIFTFYKAQGLSVGSSLVEEDKL 248
Query: 225 SLLQQLIVQNSSQLVSLII 243
L + L+ + ++ VSL++
Sbjct: 249 DLAKSLMEKAKAKGVSLLL 267
>At1g78930
Length = 489
Score = 30.0 bits (66), Expect = 7.0
Identities = 26/71 (36%), Positives = 34/71 (47%), Gaps = 8/71 (11%)
Query: 392 KGLQILAMFNLDVFPIPKSTFENILKKFMSIIIEDFNKTILWNSTLKSLFHIGSLFQNFS 451
K LQ +F L F + + F I+KK+ +I D +KT+L LK L IG S
Sbjct: 374 KTLQKKLIF-LTRFGVSTTHFPRIIKKYPEFLIYDADKTVL--PRLKYLMEIG-----IS 425
Query: 452 ESEKAMSYRSF 462
E E A R F
Sbjct: 426 EREIAFMIRKF 436
>At1g75420 unknown protein
Length = 265
Score = 30.0 bits (66), Expect = 7.0
Identities = 16/46 (34%), Positives = 25/46 (53%), Gaps = 2/46 (4%)
Query: 38 LLEHHADSVASLEEDLIFGFCAAIDAERDPECLMPAFHIVESLARL 83
L EH +S+ EDL+FG ++ + + + AFH ESL R+
Sbjct: 50 LREHVRESLGVRNEDLLFGIINSVSRGKGQDLFLRAFH--ESLERI 93
>At2g36810 unknown protein
Length = 1071
Score = 29.6 bits (65), Expect = 9.1
Identities = 19/56 (33%), Positives = 27/56 (47%), Gaps = 2/56 (3%)
Query: 957 PILQLVRESCIQCLVALSKL--PHVRIYPLRTQVLEAISKCLDDTKRSVRNEAVKC 1010
P ++ + SC C+V + L P + TQV+ I LDD SV+ AV C
Sbjct: 801 PSTKISKGSCCCCIVRVYTLQMPSACMSHYTTQVIGVILALLDDLDESVQLTAVSC 856
>At2g25370 hypothetical protein
Length = 546
Score = 29.6 bits (65), Expect = 9.1
Identities = 30/127 (23%), Positives = 54/127 (41%), Gaps = 18/127 (14%)
Query: 662 VPNIRGILHLFIITLLKGVVPVAQALGSMVNKLISKSNGAEKSDELTLEEALHIIFNTKI 721
+PN +G +G P A+ + ++V +L K ++ +E+A+ F I
Sbjct: 134 IPNSKG----------EGSSPAAKFMDNVVYRLCFKGLVNYEAVADDVEKAVKAGFGVAI 183
Query: 722 CFSSDNML-----QICDGSINRNEIVLTDVCLGMTNDRLLQTNAV---CGLSWIGKGLLL 773
C DN+L + D I+R + + + G++ L V C WI + ++
Sbjct: 184 CDEKDNLLHEIKESLRDIEISRRGVEIMALVRGLSESFDLGMRNVVIYCDDDWIYQSIIG 243
Query: 774 RGHEKIK 780
RG K K
Sbjct: 244 RGKSKKK 250
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.326 0.141 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 21,307,093
Number of Sequences: 26719
Number of extensions: 892243
Number of successful extensions: 2670
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 2
Number of HSP's successfully gapped in prelim test: 8
Number of HSP's that attempted gapping in prelim test: 2659
Number of HSP's gapped (non-prelim): 12
length of query: 1023
length of database: 11,318,596
effective HSP length: 109
effective length of query: 914
effective length of database: 8,406,225
effective search space: 7683289650
effective search space used: 7683289650
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 65 (29.6 bits)
Medicago: description of AC123570.3