
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149211.3 + phase: 0
(367 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At1g30550 unknown protein 182 2e-46
At2g41020 unknown protein 38 0.010
At3g56120 unknown protein 37 0.017
At3g19670 unknown protein 37 0.022
At3g57660 DNA-directed RNA polymerase I 190K chain - like protein 36 0.029
At4g16420 transcriptional adaptor like protein 35 0.085
At4g04670 unknown protein 34 0.14
At3g21300 unknown protein 33 0.25
At5g64150 unknown protein 32 0.42
At1g69310 putative WRKY transcription factor 32 0.72
At4g27340 putative protein 31 0.94
At4g26630 unknown protein 31 0.94
At5g53920 ribosomal protein L11 methyltransferase-like protein 30 1.6
At1g60700 unknown protein 30 1.6
At1g12800 unknown protein 30 1.6
At2g48160 unknown protein 30 2.1
At2g34850 putative UDP-galactose-4-epimerase 30 2.1
At4g32160 unknown protein 30 2.7
At3g19840 unknown protein 29 3.6
At2g43920 unknown protein 29 3.6
>At1g30550 unknown protein
Length = 398
Score = 182 bits (463), Expect = 2e-46
Identities = 91/171 (53%), Positives = 118/171 (68%), Gaps = 6/171 (3%)
Query: 197 KVKRKHRRKKLYYETEDLEFQKMPEAYSATIEKYWCQRYILFSRFDDGVKMDEEGWFSVT 256
K+K++ R KK T + + + + I KYW QRY LFSR+D G++MDEEGW+SVT
Sbjct: 164 KLKKRSRLKKEVKSTIEKDNGRRHK-----ITKYWIQRYDLFSRYDQGIEMDEEGWYSVT 218
Query: 257 PEIIAHHQASRCAGGTLIDCFTGAGGNAIQFAQRCRHVVAIDIDPLKIDYARHNAAIYRV 316
PE IA QA R G +IDCF+G GGN IQFA+ C VVAIDIDP+K++ A +NA +Y V
Sbjct: 219 PEEIAIKQAQRYRGKVVIDCFSGVGGNTIQFAKVCSSVVAIDIDPVKVELAMNNAMVYGV 278
Query: 317 DDQIDFIAGDFFVLAPKLKADTVFLSPPWGGPDYSKVVTYDMKTMLRPHDG 367
+++DF+ GDF LAP LK D VFLSPPWGGP Y +Y++ ML+P DG
Sbjct: 279 ANRVDFVIGDFIQLAPSLKGDVVFLSPPWGGPMYRDFESYNL-DMLQPRDG 328
Score = 160 bits (406), Expect = 8e-40
Identities = 79/141 (56%), Positives = 96/141 (68%), Gaps = 15/141 (10%)
Query: 227 IEKYWCQRYILFSRFDDGVKMDEEGWFSVTPEIIAHHQASRCAGGTLIDCFTGAGGNAIQ 286
I +YW QRY LFS++D G++MDEEGW+SVTPE IA QA RC G +IDCF+G G
Sbjct: 19 ISRYWIQRYDLFSKYDQGIEMDEEGWYSVTPEEIAIKQAERCRGKVVIDCFSGVG----- 73
Query: 287 FAQRCRHVVAIDIDPLKIDYARHNAAIYRVDDQIDFIAGDFFVLAPKLKADTVFLSPPWG 346
AIDIDP+KI A +NA +Y V ++IDF+ GDF LAP LK D +FLSPPWG
Sbjct: 74 ---------AIDIDPMKIALAMNNAKVYGVANRIDFVTGDFMQLAPSLKGDVLFLSPPWG 124
Query: 347 GPDYSKVVTYDMKTMLRPHDG 367
GP YSKV +Y + ML P DG
Sbjct: 125 GPTYSKVESYKL-DMLLPRDG 144
>At2g41020 unknown protein
Length = 325
Score = 37.7 bits (86), Expect = 0.010
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 22 DWMVLWDTFYGRRYFYNVKTDTSTWDPPPGME 53
+W+ +D G +YFYN +T S W+PP ++
Sbjct: 242 EWIETFDEASGHKYFYNTRTHVSQWEPPASLQ 273
Score = 28.1 bits (61), Expect = 7.9
Identities = 13/52 (25%), Positives = 22/52 (42%)
Query: 23 WMVLWDTFYGRRYFYNVKTDTSTWDPPPGMEHLAFGGCTELDDSETLKSSEE 74
W+ D G Y+YN T T W+ P + + L E +++ +E
Sbjct: 198 WVDAKDPASGATYYYNQHTGTCQWERPVELSYATSSAPPVLSKEEWIETFDE 249
>At3g56120 unknown protein
Length = 468
Score = 37.0 bits (84), Expect = 0.017
Identities = 21/51 (41%), Positives = 28/51 (54%)
Query: 270 GGTLIDCFTGAGGNAIQFAQRCRHVVAIDIDPLKIDYARHNAAIYRVDDQI 320
G T+ D F G G AI AQ+ V A D++P + Y + NA +VDD I
Sbjct: 217 GETVCDMFAGIGPFAIPAAQKGCFVYANDLNPDSVRYLKINAKFNKVDDLI 267
>At3g19670 unknown protein
Length = 960
Score = 36.6 bits (83), Expect = 0.022
Identities = 47/200 (23%), Positives = 79/200 (39%), Gaps = 9/200 (4%)
Query: 32 GRRYFYNVKTDTSTWDPPPGMEHLAFGGCTELDDSE-TLKSSEECETQSSIKQPEETLVD 90
GR+YF+N +T STW+ P + L D E + + KQ T+ +
Sbjct: 217 GRKYFFNKRTKKSTWEKPVELMTLFERADARTDWKEHSSPDGRKYYYNKITKQSTWTMPE 276
Query: 91 ENLSGNQHEEYSAEIGVAAGNLV--SDIATNSEDQFLHHPSDENLERTSCNGGVSRCSVS 148
E + E ++ G A ++ S++ T S+ P+ +TS + GV + +++
Sbjct: 277 EMKIVREQAEIASVQGPHAEGIIDASEVLTRSDTASTAAPTGLP-SQTSTSEGVEKLTLT 335
Query: 149 NTLDHVVSSNNKCSQATSEVDHTPTE----YMVIDTLELDSKSDPFMSKQEKKVKRKHRR 204
+ L S S VD + DT E D S P S K +K
Sbjct: 336 SDLKQPASVPGS-SSPVENVDRVQMSADETSQLCDTSETDGLSVPQGSGSGPKESQKPMV 394
Query: 205 KKLYYETEDLEFQKMPEAYS 224
+ E++ E Q E++S
Sbjct: 395 ESEKVESQTEEKQIHQESFS 414
>At3g57660 DNA-directed RNA polymerase I 190K chain - like protein
Length = 1670
Score = 36.2 bits (82), Expect = 0.029
Identities = 31/121 (25%), Positives = 56/121 (45%), Gaps = 4/121 (3%)
Query: 86 ETLVDENLSGNQHEEYSAEIGVAAGNLVSDIATNSEDQFLHHPSDENLERTSCNGGVSRC 145
ET D+++SG Q+E+ + G G V D+ ++++ Q + + E S +
Sbjct: 1325 ETDNDDSVSGKQNEDDGDDDG--EGTEVDDLGSDAQKQKKQETDEMDYEENSEDETNEPS 1382
Query: 146 SVSNTLDHVVSSNNKCSQATSEVDHTPTEYMVIDTLELDSKSDPFMSKQEKKVKRKHRRK 205
S+S D + S N+ ++ + E P E + E+ K + +Q KK +RK R
Sbjct: 1383 SISGVEDPEMDSENEDTEVSKEDTPEPQEESMEPQKEV--KGVKNVKEQSKKKRRKFVRA 1440
Query: 206 K 206
K
Sbjct: 1441 K 1441
>At4g16420 transcriptional adaptor like protein
Length = 487
Score = 34.7 bits (78), Expect = 0.085
Identities = 51/219 (23%), Positives = 85/219 (38%), Gaps = 24/219 (10%)
Query: 51 GMEHLAFGGCTELDDSETLKSSEECETQSS---IKQPEETLVD-ENLSGNQHEEYSAEIG 106
G+E G E+ + KS E+C + P L D +++G +E A
Sbjct: 116 GLEIYGLGNWAEVAEHVGTKSKEQCLEHYRNIYLNSPFFPLPDMSHVAGKNRKELQA--- 172
Query: 107 VAAGNLVSDIAT-NSEDQFLHHPSDENLERTSCNGGVSRC---------SVSNTLDHVVS 156
+A G + A N ++++ P +E T V R SV+N+L + +
Sbjct: 173 MAKGRIDDKKAEQNMKEEYPFSPPKVKVEDTQKESFVDRSFGGKKPVSTSVNNSLVELSN 232
Query: 157 SNNKCSQATSEVDH------TPTEYMVIDTLELDSKSDPFMSKQEKKVKRKHRRKKLYYE 210
N K + E D+ E+ DT E + K++ + RRK+ E
Sbjct: 233 YNQKREEFDPEYDNDAEQLLAEMEFKENDTPEEHELKLRVLRIYSKRLDERKRRKEFIIE 292
Query: 211 TEDLEFQKMPEAYSATIEKYWCQRYILFSRFDDGVKMDE 249
+L + E + EK C+R +F RF + DE
Sbjct: 293 -RNLLYPNPFEKDLSQEEKVQCRRLDVFMRFHSKEEHDE 330
>At4g04670 unknown protein
Length = 995
Score = 33.9 bits (76), Expect = 0.14
Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 2/76 (2%)
Query: 268 CAGGTLIDCFTGAGGNAIQFAQRCRH--VVAIDIDPLKIDYARHNAAIYRVDDQIDFIAG 325
C ++D F G G + F R + V A + +P I+ R N V ++ + G
Sbjct: 836 CENEVVVDLFAGIGYFVLPFLVRAKAKLVYACEWNPHAIEALRRNVEANSVSERCIILEG 895
Query: 326 DFFVLAPKLKADTVFL 341
D + APK AD V L
Sbjct: 896 DNRITAPKGVADRVNL 911
>At3g21300 unknown protein
Length = 598
Score = 33.1 bits (74), Expect = 0.25
Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 1/54 (1%)
Query: 273 LIDCFTGAGGNAIQFAQRCRHVVAIDIDPLKIDYARHNAAIYRVDDQIDFIAGD 326
++D F G G + A+R +HV ++ P I A NA I +++ FI GD
Sbjct: 447 VLDLFCGTGTIGLTLARRAKHVYGYEVVPQAITDAHKNAQINGIEN-ATFIQGD 499
>At5g64150 unknown protein
Length = 377
Score = 32.3 bits (72), Expect = 0.42
Identities = 24/76 (31%), Positives = 36/76 (46%), Gaps = 5/76 (6%)
Query: 275 DCFTGAGGNAIQFAQRCR---HVVAIDIDPLKIDYARHNAAIYRVDDQIDFIAGDFFVLA 331
D TG+G AI A+ V+A D+ P+ I A HN Y ++ I+ G +F
Sbjct: 206 DLGTGSGAIAIGIAKVLGSRGRVIATDLSPVAIAVAGHNVQRYSLEGMIEVREGSWFEPL 265
Query: 332 PKLKADTVFL--SPPW 345
L+ V L +PP+
Sbjct: 266 KDLEGKLVGLVSNPPY 281
>At1g69310 putative WRKY transcription factor
Length = 287
Score = 31.6 bits (70), Expect = 0.72
Identities = 29/113 (25%), Positives = 45/113 (39%), Gaps = 18/113 (15%)
Query: 111 NLVSDIATNSEDQFLHHPSDE------NLERTSCNGG-----VSRCSVSNTLDHVVSSNN 159
N++SD N LHH SD + + T G S CS S + V+S N
Sbjct: 35 NILSDFGWN-----LHHSSDHPHSLRFDSDLTQTTGVKPTTVTSSCSSSAAVSVAVTSTN 89
Query: 160 KCSQATSEVDHTPTEYMVIDTLELDSKSDPFMSKQEKKVKRKHRRKKLYYETE 212
ATS P E + P K++KK +++ R+ + + T+
Sbjct: 90 NNPSATSSSSEDPAENSTASAEKTPPPETPV--KEKKKAQKRIRQPRFAFMTK 140
>At4g27340 putative protein
Length = 562
Score = 31.2 bits (69), Expect = 0.94
Identities = 18/70 (25%), Positives = 35/70 (49%)
Query: 275 DCFTGAGGNAIQFAQRCRHVVAIDIDPLKIDYARHNAAIYRVDDQIDFIAGDFFVLAPKL 334
D F G G A+ A+ + V A D++P +++ N+ + +++ +I+ I +A +
Sbjct: 467 DVFAGVGPIALAAARIVKRVYANDLNPHAVEFMEQNSVVNKLEKRIERIRIALSEVAVDV 526
Query: 335 KADTVFLSPP 344
K V L P
Sbjct: 527 KMRKVRLVAP 536
>At4g26630 unknown protein
Length = 763
Score = 31.2 bits (69), Expect = 0.94
Identities = 32/157 (20%), Positives = 62/157 (39%), Gaps = 5/157 (3%)
Query: 64 DDSETLKSSEECETQSSIKQPEETLVDENLSGNQHEEYSAEIGVAAGNLVSDIATNSEDQ 123
D++ET K E+ E Q E T +DE+ G + + + GV+ + V S+D
Sbjct: 79 DNAETQKMEEKVEVTKDEGQAEATNMDEDADGKKEQ---TDDGVSVEDTVMKENVESKDN 135
Query: 124 FLHHPSDENLERTSCNGGVSRCSVSNTLDHVVSSNNKCSQA-TSEVDHTPTEYMVIDTLE 182
++ + T + + + H N T ++ T +
Sbjct: 136 NYAKDDEKETKETDITEADHKKAGKEDIQHEADKANGTKDGNTGDIKEEGTLVDEDKGTD 195
Query: 183 LDSK-SDPFMSKQEKKVKRKHRRKKLYYETEDLEFQK 218
+D K + +KQ + V+ K + K +T+++E K
Sbjct: 196 MDEKVENGDENKQVENVEGKEKEDKEENKTKEVEAAK 232
>At5g53920 ribosomal protein L11 methyltransferase-like protein
Length = 371
Score = 30.4 bits (67), Expect = 1.6
Identities = 19/48 (39%), Positives = 26/48 (53%), Gaps = 1/48 (2%)
Query: 270 GGTLIDCFTGAGGNAIQFAQ-RCRHVVAIDIDPLKIDYARHNAAIYRV 316
G +D TG+G AI + V +DIDPL I+ A HNAA+ +
Sbjct: 221 GEAFLDYGTGSGILAIAALKFGAASSVGVDIDPLAINSAIHNAALNNI 268
>At1g60700 unknown protein
Length = 525
Score = 30.4 bits (67), Expect = 1.6
Identities = 17/58 (29%), Positives = 26/58 (44%), Gaps = 2/58 (3%)
Query: 58 GGCTELDDSETLKSSEECETQSSIKQPEETLVDENLSGNQHEEYSAEIGVAAGNLVSD 115
G CT LD+ ++ + + + QP TL E + G EE + + NLV D
Sbjct: 303 GACTSLDEQLYANATTSEDRDAELSQPPSTLYQEEVDG--EEEIDIDAMIRKLNLVPD 358
>At1g12800 unknown protein
Length = 767
Score = 30.4 bits (67), Expect = 1.6
Identities = 27/114 (23%), Positives = 49/114 (42%), Gaps = 6/114 (5%)
Query: 63 LDDSETLKSSEECETQSSIKQPEETLVDENLSGNQHEEYSAEIGVAAGNLVSDIATNSED 122
L D T++ E Q + TL+++ + Q E+G + G S+I NS
Sbjct: 239 LSDDLTMEEGE----QEGGTYSQYTLLEKPEARLQPVNVEEEVGDSGGVESSEIVNNSIQ 294
Query: 123 QFLHHPSDENLERTSCNGGVSRCS--VSNTLDHVVSSNNKCSQATSEVDHTPTE 174
+ P EN+E+ + GV S +N++ + N++ S ++ P E
Sbjct: 295 KPEARPELENIEKEVADSGVLESSEIENNSIPTEMQLNSEMSSEEKTINSDPLE 348
>At2g48160 unknown protein
Length = 1366
Score = 30.0 bits (66), Expect = 2.1
Identities = 29/132 (21%), Positives = 52/132 (38%), Gaps = 23/132 (17%)
Query: 58 GGCTELDDSETLKSSEECETQSSIKQPEETLVDENLSGNQHEEYSAEIGVAAGNLVSDIA 117
GGC DSE S+ + +S + E +++EN+S + E ++ + G L
Sbjct: 1038 GGC----DSEGGSDSDGGDFESVTPEHESRILEENVSSSTAERHTLILEDVDGEL----- 1088
Query: 118 TNSEDQFLHHPSDENLERTSCNGGVSRCSVSNTLDHVVSSNNKCSQATSEVDHTPTEYMV 177
+E + G C+ ++ D+ SN + Q V T ++M
Sbjct: 1089 --------------EMEDVAPPWGTENCTHTDQADNTKVSNCQLGQQHRPVFGTSHQHMS 1134
Query: 178 IDTLELDSKSDP 189
+ + L S S P
Sbjct: 1135 LSSPPLPSSSPP 1146
>At2g34850 putative UDP-galactose-4-epimerase
Length = 385
Score = 30.0 bits (66), Expect = 2.1
Identities = 19/66 (28%), Positives = 29/66 (43%), Gaps = 6/66 (9%)
Query: 252 WFSVTPEIIAHHQASRCAGGTLIDCF---TGAGGNAIQFAQRCRHVVAIDIDPLKIDYAR 308
+ VT + AH +A A + F TG G + +F + C+ +DI K+DY
Sbjct: 273 YIDVTDLVDAHVKALEKAKPRKVGIFNVGTGKGSSVKEFVEACKKATGVDI---KVDYLE 329
Query: 309 HNAAIY 314
A Y
Sbjct: 330 RRAGDY 335
>At4g32160 unknown protein
Length = 723
Score = 29.6 bits (65), Expect = 2.7
Identities = 21/101 (20%), Positives = 46/101 (44%), Gaps = 1/101 (0%)
Query: 61 TELDDSETLKSSEECETQSSIKQPEETLVDENLSGNQHEEYSAEIGVAA-GNLVSDIATN 119
T+ ++E L E ++++ ++ L D + ++ +EY+ + + GN V D T
Sbjct: 561 TDKTNAEKLLQEERKLLENTVAARKKLLSDCRILHDRLKEYNLNLSMDGNGNFVDDSTTI 620
Query: 120 SEDQFLHHPSDENLERTSCNGGVSRCSVSNTLDHVVSSNNK 160
S+ L SD+ +E G + + +D +S + +
Sbjct: 621 SDVLRLLSISDDQIEEAQLLSGFDENAAAEDIDKTLSMDTE 661
>At3g19840 unknown protein
Length = 830
Score = 29.3 bits (64), Expect = 3.6
Identities = 12/28 (42%), Positives = 18/28 (63%), Gaps = 1/28 (3%)
Query: 22 DWMVLWDTFYGRRYFYNVKTDTSTWDPP 49
DW ++ T G++Y+YN KT S+W P
Sbjct: 295 DWALV-STNDGKKYYYNNKTKVSSWQIP 321
>At2g43920 unknown protein
Length = 227
Score = 29.3 bits (64), Expect = 3.6
Identities = 27/117 (23%), Positives = 43/117 (36%), Gaps = 13/117 (11%)
Query: 241 FDDGVKMDEEGWFSVTPEIIAHHQASRCAGGTLIDCFTGAGGNAIQFAQRCRHVVAIDID 300
++DGV ++G TP I+ +S G + G G + + A R VV +DI
Sbjct: 40 WEDGVTPWDQG--RATPLILHLLDSSALPLGRTLVPGCGGGHDVVAMASPERFVVGLDIS 97
Query: 301 PLKIDYARHNAAIYRVDDQIDFIAGDFFVLAPKLKADTVF-----------LSPPWG 346
++ A + F+ D F P D +F + P WG
Sbjct: 98 DKALNKANETYGSSPKAEYFSFVKEDVFTWRPNELFDLIFDYVFFCAIEPEMRPAWG 154
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.317 0.134 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,140,810
Number of Sequences: 26719
Number of extensions: 410296
Number of successful extensions: 1359
Number of sequences better than 10.0: 33
Number of HSP's better than 10.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 23
Number of HSP's that attempted gapping in prelim test: 1335
Number of HSP's gapped (non-prelim): 46
length of query: 367
length of database: 11,318,596
effective HSP length: 101
effective length of query: 266
effective length of database: 8,619,977
effective search space: 2292913882
effective search space used: 2292913882
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (28.1 bits)
Medicago: description of AC149211.3