
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0103.2
(681 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At1g72400 hypothetical protein 35 0.14
At5g44800 helicase-like protein 33 0.53
At4g17000 hypothetical protein 33 0.53
At3g19190 hypothetical protein 33 0.53
At5g06670 kinesin heavy chain-like protein 33 0.69
At2g40950 bZIP tanscription factor like protein 33 0.69
At3g50580 proline-rich protein 32 0.90
At1g49270 hypothetical protein 32 0.90
At4g13340 extensin-like protein 32 1.2
At4g02710 predicted protein of unknown function 32 1.2
At3g21290 unknown protein 32 1.2
At4g18670 extensin-like protein 32 1.5
At2g48160 unknown protein 32 1.5
At5g52550 unknown protein 31 2.0
At2g47030 putative pectinesterase 31 2.0
At2g17110 unknown protein 31 2.0
At5g38560 putative protein 31 2.6
At4g37450 arabinogalactan protein AGP18 31 2.6
At4g36650 transcription initiation factor like protein 31 2.6
At1g76930 extensin 31 2.6
>At1g72400 hypothetical protein
Length = 946
Score = 35.0 bits (79), Expect = 0.14
Identities = 62/285 (21%), Positives = 107/285 (36%), Gaps = 49/285 (17%)
Query: 400 RDGTARFP-FYWTQGARQIIDPSEDSLTPEHKAVIDFLARLAVSDCSRVIGDKASLLKIQ 458
R ++RFP Q R + E K + L +L RV GD++S +
Sbjct: 335 RANSSRFPPASVDQIQRTRLSKENPGANDELKLKANELEKLFAEHQLRVPGDQSSSSRRG 394
Query: 459 ASGKRPLGEEDPRPS----PKRLNSSEPSGTIPAQHGLVDGSLSRPIVISSPRGPNLTSS 514
S + + ++P S KRL+ S L P+V +G L +
Sbjct: 395 KSSENQVAHKEPSHSIAATEKRLSLGGGSADF--------SKLMTPLVGDKDKGDALRRN 446
Query: 515 ANAPPPPAHTSTPSVGRISQ--LEKTLAELRQEAQLESSVKADQIASTEADLENLRSDL- 571
+ + S G+ + ++K A+LR+E L+ K ++ S + LE R+++
Sbjct: 447 LS----DLSLTDDSKGKFYEKYMKKRDAKLREEWSLKKGEKETKLKSMQEALEQSRTEMK 502
Query: 572 -------AKKDDLLSSA-----------------KTQAHLDAKALEEKLRSEDAVAAVSE 607
+++ DLLSS K Q+ D E+K R++D A+ +
Sbjct: 503 AKLSASSSERKDLLSSTRQRAEKFRSFNSRSSMKKYQSEEDEDISEQKPRAKDKAASGQQ 562
Query: 608 RGRGFFLAKAQVKHLHPLIDLSQMGAFKRVTPTGLVGPDDPPGFV 652
+Q + L P ++S +TP P G V
Sbjct: 563 SVGSISSRSSQARKLQPNRNMS-----SSITPRSAASVPKPSGKV 602
Score = 31.6 bits (70), Expect = 1.5
Identities = 32/146 (21%), Positives = 61/146 (40%), Gaps = 14/146 (9%)
Query: 446 RVIGDKASLLKIQASGKRPLGEEDPRPSPKRLNSSEPSGTIPAQHGLVDGSLSRPIVISS 505
R ++S+ K Q+ + E+ PR K + + G+I ++ S +R + +
Sbjct: 527 RSFNSRSSMKKYQSEEDEDISEQKPRAKDKAASGQQSVGSISSR-----SSQARKLQPNR 581
Query: 506 PRGPNLT--SSANAPPPPAHTSTPSVGR------ISQLEKTLAELRQEAQLESSVKADQI 557
++T S+A+ P P S S GR ++Q +EL +E SS+
Sbjct: 582 NMSSSITPRSAASVPKPSGKVSNTSSGRRRSDKSLAQSVPNFSELIKENTKPSSLAVKTT 641
Query: 558 ASTEADLENLRSDLAKKDDLLSSAKT 583
++ R+ K+D LL ++
Sbjct: 642 MRSQVKSSG-RTKNIKEDTLLQRPRS 666
>At5g44800 helicase-like protein
Length = 2228
Score = 33.1 bits (74), Expect = 0.53
Identities = 33/137 (24%), Positives = 58/137 (42%), Gaps = 33/137 (24%)
Query: 466 GEEDPRPSPKRLNSSEPSGTIPAQHGLVDGSLSRPIVISSPRGPNLTSSANAPPPPAHTS 525
G P+ P LNS+EP S I+I P ++ ++A P+
Sbjct: 2084 GSSQPKLPPHNLNSTEPLS-------------SEAIIIPPPEEDSVIAAA-----PSEAP 2125
Query: 526 TPSVGRISQLEKTLAELRQEAQLESSVKADQIASTEADLENLRSDLAKKDDLLSSAKTQA 585
PS+ I+ K+++ LES + + + DL+ + D+ + S +T
Sbjct: 2126 GPSLEGITGTTKSIS-------LESQSSEPETINQDGDLD------PETDEKVESERTPL 2172
Query: 586 HLDAKALEEKLRSEDAV 602
H D K +E+ SE+A+
Sbjct: 2173 HSDEK--QEEQESENAL 2187
>At4g17000 hypothetical protein
Length = 674
Score = 33.1 bits (74), Expect = 0.53
Identities = 25/92 (27%), Positives = 47/92 (50%), Gaps = 5/92 (5%)
Query: 532 ISQLEKTLAELRQEAQLESSVKADQIASTEADLENLRSDLAKKDDLLSSAKTQAHLDAKA 591
+ Q K L +L++ +L SSVK + +++N R+ L ++SSA ++ L K
Sbjct: 199 VFQSPKKLVKLKRSVELSSSVKKLCNGMRKLEIDNKRNGLGVNHKVVSSASSRRPL--KT 256
Query: 592 LEEKLRSEDAVAA---VSERGRGFFLAKAQVK 620
E K R D++ + + ++ +G K +VK
Sbjct: 257 REVKSRVFDSLRSQKQIDQKDKGVSTLKKRVK 288
>At3g19190 hypothetical protein
Length = 1814
Score = 33.1 bits (74), Expect = 0.53
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 532 ISQLEKTLAELRQEAQLESSVKADQIASTEADLENLRSDLAK-KDDLLSSAKTQAHLD 588
+ +LE LA R E+ SS +A AST DL N+R ++ K ++++L +A A +D
Sbjct: 99 VDELELVLAP-RLESNKSSSNEASTSASTREDLHNIRLEIGKHENEMLMNAAKSASID 155
>At5g06670 kinesin heavy chain-like protein
Length = 997
Score = 32.7 bits (73), Expect = 0.69
Identities = 48/210 (22%), Positives = 90/210 (42%), Gaps = 31/210 (14%)
Query: 412 QGARQIIDPS--EDSLTPEHKAVIDFLARLAVSDCS---RVIGDKASLLKIQASGKRPLG 466
+G +IID + E+ T +H ++++L ++ D S + DK+S +K ++ P G
Sbjct: 537 EGTPEIIDDAFIEEKKTRKH-GLLNWL-KIKKKDSSLGGSSLSDKSSAVKSNSTPSTPQG 594
Query: 467 EEDPRPSPKRLNSSEPSGTIPAQHGLVDGSLSRPIVISSPRGPNLTSSANAPPPPAHTST 526
E + RL+ +GS +I + + T
Sbjct: 595 EGSDFHTESRLS---------------EGSALADQIIETMENREAHEDSFHEIETPETRI 639
Query: 527 PSVGRISQL---EKTLAE-LRQEAQL-----ESSVKADQIASTEADLENLRSDLAKKDDL 577
+ ++ L +KTL+E + Q+++ E + KA Q +A++ NL D+ K+D
Sbjct: 640 KMIDQMEILREQQKTLSEEMAQQSRSFKLLSEEAAKAPQNEEIKAEIINLNGDIKAKNDQ 699
Query: 578 LSSAKTQAHLDAKALEEKLRSEDAVAAVSE 607
+++ Q A ++L D V AVSE
Sbjct: 700 IATLGKQILDFVIASHDELDKSDIVQAVSE 729
>At2g40950 bZIP tanscription factor like protein
Length = 721
Score = 32.7 bits (73), Expect = 0.69
Identities = 35/159 (22%), Positives = 64/159 (40%), Gaps = 7/159 (4%)
Query: 465 LGEEDPRPSPKRLNSSEPSGTIPAQ-----HGLVDGSLSRPIVISSPRGPNLTSSANAPP 519
+ E PR S R + ++ + +P G +S SSP+ N+
Sbjct: 122 INRESPRDSDDRCSGADHNLDLPTPLSSQGSGNCGSDVSEATNESSPKSRNVAVDQKVKV 181
Query: 520 PPAHTSTPSVG-RISQLEKTLA-ELRQEAQLESSVKADQIASTEADLENLRSDLAKKDDL 577
A T+T S+ R ++++ L E R S AD A T + E R+ L + +
Sbjct: 182 EEAATTTTSITKRKKEIDEDLTDESRNSKYRRSGEDADASAVTGEEDEKKRARLMRNRES 241
Query: 578 LSSAKTQAHLDAKALEEKLRSEDAVAAVSERGRGFFLAK 616
++ + + LEEK+R+ + +F+A+
Sbjct: 242 AQLSRQRKKHYVEELEEKVRNMHSTITDLNGKISYFMAE 280
>At3g50580 proline-rich protein
Length = 265
Score = 32.3 bits (72), Expect = 0.90
Identities = 21/63 (33%), Positives = 31/63 (48%), Gaps = 15/63 (23%)
Query: 470 PRPSPKRLNSSEPSGTIPAQHGLVDGSLSRPIVISSPRGPNL---TSSANAPPPPA-HTS 525
P P+PK+ S P+ ++P P SP P+L T + PPPP+ H+S
Sbjct: 118 PHPTPKKSPSPPPTPSLPP-----------PAPKKSPSTPSLPPPTPKKSPPPPPSHHSS 166
Query: 526 TPS 528
+PS
Sbjct: 167 SPS 169
>At1g49270 hypothetical protein
Length = 699
Score = 32.3 bits (72), Expect = 0.90
Identities = 23/64 (35%), Positives = 28/64 (42%), Gaps = 8/64 (12%)
Query: 464 PLGEEDPRPSPKRLNSSEPSGTIPAQHGLVDGSLSRPIVISSPRGPNLTSSANAPPPPAH 523
P D P P++ S P P DGS S SSP P+ +SS + PPP
Sbjct: 46 PPSSPDIAPPPQQQQESPPP---PLPENSSDGSSS-----SSPPPPSDSSSQSQSPPPPS 97
Query: 524 TSTP 527
TS P
Sbjct: 98 TSPP 101
>At4g13340 extensin-like protein
Length = 760
Score = 32.0 bits (71), Expect = 1.2
Identities = 20/59 (33%), Positives = 26/59 (43%), Gaps = 3/59 (5%)
Query: 469 DPRPSPKRLNSSEPSGTIPAQHGLVDGSLSRPIVISSPRGPNLTSSANAPPPPAHTSTP 527
+P P P + S P P H P+ SSP P + S+ PPPP H S+P
Sbjct: 617 EPPPPPPCIEYSPPPPP-PVVH--YSSPPPPPVYYSSPPPPPVYYSSPPPPPPVHYSSP 672
>At4g02710 predicted protein of unknown function
Length = 1111
Score = 32.0 bits (71), Expect = 1.2
Identities = 28/92 (30%), Positives = 47/92 (50%), Gaps = 8/92 (8%)
Query: 531 RISQLEKTLAELRQEAQLESSVKADQIASTEADLENLRSDLAKKDDLLSSAKTQAHLDAK 590
R+S LE ++ AQ +S D+ AS EA+++ LR L K + S+ Q H K
Sbjct: 241 RLSNLESEVSR----AQADSRGINDRAASAEAEIQTLRETLYKLESEKESSFLQYH---K 293
Query: 591 ALEEKLRSEDAVAAVSERGRGFFLAKAQVKHL 622
L++ ED + +V+ + G +KA+ + L
Sbjct: 294 CLQKIADLEDGL-SVAHKEAGERASKAETETL 324
>At3g21290 unknown protein
Length = 1848
Score = 32.0 bits (71), Expect = 1.2
Identities = 50/234 (21%), Positives = 91/234 (38%), Gaps = 27/234 (11%)
Query: 454 LLKIQASGKRPLGEEDPRPSPKRLNSSEPSGTIPAQHGLVDGSLSRPIV---ISSPRGPN 510
L + S R ++ P PK+ P + S P + +S+ GP+
Sbjct: 265 LAHAEGSPWRMSNKQKKEPPPKKRKVDPPPVPVGGPKPSFRPGASTPTMKNRLSASPGPS 324
Query: 511 LTSSANAPP------PPAHTSTPSV------GRISQLEKTLAELRQEAQLESSVKADQIA 558
++ N PP H + +V GR++ +EK + + ++S +
Sbjct: 325 PSNQYNTPPYGIGNMAKTHAANENVTPVQTKGRVNMIEKEPSAWKNNVLRDTSGREAINV 384
Query: 559 STEADLENLRSDLAKKDDLLSSAKTQAHLD-----AKALEEKLRSEDAVAAVSERGRGFF 613
+ E DL++L D+ K+ + A +A D AK +E L+ A + R F
Sbjct: 385 NKEIDLQSLLVDILKEAPMSLKALEKAVGDKVPNPAKKIEPILKR----IANFQAPRYFL 440
Query: 614 LAKAQV---KHLHPLIDLSQMGAFKRVTPTGLVGPDDPPGFVVERFLTEEHVEL 664
+A++ K P + F G D P +VE+ T+E+V++
Sbjct: 441 KPEAELESYKKHSPDSGRNNTEKFSLCEQNGEGSLDCLPVHLVEQLSTQENVDI 494
>At4g18670 extensin-like protein
Length = 839
Score = 31.6 bits (70), Expect = 1.5
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 3/64 (4%)
Query: 470 PRPSPKRLNSSEPSGTIPAQHG--LVDGSLSRPIVISSP-RGPNLTSSANAPPPPAHTST 526
P PS +S PS P+ + G S PI+ S P GP+ SS + P PP S
Sbjct: 547 PSPSSPTPSSPIPSPPTPSTPPTPISPGQNSPPIIPSPPFTGPSPPSSPSPPLPPVIPSP 606
Query: 527 PSVG 530
P VG
Sbjct: 607 PIVG 610
>At2g48160 unknown protein
Length = 1366
Score = 31.6 bits (70), Expect = 1.5
Identities = 28/110 (25%), Positives = 51/110 (45%), Gaps = 17/110 (15%)
Query: 423 DSLTPEHKAVIDFLARLAVSDCSRVIGDKASLLKIQASGKR-------PLGEEDPRPSPK 475
+S+TPEH++ R+ + S ++ +L+ G+ P G E+ + +
Sbjct: 1054 ESVTPEHES------RILEENVSSSTAERHTLILEDVDGELEMEDVAPPWGTENCTHTDQ 1107
Query: 476 RLNSSEPSGTIPAQHGLVDGSLSRPIVISSPRGPNLTSSANAPPPPAHTS 525
N+ + + QH V G+ + + +SSP P S++ PPPPA S
Sbjct: 1108 ADNTKVSNCQLGQQHRPVFGTSHQHMSLSSPPLP----SSSPPPPPAPPS 1153
>At5g52550 unknown protein
Length = 360
Score = 31.2 bits (69), Expect = 2.0
Identities = 26/99 (26%), Positives = 50/99 (50%), Gaps = 10/99 (10%)
Query: 533 SQLEKTLAELRQEAQLESSVKADQIASTEA-----DLENLRSDLAKKDDLLSSAKTQAHL 587
++LEK ++++E QL+++V+ D + + +LE ++ KK + S T A +
Sbjct: 155 AELEKK-KQMKKEGQLDAAVEEDSAYAAKKKQEREELERIKQAERKKRRIEKSIATSAAI 213
Query: 588 DAKALEEKLRSEDAVAAVSERGRGFFLAKAQVKHLHPLI 626
A+ ++KLR + + E G A A+ LH L+
Sbjct: 214 RAELEKKKLRKLEEQRRLDEEG----AAIAEAVALHVLL 248
Score = 30.0 bits (66), Expect = 4.4
Identities = 22/82 (26%), Positives = 44/82 (52%), Gaps = 6/82 (7%)
Query: 533 SQLEKTLAELRQEAQLESSVKADQIASTEA-----DLENLRSDLAKKDDLLSSAKTQAHL 587
++LEK ++++E QLE++ + D + + +LE ++ KK+ L S T A +
Sbjct: 52 AELEKK-KQMKKEGQLEAADEEDSADAAKKKQERDELERIKQAENKKNRLEKSIATSAAI 110
Query: 588 DAKALEEKLRSEDAVAAVSERG 609
A+ ++KLR + ++E G
Sbjct: 111 MAELEKKKLRKLEEQKRLAEEG 132
>At2g47030 putative pectinesterase
Length = 588
Score = 31.2 bits (69), Expect = 2.0
Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 14/89 (15%)
Query: 237 DKASLLKIQASGKRPLGEEDPRPSPKSYLVAVWPIITVDLAQKFFRVVGARQIIDPSEDS 296
DKAS +K P+ EDP K++++A T D K G ++ S S
Sbjct: 50 DKASCVKTL----EPVKSEDPNKLIKAFMLA-----TKDELTKSSNFTGQTEVNMGS--S 98
Query: 297 LTPEHKAVIDFLARL---AVSDCSRVIGE 322
++P +KAV+D+ R+ A+ D + +I E
Sbjct: 99 ISPNNKAVLDYCKRVFMYALEDLATIIEE 127
>At2g17110 unknown protein
Length = 399
Score = 31.2 bits (69), Expect = 2.0
Identities = 25/93 (26%), Positives = 44/93 (46%), Gaps = 8/93 (8%)
Query: 516 NAPPPPAHTSTPSVGRISQLEKTLAELRQEAQLESSVKADQIASTEADLENLRSDLAKKD 575
N + TPS +S + + ++ + +A+ SSV A A EA+L A K
Sbjct: 13 NVSSKKLYEGTPSPSVVSSAQSSTSK-KAKAEASSSVTAPTYADIEAEL-------ALKS 64
Query: 576 DLLSSAKTQAHLDAKALEEKLRSEDAVAAVSER 608
LSS + HL K L +++++E+ + E+
Sbjct: 65 RNLSSTLHKLHLWEKKLYDEVKAEEKMRVNHEK 97
>At5g38560 putative protein
Length = 681
Score = 30.8 bits (68), Expect = 2.6
Identities = 20/58 (34%), Positives = 29/58 (49%), Gaps = 11/58 (18%)
Query: 470 PRPSPKRLNSSEPSGTIPAQHGLVDGSLSRPIVISSPRGPNLTSSANAPPPPAHTSTP 527
P+PSP P G P+ G S +P SP P T++ + PPPPA +++P
Sbjct: 135 PKPSPS------PPGETPSPPGETP-SPPKP----SPSTPTPTTTTSPPPPPATSASP 181
>At4g37450 arabinogalactan protein AGP18
Length = 209
Score = 30.8 bits (68), Expect = 2.6
Identities = 22/73 (30%), Positives = 31/73 (42%), Gaps = 4/73 (5%)
Query: 457 IQASGKRPLGEEDPRPSPKRLNSSEPSGTIPAQHGLVDGSLSRPIVISSPRGPNLTSSAN 516
I + K P P SP + + T PA+ S S P+ SP+ P S ++
Sbjct: 25 ISSPTKSPTTPSAPTTSPTKSPAVTSPTTAPAK--TPTASASSPV--ESPKSPAPVSESS 80
Query: 517 APPPPAHTSTPSV 529
PP P S+P V
Sbjct: 81 PPPTPVPESSPPV 93
>At4g36650 transcription initiation factor like protein
Length = 503
Score = 30.8 bits (68), Expect = 2.6
Identities = 29/121 (23%), Positives = 46/121 (37%), Gaps = 11/121 (9%)
Query: 391 HEYPEVFTHRDGTARFPFYWTQGARQIIDPSEDSLTPEHKAVIDFLARLAVSDCSRVIGD 450
H Y + D +F W G +++P+E P +D+ + + +GD
Sbjct: 371 HTYQQPKGKEDKQPKFRQPWLFGTASVMNPAEMISEPAKPNAMDYEKQQLDKQQQQQLGD 430
Query: 451 KASLLKIQASGKRPLGEEDPRPSPKRLNSSEPSGTI--PAQHGLVDGSLSRPIVISSPRG 508
K +L P+ D P P + S TI + +V GS S VI P+
Sbjct: 431 KETL---------PIYLRDHNPFPSNPSPSTGISTINWSFRPSVVPGSSSNLPVIHPPKL 481
Query: 509 P 509
P
Sbjct: 482 P 482
>At1g76930 extensin
Length = 373
Score = 30.8 bits (68), Expect = 2.6
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 500 PIVISSPRGPNLTSSANAPPPPAHTSTPSV 529
P+V SP P +PPPP H S P+V
Sbjct: 316 PVVYHSPPPPKKHYEYKSPPPPVHYSPPTV 345
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.317 0.134 0.397
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,378,754
Number of Sequences: 26719
Number of extensions: 757872
Number of successful extensions: 3242
Number of sequences better than 10.0: 61
Number of HSP's better than 10.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 53
Number of HSP's that attempted gapping in prelim test: 3086
Number of HSP's gapped (non-prelim): 190
length of query: 681
length of database: 11,318,596
effective HSP length: 106
effective length of query: 575
effective length of database: 8,486,382
effective search space: 4879669650
effective search space used: 4879669650
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (28.9 bits)
Lotus: description of TM0103.2