
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0082b.2
(153 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AW560255 weakly similar to GP|10086260|gb| calmodulin-binding pr... 173 2e-44
AW560681 weakly similar to GP|10086260|gb| calmodulin-binding pr... 94 2e-30
AW980545 weakly similar to PIR|T10654|T106 hypothetical protein ... 103 2e-23
AL369319 weakly similar to PIR|T10654|T106 hypothetical protein ... 69 7e-13
TC77416 similar to GP|18139887|gb|AAL60196.1 O-linked N-acetyl g... 50 4e-07
TC89875 similar to GP|20259245|gb|AAM14358.1 putative N-terminal... 42 7e-05
TC79338 homologue to GP|20259245|gb|AAM14358.1 putative N-termin... 42 7e-05
TC87394 similar to GP|3582779|gb|AAC69180.1| peroxisomal targeti... 34 0.026
TC92121 weakly similar to GP|10177302|dbj|BAB10563. gene_id:MDC1... 31 0.17
TC89957 weakly similar to PIR|D96810|D96810 hypothetical protein... 30 0.50
AW586685 similar to PIR|T01081|T01 hypothetical protein T10P11.3... 28 1.4
TC79242 26 5.5
AL381331 weakly similar to GP|14209525|dbj contains ESTs C73715(... 26 7.2
TC87659 similar to GP|10177021|dbj|BAB10259. contains similarity... 25 9.3
TC84816 similar to GP|2313789|gb|AAD07729.1| H. pylori predicted... 25 9.3
BQ144111 25 9.3
AW776563 similar to GP|21304447|em HOBBIT protein {Arabidopsis t... 25 9.3
TC91796 GP|23498903|emb|CAD50981. hypothetical protein {Plasmodi... 25 9.3
AW775312 similar to GP|20466732|gb unknown protein {Arabidopsis ... 25 9.3
>AW560255 weakly similar to GP|10086260|gb| calmodulin-binding protein MPCBP
{Zea mays}, partial (7%)
Length = 610
Score = 173 bits (439), Expect = 2e-44
Identities = 86/154 (55%), Positives = 115/154 (73%), Gaps = 3/154 (1%)
Frame = -2
Query: 3 RDYARNLEEEIWHDLAYAYISLSQWHDAEVCLSKSKAFRQYTASRCHVIGTMHEAKGLYK 62
R+ R LE E+WHDLA Y +LS+WHDAE+CL+KS+A Y+ASR H G ++EA+GL++
Sbjct: 522 RNRDRRLEVEVWHDLANVYTALSRWHDAEICLAKSQAIDPYSASRLHSTGLLNEARGLHQ 343
Query: 63 EAVKAFRDALNIDPRHVPSLISTAVALRRWSNQSDP-VVRSFLMDALRYDRLNASAWYNL 121
EA+K+++ AL+I+P HV SLISTA LR+ QS +VRS L DAL+ D N+SAWYNL
Sbjct: 342 EALKSYKKALDIEPNHVASLISTACVLRKLGGQSSSLIVRSLLTDALKLDTTNSSAWYNL 163
Query: 122 GILHKAE--GRVLEAVECFQAANSLEETAPIEPF 153
G+L+KA+ LEA ECF+ A LEE++PIEPF
Sbjct: 162 GLLYKADLGTSALEAAECFETAVFLEESSPIEPF 61
>AW560681 weakly similar to GP|10086260|gb| calmodulin-binding protein MPCBP
{Zea mays}, partial (3%)
Length = 551
Score = 93.6 bits (231), Expect(2) = 2e-30
Identities = 50/86 (58%), Positives = 62/86 (71%), Gaps = 3/86 (3%)
Frame = -2
Query: 71 ALNIDPRHVPSLISTAVALRRWSNQSDP-VVRSFLMDALRYDRLNASAWYNLGILHKAE- 128
AL+I+P HV SLISTA LR+ QS +VRS L DAL+ D N+SAWYNLG+L+KA+
Sbjct: 400 ALDIEPNHVASLISTACVLRKLGGQSSSLIVRSLLTDALKLDTTNSSAWYNLGLLYKADL 221
Query: 129 -GRVLEAVECFQAANSLEETAPIEPF 153
LEA ECF+ A LEE++PIEPF
Sbjct: 220 GTSALEAAECFETAVFLEESSPIEPF 143
Score = 54.7 bits (130), Expect(2) = 2e-30
Identities = 24/49 (48%), Positives = 38/49 (76%), Gaps = 3/49 (6%)
Frame = -3
Query: 24 LSQWHDAEVCLSKSKAFRQYTASRCH---VIGTMHEAKGLYKEAVKAFR 69
LS+WHDAE+CL+KS+A Y+A R H G ++EA+GL++EA+K+++
Sbjct: 549 LSRWHDAEICLAKSQAIDPYSAYRLHSTVYAGLLNEARGLHQEALKSYK 403
>AW980545 weakly similar to PIR|T10654|T106 hypothetical protein T5F17.50 -
Arabidopsis thaliana, partial (11%)
Length = 562
Score = 103 bits (258), Expect = 2e-23
Identities = 46/62 (74%), Positives = 53/62 (85%)
Frame = +2
Query: 3 RDYARNLEEEIWHDLAYAYISLSQWHDAEVCLSKSKAFRQYTASRCHVIGTMHEAKGLYK 62
RD ARNLE EIWHDLA+ YISLSQWHDA CLSKSKA + Y+ASRCH +G M+EAKGL+K
Sbjct: 377 RDRARNLEVEIWHDLAHVYISLSQWHDAHACLSKSKAIKPYSASRCHALGIMYEAKGLFK 556
Query: 63 EA 64
E+
Sbjct: 557 ES 562
>AL369319 weakly similar to PIR|T10654|T106 hypothetical protein T5F17.50 -
Arabidopsis thaliana, partial (6%)
Length = 318
Score = 68.9 bits (167), Expect = 7e-13
Identities = 31/44 (70%), Positives = 38/44 (85%), Gaps = 3/44 (6%)
Frame = +3
Query: 113 LNASAWYNLGILHKAEGRV---LEAVECFQAANSLEETAPIEPF 153
LNASAWYNLG+ HKAEG++ +EA ECFQAA+SLEE+ P+EPF
Sbjct: 3 LNASAWYNLGLFHKAEGKISSLVEATECFQAAHSLEESTPLEPF 134
>TC77416 similar to GP|18139887|gb|AAL60196.1 O-linked N-acetyl glucosamine
transferase {Arabidopsis thaliana}, partial (94%)
Length = 3465
Score = 50.1 bits (118), Expect = 4e-07
Identities = 37/138 (26%), Positives = 64/138 (45%), Gaps = 8/138 (5%)
Frame = +2
Query: 12 EIWHDLAYAYISLSQWHDAEVCLSKSKAFRQYTASRCHVIGTMHEAKGLYKEAVKAFRDA 71
+ W +LA AY+ + +A C ++ A +G + +A+GL +EA + +A
Sbjct: 698 DAWSNLASAYMRKGRLTEAAQCCRQALAINPLMVDAHSNLGNLMKAQGLVQEAYSCYLEA 877
Query: 72 LNIDPRHVPSLISTAVALRRWSNQSDPVVRS--------FLMDALRYDRLNASAWYNLGI 123
L I P + A+A WSN + + S + +A++ A+ NLG
Sbjct: 878 LRIQP-------TFAIA---WSNLAGLFMESGDFNRALQYYKEAVKLKPSFPDAYLNLGN 1027
Query: 124 LHKAEGRVLEAVECFQAA 141
++KA G EA+ C+Q A
Sbjct: 1028VYKALGMPQEAIACYQHA 1081
Score = 38.9 bits (89), Expect = 8e-04
Identities = 27/99 (27%), Positives = 49/99 (49%)
Frame = +2
Query: 51 IGTMHEAKGLYKEAVKAFRDALNIDPRHVPSLISTAVALRRWSNQSDPVVRSFLMDALRY 110
+G +++A G+ +EA+ ++ AL P + + + A ++ Q D + + A+
Sbjct: 1019 LGNVYKALGMPQEAIACYQHALQTRPNYGMAYGNLA-SIHYEQGQLDMAILHY-KQAIAC 1192
Query: 111 DRLNASAWYNLGILHKAEGRVLEAVECFQAANSLEETAP 149
D A+ NLG K GRV EA++C+ SL+ P
Sbjct: 1193 DPRFLEAYNNLGNALKDVGRVEEAIQCYNQCLSLQPNHP 1309
>TC89875 similar to GP|20259245|gb|AAM14358.1 putative N-terminal
acetyltransferase {Arabidopsis thaliana}, partial (38%)
Length = 1170
Score = 42.4 bits (98), Expect = 7e-05
Identities = 29/103 (28%), Positives = 48/103 (46%), Gaps = 2/103 (1%)
Frame = +3
Query: 49 HVIGTMHEAKGLYKEAVKAFRDALNIDPRHVPSLISTAVALRRWSNQSDPV-VRSFLMDA 107
HV G ++ + Y+EA+K +R+AL IDP ++ L ++ + + S V R L+
Sbjct: 345 HVFGLLYRSDREYREAIKCYRNALRIDPENIEILRDLSLLQAQMRDLSGFVETRQQLLTL 524
Query: 108 LRYDRLNASAWYNLGILHKAEGRVLEAVECFQA-ANSLEETAP 149
R+N W + H +AVE +A +LE P
Sbjct: 525 KPNHRMN---WIGFSVAHHLNSNASKAVEILEAYEGTLENDHP 644
Score = 32.3 bits (72), Expect = 0.076
Identities = 19/81 (23%), Positives = 38/81 (46%)
Frame = +3
Query: 61 YKEAVKAFRDALNIDPRHVPSLISTAVALRRWSNQSDPVVRSFLMDALRYDRLNASAWYN 120
YK+ +KA L P H +L + L +S+ + L+ D + W+
Sbjct: 177 YKKGLKAADAILKKFPDHGETLSMKGLTLNCMDRKSEAY--ELVRQGLKNDLKSHVCWHV 350
Query: 121 LGILHKAEGRVLEAVECFQAA 141
G+L++++ EA++C++ A
Sbjct: 351 FGLLYRSDREYREAIKCYRNA 413
>TC79338 homologue to GP|20259245|gb|AAM14358.1 putative N-terminal
acetyltransferase {Arabidopsis thaliana}, partial (24%)
Length = 904
Score = 42.4 bits (98), Expect = 7e-05
Identities = 29/105 (27%), Positives = 50/105 (47%), Gaps = 2/105 (1%)
Frame = +3
Query: 49 HVIGTMHEAKGLYKEAVKAFRDALNIDPRHVPSLISTAVALRRWSNQSDPV-VRSFLMDA 107
HV G ++ + Y+EA+K +R+AL IDP ++ L ++ + + S V R L+
Sbjct: 495 HVYGLLYRSDREYREAIKCYRNALRIDPDNIEILRDLSLLQAQMRDLSGFVETRQQLLTL 674
Query: 108 LRYDRLNASAWYNLGILHKAEGRVLEAVECFQA-ANSLEETAPIE 151
R+N W + H +A+E +A +LE+ P E
Sbjct: 675 KSNHRMN---WIGFAVSHHLNSNASKAIEILEAYEGTLEDDYPPE 800
Score = 32.0 bits (71), Expect = 0.100
Identities = 19/81 (23%), Positives = 38/81 (46%)
Frame = +3
Query: 61 YKEAVKAFRDALNIDPRHVPSLISTAVALRRWSNQSDPVVRSFLMDALRYDRLNASAWYN 120
YK+ +KA L P H +L + L +S+ + L+ D + W+
Sbjct: 327 YKKGLKAADAILKKFPDHGETLSMKGLTLNCMDRKSEAY--ELVRQGLKNDLKSHVCWHV 500
Query: 121 LGILHKAEGRVLEAVECFQAA 141
G+L++++ EA++C++ A
Sbjct: 501 YGLLYRSDREYREAIKCYRNA 563
>TC87394 similar to GP|3582779|gb|AAC69180.1| peroxisomal targeting sequence
1 receptor {Nicotiana tabacum}, partial (65%)
Length = 1656
Score = 33.9 bits (76), Expect = 0.026
Identities = 26/99 (26%), Positives = 46/99 (46%)
Frame = +3
Query: 50 VIGTMHEAKGLYKEAVKAFRDALNIDPRHVPSLISTAVALRRWSNQSDPVVRSFLMDALR 109
V+G ++ Y +A+ AF AL + P+ SL + A + S QS + ++ AL
Sbjct: 789 VLGVLYNLSREYDKAIAAFEQALKLKPQDY-SLWNKLGATQANSVQSADAIAAY-QQALD 962
Query: 110 YDRLNASAWYNLGILHKAEGRVLEAVECFQAANSLEETA 148
AW N+GI + +G E++ + A ++ A
Sbjct: 963 LKPNYVRAWANMGISYANQGMYDESIRYYVRALAMNPKA 1079
>TC92121 weakly similar to GP|10177302|dbj|BAB10563.
gene_id:MDC12.18~pir||F69210~similar to unknown protein
{Arabidopsis thaliana}, partial (76%)
Length = 845
Score = 31.2 bits (69), Expect = 0.17
Identities = 24/128 (18%), Positives = 54/128 (41%)
Frame = +3
Query: 14 WHDLAYAYISLSQWHDAEVCLSKSKAFRQYTASRCHVIGTMHEAKGLYKEAVKAFRDALN 73
WH L + ++ ++ L + A + + +G + +A + ++ AL+
Sbjct: 159 WHQLGLHSLCAREFKTSQKYLKAAVACDKGCSYAWSNLGVSLQLSEEQSQAEEVYKWALS 338
Query: 74 IDPRHVPSLISTAVALRRWSNQSDPVVRSFLMDALRYDRLNASAWYNLGILHKAEGRVLE 133
+ + I + + + + + ++ +L A A+ NLG++ AEG + E
Sbjct: 339 LATKQEAHAILSNMGILYRQQKKYELAKAMFTKSLELQPGYAPAFNNLGLVFIAEGLLEE 518
Query: 134 AVECFQAA 141
A CF+ A
Sbjct: 519 AKHCFEKA 542
>TC89957 weakly similar to PIR|D96810|D96810 hypothetical protein T11I11.6
[imported] - Arabidopsis thaliana, partial (51%)
Length = 1843
Score = 29.6 bits (65), Expect = 0.50
Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 3/99 (3%)
Frame = +3
Query: 37 SKAFRQYTASRCHVIGTM-HEAKGLYKEAVKAFRDALNIDP--RHVPSLISTAVALRRWS 93
+K F T++ +I + + A G ++EAVK + A +DP R V +++ A A+
Sbjct: 1140 NKIFGMATSAYLSMISALVYLASGRFEEAVKTSQQADRVDPSNREVNAVLRRAKAV---- 1307
Query: 94 NQSDPVVRSFLMDALRYDRLNASAWYNLGILHKAEGRVL 132
S + + L A ++ + A A YN G+ H VL
Sbjct: 1308 -TSSRMSGNLLFKASKF--MEACAVYNEGLDHDPHNSVL 1415
>AW586685 similar to PIR|T01081|T01 hypothetical protein T10P11.3.2 -
Arabidopsis thaliana, partial (22%)
Length = 651
Score = 28.1 bits (61), Expect = 1.4
Identities = 16/45 (35%), Positives = 24/45 (52%)
Frame = +2
Query: 34 LSKSKAFRQYTASRCHVIGTMHEAKGLYKEAVKAFRDALNIDPRH 78
LS++ AF+ H+ HE KG A++ R AL++DP H
Sbjct: 446 LSRAIAFKA-DLHLLHLRAAFHEHKGDVLSALRDCRAALSVDPNH 577
>TC79242
Length = 1802
Score = 26.2 bits (56), Expect = 5.5
Identities = 12/36 (33%), Positives = 20/36 (55%)
Frame = +3
Query: 112 RLNASAWYNLGILHKAEGRVLEAVECFQAANSLEET 147
RLN +W +L +LH+ R+ + F A+S +T
Sbjct: 957 RLNCVSWLSLRLLHRIGERLCFTSDLFDNADSATKT 1064
>AL381331 weakly similar to GP|14209525|dbj contains ESTs C73715(E20247)
C99497(E20247)~similar to Arabidopsis thaliana
chromosome 2 At2g20670~, partial (14%)
Length = 475
Score = 25.8 bits (55), Expect = 7.2
Identities = 20/63 (31%), Positives = 26/63 (40%)
Frame = -2
Query: 34 LSKSKAFRQYTASRCHVIGTMHEAKGLYKEAVKAFRDALNIDPRHVPSLISTAVALRRWS 93
+S SK FRQ T H H ++ + +F L P L+ VALR WS
Sbjct: 345 ISVSKHFRQLTTGCRH----KHSISDMFISLIASFVACLTFVS---PELVLHKVALRSWS 187
Query: 94 NQS 96
S
Sbjct: 186 CSS 178
>TC87659 similar to GP|10177021|dbj|BAB10259. contains similarity to sorting
nexin~gene_id:MQJ2.4 {Arabidopsis thaliana}, partial
(70%)
Length = 1803
Score = 25.4 bits (54), Expect = 9.3
Identities = 13/44 (29%), Positives = 19/44 (42%)
Frame = +1
Query: 30 AEVCLSKSKAFRQYTASRCHVIGTMHEAKGLYKEAVKAFRDALN 73
A + S+ FR+ + + T+HE GL AF D N
Sbjct: 1372 ATAAVKASRLFRELNSQTVKHLDTLHEYLGLMLAVHSAFTDRTN 1503
>TC84816 similar to GP|2313789|gb|AAD07729.1| H. pylori predicted coding
region HP0659 {Helicobacter pylori 26695}, partial (3%)
Length = 807
Score = 25.4 bits (54), Expect = 9.3
Identities = 11/32 (34%), Positives = 15/32 (46%)
Frame = +2
Query: 19 YAYISLSQWHDAEVCLSKSKAFRQYTASRCHV 50
Y + SLS + + SK R YT CH+
Sbjct: 425 YNFDSLSNNDQQNLLVRSSKILRTYTEGNCHI 520
>BQ144111
Length = 748
Score = 25.4 bits (54), Expect = 9.3
Identities = 8/17 (47%), Positives = 12/17 (70%)
Frame = -2
Query: 46 SRCHVIGTMHEAKGLYK 62
SRCH++G +H K Y+
Sbjct: 414 SRCHIVGHIHRNKSHYR 364
>AW776563 similar to GP|21304447|em HOBBIT protein {Arabidopsis thaliana},
partial (15%)
Length = 342
Score = 25.4 bits (54), Expect = 9.3
Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 3/68 (4%)
Frame = +3
Query: 61 YKEAVKAFRDALNIDPRHVPSLISTA---VALRRWSNQSDPVVRSFLMDALRYDRLNASA 117
++ A+K F+ A+ ++PR + VAL + N ++ + ALR D + +A
Sbjct: 9 HETALKNFQRAVQLNPRFAYAHTLCGHEYVAL*DFENG----IKCY-QSALRVDERHYNA 173
Query: 118 WYNLGILH 125
WY LG+++
Sbjct: 174 WYGLGMVY 197
>TC91796 GP|23498903|emb|CAD50981. hypothetical protein {Plasmodium
falciparum 3D7}, partial (0%)
Length = 836
Score = 25.4 bits (54), Expect = 9.3
Identities = 9/28 (32%), Positives = 15/28 (53%)
Frame = -3
Query: 13 IWHDLAYAYISLSQWHDAEVCLSKSKAF 40
+W + +A S +HD +C+S K F
Sbjct: 378 VWINSLFAAYITSSYHDKSLCISSHKLF 295
>AW775312 similar to GP|20466732|gb unknown protein {Arabidopsis thaliana},
partial (46%)
Length = 655
Score = 25.4 bits (54), Expect = 9.3
Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Frame = +2
Query: 13 IWHDLAY-AYISLSQWHDAEVCLSKSKAFRQYTASR 47
+WH L +Y SL QW + C+S + F +S+
Sbjct: 197 LWHRLPTGSYRSLHQWSNLRRCISSTFGFSSIYSSK 304
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.321 0.133 0.407
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,660,601
Number of Sequences: 36976
Number of extensions: 54021
Number of successful extensions: 276
Number of sequences better than 10.0: 38
Number of HSP's better than 10.0 without gapping: 266
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 271
length of query: 153
length of database: 9,014,727
effective HSP length: 88
effective length of query: 65
effective length of database: 5,760,839
effective search space: 374454535
effective search space used: 374454535
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 54 (25.4 bits)
Lotus: description of TM0082b.2