
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0117b.4
(482 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG587229 similar to PIR|T05664|T05 hypothetical protein F22I13.1... 427 e-120
TC92344 similar to PIR|T05664|T05664 hypothetical protein F22I13... 208 9e-83
TC83459 similar to GP|14334724|gb|AAK59540.1 unknown protein {Ar... 276 2e-74
TC88200 similar to GP|6671963|gb|AAF23222.1| unknown protein {Ar... 155 2e-38
TC91388 similar to GP|14334724|gb|AAK59540.1 unknown protein {Ar... 106 2e-23
TC84836 similar to PIR|T05165|T05165 hypothetical protein F18E5.... 59 4e-09
CA918371 homologue to GP|287405|dbj|BA ORF86 {Oryza sativa (japo... 32 0.046
TC81563 similar to GP|21593798|gb|AAM65765.1 unknown {Arabidopsi... 31 1.2
TC81242 31 1.2
TC89890 weakly similar to GP|17065226|gb|AAL32767.1 Unknown prot... 31 1.2
TC77620 similar to GP|10129643|emb|CAC08239. Pspzf zinc finger p... 30 1.6
BG454354 similar to GP|23307559|dbj hypothetical protein~similar... 30 1.6
TC87145 similar to PIR|E71446|E71446 probable protein kinase - A... 30 2.7
TC78401 similar to GP|6735386|emb|CAB69043.1 glutaredoxin {Arabi... 29 3.5
TC91513 similar to GP|13676415|dbj|BAB41198. hypothetical protei... 29 3.5
BF642107 29 4.6
TC84594 similar to GP|23497092|gb|AAN36641.1 hypothetical protei... 28 6.0
TC77189 homologue to GP|13124865|gb|AAK11734.1 serine/threonine/... 28 6.0
TC77860 similar to GP|21593547|gb|AAM65514.1 unknown {Arabidopsi... 28 6.0
BQ123345 28 7.8
>BG587229 similar to PIR|T05664|T05 hypothetical protein F22I13.130 -
Arabidopsis thaliana, partial (46%)
Length = 690
Score = 427 bits (1097), Expect = e-120
Identities = 205/229 (89%), Positives = 216/229 (93%)
Frame = +3
Query: 61 IESFVSLVNPSISVDCGILRDCYESFAMYCFGRYLVACLGGEERTIEFMERQGRSASKTP 120
IESFVSLVNPSISVDC ILRDCYESFAMYCFGRYLVACLGGE+RT++FME++GR+ KTP
Sbjct: 3 IESFVSLVNPSISVDCAILRDCYESFAMYCFGRYLVACLGGEDRTLDFMEKEGRATFKTP 182
Query: 121 LLLHHHSHDYKGTVNHPFPLKYFLKPWILTRRFYQIVKFGIVQYMIIKSFTSIMAVILEA 180
LL H+HS G V HPFP+KYFLKPWIL RFYQIVKFGIVQYMIIKSFT+I+AVILEA
Sbjct: 183 LLRHYHSSHSPGIVKHPFPIKYFLKPWILGPRFYQIVKFGIVQYMIIKSFTAILAVILEA 362
Query: 181 FGVYCEGEFKLGCGYPYIAVVLNFSQSWALYCLVQFYTVTKDELAHIKPLAKFLTFKSIV 240
FGVYCEGEFKLGCGYPY+AVVLNFSQSWALYCLVQFYTVTKDELAHIKPLAKFLTFKSIV
Sbjct: 363 FGVYCEGEFKLGCGYPYVAVVLNFSQSWALYCLVQFYTVTKDELAHIKPLAKFLTFKSIV 542
Query: 241 FLTWWQGVAIALLYTFGLFKSPIAQGLQSKSSVQDFIICIEMGIASIVH 289
FLTWWQGVAIALLYTFGLFKSPIAQGLQ KSSVQDFIICIEMGIASIVH
Sbjct: 543 FLTWWQGVAIALLYTFGLFKSPIAQGLQFKSSVQDFIICIEMGIASIVH 689
>TC92344 similar to PIR|T05664|T05664 hypothetical protein F22I13.130 -
Arabidopsis thaliana, partial (35%)
Length = 734
Score = 208 bits (530), Expect(2) = 9e-83
Identities = 100/110 (90%), Positives = 105/110 (94%)
Frame = +3
Query: 373 VHQAVEPVEKGITRFNEKLHRISQNIKKHDKDRRRIKDDSCIASSSPARRVIRGIDDPLL 432
VHQA EPVEKGITRFNEKL+RISQNIKKHDKD+RRIKDDSCI SSSPARRVIRGIDDPLL
Sbjct: 183 VHQAKEPVEKGITRFNEKLYRISQNIKKHDKDKRRIKDDSCIVSSSPARRVIRGIDDPLL 362
Query: 433 NGSISDSGISRAKKHRRKSGYTSAESGGESSSDQSYGAYQVRGHRWVTKE 482
NGS+SDSG+SR KKHRRKSGYTS ESGGESSSDQ+YG YQVRGHRWVTKE
Sbjct: 363 NGSVSDSGMSRGKKHRRKSGYTSGESGGESSSDQTYGGYQVRGHRWVTKE 512
Score = 117 bits (292), Expect(2) = 9e-83
Identities = 56/60 (93%), Positives = 56/60 (93%)
Frame = +2
Query: 313 GDYSADCPLDPDEIRDSERPTKLRLPTTDVDAKSGMTIRESVRDVVIGGSGYIVKDVKFT 372
GDYSADCPLDPDEIRDSERPTKLRLP DVDAKSGMTIRESVRDV IGG GYIVKDVKFT
Sbjct: 2 GDYSADCPLDPDEIRDSERPTKLRLPAPDVDAKSGMTIRESVRDVAIGGGGYIVKDVKFT 181
>TC83459 similar to GP|14334724|gb|AAK59540.1 unknown protein {Arabidopsis
thaliana}, partial (42%)
Length = 1248
Score = 276 bits (705), Expect = 2e-74
Identities = 141/267 (52%), Positives = 185/267 (68%), Gaps = 4/267 (1%)
Frame = +1
Query: 193 CGYPYIAVVLNFSQSWALYCLVQFYTVTKDELAHIKPLAKFLTFKSIVFLTWWQGVAIAL 252
C YPY+A +LNFSQ+WALYCLVQFY+V KD+L IKPLAKFLTFKSIVFLTWWQGVA+A
Sbjct: 169 CRYPYLASILNFSQTWALYCLVQFYSVIKDKLEPIKPLAKFLTFKSIVFLTWWQGVAVAF 348
Query: 253 LYTFGLFKSPIAQGLQSKSSVQDFIICIEMGIASIVHLYVFPAKPYELMGDRHPGSISVL 312
L++ G FK +AQ L+++ +QD+IICIEMG+A++VHLYVFPA PY+ G+R + +V+
Sbjct: 349 LFSMGAFKGALAQELRTR--IQDYIICIEMGVAAVVHLYVFPAVPYK-RGERCVRNAAVM 519
Query: 313 GDY-SADCPLDPDEIRDSERPTKLRLPTTDVDAKSGMTIRESVRDVVIGGSGYIVKDVKF 371
DY S P DP E RD ER + RL + K M +VRDVV+G IV D+KF
Sbjct: 520 ADYASLGTPPDPAEGRDCERSIRTRLGHHEEKEKKPMKFTHNVRDVVLGSGEIIVDDMKF 699
Query: 372 TVHQAVEPVEKGITRFNEKLHRISQNIKKHDKDRRR---IKDDSCIASSSPARRVIRGID 428
TV VEPVE+GI + N+ H+IS+N+K+HD++R+R +KDDS + R +
Sbjct: 700 TVSHVVEPVERGIAKINKTFHQISENVKRHDEERKRNTKVKDDSHLVPLESWRTEFSDVH 879
Query: 429 DPLLNGSISDSGISRAKKHRRKSGYTS 455
D L+ GS+SDSG+S K+H TS
Sbjct: 880 DKLVEGSVSDSGLSDGKRHTHSKASTS 960
>TC88200 similar to GP|6671963|gb|AAF23222.1| unknown protein {Arabidopsis
thaliana}, partial (88%)
Length = 1788
Score = 155 bits (393), Expect = 2e-38
Identities = 95/288 (32%), Positives = 149/288 (50%)
Frame = +1
Query: 11 PVWATIVASVFLLMTLALSMYLLFEHLSAYKNPEEQKFLIGVILMVPCYSIESFVSLVNP 70
P++ ++A + +AL++ ++ HL Y P Q+F++ ++ MVP Y++ SF+SLV P
Sbjct: 148 PLFFYVIAFFCTVGAIALAILHIYRHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLP 327
Query: 71 SISVDCGILRDCYESFAMYCFGRYLVACLGGEERTIEFMERQGRSASKTPLLLHHHSHDY 130
+S+ +R+ YE++ +Y F +A +GG + + GR + L+
Sbjct: 328 RLSIYFNSIREVYEAWVIYNFLSLCLAWVGGPGSVV--ISLSGRVLKPSVCLM------- 480
Query: 131 KGTVNHPFPLKYFLKPWILTRRFYQIVKFGIVQYMIIKSFTSIMAVILEAFGVYCEGEFK 190
FP P L RF + K G +Q++I+K ++ +IL A G Y +G F
Sbjct: 481 ----TCCFP------PIPLDGRFIRKCKQGCLQFVILKPILVVVTLILYAKGKYKDGNFN 630
Query: 191 LGCGYPYIAVVLNFSQSWALYCLVQFYTVTKDELAHIKPLAKFLTFKSIVFLTWWQGVAI 250
Y Y+ ++ FS + ALY L FY KD L P+ KF+ KS+VFLT+WQGV
Sbjct: 631 PKQSYLYLTIIYTFSYTMALYALALFYVACKDLLQPFNPVPKFIIIKSVVFLTYWQGVLF 810
Query: 251 ALLYTFGLFKSPIAQGLQSKSSVQDFIICIEMGIASIVHLYVFPAKPY 298
F KS Q + +Q+FIIC+EM IA++ H Y FP K Y
Sbjct: 811 -----FLAAKSGFIQDADEAALLQNFIICVEMLIAAVGHFYAFPYKEY 939
>TC91388 similar to GP|14334724|gb|AAK59540.1 unknown protein {Arabidopsis
thaliana}, partial (14%)
Length = 691
Score = 106 bits (264), Expect = 2e-23
Identities = 46/76 (60%), Positives = 63/76 (82%)
Frame = +2
Query: 13 WATIVASVFLLMTLALSMYLLFEHLSAYKNPEEQKFLIGVILMVPCYSIESFVSLVNPSI 72
W AS+F+L+ L LSMYL+FEHL+AY PEEQKFLIG+ILMVP Y++ESF+SL++ S
Sbjct: 464 WTVFSASIFVLVALVLSMYLIFEHLAAYNQPEEQKFLIGLILMVPVYALESFLSLLDSSA 643
Query: 73 SVDCGILRDCYESFAM 88
+ +C ++RDCYE+FA+
Sbjct: 644 AFNCEVIRDCYEAFAL 691
>TC84836 similar to PIR|T05165|T05165 hypothetical protein F18E5.190 -
Arabidopsis thaliana, partial (53%)
Length = 634
Score = 58.9 bits (141), Expect = 4e-09
Identities = 34/95 (35%), Positives = 56/95 (58%), Gaps = 5/95 (5%)
Frame = +3
Query: 2 IIIQLLHTPPVWATIVASVFLLM-TLALSMYLLFEHLSAYKNPEEQKFLIGVILMVPCYS 60
+++ L P T+ S F +M ++ ++ LL +HL +KNP+EQK +I +ILM P Y+
Sbjct: 108 VVMDLTQLNPAQITVYGSAFCVMLSMHFTLQLLSQHLFYWKNPKEQKAIIIIILMAPIYA 287
Query: 61 IESFVSLVNPSISVDCGIL----RDCYESFAMYCF 91
I SFV L++ S + L ++CYE+F + F
Sbjct: 288 IVSFVGLLDIRGSKEFFTLLESIKECYEAFVIAKF 392
>CA918371 homologue to GP|287405|dbj|BA ORF86 {Oryza sativa (japonica
cultivar-group)}, partial (15%)
Length = 596
Score = 32.0 bits (71), Expect(2) = 0.046
Identities = 13/13 (100%), Positives = 13/13 (100%)
Frame = +2
Query: 290 LYVFPAKPYELMG 302
LYVFPAKPYELMG
Sbjct: 557 LYVFPAKPYELMG 595
Score = 22.3 bits (46), Expect(2) = 0.046
Identities = 10/16 (62%), Positives = 13/16 (80%), Gaps = 3/16 (18%)
Frame = +1
Query: 277 IICI---EMGIASIVH 289
++CI +MGIASIVH
Sbjct: 508 VVCIYPAQMGIASIVH 555
>TC81563 similar to GP|21593798|gb|AAM65765.1 unknown {Arabidopsis
thaliana}, partial (52%)
Length = 1143
Score = 30.8 bits (68), Expect = 1.2
Identities = 16/37 (43%), Positives = 21/37 (56%)
Frame = -1
Query: 395 SQNIKKHDKDRRRIKDDSCIASSSPARRVIRGIDDPL 431
SQ + HD D CIA++S +R RG+DDPL
Sbjct: 267 SQIVSYHDTKHSPY*HDECIANTSKGKR--RGLDDPL 163
>TC81242
Length = 525
Score = 30.8 bits (68), Expect = 1.2
Identities = 16/37 (43%), Positives = 21/37 (56%)
Frame = +2
Query: 395 SQNIKKHDKDRRRIKDDSCIASSSPARRVIRGIDDPL 431
SQ + HD D CIA++S +R RG+DDPL
Sbjct: 344 SQIVSYHDTKHSPY*HDECIANTSKGKR--RGLDDPL 448
>TC89890 weakly similar to GP|17065226|gb|AAL32767.1 Unknown protein
{Arabidopsis thaliana}, partial (76%)
Length = 1013
Score = 30.8 bits (68), Expect = 1.2
Identities = 28/123 (22%), Positives = 49/123 (39%), Gaps = 1/123 (0%)
Frame = +1
Query: 279 CIEMGIASIVHLYVFPAKPYELMGDRHPGSISVLGDYSADCPLDPDEIRDSERPTKLRLP 338
CI M S V P+ D HP ++L P R T + LP
Sbjct: 43 CISMTYLSAVLSTPTSTLPFRCRIDSHPSQPNLL--------FAPP--RPKFNTTNIILP 192
Query: 339 TT-DVDAKSGMTIRESVRDVVIGGSGYIVKDVKFTVHQAVEPVEKGITRFNEKLHRISQN 397
++ D+ + + + GG G V+ +K +++ +EP+E+G E R+ +
Sbjct: 193 SSVAADSAKWRNMVSIFQGFLTGGRGNDVESLKVELYETIEPLERGAEATPEDQQRVDKI 372
Query: 398 IKK 400
+K
Sbjct: 373 ARK 381
>TC77620 similar to GP|10129643|emb|CAC08239. Pspzf zinc finger protein-like
{Arabidopsis thaliana}, partial (26%)
Length = 1958
Score = 30.4 bits (67), Expect = 1.6
Identities = 38/125 (30%), Positives = 57/125 (45%), Gaps = 9/125 (7%)
Frame = +3
Query: 360 GGSGYIVKDVKFTVHQAVEPVEKGITRFNEKLHRISQNIKKHDKDRRRIKDDSCIASSSP 419
G S Y ++++K V PV G T + L++ IKK + + SSS
Sbjct: 714 GTSRYGLRNLKCNSISDVMPV--GCTPSDSTLNKKKDAIKKRNCEGE---------SSST 860
Query: 420 ARRVIRGIDDPLLNG---------SISDSGISRAKKHRRKSGYTSAESGGESSSDQSYGA 470
AR + I+ PL++G SISDS ISR HR ++ ++ SG S +G
Sbjct: 861 ARG--KKINGPLIDGRNSVSRNGLSISDSRISRNAPHRDRAD-SNIASGRTRRSIGGHGR 1031
Query: 471 YQVRG 475
+V G
Sbjct: 1032GRVSG 1046
>BG454354 similar to GP|23307559|dbj hypothetical protein~similar to
Arabidopsis thaliana chromosome4 At4g21570, partial
(14%)
Length = 305
Score = 30.4 bits (67), Expect = 1.6
Identities = 10/29 (34%), Positives = 20/29 (68%)
Frame = +1
Query: 270 KSSVQDFIICIEMGIASIVHLYVFPAKPY 298
+ ++Q+ ++CIEM + S++ Y + A PY
Sbjct: 40 EEAMQNILVCIEMVVFSVLQQYAYHASPY 126
>TC87145 similar to PIR|E71446|E71446 probable protein kinase - Arabidopsis
thaliana, partial (74%)
Length = 1443
Score = 29.6 bits (65), Expect = 2.7
Identities = 19/49 (38%), Positives = 26/49 (52%), Gaps = 4/49 (8%)
Frame = +3
Query: 197 YIAVVLNFSQSWALYCLVQFYTVTKDE----LAHIKPLAKFLTFKSIVF 241
++ + LNFSQS +L+ L F TK + H KPL F+ K VF
Sbjct: 84 FLPLFLNFSQSSSLFNLSLFSFPTKHSSNIYIIHNKPLGNFMITK*SVF 230
>TC78401 similar to GP|6735386|emb|CAB69043.1 glutaredoxin {Arabidopsis
thaliana}, partial (68%)
Length = 706
Score = 29.3 bits (64), Expect = 3.5
Identities = 16/41 (39%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Frame = +3
Query: 216 FYTVTKDELAHIKPLAK--FLTFKSIVFLTWWQGVAIALLY 254
FY+VT +E HIK + + F T K++++ W IAL+Y
Sbjct: 501 FYSVTMNESLHIKDMIRLHFFTAKALIYKM*W----IALIY 611
>TC91513 similar to GP|13676415|dbj|BAB41198. hypothetical protein {Glycine
max}, partial (14%)
Length = 1030
Score = 29.3 bits (64), Expect = 3.5
Identities = 20/49 (40%), Positives = 25/49 (50%)
Frame = +2
Query: 417 SSPARRVIRGIDDPLLNGSISDSGISRAKKHRRKSGYTSAESGGESSSD 465
S+ A R D L N SIS S ++ K H RKS +S+ S S SD
Sbjct: 506 STSAEETSRTPVDKLDNVSISGSS-NKEKSHSRKSSSSSSSSSSSSDSD 649
>BF642107
Length = 690
Score = 28.9 bits (63), Expect = 4.6
Identities = 26/106 (24%), Positives = 42/106 (39%), Gaps = 5/106 (4%)
Frame = +3
Query: 367 KDVKFTVHQAVEP-----VEKGITRFNEKLHRISQNIKKHDKDRRRIKDDSCIASSSPAR 421
+D++ T + E +E N HR++ + D +R+KD + S AR
Sbjct: 246 EDIEITSREEQEEALVALIEHRTREVNNLRHRLAYTKNQLDDAEKRLKD----SESKLAR 413
Query: 422 RVIRGIDDPLLNGSISDSGISRAKKHRRKSGYTSAESGGESSSDQS 467
+RG + S D+G KK RR + S+ QS
Sbjct: 414 --LRGQTTKKITNSDDDNGTVAVKKERRSNSPIDRNERSYKSNKQS 545
>TC84594 similar to GP|23497092|gb|AAN36641.1 hypothetical protein
{Plasmodium falciparum 3D7}, partial (0%)
Length = 791
Score = 28.5 bits (62), Expect = 6.0
Identities = 13/23 (56%), Positives = 14/23 (60%), Gaps = 2/23 (8%)
Frame = -3
Query: 125 HHSHDYKGTVN--HPFPLKYFLK 145
HHSH Y TV HPFPL + K
Sbjct: 675 HHSHCYAPTVQFPHPFPLSLYRK 607
>TC77189 homologue to GP|13124865|gb|AAK11734.1 serine/threonine/tyrosine
kinase {Arachis hypogaea}, partial (95%)
Length = 1788
Score = 28.5 bits (62), Expect = 6.0
Identities = 16/56 (28%), Positives = 24/56 (42%), Gaps = 2/56 (3%)
Frame = +3
Query: 84 ESFAMYCFGRYLVACLGGEERTIEFMERQGRSASKTPLLLHHHSHDYK--GTVNHP 137
E+FA FG+ GE+ I+ +ER SK L+ + T+ HP
Sbjct: 672 EAFAQGAFGKLYRGTYNGEDVAIKILERPENDTSKAQLMEQQFQQEVMMLATLKHP 839
>TC77860 similar to GP|21593547|gb|AAM65514.1 unknown {Arabidopsis
thaliana}, partial (88%)
Length = 1355
Score = 28.5 bits (62), Expect = 6.0
Identities = 17/54 (31%), Positives = 24/54 (43%), Gaps = 4/54 (7%)
Frame = +1
Query: 110 ERQGRSASKTPLLLHHHSHDYKGTVNHPFPLKYFLKP----WILTRRFYQIVKF 159
E+Q S +T + HH HD + +K+ P W L RRF Q K+
Sbjct: 145 EQQQYSVIETQYIRRHHKHDLRDNQCSSALVKHIKAPVHLVWSLVRRFDQPQKY 306
>BQ123345
Length = 697
Score = 28.1 bits (61), Expect = 7.8
Identities = 13/28 (46%), Positives = 16/28 (56%), Gaps = 2/28 (7%)
Frame = -3
Query: 124 HHHSHDYKGTVNHPFPLKYFLK--PWIL 149
HH DYK + PF +K+ LK PW L
Sbjct: 233 HHLQFDYKTPMFFPFRMKWLLKLLPWYL 150
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.324 0.140 0.424
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,948,437
Number of Sequences: 36976
Number of extensions: 234753
Number of successful extensions: 1458
Number of sequences better than 10.0: 41
Number of HSP's better than 10.0 without gapping: 1439
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1451
length of query: 482
length of database: 9,014,727
effective HSP length: 100
effective length of query: 382
effective length of database: 5,317,127
effective search space: 2031142514
effective search space used: 2031142514
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 60 (27.7 bits)
Lotus: description of TM0117b.4