
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0195a.3
(660 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CB894335 similar to GP|4406808|gb| unknown protein {Arabidopsis ... 246 2e-88
TC88389 similar to PIR|T04590|T04590 hypothetical protein F23E13... 202 2e-72
AW689980 181 8e-46
TC88388 similar to PIR|T04590|T04590 hypothetical protein F23E13... 149 3e-36
AW585318 similar to PIR|T04590|T045 hypothetical protein F23E13.... 123 3e-28
TC82237 weakly similar to GP|10140731|gb|AAG13564.1 hypothetical... 120 1e-27
TC78567 similar to PIR|T04143|T04143 CLB1 protein - tomato, part... 33 0.35
TC85985 similar to GP|5139695|dbj|BAA81686.1 expressed in cucumb... 33 0.35
TC86556 similar to PIR|T07598|T07598 proline-rich protein GPP1 -... 32 0.60
CB065264 similar to GP|1684733|emb| ORF247 protein {Pseudomonas ... 32 1.0
AW329791 weakly similar to GP|6957728|gb| hypothetical protein {... 31 1.3
TC87439 weakly similar to GP|5139695|dbj|BAA81686.1 expressed in... 30 2.3
BQ752298 similar to GP|11595639|em conserved hypothetical protei... 30 2.3
TC89471 similar to GP|18176080|gb|AAL59980.1 unknown protein {Ar... 30 3.0
TC77332 similar to GP|10177075|dbj|BAB10517. gene_id:MKP11.15~un... 30 3.0
TC77333 similar to GP|10177075|dbj|BAB10517. gene_id:MKP11.15~un... 30 3.0
TC77331 similar to GP|10177075|dbj|BAB10517. gene_id:MKP11.15~un... 30 3.0
TC86559 similar to GP|18476498|gb|AAL50314.1 ultraviolet-B-repre... 30 3.9
BQ140776 similar to SP|P35788|PYRE_ Orotate phosphoribosyltransf... 29 5.1
TC78344 similar to GP|13540405|gb|AAK29456.1 histone H1 {Lens cu... 29 5.1
>CB894335 similar to GP|4406808|gb| unknown protein {Arabidopsis thaliana},
partial (35%)
Length = 797
Score = 246 bits (629), Expect(2) = 2e-88
Identities = 124/165 (75%), Positives = 144/165 (87%)
Frame = +1
Query: 448 TSKIAVELMKGGAMMTVLSTLVAALAWPATLVTTFDLIDSKWAVAIDRSDKAGKVLAEVL 507
T +A++LMK GAMMTVLSTLV ALAWPA L+ D IDSKW +AI+RS+KAGK+LAEVL
Sbjct: 304 TLGLAMQLMKQGAMMTVLSTLVTALAWPAVLLAATDFIDSKWTIAINRSNKAGKLLAEVL 483
Query: 508 LKGLQGNRPVTLVGFSLGARVIFKCLQFLADSDGDNAGLVEKVVFLGAPISINDENWKAA 567
LKGLQGNRPVTLVG+SLGARVIF CLQ LA ++ + A LVE+VV LGAPISI DENW+AA
Sbjct: 484 LKGLQGNRPVTLVGYSLGARVIFSCLQCLAKTE-NGAELVERVVLLGAPISIRDENWEAA 660
Query: 568 RKMVAGRFVNAYSTNDWTLGITFRASLLSQGLAGIQPVDVPGIEN 612
RKMVAGRFVNAYS NDW LG+ FRASLL++GLAGI+PVD+PGI+N
Sbjct: 661 RKMVAGRFVNAYSRNDWILGVAFRASLLTKGLAGIEPVDIPGIQN 795
Score = 98.2 bits (243), Expect(2) = 2e-88
Identities = 52/91 (57%), Positives = 63/91 (69%), Gaps = 2/91 (2%)
Frame = +2
Query: 361 SFGAAGAGLTGSKMATRIGSLEEFELKEIGGVH--QGHLAVSISISGLAFEEKDFVKPWE 418
SFGAAGAGLTG + + + + G + QG L V I ISG FE++DF++PWE
Sbjct: 17 SFGAAGAGLTGKQDG*ESRGVLTSLISNLSGENHNQGRLGVEILISGFVFEKEDFIRPWE 196
Query: 419 GHYDNSERYVLQYESKNLIALSTAIQDWLTS 449
G DN ERY LQ+ESKNLIA+STAIQDWLTS
Sbjct: 197 GLNDNLERYSLQWESKNLIAVSTAIQDWLTS 289
>TC88389 similar to PIR|T04590|T04590 hypothetical protein F23E13.100 -
Arabidopsis thaliana, partial (37%)
Length = 630
Score = 202 bits (513), Expect(2) = 2e-72
Identities = 104/148 (70%), Positives = 120/148 (80%), Gaps = 2/148 (1%)
Frame = +2
Query: 280 ESIGSENNWDKWKRGGIIGAAAVT-GGTLMAITGGLAAPAIAHGLGALAPTFGSIVPFIG 338
E+ +E+ W KWKRGGIIGA +V G LMAITGGLAAPAIA GLGALAPT G+++P IG
Sbjct: 8 EAQSNESGWAKWKRGGIIGACSVNLGEXLMAITGGLAAPAIAAGLGALAPTLGTLIPVIG 187
Query: 339 AGGFAAAATATGSVAGSVAVAASFGAAGAGLTGSKMATRIGSLEEFELKEIGGVH-QGHL 397
AGGFAAAA+A G+VAGSVAVAASFGAAGAGLTG+KMA R+GS++EFE + IG H QG L
Sbjct: 188 AGGFAAAASAAGTVAGSVAVAASFGAAGAGLTGTKMARRVGSVDEFEFRAIGDNHNQGRL 367
Query: 398 AVSISISGLAFEEKDFVKPWEGHYDNSE 425
AV I +SG FEE DFV+PWEG DN E
Sbjct: 368 AVEILVSGFVFEEDDFVRPWEGQNDNLE 451
Score = 89.7 bits (221), Expect(2) = 2e-72
Identities = 44/58 (75%), Positives = 50/58 (85%)
Frame = +3
Query: 426 RYVLQYESKNLIALSTAIQDWLTSKIAVELMKGGAMMTVLSTLVAALAWPATLVTTFD 483
RY LQ+ESKNLIA+STAIQDWLTS+IA+ELMK GAMMTVLS L+ ALAWPA L+ D
Sbjct: 456 RYALQWESKNLIAVSTAIQDWLTSRIAMELMKQGAMMTVLSALLTALAWPAALLAATD 629
>AW689980
Length = 681
Score = 181 bits (459), Expect = 8e-46
Identities = 110/193 (56%), Positives = 128/193 (65%), Gaps = 15/193 (7%)
Frame = +3
Query: 3 ETPSMLLPPTHRYAAGALFALALHHSQIHQSPT------------TAQNAASVSDDPELW 50
E S +L P RYAA ALF+LALH SQ H+ PT T +++S SD+PELW
Sbjct: 81 EVTSSILSPAQRYAAAALFSLALHQSQFHERPTPSSPRTADADAPTTADSSSTSDNPELW 260
Query: 51 IHDNFGLLSPVLRFLGVDDQSSNGLKETAGSSSQFRHHLGSFLKLLAEESDANSSERLDK 110
IHDN GLL PV RFL VD+Q+ +GLKETAGSSSQFRHHLG F K+L+EE DA+S +RLDK
Sbjct: 261 IHDNSGLLLPVFRFLEVDEQAWHGLKETAGSSSQFRHHLGDFFKVLSEEGDASSKDRLDK 440
Query: 111 EAALTKAVDAMTESMSTASIAESSSGESGGHEQK--SQDYLCNTTAP-PVVEPHSDSKTK 167
EAALT AVDA SM SIA+SS GHEQK S+D LC+T V D +TK
Sbjct: 441 EAALTNAVDATEASMK--SIADSS---CRGHEQKTRSEDNLCDTELKLYAVGAEPDGETK 605
Query: 168 DSESSALLPQQKQ 180
ESSALLP Q
Sbjct: 606 --ESSALLPLXTQ 638
>TC88388 similar to PIR|T04590|T04590 hypothetical protein F23E13.100 -
Arabidopsis thaliana, partial (20%)
Length = 930
Score = 149 bits (377), Expect = 3e-36
Identities = 75/105 (71%), Positives = 87/105 (82%), Gaps = 1/105 (0%)
Frame = +3
Query: 323 LGALAPTFGSIVPFIGAGGFAAAATATGSVAGSVAVAASFGAAGAGLTGSKMATRIGSLE 382
LGALAPT G+++P IGAGGFAAAA+A G+VAGSVAVAASFGAAGAGLTG+KMA R+GS++
Sbjct: 3 LGALAPTLGTLIPVIGAGGFAAAASAAGTVAGSVAVAASFGAAGAGLTGTKMARRVGSVD 182
Query: 383 EFELKEIGGVH-QGHLAVSISISGLAFEEKDFVKPWEGHYDNSER 426
EFE + IG H QG LAV I +SG FEE DFV+PWEG DN ER
Sbjct: 183 EFEFRAIGDNHNQGRLAVEILVSGFVFEEDDFVRPWEGQNDNLER 317
>AW585318 similar to PIR|T04590|T045 hypothetical protein F23E13.100 -
Arabidopsis thaliana, partial (19%)
Length = 585
Score = 123 bits (308), Expect = 3e-28
Identities = 80/202 (39%), Positives = 126/202 (61%), Gaps = 17/202 (8%)
Frame = +2
Query: 99 ESDAN---SSERLDKEAALTKAVDAMTESMSTASIAESSSGES-GGHEQKSQDYLCNTTA 154
ESD N S++ L+ E AL+KA DA+ M + +S E +E + ++ +A
Sbjct: 2 ESDDNAA*SAQFLELELALSKAGDAVVLEMEKNAHPSNSKRERINEYEHQCRE---KYSA 172
Query: 155 PPV--------VEPHSDSKTKDSESSALL-----PQQKQASTELENVTLEKPLEEASLIS 201
P V V+ + D++ K+++++ L+ PQQ+ +++++ EKP+EE ++S
Sbjct: 173 PDVQSNPEKADVDVNLDNQ-KETDAAPLINLEDPPQQRSDNSKID----EKPIEEVLMLS 337
Query: 202 YQRKVTVLYALVSACVADTAEVDNKCCRSRQGYDARHRVSLRLIATWLGVKWNEMEAMES 261
QRKVTVLY L+SAC++D E D +C R R+GYDARHRV+LRL+ATWL VKW +MEA+E+
Sbjct: 338 DQRKVTVLYELLSACLSDLREDDKECKRRRKGYDARHRVALRLLATWLDVKWTKMEAIET 517
Query: 262 MVAFSLMNSLSEAGAKEDESIG 283
MV S M + E + ++E+ G
Sbjct: 518 MVTCSAMAIIKEQESNKEEAQG 583
>TC82237 weakly similar to GP|10140731|gb|AAG13564.1 hypothetical protein
{Oryza sativa}, partial (15%)
Length = 632
Score = 120 bits (302), Expect = 1e-27
Identities = 71/167 (42%), Positives = 102/167 (60%), Gaps = 19/167 (11%)
Frame = +1
Query: 1 MEETPS-------MLLPPTHRYAAGALFALALHHSQIHQ------------SPTTAQNAA 41
M TPS L PT RYAAGALF LALH +Q+HQ SP+++ +
Sbjct: 85 MSSTPSPSPSPSPSYLTPTQRYAAGALFGLALHQAQLHQTHPLGLSTDDFPSPSSSTSTG 264
Query: 42 SVSDDPELWIHDNFGLLSPVLRFLGVDDQSSNGLKETAGSSSQFRHHLGSFLKLLAEESD 101
+V +DP+LW+H GLL PV FL +D + +GL+ET+GSS R H+G FL+LL+EE D
Sbjct: 265 AVFEDPDLWVHHTSGLLRPVFIFLDIDSAAWSGLEETSGSSVATR-HVGPFLRLLSEEYD 441
Query: 102 ANSSERLDKEAALTKAVDAMTESMSTASIAESSSGESGGHEQKSQDY 148
++++RLD+E AL++AVDA+ S+ E +S S +K ++Y
Sbjct: 442 DDNAKRLDQELALSEAVDALVLSL------EKNSESSRSKRKKLREY 564
>TC78567 similar to PIR|T04143|T04143 CLB1 protein - tomato, partial (53%)
Length = 1159
Score = 33.1 bits (74), Expect = 0.35
Identities = 24/72 (33%), Positives = 34/72 (46%)
Frame = +3
Query: 299 AAAVTGGTLMAITGGLAAPAIAHGLGALAPTFGSIVPFIGAGGFAAAATATGSVAGSVAV 358
AA V G T+ A+ G+G + G + IGAG A +G AG+ V
Sbjct: 630 AAGVIGSTM---------DAVGSGVGLVGSGIGLVGTGIGAG---AGLVGSGIGAGAGLV 773
Query: 359 AASFGAAGAGLT 370
+ FGA G+GL+
Sbjct: 774 GSGFGAFGSGLS 809
>TC85985 similar to GP|5139695|dbj|BAA81686.1 expressed in cucumber
hypocotyls {Cucumis sativus}, partial (42%)
Length = 892
Score = 33.1 bits (74), Expect = 0.35
Identities = 28/88 (31%), Positives = 32/88 (35%), Gaps = 13/88 (14%)
Frame = -1
Query: 294 GGIIGAAAVTGGTLMAITGGLAAPAIAHGLGALAPTFGSIVPFIG-------------AG 340
GG + A G A TG AA +A G GA T G V +G AG
Sbjct: 388 GGELTGAGFGGVLTGATTGDDAAGDLATGTGAAGETAGVTVAAVGDEAGGDDFGEATGAG 209
Query: 341 GFAAAATATGSVAGSVAVAASFGAAGAG 368
F ATG+ G V V A G
Sbjct: 208 DFGLVGAATGAAEGVVTVGGDVVGADDG 125
>TC86556 similar to PIR|T07598|T07598 proline-rich protein GPP1 - potato,
partial (24%)
Length = 1250
Score = 32.3 bits (72), Expect = 0.60
Identities = 25/85 (29%), Positives = 34/85 (39%), Gaps = 6/85 (7%)
Frame = -2
Query: 294 GGIIGAAAVTGGTLMAITGGLAAPAIAHGLGALAPT----FGSIVPFIGAGGFAAAATAT 349
GG + +TGG + TGG +G T FG IG GG +A T
Sbjct: 553 GGFLNGGRITGGLYIGTTGGFGITTGGLYIGTGGRTGCSIFGGHGFLIGTGGGSAFGTGG 374
Query: 350 G--SVAGSVAVAASFGAAGAGLTGS 372
G + G ++FG LTG+
Sbjct: 373 GLYTGTGGKTGCSTFGGGHGFLTGT 299
>CB065264 similar to GP|1684733|emb| ORF247 protein {Pseudomonas stutzeri},
partial (8%)
Length = 580
Score = 31.6 bits (70), Expect = 1.0
Identities = 17/44 (38%), Positives = 21/44 (47%)
Frame = -1
Query: 438 ALSTAIQDWLTSKIAVELMKGGAMMTVLSTLVAALAWPATLVTT 481
AL Q WLTS +A + M L+ ALAWP+ L T
Sbjct: 580 ALRATAQTWLTSSLASSSAHSASPMPRLAPQTRALAWPSRLGIT 449
>AW329791 weakly similar to GP|6957728|gb| hypothetical protein {Arabidopsis
thaliana}, partial (8%)
Length = 531
Score = 31.2 bits (69), Expect = 1.3
Identities = 16/38 (42%), Positives = 19/38 (49%), Gaps = 2/38 (5%)
Frame = -2
Query: 4 TPSMLLPPTHRYAAGALFALALHH--SQIHQSPTTAQN 39
TP LL P+H Y F HH S +HQS + QN
Sbjct: 146 TPHELLSPSHPYTTKTTFYQFSHHQESSVHQSSSDTQN 33
>TC87439 weakly similar to GP|5139695|dbj|BAA81686.1 expressed in cucumber
hypocotyls {Cucumis sativus}, partial (57%)
Length = 897
Score = 30.4 bits (67), Expect = 2.3
Identities = 24/87 (27%), Positives = 36/87 (40%), Gaps = 11/87 (12%)
Frame = -3
Query: 296 IIGAAAVTGGTLMAITGGLAAPAIAHGLGALAPTFGS-----IVPFI------GAGGFAA 344
++GA A T G ++ G+ A G G G+ ++ + AGG A
Sbjct: 442 LLGAGAGTAGVVVGAATGVTAGGELTGAGVGGDFTGTGAGGGVLTGVTAGGDEAAGGVAV 263
Query: 345 AATATGSVAGSVAVAASFGAAGAGLTG 371
A A G VAG + + GA G+ G
Sbjct: 262 GAAALGDVAGGDDLGEATGAGVVGVVG 182
>BQ752298 similar to GP|11595639|em conserved hypothetical protein
{Neurospora crassa}, partial (2%)
Length = 834
Score = 30.4 bits (67), Expect = 2.3
Identities = 23/76 (30%), Positives = 33/76 (43%), Gaps = 2/76 (2%)
Frame = +2
Query: 296 IIGAAAVTGGTLMAITGGLAAPAIAHGL--GALAPTFGSIVPFIGAGGFAAAATATGSVA 353
++G G ++I GGL + ++ G GALA G ++ G GG
Sbjct: 422 MLGGRGPGGAPRLSIDGGLLS-SVPEGPPGGALARPIGGVIHAWGGGGVPVVDLTAALGG 598
Query: 354 GSVAVAASFGAAGAGL 369
G VAV A +A A L
Sbjct: 599 GGVAVFACVASAPAAL 646
>TC89471 similar to GP|18176080|gb|AAL59980.1 unknown protein {Arabidopsis
thaliana}, partial (44%)
Length = 852
Score = 30.0 bits (66), Expect = 3.0
Identities = 20/76 (26%), Positives = 33/76 (43%)
Frame = +3
Query: 102 ANSSERLDKEAALTKAVDAMTESMSTASIAESSSGESGGHEQKSQDYLCNTTAPPVVEPH 161
A +SERL K++ + K + + E SG ++Q +CN P ++PH
Sbjct: 570 AITSERLHKQSVVIKQICKLFPQRRVVIEGEKGDDCSGQYDQ-----ICNARLPRALDPH 734
Query: 162 SDSKTKDSESSALLPQ 177
S + S S + Q
Sbjct: 735 SVPSEELSASLGYMVQ 782
>TC77332 similar to GP|10177075|dbj|BAB10517. gene_id:MKP11.15~unknown
protein {Arabidopsis thaliana}, partial (12%)
Length = 636
Score = 30.0 bits (66), Expect = 3.0
Identities = 17/49 (34%), Positives = 27/49 (54%), Gaps = 2/49 (4%)
Frame = +2
Query: 122 TESMSTASIAESSSGESGGHEQKSQDY--LCNTTAPPVVEPHSDSKTKD 168
T+S TAS S G ++ KSQD + N AP V +P++ +K ++
Sbjct: 332 TKSTGTASKCHSKGGSQSENDAKSQDIPSVGNNLAPKVRKPYTITKQRE 478
>TC77333 similar to GP|10177075|dbj|BAB10517. gene_id:MKP11.15~unknown
protein {Arabidopsis thaliana}, partial (19%)
Length = 1058
Score = 30.0 bits (66), Expect = 3.0
Identities = 17/49 (34%), Positives = 27/49 (54%), Gaps = 2/49 (4%)
Frame = +1
Query: 122 TESMSTASIAESSSGESGGHEQKSQDY--LCNTTAPPVVEPHSDSKTKD 168
T+S TAS S G ++ KSQD + N AP V +P++ +K ++
Sbjct: 271 TKSTGTASKCHSKGGSQSENDAKSQDIPSVGNNLAPKVRKPYTITKQRE 417
>TC77331 similar to GP|10177075|dbj|BAB10517. gene_id:MKP11.15~unknown
protein {Arabidopsis thaliana}, partial (25%)
Length = 1835
Score = 30.0 bits (66), Expect = 3.0
Identities = 17/49 (34%), Positives = 27/49 (54%), Gaps = 2/49 (4%)
Frame = +1
Query: 122 TESMSTASIAESSSGESGGHEQKSQDY--LCNTTAPPVVEPHSDSKTKD 168
T+S TAS S G ++ KSQD + N AP V +P++ +K ++
Sbjct: 220 TKSTGTASKCHSKGGSQSENDAKSQDIPSVGNNLAPKVRKPYTITKQRE 366
>TC86559 similar to GP|18476498|gb|AAL50314.1 ultraviolet-B-repressible
protein {Pisum sativum}, partial (75%)
Length = 944
Score = 29.6 bits (65), Expect = 3.9
Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Frame = +2
Query: 301 AVTGGTLMAITGGLAAPAIAHGLGA-LAPTFGSIVPFIGAGGFAAAATATGSVAG 354
AVT T ++T + P +AH G+ L+P+ + + I AGG A G+V G
Sbjct: 242 AVTAITAASLTASMVIPDVAHAAGSDLSPSLQNFLLSIFAGG-AVLTAILGAVIG 403
>BQ140776 similar to SP|P35788|PYRE_ Orotate phosphoribosyltransferase (EC
2.4.2.10) (OPRT) (OPRTase). [)] {graminicola}, partial
(40%)
Length = 380
Score = 29.3 bits (64), Expect = 5.1
Identities = 22/93 (23%), Positives = 36/93 (38%)
Frame = -3
Query: 298 GAAAVTGGTLMAITGGLAAPAIAHGLGALAPTFGSIVPFIGAGGFAAAATATGSVAGSVA 357
G AV L+A+ G + + G G+ T A V G ++
Sbjct: 339 GLLAVEAVALLAVFGNVQVGQLVDGCGSEGDTL------------VRGAKQDIKVEGGIS 196
Query: 358 VAASFGAAGAGLTGSKMATRIGSLEEFELKEIG 390
+A S G G ++GS+EE ++E+G
Sbjct: 195 IAVSLHGLGIGXRHGSQ--QVGSVEEARIEEVG 103
>TC78344 similar to GP|13540405|gb|AAK29456.1 histone H1 {Lens culinaris},
partial (86%)
Length = 1215
Score = 29.3 bits (64), Expect = 5.1
Identities = 32/113 (28%), Positives = 41/113 (35%), Gaps = 9/113 (7%)
Frame = -1
Query: 305 GTLMAITGGLAAPAIAHG---------LGALAPTFGSIVPFIGAGGFAAAATATGSVAGS 355
G T G AA A+ G L LA F A GFA AAT T +
Sbjct: 903 GVAAFFTAGFAAAALFPGDVLVEVLEALAGLAAGFALAAGLALAAGFALAATGTALALAT 724
Query: 356 VAVAASFGAAGAGLTGSKMATRIGSLEEFELKEIGGVHQGHLAVSISISGLAF 408
A A F A + A G L G LA++++++G AF
Sbjct: 723 GAFALGFAFATGLALAAGFAFAAGLALAAGLALAGAALA--LALALALAGAAF 571
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.314 0.129 0.368
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,439,909
Number of Sequences: 36976
Number of extensions: 208466
Number of successful extensions: 967
Number of sequences better than 10.0: 48
Number of HSP's better than 10.0 without gapping: 941
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 955
length of query: 660
length of database: 9,014,727
effective HSP length: 102
effective length of query: 558
effective length of database: 5,243,175
effective search space: 2925691650
effective search space used: 2925691650
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 62 (28.5 bits)
Lotus: description of TM0195a.3