Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0152.2
         (603 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

BF644046                                                               50  3e-06
TC88472 similar to GP|19423958|gb|AAL87269.1 unknown protein {Ar...    36  0.048
BF641220 weakly similar to PIR|G86203|G86 probable N-arginine di...    35  0.11
AL388248 weakly similar to GP|6062758|gb|A NADH dehydrogenase su...    35  0.11
TC80288 weakly similar to PIR|T06029|T06029 hypothetical protein...    34  0.14
AW256780 similar to PIR|T12641|T126 NADH dehydrogenase (ubiquino...    33  0.31
TC82286 weakly similar to PIR|B96544|B96544 hypothetical protein...    33  0.41
TC92259 weakly similar to GP|10177335|dbj|BAB10684. nuclear matr...    32  0.70
AW690594 similar to GP|23498163|emb hypothetical protein {Plasmo...    32  0.70
TC87688 similar to GP|10177535|dbj|BAB10930. gene_id:K1F13.21~un...    32  0.70
TC81816 similar to GP|8096269|dbj|BAA95789.1 KED {Nicotiana taba...    32  0.70
TC79552 similar to GP|13877579|gb|AAK43867.1 putative T-complex ...    32  0.70
TC91658 similar to GP|14329812|emb|CAC40753. putative nucleosome...    32  0.70
TC81541 similar to GP|22597168|gb|AAN03471.1 unknown protein {Gl...    32  0.70
TC86145 homologue to GP|10334499|emb|CAC10211. hypothetical prot...    32  0.91
TC86146 homologue to GP|10334499|emb|CAC10211. hypothetical prot...    32  0.91
TC77101 similar to GP|15148920|gb|AAK84887.1 homeodomain leucine...    32  0.91
TC86552 similar to PIR|G85436|G85436 hypothetical protein AT4g36...    31  1.2
TC89336 weakly similar to GP|21554135|gb|AAM63215.1 unknown {Ara...    31  1.2
TC80296 similar to PIR|H86265|H86265 protein F3F19.18 [imported]...    31  1.2

>BF644046 
          Length = 597

 Score = 49.7 bits (117), Expect = 3e-06
 Identities = 24/55 (43%), Positives = 33/55 (59%)
 Frame = +3

Query: 82  HTYFFENIFTDLKCKLPLSDFTCSVLTLLNVAPTQLHCNSWAYLRAFELLCQVLG 136
           H Y F  +F D+  K P ++F C  L  LNVA +QLH N  A++  FE+ C+ LG
Sbjct: 9   HMYSF--VFEDIGFKFPFTNFECDFLKALNVASSQLHPNCCAFMCGFEISCESLG 167


>TC88472 similar to GP|19423958|gb|AAL87269.1 unknown protein {Arabidopsis
           thaliana}, partial (46%)
          Length = 1019

 Score = 35.8 bits (81), Expect = 0.048
 Identities = 31/114 (27%), Positives = 59/114 (51%), Gaps = 5/114 (4%)
 Frame = +2

Query: 437 KQLEEKEREILKMKATMKL-LDSANKVNEKKA----ADLALENERLKKHVEDLNITQKAK 491
           K++ + ERE +K +   +L LD AN ++  +     A + +E  RL K   DL    ++ 
Sbjct: 296 KRIHKAEREKMKREHLNELFLDLANALDLSEPNNGKASILIEASRLLK---DLLCQIQSL 466

Query: 492 EEELVKSKAEITHLNSSNAELKNENSKLHSEVSELKNSVLDQFEAGFAKAKEQI 545
           ++E V   +E  ++     ELK ENS L +++ +L+  +    +A  A++K  +
Sbjct: 467 KKENVSLLSESHYVTMEKNELKEENSSLETQIEKLQGEI----QARIAQSKPDL 616


>BF641220 weakly similar to PIR|G86203|G86 probable N-arginine dibasic
           convertase [imported] - Arabidopsis thaliana, partial
           (5%)
          Length = 634

 Score = 34.7 bits (78), Expect = 0.11
 Identities = 13/25 (52%), Positives = 21/25 (84%)
 Frame = +2

Query: 577 DEEDEGEEDKNEEDENVNDNEGEGE 601
           DE+DE E+D++EED+  +D+EGE +
Sbjct: 359 DEDDEEEDDEDEEDDEEDDDEGEDD 433



 Score = 30.8 bits (68), Expect = 1.6
 Identities = 13/29 (44%), Positives = 18/29 (61%)
 Frame = +2

Query: 571 ISPDTGDEEDEGEEDKNEEDENVNDNEGE 599
           I  D  +E+DE EED  E+D+   D+E E
Sbjct: 356 IDEDDEEEDDEDEEDDEEDDDEGEDDEDE 442



 Score = 29.6 bits (65), Expect = 3.5
 Identities = 10/36 (27%), Positives = 23/36 (63%)
 Frame = +2

Query: 567 DGKLISPDTGDEEDEGEEDKNEEDENVNDNEGEGEN 602
           DG +   D  +++++ E+D+ ++DE  +D + E E+
Sbjct: 347 DGSIDEDDEEEDDEDEEDDEEDDDEGEDDEDEEXED 454


>AL388248 weakly similar to GP|6062758|gb|A NADH dehydrogenase subunit II
           {Cynolebias alexandri}, partial (7%)
          Length = 417

 Score = 34.7 bits (78), Expect = 0.11
 Identities = 23/90 (25%), Positives = 43/90 (47%)
 Frame = -2

Query: 438 QLEEKEREILKMKATMKLLDSANKVNEKKAADLALENERLKKHVEDLNITQKAKEEELVK 497
           +L  KERE +K +      +   K + ++   L   NERLK+ + +     +A E E   
Sbjct: 395 ELLRKERENMKAQIASLQAEMEEKGDSEEVGTLQKHNERLKEKLANWKEKYEASETEREA 216

Query: 498 SKAEITHLNSSNAELKNENSKLHSEVSELK 527
           ++ E +  N+S   L  +  +L  ++ EL+
Sbjct: 215 AEGEASAANASVRRLTMKVLELSKQLKELQ 126


>TC80288 weakly similar to PIR|T06029|T06029 hypothetical protein T28I19.100
           - Arabidopsis thaliana, partial (14%)
          Length = 1460

 Score = 34.3 bits (77), Expect = 0.14
 Identities = 43/209 (20%), Positives = 87/209 (41%), Gaps = 16/209 (7%)
 Frame = +3

Query: 411 TKGLEIAAISKMIDLESADFDGINSAKQLEEKEREILKMKATMKLLDSANKVNEKKAADL 470
           +KG ++  + + I L    F  I   K   +K++E  K    + +    +++ +    DL
Sbjct: 183 SKGFKVKHVLQAILLLGVCFWLIYQVKHNHDKKKEFDKNDTKLPIRTETDQILKLGRKDL 362

Query: 471 ------ALENERLKKHVEDLNIT---QKAKEEELVKSKAEITHLNSSNAELKNENSKLHS 521
                 A +NE  ++  ED +I    Q  +E +  + + E  + + +  E ++   +   
Sbjct: 363 HPGKVEADKNEGHEEEEEDEHIVYNMQNKREHDEQQQEGEEGNKHETEEESEDNVHERRE 542

Query: 522 EVSELKNSVLDQFEAGFAKAKEQILFLNPQVSINL-------AGSDPYARIVDGKLISPD 574
           E  E +N    + +       E++      V I+        A +D    +VD +    +
Sbjct: 543 EQDEEENKHGAEVQEENESKSEEVEDEGGDVEIDENDHEKSEADNDREDEVVDEEKDKEE 722

Query: 575 TGDEEDEGEEDKNEEDENVNDNEGEGENH 603
            GD+E E E+ ++EE   + +N    ENH
Sbjct: 723 EGDDETENEDKEDEEKGGLVENH---ENH 800


>AW256780 similar to PIR|T12641|T126 NADH dehydrogenase (ubiquinone) (EC
           1.6.5.3) chain 5 - Brachypodium arbuscula chloroplast
           (fragment), partial (7%)
          Length = 724

 Score = 33.1 bits (74), Expect = 0.31
 Identities = 16/49 (32%), Positives = 25/49 (50%)
 Frame = -2

Query: 355 WKSLLKEFEELTSEEVTSLWDSKIDFNSLVETNLVFEADREKVKKIGLK 403
           WK L++EFE+L  + +  LW SK  +     T   F+ D   ++   LK
Sbjct: 657 WKELIEEFEKLIVKLILGLWKSKSSYERKQHTEGNFDVDTGDIEFSQLK 511


>TC82286 weakly similar to PIR|B96544|B96544 hypothetical protein F4M15.4
           [imported] - Arabidopsis thaliana, partial (5%)
          Length = 1782

 Score = 32.7 bits (73), Expect = 0.41
 Identities = 14/29 (48%), Positives = 21/29 (72%)
 Frame = +2

Query: 574 DTGDEEDEGEEDKNEEDENVNDNEGEGEN 602
           D GDEE E EE++NEE E+  ++  +GE+
Sbjct: 404 DWGDEEKEEEEEENEEKEDEAEHMNDGES 490


>TC92259 weakly similar to GP|10177335|dbj|BAB10684. nuclear matrix
           constituent protein 1 (NMCP1)-like {Arabidopsis
           thaliana}, partial (7%)
          Length = 630

 Score = 32.0 bits (71), Expect = 0.70
 Identities = 24/91 (26%), Positives = 45/91 (49%)
 Frame = +1

Query: 399 KIGLKEACQAIMTKGLEIAAISKMIDLESADFDGINSAKQLEEKEREILKMKATMKLLDS 458
           ++ LKE    + ++ LE+ A +  +  E A F+     + L+EK+ E+ K    ++    
Sbjct: 1   EVKLKEEIDLVRSQNLELLAQADKLKAEKAKFEV--EWELLDEKKEELRKEAEFIE---- 162

Query: 459 ANKVNEKKAADLALENERLKKHVEDLNITQK 489
               NE+KA    ++NER K   E  N+ ++
Sbjct: 163 ----NERKAVSTFVKNERDKLREEKENLRKQ 243


>AW690594 similar to GP|23498163|emb hypothetical protein {Plasmodium
           falciparum 3D7}, partial (10%)
          Length = 633

 Score = 32.0 bits (71), Expect = 0.70
 Identities = 13/26 (50%), Positives = 19/26 (73%)
 Frame = +3

Query: 574 DTGDEEDEGEEDKNEEDENVNDNEGE 599
           D  D+E+E EE++ EE+E  +D EGE
Sbjct: 165 DEHDDEEEEEEEEEEEEEEDDDEEGE 242



 Score = 31.6 bits (70), Expect = 0.91
 Identities = 14/28 (50%), Positives = 20/28 (71%)
 Frame = +3

Query: 574 DTGDEEDEGEEDKNEEDENVNDNEGEGE 601
           D  DEE+E EE++ EEDE+ ++ E E E
Sbjct: 117 DEHDEEEEEEEEEEEEDEHDDEEEEEEE 200



 Score = 29.3 bits (64), Expect = 4.5
 Identities = 11/25 (44%), Positives = 18/25 (72%)
 Frame = +3

Query: 577 DEEDEGEEDKNEEDENVNDNEGEGE 601
           + +DE EE++ EE+E   D++ EGE
Sbjct: 168 EHDDEEEEEEEEEEEEEEDDDEEGE 242


>TC87688 similar to GP|10177535|dbj|BAB10930. gene_id:K1F13.21~unknown
           protein {Arabidopsis thaliana}, partial (46%)
          Length = 2077

 Score = 32.0 bits (71), Expect = 0.70
 Identities = 42/196 (21%), Positives = 70/196 (35%), Gaps = 7/196 (3%)
 Frame = +2

Query: 403 KEACQAIMTKGLEIAAISKMIDLESADFDGI--NSAKQLEEKEREILKMKATMKLLDSAN 460
           K     ++  G +   I   IDL+S            Q  +   EI + K     LD   
Sbjct: 173 KSPLDELLVDGYDAEQIWHQIDLQSQPLLSTLRRRLNQFVKNPEEIAQFKVP---LDVGK 343

Query: 461 KVNEKKAADLALE-----NERLKKHVEDLNITQKAKEEELVKSKAEITHLNSSNAELKNE 515
           K+ +KK  +L  E     +E L    +D    +K K +   + + +    +  + E   +
Sbjct: 344 KLEKKKRVELEEEESDDFDEELDDDDDDFEGVEKKKAKGGSEGEDDFEEEDDEDEEGSED 523

Query: 516 NSKLHSEVSELKNSVLDQFEAGFAKAKEQILFLNPQVSINLAGSDPYARIVDGKLISPDT 575
                 E  ++K   +   E  F K  E   +L  +        D Y +           
Sbjct: 524 EDDEEDEKEKVKGGGI---EDKFLKIDELTEYLEKE-------EDNYEK----------- 640

Query: 576 GDEEDEGEEDKNEEDE 591
           G+E DE +ED  E+DE
Sbjct: 641 GEERDEADEDSEEDDE 688


>TC81816 similar to GP|8096269|dbj|BAA95789.1 KED {Nicotiana tabacum},
           partial (13%)
          Length = 663

 Score = 32.0 bits (71), Expect = 0.70
 Identities = 27/140 (19%), Positives = 60/140 (42%), Gaps = 1/140 (0%)
 Frame = +3

Query: 461 KVNEKKAADLALENERLKKHVEDLNITQK-AKEEELVKSKAEITHLNSSNAELKNENSKL 519
           K+++K A D+  +   ++K +E  ++ +   K+E+  K K + T ++    +   E  K 
Sbjct: 90  KIDDKSAGDVKEDKVEIEKDLEIKSVEKDDEKKEKKDKEKKDKTDVDEGKDKKDKEKKKK 269

Query: 520 HSEVSELKNSVLDQFEAGFAKAKEQILFLNPQVSINLAGSDPYARIVDGKLISPDTGDEE 579
             +   +K    D  E    + K++                   +   GK      G+E+
Sbjct: 270 EKKEENVKGEEEDGDEKKDKEKKKK------------------EKKEKGKEDKDKDGEEK 395

Query: 580 DEGEEDKNEEDENVNDNEGE 599
              ++ + ++D+N +D+EGE
Sbjct: 396 KSKKDKEKKKDKNEDDDEGE 455


>TC79552 similar to GP|13877579|gb|AAK43867.1 putative T-complex protein 1
           theta subunit; TCP-1-Theta {Arabidopsis thaliana},
           partial (39%)
          Length = 1063

 Score = 32.0 bits (71), Expect = 0.70
 Identities = 33/142 (23%), Positives = 63/142 (44%)
 Frame = +3

Query: 384 VETNLVFEADREKVKKIGLKEACQAIMTKGLEIAAISKMIDLESADFDGINSAKQLEEKE 443
           V++  V E    +V  +  +E   ++ T  L  +  S + D+E A  DG+N+ K +    
Sbjct: 102 VDSVSVEEIGGARVTIVKNEEGGNSVATVVLRGSTDSILDDIERAVDDGVNTYKTMCRDS 281

Query: 444 REILKMKATMKLLDSANKVNEKKAADLALENERLKKHVEDLNITQKAKEEELVKSKAEIT 503
           R +    AT   ++ A +V E    +  L+   + K  E   +  +   E    +  EI 
Sbjct: 282 RIVPGAAATE--IELAKRVKEFSFKETGLDQYAIAKFAESFEMIPRTLAENAGLNAMEI- 452

Query: 504 HLNSSNAELKNENSKLHSEVSE 525
            ++S  AE  + N+K+  ++ E
Sbjct: 453 -ISSLYAEHASGNTKVGIDLDE 515


>TC91658 similar to GP|14329812|emb|CAC40753. putative nucleosome assembly
           protein 1 {Atropa belladonna}, partial (46%)
          Length = 583

 Score = 32.0 bits (71), Expect = 0.70
 Identities = 12/28 (42%), Positives = 20/28 (70%)
 Frame = +2

Query: 574 DTGDEEDEGEEDKNEEDENVNDNEGEGE 601
           + GDEED+ ++D +++DE   D+E E E
Sbjct: 98  EDGDEEDDDDDDDDDDDEEDEDDEEEDE 181


>TC81541 similar to GP|22597168|gb|AAN03471.1 unknown protein {Glycine max},
           partial (36%)
          Length = 927

 Score = 32.0 bits (71), Expect = 0.70
 Identities = 14/51 (27%), Positives = 24/51 (46%)
 Frame = +3

Query: 302 ESNRPQKKKRKNETPESAKGKDSSQPSMEKFMVKGNPQHMTLKAGSSSAPP 352
           +S++P+  ++    PE+ K KD   P+    M+   P  M +  G    PP
Sbjct: 735 DSDKPKDAEKPKPKPEAEKPKDKPAPTAMPMMIPQMPPPMAVPVGMCYVPP 887


>TC86145 homologue to GP|10334499|emb|CAC10211. hypothetical protein {Cicer
           arietinum}, partial (91%)
          Length = 1311

 Score = 31.6 bits (70), Expect = 0.91
 Identities = 16/49 (32%), Positives = 27/49 (54%), Gaps = 5/49 (10%)
 Frame = +1

Query: 560 DPYARIVDGKLISPDTGDEEDEGEEDKNEEDENVND-----NEGEGENH 603
           D +  + DG  +  +  DE+D+ EED  +EDE+  D     + G+ EN+
Sbjct: 448 DDFDDLHDGTDVDDEDDDEDDDNEEDYEDEDEDAFDVHDHASVGDRENN 594


>TC86146 homologue to GP|10334499|emb|CAC10211. hypothetical protein {Cicer
           arietinum}, partial (95%)
          Length = 1084

 Score = 31.6 bits (70), Expect = 0.91
 Identities = 16/49 (32%), Positives = 27/49 (54%), Gaps = 5/49 (10%)
 Frame = +3

Query: 560 DPYARIVDGKLISPDTGDEEDEGEEDKNEEDENVND-----NEGEGENH 603
           D +  + DG  +  +  DE+D+ EED  +EDE+  D     + G+ EN+
Sbjct: 267 DDFDDLHDGTDVDDEDDDEDDDNEEDYEDEDEDAFDVHDHASVGDRENN 413


>TC77101 similar to GP|15148920|gb|AAK84887.1 homeodomain leucine zipper
           protein HDZ3 {Phaseolus vulgaris}, complete
          Length = 1532

 Score = 31.6 bits (70), Expect = 0.91
 Identities = 35/137 (25%), Positives = 59/137 (42%)
 Frame = +1

Query: 365 LTSEEVTSLWDSKIDFNSLVETNLVFEADREKVKKIGLKEACQAIMTKGLEIAAISKMID 424
           LTSE+V  L  S  + N L       E   +  KK+GL+    A+  +       +K ++
Sbjct: 643 LTSEQVHMLEKSFEEENKLEP-----ERKTQLAKKLGLQPRQVAVWFQNRRARWKTKQLE 807

Query: 425 LESADFDGINSAKQLEEKEREILKMKATMKLLDSANKVNEKKAADLALENERLKKHVEDL 484
               D+D + S+              + +   DS NK NEK  +++   NE+L+   +D+
Sbjct: 808 ---RDYDVLKSSYD------------SLLSTYDSINKENEKLKSEVVSLNEKLQVQAKDM 942

Query: 485 NITQKAKEEELVKSKAE 501
                  EE L + KA+
Sbjct: 943 ------LEEPLSEKKAD 975


>TC86552 similar to PIR|G85436|G85436 hypothetical protein AT4g36980
           [imported] - Arabidopsis thaliana, partial (59%)
          Length = 1785

 Score = 31.2 bits (69), Expect = 1.2
 Identities = 13/34 (38%), Positives = 24/34 (70%)
 Frame = +2

Query: 567 DGKLISPDTGDEEDEGEEDKNEEDENVNDNEGEG 600
           +GK  S  + D+++E E+D+++ED N +D+  EG
Sbjct: 680 NGKEESQISDDDDEEDEDDEDDEDFNSDDSNDEG 781


>TC89336 weakly similar to GP|21554135|gb|AAM63215.1 unknown {Arabidopsis
           thaliana}, partial (7%)
          Length = 1241

 Score = 31.2 bits (69), Expect = 1.2
 Identities = 22/118 (18%), Positives = 54/118 (45%)
 Frame = +1

Query: 428 ADFDGINSAKQLEEKEREILKMKATMKLLDSANKVNEKKAADLALENERLKKHVEDLNIT 487
           + F G    K  +E  ++ +K +  +++ +   K++E++   L  +NE+ +  ++ +   
Sbjct: 430 SQFVGEGENKSYDELLKKFIKNEEELRVSNLKLKLSEEEIIKLKNQNEKSEGQLDSVQKE 609

Query: 488 QKAKEEELVKSKAEITHLNSSNAELKNENSKLHSEVSELKNSVLDQFEAGFAKAKEQI 545
                +EL   K ++  L    AEL+     L  ++ E+ N  L       A+ ++++
Sbjct: 610 LTLNMDELEHKKGQVLELQKQKAELETHVPNLVEQL-EVANEHLKISNDEVARLRKEL 780


>TC80296 similar to PIR|H86265|H86265 protein F3F19.18 [imported] -
           Arabidopsis thaliana, partial (34%)
          Length = 1734

 Score = 31.2 bits (69), Expect = 1.2
 Identities = 15/45 (33%), Positives = 27/45 (59%), Gaps = 1/45 (2%)
 Frame = +3

Query: 560 DPYARIVDGKLISPDTGDEEDEGEE-DKNEEDENVNDNEGEGENH 603
           D   R  D +    D  + EDEG++ + +EED  ++++EG+G+ H
Sbjct: 921 DENDRSSDYETSGDDADNVEDEGDDLEDSEEDGGISEHEGDGDLH 1055


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.313    0.130    0.368 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,041,234
Number of Sequences: 36976
Number of extensions: 245893
Number of successful extensions: 1521
Number of sequences better than 10.0: 83
Number of HSP's better than 10.0 without gapping: 1288
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1419
length of query: 603
length of database: 9,014,727
effective HSP length: 102
effective length of query: 501
effective length of database: 5,243,175
effective search space: 2626830675
effective search space used: 2626830675
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (28.1 bits)


Lotus: description of TM0152.2