Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0174.1
         (590 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

TC80017 similar to GP|10177301|dbj|BAB10562. gene_id:MDC12.17~un...   312  e-108
TC92121 weakly similar to GP|10177302|dbj|BAB10563. gene_id:MDC1...   332  2e-91
TC77416 similar to GP|18139887|gb|AAL60196.1 O-linked N-acetyl g...    74  2e-13
TC87394 similar to GP|3582779|gb|AAC69180.1| peroxisomal targeti...    51  1e-06
TC81564 weakly similar to PIR|E85082|E85082 hypothetical protein...    49  5e-06
TC79338 homologue to GP|20259245|gb|AAM14358.1 putative N-termin...    44  1e-04
TC89875 similar to GP|20259245|gb|AAM14358.1 putative N-terminal...    44  2e-04
TC90086 similar to PIR|G86185|G86185 hypothetical protein [impor...    38  0.013
TC80284 similar to PIR|T04740|T04740 hypothetical protein F6G17....    37  0.028
TC82644 similar to GP|21537266|gb|AAM61607.1 unknown {Arabidopsi...    37  0.028
BE249103 weakly similar to GP|6630450|gb| F23N19.10 {Arabidopsis...    36  0.048
TC92578 similar to GP|10177072|dbj|BAB10514. contains similarity...    33  0.40
AW586685 similar to PIR|T01081|T01 hypothetical protein T10P11.3...    32  0.69
TC83149 similar to GP|13561982|gb|AAK30594.1 flagelliform silk p...    31  1.2
BG449158 similar to GP|17979432|gb putative TPR repeat nuclear p...    31  1.5
BG455485                                                               31  1.5
TC79065 homologue to SP|P12204|YCF3_TOBAC Photosystem I assembly...    30  2.6
TC86990 similar to GP|22136840|gb|AAM91764.1 unknown protein {Ar...    29  4.5
CA919975 weakly similar to GP|3819697|emb| BnMAP4K alpha1 {Brass...    28  7.6

>TC80017 similar to GP|10177301|dbj|BAB10562. gene_id:MDC12.17~unknown
           protein {Arabidopsis thaliana}, partial (45%)
          Length = 879

 Score =  312 bits (799), Expect(2) = e-108
 Identities = 156/185 (84%), Positives = 170/185 (91%), Gaps = 1/185 (0%)
 Frame = +2

Query: 3   RLTNDENSQDKSLLSKDTDSTEGEGKKSHKLGKCRSRPSKTDS-LDCGGDADVDQHVQGA 61
           RL NDENSQDKSLLSKDT+S EGEGK  +KLGKCRS+PSKTDS +DCG DAD DQHVQGA
Sbjct: 167 RLINDENSQDKSLLSKDTNSNEGEGKLLNKLGKCRSKPSKTDSSIDCGADADGDQHVQGA 346

Query: 62  PSSREEKVSSMKTGLIHVARKMPKNAHAHFILGLMHQRLNQPQKAILVYEKAEEILLRPE 121
           PS+REEKVSSMKTGL+HVARKMPKNAHAHFILGLM+QRLNQPQKAIL YEKAEEILLRPE
Sbjct: 347 PSAREEKVSSMKTGLVHVARKMPKNAHAHFILGLMYQRLNQPQKAILAYEKAEEILLRPE 526

Query: 122 TEIERPDLLSLVQIHHAQCLILESSSENSSDKELEPHELKEILSKLKESVQFDIRQAAVW 181
            EI+R + L+LVQIHHAQCLI+ESSSENSSD+ELEPHEL+EI+SKLKES Q DIRQAAVW
Sbjct: 527 VEIDRAEFLALVQIHHAQCLIIESSSENSSDQELEPHELEEIISKLKESTQSDIRQAAVW 706

Query: 182 NTLGF 186
           N  GF
Sbjct: 707 NYTGF 721



 Score = 97.8 bits (242), Expect(2) = e-108
 Identities = 50/56 (89%), Positives = 53/56 (94%)
 Frame = +3

Query: 183 TLGFILLKTGRVQSAISVLSSLLAIAPENYDCLGNLGIAYLQIGNLELSAKCFQEL 238
           TLGFILLKTGRVQSAISVLSSLLAI+PENYDCLGNLGIAYLQIG+LELSAK   E+
Sbjct: 711 TLGFILLKTGRVQSAISVLSSLLAISPENYDCLGNLGIAYLQIGDLELSAKSVPEV 878


>TC92121 weakly similar to GP|10177302|dbj|BAB10563.
           gene_id:MDC12.18~pir||F69210~similar to unknown protein
           {Arabidopsis thaliana}, partial (76%)
          Length = 845

 Score =  332 bits (852), Expect = 2e-91
 Identities = 165/203 (81%), Positives = 182/203 (89%)
 Frame = +3

Query: 385 AGLAMVHKAQHEISSAYESEQHVLTEMEERAVCSLKQAVAEDPDDPVRWHQLGVHSLCTQ 444
           A L M HKAQHEIS+AYESEQ  L E+EE AV SLKQA+AEDPDD V+WHQLG+HSLC +
Sbjct: 15  ARLQMSHKAQHEISAAYESEQDGLKEIEECAVSSLKQAIAEDPDDAVQWHQLGLHSLCAR 194

Query: 445 QFKTSQKYLKAAVACDRGCSYTWSNLGVSLQLSEEPSQAEKAYKQALLLATKQQAHAILS 504
           +FKTSQKYLKAAVACD+GCSY WSNLGVSLQLSEE SQAE+ YK AL LATKQ+AHAILS
Sbjct: 195 EFKTSQKYLKAAVACDKGCSYAWSNLGVSLQLSEEQSQAEEVYKWALSLATKQEAHAILS 374

Query: 505 NLGIFYRHEKKYQRAKAMFTKSLELQPGYAPAFNNLGLVFVAEGLLEEAKYCFEKALQSD 564
           N+GI YR +KKY+ AKAMFTKSLELQPGYAPAFNNLGLVF+AEGLLEEAK+CFEKALQSD
Sbjct: 375 NMGILYRQQKKYELAKAMFTKSLELQPGYAPAFNNLGLVFIAEGLLEEAKHCFEKALQSD 554

Query: 565 PLLDAAKSNLVKVVTMSKICKGL 587
            +LDAAKSNL+KV TMSKICK L
Sbjct: 555 SMLDAAKSNLIKVATMSKICKDL 623



 Score = 30.8 bits (68), Expect = 1.5
 Identities = 14/64 (21%), Positives = 32/64 (49%)
 Frame = +3

Query: 179 AVWNTLGFILLKTGRVQSAISVLSSLLAIAPENYDCLGNLGIAYLQIGNLELSAKCFQEL 238
           A+ + +G +  +  + + A ++ +  L + P       NLG+ ++  G LE +  CF++ 
Sbjct: 363 AILSNMGILYRQQKKYELAKAMFTKSLELQPGYAPAFNNLGLVFIAEGLLEEAKHCFEKA 542

Query: 239 ILKD 242
           +  D
Sbjct: 543 LQSD 554


>TC77416 similar to GP|18139887|gb|AAL60196.1 O-linked N-acetyl glucosamine
           transferase {Arabidopsis thaliana}, partial (94%)
          Length = 3465

 Score = 73.6 bits (179), Expect = 2e-13
 Identities = 84/396 (21%), Positives = 143/396 (35%), Gaps = 13/396 (3%)
 Frame = +2

Query: 184 LGFILLKTGRVQSAISVLSSLLAIAPENYDCLGNLGIAYLQIGNLELSAKCFQELILKDQ 243
           LG I  +       ++     L I P   +C GN+  A+ + GN++L+ + +   I    
Sbjct: 509 LGAIYYQLHDFDMCVAKNEEALRIEPHFAECYGNMANAWKEKGNIDLAIRYYLIAIELRP 688

Query: 244 NHPVALVNYAALLL-----------CKYASVVAG--AGASASEGALADQVMAANVAKECL 290
           N   A  N A+  +           C+ A  +      A ++ G L         A  C 
Sbjct: 689 NFADAWSNLASAYMRKGRLTEAAQCCRQALAINPLMVDAHSNLGNLMKAQGLVQEAYSCY 868

Query: 291 LAAIKADVKSAHIWGNLAYAFSISGDHRSSSKCLEKAAKLEPNCMSTRYAVATHRMKEAE 350
           L A++     A  W NLA  F  SGD   + +  ++A KL+P+                 
Sbjct: 869 LEALRIQPTFAIAWSNLAGLFMESGDFNRALQYYKEAVKLKPS----------------- 997

Query: 351 RSQDPSELLSFGGNEMASIIRDGDSSLVELPTAWAGLAMVHKAQHEISSAYESEQHVLTE 410
                                         P A+  L  V+KA                 
Sbjct: 998 -----------------------------FPDAYLNLGNVYKA---------------LG 1045

Query: 411 MEERAVCSLKQAVAEDPDDPVRWHQLGVHSLCTQQFKTSQKYLKAAVACDRGCSYTWSNL 470
           M + A+   + A+   P+  + +  L        Q   +  + K A+ACD      ++NL
Sbjct: 1046MPQEAIACYQHALQTRPNYGMAYGNLASIHYEQGQLDMAILHYKQAIACDPRFLEAYNNL 1225

Query: 471 GVSLQLSEEPSQAEKAYKQALLLATKQQAHAILSNLGIFYRHEKKYQRAKAMFTKSLELQ 530
           G +L+      +A + Y Q L L         L+NLG  Y        A + +  +L + 
Sbjct: 1226GNALKDVGRVEEAIQCYNQCLSLQPNHPQ--ALTNLGNIYMEWNMVAAAASYYKATLNVT 1399

Query: 531 PGYAPAFNNLGLVFVAEGLLEEAKYCFEKALQSDPL 566
            G +  +NNL +++  +G   +A  C+ + L+ DPL
Sbjct: 1400TGLSAPYNNLAIIYKQQGNYADAISCYNEVLRIDPL 1507



 Score = 53.5 bits (127), Expect = 2e-07
 Identities = 62/282 (21%), Positives = 107/282 (36%)
 Frame = +2

Query: 293 AIKADVKSAHIWGNLAYAFSISGDHRSSSKCLEKAAKLEPNCMSTRYAVATHRMKEAERS 352
           A++ +   A  +GN+A A+   G+   + +    A +L PN       +A+  M++   +
Sbjct: 569 ALRIEPHFAECYGNMANAWKEKGNIDLAIRYYLIAIELRPNFADAWSNLASAYMRKGRLT 748

Query: 353 QDPSELLSFGGNEMASIIRDGDSSLVELPTAWAGLAMVHKAQHEISSAYESEQHVLTEME 412
                       E A   R   +    +  A + L  + KAQ  +  AY           
Sbjct: 749 ------------EAAQCCRQALAINPLMVDAHSNLGNLMKAQGLVQEAYS---------- 862

Query: 413 ERAVCSLKQAVAEDPDDPVRWHQLGVHSLCTQQFKTSQKYLKAAVACDRGCSYTWSNLGV 472
               C L +A+   P   + W  L    + +  F  + +Y K AV         + NLG 
Sbjct: 863 ----CYL-EALRIQPTFAIAWSNLAGLFMESGDFNRALQYYKEAVKLKPSFPDAYLNLGN 1027

Query: 473 SLQLSEEPSQAEKAYKQALLLATKQQAHAILSNLGIFYRHEKKYQRAKAMFTKSLELQPG 532
             +    P +A   Y+ AL   T+        NL   +  + +   A   + +++   P 
Sbjct: 1028VYKALGMPQEAIACYQHAL--QTRPNYGMAYGNLASIHYEQGQLDMAILHYKQAIACDPR 1201

Query: 533 YAPAFNNLGLVFVAEGLLEEAKYCFEKALQSDPLLDAAKSNL 574
           +  A+NNLG      G +EEA  C+ + L   P    A +NL
Sbjct: 1202FLEAYNNLGNALKDVGRVEEAIQCYNQCLSLQPNHPQALTNL 1327



 Score = 52.4 bits (124), Expect = 5e-07
 Identities = 35/111 (31%), Positives = 57/111 (50%), Gaps = 1/111 (0%)
 Frame = +2

Query: 468 SNLGVSLQLSEEPSQAEKAYKQALLLATKQQAHAIL-SNLGIFYRHEKKYQRAKAMFTKS 526
           SNLG  ++      +A   Y +AL +   Q   AI  SNL   +     + RA   + ++
Sbjct: 809 SNLGNLMKAQGLVQEAYSCYLEALRI---QPTFAIAWSNLAGLFMESGDFNRALQYYKEA 979

Query: 527 LELQPGYAPAFNNLGLVFVAEGLLEEAKYCFEKALQSDPLLDAAKSNLVKV 577
           ++L+P +  A+ NLG V+ A G+ +EA  C++ ALQ+ P    A  NL  +
Sbjct: 980 VKLKPSFPDAYLNLGNVYKALGMPQEAIACYQHALQTRPNYGMAYGNLASI 1132



 Score = 51.2 bits (121), Expect = 1e-06
 Identities = 61/258 (23%), Positives = 105/258 (40%), Gaps = 14/258 (5%)
 Frame = +2

Query: 84   PKNAHAHFILGLMHQRLNQPQKAILVYEKAEEILLRPETEIERPDLLSLVQIHHAQCLIL 143
            P    A+  LG +++ L  PQ+AI  Y+ A  +  RP   +   +L S   IH+ Q    
Sbjct: 992  PSFPDAYLNLGNVYKALGMPQEAIACYQHA--LQTRPNYGMAYGNLAS---IHYEQ---- 1144

Query: 144  ESSSENSSDKELEPHELKEILSKLKESVQFDIRQAAVWNTLGFILLKTGRVQSAISVLSS 203
                           +L   +   K+++  D R    +N LG  L   GRV+ AI   + 
Sbjct: 1145 --------------GQLDMAILHYKQAIACDPRFLEAYNNLGNALKDVGRVEEAIQCYNQ 1282

Query: 204  LLAIAPENYDCLGNLGIAYLQIGNLELSAKCFQELI-----LKDQNHPVALV-----NYA 253
             L++ P +   L NLG  Y++   +  +A  ++  +     L    + +A++     NYA
Sbjct: 1283 CLSLQPNHPQALTNLGNIYMEWNMVAAAASYYKATLNVTTGLSAPYNNLAIIYKQQGNYA 1462

Query: 254  ALLLCKYASVV----AGAGASASEGALADQVMAANVAKECLLAAIKADVKSAHIWGNLAY 309
              + C Y  V+      A    + G    ++   + A +  + AI      A    NLA 
Sbjct: 1463 DAISC-YNEVLRIDPLAADGLVNRGNTYKEIGRVSDAIQDYIRAITVRPTMAEAHANLAS 1639

Query: 310  AFSISGDHRSSSKCLEKA 327
            A+  SG   ++ K   +A
Sbjct: 1640 AYKDSGHVEAAVKSYRQA 1693



 Score = 47.0 bits (110), Expect = 2e-05
 Identities = 81/407 (19%), Positives = 138/407 (33%)
 Frame = +2

Query: 178  AAVWNTLGFILLKTGRVQSAISVLSSLLAIAPENYDCLGNLGIAYLQIGNLELSAKCFQE 237
            A  W+ L    ++ GR+  A       LAI P   D   NLG      G ++ +  C+ E
Sbjct: 695  ADAWSNLASAYMRKGRLTEAAQCCRQALAINPLMVDAHSNLGNLMKAQGLVQEAYSCYLE 874

Query: 238  LILKDQNHPVALVNYAALLLCKYASVVAGAGASASEGALADQVMAANVAKECLLAAIKAD 297
             +       +A  N A L +                    D   A    KE    A+K  
Sbjct: 875  ALRIQPTFAIAWSNLAGLFM-----------------ESGDFNRALQYYKE----AVKLK 991

Query: 298  VKSAHIWGNLAYAFSISGDHRSSSKCLEKAAKLEPNCMSTRYAVATHRMKEAERSQDPSE 357
                  + NL   +   G  + +  C + A +  PN     Y +A   +      Q    
Sbjct: 992  PSFPDAYLNLGNVYKALGMPQEAIACYQHALQTRPN-----YGMAYGNLASIHYEQGQL- 1153

Query: 358  LLSFGGNEMASIIRDGDSSLVELPTAWAGLAMVHKAQHEISSAYESEQHVLTEMEERAVC 417
                            D +++    A A      +A + + +A +    V     E A+ 
Sbjct: 1154 ----------------DMAILHYKQAIACDPRFLEAYNNLGNALKDVGRV-----EEAIQ 1270

Query: 418  SLKQAVAEDPDDPVRWHQLGVHSLCTQQFKTSQKYLKAAVACDRGCSYTWSNLGVSLQLS 477
               Q ++  P+ P     LG   +       +  Y KA +    G S  ++NL +  +  
Sbjct: 1271 CYNQCLSLQPNHPQALTNLGNIYMEWNMVAAAASYYKATLNVTTGLSAPYNNLAIIYKQQ 1450

Query: 478  EEPSQAEKAYKQALLLATKQQAHAILSNLGIFYRHEKKYQRAKAMFTKSLELQPGYAPAF 537
               + A   Y + L +     A   L N G  Y+   +   A   + +++ ++P  A A 
Sbjct: 1451 GNYADAISCYNEVLRI--DPLAADGLVNRGNTYKEIGRVSDAIQDYIRAITVRPTMAEAH 1624

Query: 538  NNLGLVFVAEGLLEEAKYCFEKALQSDPLLDAAKSNLVKVVTMSKIC 584
             NL   +   G +E A   + +AL        A  NL+   T+  +C
Sbjct: 1625 ANLASAYKDSGHVEAAVKSYRQALILRTDFPEATCNLLH--TLQCVC 1759



 Score = 43.5 bits (101), Expect = 2e-04
 Identities = 56/270 (20%), Positives = 103/270 (37%)
 Frame = +2

Query: 161  KEILSKLKESVQFDIRQAAVWNTLGFILLKTGRVQSAISVLSSLLAIAPENYDCLGNLGI 220
            +E ++  + ++Q        +  L  I  + G++  AI      +A  P   +   NLG 
Sbjct: 1052 QEAIACYQHALQTRPNYGMAYGNLASIHYEQGQLDMAILHYKQAIACDPRFLEAYNNLGN 1231

Query: 221  AYLQIGNLELSAKCFQELILKDQNHPVALVNYAALLLCKYASVVAGAGASASEGALADQV 280
            A   +G +E + +C+ + +    NHP AL N   + + ++  V A               
Sbjct: 1232 ALKDVGRVEEAIQCYNQCLSLQPNHPQALTNLGNIYM-EWNMVAA--------------- 1363

Query: 281  MAANVAKECLLAAIKADVKSAHIWGNLAYAFSISGDHRSSSKCLEKAAKLEPNCMSTRYA 340
             AA+  K    A +      +  + NLA  +   G++  +  C  +  +++P   +    
Sbjct: 1364 -AASYYK----ATLNVTTGLSAPYNNLAIIYKQQGNYADAISCYNEVLRIDP-LAADGLV 1525

Query: 341  VATHRMKEAERSQDPSELLSFGGNEMASIIRDGDSSLVELPTAWAGLAMVHKAQHEISSA 400
               +  KE  R  D               I+D   ++   PT       + +A   ++SA
Sbjct: 1526 NRGNTYKEIGRVSD--------------AIQDYIRAITVRPT-------MAEAHANLASA 1642

Query: 401  YESEQHVLTEMEERAVCSLKQAVAEDPDDP 430
            Y+   HV     E AV S +QA+    D P
Sbjct: 1643 YKDSGHV-----EAAVKSYRQALILRTDFP 1717



 Score = 42.7 bits (99), Expect = 4e-04
 Identities = 25/86 (29%), Positives = 47/86 (54%)
 Frame = +2

Query: 505 NLGIFYRHEKKYQRAKAMFTKSLELQPGYAPAFNNLGLVFVAEGLLEEAKYCFEKALQSD 564
           N+   ++ +     A   +  ++EL+P +A A++NL   ++ +G L EA  C  +AL  +
Sbjct: 608 NMANAWKEKGNIDLAIRYYLIAIELRPNFADAWSNLASAYMRKGRLTEAAQCCRQALAIN 787

Query: 565 PLLDAAKSNLVKVVTMSKICKGLLKE 590
           PL+  A SNL  ++      +GL++E
Sbjct: 788 PLMVDAHSNLGNLMK----AQGLVQE 853



 Score = 38.9 bits (89), Expect = 0.006
 Identities = 34/174 (19%), Positives = 65/174 (36%)
 Frame = +2

Query: 160 LKEILSKLKESVQFDIRQAAVWNTLGFILLKTGRVQSAISVLSSLLAIAPENYDCLGNLG 219
           ++E  S   E+++     A  W+ L  + +++G    A+      + + P   D   NLG
Sbjct: 845 VQEAYSCYLEALRIQPTFAIAWSNLAGLFMESGDFNRALQYYKEAVKLKPSFPDAYLNLG 1024

Query: 220 IAYLQIGNLELSAKCFQELILKDQNHPVALVNYAALLLCKYASVVAGAGASASEGALADQ 279
             Y  +G  + +  C+Q  +    N+ +A  N A++                 +G L   
Sbjct: 1025NVYKALGMPQEAIACYQHALQTRPNYGMAYGNLASI--------------HYEQGQLDMA 1162

Query: 280 VMAANVAKECLLAAIKADVKSAHIWGNLAYAFSISGDHRSSSKCLEKAAKLEPN 333
           ++    A  C       D +    + NL  A    G    + +C  +   L+PN
Sbjct: 1163ILHYKQAIAC-------DPRFLEAYNNLGNALKDVGRVEEAIQCYNQCLSLQPN 1303


>TC87394 similar to GP|3582779|gb|AAC69180.1| peroxisomal targeting sequence
           1 receptor {Nicotiana tabacum}, partial (65%)
          Length = 1656

 Score = 50.8 bits (120), Expect = 1e-06
 Identities = 31/101 (30%), Positives = 57/101 (55%)
 Frame = +3

Query: 470 LGVSLQLSEEPSQAEKAYKQALLLATKQQAHAILSNLGIFYRHEKKYQRAKAMFTKSLEL 529
           LGV   LS E  +A  A++QAL L  K Q +++ + LG    +  +   A A + ++L+L
Sbjct: 792 LGVLYNLSREYDKAIAAFEQALKL--KPQDYSLWNKLGATQANSVQSADAIAAYQQALDL 965

Query: 530 QPGYAPAFNNLGLVFVAEGLLEEAKYCFEKALQSDPLLDAA 570
           +P Y  A+ N+G+ +  +G+ +E+   + +AL  +P  + A
Sbjct: 966 KPNYVRAWANMGISYANQGMYDESIRYYVRALAMNPKAENA 1088



 Score = 33.9 bits (76), Expect = 0.18
 Identities = 42/235 (17%), Positives = 99/235 (41%), Gaps = 9/235 (3%)
 Frame = +3

Query: 273 EGALADQVMAANVAKECLLAAIKADVKSAHIWGNLAYAFSISGDHRSSSKCLEKAAKLEP 332
           +G L++ V+A       L A +  + +++  W  L  A + + D + +   + +A + +P
Sbjct: 414 KGLLSEAVLA-------LEAEVLKNPENSEGWRLLGIAHAENDDDQQAIAAMMRAQEADP 572

Query: 333 NCMSTRYAVATHRMKEAERSQDPSELLSFGGNEMASIIRDGDSSLVELPTA--WAGLAMV 390
             +    A+      E E++     L  +  N      + G  +  E+  +  +A +A +
Sbjct: 573 TNLEVLLALGVSHTNELEQNAALKYLFGWLRNHP----KYGTIAPPEMSDSLYYADVARL 740

Query: 391 HKAQHEISSAYESEQHV-------LTEMEERAVCSLKQAVAEDPDDPVRWHQLGVHSLCT 443
              +  + S  +++ H+       L+   ++A+ + +QA+   P D   W++LG     +
Sbjct: 741 FN-EAAVISPDDADVHIVLGVLYNLSREYDKAIAAFEQALKLKPQDYSLWNKLGATQANS 917

Query: 444 QQFKTSQKYLKAAVACDRGCSYTWSNLGVSLQLSEEPSQAEKAYKQALLLATKQQ 498
            Q   +    + A+         W+N+G+S        ++ + Y +AL +  K +
Sbjct: 918 VQSADAIAAYQQALDLKPNYVRAWANMGISYANQGMYDESIRYYVRALAMNPKAE 1082



 Score = 30.4 bits (67), Expect = 2.0
 Identities = 43/234 (18%), Positives = 90/234 (38%), Gaps = 50/234 (21%)
 Frame = +3

Query: 52  ADVDQHVQGAPSSREEKVSSMKTGLIHVA--------RKMPKNAHAHFILGLMHQRLNQP 103
           +D++ +V G P+  +E     + GL+  A         K P+N+    +LG+ H   +  
Sbjct: 351 SDLNPYV-GHPNPLKEGQDLFRKGLLSEAVLALEAEVLKNPENSEGWRLLGIAHAENDDD 527

Query: 104 QKAILVYEKAEE-------ILLR----PETEIERPDLLSLV------------------- 133
           Q+AI    +A+E       +LL        E+E+   L  +                   
Sbjct: 528 QQAIAAMMRAQEADPTNLEVLLALGVSHTNELEQNAALKYLFGWLRNHPKYGTIAPPEMS 707

Query: 134 -QIHHAQCLILESSSENSSDKELEPH-----------ELKEILSKLKESVQFDIRQAAVW 181
             +++A    L + +   S  + + H           E  + ++  +++++   +  ++W
Sbjct: 708 DSLYYADVARLFNEAAVISPDDADVHIVLGVLYNLSREYDKAIAAFEQALKLKPQDYSLW 887

Query: 182 NTLGFILLKTGRVQSAISVLSSLLAIAPENYDCLGNLGIAYLQIGNLELSAKCF 235
           N LG     + +   AI+     L + P       N+GI+Y   G  + S + +
Sbjct: 888 NKLGATQANSVQSADAIAAYQQALDLKPNYVRAWANMGISYANQGMYDESIRYY 1049


>TC81564 weakly similar to PIR|E85082|E85082 hypothetical protein AT4g08320
           [imported] - Arabidopsis thaliana, partial (46%)
          Length = 1215

 Score = 48.9 bits (115), Expect = 5e-06
 Identities = 35/120 (29%), Positives = 56/120 (46%), Gaps = 7/120 (5%)
 Frame = +3

Query: 462 GCS-YTWSNLGVSLQLSEEPSQAEKAYKQALLL-----ATKQQAHAILSNLGIFYRHEKK 515
           GC  +   NL  SL+     +   K Y  A+ L     A  +++     N    Y    +
Sbjct: 516 GCQQFNLKNLAESLKTLGNKAMQSKQYFDAIELYNCAIAIYEKSAVYYCNRAAAYTQINR 695

Query: 516 YQRAKAMFTKSLELQPGYAPAFNNLGLVFVAEGLLEEA-KYCFEKALQSDPLLDAAKSNL 574
           Y  A     +S+E+ P Y+ A++ LGL + A+G   +A    F+KALQ DP  ++ K N+
Sbjct: 696 YTEAIQDSLRSIEIDPNYSKAYSRLGLAYYAQGNYRDAIDKGFKKALQLDPNNESVKENI 875


>TC79338 homologue to GP|20259245|gb|AAM14358.1 putative N-terminal
           acetyltransferase {Arabidopsis thaliana}, partial (24%)
          Length = 904

 Score = 44.3 bits (103), Expect = 1e-04
 Identities = 32/148 (21%), Positives = 63/148 (41%)
 Frame = +3

Query: 137 HAQCLILESSSENSSDKELEPHELKEILSKLKESVQFDIRQAAVWNTLGFILLKTGRVQS 196
           H + L ++  + N  D++ E +EL      +++ ++ D++    W+  G +       + 
Sbjct: 378 HGETLSMKGLTLNCMDRKSEAYEL------VRQGLKNDLKSHVCWHVYGLLYRSDREYRE 539

Query: 197 AISVLSSLLAIAPENYDCLGNLGIAYLQIGNLELSAKCFQELILKDQNHPVALVNYAALL 256
           AI    + L I P+N + L +L +   Q+ +L    +  Q+L+    NH +  + +A   
Sbjct: 540 AIKCYRNALRIDPDNIEILRDLSLLQAQMRDLSGFVETRQQLLTLKSNHRMNWIGFAVSH 719

Query: 257 LCKYASVVAGAGASASEGALADQVMAAN 284
                +  A     A EG L D     N
Sbjct: 720 HLNSNASKAIEILEAYEGTLEDDYPPEN 803


>TC89875 similar to GP|20259245|gb|AAM14358.1 putative N-terminal
           acetyltransferase {Arabidopsis thaliana}, partial (38%)
          Length = 1170

 Score = 43.9 bits (102), Expect = 2e-04
 Identities = 25/117 (21%), Positives = 55/117 (46%)
 Frame = +3

Query: 137 HAQCLILESSSENSSDKELEPHELKEILSKLKESVQFDIRQAAVWNTLGFILLKTGRVQS 196
           H + L ++  + N  D++ E +EL      +++ ++ D++    W+  G +       + 
Sbjct: 228 HGETLSMKGLTLNCMDRKSEAYEL------VRQGLKNDLKSHVCWHVFGLLYRSDREYRE 389

Query: 197 AISVLSSLLAIAPENYDCLGNLGIAYLQIGNLELSAKCFQELILKDQNHPVALVNYA 253
           AI    + L I PEN + L +L +   Q+ +L    +  Q+L+    NH +  + ++
Sbjct: 390 AIKCYRNALRIDPENIEILRDLSLLQAQMRDLSGFVETRQQLLTLKPNHRMNWIGFS 560


>TC90086 similar to PIR|G86185|G86185 hypothetical protein [imported] -
           Arabidopsis thaliana, partial (34%)
          Length = 894

 Score = 37.7 bits (86), Expect = 0.013
 Identities = 31/117 (26%), Positives = 55/117 (46%)
 Frame = +2

Query: 446 FKTSQKYLKAAVACDRGCSYTWSNLGVSLQLSEEPSQAEKAYKQALLLATKQQAHAILSN 505
           +K + K L+ A+      +    +L  +L    E  +A + +++A+ L  K      L N
Sbjct: 53  YKAAVKALEEAIFMKPDYADAHCDLASALHAMREDERAIEVFQKAIDL--KPGHIDALYN 226

Query: 506 LGIFYRHEKKYQRAKAMFTKSLELQPGYAPAFNNLGLVFVAEGLLEEAKYCFEKALQ 562
           L   Y    ++QRA  M+T+ L + P +  A  N  +  +  G  EEAK   ++AL+
Sbjct: 227 LSGLYMDLGRFQRASEMYTRVLAVWPNHWRAQLNKAVSLLGAGENEEAKKALKEALK 397


>TC80284 similar to PIR|T04740|T04740 hypothetical protein F6G17.110 -
           Arabidopsis thaliana, partial (31%)
          Length = 1645

 Score = 36.6 bits (83), Expect = 0.028
 Identities = 90/394 (22%), Positives = 151/394 (37%), Gaps = 16/394 (4%)
 Frame = +2

Query: 188 LLKTGRVQSAISVLSSLLAIAPENYDCLGNLGIAYLQIGNLELSAKCFQELILKDQNHPV 247
           L  T     AI +L SL++ +    D + N    Y Q   LEL     ++     Q +P+
Sbjct: 74  LCSTKDWSKAIRILDSLISQSTAIQD-ICNRAFCYSQ---LELHKHVIKDCDRAIQLNPL 241

Query: 248 ALVNYAALLLCKYASVVAGAGASA----SEGALADQVMAANVAK----ECLLAAIKADVK 299
            L  Y   +L  +A    G  A A     +G    Q  +A++ +    E LL   K  + 
Sbjct: 242 LLQAY---ILKGHAFSALGRKADALLVWEQGYEQAQHHSADLKQLIELEELLVKAKQAIN 412

Query: 300 SAHIWGNLAY--AFSISGDHRSSSKCLEKAAKLEPNCMSTRYAVATHRMKEAERSQDPSE 357
           S++    L+   A S S  +R+ ++  E  AKL  N       +    +K A++    +E
Sbjct: 413 SSNETNGLSIPQAKSDSSSNRNLTETCESQAKLSGNTSDKSEVL----LKSADKFDARNE 580

Query: 358 LLSFGGNEMASIIRDGDSSLVELPTAWAGLAMVHKAQHEISSAYESEQHVLTEMEERAVC 417
           L S GG          D  +   P       ++   +++ S   ES + VLT   E +  
Sbjct: 581 LNSEGGESSKC-----DGQVNGSPD------IIDNLRYDSSDTSESCEKVLTNSGESSDS 727

Query: 418 SLKQAVAEDPDDPVRWHQLGVHSLCTQQFKTSQKYLKAAVACDRGCSYTWSNLGVSLQLS 477
           +    +   P       +    S  + + + S+K+  A V+  +  S       V  +LS
Sbjct: 728 NDAAEILRKPS-----FKFTFPSEKSSEARKSKKFSVARVSKTKSIS-------VDFRLS 871

Query: 478 EEPSQA-EKAYKQAL-----LLATKQQAHAILSNLGIFYRHEKKYQRAKAMFTKSLELQP 531
              ++  E  Y  A+     +L         L   G  Y  +++   A A FTK+++  P
Sbjct: 872 RGIAEVNEGKYAHAISIFDQILKEDSAYPEALIGRGTAYAFKRELHSAIADFTKAIQYNP 1051

Query: 532 GYAPAFNNLGLVFVAEGLLEEAKYCFEKALQSDP 565
               A+   G    A G   EA     KAL+ +P
Sbjct: 1052AAGEAWKRRGQARAALGEFVEAIEDLTKALEFEP 1153


>TC82644 similar to GP|21537266|gb|AAM61607.1 unknown {Arabidopsis
           thaliana}, partial (70%)
          Length = 937

 Score = 36.6 bits (83), Expect = 0.028
 Identities = 32/124 (25%), Positives = 55/124 (43%), Gaps = 1/124 (0%)
 Frame = +2

Query: 468 SNLGVSLQLSEEPSQAEKAYKQALLLATKQQAHAIL-SNLGIFYRHEKKYQRAKAMFTKS 526
           + L V  +  E  +Q E A + A  + +  +  +I  +N G+ +    KY+      TK+
Sbjct: 374 NKLFVDAKYEEALTQYELALEVAPDMPSSVEIRSICHANRGVCFLKMGKYENTVKECTKA 553

Query: 527 LELQPGYAPAFNNLGLVFVAEGLLEEAKYCFEKALQSDPLLDAAKSNLVKVVTMSKICKG 586
           LEL P Y  A    G         EEA    +K L+ DP  D A   + ++  ++ + + 
Sbjct: 554 LELNPMYVKALVRRGEAHEKLEHFEEAIADMKKILEIDPSNDQAGKAIRRLEPLAAVKRE 733

Query: 587 LLKE 590
            +KE
Sbjct: 734 KMKE 745


>BE249103 weakly similar to GP|6630450|gb| F23N19.10 {Arabidopsis thaliana},
           partial (20%)
          Length = 613

 Score = 35.8 bits (81), Expect = 0.048
 Identities = 23/82 (28%), Positives = 38/82 (46%)
 Frame = +1

Query: 493 LATKQQAHAILSNLGIFYRHEKKYQRAKAMFTKSLELQPGYAPAFNNLGLVFVAEGLLEE 552
           +A     H + SN        K+Y+ A     K +EL+P +    + LG      G  ++
Sbjct: 124 IAVDATNHVLYSNRSAAQASLKRYKEALEDAQKVVELKPDWPKGHSRLGTAKQGLGDWDD 303

Query: 553 AKYCFEKALQSDPLLDAAKSNL 574
           A   +++ALQ +P  +AAK  L
Sbjct: 304 AIDAYKRALQLEPTNEAAKKAL 369


>TC92578 similar to GP|10177072|dbj|BAB10514. contains similarity to unknown
           protein~dbj|BAA91048.1~gene_id:MKP11.12 {Arabidopsis
           thaliana}, partial (36%)
          Length = 1126

 Score = 32.7 bits (73), Expect = 0.40
 Identities = 43/228 (18%), Positives = 82/228 (35%), Gaps = 17/228 (7%)
 Frame = +3

Query: 293 AIKADVKSAHIWGNLAYAFSISGDHRSSSKCLEKAAKLEPN----------------CMS 336
           AIK + +   +W NL + +S+     ++ + + K     PN                C  
Sbjct: 411 AIK-EFEDLELWDNLIHCYSLLEKKATAVELIRKRLSERPNDPRLWCSLGDITNNDACYE 587

Query: 337 TRYAVATHRMKEAERSQDPSELLSFGGNEMASIIRDGDSSLVEL-PTAWAGLAMVHKAQH 395
               V+ +R   A+RS   S   + G  E + ++ +   S+  + P  W           
Sbjct: 588 KALEVSNNRSARAKRSLARS-AYNRGEYETSKVLWESAMSMNSMFPDGWFAFGAAALKAR 764

Query: 396 EISSAYESEQHVLTEMEERAVCSLKQAVAEDPDDPVRWHQLGVHSLCTQQFKTSQKYLKA 455
           ++               E+A+ +  +AV  DPD+   W+ +    L  ++ K +    K 
Sbjct: 765 DV---------------EKALDAFTRAVQLDPDNGEAWNNIACLHLIKKKSKEAFIAFKE 899

Query: 456 AVACDRGCSYTWSNLGVSLQLSEEPSQAEKAYKQALLLATKQQAHAIL 503
           A+   R     W N           SQA +A +  L +   ++   +L
Sbjct: 900 ALKFKRNSWQLWENYSHVAVDVGNISQALEAAQMVLDITKNKRVDTVL 1043



 Score = 30.4 bits (67), Expect = 2.0
 Identities = 32/155 (20%), Positives = 66/155 (41%)
 Frame = +3

Query: 408 LTEMEERAVCSLKQAVAEDPDDPVRWHQLGVHSLCTQQFKTSQKYLKAAVACDRGCSYTW 467
           L E +  AV  +++ ++E P+DP  W  LG  +           Y KA    +   +   
Sbjct: 468 LLEKKATAVELIRKRLSERPNDPRLWCSLGDIT------NNDACYEKALEVSNNRSARAK 629

Query: 468 SNLGVSLQLSEEPSQAEKAYKQALLLATKQQAHAILSNLGIFYRHEKKYQRAKAMFTKSL 527
            +L  S     E   ++  ++ A+ + +           G      +  ++A   FT+++
Sbjct: 630 RSLARSAYNRGEYETSKVLWESAMSMNSMFPDGWFA--FGAAALKARDVEKALDAFTRAV 803

Query: 528 ELQPGYAPAFNNLGLVFVAEGLLEEAKYCFEKALQ 562
           +L P    A+NN+  + + +   +EA   F++AL+
Sbjct: 804 QLDPDNGEAWNNIACLHLIKKKSKEAFIAFKEALK 908


>AW586685 similar to PIR|T01081|T01 hypothetical protein T10P11.3.2 -
           Arabidopsis thaliana, partial (22%)
          Length = 651

 Score = 32.0 bits (71), Expect = 0.69
 Identities = 15/27 (55%), Positives = 19/27 (69%)
 Frame = +2

Query: 536 AFNNLGLVFVAEGLLEEAKYCFEKALQ 562
           A NNLG VFV  G L++A  C+ KAL+
Sbjct: 95  ALNNLGSVFVDHGKLDQAADCYIKALK 175


>TC83149 similar to GP|13561982|gb|AAK30594.1 flagelliform silk protein
           {Argiope trifasciata}, partial (3%)
          Length = 1177

 Score = 31.2 bits (69), Expect = 1.2
 Identities = 25/95 (26%), Positives = 43/95 (44%), Gaps = 5/95 (5%)
 Frame = +2

Query: 473 SLQLSEEPSQAEKAYKQALLLATK----QQAHAI-LSNLGIFYRHEKKYQRAKAMFTKSL 527
           SL+     + A K Y  A+ L T+       +A+ LSN    +   K +  A+A    ++
Sbjct: 248 SLKSRGNAAMASKDYPSAIALYTEALSLNPGNAVYLSNRAAAHSAAKDHSSARADAEAAV 427

Query: 528 ELQPGYAPAFNNLGLVFVAEGLLEEAKYCFEKALQ 562
            + P Y  A++ LGL   A G  + A   + K ++
Sbjct: 428 AIDPAYTKAWSRLGLARFALGDAKGAMEAYGKGIE 532


>BG449158 similar to GP|17979432|gb putative TPR repeat nuclear
           phosphoprotein {Arabidopsis thaliana}, partial (26%)
          Length = 693

 Score = 30.8 bits (68), Expect = 1.5
 Identities = 17/69 (24%), Positives = 32/69 (45%)
 Frame = +1

Query: 189 LKTGRVQSAISVLSSLLAIAPENYDCLGNLGIAYLQIGNLELSAKCFQELILKDQNHPVA 248
           +K G  +SA+S    +L + P+N + L  L   Y+Q+G  +   +  ++    D     A
Sbjct: 1   IKLGDFRSALSNFEKVLEVYPDNCETLKALAYIYVQLGQTDKGHEFIRKATKIDPRDAQA 180

Query: 249 LVNYAALLL 257
            +    LL+
Sbjct: 181 FLELGELLI 207


>BG455485 
          Length = 677

 Score = 30.8 bits (68), Expect = 1.5
 Identities = 19/81 (23%), Positives = 35/81 (42%), Gaps = 3/81 (3%)
 Frame = +3

Query: 8   ENSQDKSLLSKDTDSTEGEGKKSHKLGKCRSRPSKTDSLDC---GGDADVDQHVQGAPSS 64
           ENS++  +     ++  G GK         S+  + +        G  DV+ +V G  + 
Sbjct: 30  ENSEEIEIKHNSENNNNGTGKVDTSTSTSTSKNVEVNGNTAKSGNGSGDVNVNVNGKVNK 209

Query: 65  REEKVSSMKTGLIHVARKMPK 85
             E  ++ KTG++HV    P+
Sbjct: 210 PSEGSATGKTGVVHVQELKPE 272


>TC79065 homologue to SP|P12204|YCF3_TOBAC Photosystem I assembly protein
            ycf3. [Common tobacco] {Nicotiana tabacum}, partial (70%)
          Length = 1301

 Score = 30.0 bits (66), Expect = 2.6
 Identities = 18/73 (24%), Positives = 37/73 (50%), Gaps = 1/73 (1%)
 Frame = +3

Query: 472  VSLQLSEEPSQAEKAYKQALLLATKQQAHA-ILSNLGIFYRHEKKYQRAKAMFTKSLELQ 530
            +S Q     ++A + Y +A+ L       + IL N+G+ +    ++ +A   + ++LE  
Sbjct: 1053 MSAQSEGNYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFRALERN 1232

Query: 531  PGYAPAFNNLGLV 543
            P    AFNN+ ++
Sbjct: 1233 PFLPQAFNNMAVI 1271


>TC86990 similar to GP|22136840|gb|AAM91764.1 unknown protein {Arabidopsis
           thaliana}, partial (64%)
          Length = 2354

 Score = 29.3 bits (64), Expect = 4.5
 Identities = 14/38 (36%), Positives = 21/38 (54%)
 Frame = +2

Query: 231 SAKCFQELILKDQNHPVALVNYAALLLCKYASVVAGAG 268
           SA+ F+  I   +N+PVAL+NY    +CK   +    G
Sbjct: 788 SARLFKMRIKFVKNYPVALINYTPWKICKLELIKISQG 901


>CA919975 weakly similar to GP|3819697|emb| BnMAP4K alpha1 {Brassica napus},
           partial (12%)
          Length = 768

 Score = 28.5 bits (62), Expect = 7.6
 Identities = 29/119 (24%), Positives = 49/119 (40%), Gaps = 3/119 (2%)
 Frame = -2

Query: 316 DHRSSSKCLEKAAKLEPNCMSTRYAVATHRMKEAERSQDPSELLSFGGNEMASIIRDGDS 375
           D  S     + AA L     + +     +  K   R +  S++     ++M S   D   
Sbjct: 761 DRNSMPSLKDSAANLAEAKAAIQGGRKVNARKRHSRGKINSDIQESKRDQMTSST-DSSR 585

Query: 376 SLVELPTAWAGLAMVHKAQHEISSAYESEQHVLTEMEERAVC---SLKQAVAEDPDDPV 431
           S  E   A  G++  H A  +  SA      +L+     +V    SLK+A+A+DP+ P+
Sbjct: 584 SYREYIDAQRGMSKSHYASDDEESA-----RILSSSAPLSVLLIPSLKEAIADDPEGPI 423


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.315    0.129    0.364 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,178,763
Number of Sequences: 36976
Number of extensions: 227847
Number of successful extensions: 1098
Number of sequences better than 10.0: 38
Number of HSP's better than 10.0 without gapping: 1053
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1089
length of query: 590
length of database: 9,014,727
effective HSP length: 101
effective length of query: 489
effective length of database: 5,280,151
effective search space: 2581993839
effective search space used: 2581993839
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (28.1 bits)


Lotus: description of TM0174.1