Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0118a.5
         (351 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

BG585866                                                               92  3e-19
BG585499                                                               72  2e-13
TC83226 weakly similar to PIR|G86419|G86419 probable reverse tra...    54  1e-07
BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At...    53  2e-07
CA919100 homologue to PIR|G90291|G902 endoglucanase precursor [i...    44  7e-05
BG586862                                                               44  1e-04
TC82520                                                                31  0.015
TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarot...    35  0.033
TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non...    33  0.16
AJ496898 similar to GP|23171295|gb CG3731-PA {Drosophila melanog...    33  0.16
AW980456                                                               32  0.28
BG452711                                                               32  0.37
BG644917 homologue to GP|10177015|dbj| ubiquitin-like protein {A...    29  0.39
AW686588                                                               31  0.82
BG586638 homologue to GP|9279730|dbj formin-like protein {Arabid...    30  1.8
BG456581                                                               29  2.4
TC88753 similar to PIR|T10586|T10586 small nuclear ribonucleopro...    29  2.4
TC89483 similar to GP|18377662|gb|AAL66981.1 unknown protein {Ar...    29  3.1
AJ501131                                                               28  4.1
TC91679 similar to PIR|E84908|E84908 hypothetical protein At2g46...    28  5.3

>BG585866 
          Length = 828

 Score = 91.7 bits (226), Expect(2) = 3e-19
 Identities = 42/131 (32%), Positives = 68/131 (51%), Gaps = 1/131 (0%)
 Frame = +3

Query: 1   IWGPSPTGCYTAREAYGW-LNNLDHEGQNGRCWQWVWKLKVPEKM*LFVWLILNKALQTN 59
           IW  +  G YTA+  Y W L+  +    N   W W+W+LK+PEK   F+WL  + A+ T 
Sbjct: 357 IWPHNSNGVYTAKSGYSWILSQTETVNYNNSSWSWIWRLKIPEKYKFFLWLACHNAVPTL 536

Query: 60  GNRFRCHLAQSPSRSRCSAAEETILHCLRDCPHSRELWLRLGATTWRSFATMDVEAWITS 119
                 ++  S   SRC   EE+  HC+RDC  S+ +W ++G ++   F++  V+ W+  
Sbjct: 537 SLLNHRNMVNSAICSRCGEHEESFFHCVRDCRFSKIIWHKIGFSSPDFFSSSSVQDWLKD 716

Query: 120 LARSNHAISFL 130
               +   +FL
Sbjct: 717 GISCHRPTTFL 749



 Score = 20.8 bits (42), Expect(2) = 3e-19
 Identities = 6/13 (46%), Positives = 8/13 (61%)
 Frame = +2

Query: 132 GIWSVWLWRNNMC 144
           G+W +W  R  MC
Sbjct: 752 GLWWIWRHRTLMC 790


>BG585499 
          Length = 792

 Score = 72.4 bits (176), Expect = 2e-13
 Identities = 37/126 (29%), Positives = 64/126 (50%), Gaps = 10/126 (7%)
 Frame = +3

Query: 32  WQWVWKLKVPEKM*LFVWLILNKALQTNGNRFRCHLAQSPSRSRCSAAEETILHCLRDCP 91
           W+ +W  + P +   F+WL+ +  + TN  R R       +   C  A+ET+LH L DC 
Sbjct: 225 WKMLWGWRGPHRTQTFMWLVAHGCILTNYRRSRWGTRVLATCPCCGNADETVLHVLCDCR 404

Query: 92  HSRELWLRLGATTW--RSFATMDVEAWI-TSLARSNHAIS-------FLSGIWSVWLWRN 141
            + ++W+RL  + W    F+  D   W+  +L++ ++ +S       F++  W +W WRN
Sbjct: 405 PASQVWIRLVPSDWITNFFSFDDCRDWVFKNLSKRSNGVSKFKWQPTFMTTCWHMWTWRN 584

Query: 142 NMCFEE 147
              FEE
Sbjct: 585 KAIFEE 602


>TC83226 weakly similar to PIR|G86419|G86419 probable reverse transcriptase 
           100033-105622 [imported] - Arabidopsis thaliana, partial
           (2%)
          Length = 885

 Score = 53.5 bits (127), Expect = 1e-07
 Identities = 28/105 (26%), Positives = 44/105 (41%), Gaps = 8/105 (7%)
 Frame = +3

Query: 1   IWGPSPTGCYTAREAYGWL--------NNLDHEGQNGRCWQWVWKLKVPEKM*LFVWLIL 52
           +W  +PTG Y+ +  Y  L        NN          W+ +W L    +  + +W IL
Sbjct: 3   MWMHNPTGIYSVKSGYNTLRTWQTQQINNTSTSSDETLIWKKIWSLHTIPRHKVLLWRIL 182

Query: 53  NKALQTNGNRFRCHLAQSPSRSRCSAAEETILHCLRDCPHSRELW 97
           N +L    +  +  +   P   RC +  ETI H    CP S+ +W
Sbjct: 183 NDSLPVRSSLRKRGIQCYPLCPRCHSKTETITHLFMSCPLSKRVW 317


>BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At2g45230
           [imported] - Arabidopsis thaliana, partial (10%)
          Length = 767

 Score = 52.8 bits (125), Expect = 2e-07
 Identities = 31/105 (29%), Positives = 48/105 (45%), Gaps = 9/105 (8%)
 Frame = -3

Query: 2   WGPSPTGCYTAREAYGWLNNL-DHEGQNGRC--------WQWVWKLKVPEKM*LFVWLIL 52
           W  S +G Y+ +  Y    N+     Q G          +Q VWK     K+  F+W  +
Sbjct: 414 WEYSKSGHYSVKSGYYVQTNIIAAANQRGTVDQPSLDDLYQRVWKYNTSPKVRHFLWRCI 235

Query: 53  NKALQTNGNRFRCHLAQSPSRSRCSAAEETILHCLRDCPHSRELW 97
           + +L T  N    H+++  S SRC    ET+ H L  CP++R +W
Sbjct: 234 SNSLPTAANMRSRHISKDGSCSRCGMESETVNHILFQCPYARLIW 100


>CA919100 homologue to PIR|G90291|G902 endoglucanase precursor [imported] -
           Sulfolobus solfataricus, partial (3%)
          Length = 789

 Score = 44.3 bits (103), Expect = 7e-05
 Identities = 34/131 (25%), Positives = 53/131 (39%), Gaps = 15/131 (11%)
 Frame = -2

Query: 33  QWVWKLKVPEKM*LFVWLILNKALQTNGNRFRCHLAQSPSR---SRCSAAEETILHCLRD 89
           + +W  +VP K+ +F W +L   L T  N     +  + +    S C A E    H    
Sbjct: 590 EMIWHRQVPLKVSVFAWRLLRDRLPTKSNLIYRGVIPTEAGLCVSGCGALESA-QHLFLS 414

Query: 90  CPHSRELWLRLGATTWRSFATMDVEA-------WITSLARSNHAISFLSGIWSVWLW--- 139
           C +   LW  +    W  F  +D          ++ S   +  + SFL  IW +  W   
Sbjct: 413 CSYFASLWSLV--RDWIGFVGVDTNVLSDHFVQFVHSTGGNKASQSFLQLIWLLCAWVLW 240

Query: 140 --RNNMCFEET 148
             RNNMCF ++
Sbjct: 239 TERNNMCFNDS 207


>BG586862 
          Length = 804

 Score = 43.5 bits (101), Expect = 1e-04
 Identities = 30/117 (25%), Positives = 52/117 (43%), Gaps = 5/117 (4%)
 Frame = -1

Query: 35  VWKLKVPEKM*LFVWLILNKALQTNGNRFRCHLAQSPSRSRCSAAEETILHCLRDCPHSR 94
           VW +K   +   F+W +L+ AL       +  +  S    RC +  ET+ H   +C  ++
Sbjct: 654 VWGIKTIPRHKSFLWRLLHNALPVKDELHKRGIRCSLLCPRCESKIETVQHLFLNCEVTQ 475

Query: 95  ELWL--RLGATTWRSFATMDVEAWITSLARSNH---AISFLSGIWSVWLWRNNMCFE 146
           + W   +LG   + S   +    WIT+    N     I+  + ++S+W  RN   FE
Sbjct: 474 KEWFGSQLG-INFHSSGVLHFHDWITNFILKNDEETIIALTALLYSIWHARNQKVFE 307


>TC82520 
          Length = 833

 Score = 31.2 bits (69), Expect(2) = 0.015
 Identities = 12/35 (34%), Positives = 20/35 (56%)
 Frame = +3

Query: 35  VWKLKVPEKM*LFVWLILNKALQTNGNRFRCHLAQ 69
           VW+  +P K+ +FVW + +  L T  N  + H+ Q
Sbjct: 186 VWQKNIPSKVSMFVWRLFHNRLPTKVNLMQRHVLQ 290



 Score = 24.3 bits (51), Expect(2) = 0.015
 Identities = 29/122 (23%), Positives = 43/122 (34%), Gaps = 15/122 (12%)
 Frame = +2

Query: 81  ETILHCLRDCPHSRELW----------LRLGATTWRSFATMDVEAWITSLARSNHAISFL 130
           ET  H    C     LW          L L A   + F      A       S   I + 
Sbjct: 329 ETATHLFLHCDIFGSLWSHVLRWLHLLLVLPADIRQFFIQFTSMAGSPRFTHSFLQIMWF 508

Query: 131 SGIWSVWLWRNNMCFEET---PWNLAEAWRRLSHVHDEMLQTSQDWSPGD--LNSLLCVR 185
           + +W +W  RNN  F+ +   P    E  +  S +  +  Q +  +S  D   + LLC+ 
Sbjct: 509 ASVWVLWKKRNNRVFQNSLSDPSTFVEQVKMHSFLWLKFQQATFSFSYHDWWKHPLLCMG 688

Query: 186 WH 187
            H
Sbjct: 689 VH 694


>TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarotenoid
           dioxygenase1 {Pisum sativum}, partial (43%)
          Length = 1865

 Score = 35.4 bits (80), Expect = 0.033
 Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 8/98 (8%)
 Frame = -2

Query: 1   IWGPSPTGCYTAREAYGWLNNLDH-----EGQNGRCWQWVWKLKVPEKM*LFVWLILNKA 55
           +W P   G ++    Y  L NL         +    ++ +WK K P K+  F W +    
Sbjct: 439 VWKPDKEGVFSVNSCYFLLQNLRLLEDRLSYEEEVIFRELWKSKAPAKVLAFSWTLFLDR 260

Query: 56  LQTNGNRFRCHLAQSPSRSR---CSAAEETILHCLRDC 90
           + T  N  +  L +     R   C   +ET++H    C
Sbjct: 259 IPTMVNLGKRRLLRVEDSKRCVFCGCQDETVVHLFLHC 146


>TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non-LTR
           retroelement reverse transcriptase {Oryza sativa
           (japonica cultivar-group)}, partial (2%)
          Length = 1262

 Score = 33.1 bits (74), Expect = 0.16
 Identities = 25/98 (25%), Positives = 37/98 (37%), Gaps = 2/98 (2%)
 Frame = +2

Query: 2   WGPSPTGCYTAREAYGWLNNLDHEGQNGRCWQWVWKLKVPEKM*LFVWLILNKALQTNGN 61
           W   P   Y+ +  Y ++ +  H          VW   +P K+ LFVW +L   L T  N
Sbjct: 530 WLLDPVNGYSVKVFYRYITSTGHISDRSLVDD-VWHKHIPSKVSLFVWRLLRNRLPTKDN 706

Query: 62  R-FRCHLAQSPSRSRCSAAE-ETILHCLRDCPHSRELW 97
              R  L  + +   C   + E+  H    C     LW
Sbjct: 707 LVHRGVLLATNAACVCGCVDSESTTHLFLHCNVFCSLW 820


>AJ496898 similar to GP|23171295|gb CG3731-PA {Drosophila melanogaster},
           partial (35%)
          Length = 698

 Score = 33.1 bits (74), Expect = 0.16
 Identities = 16/48 (33%), Positives = 24/48 (49%), Gaps = 8/48 (16%)
 Frame = +2

Query: 120 LARSNHAISFLS--------GIWSVWLWRNNMCFEETPWNLAEAWRRL 159
           +A  N A SF S        G+W V+   + M  E   WN+ EAW+++
Sbjct: 152 VAEDNCAHSFQSFNTCYKDTGLWGVYFVSDGMTIENMVWNIQEAWKKM 295


>AW980456 
          Length = 779

 Score = 32.3 bits (72), Expect = 0.28
 Identities = 23/72 (31%), Positives = 30/72 (40%), Gaps = 6/72 (8%)
 Frame = -2

Query: 1   IWGPSPTGCYTAREAYGWL------NNLDHEGQNGRCWQWVWKLKVPEKM*LFVWLILNK 54
           IW     G Y  + AY +       ++  H   N   W  +WKLKVP K+   VW +   
Sbjct: 199 IWKDEKHGKYYVKSAYRFCVEELFDSSYLHRPGN---WSGIWKLKVPPKVQNLVWRMCRG 29

Query: 55  ALQTNGNRFRCH 66
            L T   R R H
Sbjct: 28  CLPT---RIRLH 2


>BG452711 
          Length = 672

 Score = 32.0 bits (71), Expect = 0.37
 Identities = 19/95 (20%), Positives = 35/95 (36%)
 Frame = +3

Query: 3   GPSPTGCYTAREAYGWLNNLDHEGQNGRCWQWVWKLKVPEKM*LFVWLILNKALQTNGNR 62
           G   T      + Y WL  +           W+ +L  P  + LF+  +    +      
Sbjct: 198 GSGSTAHLIVSKGYWWLMGVHTSFLGKES*NWISRLCAPSNIKLFL*QL*RDYVHFRSIL 377

Query: 63  FRCHLAQSPSRSRCSAAEETILHCLRDCPHSRELW 97
             C+L  S     C+   + +LH L  C  ++++W
Sbjct: 378 LFCNLISSNLCPICNQRSQDMLHALFSCTRAKDVW 482


>BG644917 homologue to GP|10177015|dbj| ubiquitin-like protein {Arabidopsis
           thaliana}, partial (98%)
          Length = 751

 Score = 28.9 bits (63), Expect(2) = 0.39
 Identities = 10/15 (66%), Positives = 10/15 (66%)
 Frame = +3

Query: 144 CFEETPWNLAEAWRR 158
           CF    WNL EAWRR
Sbjct: 684 CFANCKWNLGEAWRR 728



 Score = 21.6 bits (44), Expect(2) = 0.39
 Identities = 7/16 (43%), Positives = 10/16 (61%)
 Frame = +2

Query: 130 LSGIWSVWLWRNNMCF 145
           + G W VWL+   +CF
Sbjct: 626 MRGFWRVWLFWILICF 673


>AW686588 
          Length = 567

 Score = 30.8 bits (68), Expect = 0.82
 Identities = 33/131 (25%), Positives = 45/131 (34%), Gaps = 14/131 (10%)
 Frame = +1

Query: 33  QWVWKLKVPEKM*LFVWLILNKALQTNGN--RFRCHLAQSPSRSRCSAAEETILHCLRDC 90
           Q  W L VP K+ +  W ++   L T  N  R RC   ++          ET  H    C
Sbjct: 124 QTTW-L*VPLKVSILAWRLIRDRLPTKANLVRRRCLAVEAAGCVVGCGIAETANHLFLHC 300

Query: 91  PHSRELWLRLGATTWRSFATMDVE------------AWITSLARSNHAISFLSGIWSVWL 138
                +W  + A  W   +  D                 T   RS   + +L  +W VW 
Sbjct: 301 ATFGAVWQHIRA--WIGVSGADPHDLSDHFIQFITCTGHTRARRSFMQLIWLLCVWMVWN 474

Query: 139 WRNNMCFEETP 149
            RNN  F   P
Sbjct: 475 ERNNRLFN*YP 507


>BG586638 homologue to GP|9279730|dbj formin-like protein {Arabidopsis
           thaliana}, partial (1%)
          Length = 723

 Score = 29.6 bits (65), Expect = 1.8
 Identities = 15/53 (28%), Positives = 25/53 (46%)
 Frame = +2

Query: 87  LRDCPHSRELWLRLGATTWRSFATMDVEAWITSLARSNHAISFLSGIWSVWLW 139
           L +C +   LW  +    W     +D  +W+ SL   N A+  +S +W  W+W
Sbjct: 548 LENCCYV*LLWCWI----WICCGRVDSVSWLISL--ENTAVVVMSKLWGFWIW 688


>BG456581 
          Length = 683

 Score = 29.3 bits (64), Expect = 2.4
 Identities = 30/126 (23%), Positives = 46/126 (35%), Gaps = 11/126 (8%)
 Frame = +2

Query: 31  CWQW---VWKLKVPEKM*LFVWLILNKALQTNGNRFRCHLAQSPSRSRCSAAEETILHCL 87
           C  W   +W   +P       W + +  L T+ N      A     S C    ET  H  
Sbjct: 59  CAPWASTIWNSCIPPSHSFICWRLAHDRLPTDDNLSSRGCALVSMCSFCLEQVETSDHLF 238

Query: 88  RDCPHSRELW------LRLGATTWRSFATM--DVEAWITSLARSNHAISFLSGIWSVWLW 139
             C     LW      LR+G   + SF  +   +    +S  R  +  + +  + S+W  
Sbjct: 239 LRCKFVVTLWSWLCSQLRVGLD-FSSFKALLSSLPRHCSSQVRDLYVAAVVHMVHSIWWA 415

Query: 140 RNNMCF 145
           RNN+ F
Sbjct: 416 RNNVRF 433


>TC88753 similar to PIR|T10586|T10586 small nuclear
           ribonucleoprotein-associated protein homolog F9F13.90 -
           Arabidopsis thaliana, partial (77%)
          Length = 1273

 Score = 29.3 bits (64), Expect = 2.4
 Identities = 25/112 (22%), Positives = 41/112 (36%), Gaps = 5/112 (4%)
 Frame = +1

Query: 97  WLRLGATTWRSFATMDVEAWITSLARSNHAI-SFLSGIWSVWLWRNNMCFEETPWNLAEA 155
           W    A+TW +  +    +W  S A    +  S+ +G  ++W         ET W+    
Sbjct: 592 WSACYASTWSNAVSWT--SWTRSTADGEGSTASYAAG--AIWA--------ETRWSSTTI 735

Query: 156 WRRLSHVHDEMLQTSQDWSPGDLNSLLCVRWHPPAR----GGSN*MWMTVTW 203
           W   S V  E    +  WS G+  S     W+  +       S+  W +  W
Sbjct: 736 WYATSSVWAETNGATSSWSDGERTSCSSSAWNAASASSWYASSSWKWCSCVW 891


>TC89483 similar to GP|18377662|gb|AAL66981.1 unknown protein {Arabidopsis
           thaliana}, partial (35%)
          Length = 1711

 Score = 28.9 bits (63), Expect = 3.1
 Identities = 9/13 (69%), Positives = 10/13 (76%)
 Frame = +1

Query: 132 GIWSVWLWRNNMC 144
           GIWS WLW+ N C
Sbjct: 802 GIWSGWLWKKNNC 840


>AJ501131 
          Length = 451

 Score = 28.5 bits (62), Expect = 4.1
 Identities = 12/29 (41%), Positives = 14/29 (47%)
 Frame = -1

Query: 129 FLSGIWSVWLWRNNMCFEETPWNLAEAWR 157
           FL   W V+LW    CF    W+L   WR
Sbjct: 262 FLWSGWFVFLWSGFGCFVFRLWSLVSLWR 176


>TC91679 similar to PIR|E84908|E84908 hypothetical protein At2g46890
           [imported] - Arabidopsis thaliana, partial (77%)
          Length = 1156

 Score = 28.1 bits (61), Expect = 5.3
 Identities = 16/38 (42%), Positives = 20/38 (52%)
 Frame = +1

Query: 120 LARSNHAISFLSGIWSVWLWRNNMCFEETPWNLAEAWR 157
           L RSN AI  L+ +WS+ L  N    E+  W   E WR
Sbjct: 292 LWRSNIAI-LLTWVWSIRLTHNYFRREKWQWGAREDWR 402


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.342    0.147    0.572 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,732,704
Number of Sequences: 36976
Number of extensions: 277843
Number of successful extensions: 3138
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 3057
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3132
length of query: 351
length of database: 9,014,727
effective HSP length: 97
effective length of query: 254
effective length of database: 5,428,055
effective search space: 1378725970
effective search space used: 1378725970
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.5 bits)
S2: 59 (27.3 bits)


Lotus: description of TM0118a.5