Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0103.7
         (411 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

TC93153 similar to GP|14715220|emb|CAC44106. gag polyprotein {Ci...   141  4e-34
AL366725                                                               75  5e-14
TC87383 similar to GP|19168656|emb|CAD26175. DNA-DIRECTED RNA PO...    50  1e-06
TC87382 similar to EGAD|146423|156195 vitellogenin {Anolis pulch...    50  2e-06
BG647713 homologue to GP|15042313|gb| 232R {Chilo iridescent vir...    46  2e-05
BF521488 weakly similar to GP|18447433|gb RE28280p {Drosophila m...    37  0.018
TC92636 homologue to GP|15042313|gb|AAK82093.1 232R {Chilo iride...    34  0.091
TC87381                                                                32  0.59
TC82733 similar to GP|10177404|dbj|BAB10535. gene_id:K24M7.12~pi...    30  1.7
TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4....    30  2.2
AL382998 similar to GP|23491723|db formin homology protein A {Di...    29  2.9
BQ144502 weakly similar to GP|6523547|emb hydroxyproline-rich gl...    29  3.8
TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finge...    28  5.0
TC79732 weakly similar to GP|6729021|gb|AAF27017.1| hypothetical...    28  5.0
BQ151174 similar to GP|11036868|gb| PxORF73 peptide {Plutella xy...    28  6.5
TC91504 similar to GP|21618032|gb|AAM67082.1 unknown {Arabidopsi...    28  6.5
BG449394 similar to PIR|A83006|A830 hypothetical protein PA5121 ...    28  6.5
TC86584 similar to GP|11994215|dbj|BAB01337. contains similarity...    28  8.5

>TC93153 similar to GP|14715220|emb|CAC44106. gag polyprotein {Cicer
           arietinum}, partial (8%)
          Length = 516

 Score =  141 bits (356), Expect = 4e-34
 Identities = 64/136 (47%), Positives = 90/136 (66%)
 Frame = +2

Query: 43  LDKFLKRSPPKFEGGYNPDGAYEWVRELERIFETLVCAEPRKVAFASYLLSSEARTWWTS 102
           L+ FL+  PP F+G Y PDGA +W++E+ERIF  + C E +KV F +++L+ EA  WW S
Sbjct: 107 LETFLRNHPPTFKGRYAPDGA*KWLKEIERIFRVMQCFETQKVQFGTHMLAEEADDWWIS 286

Query: 103 VRGRITPEEGELIWEIFKNSFLEKYFPADAKCRKEMEFFELKQEAMSVGEYAAKFEELCQ 162
           +   +  ++  + W +F+  FL +YFP D + +KE+EF ELKQ  MSV EYAAKF EL  
Sbjct: 287 LLPVLEQDDAVVTWAMFRKEFLGRYFPEDVRGKKEIEFLELKQGDMSVTEYAAKFVELAT 466

Query: 163 FHPRYSTANDETSKCV 178
           F+P YS    E SKC+
Sbjct: 467 FYPHYSAETAEFSKCI 514


>AL366725 
          Length = 485

 Score = 75.1 bits (183), Expect = 5e-14
 Identities = 44/140 (31%), Positives = 66/140 (46%), Gaps = 12/140 (8%)
 Frame = +2

Query: 162 QFHPRYSTANDETSKCVKFEYGLRPDIRTVVGHDQIRNFATLVKKCRIFEENEKT*KEYL 221
           +F+P Y+    E SKC+KFE GLRPDI+  +G+ Q+R F  LV  CRI+EE+ K   + +
Sbjct: 2   KFYPHYAAETAEFSKCIKFENGLRPDIKRAIGYQQLRVFPDLVNTCRIYEEDTKAHDKVV 181

Query: 222 KGLGVNKAPKKKEEKRKPYF*PQDQGRNQFGRTWNPTG----------GFGFQGRPGGF- 270
                 +  K +  + KPY  P D+G+ +      P             +G +G      
Sbjct: 182 N----ERKTKGQ*SRPKPYSAPADKGKQRMVDDRRPKKKDAPAEIVCFNYGEKGHKSNVC 349

Query: 271 -NNTPFCGRCRRNGHRAQEC 289
                 C RC + GH   +C
Sbjct: 350 PKEIKKCVRCDKKGHIVADC 409


>TC87383 similar to GP|19168656|emb|CAD26175. DNA-DIRECTED RNA POLYMERASE II
           {Encephalitozoon cuniculi}, partial (0%)
          Length = 1247

 Score = 50.4 bits (119), Expect = 1e-06
 Identities = 38/142 (26%), Positives = 58/142 (40%), Gaps = 13/142 (9%)
 Frame = -2

Query: 1   EEMAAMARALEQLT-------AFLTEQAERARQGAGQGAANNQEEIYHGLDKF----LKR 49
           +EM  M R ++QL        A L  Q +R +       + +         +F    +K 
Sbjct: 910 QEMEKMRR*IQQLQEIVNAQQALLEAQQKRFKDHVSSSDSLSSRSSRSQRREFQMNDIK* 731

Query: 50  SPPKFEGGYNPDGAYEWVRELERIFETLVCAEPRKVAFASYLLSSEARTWWTSVRGRITP 109
             P FEG   PD   +W++ +ER+F+     E +KV   +  L   A  WW +V+ R   
Sbjct: 730 DIPDFEGNLQPDDLLDWLQIMERLFKYKEVLEEQKVKIVAAKLKKLASIWWENVKRRRKR 551

Query: 110 EEGELI--WEIFKNSFLEKYFP 129
           E    I  WE  +     KY P
Sbjct: 550 EGKSKIKTWEKMRQKLTRKYLP 485


>TC87382 similar to EGAD|146423|156195 vitellogenin {Anolis pulchellus},
           partial (7%)
          Length = 2304

 Score = 50.1 bits (118), Expect = 2e-06
 Identities = 37/142 (26%), Positives = 61/142 (42%), Gaps = 13/142 (9%)
 Frame = +2

Query: 1   EEMAAMARALEQLTAFLTEQ-----AERAR-QGAGQGAANNQEEIYHGLDKFLKRSP--- 51
           +EM  M R ++QL   +  Q     AE+ R +G    + ++     H   + L+ +    
Sbjct: 443 QEMEDMRRQIQQLQEIINAQQALLEAEQRRFEGDVSYSDSSSSRSSHSQRRQLQMNDIKV 622

Query: 52  --PKFEGGYNPDGAYEWVRELERIFETLVCAEPRKVAFASYLLSSEARTWWTSVRGRITP 109
             P FEG    D   +W++ +ER+FE     E +KV   +  L   A  WW +++ R   
Sbjct: 623 DIPDFEGNLQLDDFLDWLQTIERVFEYKEVPEEQKVKIVAAKLKKHALIWWENLKRRRKR 802

Query: 110 EEGELI--WEIFKNSFLEKYFP 129
           E    I  W+  +     KY P
Sbjct: 803 EGKSKIKTWDKMRQKLTRKYLP 868


>BG647713 homologue to GP|15042313|gb| 232R {Chilo iridescent virus}, partial
           (1%)
          Length = 726

 Score = 46.2 bits (108), Expect = 2e-05
 Identities = 35/139 (25%), Positives = 60/139 (42%), Gaps = 13/139 (9%)
 Frame = +2

Query: 2   EMAAMARALEQLTAFLTEQ-----AERARQ---GAGQGAANNQEEIYHGLD---KFLKRS 50
           EM  M R ++ L   +  Q     A+R R    G+G  +++++    H        +K  
Sbjct: 164 EMEEMRRQIQLLQETVNAQQALLEAQRRRNDDDGSGSDSSSSRSSRSHRRQTRMSKIKVD 343

Query: 51  PPKFEGGYNPDGAYEWVRELERIFETLVCAEPRKVAFASYLLSSEARTWWTSVRGRITPE 110
            P F G   PD   +W++ +ER+F+    AE +KV   +  L   A  WW +++ +   E
Sbjct: 344 IPDF*GKLQPDEFVDWLQTIERVFKYKEVAEEQKVKIVAAKLKKHASIWWKNLKRKRNCE 523

Query: 111 EGELI--WEIFKNSFLEKY 127
               I  W+  +     KY
Sbjct: 524 GKSKIKTWDKMRQKLTRKY 580


>BF521488 weakly similar to GP|18447433|gb RE28280p {Drosophila
           melanogaster}, partial (84%)
          Length = 470

 Score = 36.6 bits (83), Expect = 0.018
 Identities = 29/71 (40%), Positives = 31/71 (42%), Gaps = 1/71 (1%)
 Frame = +2

Query: 247 GRNQFGRTWNPTGGFGFQGRP-GGFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSG 305
           GR  FG  +NP GGFG  GRP GGF    F GR  R G          GG  G     S 
Sbjct: 194 GRPGFGGGYNPYGGFG-GGRPYGGFGGGGFGGRPFRGG----------GGGGGGSASASA 340

Query: 306 NPNPTTTVGGG 316
           + N     GGG
Sbjct: 341 SAN---AAGGG 364


>TC92636 homologue to GP|15042313|gb|AAK82093.1 232R {Chilo iridescent
           virus}, partial (1%)
          Length = 772

 Score = 34.3 bits (77), Expect = 0.091
 Identities = 24/84 (28%), Positives = 39/84 (45%), Gaps = 10/84 (11%)
 Frame = +3

Query: 2   EMAAMARALEQLTAFLTEQ-----AERAR---QGAGQGAANNQEEIYHG--LDKFLKRSP 51
           EM  M R +++L   +  Q     AER R    G+   +++     +    L   +K   
Sbjct: 258 EMEEMRRQIQELQETVNAQQAILEAERRRVDDDGSSDSSSSRSSRSHRRKTLMNDIKVDI 437

Query: 52  PKFEGGYNPDGAYEWVRELERIFE 75
           P FEG   PD   +W++ +ER+FE
Sbjct: 438 PDFEGELQPDEFVDWLQAIERVFE 509


>TC87381 
          Length = 814

 Score = 31.6 bits (70), Expect = 0.59
 Identities = 17/60 (28%), Positives = 27/60 (44%)
 Frame = +3

Query: 47  LKRSPPKFEGGYNPDGAYEWVRELERIFETLVCAEPRKVAFASYLLSSEARTWWTSVRGR 106
           +K   P FEG    D   + ++ +E +FE     E  KV   +  L   A  WW +++ R
Sbjct: 621 IKVDIPDFEGELQSDEFVD*LQAIECVFEYKEIPEDHKVKVVAV*LKKHALIWWENLKRR 800


>TC82733 similar to GP|10177404|dbj|BAB10535.
           gene_id:K24M7.12~pir||S42136~similar to unknown protein
           {Arabidopsis thaliana}, partial (57%)
          Length = 710

 Score = 30.0 bits (66), Expect = 1.7
 Identities = 11/14 (78%), Positives = 11/14 (78%)
 Frame = +3

Query: 276 CGRCRRNGHRAQEC 289
           C RCRR GHRAQ C
Sbjct: 357 CLRCRRRGHRAQNC 398


>TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.150 -
           Arabidopsis thaliana, partial (17%)
          Length = 378

 Score = 29.6 bits (65), Expect = 2.2
 Identities = 17/42 (40%), Positives = 19/42 (44%)
 Frame = +1

Query: 276 CGRCRRNGHRAQECIVVLGGQSGVQTGGSGNPNPTTTVGGGS 317
           C  C  +GH A+EC    GG  G   GG G        GGGS
Sbjct: 16  CYNCGESGHMARECTSGGGGGGGRYGGGGGGGGGGG--GGGS 135


>AL382998 similar to GP|23491723|db formin homology protein A {Dictyostelium
           discoideum}, partial (3%)
          Length = 355

 Score = 29.3 bits (64), Expect = 2.9
 Identities = 17/51 (33%), Positives = 20/51 (38%), Gaps = 6/51 (11%)
 Frame = +3

Query: 294 GGQSGVQTGGSGNPNPTTTVGGG------SNRNANVPPRDNRRVGHPANKG 338
           GG+   Q  G G P P    GGG       NR    PP   R+   P  +G
Sbjct: 18  GGEGRKQRKGGGGPPPPPGGGGGPPPPPKKNRGGGPPPGVPRKFSPPPKRG 170


>BQ144502 weakly similar to GP|6523547|emb hydroxyproline-rich glycoprotein
           DZ-HRGP {Volvox carteri f. nagariensis}, partial (39%)
          Length = 1358

 Score = 28.9 bits (63), Expect = 3.8
 Identities = 20/53 (37%), Positives = 21/53 (38%)
 Frame = +1

Query: 253 RTWNPTGGFGFQGRPGGFNNTPFCGRCRRNGHRAQECIVVLGGQSGVQTGGSG 305
           R W P G     GR G     P  G     G  A +C   LGG  G  TGG G
Sbjct: 409 RGW-PAGSGALGGRRGARGRAPRGGG--GGGGAASDCGGALGGDGGGGTGGGG 558


>TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
           {Oryza sativa (japonica cultivar-group)}, partial (16%)
          Length = 421

 Score = 28.5 bits (62), Expect = 5.0
 Identities = 10/21 (47%), Positives = 13/21 (61%)
 Frame = +3

Query: 269 GFNNTPFCGRCRRNGHRAQEC 289
           GF+    C  C+R GH A+EC
Sbjct: 327 GFSRDNLCKNCKRPGHYAREC 389


>TC79732 weakly similar to GP|6729021|gb|AAF27017.1| hypothetical protein
           {Arabidopsis thaliana}, partial (19%)
          Length = 654

 Score = 28.5 bits (62), Expect = 5.0
 Identities = 12/38 (31%), Positives = 24/38 (62%)
 Frame = +1

Query: 106 RITPEEGELIWEIFKNSFLEKYFPADAKCRKEMEFFEL 143
           ++ P +GE  W +FKN  ++ Y  A++  + ++EF E+
Sbjct: 328 KVYPRKGET-WALFKNWDIKWYMDAESHQKYDLEFVEI 438


>BQ151174 similar to GP|11036868|gb| PxORF73 peptide {Plutella xylostella
           granulovirus}, partial (42%)
          Length = 909

 Score = 28.1 bits (61), Expect = 6.5
 Identities = 16/49 (32%), Positives = 22/49 (44%), Gaps = 5/49 (10%)
 Frame = +1

Query: 295 GQSGVQTGGSGNPNPTTTVGGGSNRNANVPP-----RDNRRVGHPANKG 338
           G++  ++ G   P PT   GGG  R A  PP     R+  + G P   G
Sbjct: 568 GETHKKSPGGRTPPPTPRPGGGGARGAPAPPPQKKQREGGKSGPPRAPG 714


>TC91504 similar to GP|21618032|gb|AAM67082.1 unknown {Arabidopsis
           thaliana}, partial (37%)
          Length = 863

 Score = 28.1 bits (61), Expect = 6.5
 Identities = 13/34 (38%), Positives = 18/34 (52%), Gaps = 1/34 (2%)
 Frame = +3

Query: 95  EARTW-WTSVRGRITPEEGELIWEIFKNSFLEKY 127
           E ++W WT V G I P    L+W +    F+E Y
Sbjct: 546 EIKSWGWTEVYGYIMPFVWMLLWSLVLKFFIEVY 647


>BG449394 similar to PIR|A83006|A830 hypothetical protein PA5121 [imported] -
           Pseudomonas aeruginosa (strain PAO1), partial (1%)
          Length = 684

 Score = 28.1 bits (61), Expect = 6.5
 Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 2/33 (6%)
 Frame = -2

Query: 256 NPTGGFGFQGRPGGFNNTPFCGRC--RRNGHRA 286
           +P    G +G P  F+  PFCGR   RR G R+
Sbjct: 512 SPNSRRGGRGAPSLFSTAPFCGRSSRRRGGGRS 414


>TC86584 similar to GP|11994215|dbj|BAB01337. contains similarity to
            RNA-binding protein~gene_id:MOA2.5 {Arabidopsis
            thaliana}, partial (72%)
          Length = 1574

 Score = 27.7 bits (60), Expect = 8.5
 Identities = 12/25 (48%), Positives = 14/25 (56%)
 Frame = +2

Query: 265  GRPGGFNNTPFCGRCRRNGHRAQEC 289
            G PG FN+  FCG C  NG +   C
Sbjct: 1115 GGPGAFNSNCFCGIC--NGRKCHCC 1183


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.320    0.137    0.423 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,243,230
Number of Sequences: 36976
Number of extensions: 177919
Number of successful extensions: 868
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 842
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 865
length of query: 411
length of database: 9,014,727
effective HSP length: 98
effective length of query: 313
effective length of database: 5,391,079
effective search space: 1687407727
effective search space used: 1687407727
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (27.3 bits)


Lotus: description of TM0103.7