Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0212.11
         (511 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr...    57  2e-08
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi...    57  2e-08
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen...    54  2e-07
BG586293 weakly similar to PIR|E84473|E84 probable retroelement ...    49  5e-06
BG587156 similar to PIR|G85055|G8 probable polyprotein [imported...    48  1e-05
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate...    44  2e-04
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid...    43  3e-04
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2...    35  0.002
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu...    31  0.99
TC79362 WD-repeat cell cycle regulatory protein                        30  2.9
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p...    29  4.9
TC86982 similar to GP|10946337|gb|AAG24863.1 CONSTANS-like prote...    28  6.4

>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
           sativa}, partial (9%)
          Length = 542

 Score = 56.6 bits (135), Expect = 2e-08
 Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 3/105 (2%)
 Frame = +2

Query: 361 DNDFKKSASGYLIKFVGGVVALKSRLHRCIALSTFEAEFIDITEACKEFL*LKKLFQD*V 420
           D + +KS SGY      G ++  S+    +A ST EAE+I  T    + + L+++ +  V
Sbjct: 17  DTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRILE--V 190

Query: 421 L---SRTNTCCFVIVKVRFILRKSPTFHSTSKHIDVRCHWIRDAL 462
           +     T T  +   K    L K+P FH  SKHID++ H IR+ +
Sbjct: 191 MHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKIRELI 325


>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
           partial (9%)
          Length = 675

 Score = 56.6 bits (135), Expect = 2e-08
 Identities = 33/101 (32%), Positives = 54/101 (52%), Gaps = 4/101 (3%)
 Frame = +1

Query: 7   MDVMKNEMKSWHDNCSFHLVKLPKGKKALENKWICMVKLESNSTSLRYKAILMVEDFRHG 66
           +  MK E K+  DN ++ LV LP  KKA+  KW+  VK   + +  ++KA L+ + F   
Sbjct: 100 LQAMKTEYKALIDNKTWDLVPLPPHKKAIGCKWVYRVKENPDGSVNKFKARLVAKGFSQT 279

Query: 67  KSVDSNE----MVKMSFIEIVLSLVATLDLEVKQIDVKAAF 103
              D  E    ++K   I ++L++  T   E++QID+  AF
Sbjct: 280 LGCDYTETFSPVIKPVTIRLILTIAITYKWEIQQIDINNAF 402


>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana, partial
           (10%)
          Length = 814

 Score = 53.5 bits (127), Expect = 2e-07
 Identities = 29/105 (27%), Positives = 52/105 (48%)
 Frame = +1

Query: 358 YVVDNDFKKSASGYLIKFVGGVVALKSRLHRCIALSTFEAEFIDITEACKEFL*LKKLFQ 417
           Y  + D +KS SG++    G  ++ K+     + LST +AE+I   E  K+ + LK +  
Sbjct: 112 YAGNVDTRKSLSGFVFTLYGTTISWKANQQSVVTLSTTQAEYIAFVEGVKDAIWLKGMIG 291

Query: 418 D*VLSRTNTCCFVIVKVRFILRKSPTFHSTSKHIDVRCHWIRDAL 462
           +  +++         +    L     +H  +KHID+R H+IRD +
Sbjct: 292 ELGITQEYVKIHCDSQSAIHLANHQVYHERTKHIDIRLHFIRDMI 426


>BG586293 weakly similar to PIR|E84473|E84 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana, partial
           (7%)
          Length = 763

 Score = 48.9 bits (115), Expect = 5e-06
 Identities = 30/97 (30%), Positives = 53/97 (53%), Gaps = 4/97 (4%)
 Frame = +2

Query: 17  WHDNCSFHLVKLPKGKKALENKWICMVKLESNSTSLRYKAILMVEDFRHGKSVDSNE--- 73
           ++   +  LVK P G K +  +WI  +K   + T ++YKA L+ + +   + +D +E   
Sbjct: 41  YYQKQTLKLVKKPTGVKPIGLRWIYKIKRNEDGTLIKYKARLVAKGYVKQQGIDFDEVFA 220

Query: 74  -MVKMSFIEIVLSLVATLDLEVKQIDVKAAFPSW*FV 109
            +V++  I ++L+L AT    +  IDVK AF +  FV
Sbjct: 221 PVVRIETI*LLLALAATNGC*IHHIDVKIAFLNGHFV 331


>BG587156 similar to PIR|G85055|G8 probable polyprotein [imported] -
           Arabidopsis thaliana, partial (17%)
          Length = 618

 Score = 47.8 bits (112), Expect = 1e-05
 Identities = 31/88 (35%), Positives = 48/88 (54%), Gaps = 4/88 (4%)
 Frame = -1

Query: 20  NCSFHLVKLPKGKKALENKWICMVKLESNSTSLRYKAILMVEDFRHGKSVDSNE----MV 75
           N +++  +LPKGKKA+ ++WI  +K +++ +  R K  L+   F      D  E    + 
Sbjct: 480 NDTWYESELPKGKKAVSSRWIFTIKYKADGSIERKKTRLVARGFTLTYGEDYIETFAPVA 301

Query: 76  KMSFIEIVLSLVATLDLEVKQIDVKAAF 103
           K+  I IVLSL   L   + Q+DVK AF
Sbjct: 300 KLHTIRIVLSLAVNLGWGLWQMDVKNAF 217


>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
           polyprotein from transposon TNT 1-94 [Contains: Protease
           (EC 3.4.23.-);, partial (7%)
          Length = 705

 Score = 43.5 bits (101), Expect = 2e-04
 Identities = 27/80 (33%), Positives = 44/80 (54%), Gaps = 1/80 (1%)
 Frame = +1

Query: 361 DNDFKKSASGYLIKFVGGVVALKSRLHRCIALSTFEAEFI-DITEACKEFL*LKKLFQD* 419
           D+D +KS +GY+    GG V+  S+L   +ALST EAE++    +  ++    K  +++ 
Sbjct: 106 DHDKRKSTTGYVFTLAGGAVSWLSKLQTVVALSTTEAEYMAAYLKHARKLFGCKG*WRNS 285

Query: 420 VLSRTNTCCFVIVKVRFILR 439
             SR    C V V+V  IL+
Sbjct: 286 GTSRNKLLCIVTVRVPCILQ 345



 Score = 35.8 bits (81), Expect = 0.040
 Identities = 14/61 (22%), Positives = 35/61 (56%)
 Frame = +3

Query: 402 ITEACKEFL*LKKLFQD*VLSRTNTCCFVIVKVRFILRKSPTFHSTSKHIDVRCHWIRDA 461
           + +ACKE + +++L ++    +     +   +    + ++P FHS +KHI ++ H++R+ 
Sbjct: 231 LPQACKEAIWMQRLMEELGHKQEQITVYCDSQSALHIARNPAFHSRTKHIGIQYHFVREV 410

Query: 462 L 462
           +
Sbjct: 411 V 413


>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
           thaliana}, partial (14%)
          Length = 778

 Score = 42.7 bits (99), Expect = 3e-04
 Identities = 26/85 (30%), Positives = 41/85 (47%), Gaps = 4/85 (4%)
 Frame = +3

Query: 10  MKNEMKSWHDNCSFHLVKLPKGKKALENKWICMVKLESNSTSLRYKAILMVEDFRHGKSV 69
           M NEM++   N ++ L  L  G K +  KWI   KL  N    +YKA L+ + +     V
Sbjct: 90  MNNEMEATERNNTWELTDLRSGAKTIGLKWIFKTKLNENGEIEKYKARLVAKGYSQQYGV 269

Query: 70  DSNE----MVKMSFIEIVLSLVATL 90
           D  E    + +   I +V++L A +
Sbjct: 270 DYTEVFAPVARWDTIRMVIALAAQI 344


>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
           Arabidopsis thaliana, partial (11%)
          Length = 732

 Score = 35.4 bits (80), Expect(2) = 0.002
 Identities = 18/43 (41%), Positives = 24/43 (54%)
 Frame = +1

Query: 358 YVVDNDFKKSASGYLIKFVGGVVALKSRLHRCIALSTFEAEFI 400
           Y  D D +KS SGY+     G V+  S+    + LST +AEFI
Sbjct: 385 YAGDLDDRKSTSGYVFMLSSGAVSWSSKKQPVVTLSTTKAEFI 513



 Score = 23.5 bits (49), Expect(2) = 0.002
 Identities = 14/46 (30%), Positives = 21/46 (45%)
 Frame = +3

Query: 288 CW*FDVCNGVHKTRYSACCW*NQEMSNKSG*RALECCEIDFEVSLW 333
           CW  DV +  HKT  + C   N ++   S *    C +   ++S W
Sbjct: 180 CWLSDVLSS-HKT*SNVCVKSN*QIHELSN*VTHACSQKSTQISQW 314


>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
           partial (13%)
          Length = 494

 Score = 31.2 bits (69), Expect = 0.99
 Identities = 17/54 (31%), Positives = 31/54 (56%)
 Frame = +1

Query: 365 KKSASGYLIKFVGGVVALKSRLHRCIALSTFEAEFIDITEACKEFL*LKKLFQD 418
           ++S SGY+ KF    ++  ++     ALS++EAE+I  T A  + L L  + ++
Sbjct: 316 RRSTSGYVFKFNDAAISWCTKKQPITALSSYEAEYIAGTFATFQALWLDSVIKE 477


>TC79362 WD-repeat cell cycle regulatory protein
          Length = 1800

 Score = 29.6 bits (65), Expect = 2.9
 Identities = 11/24 (45%), Positives = 14/24 (57%)
 Frame = +2

Query: 284 LCICCW*FDVCNGVHKTRYSACCW 307
           LC+ CW       +  TRYS+CCW
Sbjct: 794 LCLFCW-------LGSTRYSSCCW 844


>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
           [imported] - Arabidopsis thaliana, partial (4%)
          Length = 480

 Score = 28.9 bits (63), Expect = 4.9
 Identities = 10/23 (43%), Positives = 15/23 (64%)
 Frame = +2

Query: 438 LRKSPTFHSTSKHIDVRCHWIRD 460
           L  +P +HS  KHI +  H++RD
Sbjct: 53  LTHNPVYHSRMKHISIDIHFVRD 121


>TC86982 similar to GP|10946337|gb|AAG24863.1 CONSTANS-like protein {Ipomoea
           nil}, partial (54%)
          Length = 1643

 Score = 28.5 bits (62), Expect = 6.4
 Identities = 15/54 (27%), Positives = 32/54 (58%)
 Frame = +2

Query: 321 LECCEIDFEVSLWF*LLIV*GFVLEVISLPMGLD*F*YVVDNDFKKSASGYLIK 374
           ++ C ++  +S+W  L+++ G V++ +SLP+ L     +V+N+  K    +L K
Sbjct: 632 MDICLVEKLMSIWTLLIVILGGVMKTLSLPIILIIMMSIVNNNNNKIIMVFLKK 793


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.367    0.164    0.593 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,748,498
Number of Sequences: 36976
Number of extensions: 215657
Number of successful extensions: 3437
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 1415
Number of HSP's successfully gapped in prelim test: 155
Number of HSP's that attempted gapping in prelim test: 1958
Number of HSP's gapped (non-prelim): 1674
length of query: 511
length of database: 9,014,727
effective HSP length: 100
effective length of query: 411
effective length of database: 5,317,127
effective search space: 2185339197
effective search space used: 2185339197
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 14 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 36 (21.7 bits)
S2: 60 (27.7 bits)


Lotus: description of TM0212.11