Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0229.10
         (277 letters)

Database: GMGI 
           63,676 sequences; 37,918,896 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

BE211208                                                               80  8e-16
CO982036                                                               80  1e-15
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete              65  3e-11
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co...    64  1e-10
BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {...    39  1e-07
BF068614 similar to GP|27817858|db OJ1081_B12.7 {Oryza sativa (j...    49  2e-06
BI321712                                                               49  2e-06
TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (F...    46  2e-05
BU548243                                                               45  3e-05
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag...    40  0.001
BU764568                                                               40  0.002
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara...    37  0.010
BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberos...    37  0.013
TC232995                                                               36  0.017
AI855899 similar to GP|2244960|emb| retrotransposon like protein...    36  0.017
BI972503 weakly similar to GP|18378607|gb polyprotein {Oryza sat...    33  0.11
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr...    33  0.11
BM731813 weakly similar to PIR|T02323|T02 nodulin-like protein [...    33  0.14
AW759031 weakly similar to GP|22093573|db polyprotein {Oryza sat...    33  0.18
BM143109                                                               32  0.31

>BE211208 
          Length = 413

 Score = 80.5 bits (197), Expect = 8e-16
 Identities = 49/121 (40%), Positives = 69/121 (56%), Gaps = 13/121 (10%)
 Frame = +2

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
           DIIITG S+ L+ + +  LN  F LKQLG L++FLGIEV     G+++LTQ +YI DL  
Sbjct: 50  DIIITGRSNYLIQSLVHHLNSNFSLKQLGQLDYFLGIEVHHTPTGSVLLTQSKYICDLLH 229

Query: 61  KVNISYGEKLQIPHNRGC-------------SFYGSIAGSLQ*LTITRPEQSYSVKKVCP 107
           K +++  + +  P                  + Y S+ G+LQ  TITRPE S++  KVC 
Sbjct: 230 KTDMAEAKPISSPMVTNLRLSKNGDDLLSDPTMYRSVVGALQYPTITRPEISFAANKVCQ 409

Query: 108 F 108
           F
Sbjct: 410 F 412


>CO982036 
          Length = 674

 Score = 79.7 bits (195), Expect = 1e-15
 Identities = 53/133 (39%), Positives = 73/133 (54%), Gaps = 13/133 (9%)
 Frame = -2

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
           DIIITGSS  L+ N  SKLN  F LK LG L++F+ IEVKS+ D  L+ +    I ++  
Sbjct: 631 DIIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPD--LLFSLRTSIFEIFC 458

Query: 61  KVNISYGEKLQIPHNRGC-------------SFYGSIAGSLQ*LTITRPEQSYSVKKVCP 107
           +      + +  P    C             +FY S+ G+LQ  T+ RPE S++V KVC 
Sbjct: 457 RKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVIRPEISFAVNKVCQ 278

Query: 108 FMSSLHDSHRTAI 120
           FMS+  DSH T +
Sbjct: 277 FMSNPLDSHWTEV 239


>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
          Length = 4731

 Score = 65.1 bits (157), Expect = 3e-11
 Identities = 75/309 (24%), Positives = 132/309 (42%), Gaps = 32/309 (10%)
 Frame = +1

Query: 1    DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
            DI+  G S+ ++ +F+ ++   F +  +G L +FLG++VK + D ++ L+Q RY +++  
Sbjct: 3790 DIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMED-SIFLSQSRYAKNIVK 3966

Query: 61   KVNISYGEKLQIP-------------HNRGCSFYGSIAGSLQ*LTITRPEQSYSVKKVCP 107
            K  +      + P              +   S Y S+ GSL  LT +RP+ +Y+V     
Sbjct: 3967 KFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCAR 4146

Query: 108  FMSSLHDSHRTAINASSTNSKAMSLLASSQACFSRLSFI----SSWV**C*LGHGS*RSS 163
            + ++   SH T +          S         S    +    + W      G    R S
Sbjct: 4147 YQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWA-----GSADDRKS 4311

Query: 164  FYLRCVYLLRPILIFWRS*KQTVVSGSSSEAEYRSLGLSSC-----------RYVKDPDS 212
                C YL    LI W S KQ  VS S++EAEY + G SSC            Y  + D 
Sbjct: 4312 TSGGCFYLGNN-LISWFSKKQNCVSLSTAEAEYIAAG-SSCSQLVWMKQMLKEYNVEQDV 4485

Query: 213  ITW----LKVVNLNTAPSIF*QVYHHFT*L*HYPTC*YYMELDILFFRKIVINRQLTAQH 268
            +T     +  +N++  P     V H  T          ++++   + R +V ++ +T +H
Sbjct: 4486 MTLYCDNMSAINISKNP-----VQHSRT---------KHIDIRHHYIRDLVDDKVITLKH 4623

Query: 269  IPADLQLAN 277
            +  + Q+A+
Sbjct: 4624 VDTEEQIAD 4650


>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
          Length = 4734

 Score = 63.5 bits (153), Expect = 1e-10
 Identities = 77/311 (24%), Positives = 136/311 (42%), Gaps = 34/311 (10%)
 Frame = +1

Query: 1    DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
            DI+  G S+ ++ +F+ ++   F +  +G L +FLG++VK + D ++ L+Q +Y +++  
Sbjct: 3793 DIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMED-SIFLSQSKYAKNIVK 3969

Query: 61   KVNISYGEKLQIP-------------HNRGCSFYGSIAGSLQ*LTITRPEQSYSVKKVC- 106
            K  +      + P              +   S Y S+ GSL  LT +RP+ +Y+V  VC 
Sbjct: 3970 KFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAV-GVCA 4146

Query: 107  -----PFMSSLHDSHRTAINASSTNSKAMSLLASSQACFSRLSFISSWV**C*LGHGS*R 161
                 P +S L+   R     + T+   +     S +        + W      G    R
Sbjct: 4147 RYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYC-DADWA-----GSADDR 4308

Query: 162  SSFYLRCVYLLRPILIFWRS*KQTVVSGSSSEAEYRSLGLSSC-----------RYVKDP 210
             S    C Y L   LI W S KQ  VS S++EAEY + G SSC            Y  + 
Sbjct: 4309 KSTSGGCFY-LGTNLISWFSKKQNCVSLSTAEAEYIAAG-SSCSQLVWMKQMLKEYNVEQ 4482

Query: 211  DSITW----LKVVNLNTAPSIF*QVYHHFT*L*HYPTC*YYMELDILFFRKIVINRQLTA 266
            D +T     +  +N++  P     V H  T          ++++   + R +V ++ +T 
Sbjct: 4483 DVMTLYCDNMSAINISKNP-----VQHSRT---------KHIDIRHHYIRDLVDDKVITL 4620

Query: 267  QHIPADLQLAN 277
            +H+  + Q+A+
Sbjct: 4621 EHVDTEEQIAD 4653


>BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {Vitis
           vinifera}, partial (19%)
          Length = 437

 Score = 38.9 bits (89), Expect(2) = 1e-07
 Identities = 22/58 (37%), Positives = 32/58 (54%)
 Frame = +2

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL 58
           DI+ITG+    +      L   F  K LG  E+FLGIEV    DG ++++Q +Y  D+
Sbjct: 59  DIVITGNDQGKIAQLKGHLFSHFQTKDLGKFEYFLGIEVAQSKDG-IIISQRKYALDI 229



 Score = 33.9 bits (76), Expect(2) = 1e-07
 Identities = 18/37 (48%), Positives = 21/37 (56%)
 Frame = +1

Query: 81  YGSIAGSLQ*LTITRPEQSYSVKKVCPFMSSLHDSHR 117
           Y  + G L  LTITRP  S+ V  V  FM S H+ HR
Sbjct: 325 YRILVGKLIYLTITRPNISFVVGVVSQFMQSPHNDHR 435


>BF068614 similar to GP|27817858|db OJ1081_B12.7 {Oryza sativa (japonica
           cultivar-group)}, partial (3%)
          Length = 413

 Score = 49.3 bits (116), Expect = 2e-06
 Identities = 27/56 (48%), Positives = 36/56 (64%)
 Frame = +2

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIR 56
           DI ITG+   L+   IS+LN  F LKQLG L++FLGIEVK L D  +   + ++ R
Sbjct: 65  DITITGNCVLLIQQLISQLNSQFALKQLGLLDYFLGIEVKYLPDKGISCPRLKWQR 232



 Score = 47.4 bits (111), Expect = 7e-06
 Identities = 27/68 (39%), Positives = 35/68 (50%), Gaps = 13/68 (19%)
 Frame = +1

Query: 56  RDL*AKVNISYGEKLQIPHNRGC-------------SFYGSIAGSLQ*LTITRPEQSYSV 102
           RDL  K  ++  + +  P    C             + YGS+ G+LQ  T+TRPE SYSV
Sbjct: 199 RDLLPKTKMAEAQPISSPMVSSCKLSKSGSDLFQDPTLYGSVVGALQYATLTRPEFSYSV 378

Query: 103 KKVCPFMS 110
            KVC FMS
Sbjct: 379 NKVCQFMS 402


>BI321712 
          Length = 399

 Score = 48.9 bits (115), Expect = 2e-06
 Identities = 35/122 (28%), Positives = 60/122 (48%), Gaps = 13/122 (10%)
 Frame = -3

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
           D+I TG++ ++   F   ++  F +  +G + ++LGIEVK   D  + +TQ  Y +++  
Sbjct: 367 DLIFTGNNPSMFEEFKKDMSNEFEMTDMGLMAYYLGIEVKQ-EDKGIFITQEGYAKEVLK 191

Query: 61  KVNISYGEKLQIP---------HNRG----CSFYGSIAGSLQ*LTITRPEQSYSVKKVCP 107
           K  +     +  P         H +G     + Y S+ GSL+ LT TRP+  Y V  V  
Sbjct: 190 KFKMDDANPVGTPMECGSKLSKHEKGENVDPTLYKSLIGSLRYLTCTRPDILYVVGVVSR 11

Query: 108 FM 109
           +M
Sbjct: 10  YM 5


>TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (Fragment),
           partial (16%)
          Length = 562

 Score = 45.8 bits (107), Expect = 2e-05
 Identities = 29/115 (25%), Positives = 56/115 (48%), Gaps = 13/115 (11%)
 Frame = +1

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
           D+++T   + L+  F  ++   F +  LG + +FLGIE+K  S   +++ Q +Y +++  
Sbjct: 217 DLLVTRDDARLVEEFKQEMMQAFEMTNLGLMTYFLGIEIKQ-SQNKVLICQRKYAKEILK 393

Query: 61  KVNISYGEKLQIPHNRGCSF-------------YGSIAGSLQ*LTITRPEQSYSV 102
           K  +   + +  P N+   F             Y S+ G L  LT TRP+  +++
Sbjct: 394 KFQMEECKSVSTPMNQKEKFNKVDGADKIDEGYYRSLIGCLMYLTATRPDILFAI 558


>BU548243 
          Length = 599

 Score = 45.4 bits (106), Expect = 3e-05
 Identities = 37/116 (31%), Positives = 54/116 (45%), Gaps = 16/116 (13%)
 Frame = -1

Query: 172 LRPILIFWRS*KQTVVSGSSSEAEYRSLGLSSCRYVKDPDSITWLKVVNLN-----TAPS 226
           L P LI W S KQ V + SS+EAEYRS+  +S         +TW++ + +      T P 
Sbjct: 527 LGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSA-------ELTWIQALLMELQIPFTPPV 369

Query: 227 IF*Q-----------VYHHFT*L*HYPTC*YYMELDILFFRKIVINRQLTAQHIPA 271
           I              V+H  T          +ME+D+ F  + V+++QL   HIPA
Sbjct: 368 ILCDNKSAVAIAHNLVFHSRT---------KHMEIDVFFVHEKVLSKQLQIFHIPA 228


>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment),
           partial (30%)
          Length = 687

 Score = 40.0 bits (92), Expect = 0.001
 Identities = 19/42 (45%), Positives = 28/42 (66%)
 Frame = +2

Query: 176 LIFWRS*KQTVVSGSSSEAEYRSLGLSSCRYVKDPDSITWLK 217
           L+ W+S KQTVV+ SS+EAEYRS+ + +C        + W+K
Sbjct: 104 LVSWKSKKQTVVARSSAEAEYRSMAMVTC-------ELMWIK 208


>BU764568 
          Length = 420

 Score = 39.7 bits (91), Expect = 0.002
 Identities = 20/42 (47%), Positives = 28/42 (66%)
 Frame = +3

Query: 176 LIFWRS*KQTVVSGSSSEAEYRSLGLSSCRYVKDPDSITWLK 217
           LI W+S KQ+VV+ SS+EAEYR++ L +C  +       WLK
Sbjct: 198 LISWKSKKQSVVAKSSAEAEYRAMALVTCELI-------WLK 302


>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
           thaliana}, partial (18%)
          Length = 421

 Score = 37.0 bits (84), Expect = 0.010
 Identities = 22/58 (37%), Positives = 34/58 (57%)
 Frame = -2

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL 58
           DII+ G S    +   + L+L F +K LG L++FLG+EV     G + ++Q +Y  DL
Sbjct: 264 DIILAGDSIDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLG-ITISQRKYCLDL 94


>BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial
           (21%)
          Length = 421

 Score = 36.6 bits (83), Expect = 0.013
 Identities = 32/114 (28%), Positives = 50/114 (43%), Gaps = 13/114 (11%)
 Frame = +2

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
           DI+IT   +  +      L   F  K L  L++FLGIEV    DG +V++Q +Y  D+  
Sbjct: 83  DIMITKKDATKIVQLKEHLFNHFQTKDLRYLKYFLGIEVAQSGDG-VVISQRKYALDILE 259

Query: 61  KVNISYGEKLQIPHNRGCSF-------------YGSIAGSLQ*LTITRPEQSYS 101
           +  +     +  P +                  Y  + G L  LTITRP+ S++
Sbjct: 260 ETGMQNCRLVDSPMDPNLKLMAYQSEVYPDPERYRRLVGKLIYLTITRPDISFA 421


>TC232995 
          Length = 1009

 Score = 36.2 bits (82), Expect = 0.017
 Identities = 18/78 (23%), Positives = 39/78 (49%)
 Frame = +2

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*A 60
           DII   ++ +L   F   +   F +  +G L++FLG+++K    G + + Q +Y ++L  
Sbjct: 218 DIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*G-IFINQSKYCKELIK 394

Query: 61  KVNISYGEKLQIPHNRGC 78
           +  +   + +  P +  C
Sbjct: 395 RFGMDSAKHMSTPMSTNC 448


>AI855899 similar to GP|2244960|emb| retrotransposon like protein
           {Arabidopsis thaliana}, partial (18%)
          Length = 418

 Score = 36.2 bits (82), Expect = 0.017
 Identities = 17/31 (54%), Positives = 22/31 (70%)
 Frame = +1

Query: 81  YGSIAGSLQ*LTITRPEQSYSVKKVCPFMSS 111
           Y  I G+LQ +T+TRP  +Y+V KV  FMSS
Sbjct: 103 YRDIVGALQYVTLTRPNIAYNVNKVSEFMSS 195


>BI972503 weakly similar to GP|18378607|gb polyprotein {Oryza sativa
           (japonica cultivar-group)}, partial (5%)
          Length = 327

 Score = 33.5 bits (75), Expect = 0.11
 Identities = 30/109 (27%), Positives = 48/109 (43%), Gaps = 13/109 (11%)
 Frame = +2

Query: 5   TGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL*AKVNI 64
           TG+ +  +      L   F  K LG L++FLGIEV + S   +V++Q +Y  D+  +  +
Sbjct: 2   TGNDATKIVQLKEHLFSHFQTKDLGYLKYFLGIEV-AQSGVDVVISQRKYALDILEETGM 178

Query: 65  SYGEKLQIPHNRGCSF-------------YGSIAGSLQ*LTITRPEQSY 100
                +  P +                  Y  + G L  LTITRP+ S+
Sbjct: 179 QNCRPVDSPMDPNLKLMADQSEIYHDPERYRRLVGKLIYLTITRPDISF 325


>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
           Hopscotch polyprotein, partial (7%)
          Length = 1446

 Score = 33.5 bits (75), Expect = 0.11
 Identities = 13/32 (40%), Positives = 24/32 (74%)
 Frame = +2

Query: 176 LIFWRS*KQTVVSGSSSEAEYRSLGLSSCRYV 207
           L+ W+S K  VV+ SS+EAEY+++ +++C  +
Sbjct: 80  LVLWKSNK*NVVARSSAEAEYKAMTVATCELI 175


>BM731813 weakly similar to PIR|T02323|T02 nodulin-like protein [imported] -
           Arabidopsis thaliana, partial (2%)
          Length = 406

 Score = 33.1 bits (74), Expect = 0.14
 Identities = 29/70 (41%), Positives = 37/70 (52%), Gaps = 3/70 (4%)
 Frame = -2

Query: 127 SKAMSLLASSQACFSRLSFISSWV**C*LGHGS*RSS---FYLRCVYLLRPILIFWRS*K 183
           S+++    SS ACFS     SSW+        S  SS   F LRC  +LR  L+F+RS K
Sbjct: 309 SRSLLESKSSSACFSFFHSPSSWIIRRPSESTSTSSSNLDFLLRCSDVLRADLVFFRSSK 130

Query: 184 QTVVSGSSSE 193
               SGSSS+
Sbjct: 129 S---SGSSSD 109


>AW759031 weakly similar to GP|22093573|db polyprotein {Oryza sativa
           (japonica cultivar-group)}, partial (5%)
          Length = 430

 Score = 32.7 bits (73), Expect = 0.18
 Identities = 20/47 (42%), Positives = 25/47 (52%)
 Frame = +3

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTL 47
           DI+ITG     ++   S L+  F  K LG   +FLGIEV   S G L
Sbjct: 135 DIVITGDDFDGIHRLKSHLHNKFQTKDLGPPNYFLGIEVAQSSSGFL 275


>BM143109 
          Length = 415

 Score = 32.0 bits (71), Expect = 0.31
 Identities = 16/58 (27%), Positives = 31/58 (52%)
 Frame = +1

Query: 1   DIIITGSSSALMNNFISKLNLVFLLKQLG*LEHFLGIEVKSLSDGTLVLTQFRYIRDL 58
           DII   ++ +L   F   +   F +  +  L  FLG+++K   +G + ++Q +Y +DL
Sbjct: 214 DIIFGSTNDSLCKKFSQDMQNEFEMSMMRELNFFLGLQIKQTKNG-IFISQSKYCKDL 384


  Database: GMGI
    Posted date:  Oct 22, 2004  4:58 PM
  Number of letters in database: 37,918,896
  Number of sequences in database:  63,676
  
Lambda     K      H
   0.348    0.152    0.497 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,250,689
Number of Sequences: 63676
Number of extensions: 199415
Number of successful extensions: 1516
Number of sequences better than 10.0: 53
Number of HSP's better than 10.0 without gapping: 1496
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1504
length of query: 277
length of database: 12,639,632
effective HSP length: 96
effective length of query: 181
effective length of database: 6,526,736
effective search space: 1181339216
effective search space used: 1181339216
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 14 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.8 bits)
S2: 58 (26.9 bits)


Lotus: description of TM0229.10