Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC144731.14 + phase: 0 /pseudo
         (1343 letters)

Database: sprot 
           164,201 sequences; 59,974,054 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from tran...   188  1e-46
COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contain...   128  1e-28
YMU0_YEAST (Q04670) Transposon Ty1 protein B                          127  2e-28
YMT5_YEAST (Q04214) Transposon Ty1 protein B                          127  2e-28
YME4_YEAST (Q04711) Transposon Ty1 protein B                          127  2e-28
YMD9_YEAST (Q03434) Transposon Ty1 protein B                          127  2e-28
YJZ9_YEAST (P47100) Transposon Ty1 protein B                          127  2e-28
YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B)       126  3e-28
YJZ7_YEAST (P47098) Transposon Ty1 protein B                          125  6e-28
YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein      70  3e-11
POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.2...    54  3e-06
POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23...    49  1e-04
POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse transcript...    49  1e-04
POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse transcript...    47  4e-04
POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.2...    46  8e-04
M300_ARATH (P93293) Hypothetical mitochondrial protein AtMg00300...    46  8e-04
GAG_SIVCZ (P17282) Gag polyprotein [Contains: Core protein p18; ...    46  8e-04
GAG_HV1MA (P04594) Gag polyprotein [Contains: Core protein p17 (...    46  8e-04
GAG_HV1BR (P03348) Gag polyprotein [Contains: Core protein p17 (...    45  0.001
GAG_HV2KR (Q74119) Gag polyprotein [Contains: Core protein p16; ...    45  0.001

>POLX_TOBAC (P10978) Retrovirus-related Pol polyprotein from
           transposon TNT 1-94 [Contains: Protease (EC 3.4.23.-);
           Reverse transcriptase (EC 2.7.7.49); Endonuclease]
          Length = 1328

 Score =  188 bits (477), Expect = 1e-46
 Identities = 172/658 (26%), Positives = 279/658 (42%), Gaps = 64/658 (9%)

Query: 15  RFTGKN-YPAWEFQFRMYVKGNKLWSHLDDVSKAPTEKAALEEWEYKDAQIISWILSSID 73
           +F G N +  W+ + R  +    L   LD  SK P    A E+W   D +  S I   + 
Sbjct: 10  KFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKA-EDWADLDERAASAIRLHLS 68

Query: 74  PQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYKQGNLYVQEFYSGFLNLW 133
             ++NN+    TA+ +W  L+ +Y       +  L+ +        LY      G    +
Sbjct: 69  DDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQ--------LYALHMSEG--TNF 118

Query: 134 TEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFEVVRGALLNRNPVPSLDTCVG 193
             H  + +  + + +   V+ +    +    L  L   ++ +   +L+      L     
Sbjct: 119 LSHLNVFNGLITQLANLGVK-IEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTS 177

Query: 194 ELLREEQRLLTQGTMSHDAFISEPVPVAYAAQSRGKGRDMRQVQCFTCKQFGHVARSCTA 253
            LL  E ++  +      A I+E    +Y   S   GR   + +     +          
Sbjct: 178 ALLLNE-KMRKKPENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSK-------SRV 229

Query: 254 KFCKYCKQNGHVIFDCPIRPPRRTQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQM 313
           + C  C Q GH   DCP   PR+ +  T       ++AA   +   +D   L     ++ 
Sbjct: 230 RNCYNCNQPGHFKRDCP--NPRKGKGETSGQKNDDNTAA---MVQNNDNVVLFINEEEEC 284

Query: 314 VLAALSNMGIHGKSSNVSRPWFLDSGASNHMTGSSE-YLHNLHSYHGNQQIQIADGNKLS 372
                  M + G  S     W +D+ AS+H T   + +   +    G   +++ + +   
Sbjct: 285 -------MHLSGPESE----WVVDTAASHHATPVRDLFCRYVAGDFGT--VKMGNTSYSK 331

Query: 373 ITDVGDI--------NSDFQDVLVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSG 424
           I  +GDI            +DV   P L  NL+S   L  +     F+     + +    
Sbjct: 332 IAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTK--GS 389

Query: 425 KVIAKGPKVGRLFPLQFISSHLSLACNNVLNSYED------WHRKLGHPNSTVLSHLFKT 478
            VIAKG   G L+         +  C   LN+ +D      WH+++GH +   L  L K 
Sbjct: 390 LVIAKGVARGTLYRTN------AEICQGELNAAQDEISVDLWHKRMGHMSEKGLQILAKK 443

Query: 479 GLLGNKQVVCTASISCPVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYK 538
            L+   +   T    C  C   K   + F + + R  N  ++++SDV G   I S    K
Sbjct: 444 SLISYAKG--TTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNK 501

Query: 539 YFVTFIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQ 598
           YFVTFIDD SR  W+Y L++K +VF +F+KF   VE +    +K  RS++GGEY S EF+
Sbjct: 502 YFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFE 561

Query: 599 EYLQHKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF 656
           EY    GI  +++ P TPQ NG+AER NR +++  RS+L  A +P  FW EA+ T C+
Sbjct: 562 EYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACY 619


>COPI_DROME (P04146) Copia protein (Gag-int-pol protein) [Contains:
           Copia VLP protein; Copia protease (EC 3.4.23.-)]
          Length = 1409

 Score =  128 bits (321), Expect = 1e-28
 Identities = 101/414 (24%), Positives = 183/414 (43%), Gaps = 39/414 (9%)

Query: 256 CKYCKQNGHVIFDCPIRPPRRTQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQMVL 315
           C +C + GH+  DC         +  + L+           T+ S G +   + +     
Sbjct: 232 CHHCGREGHIKKDC--------FHYKRILNNKNKENEKQVQTATSHGIAFMVKEVNNT-- 281

Query: 316 AALSNMGIHGKSSNVSRPWFLDSGASNHMTGSSEYLHNLHSYHGNQQIQIA-DGNKLSIT 374
           + + N G           + LDSGAS+H+        +        +I +A  G  +  T
Sbjct: 282 SVMDNCG-----------FVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYAT 330

Query: 375 DVG------DINSDFQDVLVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIA 428
             G      D     +DVL     A NL+SV +L +   ++ F ++G  + +  +G ++ 
Sbjct: 331 KRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISK--NGLMVV 388

Query: 429 KGPKVGRLFPLQFISSHLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVC 488
           K   +    P+    ++   A +   N++  WH + GH +   L  + +  +  ++ ++ 
Sbjct: 389 KNSGMLNNVPVINFQAYSINAKHK--NNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLN 446

Query: 489 TASISCPVCKLA---KSKTLPFPS---GAHRASNCFEMIHSDVWGMSPIASHAHYKYFVT 542
              +SC +C+     K   LPF       H     F ++HSDV G     +     YFV 
Sbjct: 447 NLELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLF-VVHSDVCGPITPVTLDDKNYFVI 505

Query: 543 FIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQ 602
           F+D ++ +   Y ++ KS+VFSMF+ F+   E  F   V     ++G EY+S+E +++  
Sbjct: 506 FVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCV 565

Query: 603 HKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF 656
            KGI    + P+TPQ NG++ER  R + +  R+++  A +   FW EA+ T  +
Sbjct: 566 KKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATY 619



 Score = 33.1 bits (74), Expect = 5.2
 Identities = 22/110 (20%), Positives = 46/110 (41%), Gaps = 4/110 (3%)

Query: 7   EKAKDFCVRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPTEKAALEEWEYKDAQIIS 66
           +KAK     F G+ Y  W+F+ R  +    +   +D +     +    + W+  +    S
Sbjct: 2   DKAKRNIKPFDGEKYAIWKFRIRALLAEQDVLKVVDGLMPNEVD----DSWKKAERCAKS 57

Query: 67  WILSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYK 116
            I+  +    +N   S  TA+++   L  +Y + + A +  L   + + K
Sbjct: 58  TIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLK 107


>YMU0_YEAST (Q04670) Transposon Ty1 protein B
          Length = 1328

 Score =  127 bits (319), Expect = 2e-28
 Identities = 98/347 (28%), Positives = 165/347 (47%), Gaps = 37/347 (10%)

Query: 336 LDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNK--LSITDVGDINSDFQD-------V 386
           LDSGAS  +  S+   H++HS   N  I + D  K  + I  +GD+   FQD       V
Sbjct: 33  LDSGASRTLIRSA---HHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKV 89

Query: 387 LVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQ---FIS 443
           L +P +A +LLS+ +L   +    F++    V E+  G V+A   K G  + +     + 
Sbjct: 90  LHTPNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVKYGDFYWVSKKYLLP 146

Query: 444 SHLSL-ACNNVLNS-------YEDWHRKLGHPNSTVLSHLFKTGLL---GNKQVVCTASI 492
           S++S+   NNV  S       Y   HR L H N+  + +  K   +       V  +++I
Sbjct: 147 SNISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAI 206

Query: 493 S--CPVCKLAKSKTLPFPSGAH----RASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDD 546
              CP C + KS       G+      +   F+ +H+D++G       +   YF++F D+
Sbjct: 207 DYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDE 266

Query: 547 YSRFTWIYFLRSKSE--VFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHK 604
            ++F W+Y L  + E  +  +F   L +++ QFQASV + + + G EY +    ++L+  
Sbjct: 267 TTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKN 326

Query: 605 GILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEAL 651
           GI    +     + +G+AER NR LLD  R+ L  + +P   W  A+
Sbjct: 327 GITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAI 373


>YMT5_YEAST (Q04214) Transposon Ty1 protein B
          Length = 1328

 Score =  127 bits (319), Expect = 2e-28
 Identities = 98/347 (28%), Positives = 165/347 (47%), Gaps = 37/347 (10%)

Query: 336 LDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNK--LSITDVGDINSDFQD-------V 386
           LDSGAS  +  S+   H++HS   N  I + D  K  + I  +GD+   FQD       V
Sbjct: 33  LDSGASRTLIRSA---HHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKV 89

Query: 387 LVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQ---FIS 443
           L +P +A +LLS+ +L   +    F++    V E+  G V+A   K G  + +     + 
Sbjct: 90  LHTPNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVKYGDFYWVSKKYLLP 146

Query: 444 SHLSL-ACNNVLNS-------YEDWHRKLGHPNSTVLSHLFKTGLL---GNKQVVCTASI 492
           S++S+   NNV  S       Y   HR L H N+  + +  K   +       V  +++I
Sbjct: 147 SNISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAI 206

Query: 493 S--CPVCKLAKSKTLPFPSGAH----RASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDD 546
              CP C + KS       G+      +   F+ +H+D++G       +   YF++F D+
Sbjct: 207 DYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDE 266

Query: 547 YSRFTWIYFLRSKSE--VFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHK 604
            ++F W+Y L  + E  +  +F   L +++ QFQASV + + + G EY +    ++L+  
Sbjct: 267 TTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKN 326

Query: 605 GILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEAL 651
           GI    +     + +G+AER NR LLD  R+ L  + +P   W  A+
Sbjct: 327 GITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAI 373


>YME4_YEAST (Q04711) Transposon Ty1 protein B
          Length = 1328

 Score =  127 bits (319), Expect = 2e-28
 Identities = 98/347 (28%), Positives = 165/347 (47%), Gaps = 37/347 (10%)

Query: 336 LDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNK--LSITDVGDINSDFQD-------V 386
           LDSGAS  +  S+   H++HS   N  I + D  K  + I  +GD+   FQD       V
Sbjct: 33  LDSGASRTLIRSA---HHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKV 89

Query: 387 LVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQ---FIS 443
           L +P +A +LLS+ +L   +    F++    V E+  G V+A   K G  + +     + 
Sbjct: 90  LHTPNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVKYGDFYWVSKKYLLP 146

Query: 444 SHLSL-ACNNVLNS-------YEDWHRKLGHPNSTVLSHLFKTGLL---GNKQVVCTASI 492
           S++S+   NNV  S       Y   HR L H N+  + +  K   +       V  +++I
Sbjct: 147 SNISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAI 206

Query: 493 S--CPVCKLAKSKTLPFPSGAH----RASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDD 546
              CP C + KS       G+      +   F+ +H+D++G       +   YF++F D+
Sbjct: 207 DYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDE 266

Query: 547 YSRFTWIYFLRSKSE--VFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHK 604
            ++F W+Y L  + E  +  +F   L +++ QFQASV + + + G EY +    ++L+  
Sbjct: 267 TTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKN 326

Query: 605 GILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEAL 651
           GI    +     + +G+AER NR LLD  R+ L  + +P   W  A+
Sbjct: 327 GITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAI 373


>YMD9_YEAST (Q03434) Transposon Ty1 protein B
          Length = 1328

 Score =  127 bits (319), Expect = 2e-28
 Identities = 99/347 (28%), Positives = 165/347 (47%), Gaps = 37/347 (10%)

Query: 336 LDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNK--LSITDVGDINSDFQD-------V 386
           LDSGAS  +  S+   H++HS   N  I + D  K  + I  +GD+   FQD       V
Sbjct: 33  LDSGASRTLIRSA---HHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKV 89

Query: 387 LVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQ---FIS 443
           L +P +A +LLS+ +L   +    F++    V E+  G V+A   K G  + +     + 
Sbjct: 90  LHTPNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVKYGDFYWVSKKYLLP 146

Query: 444 SHLSL-ACNNVLNS-------YEDWHRKLGHPNSTVLSHLFKTGLLG--NKQVVCTASI- 492
           S++S+   NNV  S       Y   HR L H N+  + +  K   +   N+  V  +S  
Sbjct: 147 SNISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDRSSAI 206

Query: 493 --SCPVCKLAKSKTLPFPSGAH----RASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDD 546
              CP C + KS       G+      +   F+ +H+D++G       +   YF++F D+
Sbjct: 207 DYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDE 266

Query: 547 YSRFTWIYFLRSKSE--VFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHK 604
            ++F W+Y L  + E  +  +F   L +++ QFQASV + + + G EY +    ++L+  
Sbjct: 267 TTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKN 326

Query: 605 GILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEAL 651
           GI    +     + +G+AER NR LLD  R+ L  + +P   W  A+
Sbjct: 327 GITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAI 373


>YJZ9_YEAST (P47100) Transposon Ty1 protein B
          Length = 1755

 Score =  127 bits (319), Expect = 2e-28
 Identities = 98/347 (28%), Positives = 165/347 (47%), Gaps = 37/347 (10%)

Query: 336 LDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNK--LSITDVGDINSDFQD-------V 386
           LDSGAS  +  S+   H++HS   N  I + D  K  + I  +GD+   FQD       V
Sbjct: 460 LDSGASRTLIRSA---HHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKV 516

Query: 387 LVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQ---FIS 443
           L +P +A +LLS+ +L   +    F++    V E+  G V+A   K G  + +     + 
Sbjct: 517 LHTPNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVKYGDFYWVSKKYLLP 573

Query: 444 SHLSL-ACNNVLNS-------YEDWHRKLGHPNSTVLSHLFKTGLL---GNKQVVCTASI 492
           S++S+   NNV  S       Y   HR L H N+  + +  K   +       V  +++I
Sbjct: 574 SNISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAI 633

Query: 493 S--CPVCKLAKSKTLPFPSGAH----RASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDD 546
              CP C + KS       G+      +   F+ +H+D++G       +   YF++F D+
Sbjct: 634 DYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDE 693

Query: 547 YSRFTWIYFLRSKSE--VFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHK 604
            ++F W+Y L  + E  +  +F   L +++ QFQASV + + + G EY +    ++L+  
Sbjct: 694 TTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKN 753

Query: 605 GILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEAL 651
           GI    +     + +G+AER NR LLD  R+ L  + +P   W  A+
Sbjct: 754 GITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAI 800


>YCB9_YEAST (P25384) Transposon Ty2 protein B (Ty1-17 protein B)
          Length = 1770

 Score =  126 bits (317), Expect = 3e-28
 Identities = 97/347 (27%), Positives = 165/347 (46%), Gaps = 37/347 (10%)

Query: 336 LDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNK--LSITDVGDINSDFQD-------V 386
           +DSGAS  +  S+ YLH+      N +I I D  K  + I  +G+++ +FQ+        
Sbjct: 456 IDSGASQTLVRSAHYLHHATP---NSEINIVDAQKQDIPINAIGNLHFNFQNGTKTSIKA 512

Query: 387 LVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQ---FIS 443
           L +P +A +LLS+ +L + N    F+R      E+  G V+A   K G  + L     I 
Sbjct: 513 LHTPNIAYDLLSLSELANQNITACFTRN---TLERSDGTVLAPIVKHGDFYWLSKKYLIP 569

Query: 444 SHLS-LACNNVLNS-------YEDWHRKLGHPNSTVLSHLFKTGLL-----GNKQVVCTA 490
           SH+S L  NNV  S       Y   HR LGH N   +    K   +      + +    +
Sbjct: 570 SHISKLTINNVNKSKSVNKYPYPLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNAS 629

Query: 491 SISCPVCKLAKSKTLPFPSGAH----RASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDD 546
           +  CP C + KS       G+      +   F+ +H+D++G       +   YF++F D+
Sbjct: 630 TYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDE 689

Query: 547 YSRFTWIYFLRSKSE--VFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHK 604
            +RF W+Y L  + E  + ++F   L +++ QF A V + + + G EY +    ++  ++
Sbjct: 690 KTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNR 749

Query: 605 GILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEAL 651
           GI +  +     + +G+AER NR LL+  R+LL  + +P   W  A+
Sbjct: 750 GITACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNHLWFSAV 796


>YJZ7_YEAST (P47098) Transposon Ty1 protein B
          Length = 1755

 Score =  125 bits (315), Expect = 6e-28
 Identities = 97/347 (27%), Positives = 165/347 (46%), Gaps = 37/347 (10%)

Query: 336 LDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNK--LSITDVGDINSDFQD-------V 386
           LDSGAS  +  S+   H++HS   N  I + D  K  + I  +GD+   FQD       V
Sbjct: 460 LDSGASRTLIRSA---HHIHSASSNPDINVVDAQKRNIPINAIGDLQFHFQDNTKTSIKV 516

Query: 387 LVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQ---FIS 443
           L +P +A +LLS+ +L   +    F++    V E+  G V+A   + G  + +     + 
Sbjct: 517 LHTPNIAYDLLSLNELAAVDITACFTKN---VLERSDGTVLAPIVQYGDFYWVSKRYLLP 573

Query: 444 SHLSL-ACNNVLNS-------YEDWHRKLGHPNSTVLSHLFKTGLL---GNKQVVCTASI 492
           S++S+   NNV  S       Y   HR L H N+  + +  K   +       V  +++I
Sbjct: 574 SNISVPTINNVHTSESTRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAI 633

Query: 493 S--CPVCKLAKSKTLPFPSGAH----RASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDD 546
              CP C + KS       G+      +   F+ +H+D++G       +   YF++F D+
Sbjct: 634 DYQCPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDE 693

Query: 547 YSRFTWIYFLRSKSE--VFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHK 604
            ++F W+Y L  + E  +  +F   L +++ QFQASV + + + G EY +    ++L+  
Sbjct: 694 TTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKN 753

Query: 605 GILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEAL 651
           GI    +     + +G+AER NR LLD  R+ L  + +P   W  A+
Sbjct: 754 GITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAI 800


>YJL3_YEAST (P47024) Transposon Ty4 207.7 kDa hypothetical protein
          Length = 1803

 Score = 70.5 bits (171), Expect = 3e-11
 Identities = 94/437 (21%), Positives = 170/437 (38%), Gaps = 55/437 (12%)

Query: 256 CKYCKQNGHVIFDCPIRPPRRTQYPTQALHATTSSAAPPTITSAS-DGGSLQPEMIQQMV 314
           C YCK   H   +C  +P R        L  T   +  P I     D   L P   +Q  
Sbjct: 342 CMYCKSVFHCSINCKKKPNRN-------LGLTRPISQKPIIYKVHRDNNHLSPVQNEQKS 394

Query: 315 LAALSNMGIHGKSSNVSRPWFLDSGASNHMTGSSEYLHNLH-SYHGNQQIQIADGNKLSI 373
                      K  N  +   +D+G+  ++T     LHN   S    +   I   + +S+
Sbjct: 395 WNKTQKRS--NKVYNSKKLVIIDTGSGVNITNDKTLLHNYEDSNRSTRFFGIGKNSSVSV 452

Query: 374 TDVGDI-------NSDFQDVLVS--PGLASNLLSVGQLVDNNCNV---NFSRAGCLV--- 418
              G I       N+D + +L    P   S ++S   L      V    ++R G  +   
Sbjct: 453 KGYGYIKIKNGHNNTDNKCLLTYYVPEEESTIISCYDLAKKTKMVLSRKYTRLGNKIIKI 512

Query: 419 -QEQVSGKVIAK----------GPKVGRLFPLQFISSHLSLACNNVLNSYEDWHRKLGHP 467
             + V+G +  K            K+  + P    +S      N    + ED H+++GH 
Sbjct: 513 KTKIVNGVIHVKMNELIERPSDDSKINAIKP----TSSPGFKLNKRSITLEDAHKRMGHT 568

Query: 468 NSTVLSHLFKTGLLGNKQVVCTA--SISCPVCKLAKSKTLPFPSGA-------HRASNCF 518
               + +  K         +        C  CK++K+      +G+       H   + +
Sbjct: 569 GIQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNHSTDHEPGSSW 628

Query: 519 EMIHSDVWGMSPIASHAHYKYFVTFIDDYSRF--TWIYFLRSKSEVFSMFKKFLTYVETQ 576
            M   D++G    ++    +Y +  +D+ +R+  T  +F ++   + +  +K + YVETQ
Sbjct: 629 CM---DIFGPVSSSNADTKRYMLIMVDNNTRYCMTSTHFNKNAETILAQVRKNIQYVETQ 685

Query: 577 FQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSL 636
           F   V+   S+ G E+ + + +EY   KGI    +       NG AER  R ++    +L
Sbjct: 686 FDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIRTIITDATTL 745

Query: 637 LLQASVPPRFWVEALST 653
           L Q+++  +FW  A+++
Sbjct: 746 LRQSNLRVKFWEYAVTS 762


>POL_SFV3L (P27401) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1157

 Score = 53.9 bits (128), Expect = 3e-06
 Identities = 57/234 (24%), Positives = 93/234 (39%), Gaps = 20/234 (8%)

Query: 423  SGKVIAKGPKVGRLFPLQFISSHLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLG 482
            +G+V+   P   R+ P +     + L  +N+ ++  D         ST L    K     
Sbjct: 798  NGQVMVTRPNGKRIIPPKSDRPQIILQAHNIAHTGRD---------STFLKVSSKYWWPN 848

Query: 483  NKQVVCTASISCPVCKLAKSKTLPFPS--GAHRASNCFEMIHSDVWGMSPIASHAHYKYF 540
             ++ V      C  C +  + TL  P      R    F+    D  G  P+     Y + 
Sbjct: 849  LRKDVVKVIRQCKQCLVTNAATLAAPPILRPERPVKPFDKFFIDYIG--PLPPSNGYLHV 906

Query: 541  VTFIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEY 600
            +  +D  + F W+Y  ++ S   S   K L  + +   A  K+  S+ G  + S  F ++
Sbjct: 907  LVVVDSMTGFVWLYPTKAPST--SATVKALNMLTSI--AVPKVIHSDQGAAFTSATFADW 962

Query: 601  LQHKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEALSTV 654
             ++KGI  + S P  PQ +G  ERKN    D+ R L       P  W + L  V
Sbjct: 963  AKNKGIQLEFSTPYHPQSSGKVERKNS---DIKRLLTKLLVGRPAKWYDLLPVV 1013


>POL_SFV1 (P23074) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1161

 Score = 48.5 bits (114), Expect = 1e-04
 Identities = 37/137 (27%), Positives = 61/137 (44%), Gaps = 9/137 (6%)

Query: 518  FEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQF 577
            F+  + D  G  P+     Y + +  +D  + F W+Y  ++ S   S   K L  + +  
Sbjct: 884  FDKFYIDYIG--PLPPSNGYLHVLVVVDSMTGFVWLYPTKAPST--SATVKALNMLTSI- 938

Query: 578  QASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLL 637
             A  K+  S+ G  + S  F ++ + KGI  + S P  PQ +G  ERKN  +  +   LL
Sbjct: 939  -AIPKVLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKNSDIKRLLTKLL 997

Query: 638  LQASVPPRFWVEALSTV 654
            +     P  W + L  V
Sbjct: 998  IGR---PAKWYDLLPVV 1011


>POL_FOAMV (P14350) Pol polyprotein [Contains: Reverse
           transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
           (RT); Integrase (IN)]
          Length = 886

 Score = 48.5 bits (114), Expect = 1e-04
 Identities = 36/127 (28%), Positives = 55/127 (42%), Gaps = 7/127 (5%)

Query: 528 MSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKIFRSN 587
           + P+     Y Y +  +D  + FTW+Y  ++ S   S   K L  + +   A  K+  S+
Sbjct: 684 IGPLPPSQGYLYVLVVVDGMTGFTWLYPTKAPST--SATVKSLNVLTSI--AIPKVIHSD 739

Query: 588 SGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFW 647
            G  + S  F E+ + +GI  + S P  PQ     ERKN    D+ R L       P  W
Sbjct: 740 QGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKNS---DIKRLLTKLLVGRPTKW 796

Query: 648 VEALSTV 654
            + L  V
Sbjct: 797 YDLLPVV 803


>POL_FENV1 (P31792) Pol polyprotein [Contains: Reverse
           transcriptase/ribonuclease H (EC 2.7.7.49) (EC 3.1.26.4)
           (RT); Integrase (IN)] (Fragment)
          Length = 1046

 Score = 47.0 bits (110), Expect = 4e-04
 Identities = 39/160 (24%), Positives = 69/160 (42%), Gaps = 5/160 (3%)

Query: 493 SCPVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTW 552
           +C VC+   +     P G     N    ++ ++        +A YKY + F+D +S +  
Sbjct: 736 ACKVCQQVNAGATRVPEGKRTRGNR-PGVYWEIDFTEVKPHYAGYKYLLVFVDTFSGWVE 794

Query: 553 IYFLRSKSEVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSC 612
            Y  R ++    + KK L  +  +F    K+  S++G  ++S   Q   +  GI  +  C
Sbjct: 795 AYPTRQET-AHMVAKKILEEIFPRF-GLPKVIGSDNGPAFVSQVSQGLARTLGINWKLHC 852

Query: 613 PNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEALS 652
              PQ +G  ER NR + +    L L+  +  + W   LS
Sbjct: 853 AYRPQSSGQVERMNRTIKETLTKLTLETGL--KDWRRLLS 890


>POL_BAEVM (P10272) Pol polyprotein [Contains: Protease (EC 3.4.23.-);
            Reverse transcriptase/ribonuclease H (EC 2.7.7.49) (EC
            3.1.26.4) (RT); Integrase (IN)]
          Length = 1189

 Score = 45.8 bits (107), Expect = 8e-04
 Identities = 39/161 (24%), Positives = 70/161 (43%), Gaps = 7/161 (4%)

Query: 493  SCPVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTW 552
            +C VC+   +     P+G     N    ++ ++        +A YKY + F+D +S   W
Sbjct: 879  ACKVCQQVNAGATRVPAGKRTRGNR-PGVYWEIDFTEVKPHYAGYKYLLVFVDTFS--GW 935

Query: 553  IYFLRSKSEVFSMF-KKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRS 611
            +    ++ E   +  KK L  +  +F    K+  S++G  ++S   Q   +  GI  +  
Sbjct: 936  VEAFPTRQETAHIVAKKILEEIFPRF-GLPKVIGSDNGPAFVSQVSQGLARILGINWKLH 994

Query: 612  CPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEALS 652
            C   PQ +G  ER NR + +    L L+  +  + W   LS
Sbjct: 995  CAYRPQSSGQVERMNRTIKETLTKLTLETGL--KDWRRLLS 1033


>M300_ARATH (P93293) Hypothetical mitochondrial protein AtMg00300
           (ORF145a) (ORF1451)
          Length = 145

 Score = 45.8 bits (107), Expect = 8e-04
 Identities = 29/107 (27%), Positives = 50/107 (46%), Gaps = 9/107 (8%)

Query: 425 KVIAKGPKVGRLFPLQFISSHLSLACNNVLNSYED----WHRKLGHPNSTVLSHLFKTGL 480
           + I KG +   L+ LQ     +    +N+  + +D    WH +L H +   +  L K G 
Sbjct: 36  RTILKGNRHDSLYILQ---GSVETGESNLAETAKDETRLWHSRLAHMSQRGMELLVKKGF 92

Query: 481 LGNKQVVCTASISCPVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWG 527
           L + +V  ++   C  C   K+  + F +G H   N  + +HSD+WG
Sbjct: 93  LDSSKV--SSLKFCEDCIYGKTHRVNFSTGQHTTKNPLDYVHSDLWG 137


>GAG_SIVCZ (P17282) Gag polyprotein [Contains: Core protein p18;
           Core protein p25; Core protein p16]
          Length = 508

 Score = 45.8 bits (107), Expect = 8e-04
 Identities = 26/72 (36%), Positives = 35/72 (48%), Gaps = 11/72 (15%)

Query: 234 RQVQCFTCKQFGHVARSCTA---KFCKYCKQNGHVIFDCPIRP--------PRRTQYPTQ 282
           R+++CF C + GH+AR+C A   K C  C Q GH + DC  R         P R+  P  
Sbjct: 398 RKIKCFNCGKEGHLARNCKAPRRKGCWRCGQEGHQMKDCTGRQVNFLGKGWPSRSGRPGN 457

Query: 283 ALHATTSSAAPP 294
            +   T   APP
Sbjct: 458 FVQNRTEPTAPP 469


>GAG_HV1MA (P04594) Gag polyprotein [Contains: Core protein p17
           (Matrix protein); Core protein p24 (Core antigen); Core
           protein p2; Core protein p7 (Nucleocapsid protein); Core
           protein p1; Core protein p6]
          Length = 504

 Score = 45.8 bits (107), Expect = 8e-04
 Identities = 29/97 (29%), Positives = 43/97 (43%), Gaps = 12/97 (12%)

Query: 227 RGKGRDMRQVQCFTCKQFGHVARSCTA---KFCKYCKQNGHVIFDCPIRP--------PR 275
           RG  +  ++++CF C + GH+AR+C A   K C  C + GH + DC  R         P 
Sbjct: 386 RGNFKGQKRIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCTERQANFLGKIWPS 445

Query: 276 RTQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQ 312
               P   L +     APP   S   G  ++P   Q+
Sbjct: 446 HKGRPGNFLQSRPEPTAPPA-ESFGFGEEIKPSQKQE 481


>GAG_HV1BR (P03348) Gag polyprotein [Contains: Core protein p17
           (Matrix protein); Core protein p24 (Core antigen); Core
           protein p2; Core protein p7 (Nucleocapsid protein); Core
           protein p1; Core protein p6]
          Length = 511

 Score = 45.4 bits (106), Expect = 0.001
 Identities = 48/170 (28%), Positives = 67/170 (39%), Gaps = 26/170 (15%)

Query: 152 VQEVYNTSRRDQFLMKLRPEFEVVRGALLNRNPVPSLDTCVGEL----LREEQRLLTQGT 207
           V   Y T R +Q   +++     +   LL +N  P   T +  L      EE     QG 
Sbjct: 296 VDRFYKTLRAEQASQEVK---NWMTETLLVQNANPDCKTILKALGPAATLEEMMTACQGV 352

Query: 208 --MSHDAFI-----SEPVPVAYAAQSRGKGRDMRQ-VQCFTCKQFGHVARSCTA---KFC 256
               H A +     S+    A     RG  R+ R+ V+CF C + GH+AR+C A   K C
Sbjct: 353 GGPGHKARVLAEAMSQVTNSATIMMQRGNFRNQRKIVKCFNCGKEGHIARNCRAPRKKGC 412

Query: 257 KYCKQNGHVIFDCPIRP--------PRRTQYPTQALHATTSSAAPPTITS 298
             C + GH + DC  R         P     P   L +     APP + S
Sbjct: 413 WKCGKEGHQMKDCTERQANFLGKIWPSYKGRPGNFLQSRPEPTAPPFLQS 462


>GAG_HV2KR (Q74119) Gag polyprotein [Contains: Core protein p16;
           Core protein p26]
          Length = 521

 Score = 45.1 bits (105), Expect = 0.001
 Identities = 53/210 (25%), Positives = 82/210 (38%), Gaps = 44/210 (20%)

Query: 174 VVRGALLNRNPVPSLDTCVGELLREEQRLLTQGTMSHDAFISEPVPVAYAAQSRGKGRDM 233
           V++G  +N      L  C G +    Q+         +A    P+P A AAQ R      
Sbjct: 335 VLKGLGMNPTLEEMLTACQG-IGGPGQKARLMAEALKEALAPAPIPFA-AAQQR------ 386

Query: 234 RQVQCFTCKQFGHVARSCTA---KFCKYCKQNGHVIFDCPIRP----------PRRTQYP 280
           R ++C+ C + GH AR C A   + C  C ++GHV+ +CP R            +   +P
Sbjct: 387 RTIKCWNCGKDGHSARQCRAPRRQGCWKCGKSGHVMANCPERQAGFLGIGPWGKKPRNFP 446

Query: 281 TQALHATTSSAAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPW------ 334
              +    +  APP    A     L  + +QQ         G   K   + RP+      
Sbjct: 447 VTRVPQGLTPTAPP----ADPAADLLEKYLQQ---------GRKQKEQKM-RPYKEVTED 492

Query: 335 --FLDSGASNHMTGSSEYLHNLHSYHGNQQ 362
              L+ G + H   + + LH L+S  G  Q
Sbjct: 493 LLHLEQGETPHKEATEDLLH-LNSLFGKDQ 521


  Database: sprot
    Posted date:  Nov 25, 2004 10:54 AM
  Number of letters in database: 59,974,054
  Number of sequences in database:  164,201
  
Lambda     K      H
   0.338    0.147    0.468 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 140,063,334
Number of Sequences: 164201
Number of extensions: 5456256
Number of successful extensions: 20771
Number of sequences better than 10.0: 129
Number of HSP's better than 10.0 without gapping: 31
Number of HSP's successfully gapped in prelim test: 98
Number of HSP's that attempted gapping in prelim test: 20515
Number of HSP's gapped (non-prelim): 223
length of query: 1343
length of database: 59,974,054
effective HSP length: 122
effective length of query: 1221
effective length of database: 39,941,532
effective search space: 48768610572
effective search space used: 48768610572
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 72 (32.3 bits)


Medicago: description of AC144731.14