Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0003.6
         (1307 letters)

Database: GMGI 
           63,676 sequences; 37,918,896 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete             503  e-142
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co...   499  e-141
CF922226                                                              156  6e-38
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi...   143  4e-34
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ...   135  2e-31
CO981347                                                              108  1e-29
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V...   120  4e-27
NP005470 reverse transcriptase                                        117  3e-26
TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement p...   116  6e-26
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti...   113  5e-25
BM307983                                                              112  1e-24
BI701169                                                              109  7e-24
CA937893 similar to GP|20805072|dbj retrovirus-related pol polyp...   109  9e-24
CO981879                                                               71  1e-23
AI959950                                                              107  5e-23
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei...   103  4e-22
CO983516                                                               99  1e-20
TC213365 similar to UP|Q6I8L6 (Q6I8L6) Reverse transcriptase (Fr...    97  6e-20
TC232995                                                               92  1e-18
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag...    92  1e-18

>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
          Length = 4731

 Score =  503 bits (1295), Expect = e-142
 Identities = 337/1096 (30%), Positives = 546/1096 (49%), Gaps = 18/1096 (1%)
 Frame = +1

Query: 221  RSKSKDRKTTECYSCKQIGHWKRDC------PNRSGKSGNNSSSANVVQSDGSCSEEDLL 274
            + K   RK   C+ C + GH K  C      P+   +S N+      V    + S   L+
Sbjct: 1477 QQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSNSRKKMMWVPKHKAVS---LV 1647

Query: 275  CVSSVKCT--DAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIA 332
              +S++ +  + W LDSGCS HMT  +E+  + +     YV  GD     I GM +    
Sbjct: 1648 VHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGK---- 1815

Query: 333  LDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTA 392
            L   G+ +L++V  V  +  NLIS+  L + G++    ++  ++   K  + +M+  R+ 
Sbjct: 1816 LVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEV-LMKGSRSK 1992

Query: 393  GNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIG-- 450
             N Y             + + +D  ++WH R GHL  RGM ++  +  ++G+ +  I   
Sbjct: 1993 DNCYLWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEG 2172

Query: 451  -LCKYCVLGKQCRVRF-KTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRK 508
             +C  C +GKQ ++   K  H  T  +L+ +H D+ GP +  S+ G RY     DDFSR 
Sbjct: 2173 RICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRF 2352

Query: 509  VWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHF 568
             WV F++ KSE F  FK     ++ +    IK +RSD+G E+ +  F  FC   GI   F
Sbjct: 2353 TWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEF 2532

Query: 569  SVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKV 628
            S   TPQQNG+ ER NRTL E AR +     L   LWA  +N ACY+ NR          
Sbjct: 2533 SAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTT 2712

Query: 629  AEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVK 688
              E+W G    + +  IF  P Y+    E R K+DPKS   I +GY+   + Y++++   
Sbjct: 2713 LYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRT 2892

Query: 689  KKVIVSRDVVFDE*SMLKQSDVTVVPDTEVEN-SSQDKIQVDIEETPVSPRQIVAQQQSE 747
            + V+ S +VV D+ S  ++ DV     T  +N +   K   + E +  +  +    Q  +
Sbjct: 2893 RTVMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESNINQPDK 3072

Query: 748  PGSDSGEVQDYTLVRDREPSRITPPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKW 807
              S   +      +   +P+R     R    ++ + +   S  +P    EA+     + W
Sbjct: 3073 RSSTRIQKMHPKELIIGDPNR-GVTTRSREVEIVSNSCFVSKIEPKNVKEALTD---EFW 3240

Query: 808  MSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KER--EKFKAHLVTKGY 865
            ++AM EE+E  K+NE   LV  P G  VIG KW++K K   T +E    + KA LV +GY
Sbjct: 3241 INAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNK---TNEEGVITRNKARLVAQGY 3411

Query: 866  SQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEG 925
            +Q +G+D+DE F+PV R  SIR++L +   +   L QMDVK+ FL+G L E++Y+EQP+G
Sbjct: 3412 TQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKG 3591

Query: 926  FSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFI 985
            F++      V +LK++LYGLKQ+PR WY+R   ++ + GYR+   D  ++V   D  + +
Sbjct: 3592 FADPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQ-DAENLM 3768

Query: 986  FLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQK 1045
               +YVDD++        +     ++  EF+M  +G     LG+++ +   +  ++LSQ 
Sbjct: 3769 IAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDS--IFLSQS 3942

Query: 1046 SYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYAVQLVV*CMLWF 1105
             Y + ++ +F M  A+H  TP   H KLS +++        G S      + ++  +L+ 
Sbjct: 3943 RYAKNIVKKFGMENASHKRTPAPTHLKLSKDEA--------GTSVDQSLYRSMIGSLLYL 4098

Query: 1106 ALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QGARCCS-ISCGICG 1164
               +      V      QA       +QV  +IL+  NG+  +      C + +  G C 
Sbjct: 4099 TASRPDITYAVGVCARYQANPKISHLTQV-KRILKYVNGTSDYGIMYCHCSNPMLVGYCD 4275

Query: 1165 L--CR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGL 1222
                  +   K  +  C Y     I W S  Q+ V++ST EAEY+A   +  + +W+  +
Sbjct: 4276 ADWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQM 4455

Query: 1223 VKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTS 1282
            +KE  VEQ  + L+CD+ SAI ++ N V H+RTKHID+R H IR+L+  + I L+ + T 
Sbjct: 4456 LKEYNVEQDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTE 4635

Query: 1283 ENTTDKLTKPVTSDKF 1298
            E   D  TK + +++F
Sbjct: 4636 EQIADIFTKALDANQF 4683


>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
          Length = 4734

 Score =  499 bits (1286), Expect = e-141
 Identities = 335/1094 (30%), Positives = 540/1094 (48%), Gaps = 16/1094 (1%)
 Frame = +1

Query: 221  RSKSKDRKTTECYSCKQIGHWKRDCPNRSGKSGN---NSSSANVVQSDGSCSEEDLLCVS 277
            + K   RK   C+ C + GH K  C +  G   +   +SSS   +          L+  +
Sbjct: 1480 QQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSSSGRKMMWVPKHKIVSLVVHT 1659

Query: 278  SVKCT--DAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDD 335
            S++ +  + W LDSGCS HMT  +E+  + +     YV  GD     I GM +    L  
Sbjct: 1660 SLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGK----LVH 1827

Query: 336  GGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNI 395
             G+ +L++V  V  +  NLIS+  L + G++    ++  ++   K  + +M+  R+  N 
Sbjct: 1828 DGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEV-LMKGSRSKDNC 2004

Query: 396  YKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIG---LC 452
            Y             + + +D  K+WH R GHL  RGM ++  +  ++G+ +  I    +C
Sbjct: 2005 YLWTPQETSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRIC 2184

Query: 453  KYCVLGKQCRVRF-KTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWV 511
              C +GKQ ++   K  H  T  +L+ +H D+ GP +  S+ G RY     DDFSR  WV
Sbjct: 2185 GECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWV 2364

Query: 512  YFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVR 571
             F++ KS+ F  FK     ++ +    IK +RSD+G E+ +  F  FC   GI   FS  
Sbjct: 2365 NFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAA 2544

Query: 572  KTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEE 631
             TPQQNG+ ER NRTL E AR +     L   LWA  +N ACY+ NR            E
Sbjct: 2545 ITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYE 2724

Query: 632  VWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKV 691
            +W G    + +  IF  P Y+    E R K+DPKS   I +GY+   + Y++++   + V
Sbjct: 2725 IWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTV 2904

Query: 692  IVSRDVVFDE*SMLKQSDVTVVPDTEVENSSQDKIQVDIEET--PVSPRQIVAQQQSEPG 749
            + S +VV D+ +  ++ DV     T  +N +      +  E     +    + Q    P 
Sbjct: 2905 MESINVVVDDLTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDEPNINQPDKRPS 3084

Query: 750  SDSGEVQDYTLVRDREPSRITPPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKWMS 809
                ++    L+       +T   R    ++ + +   S  +P    EA+     + W++
Sbjct: 3085 IRIQKMHPKELIIGDPNRGVT--TRSREIEIVSNSCFVSKIEPKNVKEALTD---EFWIN 3249

Query: 810  AMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KER--EKFKAHLVTKGYSQ 867
            AM EE+E  K+NE   LV  P G  VIG KW++K K   T +E    + KA LV +GY+Q
Sbjct: 3250 AMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNK---TNEEGVITRNKARLVAQGYTQ 3420

Query: 868  HKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFS 927
             +G+D+DE F+PV R  SIR++L +   +   L QMDVK+ FL+G L E+ Y+EQP+GF 
Sbjct: 3421 IEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFV 3600

Query: 928  ETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFL 987
            +      V +LK++LYGLKQ+PR WY+R   ++ + GYR+   D  ++V   D  + +  
Sbjct: 3601 DPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQ-DAENLMIA 3777

Query: 988  LLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSY 1047
             +YVDD++        +     ++  EF+M  +G     LG+++ +   +  ++LSQ  Y
Sbjct: 3778 QIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDS--IFLSQSKY 3951

Query: 1048 VEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYAVQLVV*CMLWFAL 1107
             + ++ +F M  A+H  TP   H KLS +++        G S      + ++  +L+   
Sbjct: 3952 AKNIVKKFGMENASHKRTPAPTHLKLSKDEA--------GTSVDQSLYRSMIGSLLYLTA 4107

Query: 1108 DQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QGARCC-SISCGICGL- 1165
             +      V      QA       +QV  +IL+  NG+  +      C  S+  G C   
Sbjct: 4108 SRPDITYAVGVCARYQANPKISHLNQV-KRILKYVNGTSDYGIMYCHCSDSMLVGYCDAD 4284

Query: 1166 -CR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVK 1224
                +   K  +  C Y     I W S  Q+ V++ST EAEY+A   +  + +W+  ++K
Sbjct: 4285 WAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLK 4464

Query: 1225 ELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSEN 1284
            E  VEQ  + L+CD+ SAI ++ N V H+RTKHID+R H IR+L+  + I L+ + T E 
Sbjct: 4465 EYNVEQDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQ 4644

Query: 1285 TTDKLTKPVTSDKF 1298
              D  TK + +++F
Sbjct: 4645 IADIFTKALDANQF 4686



 Score = 45.4 bits (106), Expect = 2e-04
 Identities = 69/303 (22%), Positives = 112/303 (36%), Gaps = 50/303 (16%)
 Frame = +1

Query: 20  WQRMAKDLLAQKSLQ---KALRDEKPADIATVDWNEM---KEKAAGLITLCVSDDVMNHI 73
           W+ + K     K L    K   + KP +  T + +E+     KA   +   V  ++   I
Sbjct: 118 WKAVIKGWEHPKMLDTEGKPTNELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLI 297

Query: 74  LDLTTLKDVWDKLESLY--MSKTPMNKL-FAKQRLYSLKMQEGGDLQAHVYAFNNILADM 130
              T  KD W+ L++ +   SK  M++L     +  +LKM+E   +         I    
Sbjct: 298 NTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANAC 477

Query: 131 TRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKD--SITLDSISSTL------LPHAQ 182
           T LG  + DE     +L SLP  +D  VT +   +D  ++ +D +  +L      L    
Sbjct: 478 TALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRT 657

Query: 183 RRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAV---------------------------- 214
            ++S      S+ EG   +   D   G   AV                            
Sbjct: 658 EKKSKNLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFD 837

Query: 215 -----DSGKKKRSKSKDRKTTECYSCKQIGHWKRDCPNRSGKSGNNSSSANVVQSDGSCS 269
                +  KK   K    K  +C+ C+  GH K +CP    K     S   V +SD + S
Sbjct: 838 IRKGSEYQKKSDEKPSHSKGIQCHGCEGYGHIKAECPTHLKKQRKGLS---VCRSDDTES 1008

Query: 270 EED 272
           E++
Sbjct: 1009EQE 1017


>CF922226 
          Length = 667

 Score =  156 bits (394), Expect = 6e-38
 Identities = 89/228 (39%), Positives = 129/228 (56%), Gaps = 13/228 (5%)
 Frame = -3

Query: 91  MSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSL 150
           M+K+ +N+L+ KQ LYS KM E   +   +  FN ++ D+  + VT+DDED+A++LLC L
Sbjct: 665 MTKSLVNRLYXKQSLYSFKMHEDRSVGEQLDLFNKLILDLENIDVTIDDEDQALLLLCYL 486

Query: 151 PGSYDHLVTTLTYGKDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGK 210
           P SY H   TL +G+DS++LD +  T L   +  +  E+   +SGEGL         RGK
Sbjct: 485 PKSYSHFKETLLFGRDSVSLDEV-QTALNSKELNERKEKKSSASGEGL-------TARGK 330

Query: 211 GKAVDSG-KKKRSKSKDRKTTE-------CYSCKQIGHWKRDCPNRSGKSGNN-----SS 257
               DS   KK+ K +++K  E       CY CK+ GH ++ CP R    G+N     S 
Sbjct: 329 TFKKDSEFDKKKQKPENQKNGEGNIFKIRCYHCKKEGHTRKVCPERQKNGGSNNRKKDSG 150

Query: 258 SANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSF 305
           +A +VQ DG  S E L+ VS       W++DSGCS+HMT ++ WF  F
Sbjct: 149 NAAIVQDDGYESAEALM-VSEKNPETKWIMDSGCSWHMTPNKSWFEQF 9


>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
           partial (34%)
          Length = 407

 Score =  143 bits (361), Expect = 4e-34
 Identities = 71/130 (54%), Positives = 90/130 (68%)
 Frame = -2

Query: 827 VQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSI 886
           V LP GK  +GC+WVY  K+  T  E ++ KA LV KGY+Q  GIDY + FSPV + T++
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPT-GEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTV 230

Query: 887 RVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLK 946
           R+ LA+ A     L Q+D+K  FLHG+LEE IY+EQP GF   G+  LVCKL RSLYGLK
Sbjct: 229 RLFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLK 50

Query: 947 QSPRQWYKRF 956
           QSPR W+ +F
Sbjct: 49  QSPRAWFGKF 20


>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
           (japonica cultivar-group)}, partial (10%)
          Length = 463

 Score =  135 bits (339), Expect = 2e-31
 Identities = 70/155 (45%), Positives = 101/155 (65%), Gaps = 3/155 (1%)
 Frame = -3

Query: 810 AMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KERE---KFKAHLVTKGYS 866
           AM EE+   ++N    LV+ P    VIG KWV++ KL     E     + KA LV KGY+
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKL----DEHGIIIRNKARLVAKGYN 291

Query: 867 QHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGF 926
           Q +GIDY+E ++PV R   IR++LA V+ M+  L QMDVK+ FL+G ++E++Y+EQP GF
Sbjct: 290 QEEGIDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGF 111

Query: 927 SETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYML 961
                   V KL+++LYGLKQ+PR WY+R  +++L
Sbjct: 110 EIPDKPTHVYKLQKALYGLKQAPRAWYERISNFLL 6


>CO981347 
          Length = 624

 Score =  108 bits (269), Expect(2) = 1e-29
 Identities = 54/128 (42%), Positives = 74/128 (57%)
 Frame = +2

Query: 520 VFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGV 579
           +F KF+     + NQ G K+K LR+DNG E+  + F  FC + GI+RH  V  TP QNG+
Sbjct: 104 IF*KFRE*HTLIGNQLGTKLKVLRTDNGLEFVLEQFNEFCRKIGIKRHKIVPHTP*QNGL 283

Query: 580 AERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPID 639
           AERMN T+ E+ RC+ L+A L K  W    N   YL+NR P ++L  K   E W+G    
Sbjct: 284 AERMNMTILERVRCMLLSARLPKTFWGEAANTTSYLINRCPSSTLGFKTPMEAWSGETT* 463

Query: 640 LSNLRIFW 647
           L  ++  W
Sbjct: 464 LFRIKGVW 487



 Score = 44.3 bits (103), Expect = 4e-04
 Identities = 23/59 (38%), Positives = 36/59 (60%), Gaps = 2/59 (3%)
 Frame = +3

Query: 637 PIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLW--DPVKKKVIV 693
           P + S L++F   A+ H+    + KLD ++  C+ IGY KGVK YKLW  +P + + I+
Sbjct: 456 PPNYSGLKVFGSLAFDHVK---QGKLDARAVKCVFIGYPKGVKRYKLWKLEPGETRCII 623



 Score = 41.6 bits (96), Expect(2) = 1e-29
 Identities = 18/36 (50%), Positives = 25/36 (69%)
 Frame = +3

Query: 486 PTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVF 521
           P++  +  G  YF+T  DDFSR+VW+Y +K KSE F
Sbjct: 3   PSRVKTHGGSSYFLTIIDDFSRRVWLYVLKNKSESF 110


>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
           vinifera}, partial (34%)
          Length = 409

 Score =  120 bits (301), Expect = 4e-27
 Identities = 65/138 (47%), Positives = 88/138 (63%)
 Frame = +3

Query: 786 LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKK 845
           L+S   PST  EA+   +   W  AMV+EM++L+ N T  LV LP GK  +GC+WVY  K
Sbjct: 3   LSSLTVPSTIREAL---DHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVK 173

Query: 846 LAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDV 905
           +    K  ++ KA LV KGY+Q  GI+Y + FSPV   T++R+ LA+ A     L Q+D+
Sbjct: 174 VGPNGKV-DRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDI 350

Query: 906 KTTFLHGNLEEQIYIEQP 923
           K  FLHG+LEE IY+EQP
Sbjct: 351 KNAFLHGDLEEDIYMEQP 404


>NP005470 reverse transcriptase
          Length = 267

 Score =  117 bits (293), Expect = 3e-26
 Identities = 54/89 (60%), Positives = 66/89 (73%)
 Frame = +1

Query: 906 KTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGY 965
           +T FLHG LEE I ++QPEGF   G  R V +L+RSLYGLKQSPRQWY  FDS++   G+
Sbjct: 1   RTAFLHGRLEENILMKQPEGFEVQGKERYVSQLQRSLYGLKQSPRQWYMSFDSFITNQGF 180

Query: 966 RRCDYDCCVYVMSLDDGSFIFLLLYVDDM 994
           +R  YDCCVY   ++DG  I+LLLYVDDM
Sbjct: 181 KRSLYDCCVYHNKVEDGLMIYLLLYVDDM 267


>TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement pol
           polyprotein, partial (4%)
          Length = 919

 Score =  116 bits (291), Expect = 6e-26
 Identities = 63/152 (41%), Positives = 91/152 (59%), Gaps = 1/152 (0%)
 Frame = +1

Query: 444 VRSCIIGLCKYCVLGKQCRVRFKTGHH-KTKGILDYVHSDVRGPTKEPSV*GFRYFVTFT 502
           + + + G+C  C +GK+ R  F TG   + K +L  VH D+    + P+     YF+TF 
Sbjct: 307 IMNILDGVCDTCEIGKKHRESFPTGKSWRMKKLLKIVHLDLC-TVEIPTHGDNNYFITFI 483

Query: 503 DDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEEN 562
           DDFS+K+WVYF+K KSE    FK++KA  E Q G K+K L  D G EY   ++  F E++
Sbjct: 484 DDFSKKMWVYFLKQKSEACNAFKMFKAFAEKQNGCKVKALIIDKGQEYL--SYTIFFEKH 657

Query: 563 GIQRHFSVRKTPQQNGVAERMNRTLTEKARCL 594
           GIQ   + + TPQ NGV ER N+T+ +  RC+
Sbjct: 658 GIQHQLTTKYTPQHNGVTERKNKTIMDMVRCM 753


>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
          Length = 558

 Score =  113 bits (283), Expect = 5e-25
 Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 1/177 (0%)
 Frame = +1

Query: 854  EKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGN 913
            ++FKA LV K Y+Q  G DY   FSPV +   + ++ ++       L  +D K  FLHG 
Sbjct: 37   DQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFLHGY 216

Query: 914  LEEQIYIEQPEGFSETGD-GRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDC 972
            LEE++Y+EQP GF   G+   +VC+L RS YGLKQSPR W   F      I Y   + D 
Sbjct: 217  LEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAW--PFLYCGAAIWYDSHEADH 390

Query: 973  CVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGM 1029
             V+      G  I+L++YVDD+ I  +  H +  LK  L  +F  KDLG  +  LG+
Sbjct: 391  SVFYCHSPQGC-IYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558


>BM307983 
          Length = 406

 Score =  112 bits (280), Expect = 1e-24
 Identities = 58/135 (42%), Positives = 82/135 (59%), Gaps = 3/135 (2%)
 Frame = +2

Query: 836 IGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPV---VRHTSIRVVLAL 892
           +GC+W+Y  K        +++KA LV KGY Q  GIDY+E F+     ++  S       
Sbjct: 2   VGCRWIYTVKY*AD-DTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQA 178

Query: 893 VASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQW 952
               +MH  Q DVK  FLHG+LEE++Y+E P G+  +  G  VC+LK++LYGLKQSPR W
Sbjct: 179 QFGWEMH--QFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAW 352

Query: 953 YKRFDSYMLRIGYRR 967
           + RF   ML +GY++
Sbjct: 353 FGRFTQAMLSLGYKQ 397


>BI701169 
          Length = 407

 Score =  109 bits (273), Expect = 7e-24
 Identities = 52/97 (53%), Positives = 67/97 (68%)
 Frame = +2

Query: 498 FVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMH 557
           F  F DDFSRK WVYF K+K EVF  FK +KA VE ++G KIK +RSD G E+    F  
Sbjct: 113 FFLFIDDFSRKTWVYFFKHKLEVFENFKKFKAIVEKESGFKIKAMRSDRGGEF*SNEFQK 292

Query: 558 FCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCL 594
           +C+++GI+R   V ++PQQNGVAER NRT+   AR +
Sbjct: 293 YCDDHGIRRPLMVLRSPQQNGVAERKNRTILNMARSM 403


>CA937893 similar to GP|20805072|dbj retrovirus-related pol polyprotein from
           transposon TNT 1-94-like, partial (7%)
          Length = 412

 Score =  109 bits (272), Expect = 9e-24
 Identities = 53/104 (50%), Positives = 70/104 (66%)
 Frame = -2

Query: 3   GAKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDWNEMKEKAAGLIT 62
           GAKFEV +FDGTGNF LWQ+  KDLLA + L K LRD K  +   +DW E+ E+ A  I 
Sbjct: 315 GAKFEVGKFDGTGNFRLWQKRVKDLLA*QGLLKVLRDSKSNNTEALDWEEL*ERTATTIR 136

Query: 63  LCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLY 106
           LC+ D+ + H+++L    +VW KLES YM K+  NKL+  Q+LY
Sbjct: 135 LCLVDEFLYHMMELAFPGEVWKKLESQYMLKSLTNKLYLMQKLY 4


>CO981879 
          Length = 576

 Score = 70.9 bits (172), Expect(2) = 1e-23
 Identities = 43/102 (42%), Positives = 52/102 (50%)
 Frame = -1

Query: 524 FKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERM 583
           FK +   ++ Q   KIK  RSDNG EY +K+      ENGI    S   TPQQNGVAER 
Sbjct: 567 FKTFFQMIQTQFQVKIKVFRSDNGREYFNKHLSKXXLENGIIHQSSCVDTPQQNGVAERK 388

Query: 584 NRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLD 625
           NR L E AR L       K  W   I    YL N++   +L+
Sbjct: 387 NRHLXEVARALLFQNKAPKYXWGEAILTGTYLKNKNA*QNLE 262



 Score = 58.5 bits (140), Expect(2) = 1e-23
 Identities = 26/59 (44%), Positives = 37/59 (62%)
 Frame = -2

Query: 643 LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE 701
           L+IF    +VHI   ++ KL+P++K C+ +GY    KGYK +DP  KK  V+ DV F E
Sbjct: 194 LKIFGCTVFVHIHEPNQGKLEPRAKKCVFVGYAPNQKGYKCFDPTSKKTFVTIDVTFFE 18


>AI959950 
          Length = 466

 Score =  107 bits (266), Expect = 5e-23
 Identities = 57/133 (42%), Positives = 84/133 (62%)
 Frame = -1

Query: 808 MSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQ 867
           M AM EE++  +KN  * LV+LP  K+V+G KW++  KL    K   ++KA LV KGYSQ
Sbjct: 397 MKAMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKV-VRYKARLVAKGYSQ 221

Query: 868 HKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFS 927
            +GIDY + F+ V R   I ++L+     +M L QMDVK+ FL+G +++++Y+EQP GF 
Sbjct: 220 QEGIDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFE 41

Query: 928 ETGDGRLVCKLKR 940
                + V KL +
Sbjct: 40  NETLHQHVFKLNK 2


>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
           (Fragment), partial (28%)
          Length = 865

 Score =  103 bits (258), Expect = 4e-22
 Identities = 70/183 (38%), Positives = 95/183 (51%), Gaps = 2/183 (1%)
 Frame = +2

Query: 418 KLWHMRLGH--LSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGI 475
           KL H RLGH  LS+  +M +     +K +       C+ C LGK  R   +    +    
Sbjct: 308 KLLHERLGHPHLSKLKIM-VPSLEKIKDL------FCESCQLGKHVRSSXRHVESRVDSP 466

Query: 476 LDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQT 535
              +H D+ GP +  S+  +RYFVTF D+FS+   V+ MK +SE+ + F     +++ Q 
Sbjct: 467 FLVIHXDIWGPNRVSSM-SYRYFVTFIDEFSQCTRVFLMKERSEILS-FLTSVNKIKTQF 640

Query: 536 GRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLR 595
           G+ IK LRSDN  EY       F    GI   FS   TPQQN +AER NR L E AR L 
Sbjct: 641 GKTIKILRSDNAKEYFSSVISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLL 820

Query: 596 LNA 598
           L+A
Sbjct: 821 LHA 829


>CO983516 
          Length = 724

 Score = 99.0 bits (245), Expect = 1e-20
 Identities = 50/122 (40%), Positives = 77/122 (62%)
 Frame = +2

Query: 874 DEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGR 933
           D+ F PV R  SIR++L +   +   L QMDVK+ FL+G L E++Y+EQP+GF +     
Sbjct: 356 DKEFHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHPD 535

Query: 934 LVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDD 993
            V +LK++LYGLKQ+PR WY+R    + + GYR+   D  ++V   D  + +   +YVDD
Sbjct: 536 HVYRLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQ-DAENLMIAQIYVDD 712

Query: 994 ML 995
           ++
Sbjct: 713 IV 718


>TC213365 similar to UP|Q6I8L6 (Q6I8L6) Reverse transcriptase (Fragment),
            partial (10%)
          Length = 440

 Score = 96.7 bits (239), Expect = 6e-20
 Identities = 51/102 (50%), Positives = 66/102 (64%)
 Frame = +3

Query: 1197 VAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTK 1256
            VA+STTEA+YMA+ EAAKE +WL GL+ +L + Q    ++ DS SAI    +QV+H RTK
Sbjct: 9    VALSTTEAKYMALTEAAKEGIWLRGLINDLRINQEYANIYYDSLSAICFAKDQVHHDRTK 188

Query: 1257 HIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
            HIDVR+H IR   + R+I + KI T  N  D   K V   KF
Sbjct: 189  HIDVRYHFIR---SERRIKVHKISTLHNPADMFPKLVPKSKF 305


>TC232995 
          Length = 1009

 Score = 92.4 bits (228), Expect = 1e-18
 Identities = 47/159 (29%), Positives = 88/159 (54%)
 Frame = +2

Query: 920  IEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSL 979
            +EQP GF  +     V KL+++LYGLKQ+PR WY+R  +++L   + R   D  +++   
Sbjct: 2    VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFI-KR 178

Query: 980  DDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKK 1039
                 + + +YVDD++  + +     E    +  EF+M  +G  K  LG++I + +    
Sbjct: 179  KHNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*G-- 352

Query: 1040 LWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQS 1078
            ++++Q  Y + ++ RF M  A H+STP++ +  L  ++S
Sbjct: 353  IFINQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDES 469


>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
            (30%)
          Length = 687

 Score = 92.0 bits (227), Expect = 1e-18
 Identities = 45/120 (37%), Positives = 77/120 (63%), Gaps = 1/120 (0%)
 Frame = +2

Query: 1179 CLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGV-EQGGVQLHC 1237
            C++  G  + WKS  Q+ VA S+ EAEY ++A    E +W+   ++EL   E+  ++L+C
Sbjct: 83   CVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCELMWIKQFLQELRFCEELQMKLYC 262

Query: 1238 DSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDK 1297
            D+Q+A+++ +N V+H RTKHI++  H IRE L S++I+ + I +++   D LTK +   K
Sbjct: 263  DNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEIVTEFIGSNDQPVDILTKSLRGPK 442


  Database: GMGI
    Posted date:  Oct 22, 2004  4:58 PM
  Number of letters in database: 37,918,896
  Number of sequences in database:  63,676
  
Lambda     K      H
   0.323    0.138    0.419 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 57,975,398
Number of Sequences: 63676
Number of extensions: 840875
Number of successful extensions: 4602
Number of sequences better than 10.0: 166
Number of HSP's better than 10.0 without gapping: 4375
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4523
length of query: 1307
length of database: 12,639,632
effective HSP length: 108
effective length of query: 1199
effective length of database: 5,762,624
effective search space: 6909386176
effective search space used: 6909386176
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 65 (29.6 bits)


Lotus: description of TM0003.6