Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC144765.7 - phase: 0 
         (482 letters)

Database: uniref100 
           2,790,947 sequences; 848,049,833 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

UniRef100_Q60DN3 Integrase core domain containing protein [Oryza...   225  3e-57
UniRef100_O24438 Retrofit [Oryza longistaminata]                      214  4e-54
UniRef100_Q6ATL7 Putative polyprotein [Oryza sativa]                  214  6e-54
UniRef100_Q65X82 Putative polyprotein [Oryza sativa]                  211  4e-53
UniRef100_Q8RZ67 Putative rice retrotransposon retrofit gag/pol ...   206  1e-51
UniRef100_Q94LQ7 Putative gag-pol polyprotein [Oryza sativa]          204  5e-51
UniRef100_Q688S3 Putative polyprotein [Oryza sativa]                  203  8e-51
UniRef100_Q75G45 Putative polyprotein [Oryza sativa]                  201  5e-50
UniRef100_Q7G7H3 Putative gag-pol protein [Oryza sativa]              199  1e-49
UniRef100_Q94DD5 Putative gag/pol polyprotein [Oryza sativa]          198  3e-49
UniRef100_Q7XKV9 OSJNBa0073E02.10 protein [Oryza sativa]              187  8e-46
UniRef100_Q9SA17 F28K20.17 protein [Arabidopsis thaliana]             186  1e-45
UniRef100_Q6F356 Putative polyprotein [Oryza sativa]                  186  2e-45
UniRef100_Q94H72 Putative gag-pol protein [Oryza sativa]              182  2e-44
UniRef100_Q9SSB1 T18A20.5 protein [Arabidopsis thaliana]              175  3e-42
UniRef100_Q8W0X9 Putative copia-like retrotransposon Hopscotch p...   174  5e-42
UniRef100_Q94IU9 Copia-like polyprotein [Arabidopsis thaliana]        174  7e-42
UniRef100_Q8H7X8 Putative gag-pol polyprotein [Oryza sativa]          172  1e-41
UniRef100_Q9SLL4 Putative retroelement pol polyprotein [Arabidop...   172  3e-41
UniRef100_Q9SV56 Hypothetical protein AT4g28900 [Arabidopsis tha...   171  6e-41

>UniRef100_Q60DN3 Integrase core domain containing protein [Oryza sativa]
          Length = 1021

 Score =  225 bits (573), Expect = 3e-57
 Identities = 127/397 (31%), Positives = 218/397 (53%), Gaps = 32/397 (8%)

Query: 15  SVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSS---KSNNPAFEE 71
           +V+ KL R N+ LWK+ +LPV+RG +++GY+ G  + P   I + D     K++ PAFE 
Sbjct: 16  AVAEKLTRTNFLLWKAQILPVIRGARMEGYLTGATQAPLAVIDAKDGEATVKASKPAFEM 75

Query: 72  WQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIR 131
           W   DQ++LG++L++++ E+ TQ++  E++ Q+W     +  + +R++ +  +    +  
Sbjct: 76  WITADQQVLGFLLSTLSKEILTQVISMESAAQVWKAITEMLSSQSRARALNTRLALATTL 135

Query: 132 KGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHTTLSW 190
           KG++ + DY+ KMK LAD++  AG P+ + +LI   L GLD +Y P+V  L      +S 
Sbjct: 136 KGDLSVSDYISKMKVLADEMAFAGKPLDDEELISYVLAGLDDDYEPVVSSLVGKSEVVSL 195

Query: 191 VDLQAQLLTFESRIEQLNNLTN--LNLNATANVANKFD-----HRDNRFNSNNNWRGSNF 243
            +  +QLL+F+SR +  +   +   ++NA      K       +R  R   NNN R SN 
Sbjct: 196 AECYSQLLSFKSRQKLRHAAAHQASSVNAARRGGGKGGGYTPFNRGGRSGGNNNGRRSNN 255

Query: 244 RGWRGGRG-----RGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNA 298
            G RGGRG     RG      C +CGKT H   +C++R+D+N+   N  A +   G    
Sbjct: 256 GGGRGGRGNNNGDRGGKPHPVCHLCGKTGHVVADCWYRYDENFVPENKIAAAASYG---- 311

Query: 299 FIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV----ATC 354
                    D +WY D+GA++H+T + +K     +++GK+ +   +G  ++I     +  
Sbjct: 312 --------VDTNWYVDTGATDHITGELDKLTTREKYNGKDQIYTASGAGMDIKHIGHSVI 363

Query: 355 SSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
            +  +++ L ++L+VP   KNLL   +LA DN+ FVE
Sbjct: 364 CTPTRNIYLKNILHVPKAKKNLLFAHRLALDNHAFVE 400


>UniRef100_O24438 Retrofit [Oryza longistaminata]
          Length = 1445

 Score =  214 bits (545), Expect = 4e-54
 Identities = 132/403 (32%), Positives = 220/403 (53%), Gaps = 43/403 (10%)

Query: 15  SVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN---NPAFEE 71
           SVS KL + N+ LWK+ V   VRG +L GY+ G  K P+  ++ +   K+    NPAFE+
Sbjct: 20  SVSEKLGKANHALWKAQVSAAVRGARLLGYLNGDIKAPDAELSVTIDGKTTTKPNPAFED 79

Query: 72  WQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIR 131
           W+ANDQ +LG++L+S++ ++  Q+  C+T+ + W   ++L    TR++ +  +    + +
Sbjct: 80  WEANDQLVLGYLLSSLSRDVLIQVATCKTAAEAWRSIEALYSTGTRARAVNTRLALTNTK 139

Query: 132 KGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL---SDHTTL 188
           KG MK+ +Y+ KM+ L D++   G+P+   DL+   + GL+ +++PIV  L   SD  T+
Sbjct: 140 KGTMKIAEYVAKMRALGDEMAAGGHPLDEEDLVQYIIAGLNEDFSPIVSNLCNKSDPITV 199

Query: 189 SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNN----------- 237
              +L +QL+ FE+ ++      +      A VAN+          NNN           
Sbjct: 200 G--ELYSQLVNFETLLDLYR---STGQGGAAFVANRGRGGGGGGRGNNNNSGGGGGRSAP 254

Query: 238 -WRGSNFRGWRGGRGR---GRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQ 293
             RGS  +G RGGRGR   G+  +  CQVC K  HTA +C++RFD++Y            
Sbjct: 255 GGRGSGSQG-RGGRGRGTGGQDRRPTCQVCFKRGHTAADCWYRFDEDY-----------V 302

Query: 294 GSHNAFIASQNSVE-DYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV- 351
                  A+ NS   D +WY D+GA++H+T +  K     +++G   +   +G  ++I  
Sbjct: 303 ADEKLVAAATNSYGIDTNWYIDTGATDHITGELEKLTTKEKYNGGEQIHTASGAGMDISH 362

Query: 352 ---ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
                  +  ++++L++VLYVP   KNL+S S+LAADN+ F+E
Sbjct: 363 IGHTIVHTPSRNIHLNNVLYVPQAKKNLISASQLAADNSAFLE 405


>UniRef100_Q6ATL7 Putative polyprotein [Oryza sativa]
          Length = 1437

 Score =  214 bits (544), Expect = 6e-54
 Identities = 127/416 (30%), Positives = 219/416 (52%), Gaps = 38/416 (9%)

Query: 1   MASAANNNKND--LPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITS 58
           MAS++ NN  +  +   VS KL ++N+ +WK+ +L  +RG +L+G++ G  + P   +  
Sbjct: 1   MASSSKNNTGNPLVGQPVSEKLGKSNHAVWKAQILATIRGARLEGHLTGDDQPPAPILRR 60

Query: 59  SDSSKS---NNPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAH 115
            +  K    +NP +EEW A DQ++L ++L+SM  ++  Q+  C T+   W   Q + G+ 
Sbjct: 61  KEGEKEVVVSNPEYEEWVATDQQVLAYLLSSMTKDLLVQVATCRTAASAWSMIQGMFGSM 120

Query: 116 TRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEY 175
           TR++ I  +    +++KG+M +  Y+ KM+ LAD L   G P+ + +LI     GLD E+
Sbjct: 121 TRARTINTRLSLSTLQKGDMNITTYVGKMRALADDLMAVGKPVDDDELIGYIFAGLDDEF 180

Query: 176 NPI---VVKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRF 232
            P+   +V   D  T+   +  AQL++FE R+    +    ++N+ +    +     +R 
Sbjct: 181 EPVISTIVGRPDPVTIG--ETYAQLISFEQRLAHRRSGDQSSVNSASRSRGQPQRGGSRS 238

Query: 233 NSNNN-WRGSNFRGWRGGRGRGRSS------------KAPCQVCGKTNHTAINCFHRFDK 279
             ++N  RG+   G   GRGRG  S            +  CQ+C K  HT  +C++R+D+
Sbjct: 239 GGDSNRGRGAPSNGANRGRGRGNPSGGRANVGGGTDNRPKCQLCYKRGHTVCDCWYRYDE 298

Query: 280 NYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNS 339
           N+      A +           + +   D +WY D+GA++HVT + +K     ++HG + 
Sbjct: 299 NFVPDERFAGT-----------AVSYGVDTNWYLDTGATDHVTGELDKLTVRDKYHGNDQ 347

Query: 340 LVVGNGDKLEIVATCSSKLK----SLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
           +   +G  +EI    +S +K    +L+L DVLYVP   KNL+S  KL +DN  F+E
Sbjct: 348 VHTASGAGMEISHIGNSVVKTPSRNLHLKDVLYVPKANKNLVSAYKLTSDNLAFIE 403


>UniRef100_Q65X82 Putative polyprotein [Oryza sativa]
          Length = 1447

 Score =  211 bits (537), Expect = 4e-53
 Identities = 116/400 (29%), Positives = 212/400 (53%), Gaps = 34/400 (8%)

Query: 14  SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN------NP 67
           +++S KL ++N+ LWK+ V+  VRG +L+G++ G  K P   IT++   K        NP
Sbjct: 16  NAISEKLSKSNHALWKAQVMAAVRGARLEGHLTGATKTPNALITTTAGDKGEKEVTVRNP 75

Query: 68  AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
            F++W A DQ++LG++L+++A ++  Q+  C T+   W   + +  + TR++ I  +   
Sbjct: 76  EFDDWVATDQQVLGFLLSTLARDVLAQVATCGTAAAAWQMLEEMYSSVTRARFINTRIAL 135

Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSDHTT 187
            + +KG + + +Y+ KMK LAD++  AG  + + DLI   + GLD  Y P++  +    T
Sbjct: 136 SNTKKGTLSINEYVSKMKALADEMTAAGKIVDDDDLISYIIAGLDDTYEPVISTIVGKDT 195

Query: 188 LSWVDLQAQLLTFESRIE-QLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNFRGW 246
           ++  +  +QLL+FE R+  +    +++NL            R     +    RG N  G 
Sbjct: 196 MTLGEAYSQLLSFEQRLALRHGGDSSVNLANRGRGGGGGQQRGGNTGNGGRGRGGNNNGA 255

Query: 247 RGGRGRGRS----------SKAPCQVCGKTNHTAINCFHRFDKNY-SRSNYSADSDKQGS 295
             GRGRG +          ++  CQ+C K  HT INC++R+D+++     Y+  +   G 
Sbjct: 256 NRGRGRGNNGGARPPGGVDNRPKCQLCYKRGHTVINCWYRYDEDFVPDEKYAGSATSYGI 315

Query: 296 HNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV---- 351
                       D +WY D+ A++HVT + +K      + G++ +   +G  +EI     
Sbjct: 316 ------------DTNWYVDTSATDHVTGELDKLTVRDRYKGQDQVHTASGAGMEISHIGH 363

Query: 352 ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
           +T  +  + ++L ++LYVPN  KNL+S ++L +DN+ ++E
Sbjct: 364 STVRTPNRDIHLRNILYVPNANKNLVSANRLVSDNSAYME 403


>UniRef100_Q8RZ67 Putative rice retrotransposon retrofit gag/pol polyprotein [Oryza
           sativa]
          Length = 1448

 Score =  206 bits (524), Expect = 1e-51
 Identities = 128/404 (31%), Positives = 214/404 (52%), Gaps = 41/404 (10%)

Query: 15  SVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN---NPAFEE 71
           SVS KL + N+ LWK+ V   V G +L GY+ G  K P   I+ +   K+    NPAFE+
Sbjct: 20  SVSEKLGKANHALWKAQVSAAVHGARLLGYLNGDIKAPNAEISVTIDGKTTTKPNPAFED 79

Query: 72  WQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIR 131
           W+ANDQ +LG++L+S++ ++  Q+  C+T+ + W   ++L    TR++ +  +    + +
Sbjct: 80  WEANDQLVLGYLLSSLSRDVLIQVATCKTAAEAWRNIEALYSTGTRARAVNTRLALTNTK 139

Query: 132 KGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL---SDHTTL 188
           KG MK+ +Y+ KM+ L D++   G P+    L+   + GL+ +++PIV  L   SD  T+
Sbjct: 140 KGTMKIAEYVAKMRALCDEMAAGGRPLDEEGLVQYIIAGLNEDFSPIVSNLCNKSDPITV 199

Query: 189 SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNN----------- 237
              +L +QL+ FE+ ++   +         AN          R N+NN+           
Sbjct: 200 G--ELYSQLVNFETLLDLYRSTGQGGAAFVANRGRGGGGGGGRGNNNNSDGGGGGGGRGA 257

Query: 238 --WRGSNFRGWRGGRGR---GRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDK 292
              RG   +G RGG GR   G+  +  CQVC K  HTA +C++RFD++Y           
Sbjct: 258 PRGRGGGGQG-RGGHGRGTGGQDRRPTCQVCFKRGHTAADCWYRFDEDY----------- 305

Query: 293 QGSHNAFIASQNSVE-DYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV 351
                   A+ NS   D +WY D+GA++H+T +  K     +++G   +   +G  ++I 
Sbjct: 306 VADEKLVAAATNSYGIDTNWYIDTGATDHITGELEKLTTKEKYNGGEQIHTASGAGMDIS 365

Query: 352 ----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
                   +  ++++L++VLYVP   KNL+S S+LAADN+ F+E
Sbjct: 366 HIGHTIVHTPSRNIHLNNVLYVPQAKKNLISASQLAADNSAFLE 409


>UniRef100_Q94LQ7 Putative gag-pol polyprotein [Oryza sativa]
          Length = 1031

 Score =  204 bits (519), Expect = 5e-51
 Identities = 129/417 (30%), Positives = 213/417 (50%), Gaps = 42/417 (10%)

Query: 1   MASAANNNKNDLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSD 60
           MA+A +N    L   +S KL + N+PLW + +L  +RG +L+ +++ T   P   I   D
Sbjct: 6   MAAAISNPLFGL--QISEKLTKQNHPLWAAQILTTLRGAQLEEHIVSTTAAPAAEIEKED 63

Query: 61  SSKSN-------NPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
             K         NP ++ W   DQ++LG++ +S++ E+  Q+    T+ Q W+    +  
Sbjct: 64  GDKDKKTKIVIPNPEYKTWFVQDQQVLGFIFSSLSREVLQQVAGARTAAQAWNMIDDMFS 123

Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
             +++  I +     + +KG M + +Y+ KM++LAD++   G P+   +L+   +NGLDS
Sbjct: 124 CKSKAGTINVLLALTTTQKGPMSISEYIAKMRSLADEMAATGKPLDEEELVAYIINGLDS 183

Query: 174 EYNPIVVKLSDHTTLSWVDLQ---AQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDN 230
           E++  V  L     ++ V +    +QLL++E+RI     +    L  +AN AN+   R  
Sbjct: 184 EFDAAVEGLMATARIAPVSISHVYSQLLSYENRI----RIRQAYLTTSANAANRGGGRGG 239

Query: 231 RFNSNNN---WRGSNFRGWRG---------GRGRGRSSKAPCQVCGKTNHTAINCFHRFD 278
           R +S  N    RG   RG RG         GRGRG  ++  CQVC K  H A +C+HR+D
Sbjct: 240 RGSSTGNRGGRRGGFGRGGRGRGAPSGASQGRGRGNDTRPVCQVCHKRGHVASDCWHRYD 299

Query: 279 KNYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKN 338
            +Y         +K G      A+     D +WY D+GA++H+T Q +K      + G +
Sbjct: 300 DSY------VPDEKLGG----AATYAYGVDTNWYVDTGATDHITGQLDKLTTKERYKGTD 349

Query: 339 SLVVGNGDKLEIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
            +   +G+ + I     A   +  + L+L +VL+VP   KNL+SV KL ADN  F+E
Sbjct: 350 QIHTASGEGMSIKHVGHAIVPTPSRPLHLKNVLHVPEAAKNLVSVHKLVADNYAFLE 406


>UniRef100_Q688S3 Putative polyprotein [Oryza sativa]
          Length = 1210

 Score =  203 bits (517), Expect = 8e-51
 Identities = 128/418 (30%), Positives = 217/418 (51%), Gaps = 46/418 (11%)

Query: 1   MASAANNNKNDL-PSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSS 59
           MAS++    N L   ++S KL +NN+ LWK+ +LPV+ G +++GY+ G  + P   I   
Sbjct: 1   MASSSGAAVNPLFGQAISEKLTKNNFSLWKTHILPVICGARMEGYLTGATQVPSAEIEVK 60

Query: 60  DSSKS------NNPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
           +  K       +NPA+E W A DQ++LG++L+S++ E+  Q+ + +T+   W     L  
Sbjct: 61  EGEKGEITKKVSNPAYEAWIAADQQVLGFLLSSISKEILIQVANVDTAAHAWKMIVGLLS 120

Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
             +R++ +  +    + +KGE  + DY+ KMK LAD++  AG P+ + +     L GLDS
Sbjct: 121 TQSRARALNTRIALATTQKGESSVSDYISKMKTLADEMASAGKPLDDEEFTSYILAGLDS 180

Query: 174 EYNPIV---VKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDN 230
           +Y  +V   V  S+  T+S  ++ +QLL+FE   +     TN +  + ++V +    R+N
Sbjct: 181 DYEQVVSSIVGRSEGVTIS--EVYSQLLSFEELWQ-----TNGSSGSYSSVNSANHGRNN 233

Query: 231 RFNSNNNWRGSNF--------RGWRGGRGRGRSS-----KAPCQVCGKTNHTAINCFHRF 277
               N N  G  F        RG RGG GRG        +  CQ+C K  H   +C+H +
Sbjct: 234 GGGGNFNNGGGYFSNRGRGGGRGDRGGCGRGNGGRNFKPRPTCQLCSKVGHVVADCWHCY 293

Query: 278 DKNYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGK 337
           D ++      A +   G             D +WY D+GA++H+T++  K      ++GK
Sbjct: 294 DDSFVPDARVAAAASYG------------VDSNWYVDTGATDHITNELEKLTTRDRYNGK 341

Query: 338 NSLVVGNGDKLEI----VATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
             +   +G  ++I     +T  +  ++L L ++L+VP   KNL+S  +LA DN+ FVE
Sbjct: 342 EQIHTASGSGMDIKHIGQSTIRTPTRNLYLRNILHVPRTKKNLISAHRLAVDNHAFVE 399


>UniRef100_Q75G45 Putative polyprotein [Oryza sativa]
          Length = 1431

 Score =  201 bits (510), Expect = 5e-50
 Identities = 132/417 (31%), Positives = 208/417 (49%), Gaps = 51/417 (12%)

Query: 1   MASAANNNKNDLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSD 60
           MA+A +N    L   +S KL + N+PLW + +L  +RG +L+ +++ T   P   I   D
Sbjct: 6   MAAAISNPLFGL--QISDKLTKQNHPLWAAQILTTLRGAQLEEHIVSTTAAPAAEIEKED 63

Query: 61  SSKSN-------NPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
             K         NP ++ W   DQ++LG++ +S++ E+  Q+    T+ Q W+    +  
Sbjct: 64  GDKDKKTKIVIPNPEYKTWFVQDQQVLGFIFSSLSREVLQQVAGARTAAQAWNMIDDMFS 123

Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
                     KS+  +I KG M M +Y+ KM++LADK+   G P+   +L+   +NGLDS
Sbjct: 124 C---------KSKAGTINKGPMSMSEYIAKMRSLADKMAATGKPLDEEELVAYIINGLDS 174

Query: 174 EYNPIVVKLSDHTTLSWVDLQ---AQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDN 230
           E++  V  L     ++ V +    +QLL++E+RI     +    L  +AN AN+   R  
Sbjct: 175 EFDAAVEGLMATARIAPVSISHVYSQLLSYENRI----RIRQAYLTTSANAANRGGGRGG 230

Query: 231 RFNSNNN---WRGSNFRGWRG---------GRGRGRSSKAPCQVCGKTNHTAINCFHRFD 278
           R +S  N    RG   RG  G         GRGRG  ++  CQVC K  H A +C+HR+D
Sbjct: 231 RGSSTGNRGGGRGGFGRGGHGRGAPSGASQGRGRGNDTRPVCQVCHKRGHVASDCWHRYD 290

Query: 279 KNYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKN 338
            +Y         +K G      A+     D +WY D+GA++H+T Q +K      + G +
Sbjct: 291 DSY------VPDEKLGG----AATYAYGVDTNWYVDTGATDHITGQLDKLTTKERYKGTD 340

Query: 339 SLVVGNGDKLEIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
            +   +G+   I     A   +    L+L +VL+VP   KNL+SV KL ADN  F+E
Sbjct: 341 QIHTASGEGTSIKHVGHAIVPTPSHPLHLKNVLHVPEAAKNLVSVHKLVADNYAFLE 397


>UniRef100_Q7G7H3 Putative gag-pol protein [Oryza sativa]
          Length = 1219

 Score =  199 bits (506), Expect = 1e-49
 Identities = 118/397 (29%), Positives = 207/397 (51%), Gaps = 38/397 (9%)

Query: 16  VSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN------NPAF 69
           VS KL + N+ LW + VL  +RG +L+G++LGT   PE  +   +  K        NPA+
Sbjct: 19  VSEKLTKQNHSLWSAQVLTALRGARLEGHVLGTSVPPEAELEQKEGEKGEKTVRVPNPAY 78

Query: 70  EEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHS 129
            EW A DQ++LG++ +S+  E+ +Q+    T+   W   ++     + +  I ++    +
Sbjct: 79  GEWFATDQQVLGFLFSSLTREIRSQVAGAPTAAAAWKTIENTFSTRSHAGAINVRLALTT 138

Query: 130 IRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHTTL 188
            +KG+  + +Y+ KM+ L D++   G PI + +L+   +NGLDSE++P+V  L + + ++
Sbjct: 139 TQKGQSTVTEYVSKMRALGDEIAATGKPIDDEELVAYIINGLDSEFDPVVEALIAKNASV 198

Query: 189 SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWR-------GS 241
           +  ++ +QLL FE+R++       +     A   N+      R   N   R       GS
Sbjct: 199 TVAEVYSQLLGFENRVK-------IRTACAATSGNRGSGNQGRGGGNPRGRGTGRGGGGS 251

Query: 242 NFRGWRGGRGRGR---SSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNA 298
             RG   GRG GR    ++  CQVC K  H   +C+HR+D+NY         +K G    
Sbjct: 252 GGRGGGHGRGNGRGGTDNRPTCQVCHKKGHVVADCWHRYDENY------VPDEKLGG--- 302

Query: 299 FIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV----ATC 354
             A+     D +WY D+ A++H+T Q +K     ++ G + +   +G+ ++I     +  
Sbjct: 303 -AATHAYGVDTNWYVDTEATDHITGQLDKLTTREKYKGTDQIHTASGEGMDIQHIGHSYV 361

Query: 355 SSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
            +  + L+L ++L+VP  +KNL+SV +L ADN  F+E
Sbjct: 362 PTSSRPLHLKNILHVPKASKNLISVHRLVADNYAFLE 398


>UniRef100_Q94DD5 Putative gag/pol polyprotein [Oryza sativa]
          Length = 1449

 Score =  198 bits (503), Expect = 3e-49
 Identities = 125/410 (30%), Positives = 219/410 (52%), Gaps = 47/410 (11%)

Query: 14  SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFIT------SSDSSKSNNP 67
           +++S KL +NNY LWK+ VL  VRG +L+G++ GT   P   I+         ++++ NP
Sbjct: 10  NTISEKLAKNNYALWKAQVLASVRGARLEGHLTGTTAAPAITISVPGEKEGDKATRAANP 69

Query: 68  AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
           A++EW A DQ++LG +L++++ ++  Q+  C T+   W   + +  + TR++ I  +   
Sbjct: 70  AYDEWVATDQQILGLLLSTLSKDVLAQVATCGTAAAAWSMLEEMYTSMTRARFINTRIAL 129

Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHT 186
            + +KG++ + +Y+ KM+ L D +  AG  + + DLI   + GLD  Y P++  +     
Sbjct: 130 SNTKKGDLSITEYVAKMRALGDDMTAAGKVVDDEDLISYIIAGLDDTYEPVISSIVGKSE 189

Query: 187 TLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANK---------------FDHRDNR 231
            +S+ +  +QLL+FE R    NNL +    ++AN+AN+                ++R   
Sbjct: 190 PMSFGEAFSQLLSFEQR----NNLRH-GGESSANLANRGRGTTGGNGGQRGRGGNNRGRG 244

Query: 232 FNSNNNW--RGSNFR---GWRGGR-GRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSN 285
            N  NN   RG   R   G+ GGR G G  ++  CQ+C K  HT INC++R+D+++    
Sbjct: 245 GNGGNNSANRGKGGRGNGGFNGGRQGGGVDTRPKCQLCYKRGHTVINCWYRYDEDFVPDE 304

Query: 286 YSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNG 345
             A S          A+ +   D +WY D+GA++HVT +  K      + G + +   +G
Sbjct: 305 KYAGS----------AATSYGIDTNWYVDTGATDHVTGELEKLIVRDRYKGHDQVHTASG 354

Query: 346 DKLEIVATCSSKLKS----LNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
             +EI     S +K+    ++L ++L+VP   KNL+S  +L +DN+ F+E
Sbjct: 355 AGMEISHIGHSIVKTPSRDIHLRNILHVPKANKNLVSAQRLVSDNSAFME 404


>UniRef100_Q7XKV9 OSJNBa0073E02.10 protein [Oryza sativa]
          Length = 1131

 Score =  187 bits (474), Expect = 8e-46
 Identities = 128/409 (31%), Positives = 205/409 (49%), Gaps = 33/409 (8%)

Query: 1   MASAANNNKND---LPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFIT 57
           MAS++++  +    L   VS KL R+N+ +W++ VLP VRG +L GY+ GTK+ P   IT
Sbjct: 1   MASSSSSTLSSSAVLGHPVSEKLSRDNFLVWRAQVLPAVRGAQLTGYLDGTKEVPSPEIT 60

Query: 58  SSDSSKSNN----PAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
                  +       +    +   R   W          + +L       L  +   +  
Sbjct: 61  VEKKPIHSGLRMINKYSGIYSRRFREKFWC--------KSHILRVPDKSGL--QFNEMFS 110

Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
           + ++++II ++++     KG+     Y  KMK LAD++  AG  + + D++   L GLD+
Sbjct: 111 SQSKARIIQIRAQLARELKGDSSAAAYFTKMKGLADEMAAAGKKLDDDDIVSYILGGLDA 170

Query: 174 EYNPIVVKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDH-RDNRF 232
           +YNP+V  +S    +S  DL AQLL+FE+    LNN +    +++AN A++    R    
Sbjct: 171 DYNPLVASVSSKDYISLSDLYAQLLSFEA---HLNNQSEGGYHSSANSASRGGRGRGQGR 227

Query: 233 NSNNNWRGSNF------RGWRGGRGRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNY 286
                  GSNF      RG RGGRGRG  S+  CQ+CGK  HT   C+ RFD+++S ++ 
Sbjct: 228 GRGRGGFGSNFGSGFGGRG-RGGRGRGDGSRPSCQLCGKEGHTVHTCWKRFDRSFSGNDV 286

Query: 287 SADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGD 346
                 Q + +A   S   V D +WY D+ A++H+T +  K      +HG   +   N  
Sbjct: 287 IFQQHHQQAKSASAVSSYGV-DTNWYLDTAATDHITGELKKLTTKERYHGNEQVHAANSA 345

Query: 347 KLEIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
            + I         +  ++L L++VLY+P   KNL+S  +LA DN+ FVE
Sbjct: 346 GMSISHIGRTIFHTPNRNLALNNVLYIPKAKKNLVSAHRLAYDNHAFVE 394


>UniRef100_Q9SA17 F28K20.17 protein [Arabidopsis thaliana]
          Length = 1415

 Score =  186 bits (473), Expect = 1e-45
 Identities = 129/411 (31%), Positives = 202/411 (48%), Gaps = 44/411 (10%)

Query: 14  SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEE---FITSSDSSKSNNPAFE 70
           SSV++KL  +NY LWK+    ++   KL G++ G    P +    +    +S+  NP +E
Sbjct: 15  SSVTLKLTDSNYLLWKTQFESLLSSQKLIGFVNGAVNAPSQSRLVVNGEVTSEEPNPLYE 74

Query: 71  EWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQI---IYLKSEF 127
            W   DQ +  W+  +++ E+   + +  TS+Q+W    SLA    +S +     L+   
Sbjct: 75  SWFCTDQLVRSWLFGTLSEEVLGHVHNLSTSRQIW---VSLAENFNKSSVAREFSLRQNL 131

Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVV----KLS 183
             + K E     Y  + K + D L   G P+  S  I   LNGL  +Y+PI       LS
Sbjct: 132 QLLSKKEKPFSVYCREFKTICDALSSIGKPVDESMKIFGFLNGLGRDYDPITTVIQSSLS 191

Query: 184 DHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRG--- 240
              T ++ D+ +++  F+S+++      ++  +   N+  + +    ++N N   RG   
Sbjct: 192 KLPTPTFNDVVSEVQGFDSKLQSYEEAASVTPHLAFNI-ERSESGSPQYNPNQKGRGRSG 250

Query: 241 -SNFRGWRGGRGRGRSS----------KAPCQVCGKTNHTAINCFHRFDKNYSRSNYSAD 289
            +  RG    RGRG S           +  CQ+CG+T HTA+ C++RFD NY        
Sbjct: 251 QNKGRGGYSTRGRGFSQHQSSPQVSGPRPVCQICGRTGHTALKCYNRFDNNY-------- 302

Query: 290 SDKQGSHNAFIASQNSVED-YDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKL 348
              Q    AF   + S +   +W+ DS A+ HVT  TN  Q  TE+ G ++++VG+G  L
Sbjct: 303 ---QAEIQAFSTLRVSDDTGKEWHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYL 359

Query: 349 EIVATCSSKLKSLN----LDDVLYVPNITKNLLSVSKLAADNNIFVEFDKN 395
            I  T S+ +KS N    L++VL VPNI K+LLSVSKL  D    V FD N
Sbjct: 360 PITHTGSTTIKSSNGKIPLNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDAN 410


>UniRef100_Q6F356 Putative polyprotein [Oryza sativa]
          Length = 1256

 Score =  186 bits (471), Expect = 2e-45
 Identities = 112/369 (30%), Positives = 194/369 (52%), Gaps = 45/369 (12%)

Query: 1   MASAANNNKNDLPSSVSV--KLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITS 58
           MAS++ ++ +     ++V  KL + NY +WK  VL V+RG +LD Y+ G  K P   I  
Sbjct: 1   MASSSQSSASGSLGGITVTEKLSKGNYLIWKVQVLAVIRGARLDSYLTGATKKPSATIII 60

Query: 59  SDSSKS---NNPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAH 115
             + K    +NPA +EW ANDQ++LG++L +M+ ++ +Q+  C ++  LW   + +  + 
Sbjct: 61  KKNEKEVEVSNPAVDEWIANDQQVLGYLLTTMSRDVLSQVATCSSAASLWSTIEGMFSSA 120

Query: 116 TRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEY 175
           TR++ I  K    + +KG++ + +Y+ KM+ LAD+L  +G P+   DLI   + GLD ++
Sbjct: 121 TRARSINTKIALTNTKKGDLGIAEYVSKMRVLADELATSGKPVDEEDLISYIIAGLDEDF 180

Query: 176 NPIV---VKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRF 232
            PI+   V  S+H +L   +  +QLL+FE R++       +    +AN+AN+   R N  
Sbjct: 181 EPIISSLVSKSEHVSLG--EAYSQLLSFEQRMK-------MRQEHSANLANRGRGRGNPG 231

Query: 233 NSNNNWRGSNFRGWRGG--RGRGRSS-------------KAPCQVCGKTNHTAINCFHRF 277
              NN +    +   GG  RGRGR +             +  CQ+C K  HT I+C++R+
Sbjct: 232 RGRNNKQPQQQQRGHGGNSRGRGRGNNSNQRQGGNGVDYRPKCQLCYKRGHTVIDCWYRY 291

Query: 278 DKNY-SRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHG 336
           D+++     Y+  +   G             D +WY D+G ++HVT +  K     ++ G
Sbjct: 292 DEDFVPDEKYAGTTASYG------------VDSNWYVDTGTTDHVTGELEKLTIRDKYKG 339

Query: 337 KNSLVVGNG 345
           ++ +   NG
Sbjct: 340 QDQVQTANG 348


>UniRef100_Q94H72 Putative gag-pol protein [Oryza sativa]
          Length = 535

 Score =  182 bits (462), Expect = 2e-44
 Identities = 112/394 (28%), Positives = 205/394 (51%), Gaps = 35/394 (8%)

Query: 19  KLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKS-----NNPAFEEWQ 73
           K  + N+ LW + +L  +RG +L+GY+ GT + P       D  K      +NP + +W 
Sbjct: 134 KFTKQNHSLWSAQILTTLRGAQLEGYITGTAEAPAAECEKEDGDKKVKTTISNPEYIKWF 193

Query: 74  ANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRKG 133
             DQ++LG++ +S++ E+  Q+   +T+ Q W     +    +++  I +     + +KG
Sbjct: 194 TQDQQVLGFLFSSLSREVLQQVAGAKTAAQAWSMINDMFTCKSKAGAINVLLALTTTQKG 253

Query: 134 EMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL---SDHTTLSW 190
            + + +Y+ KM++L D++  AG P+ + +LI   +NGL+S+++  V  L   +    LS 
Sbjct: 254 PISISEYIAKMRSLGDEMAGAGKPLDDEELIAYIINGLNSDFDATVEGLMATARIAPLSI 313

Query: 191 VDLQAQLLTFESRIEQLNNLTNLNLNATANVAN------KFDHRDNRFNSNNNWRG---S 241
             + +QLL++E+RI     +    L  +AN AN      +  +R  R ++ +  RG    
Sbjct: 314 SHVYSQLLSYENRI----RIRQAYLTTSANAANRGGRGGRGGNRGGRSSAPHGGRGGGRG 369

Query: 242 NFRGWRGGRGRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAFIA 301
           N  G   GRGRG  ++  CQVC K  H A +C+HR+D+NY         +K G      A
Sbjct: 370 NTGGANPGRGRGNDTRPVCQVCHKRGHVASDCWHRYDENY------GPDEKLGG----AA 419

Query: 302 SQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIVATCSSKLKS- 360
           +     D +WY D+ A++H+T Q +K     ++ G + +   +G+ + +     + + + 
Sbjct: 420 TYAYGVDTNWYVDTRATDHITGQLDKLTTREKYKGTDLIHTVSGEGMNVKHIVHTIVPTP 479

Query: 361 ---LNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
              L+L ++L+VP  +KNL+ V +L ADN  F++
Sbjct: 480 SCPLHLKNILHVPQASKNLVFVHRLVADNYAFLD 513


>UniRef100_Q9SSB1 T18A20.5 protein [Arabidopsis thaliana]
          Length = 1522

 Score =  175 bits (443), Expect = 3e-42
 Identities = 121/413 (29%), Positives = 200/413 (48%), Gaps = 40/413 (9%)

Query: 11  DLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEF--ITSSD-SSKSNNP 67
           ++ + V+V L++ NY LWKS     + G  L G++ G+   P +   +T ++ +S+  NP
Sbjct: 10  NISNCVTVTLNQQNYILWKSQFESFLSGQGLLGFVTGSISAPAQTRSVTHNNVTSEEPNP 69

Query: 68  AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
            F  W   DQ +  W+L S A ++ + +++C TS Q+W    +     + S++  L+   
Sbjct: 70  EFYTWHQTDQVVKSWLLGSFAEDILSVVVNCFTSHQVWLTLANHFNRVSSSRLFELQRRL 129

Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSD--- 184
            ++ K +  ME +L  +K++ D+L   G+P+     I   LNGL  EY PI   + +   
Sbjct: 130 QTLEKKDNTMEVFLKDLKHICDQLASVGSPVPEKMKIFSALNGLGREYEPIKTTIENSVD 189

Query: 185 -HTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNF 243
            + +LS  ++ ++L  ++ R++       ++ +   NV     H D+ +  NNN RG   
Sbjct: 190 SNPSLSLDEVASKLRGYDDRLQSYVTEPTISPHVAFNVT----HSDSGYYHNNN-RGKGR 244

Query: 244 RGWRGG------RGRG-------------RSSKAPCQVCGKTNHTAINCFHRFDKNYSRS 284
                G      RGRG              +S   CQ+CGK  H A+ C+HRFD +Y   
Sbjct: 245 SNSGSGKSSFSTRGRGFHQQISPTSGSQAGNSGLVCQICGKAGHHALKCWHRFDNSYQHE 304

Query: 285 NYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGN 344
           +              I        ++W  DS AS HVT+  +  Q    +HG +S++V +
Sbjct: 305 DL-----PMALATMRITDVTDHHGHEWIPDSAASAHVTNNRHVLQQSQPYHGSDSIMVAD 359

Query: 345 GDKLEIVATCSSKLKS----LNLDDVLYVPNITKNLLSVSKLAADNNIFVEFD 393
           G+ L I  T S  + S    + L +VL  P+I K+LLSVSKL +D    VEFD
Sbjct: 360 GNFLPITHTGSGSIASSSGKIPLKEVLVCPDIVKSLLSVSKLTSDYPCSVEFD 412


>UniRef100_Q8W0X9 Putative copia-like retrotransposon Hopscotch polyprotein [Zea
           mays]
          Length = 1313

 Score =  174 bits (441), Expect = 5e-42
 Identities = 101/346 (29%), Positives = 171/346 (49%), Gaps = 44/346 (12%)

Query: 26  PLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSNNPAFEEWQANDQRLLGWMLN 85
           P  K+ V   +RG +L+GY+ G  K P+E    +   KS NPAFEEW+A DQ++L ++L+
Sbjct: 2   PCGKAQVRAAMRGARLEGYLTGATKMPDEETVDNKGKKSPNPAFEEWEAKDQQILSYLLS 61

Query: 86  SMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMK 145
           S++ E+  Q+   +T+ + W   +++  + TR++ + L+    + +KG M + +Y  KMK
Sbjct: 62  SISREVQIQVTSAKTAAEAWHSIEAMFASQTRARAVNLRLALSTTKKGSMTVAEYYTKMK 121

Query: 146 NLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSDHT-TLSWVDLQAQLLTFESRI 204
              D++  AG P+ + +++   L GL+ E+ P+V  L      +S  DL +QLL FE+++
Sbjct: 122 GYGDEMAAAGRPLQDEEMVEYILTGLEEEFLPMVSALVTRVDPISLEDLYSQLLNFETKL 181

Query: 205 EQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNFRGWRGGRGRGRSSKAP----- 259
           + +      +   +AN+A        R    N   GS  RG RGGRG  R  +A      
Sbjct: 182 DLMRGGGEQH-QGSANMA-------GRGGRGNQRGGSGGRGQRGGRGSSRGGRASWSGGR 233

Query: 260 --------------------CQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAF 299
                               CQVC K  HTA  C+HRF++++      A +     H   
Sbjct: 234 QSNQGGYIRRSNNSSDERPVCQVCFKKGHTAARCWHRFEEDFVPDEKLAGAATNSYH--- 290

Query: 300 IASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNG 345
                   D +WY D+GA++H+T +  K     ++ G + +   +G
Sbjct: 291 -------VDTNWYTDTGATDHITGELEKLSIREKYAGGDQIHTASG 329


>UniRef100_Q94IU9 Copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  174 bits (440), Expect = 7e-42
 Identities = 128/411 (31%), Positives = 205/411 (49%), Gaps = 42/411 (10%)

Query: 14  SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEE---FITSSDSSKSNNPAFE 70
           SSV++KL+ +NY LWK+    ++   KL G++ G    P +    +    +S+  NP +E
Sbjct: 15  SSVTLKLNDSNYLLWKTQFESLLSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEVPNPQYE 74

Query: 71  EWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQI---IYLKSEF 127
           +W   DQ +  W+  +++ E+   + +  TS+Q+W    SLA    +S I     L+   
Sbjct: 75  DWFCTDQLVRSWLFGTLSEEVLGHVHNLTTSRQIWI---SLAENFNKSSIAREFSLRRNL 131

Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVV----KLS 183
             + K +  +  Y    K + D L   G P+  S  I   LNGL  EY+PI       LS
Sbjct: 132 QLLTKKDKSLSVYCRDFKIICDSLSSIGKPVEESMKIFGFLNGLGREYDPITTVIQSSLS 191

Query: 184 DHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRG--- 240
                ++ D+ +++  F+S+++  ++  ++N +   N   + +    ++NSN+  RG   
Sbjct: 192 KLPAPTFNDVISEVQGFDSKLQSYDDTVSVNPHLAFNT-ERSNSGAPQYNSNSRGRGRSG 250

Query: 241 -SNFRGWRGGRGRGRS---SKAP-------CQVCGKTNHTAINCFHRFDKNYSRSNYSAD 289
            +  RG    RGRG S   S +P       CQ+CG+  HTAI C++RFD NY        
Sbjct: 251 QNRGRGGYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDNNY-------- 302

Query: 290 SDKQGSHNAFIASQNSVE-DYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKL 348
              +    AF A + S E   +WY DS A+ H+T  T+  Q+ T + G ++++VG+G  L
Sbjct: 303 -QSEVPTQAFSALRVSDETGKEWYPDSAATAHITASTSGLQNATTYEGNDAVLVGDGTYL 361

Query: 349 EIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVEFDKN 395
            I      T SS   ++ L++VL  P I K+LLSVSKL  D    V FD N
Sbjct: 362 PITHVGSTTISSSKGTIPLNEVLVCPAIQKSLLSVSKLCDDYPCGVYFDAN 412


>UniRef100_Q8H7X8 Putative gag-pol polyprotein [Oryza sativa]
          Length = 1247

 Score =  172 bits (437), Expect = 1e-41
 Identities = 109/360 (30%), Positives = 188/360 (51%), Gaps = 43/360 (11%)

Query: 14  SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFIT------SSDSSKSNNP 67
           +++S KL +NNY LWK+ VL  VRG +L+G++ GT   P   I+         +++  NP
Sbjct: 10  NTISEKLAKNNYALWKAQVLASVRGARLEGHLTGTTAAPAITISVPGEKEGDKATRVANP 69

Query: 68  AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
           A++EW A DQ++LG +L++++ ++  Q+  C T+   W   + +  + TR++ I  +   
Sbjct: 70  AYDEWVATDQQILGLLLSTLSRDVLAQVATCGTAATAWSMLEEMYTSMTRARFINTRIAL 129

Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHT 186
            + +KG++ + +Y+ KM+ L D +  AG  + N DLI   + GLD  Y P++  +     
Sbjct: 130 SNTKKGDLSITEYVAKMRALGDDMTAAGKVVDNEDLISYIIAGLDDTYEPVISSIVGKSE 189

Query: 187 TLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANK---------------FDHRDNR 231
            +S+ +  +QLL+F    EQ NNL +    ++AN+AN+                ++R   
Sbjct: 190 PMSFGEAFSQLLSF----EQCNNLRH-GGESSANLANRGCGTTGGNGGQRGRGGNNRGRG 244

Query: 232 FNSNNNW--RGSNFR---GWRGGR-GRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSN 285
            N  NN   RG   R   G+ GGR G G  ++  CQ+C K  HT INC++R+D+++    
Sbjct: 245 GNGGNNSANRGKGGRGNGGFNGGRQGGGVDTRPKCQLCYKRGHTVINCWYRYDEDFVPDE 304

Query: 286 YSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNG 345
             A S          A+ +   D +WY D+GA++HVT +  K      + G + +   +G
Sbjct: 305 KYAGS----------AATSYGIDTNWYVDTGATDHVTGELEKLIVRDHYKGHDQVHTASG 354


>UniRef100_Q9SLL4 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1402

 Score =  172 bits (435), Expect = 3e-41
 Identities = 116/416 (27%), Positives = 194/416 (45%), Gaps = 36/416 (8%)

Query: 11  DLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSD----SSKSNN 66
           ++ + V+V L   NY LWKS     + G  L G++ G+   P +    SD    +S S N
Sbjct: 10  NISNCVTVTLTAKNYILWKSQFESFLDGQGLLGFVTGSIPAPSQTSVVSDIDGSTSASPN 69

Query: 67  PAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSE 126
           P +  W   D+ +  W+L S   ++ + +++C TS ++W    +     + S++  L+  
Sbjct: 70  PEYYTWFKTDRVVKSWLLGSFLEDILSVVVNCNTSHEVWISVANHFNRVSSSRLFELQRR 129

Query: 127 FHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSDHT 186
             ++ K +  M++YL  +K + D+L   G+P++    I   LNGL  EY PI   + +  
Sbjct: 130 LQNVSKRDKSMDEYLKDLKTICDQLASVGSPVTEKMKIFAALNGLGREYEPIKTTIENSM 189

Query: 187 TL----SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFD-HRDNRFNSNNNWRGS 241
                 S  D+  +L  ++ R++     T ++ +   N+    D +    FN+ N  +G 
Sbjct: 190 DALPGPSLEDVIPKLTGYDDRLQGYLEETAVSPHVAFNITTSDDSNASGYFNAYNRGKGK 249

Query: 242 NFRGWRGGRGRGR------------------SSKAPCQVCGKTNHTAINCFHRFDKNYSR 283
           + RG      RGR                   +   CQ+CGK  H A+ C+HRF+     
Sbjct: 250 SNRGRNSFSTRGRGFHQQISSTNSSSGSQSGGTSVVCQICGKMGHPALKCWHRFN----- 304

Query: 284 SNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVG 343
           ++Y  +   +      I         +W  DS A+ HVT+     Q    +HG ++++V 
Sbjct: 305 NSYQYEELPRALAAMRITDITDQHGNEWLPDSAATAHVTNSPRSLQQSQPYHGSDAVMVA 364

Query: 344 NGDKLEIVATCSSKLKS----LNLDDVLYVPNITKNLLSVSKLAADNNIFVEFDKN 395
           +G+ L I  T S+ L S    + L DVL  P+ITK+LLSVSKL  D    VEFD +
Sbjct: 365 DGNFLPITHTGSTNLASSSGNVPLTDVLVCPSITKSLLSVSKLTQDYPCTVEFDSD 420


>UniRef100_Q9SV56 Hypothetical protein AT4g28900 [Arabidopsis thaliana]
          Length = 1415

 Score =  171 bits (432), Expect = 6e-41
 Identities = 114/392 (29%), Positives = 190/392 (48%), Gaps = 39/392 (9%)

Query: 16  VSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPE--EFITSSDS-SKSNNPAFEEW 72
           V++KL   NY LWK      +   +L G++ G   CP     I + D  +++ NP F  W
Sbjct: 17  VTLKLSTANYLLWKIQFETWLNNQRLLGFVTGANPCPNATRSIRNGDQVTEATNPDFLTW 76

Query: 73  QANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRK 132
             NDQ+++GW+L S++ +    +    TS+++W          + S+   L+   + + K
Sbjct: 77  VQNDQKIMGWLLGSLSEDALRSVYGLHTSREVWFSLAKKYNRVSASRKSDLQRRLNPVSK 136

Query: 133 GEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLS---DHTTLS 189
            E  M +YL  +K + D+L   G P+  ++ I   LNGL  EY  +   +    D   +S
Sbjct: 137 NEKSMLEYLNCVKQICDQLDSIGCPVPENEKIFGVLNGLGQEYMLVSTMIKGSMDTYPMS 196

Query: 190 WVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNF-RGWRG 248
           + D+  +L+ F+ +++   +                    NR  +N   +G  F +    
Sbjct: 197 FEDVVFKLINFDDKLQNGQS------------------GGNRGRNNYTTKGRGFPQQISS 238

Query: 249 GRGRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAFIASQNSVED 308
           G      ++  CQ+C K  H+A  C+ RFD  +   ++S          AF A + S + 
Sbjct: 239 GSPSDSGTRPTCQICNKYGHSAYKCWKRFDHAFQSEDFS---------KAFAAMRVSDQK 289

Query: 309 YD-WYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV----ATCSSKLKSLNL 363
            + W  DSGA++H+T+ T++ Q    + G++S++VGN D L I     A  +S   +L L
Sbjct: 290 SNPWVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHIGSAVLTSNQGNLPL 349

Query: 364 DDVLYVPNITKNLLSVSKLAADNNIFVEFDKN 395
            DVL  PNITK+LLSVSKL +D    +EFD +
Sbjct: 350 RDVLVCPNITKSLLSVSKLTSDYPCVIEFDSD 381


  Database: uniref100
    Posted date:  Jan 5, 2005  1:24 AM
  Number of letters in database: 848,049,833
  Number of sequences in database:  2,790,947
  
Lambda     K      H
   0.316    0.130    0.391 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 756,671,546
Number of Sequences: 2790947
Number of extensions: 30119972
Number of successful extensions: 110599
Number of sequences better than 10.0: 684
Number of HSP's better than 10.0 without gapping: 168
Number of HSP's successfully gapped in prelim test: 532
Number of HSP's that attempted gapping in prelim test: 107714
Number of HSP's gapped (non-prelim): 1935
length of query: 482
length of database: 848,049,833
effective HSP length: 131
effective length of query: 351
effective length of database: 482,435,776
effective search space: 169334957376
effective search space used: 169334957376
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)


Medicago: description of AC144765.7