
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147002.8 + phase: 0 /pseudo
(1664 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q84MR5 Putative copia-type pol polyprotein [Oryza sativa] 133 5e-29
UniRef100_Q6ATD3 Putative polyprotein [Oryza sativa] 132 7e-29
UniRef100_Q84VI2 Gag-pol polyprotein [Glycine max] 129 8e-28
UniRef100_Q84VI4 Gag-pol polyprotein [Glycine max] 127 4e-27
UniRef100_Q84VH8 Gag-pol polyprotein [Glycine max] 127 4e-27
UniRef100_Q94H19 Putative gag-pol polyprotein, 3'-partial [Oryza... 126 5e-27
UniRef100_Q94GR9 Putative gag-pol polyprotein [Oryza sativa] 126 5e-27
UniRef100_Q9FG84 Copia-like retroelement pol polyprotein [Arabid... 126 6e-27
UniRef100_Q9C5V1 Gag/pol polyprotein [Arabidopsis thaliana] 126 6e-27
UniRef100_Q84T51 Putative copia-type pol polyprotein [Oryza sativa] 126 6e-27
UniRef100_Q7XS40 OSJNBa0069D17.3 protein [Oryza sativa] 121 2e-25
UniRef100_Q7XP45 OSJNBa0063G07.6 protein [Oryza sativa] 121 2e-25
UniRef100_Q8H8K3 Putative retroelement [Oryza sativa] 120 3e-25
UniRef100_Q94HD2 Putative retroelement [Oryza sativa] 120 3e-25
UniRef100_Q7XES3 Contains similarity to Zea mays retrotransposon... 120 4e-25
UniRef100_Q7XL83 OSJNBb0014D23.6 protein [Oryza sativa] 120 4e-25
UniRef100_Q9AY98 Putative copia-type pol polyprotein [Oryza sativa] 120 5e-25
UniRef100_Q8LMY8 Putative polyprotein [Oryza sativa] 119 8e-25
UniRef100_Q8S5D7 Putative gag-pol polyprotein [Oryza sativa] 119 8e-25
UniRef100_Q7XTX2 OSJNBa0019K04.26 protein [Oryza sativa] 118 2e-24
>UniRef100_Q84MR5 Putative copia-type pol polyprotein [Oryza sativa]
Length = 1896
Score = 133 bits (334), Expect = 5e-29
Identities = 125/487 (25%), Positives = 217/487 (43%), Gaps = 51/487 (10%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVD----DLDLDEEGAAIDRKIHTPAQKKLYKKHHKIR 56
WK M I+ L +W ++ G+D D++L E Q++L ++ ++
Sbjct: 62 WKHKMKLHIISLHPSIWKVVCTGIDVPHDDMELTSE------------QEQLIHRNDQVS 109
Query: 57 GIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDES 116
I++++ E+ K+ AKA++ +L EGS V+EAK +L + F M D E+
Sbjct: 110 NAILSALSPEEFNKVDRLEEAKAIWDTLQLAHEGSPAVREAKIELLEGRLGRFVMDDKET 169
Query: 117 IEEMYSRFQTLVSGLQILKKSYVSSDHVSK-ILRSLPSRWRPKVTAIEEAKDLNTLSVED 175
+EMY R LV+ ++ L +++ V K +LR+ R V+ I E KD L+ D
Sbjct: 170 PQEMYDRMMILVNKIKGLGSEDMTNHFVVKRLLRAFGPRNPTLVSMIHERKDFKRLTPSD 229
Query: 176 LVSSLKVHEMSLNE-HETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSV 234
++ + HEM E E + K+ A+ A KA + E S + DE+
Sbjct: 230 ILGRIVSHEMQEEEAREVRQMVKNAAM-----IKNQEVALKAKQEEESSCEESKDEE--- 281
Query: 235 KMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKGCFNCKKPGHFIADCPDLQKEKFKG 294
MA++ + ++ R+ ++ K K++ ++ CFNC + GHFIADCP + K KG
Sbjct: 282 -MALIVKRFKHFLRRSGYGKERKDDDKG-KRQSKRACFNCGEYGHFIADCPKTNEAKGKG 339
Query: 295 KSKKSSFNSSKFRKQIKKSLMA-TWEDLDSESGSDKEEADDDAKAAVGLVATVSSEAVSE 353
KK + ++ MA W D E K + + + V VA SS + E
Sbjct: 340 GKKKPEC------AHVAEAHMAEVWYSEDEEDPKPKPKDKVEGEGGVATVAFKSSSSSKE 393
Query: 354 A--------ESDSEDENEVYS-------KIPRQELVDSLKELLSLFEHRTNELTDLKEKY 398
SD D++ YS K+ Q+L + S E NEL D+ + +
Sbjct: 394 RLFMAQGHNLSDDNDDSYHYSCFMAQGRKVMTQKLSHTSLNDDSSDEESDNELDDVLKSF 453
Query: 399 VDLMKQQKSTLLE-LKASEEELKGFNLISATYEDRLKSLCQKLQEKCDKGSGNKHEIALD 457
Q + L+ L + E+ L+ + + R +L + L ++C K +++ L
Sbjct: 454 SKPAMQHLAKLMRALDSKEQSLERQEELLILEKKRNLALEESLAKECAKNEQLANDLNLA 513
Query: 458 DFIMAGI 464
+ +A +
Sbjct: 514 NGSLASL 520
>UniRef100_Q6ATD3 Putative polyprotein [Oryza sativa]
Length = 1362
Score = 132 bits (333), Expect = 7e-29
Identities = 131/494 (26%), Positives = 221/494 (44%), Gaps = 65/494 (13%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVD----DLDLDEEGAAIDRKIHTPAQKKLYKKHHKIR 56
WK M ++ L +W ++ GVD D++L E Q++L ++ +
Sbjct: 63 WKHKMKLHLISLHPSIWKVVCTGVDVPHDDMELTSE------------QEQLIHRNAQAS 110
Query: 57 GIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDES 116
I++++ E+ K+ AK ++ +L EGS V+EAK +L + F M D E+
Sbjct: 111 NAILSALSPEEFNKVDGLEEAKEIWDTLQLAHEGSPAVREAKIELLEGRLGRFVMGDKET 170
Query: 117 IEEMYSRFQTLVSGLQILKKSYVSSDHVSK-ILRSLPSRWRPKVTAIEEAKDLNTLSVED 175
+EMY R LV+ ++ L +++ V K +LR+ R V+ I E KD L+ D
Sbjct: 171 PQEMYDRMMILVNKIKGLGSEDMTNHFVVKRLLRAFGPRNPTLVSMIRERKDFKRLTPSD 230
Query: 176 LVSSLKVHEMSLNE-HETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSV 234
++ + HEM E E + K+ A+ A KA + E S + DE+
Sbjct: 231 ILGRIVSHEMQEEEAREVRQMVKNAAM-----IKNQEIALKAKQEEESSCEESEDEE--- 282
Query: 235 KMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKG-------CFNCKKPGHFIADCPDL 287
+ ++ ++ K FL K G Y +K+D KG CFNC + GHFIADCP
Sbjct: 283 --------MAFIVKRFKHFLRKSG-YGKGRKDDDKGKRQSKRACFNCGEYGHFIADCPKS 333
Query: 288 QKEKFKGKSKKSSFNSSKFRKQIKKSLMA-TWEDLDSESGSDK-EEADDDAKAAVGLVAT 345
+ K KG KK R + ++ MA W D E K + D G VAT
Sbjct: 334 NEAKAKGGKKKPE------RAHVAEAHMAEVWYSRDEEDPEVKPKPKPKDKVEGEGGVAT 387
Query: 346 VSSEAVSEAE-------SDSEDENEVYS-------KIPRQELVDSLKELLSLFEHRTNEL 391
V+ ++ S ++ SD +D++ YS K+ Q+ + ++ S E NEL
Sbjct: 388 VAFKSSSSSKERLFNNLSDDDDDSYHYSCFMAQGRKVMTQKPSHTSLDVDSSDEESDNEL 447
Query: 392 TDLKEKYVDLMKQQKSTLLE-LKASEEELKGFNLISATYEDRLKSLCQKLQEKCDKGSGN 450
D+ + + Q + L+ L + E+ L+ + + R +L + L ++C K
Sbjct: 448 DDVLKSFSKPAMQHLAKLMRALYSKEQSLERQEELLILEKKRNLALEESLAKECAKNEQL 507
Query: 451 KHEIALDDFIMAGI 464
+E+ L + +A +
Sbjct: 508 ANELNLANGSLASL 521
>UniRef100_Q84VI2 Gag-pol polyprotein [Glycine max]
Length = 1576
Score = 129 bits (324), Expect = 8e-28
Identities = 125/478 (26%), Positives = 207/478 (43%), Gaps = 78/478 (16%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLD-LDEEGAAIDR----KIHTPAQKKLYKKHHKI 55
WK M +F+ LD W + G + LD EG D + T + +L + K
Sbjct: 24 WKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDELKPEEDWTKEEDELALGNSKA 83
Query: 56 RGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDE 115
+ + + + ++ + AK L + EG+ KVK ++ +L ++E +MK++E
Sbjct: 84 LNALFNGVDKNIFRLINTCTVAKDACEILKSTHEGTSKVKMSRLQLLATKFENLKMKEEE 143
Query: 116 SIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVED 175
I + + + + L + V KILRSLP R+ KVTAIEEA+D+ + V++
Sbjct: 144 CIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDE 203
Query: 176 LVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVK 235
L+ SL+ E+ L++ KKSK++A S E E+ D D+DE +
Sbjct: 204 LIGSLQTFELGLSD-RAEKKSKNLAFVSN------------DEGEEDEYDLDTDEGLTNA 250
Query: 236 MAMLSNK----LEYLARKQKKFL----------SKRGSYKNFKKEDQKG--CFNCKKPGH 279
+ +L + L + ++QK + SK + K KG C C+ GH
Sbjct: 251 VVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKGIQCHGCEGYGH 310
Query: 280 FIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKAA 339
IA+CP K+ KG S S D +SE SD +D D A
Sbjct: 311 IIAECPTHLKKHRKGLSVCQS-------------------DTESEQESD---SDRDVNAL 348
Query: 340 VGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHRTNELTDLKEKYV 399
+G+ T +ED ++ S+I EL S ++L E + LK+
Sbjct: 349 IGIFET------------AEDSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIA 396
Query: 400 DLMKQQKSTLLELKASEEELKG-FNLISATYEDRLKSLCQKLQEKCDKGSGNKHEIAL 456
DL ++++ E+ ELKG +++ E+ KS+ + +KGS E+ L
Sbjct: 397 DLEAEKEAHKEEI----SELKGEVGFLNSKLENMTKSI-----KMLNKGSDTLDEVLL 445
>UniRef100_Q84VI4 Gag-pol polyprotein [Glycine max]
Length = 1574
Score = 127 bits (318), Expect = 4e-27
Identities = 125/478 (26%), Positives = 206/478 (42%), Gaps = 78/478 (16%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLD-LDEEGAAIDR----KIHTPAQKKLYKKHHKI 55
WK M +F+ LD W + G + LD EG D + T + +L + K
Sbjct: 24 WKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDELKPEEDWTKEEDELALGNSKA 83
Query: 56 RGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDE 115
+ + + + ++ + AK + L EG+ KVK ++ +L ++E +MK++E
Sbjct: 84 LNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKVKISRLQLLATKFENLKMKEEE 143
Query: 116 SIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVED 175
I + + + + L + V KILRSLP R+ KVTAIEEA+D+ + V++
Sbjct: 144 CIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDE 203
Query: 176 LVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVK 235
L+ SL+ E+ L++ KKSK++A S E E+ D ++DE +
Sbjct: 204 LIGSLQTFELGLSD-RAEKKSKNLAFVSN------------DEGEEDEYDLNTDEGLTNA 250
Query: 236 MAMLSNK----LEYLARKQKKFLSK-----RGSYKNFKKEDQK-------GCFNCKKPGH 279
+ +L + L + ++QK + R K KK D K C C+ GH
Sbjct: 251 VVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKKSDVKPSHSKGIQCHGCEGYGH 310
Query: 280 FIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKAA 339
IA+CP K+ KG S S D +SE SD +D D A
Sbjct: 311 IIAECPTHLKKHRKGLSVCQS-------------------DTESEQESD---SDRDVNAL 348
Query: 340 VGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHRTNELTDLKEKYV 399
G+ T +ED ++ S+I EL S ++L E + LK+
Sbjct: 349 TGIFET------------AEDSSDTDSEITFDELATSYRKLCIKSEKILQQEAQLKKVIA 396
Query: 400 DLMKQQKSTLLELKASEEELKG-FNLISATYEDRLKSLCQKLQEKCDKGSGNKHEIAL 456
DL ++++ E+ ELKG +++ E+ KS+ + +KGS E+ L
Sbjct: 397 DLEAEKEAHKEEI----SELKGEVGFLNSKLENMTKSI-----KMLNKGSDTLDEVLL 445
>UniRef100_Q84VH8 Gag-pol polyprotein [Glycine max]
Length = 1576
Score = 127 bits (318), Expect = 4e-27
Identities = 125/478 (26%), Positives = 205/478 (42%), Gaps = 78/478 (16%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLD-LDEEGAAIDR----KIHTPAQKKLYKKHHKI 55
WK M +F+ LD W + G + LD EG D + T + +L + K
Sbjct: 24 WKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDELKPEEDWTKEEDELALGNSKA 83
Query: 56 RGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDE 115
+ + + + ++ + AK + L EG+ KVK ++ +L ++E +MK++E
Sbjct: 84 LNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKVKMSRLQLLATKFENLKMKEEE 143
Query: 116 SIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVED 175
I + + + + L + V KILRSLP R+ KVTAIEEA+D+ + V++
Sbjct: 144 CIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDE 203
Query: 176 LVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVK 235
L+ SL+ E+ L++ KKSK++A S E E+ D D+DE +
Sbjct: 204 LIGSLQTFELGLSD-RAEKKSKNLAFVSN------------DEGEEDEYDLDTDEGLTNA 250
Query: 236 MAMLSNK----LEYLARKQKKFL----------SKRGSYKNFKKEDQKG--CFNCKKPGH 279
+ +L + L + ++QK + SK + K KG C C+ GH
Sbjct: 251 VVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKGIQCHGCEGYGH 310
Query: 280 FIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKAA 339
IA+CP K+ KG S S D +SE SD +D D A
Sbjct: 311 IIAECPTHLKKHRKGLSVCQS-------------------DTESEQESD---SDRDVNAL 348
Query: 340 VGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHRTNELTDLKEKYV 399
G+ T +ED ++ S+I EL S ++L E + LK+
Sbjct: 349 TGIFET------------AEDSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIA 396
Query: 400 DLMKQQKSTLLELKASEEELKG-FNLISATYEDRLKSLCQKLQEKCDKGSGNKHEIAL 456
DL ++++ E+ ELKG +++ E KS+ + +KGS E+ L
Sbjct: 397 DLEAEKEAHEEEI----SELKGEVGFLNSKLETMKKSI-----KMLNKGSDTLDEVLL 445
>UniRef100_Q94H19 Putative gag-pol polyprotein, 3'-partial [Oryza sativa]
Length = 1074
Score = 126 bits (317), Expect = 5e-27
Identities = 123/455 (27%), Positives = 202/455 (44%), Gaps = 49/455 (10%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLDLDEEGAAIDRKIHTPAQKKLYKKHHKIRGIIV 60
WK M + ++ +W I+E G ++ D H AQ +I+
Sbjct: 47 WKHKMKMHLKSINPSIWRIVEKGYVLQKPEDPTKEDDENEHKNAQAA---------NVIL 97
Query: 61 ASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESIEEM 120
+++ +E+ ++ D +AK ++ +L EG+ V+E+K +L Q+E F M D ES +M
Sbjct: 98 SALSGSEFNRVDDIESAKVIWDTLRNLHEGTDSVRESKVEILKGQFERFVMLDGESPSDM 157
Query: 121 YSRFQTLVSGLQIL-KKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSS 179
Y +V+ ++ L K V K++R++ R VT I E D TL+ DL+
Sbjct: 158 YDHLSKIVNEIKGLGSKDMTDEVVVKKMVRAITLRNSTLVTIIRERPDYKTLTPHDLLGR 217
Query: 180 LKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDED-QSVKMAM 238
+ H+M + E+ + + I S K A KA E EES S E+ +M +
Sbjct: 218 ILAHDMLV--QESKEVIQYINQSSTTSIKKEDLALKAKEEEEESRKSKSKEEIDDEEMTL 275
Query: 239 LSNKLEYLARKQ---KKFLSKRGSYKNFKKEDQKGCFNCKKPGHFIADCPDLQ-----KE 290
K R+ K SK S K + + C+ CK+PGHFIADCP L+ KE
Sbjct: 276 FVKKFGKFMRRSGFFKGSSSKHHSNKLSGRHSARVCYVCKEPGHFIADCPYLKDGSHIKE 335
Query: 291 KFKG--KSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEA---------------- 332
G K +K + R++ ++ + DSES S EE
Sbjct: 336 DKMGEKKDEKKEKHKHSKRERYGQAHLGKIFGSDSESSSSDEEGIATFAVKPSSPPRLFN 395
Query: 333 -DDDAKAAVGLVA----TVSSEAVSEAESDSEDENEVYSKIPRQELVDSL-KELLSLFEH 386
D A + L+A SS + S++E++V ++ +EL S+ KE L H
Sbjct: 396 YSSDEDAPICLMAKEPKVPSSLKSFNVDLVSDEEDDVEDEVMDEELFKSITKESL---PH 452
Query: 387 RTNELTDLKEKYVDLMKQQKSTLLELKASEEELKG 421
+ L ++++Y L ++Q++ L+ K ELKG
Sbjct: 453 LSELLGRIEDQYATL-ERQEALLIREKERSHELKG 486
>UniRef100_Q94GR9 Putative gag-pol polyprotein [Oryza sativa]
Length = 1700
Score = 126 bits (317), Expect = 5e-27
Identities = 118/470 (25%), Positives = 208/470 (44%), Gaps = 72/470 (15%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLDLDEEGAAIDRKIHTPAQKKLYKKHHKIRGIIV 60
WK M + + + +W I++ G AI T + + + + +
Sbjct: 22 WKIKMSTHLKAMSFHIWSIVDVGF----------AITGTPLTEIDHRNLQLNAQAMNALF 71
Query: 61 ASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESIEEM 120
S+ + E+ ++S+ TA ++ L EG+ + K+AK L QYE F M ES+ +M
Sbjct: 72 NSLSQEEFDRVSNLETAYEIWNKLAEIHEGTSEYKDAKLHFLKIQYETFSMLPHESVNDM 131
Query: 121 YSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSL 180
Y R +V+ L+ L +Y + K+LR+LP ++ VT + + D++ ++ L+ +
Sbjct: 132 YGRLNVIVNDLKGLGANYTDLEVAQKMLRALPEKYETFVTMLINS-DMSRMTPASLLGKI 190
Query: 181 KVHEM----SLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVKM 236
++M E S K IAL ++ + SK + +E +EE +M
Sbjct: 191 NTNDMYKLKKKEIEEASPSKKCIALQAEVEDQSKSKVNEVNEDLEE------------EM 238
Query: 237 AMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKG-------CFNCKKPGHFIADCPDLQK 289
+L+ + L ++K+ RGS N +K + CF C + GHF + CP
Sbjct: 239 VLLARRFNDLLGRRKE--RGRGSNSNRRKNRRPNKTLSNLRCFECGEKGHFASKCPSKDD 296
Query: 290 EKFKGKSKKSSFNSSKFRKQIKK------SLMATW---EDLDSESGSDKEEADDDAK--- 337
+ K KKS K K++KK + + W E+ + SGS++E DD +
Sbjct: 297 DGDKSSKKKS--GGYKLMKKLKKEGKKIEAFIGEWDSNEESSASSGSEEEGGDDASSKKK 354
Query: 338 --------------AAVGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSL 383
A + L+A SS+ S ++S+S+D+ + + ELV +EL +
Sbjct: 355 KMAVVAIKEAPSLFAPLCLMAKGSSKVTSLSDSESDDDCD---DVSYDELVSMFEELHTY 411
Query: 384 FEHRTNELTDLKEKYVD---LMKQQKSTLLELKASEEELKGF--NLISAT 428
E + LK+ + L ++ K+T L S E+LK NL+S T
Sbjct: 412 SEKEIVKFKALKKDHASLEVLYEELKTTHERLTISHEKLKEAHDNLLSTT 461
Score = 43.9 bits (102), Expect = 0.042
Identities = 62/258 (24%), Positives = 104/258 (40%), Gaps = 55/258 (21%)
Query: 77 AKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESIEEMYSRFQTL-------VS 129
A ++FA LC +GS KV + E DD S +E+ S F+ L +
Sbjct: 364 APSLFAPLCLMAKGSSKVTS------LSDSESDDDCDDVSYDELVSMFEELHTYSEKEIV 417
Query: 130 GLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVE------DLVSSLKVH 183
+ LKK + S + + + L++ R ++EA D N LS D+ S +
Sbjct: 418 KFKALKKDHASLEVLYEELKTTHERLTISHEKLKEAHD-NLLSTTQHGAHIDVDISCDLL 476
Query: 184 EMSLNEHETSKKSKSIA--------LPSKGKTSKS-----------------SKAYKASE 218
+ S H S SI+ +PS +S S + K ++
Sbjct: 477 DDSATCHIAHVASSSISTSCDNLMDMPSPCSSSSSCVSICDASLVVENNELKEQVSKLNK 536
Query: 219 SVEESPDGDSDEDQSVKMAMLSNKLEYLARKQKKFLSKRGSYKN-----FKKEDQKGCFN 273
S+E G + D+ +LS + L ++ F+ K+G + F K + K C
Sbjct: 537 SLERCFKGKNTLDK-----ILSEQRCILNKEGLGFIPKKGKKPSHHATRFVKRNGKYCSK 591
Query: 274 CKKPGHFIADCPDLQKEK 291
C++ GH ++DCP + K
Sbjct: 592 CREVGHLVSDCPGSKPPK 609
>UniRef100_Q9FG84 Copia-like retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1013
Score = 126 bits (316), Expect = 6e-27
Identities = 120/483 (24%), Positives = 204/483 (41%), Gaps = 42/483 (8%)
Query: 1 WKTNMYSFIMGLDEELWDIL-----------EDGVDDLDLDEEGAAIDRKIHTPAQKKLY 49
WK M + I GL +E W E+G D L +++ T A++
Sbjct: 24 WKVKMRALIRGLGKEAWIATSVGWKAPVVKGENGEDVLKTEDQW--------TDAEEAKA 75
Query: 50 KKHHKIRGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELF 109
+ + +I S+ + ++ ++ + +AK + L +EG+ VK ++ ML Q+E
Sbjct: 76 TANSRALSLIFNSVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVKRSRIDMLASQFENL 135
Query: 110 RMKDDESIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLN 169
M + E+IEE + + S L K Y V K+LR LPSR+ K TA+ + D +
Sbjct: 136 TMDESENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFESKRTAMGTSLDTD 195
Query: 170 TLSVEDLVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASE--SVEESPDGD 227
T+ E++V L+ +E+ + KG SK +SE ++E D
Sbjct: 196 TIDFEEVVGMLQAYELEITS-------------GKGGYSKGVALAVSSEKNEIQELKDSM 242
Query: 228 SDEDQSVKMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKGCFNCKKPGHFIADCPDL 287
S ++ AM + AR Q + K + C C+ GH A+CP L
Sbjct: 243 SMMAKNFSRAMKRVEKRGFARNQGSDRDRDRDRDRNSKRSEIQCHECQGYGHIKAECPSL 302
Query: 288 QKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKAAVGLVATVS 347
+++ K S+ +KF KS +S+S SD E++++D K V V +
Sbjct: 303 KRKDLK-CSECRGIGHTKFDCIGSKSKPDRSYIAESDSDSDDEDSEEDVKGFVSFVGIIE 361
Query: 348 SEAVSEAESDSE---DENEVYSKIPRQELVDSLKELLSLFEH---RTNELTDLKEKYVDL 401
+ VS SDSE ++ E+ + +D E L+E+ + E E+ V +
Sbjct: 362 DDNVSSDSSDSEVGCEKEEISADDESDVEMDVDGEFRKLYENWLVLSKEKVIWLEEKVKV 421
Query: 402 MKQQKSTLLELKASEEELKGFNLISATYEDRLKSLCQKLQEKCDK-GSGNKHEIALDDFI 460
+Q + EL + + L + E++ + L Q L + K NK LD +
Sbjct: 422 QEQIEQLKGELAVANQIKSEMILKYSAKEEKNRELSQDLSDTRKKIHMLNKGTKDLDSIL 481
Query: 461 MAG 463
AG
Sbjct: 482 AAG 484
>UniRef100_Q9C5V1 Gag/pol polyprotein [Arabidopsis thaliana]
Length = 1643
Score = 126 bits (316), Expect = 6e-27
Identities = 120/483 (24%), Positives = 204/483 (41%), Gaps = 42/483 (8%)
Query: 1 WKTNMYSFIMGLDEELWDIL-----------EDGVDDLDLDEEGAAIDRKIHTPAQKKLY 49
WK M + I GL +E W E+G D L +++ T A++
Sbjct: 24 WKVKMRALIRGLGKEAWIATSVGWKAPVVKGENGEDVLKTEDQW--------TDAEEAKA 75
Query: 50 KKHHKIRGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELF 109
+ + +I S+ + ++ ++ + +AK + L +EG+ VK ++ ML Q+E
Sbjct: 76 TANSRALSLIFNSVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVKRSRIDMLASQFENL 135
Query: 110 RMKDDESIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLN 169
M + E+IEE + + S L K Y V K+LR LPSR+ K TA+ + D +
Sbjct: 136 TMDESENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFESKRTAMGTSLDTD 195
Query: 170 TLSVEDLVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASE--SVEESPDGD 227
T+ E++V L+ +E+ + KG SK +SE ++E D
Sbjct: 196 TIDFEEVVGMLQAYELEITS-------------GKGGYSKGVALAVSSEKNEIQELKDSM 242
Query: 228 SDEDQSVKMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKGCFNCKKPGHFIADCPDL 287
S ++ AM + AR Q + K + C C+ GH A+CP L
Sbjct: 243 SMMAKNFSRAMKRVEKRGFARNQGSDRDRDRDRDRNSKRSEIQCHECQGYGHIKAECPSL 302
Query: 288 QKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKAAVGLVATVS 347
+++ K S+ +KF KS +S+S SD E++++D K V V +
Sbjct: 303 KRKDLK-CSECRGIGHTKFDCIGSKSKPDRSYIAESDSDSDDEDSEEDVKGFVSFVGIIE 361
Query: 348 SEAVSEAESDSE---DENEVYSKIPRQELVDSLKELLSLFEH---RTNELTDLKEKYVDL 401
+ VS SDSE ++ E+ + +D E L+E+ + E E+ V +
Sbjct: 362 DDNVSSDSSDSEVGCEKEEISADDESDVEMDVDGEFRKLYENWLVLSKEKVIWLEEKVKV 421
Query: 402 MKQQKSTLLELKASEEELKGFNLISATYEDRLKSLCQKLQEKCDK-GSGNKHEIALDDFI 460
+Q + EL + + L + E++ + L Q L + K NK LD +
Sbjct: 422 QEQIEQLKGELAVANQIKSEMILKYSAKEEKNRELSQDLSDTRKKIHMLNKGTKDLDSIL 481
Query: 461 MAG 463
AG
Sbjct: 482 AAG 484
>UniRef100_Q84T51 Putative copia-type pol polyprotein [Oryza sativa]
Length = 2027
Score = 126 bits (316), Expect = 6e-27
Identities = 120/451 (26%), Positives = 198/451 (43%), Gaps = 47/451 (10%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLDLDEEGAAIDRKIHTPAQKKLYKKHHKIRGIIV 60
WK M + ++ +W I+E G + + D H AQ I+
Sbjct: 47 WKHKMKMHLKSINPSIWRIVEKGYVLQNPENPTKEDDENEHKNAQAA---------NAIL 97
Query: 61 ASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESIEEM 120
+++ +E+ ++ D +AK ++ +L EG+ V+E+K +L Q+E F M D ES +M
Sbjct: 98 SALSGSEFNRVDDIESAKVIWDTLRNLHEGTDSVRESKVEILKGQFERFIMLDGESPSDM 157
Query: 121 YSRFQTLVSGLQIL-KKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSS 179
Y R +V+ ++ L K V K++R++ R VT I E D TL+ DL+
Sbjct: 158 YDRLSKIVNEIKGLGSKDMTDEVVVKKMVRAITPRNSTLVTIIRERPDYKTLTPHDLLGR 217
Query: 180 LKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSD-EDQSVKMAM 238
+ H+M + E+ + I S K A KA E EE+ S E +MA+
Sbjct: 218 ILAHDMLV--QESKDVIQYINQSSTTNIKKEDLALKAKEEEEENRKSKSKAEIDDEEMAL 275
Query: 239 LSNKLEYLARKQKKF---LSKRGSYKNFKKEDQKGCFNCKKPGHFIADCPDL------QK 289
K R+ F SK S K+ + + C+ CK+PGHFIADCP L ++
Sbjct: 276 FVKKFGKFMRRSGFFKGGSSKHYSNKSSGRHSARMCYVCKEPGHFIADCPHLKDGSHIKE 335
Query: 290 EKFKGKSKKSSFNSSKFRK--QIKKSLMATWEDLDSESGSDKEEA--------------- 332
+K KG+ K K+ K ++ + DSES S EE
Sbjct: 336 DKKKGEKKDEKKEKHKYSKWEHYGQAHLGMIFGSDSESSSSDEEGVATFAVKPSSPPRLF 395
Query: 333 --DDDAKAAVGLVA---TVSSEAVS-EAESDSEDENEVYSKIPRQELVDSLKELLSLFEH 386
D A + L+A V S S + S++E+EV ++ EL+ S+ + H
Sbjct: 396 DYSSDEDAPICLMAKEPKVPSPLKSFNVDFVSDEEDEVEDEVMDDELLKSITK--ESIPH 453
Query: 387 RTNELTDLKEKYVDLMKQQKSTLLELKASEE 417
+ L ++++Y L +Q++ + E + S E
Sbjct: 454 LSELLGRIEDQYATLERQEELLIHEKEQSHE 484
>UniRef100_Q7XS40 OSJNBa0069D17.3 protein [Oryza sativa]
Length = 1156
Score = 121 bits (304), Expect = 2e-25
Identities = 121/448 (27%), Positives = 202/448 (45%), Gaps = 47/448 (10%)
Query: 42 TPAQKKLYKKHHKIRGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALM 101
T Q++L ++ + I++++ E+ K+ AK ++ +L EGS V+EAK +
Sbjct: 4 TSEQEQLIHRNAQASKAILSALSPEEFNKVDGLEEAKEIWDTLQLAHEGSPAVREAKIEL 63
Query: 102 LVHQYELFRMKDDESIEEMYSRFQTLVSGLQILKKSYVSSDHVSK-ILRSLPSRWRPKVT 160
L + F M D E+ +EMY R LV+ ++ L +++ V K +LR+ R V+
Sbjct: 64 LEGRLGRFVMDDKETPQEMYDRMMILVNKIKGLGSEDMTNHFVVKRLLRAFGPRNPTLVS 123
Query: 161 AIEEAKDLNTLSVEDLVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESV 220
I E KD L+ D++ S+ HEM E ++ A K + A KA++
Sbjct: 124 MIRERKDFKRLTPNDILGSIVSHEMQEEEAREVRQMVKNAPMIKNQ----EVALKANQEE 179
Query: 221 EESPDGDSDEDQSVKMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKG-------CFN 273
E S + DE+ L ++ ++ K FL K G Y +K+D KG CFN
Sbjct: 180 ESSCEESEDEE-----------LAFIVKRFKHFLRKSG-YGKGRKDDDKGKRQSKRACFN 227
Query: 274 CKKPGHFIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKKSLMA-TWEDLDSESGSDK-EE 331
C + GHFIADCP + K KG KK R + ++ MA W D E K +
Sbjct: 228 CGEYGHFIADCPRSNEAKAKGGKKKPK------RAHVAEAHMAEVWYSGDEEYPEVKPKP 281
Query: 332 ADDDAKAAVGLVATVSSEAVSEAE-------SDSEDENEVYS-------KIPRQELVDSL 377
D G VAT++ ++ S ++ SD +D++ YS K+ Q+ +
Sbjct: 282 KSKDKVEGEGGVATIAFKSSSSSKERLFNNLSDDDDDSYHYSCFMAQGRKVMTQKPSHTS 341
Query: 378 KELLSLFEHRTNELTD-LKEKYVDLMKQQKSTLLELKASEEELKGFNLISATYEDRLKSL 436
++ S E NEL D LK M+ + L + E+ L+ + + R L
Sbjct: 342 LDVDSSDEESDNELDDVLKSLSKPTMQHLAKLMRALDSKEQSLERQEELLILEKKRNLDL 401
Query: 437 CQKLQEKCDKGSGNKHEIALDDFIMAGI 464
+ L ++C K +++ L + +A +
Sbjct: 402 EESLAKECAKNEQLANDLNLANGSLASL 429
>UniRef100_Q7XP45 OSJNBa0063G07.6 protein [Oryza sativa]
Length = 1539
Score = 121 bits (303), Expect = 2e-25
Identities = 114/470 (24%), Positives = 211/470 (44%), Gaps = 64/470 (13%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLDLDEEGAAIDRKIHTPAQKKLYKKHHKIRGIIV 60
WK M + + + +W I++ G AI T + + + + +
Sbjct: 22 WKIKMSTHLKAMSFHIWSIVDVGF----------AITGTPLTEIDHRNLQLNAQAMNALF 71
Query: 61 ASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESIEEM 120
S+ + E+ ++S+ TA ++ L EG+ + K+AK L QYE F M ES+ +M
Sbjct: 72 NSLSQEEFDRVSNLETAYEIWNKLAKIHEGTSEYKDAKLHFLKIQYETFSMLPHESVNDM 131
Query: 121 YSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSL 180
Y R +V+ L+ L +Y + K+LR+LP ++ VT + + D++ ++ L+ +
Sbjct: 132 YGRLNVIVNDLKGLGANYTDLEVAKKMLRALPEKYETLVTMLINS-DMSRMTPASLLGKI 190
Query: 181 KVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPD-GDSDEDQSVKMAML 239
++M ++ KK A PSK K VE+ D +++ ++A+L
Sbjct: 191 NTNDM----YKLKKKEMEEASPSK-------KCIALQAEVEDKGKVNDVNKNLEEEIALL 239
Query: 240 SNKL-EYLARKQKKFLSKRGSYKNFKKEDQK----GCFNCKKPGHFIADCPDLQKEKFKG 294
+ + ++L R++++ + + K+ ++ CF C + GHF + CP + K
Sbjct: 240 ARRFNDFLGRRKERGKGSNSNRRRNKRPNKTLSNLRCFECGEKGHFASKCPSKDDDGDKS 299
Query: 295 KSKKSSFNSSKFRKQIKK------SLMATW---EDLDSESGSDKEEADDDAK-------- 337
KKS K K++KK + + W E+ + SGS +E+ DD +
Sbjct: 300 SKKKS--GGYKLMKKLKKEGKKIEAFIGEWDSNEESSASSGSKEEDGDDASSKKKKMAVV 357
Query: 338 ---------AAVGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHRT 388
A + L+A SS+ S ++S+S+D+ + I ELV +EL + E
Sbjct: 358 AIKEAPSLFAPLCLMAKGSSKVTSLSDSESDDDCD---DISYDELVSMFEELHAYSEKEI 414
Query: 389 NELTDLKEKYVD---LMKQQKSTLLELKASEEELKGF--NLISATYEDRL 433
+ LK+ + L ++ K++ L S E+LK NL+S T L
Sbjct: 415 VKFKALKKDHASLEVLYEELKTSHERLTISHEKLKEAHDNLLSTTQHGAL 464
>UniRef100_Q8H8K3 Putative retroelement [Oryza sativa]
Length = 1299
Score = 120 bits (302), Expect = 3e-25
Identities = 117/449 (26%), Positives = 199/449 (44%), Gaps = 73/449 (16%)
Query: 33 GAAIDRKIHTPAQKKLYKKHHKIRGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSK 92
G AI K T + + + + + S+ + E+ ++S+ TA ++ L EG+
Sbjct: 10 GFAITGKPLTEIDHRNLQLNAQAMNALFNSLSQEEFDRVSNLETAYEIWNKLAEIHEGTS 69
Query: 93 KVKEAKALMLVHQYELFRMKDDESIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLP 152
+ K+AK L QYE F M ES+ +MY R +V+ L+ L +Y + K+LR+LP
Sbjct: 70 EYKDAKLHFLKIQYETFSMLPHESVNDMYGRLNVIVNDLKGLGANYTDLEIAQKMLRALP 129
Query: 153 SRWRPKVTAIEEAKDLNTLSVEDLVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSK 212
++ VT + + D++ ++ L+ + ++M ++ KK A PSK K
Sbjct: 130 EKYETLVTMLINS-DMSRMTPASLLGKINTNDM----YKLKKKEMEEASPSK-------K 177
Query: 213 AYKASESVEESPDG---DSDEDQSVKMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQK 269
VE+ G +++ED ++A+L+ + L ++K+ RGS N ++ +
Sbjct: 178 CIALQTEVEDKGKGKVNEANEDLEEEIALLARRFNDLLGRKKE--RGRGSNSNRRRNRRP 235
Query: 270 G-------CFNCKKPGHFIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKK------SLMA 316
CF C + GHF + CP + K KKS K K++KK + +
Sbjct: 236 NKTLSNLRCFECGEKGHFASKCPSKDDDGDKSSKKKS--GGYKLMKKLKKEGKKIEAFIG 293
Query: 317 TWE---DLDSESGSDKEEADDDAK-----------------AAVGLVATVSSEAVSEAES 356
W+ + + SGS++E DD + A + L+A SS+ S ++S
Sbjct: 294 EWDSNKESSASSGSEEEGGDDASSKKKKMAVVAIKEAPSLFAPLCLMAKGSSKVTSLSDS 353
Query: 357 DSEDENEVYSKIPRQELVDSLKELLSLFEHRTNELTDLKEKYVDLMKQQKSTLLELKASE 416
+S+D+ + S E +S+FE EL EK + K K LK
Sbjct: 354 ESDDDCDDV----------SYDEFVSMFE----ELHAYSEKEIVKFKALKKNHASLKVLY 399
Query: 417 EELKGFNLISATYEDRLKSLCQKLQEKCD 445
EELK T +RL +KL+E D
Sbjct: 400 EELK-------TSHERLTISHEKLKETHD 421
>UniRef100_Q94HD2 Putative retroelement [Oryza sativa]
Length = 1131
Score = 120 bits (302), Expect = 3e-25
Identities = 117/449 (26%), Positives = 199/449 (44%), Gaps = 73/449 (16%)
Query: 33 GAAIDRKIHTPAQKKLYKKHHKIRGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSK 92
G AI K T + + + + + S+ + E+ ++S+ TA ++ L EG+
Sbjct: 64 GFAITGKPLTEIDHRNLQLNAQAMNALFNSLSQEEFDRVSNLETAYEIWNKLAEIHEGTS 123
Query: 93 KVKEAKALMLVHQYELFRMKDDESIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLP 152
+ K+AK L QYE F M ES+ +MY R +V+ L+ L +Y + K+LR+LP
Sbjct: 124 EYKDAKLHFLKIQYETFSMLPHESVNDMYGRLNVIVNDLKGLGANYTDLEIAQKMLRALP 183
Query: 153 SRWRPKVTAIEEAKDLNTLSVEDLVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSK 212
++ VT + + D++ ++ L+ + ++M ++ KK A PSK K
Sbjct: 184 EKYETLVTMLINS-DMSRMTPASLLGKINTNDM----YKLKKKEMEEASPSK-------K 231
Query: 213 AYKASESVEESPDG---DSDEDQSVKMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQK 269
VE+ G +++ED ++A+L+ + L ++K+ RGS N ++ +
Sbjct: 232 CIALQTEVEDKGKGKVNEANEDLEEEIALLARRFNDLLGRKKE--RGRGSNSNRRRNRRP 289
Query: 270 G-------CFNCKKPGHFIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKK------SLMA 316
CF C + GHF + CP + K KKS K K++KK + +
Sbjct: 290 NKTLSNLRCFECGEKGHFASKCPSKDDDGDKSSKKKS--GGYKLMKKLKKEGKKIEAFIG 347
Query: 317 TWE---DLDSESGSDKEEADDDAK-----------------AAVGLVATVSSEAVSEAES 356
W+ + + SGS++E DD + A + L+A SS+ S ++S
Sbjct: 348 EWDSNKESSASSGSEEEGGDDASSKKKKMAVVAIKEAPSLFAPLCLMAKGSSKVTSLSDS 407
Query: 357 DSEDENEVYSKIPRQELVDSLKELLSLFEHRTNELTDLKEKYVDLMKQQKSTLLELKASE 416
+S+D+ + S E +S+FE EL EK + K K LK
Sbjct: 408 ESDDDCDDV----------SYDEFVSMFE----ELHAYSEKEIVKFKALKKNHASLKVLY 453
Query: 417 EELKGFNLISATYEDRLKSLCQKLQEKCD 445
EELK T +RL +KL+E D
Sbjct: 454 EELK-------TSHERLTISHEKLKETHD 475
>UniRef100_Q7XES3 Contains similarity to Zea mays retrotransposon Opie-2 [Oryza
sativa]
Length = 542
Score = 120 bits (301), Expect = 4e-25
Identities = 87/338 (25%), Positives = 166/338 (48%), Gaps = 22/338 (6%)
Query: 36 IDRKIHTPAQKKLYKKHHKIRGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVK 95
I I+T A+K ++++ K R I+++ I R++Y +++ TA ++ L +G+ +K
Sbjct: 86 IPEAINTAAEKTAFEQNCKARNILLSGISRSDYDRVAHLQTAHEIWIFLSNFHQGTNNIK 145
Query: 96 EAKALMLVHQYELFRMKDDESIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSL---- 151
E + + +Y F MK E++++ SRF ++S L+ + SY ++ SKI R
Sbjct: 146 ELRRDLFKTEYIKFEMKPGEALDDYISRFNKILSDLRSVDSSYDANYPQSKIFRHFLNGL 205
Query: 152 -PSRWRPKVTAIEEAKDLNTLSVEDLVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKS 210
S W KVT+I+E+ +++TL+++ L + LK HEM++ + KS + S +
Sbjct: 206 DMSIWEMKVTSIQESVNMSTLTLDSLYTKLKTHEMNIRSRKGDSKSSVLVSSSTFLDVGA 265
Query: 211 SKAYKASESVEESPDGDSDEDQSVKMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKG 270
S + + ++ + D E + +LSN +F + +N K+ +
Sbjct: 266 SSSKSSVLALFNAISDDQLEQFEDDLVLLSN----------RFSRAMKNVRNRKRGEPNR 315
Query: 271 CFNCKKPGHFIADCPDLQKEKFK--GKSKKSSFNSSK-FRKQIKKSLMATW--EDLDSES 325
CF C H + CP L + K + G+ K+ N K +++ KK W ++L
Sbjct: 316 CFECGALNHLRSHCPKLGRGKNEDDGRVKEDDVNKKKNMKEKEKKKHCMQWLVQELIKVL 375
Query: 326 GSDKEEADDDAKAAVGL--VATVSSEAVSEAESDSEDE 361
++E + K V L +A +S AV E++ D+E++
Sbjct: 376 DGSEDEDEGKGKQVVDLDFIARNASSAVDESDDDNEEK 413
>UniRef100_Q7XL83 OSJNBb0014D23.6 protein [Oryza sativa]
Length = 1475
Score = 120 bits (301), Expect = 4e-25
Identities = 120/482 (24%), Positives = 205/482 (41%), Gaps = 85/482 (17%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLDLDEEGAAIDRKIHTPAQKKLYKKHHKIRGIIV 60
WK M + + + +W I++ G AI T + + + + +
Sbjct: 22 WKIKMSTHLKAMSFYIWSIVDVGF----------AITGTPLTEIDHRNLQLNAQAMNALF 71
Query: 61 ASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESIEEM 120
S+ + E+ ++S+ TA ++ L EG+ + K+AK L QYE F M ES+ +M
Sbjct: 72 NSLSQEEFDRVSNLETAYEIWNKLAEIHEGTSEYKDAKLHFLKIQYETFSMLPHESVNDM 131
Query: 121 YSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSL 180
Y R +V+ L+ L +Y + K+LR+L ++ VT + + D++ ++ L+ +
Sbjct: 132 YGRLNVIVNDLKGLGANYTDLEVAQKMLRALSEKYETLVTMLINS-DMSRMTPASLLGKI 190
Query: 181 KVHEM----SLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVKM 236
++M E S K IAL ++ + SK + +E +EE +M
Sbjct: 191 NTNDMYKLKKKEMKEASPSKKCIALQAEVEDKSKSKVNEVNEDLEE------------EM 238
Query: 237 AMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKG-------CFNCKKPGHFIADCPDLQK 289
+L+ K L ++K+ RGS N ++ + CF C + GHF + CP
Sbjct: 239 VLLARKFNDLLGRRKE--RGRGSNSNRRRNRRPNKTLSNLRCFECGEKGHFASKCPSKDD 296
Query: 290 EKFKGKSKKSSFNSSKFRKQIKK------SLMATWEDLDSESGSD--KEEADDDAK---- 337
+ K KKS K K++KK + + W+ + S S +E+ DDA
Sbjct: 297 DGDKSSKKKS--GGYKLMKKLKKEGKKIEAFIGEWDSNEESSASSGFEEQCSDDASSKKK 354
Query: 338 --------------AAVGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSL 383
A + L+A SS+ S ++S+S+D+ + S EL+S+
Sbjct: 355 KMAVVAIKEAPSLFAPLCLMAKGSSKVTSLSDSESDDDCDDV----------SYDELVSM 404
Query: 384 FEHRTNELTDLKEKYVDLMKQQKSTLLELKASEEELKGFNLISATYEDRLKSLCQKLQEK 443
FE EL EK + K K L+ EELK T +RL +KL+E
Sbjct: 405 FE----ELHAYSEKEIVKFKALKKDHASLEVLYEELK-------TSHERLTISHEKLKEA 453
Query: 444 CD 445
D
Sbjct: 454 HD 455
>UniRef100_Q9AY98 Putative copia-type pol polyprotein [Oryza sativa]
Length = 1778
Score = 120 bits (300), Expect = 5e-25
Identities = 123/486 (25%), Positives = 210/486 (42%), Gaps = 76/486 (15%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVD----DLDLDEEGAAIDRKIHTPAQKKLYKKHHKIR 56
WK M ++ L +W ++ G+D D++L E Q++L ++ +
Sbjct: 63 WKHKMKLHLISLHPNIWKVVCTGIDIPHDDMELTSE------------QEQLIHRNAQAS 110
Query: 57 GIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDES 116
I+ ++ E+ K+ AK ++ +L EGS V+EAK +L + F M D E+
Sbjct: 111 NAILFALSPEEFNKIDGLVEAKEIWDTLQLAHEGSPAVREAKIELLEERLGRFVMDDKET 170
Query: 117 IEEMYSRFQTLVSGLQILKKSYVSSDH-VSKILRSLPSRWRPKVTAIEEAKDLNTLSVED 175
+E+Y R LV+ ++ L +++ V K+LR+ R V+ I E KD L+ D
Sbjct: 171 PQEIYDRMMILVNKIKGLGSEDMTNHFVVKKLLRAFGPRNPTLVSMIRERKDFKRLTPSD 230
Query: 176 LVSSLKVHEMSLNE-HETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSV 234
++ + HEM E E + K+ A+ ++Q V
Sbjct: 231 ILGRIVSHEMQEEEAREVRQMVKNAAM---------------------------IKNQEV 263
Query: 235 KMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKG-------CFNCKKPGHFIADCPDL 287
+ ++ ++ + K FL K G Y +K+D KG CFNC + GHFIADCP
Sbjct: 264 ALKAKQEEMAFIVKIFKHFLRKSG-YGKGRKDDDKGKRQSRRACFNCGEYGHFIADCPKS 322
Query: 288 QKEKFKGKSKKSSFNSSKFRKQIKKSLMA-TWEDLDSESGSDK-EEADDDAKAAVGLVAT 345
+ K KG KK + ++ MA W D E K + D G VAT
Sbjct: 323 NEAKAKGGKKKPE------HAHVAEAHMAEVWYSGDEEDPEVKPKPRSKDKVEGEGGVAT 376
Query: 346 VSSEAVSEAE-------SDSEDENEVYS-------KIPRQELVDSLKELLSLFEHRTNEL 391
V+ ++ S ++ SD +D++ YS K+ Q+ + ++ S E NEL
Sbjct: 377 VAFKSSSSSKERLFNNLSDDDDDSYHYSCFMAQGCKVMTQKPSQTSLDVDSSDEESDNEL 436
Query: 392 TDLKEKYVDLMKQQKSTLLE-LKASEEELKGFNLISATYEDRLKSLCQKLQEKCDKGSGN 450
D+ + + Q + L+ L + E+ L+ + + R +L + L +KC K
Sbjct: 437 DDVLKSFSKPAMQHLAKLMRALDSKEQSLERQEELLILEKKRNLTLEESLAKKCAKNEQL 496
Query: 451 KHEIAL 456
+E+ L
Sbjct: 497 ANELNL 502
>UniRef100_Q8LMY8 Putative polyprotein [Oryza sativa]
Length = 1584
Score = 119 bits (298), Expect = 8e-25
Identities = 115/470 (24%), Positives = 206/470 (43%), Gaps = 72/470 (15%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLDLDEEGAAIDRKIHTPAQKKLYKKHHKIRGIIV 60
WK M + + + +W I+ G AI T + + + + +
Sbjct: 22 WKIKMSTHLKAMSFHIWSIVYVGF----------AITGTPLTEIDHRNLQLNAQAMNALF 71
Query: 61 ASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESIEEM 120
S+ + E+ ++S+ TA ++ L EG+ + K+AK L QYE F M ES+ +M
Sbjct: 72 NSLSQEEFDRVSNLETAYEIWNKLAEIHEGTSEYKDAKLHFLKIQYETFYMLPHESVNDM 131
Query: 121 YSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSL 180
Y R +V+ L+ L +Y + K+LR+LP ++ VT + + D++ ++ L+ +
Sbjct: 132 YGRLNVIVNDLKGLGANYTDLEVAQKMLRALPEKYETLVTMLINS-DMSRMTPASLLGKI 190
Query: 181 KVHEM----SLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVKM 236
++M E S K IAL ++ + SK + +E +EE +M
Sbjct: 191 NTNDMYKLKKKEMEEASPSKKCIALQAEVEDKSKSKVNEVNEDLEE------------EM 238
Query: 237 AMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKG-------CFNCKKPGHFIADCPDLQK 289
+L+ + L ++K+ RGS N ++ + CF + GHF + CP
Sbjct: 239 ILLARRFNDLLERRKE--RGRGSNSNRRRNRRLNKTLSNLRCFEYGEKGHFASKCPSKDD 296
Query: 290 EKFKGKSKKSSFNSSKFRKQIKK------SLMATW---EDLDSESGSDKEEADDDAK--- 337
+ K KKS K K++KK + + W E+ + SGS++E DD +
Sbjct: 297 DGDKSSKKKS--GGYKLMKKLKKEGKKIEAFIGEWDSNEESSASSGSEEEGGDDASSKKK 354
Query: 338 --------------AAVGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSL 383
A + L+A SS+ S ++S+S+D+ + + +ELV +EL +
Sbjct: 355 KMVVVAIKEAPSLFAPLCLMAKGSSKVTSLSDSESDDDCD---DVSYEELVSMFEELHAY 411
Query: 384 FEHRTNELTDLKEKYVD---LMKQQKSTLLELKASEEELKGF--NLISAT 428
E + L + Y L ++ K++ L S E+LK NL+S T
Sbjct: 412 SEKEIVKFKALNKDYASLDVLYEELKTSNERLTISHEKLKEAHDNLLSTT 461
>UniRef100_Q8S5D7 Putative gag-pol polyprotein [Oryza sativa]
Length = 1627
Score = 119 bits (298), Expect = 8e-25
Identities = 130/537 (24%), Positives = 230/537 (42%), Gaps = 80/537 (14%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLDLDEEGAAIDRKIHTPAQKKLYKKHHKIRGIIV 60
WK M + + + +W I++ G AI T + + + +
Sbjct: 22 WKIKMSTHLKAMSFHIWSIVDVGF----------AITGTPLTEIDHCNLQLNAQAMNALF 71
Query: 61 ASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESIEEM 120
S + E+ ++S+ TA ++ L EG+ + K+AK L QYE F M ES+ +M
Sbjct: 72 NSFSQEEFDRVSNLETAYEIWNKLAEIHEGTSEYKDAKLHFLKIQYETFSMLPHESVNDM 131
Query: 121 YSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSL 180
Y R +V+ L+ L +Y + K+LR+LP ++ VT + + D++ ++ L+ +
Sbjct: 132 YGRLNVIVNDLKGLGATYTDLEVAQKMLRALPKKYETLVTMLINS-DMSRMTPASLLGKI 190
Query: 181 KVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVKMAMLS 240
++M ++ KK A PSK K E +S ++++D ++A+L+
Sbjct: 191 NTNDM----YKLKKKEMEEASPSK----KCIALQAEVEDKGKSKVNEANDDLKEEIALLA 242
Query: 241 NKLEYLARKQKKFLSKRGSYKNFKKEDQKG-------CFNCKKPGHFIADCPDLQKEKFK 293
+ L ++K+ + RGS N ++ + CF C + GHF + CP + K
Sbjct: 243 RRFNDLLGRRKERV--RGSNSNRRRNRRPNKTLSNLRCFECGEKGHFASKCPSKDDDGDK 300
Query: 294 GKSKKSSFNSSKFRKQIKK------SLMATW---EDLDSESGSDKEEADDDAK------- 337
KKS K K++KK + + W E+ + SGS++E DD +
Sbjct: 301 SSKKKS--GGYKLMKKLKKEGKKIEAFIGEWDSNEESSASSGSEEEGGDDASSKKKKMAV 358
Query: 338 ----------AAVGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHR 387
A + L+A SS+ S ++S+S+D+ + S EL+S+FE
Sbjct: 359 VVIKEAPTLFAPLCLMAKGSSKVTSLSDSESDDDCDDV----------SYDELVSMFE-- 406
Query: 388 TNELTDLKEKYVDLMKQQKSTLLELKASEEELKGFNLISATYEDRLKSLCQKLQEKCD-- 445
EL EK + K K L+ EELK + +L S CD
Sbjct: 407 --ELHAYSEKEIVKFKALKMDHASLEVLYEELKTSHERLTISHKKLNSSSSSCVSICDAS 464
Query: 446 ---KGSGNKHEIA-LDDFIMAGIDRSKVASMIYSTYK---NKGKGIGYSEEKSKEYS 495
+ + K ++A L+ + + I S + NK +G+G+ +KSK+ S
Sbjct: 465 LVVENNELKEQVAKLNKSLERCFNGKNTLDKILSEQRCILNK-EGLGFIPKKSKKPS 520
>UniRef100_Q7XTX2 OSJNBa0019K04.26 protein [Oryza sativa]
Length = 559
Score = 118 bits (295), Expect = 2e-24
Identities = 110/447 (24%), Positives = 203/447 (44%), Gaps = 48/447 (10%)
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLDLDEEGAAIDRKIHTPAQKKLYKKHHKIRGIIV 60
WK M ++ L +W ++ G+D + D E T Q++L ++ + I+
Sbjct: 62 WKHKMKLHLISLHPSIWKVVCTGIDVPNDDME--------LTSEQEQLIHRNAQASNTIL 113
Query: 61 ASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESIEEM 120
+++ E+ K AK ++ +L EGS V+E K +L + F M D E+ +EM
Sbjct: 114 SALSPKEFNKFDGLEEAKEIWDTLQLAHEGSPAVREDKIELLEGRLGRFVMDDKETPQEM 173
Query: 121 YSRFQTLVSGLQILKKSYVSSDHVSK-ILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSS 179
Y R LV+ ++ +++ V K +LR+ R V+ I E KD L+ D++
Sbjct: 174 YDRMMILVNKIKGFGSEDMTNHFVVKRLLRAFGPRNPTLVSMIRERKDFKRLTPSDILGR 233
Query: 180 LKVHEMSLNE-HETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVKMAM 238
+ HEM E E + K+ A+ + + ++K + S E
Sbjct: 234 IVSHEMQEEEAREVRQMVKNAAMIKNQEVALNAKQEEESSCEE----------------- 276
Query: 239 LSNKLEYLARKQKKFLSKRGSYKNFKKEDQKGCFNCKKPGHFIADCPDLQKEKFKGKSKK 298
K++ L+ ++ + ++ K K++ ++ CFNC + GHFIADCP + K KG KK
Sbjct: 277 ---KIQALSSQEWLWQGRKDDDKG-KRQSERACFNCGEYGHFIADCPKTNEAKAKGDKKK 332
Query: 299 SSFNSSKFRKQIKKSLMATWEDLDSESGSDK-EEADDDAKAAVGLVATVSSEAVSEAE-- 355
+ +K ++ W D E+ DK + D G +ATV+ ++ S ++
Sbjct: 333 -----PERAHVVKAHMVEVWYSEDEEASEDKPKPKPKDKIEGEGGIATVAFKSSSSSKEC 387
Query: 356 -----SDSEDENEVYSKIPRQ--ELVDSLKELLSLFEHRTNELTDLKEKYVDLMKQQKST 408
SD +D++ YS Q +++ SL + ++E T L++ V+ Q+K
Sbjct: 388 LFNNLSDDDDDSYHYSCFMAQGRKVMTQKPSHTSLDDDSSDEETSLRD--VNETLQEKFA 445
Query: 409 LLELKASEEELKGFNLISATYEDRLKS 435
+L+ + E++ L S+T + + S
Sbjct: 446 ILDKSHKDLEVQLDTLWSSTSQPNVVS 472
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.357 0.156 0.559
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,204,897,609
Number of Sequences: 2790947
Number of extensions: 81036558
Number of successful extensions: 727156
Number of sequences better than 10.0: 2884
Number of HSP's better than 10.0 without gapping: 267
Number of HSP's successfully gapped in prelim test: 2832
Number of HSP's that attempted gapping in prelim test: 709807
Number of HSP's gapped (non-prelim): 12631
length of query: 1664
length of database: 848,049,833
effective HSP length: 141
effective length of query: 1523
effective length of database: 454,526,306
effective search space: 692243564038
effective search space used: 692243564038
T: 11
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.7 bits)
S2: 82 (36.2 bits)
Medicago: description of AC147002.8