
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC135504.5 - phase: 0 /pseudo
(1283 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultiv... 400 e-109
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi... 390 e-106
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult... 387 e-105
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-... 382 e-104
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi... 380 e-103
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-... 378 e-103
ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sa... 377 e-102
emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana] ... 370 e-100
gb|AAP53029.1| putative retrotransposon-related protein [Oryza s... 368 e-100
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi... 367 1e-99
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu... 360 2e-97
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi... 359 3e-97
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu... 359 3e-97
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 353 2e-95
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1... 350 2e-94
gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cult... 350 2e-94
gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis ... 347 1e-93
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas] 345 4e-93
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense] 328 6e-88
emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir|... 328 9e-88
>ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultivar-group)]
gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza
sativa (japonica cultivar-group)]
Length = 1181
Score = 400 bits (1027), Expect = e-109
Identities = 228/589 (38%), Positives = 349/589 (58%), Gaps = 24/589 (4%)
Query: 5 KWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVSAII 64
K+D+ F LW+VKMRA+L Q++ +AL + + + EK + + KA+S I
Sbjct: 5 KYDLPLLDRDTRFSLWQVKMRAVLAQQELDDALSGFDKRTQDWSNDEK-KRDRKAMSYIH 63
Query: 65 LCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLTE 124
L L + +L+EV +E TA +W KL+ + MTK L + LKQ+L+ +++ + +M+ L+
Sbjct: 64 LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSA 123
Query: 125 FNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITKE 184
F +I+ DL +++V +++D AL LLC+LP S+ NF+DT+LY ++ T+TL+EV L KE
Sbjct: 124 FKEIVADLESMEVKYDEKDLALILLCSLPSSYANFRDTILYSRD-TLTLKEVYDALHAKE 182
Query: 185 LTKFKDLKVDDSG---EGLNVSRGRNQNRGKGKGKNSKSKSRSKGDG-NKTKYK-CFICH 239
K K + + S EGL V RG Q + KS S +G ++ +YK C C
Sbjct: 183 KMK-KMVPSEGSNSQAEGL-VVRGSQQEKNTNNKSRDKSSSSYRGRSKSRGRYKSCKYCK 240
Query: 240 NPGHFKKDC--PERKDNGGGNPSVQLASKDEGC------ESAGALTVTSW----EPEKGW 287
GH C + KD G + ++EG E + A + ++ + W
Sbjct: 241 RDGHDISKCWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTSDQW 300
Query: 288 VLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVR 347
+LD+ C+YH+ P + +F T E+ +GG V +G++ C++ GIGT+++KMFD L DVR
Sbjct: 301 ILDTACTYHMCPNRDWFATYEVVQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVR 360
Query: 348 YIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHA 406
+IP L+R+LIS+ D GY G+++++ G+LV+ K S K LY L+G+TI+ +
Sbjct: 361 HIPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKASIKSANLYHLQGTTILGNV 420
Query: 407 SV--PSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVK 464
+ S+ D T LWH+RLGH+SE GL EL+K+GLL + ++KL FC++C GK +VK
Sbjct: 421 ATVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSISKLKFCEHCIFGKHKRVK 480
Query: 465 FGVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKF 524
F H + + VHSDL GPA ++GG Y +I+DDYSR+VW Y LK+K AF F
Sbjct: 481 FNTSTHTTEGILDYVHSDLWGPARKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFNVF 540
Query: 525 KEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
KEW +VE Q K+K+LRTDNG EF + F +C+ +GI RH V +T
Sbjct: 541 KEWKTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHT 589
Score = 99.0 bits (245), Expect = 9e-19
Identities = 43/91 (47%), Positives = 65/91 (71%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
EAIWL+G+ E+ C+ I CDSQSAI L Q++HERTKHI++R HF+R +I ++
Sbjct: 1087 EAIWLRGLYTELCGVTSCINIFCDSQSAICLTKDQMFHERTKHIDVRYHFIRGVIAEGDV 1146
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
KV K+++ +NPAD+ TK +P ++F+ C L+
Sbjct: 1147 KVCKISTHDNPADMMTKPVPATKFELCSSLV 1177
>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana]
gi|11278366|pir||T47492 copia-like polyprotein -
Arabidopsis thaliana
Length = 1363
Score = 390 bits (1003), Expect = e-106
Identities = 233/611 (38%), Positives = 346/611 (56%), Gaps = 51/611 (8%)
Query: 3 GSKWDIEKFTGSNLFGLWKVKM---------RAILIQEKCVEALKREAQMSAHLTPAEKT 53
G++ ++EKF G + +WK K+ A+L + + +R+++ S E+
Sbjct: 3 GARIEVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEERE 62
Query: 54 EMND------KAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQL 107
+M KA S I+L + D++LR++ +ET+A +M LD LYM+K+L +R LKQ+L
Sbjct: 63 KMEAFEEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQKL 122
Query: 108 YFYRMMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGK 167
Y ++M E+ I + EF I+ DL N++V + DED+A+ LL +LP+ F+ KDT+ Y
Sbjct: 123 YSFKMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKYSS 182
Query: 168 EGTI-TLEEVQATLITKELTKFKDLKVDDSG--EGLNV-----SRGRNQNRGKGKGKNSK 219
T+ +L+EV A + ++EL +F +K G EGL V +RGR++ + KGKGK SK
Sbjct: 183 GKTVLSLDEVAAAIYSREL-EFGSVKKSIKGQAEGLYVKDKAENRGRSEQKDKGKGKRSK 241
Query: 220 SKSRSKGDGNKTKYKCFICHNPGHFKKDCPERKD----NGGGNPSVQLASKD---EGC-- 270
SKS K C+IC GH K CP + N G N K EG
Sbjct: 242 SKS---------KRGCWICGEDGHLKSTCPNKNKPQFKNQGSNKGESSGGKGNLVEGSVN 292
Query: 271 --ESAG-----ALTVTSWEPEKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKAC 323
ESAG AL+ T E W++D+GC YH++ ++ + E + E GG VR+GN
Sbjct: 293 FVESAGMFVSEALSSTDIHLEDEWIMDTGCIYHMTHKREWLEDFDEEAGGSVRMGNKSIS 352
Query: 324 KIQGIGTIRLKMFDDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGAL 383
+++G+GT+R+ + L++VRYIP++ RNL+S+ F+ G+ E G++RI G
Sbjct: 353 RVKGVGTVRIVNDNGLTVTLQNVRYIPDMDRNLLSLGTFEKAGHKFESENGMLRIKSGNQ 412
Query: 384 VIAKGSKIHGLYILEGSTIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGN 443
V+ +G + LYIL G S+ D T LWH RL H+S++ + L K+G L
Sbjct: 413 VLLEGRRYDTLYILHGKPA-TDESLAVARANDDTVLWHRRLCHMSQKNMSLLIKKGFLDK 471
Query: 444 EKLNKLDFCDNCTLGKQHKVKFGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSII 502
+K++ LD C++C G+ K+ F + H + + E VHSDL G P + G YF S I
Sbjct: 472 KKVSMLDTCEDCIYGRAKKIGFNLAQHDTKKKLEYVHSDLWGAPTVPMSLGNCQYFISFI 531
Query: 503 DDYSRRVWVYILKNKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKK 562
DDY+R+VWVY LK K +AFEKF W LVENQ G ++K LRTDNG EF F+ FC +K
Sbjct: 532 DDYTRKVWVYFLKTKDEAFEKFVSWISLVENQSGERVKTLRTDNGLEFCNRMFDGFCEEK 591
Query: 563 GIKRHRIVAYT 573
G +RHR AYT
Sbjct: 592 GFQRHRTCAYT 602
Score = 108 bits (271), Expect = 9e-22
Identities = 48/102 (47%), Positives = 75/102 (73%)
Query: 1179 CVYIVWYNSNLEAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLH 1238
C Y+ + EAIWLKG++ + G Q V+I CDSQSAI L+ + ++HERTKHI+++ H
Sbjct: 1257 CEYMSLTEAVKEAIWLKGLLKDFGYEQKNVEIFCDSQSAIALSKNNVHHERTKHIDVKFH 1316
Query: 1239 FVRDMIETKEIKVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
F+R++I +++V K+++E+NPADIFTK LP ++F+ LD +
Sbjct: 1317 FIREIIADGKVEVSKISTEKNPADIFTKVLPVNKFQTALDFL 1358
>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1280
Score = 387 bits (994), Expect = e-105
Identities = 224/589 (38%), Positives = 335/589 (56%), Gaps = 24/589 (4%)
Query: 5 KWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVSAII 64
K+D+ F LW+VKMRA+L Q+ +AL + + + EK + + KA+S I
Sbjct: 40 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEKKK-DRKAMSYIH 98
Query: 65 LCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLTE 124
L L + +L+EV +E TA +W KL+ + MTK L + LKQ+L+ +++ + +M+ L+
Sbjct: 99 LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLST 158
Query: 125 FNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITKE 184
F +I+ DL +I+V ++ED L LLC+LP S+ NF+DT+LY + T+ L+EV L KE
Sbjct: 159 FKEIVADLESIEVKYDEEDLGLILLCSLPSSYANFRDTILYSHD-TLILKEVYDALHAKE 217
Query: 185 LTKFKDLKVDDSG---EGLNVSRGRNQNRGKGKGKNSKSKSRSKGDG-NKTKYK-CFICH 239
K K + + S EGL V RGR Q + KS S +G ++ +YK C C
Sbjct: 218 KMK-KMVPSEGSNSQAEGL-VVRGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYCK 275
Query: 240 NPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPE------------KGW 287
GH +C + +D K E A +T + E W
Sbjct: 276 RDGHDISECWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDTELLVAYAGCAQTSDQW 335
Query: 288 VLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVR 347
+LD+ +YH+ P + +F T E +GG V +G++ C++ GIGT+++KMFD L DVR
Sbjct: 336 ILDTAWTYHMCPNRDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGYIRTLSDVR 395
Query: 348 YIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHA 406
+IP L+R+LIS+ D GY G+++++ G+LV+ K K LY L G+TI+ +
Sbjct: 396 HIPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTILGNV 455
Query: 407 SV--PSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVK 464
+ S+ D T LWH+RLGH+SE GL EL+K+ LL + + KL FC++C GK +VK
Sbjct: 456 AAVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRELLDGQSIGKLKFCEHCIFGKHKRVK 515
Query: 465 FGVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKF 524
F H + + VHSDL GPA ++GG Y +I+DDYSR+VW Y LK+K AF+ F
Sbjct: 516 FNTSTHTTEGILDYVHSDLWGPACKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDVF 575
Query: 525 KEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
KEW +VE Q K+K+LRTDNG EF + F +C+ +GI H V +T
Sbjct: 576 KEWKTMVERQTEKKVKILRTDNGMEFCSKIFKSYCKSEGIVHHYTVPHT 624
Score = 93.2 bits (230), Expect = 5e-17
Identities = 41/91 (45%), Positives = 63/91 (69%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
EAIWL+G+ E+ C+ I CDSQSAI L Q++HERTKHI++R H +R +I ++
Sbjct: 1186 EAIWLRGLYTELCGVTSCINIFCDSQSAICLTKDQMFHERTKHIDVRYHIIRGVIVEGDV 1245
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
KV K+++ +NPAD+ TK + ++F+ C L+
Sbjct: 1246 KVCKISTHDNPADMMTKPVSATKFELCSSLV 1276
>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
sativa (japonica cultivar-group)]
Length = 2340
Score = 382 bits (980), Expect = e-104
Identities = 219/588 (37%), Positives = 333/588 (56%), Gaps = 22/588 (3%)
Query: 5 KWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVSAII 64
K+D+ F LW+VKMRA+L Q+ +AL + + + EK + + KA+S I
Sbjct: 212 KYDLPLLYRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTHDWSNDEK-KRDRKAMSYIH 270
Query: 65 LCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLTE 124
L L + +L+EV +E A +W KL+ + MTK L + LKQ L+ +++ + +M+ L+
Sbjct: 271 LHLSNNILQEVLKEEIAAGLWLKLEQICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSA 330
Query: 125 FNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITKE 184
F +II DL +++V ++ED L LLC+LP S+ NF+DT+LY ++ T+TL+EV L KE
Sbjct: 331 FKEIIADLESMEVKYDEEDLGLILLCSLPSSYANFRDTILYSRD-TLTLKEVYDALHVKE 389
Query: 185 LTKFKDLKVDDSG---EGLNVSRGRNQNRGKGKGKNSKSKSRSKGDGNKTKYK-CFICHN 240
K K + + S EGL V + + K + ++ S S ++ +YK C C
Sbjct: 390 KMK-KMVPSEGSNSQAEGLIVWGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYCKR 448
Query: 241 PGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPE------------KGWV 288
GH +C + D K E A +T + E W+
Sbjct: 449 DGHDIFECWKLHDKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTSDQWI 508
Query: 289 LDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRY 348
L++ C YH+ P + +F T E + G V +G++ C++ GIGT+++KMFD L DVR+
Sbjct: 509 LNTACIYHMCPNRDWFATYEAVQVGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRH 568
Query: 349 IPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHAS 407
IP L+R+LIS+ D GY G+++++ G+LV+ K K LY L G+TI+ + +
Sbjct: 569 IPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTILGNVA 628
Query: 408 V--PSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKF 465
S+ D T LWH+RLGH++E GL EL+K+GLL + + KL FC++C GK +VKF
Sbjct: 629 AVSDSLSNSDATNLWHMRLGHMTEIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKF 688
Query: 466 GVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFK 525
H + + VHSDL GPA ++GG Y +I+DDYSR+VW Y LK+K AF+ FK
Sbjct: 689 NTSTHTTEGILDYVHSDLWGPARKTSFGGTRYMMTIVDDYSRKVWPYFLKHKYQAFDVFK 748
Query: 526 EWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
EW +VE Q K+K+LRTDNG EF + F +C+ +GI RH V +T
Sbjct: 749 EWKTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHT 796
Score = 96.3 bits (238), Expect = 6e-18
Identities = 41/91 (45%), Positives = 63/91 (69%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
EAIWL+G+ + C+ I CDSQSAI L Q++HERTKHI++R HF+R +I ++
Sbjct: 1445 EAIWLRGLYTVLCAVTSCINIFCDSQSAICLTKDQMFHERTKHIDVRYHFIRGLIAEGDV 1504
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
K+ K++ +NPAD+ TK +P ++F+ C L+
Sbjct: 1505 KICKISIHDNPADMMTKPVPATKFELCSSLV 1535
>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301696|pir||F84486 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1356
Score = 380 bits (977), Expect = e-103
Identities = 224/599 (37%), Positives = 333/599 (55%), Gaps = 42/599 (7%)
Query: 7 DIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKT------------- 53
++EKF G + +WK K+ A + ALK + +++
Sbjct: 7 EVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEKLEKFEA 66
Query: 54 --EMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYR 111
E KA SAI+L + D++LR++ +E+TA +M LD LYM+K+L +R KQ+LY ++
Sbjct: 67 LEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQKLYSFK 126
Query: 112 MMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGK-EGT 170
M E+ + + EF +II DL N++V + DED+A+ LL ALP++F+ KDT+ Y +
Sbjct: 127 MSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKYSSGKSI 186
Query: 171 ITLEEVQATLITKEL---TKFKDLKVDDSG---EGLNVSRGRNQNRGKGKGKNSKSKSRS 224
+TL+EV A + +KEL + K +KV G + N ++G+ + +GKGKGK KSK
Sbjct: 187 LTLDEVAAAIYSKELELGSVKKSIKVQAEGLYVKDKNENKGKGEQKGKGKGKKGKSKK-- 244
Query: 225 KGDGNKTKYKCFICHNPGHFKKDCPERKDNGGGNPSVQLASKDEG----CESAG-----A 275
K C+ C GHF+ CP + V G E+AG A
Sbjct: 245 -------KPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAAGYYVSEA 297
Query: 276 LTVTSWEPEKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKM 335
L+ T E W+LD+GCSYH++ ++ +F + GG VR+GN +++G+GTIR+K
Sbjct: 298 LSSTEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGTIRVKN 357
Query: 336 FDDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLY 395
D +L +VRYIP++ RNL+S+ F+ GY E G++RI G V+ G + LY
Sbjct: 358 SDGLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDGILRIKAGNQVLLTGRRYDTLY 417
Query: 396 ILEGSTIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNC 455
+L +A S+ V D T LWH RL H+S++ + L ++G L +K++ LD C++C
Sbjct: 418 LLNWKP-VASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDKKKVSSLDVCEDC 476
Query: 456 TLGKQHKVKFGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSIIDDYSRRVWVYIL 514
GK + F + H + E +HSDL G P + G YF SIIDD++R+VWVY +
Sbjct: 477 IYGKAKRKSFSLAHHDTKEKLEYIHSDLWGAPFVPLSLGKCQYFMSIIDDFTRKVWVYFM 536
Query: 515 KNKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
K K +AFEKF EW LVENQ ++K LRTDNG EF + F+ FC GI RHR AYT
Sbjct: 537 KTKDEAFEKFVEWVNLVENQTDRRVKTLRTDNGLEFCNKLFDGFCESIGIHRHRTCAYT 595
Score = 103 bits (257), Expect = 4e-20
Identities = 46/91 (50%), Positives = 70/91 (76%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
EAIWLKG++ + G Q V+I CDSQSAI L+ + ++HERTKHI+++ HF+R++I +
Sbjct: 1261 EAIWLKGLLKDFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVKYHFIREIISDGTV 1320
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
+V K+++E+NPADIFTK L S+F+ L+L+
Sbjct: 1321 EVLKISTEKNPADIFTKVLAVSKFQAALNLL 1351
>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
sativa (japonica cultivar-group)]
Length = 1373
Score = 378 bits (971), Expect = e-103
Identities = 226/586 (38%), Positives = 332/586 (56%), Gaps = 23/586 (3%)
Query: 5 KWDIEKFTGSNLFGLWKVKMRAILIQEKCV-EALKREAQMSAHLTPAEKTEMNDKAVSAI 63
K+D+ F LW+VKMR IL Q EAL + A T AE+ + KA++ I
Sbjct: 2 KFDLPLLNYDTRFSLWQVKMRGILAQTHDYDEALDNFGKRRAEWT-AEEIRKDQKALALI 60
Query: 64 ILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLT 123
L L + +L+E E T+ +W KL+ + M+K L + +K +L+ +M E ++ +
Sbjct: 61 QLHLHNDILQECLTEKTSAELWLKLESICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMA 120
Query: 124 EFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITK 183
EF KI+ DL +++V +DED L LLC+LP S+ NF+DT+L ++ +TL+EV L K
Sbjct: 121 EFKKIVADLVSMEVKYDDEDLGLLLLCSLPNSYANFRDTILLSRD-ELTLKEVYDALQNK 179
Query: 184 ELTKF---KDLKVDDSGEGLNVSRGRNQNRGKG-KGKNSKSKSRSKGDGNKTKYKCFICH 239
E K D GE L+V RGR +NR K + + +S+SK GNK K+ C C
Sbjct: 180 EKMKIMVQNDGSSSSKGEALHV-RGRTENRTSNEKNYDRRGRSKSKPPGNK-KF-CVYCK 236
Query: 240 NPGHFKKDCP-----ERKDNGGGNPSVQLASKDEGCESAGALTVTSW--EPEKGWVLDSG 292
H +C ERK+ G SV A+ + +S L V + W+LDS
Sbjct: 237 LKNHNIDECKKVQAKERKNKKDGKVSVASAAASDD-DSGDCLVVFAGCVAGHDEWILDSA 295
Query: 293 CSYHISPRKGYFETLE-LEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYIPE 351
CS+HI ++ +F + + +++G VVR+G++ C I GIG++++K D LK+VRYIP
Sbjct: 296 CSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYIPG 355
Query: 352 LRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHG-LYILEGSTIIAHASVPS 410
+ RNLIS+S D GY GV+++S G+LV KG LY+L G T+ S +
Sbjct: 356 MSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLKGDVNSAKLYVLRGCTLTGSDSAAA 415
Query: 411 VDTLDI---TKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGV 467
T D T LWH+RLGH+S G+ EL K+ LL +K+ FC++C GK +V+F
Sbjct: 416 AITNDEPSKTNLWHMRLGHMSHLGMTELMKRNLLKGCTSSKIKFCEHCIFGKHKRVQFNT 475
Query: 468 GVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEW 527
VH + + VH+DL GP+ + GG Y +IIDDYSR+VW Y LK+K D F FK W
Sbjct: 476 SVHTTKGTLDYVHADLWGPSKKPSLGGARYMLTIIDDYSRKVWPYFLKHKDDTFTAFKNW 535
Query: 528 DILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
+++E Q K+K+LRTDNG EF FN++CR++GI RH + +T
Sbjct: 536 KVMIERQTERKVKLLRTDNGGEFCSHAFNDYCRQEGIVRHHTIPHT 581
Score = 78.6 bits (192), Expect = 1e-12
Identities = 32/56 (57%), Positives = 44/56 (78%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIE 1245
E IWLKG+ E+ + C+ +HCDS+SAI+L Q++HERTKHI+I+ HFVRD+IE
Sbjct: 1239 ELIWLKGLYAELSGVESCISLHCDSESAIYLTKDQMFHERTKHIDIKYHFVRDVIE 1294
>ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sativa]
gi|14029020|gb|AAK52561.1| Putative retroelement pol
polyprotein [Oryza sativa]
Length = 1326
Score = 377 bits (969), Expect = e-102
Identities = 228/590 (38%), Positives = 336/590 (56%), Gaps = 33/590 (5%)
Query: 5 KWDIEKFTGSNLFGLWKVKMRAILIQEKCV-EALKREAQMSAHLTPAEKTEMNDKAVSAI 63
K+D+ F LW+VKMRAIL Q + EAL+ + + AE+ + KA+ I
Sbjct: 5 KYDLPLLDYKTRFSLWQVKMRAILAQTSDLDEALESFGKKKSTEWTAEEKRKDRKALLLI 64
Query: 64 ILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLT 123
L L + +L+EV +E TA +W KL+ + M+K L + +K +L+ +++ ES ++ ++
Sbjct: 65 QLHLSNDILQEVLQEKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHIS 124
Query: 124 EFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITK 183
F +I+ DL +I+V +DED L LLC+LP S+ NF+DT+L ++ +TL EV L +
Sbjct: 125 VFKEIVVDLVSIEVQFDDEDLGLLLLCSLPSSYANFRDTILLSRD-ELTLAEVYEALQNR 183
Query: 184 ELTKFKDLKVDDS----GEGLNVSRGRNQNRGKGKGKN---SKSKSRSKGDGNKTKYKCF 236
E K K + D+ GE L V RGR++ R + S+S+ RSK G K C
Sbjct: 184 E--KMKGMVQSDASSSKGEALQV-RGRSEQRTYNDSSDRDKSQSRGRSKSRGKKF---CK 237
Query: 237 ICHNPGHFKKDC------PERKDNGGGNPSVQLASKDEG-CESAGALTVTSWEPEKGWVL 289
C HF ++C +RK +G + + D G C A V S + W+L
Sbjct: 238 YCKKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGCVASHDE---WIL 294
Query: 290 DSGCSYHISPRKGYFETLE-LEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRY 348
D+ CS+HI + +F + + ++ G VVR+G++ +I GIG++++K D LKDVR+
Sbjct: 295 DTACSFHICINRDWFSSYKSVQNGDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRH 354
Query: 349 IPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHAS 407
IP + RNLIS+S D GY GV+++S G+LV G LY+L GST+ H S
Sbjct: 355 IPGMARNLISLSTLDAEGYKYSSSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTL--HGS 412
Query: 408 VP----SVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKV 463
V S D T LWH+RLGH+SE G+ EL K+ LL K+ FC++C GK +V
Sbjct: 413 VTAAAVSKDEPIKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGKMKFCEHCVFGKHKRV 472
Query: 464 KFGVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEK 523
KF VH++ + VH+DL GP+ GG Y +IIDDYSR+VW Y LK+K D F
Sbjct: 473 KFNTSVHRTKGILDYVHTDLWGPSRKAYLGGARYMLTIIDDYSRKVWPYFLKHKDDTFAA 532
Query: 524 FKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
FKEW + +E Q ++KVLRTDNG EF + F+++CRK+GI RH + YT
Sbjct: 533 FKEWKVRIERQTEKEVKVLRTDNGGEFCSDAFDDYCRKEGIVRHHTIPYT 582
Score = 64.3 bits (155), Expect = 3e-08
Identities = 25/55 (45%), Positives = 39/55 (70%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMI 1244
E++WLKG+ E+ C+ + CDSQSAI L ++HER+KHI+I+ H+V D++
Sbjct: 1097 ESVWLKGLFAELCGVDSCINLFCDSQSAICLTKDHMFHERSKHIDIKYHYVHDVV 1151
>emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana]
gi|4539406|emb|CAB40039.1| putative retrotransposon
[Arabidopsis thaliana] gi|7444416|pir||T04181
hypothetical protein F7L13.40 - Arabidopsis thaliana
Length = 1230
Score = 370 bits (951), Expect = e-100
Identities = 222/598 (37%), Positives = 323/598 (53%), Gaps = 56/598 (9%)
Query: 7 DIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEK-------------T 53
++EKF G + LWK K+ A + AL+ +S L E+
Sbjct: 7 EMEKFDGHGDYTLWKEKLMAHMDLLGLTVALRETQSVSDPLESEEEGKESEKGDKEALME 66
Query: 54 EMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMM 113
E KA S I+L + D++LR+ +E TA SM LD LYM+K+L +R LKQ+LY Y+M
Sbjct: 67 EKRQKARSTIVLSVSDQVLRKSKKEKTAPSMLEALDKLYMSKALPNRIYLKQKLYSYKMQ 126
Query: 114 ESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGK-EGTIT 172
E+ + + EF ++I DL N +V + DED+A+ LL +LP+ F+ KDT+ YG T++
Sbjct: 127 ENLSVEGNIDEFLRLIADLENTNVLVSDEDQAILLLMSLPKQFDQLKDTLKYGSGRTTLS 186
Query: 173 LEEVQATLITKELTKFKDLK-VDDSGEGLNVS---RGRNQNRGKGKGKNSKSKSRSKGDG 228
++EV A + +KEL + K + EGL V R + K KG +S+SRSKG
Sbjct: 187 VDEVVAAIYSKELELGSNKKSIRGQAEGLYVKDKPETRGMSEQKEKGNKGRSRSRSKGWK 246
Query: 229 NKTKYKCFICHNPGHFKKDCPER-------KDNGGGNPSVQLASKDEGCESAG-----AL 276
C+IC GHFK CP + KD G+ K E +G AL
Sbjct: 247 G-----CWICGEEGHFKTSCPNKGKQQNKGKDQASGSKGEAATIKGNTSEGSGYYVSEAL 301
Query: 277 TVTSWEPEKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMF 336
T WV+D+GC+YH++ +K +FE L + GG VR+GN K +
Sbjct: 302 HSTDVNLGNEWVMDTGCNYHMTHKKEWFEELSEDAGGTVRMGNKSTSKFR---------- 351
Query: 337 DDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLYI 396
V+YIP++ RNL+S+ + GY + GV+ + G + GS+ LY+
Sbjct: 352 ---------VKYIPDMDRNLLSMGTLEEHGYSFESKNGVLVVKEGTRTLLIGSRHEKLYL 402
Query: 397 LEGSTIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCT 456
L+G ++H S+ D T LWH RLGH+S++ + L K+G L +K++KL+ C++C
Sbjct: 403 LQGKPEVSH-SMTVERRNDDTVLWHRRLGHISQKNMDILVKKGYLDGKKVSKLELCEDCI 461
Query: 457 LGKQHKVKFGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSIIDDYSRRVWVYILK 515
GK ++ F V H + VHSDL G P+ + G YF S ID YSR+ WVY LK
Sbjct: 462 YGKARRLSFVVATHNTEDKLNYVHSDLWGAPSVPLSLGKCQYFISFIDVYSRKTWVYFLK 521
Query: 516 NKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
+K +AF F EW ++VENQ G K+K+LR DNG EF +QFN+FC++KGI RH+ AYT
Sbjct: 522 HKDEAFGTFAEWSVMVENQTGRKIKILRIDNGLEFCNQQFNDFCKEKGIVRHQTCAYT 579
Score = 93.2 bits (230), Expect = 5e-17
Identities = 43/88 (48%), Positives = 64/88 (71%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
EAIWLKG++ + G Q V+I CDSQSAI L+ + ++H+RTKHI+I+ H +R++I +
Sbjct: 1135 EAIWLKGLLQDFGYEQKTVEIFCDSQSAIALSKNNVHHDRTKHIDIKYHKIREVIADGVV 1194
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCL 1277
+V+K+ + N ADIFTK +P S+FK L
Sbjct: 1195 EVKKICTLVNSADIFTKVVPVSKFKTAL 1222
>gb|AAP53029.1| putative retrotransposon-related protein [Oryza sativa (japonica
cultivar-group)] gi|37532880|ref|NP_920742.1| putative
retrotransposon-related protein [Oryza sativa (japonica
cultivar-group)] gi|22655747|gb|AAN04164.1| Putative
retrotransposon protein [Oryza sativa (japonica
cultivar-group)] gi|16905223|gb|AAL31093.1| putative
retrotransposon-related protein [Oryza sativa]
Length = 1229
Score = 368 bits (945), Expect = e-100
Identities = 218/590 (36%), Positives = 333/590 (55%), Gaps = 23/590 (3%)
Query: 5 KWDIEKFTGSNLFGLWKVKMRAILIQEKCV-EALKREAQMSAHLTPAEKTEMNDKAVSAI 63
K+D+ F LW+VKMRA+L Q + EAL+ + AE+ + KA+S I
Sbjct: 2 KYDLPLQDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 61
Query: 64 ILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLT 123
L L + +L++V +E TA +W KL+ + M+K L + +K +L+ +++ ES ++ ++
Sbjct: 62 QLHLSNDILQKVLQEKTAAELWFKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHIS 121
Query: 124 EFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITK 183
F +II DL +++V +DED L LLC+LP + NF+DT+L ++ +TL EV L +
Sbjct: 122 VFKEIIADLVSMEVQFDDEDLGLLLLCSLPSLYANFRDTILLSRD-ELTLAEVYEALQNR 180
Query: 184 ELTKFKDLKVDDS----GEGLNVSRGRNQNRGKGKGKN---SKSKSRSKGDGNKTKYKCF 236
E K K + D+ G+ L V RGR++ R + S+S+ RSK G K C
Sbjct: 181 E--KMKGMVQSDASSSKGKALQV-RGRSEQRTYNDSNDRDKSQSRGRSKSRGKKF---CK 234
Query: 237 ICHNPGHFKKDC--PERKDNGGGNPSVQLASKDEGCESAGALTVTSW--EPEKGWVLDSG 292
C HF ++C + K+ + + + E +SA L + W+LD+
Sbjct: 235 YCKKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSADCLVFFAGCVASHDEWILDTA 294
Query: 293 CSYHISPRKGYFETLE-LEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYIPE 351
C + I + +F + + ++ G VVR+G+N +I GIG++++K D LKDVR+IP
Sbjct: 295 CLFLICINRDWFSSHKSVQNGDVVRMGDNNPREIMGIGSVQIKTHDGMTRTLKDVRHIPG 354
Query: 352 LRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAH--ASV 408
+ RNLIS+S D GY GV+++S G+LV G LY+L GST+ A+
Sbjct: 355 MARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSLTAAA 414
Query: 409 PSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGVG 468
S D T LWH+RLGH+SE G+ EL K+ LL + FC++C GK +VKF
Sbjct: 415 VSKDEPSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTS 474
Query: 469 VHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEWD 528
VH++ + VH+DL GP+ + GG Y +IIDDYSR+VW Y LK+K D F FKEW
Sbjct: 475 VHRTKGILDYVHADLWGPSRKPSLGGACYMLTIIDDYSRKVWPYFLKHKDDTFAAFKEWK 534
Query: 529 ILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYTSTERS 578
+++E Q ++KVLRTDNG EF + F+++CRK+GI RH + YT + S
Sbjct: 535 VMIERQAEKEVKVLRTDNGGEFCSDAFDDYCRKEGIGRHHTIPYTPQQNS 584
Score = 60.1 bits (144), Expect = 5e-07
Identities = 25/55 (45%), Positives = 38/55 (68%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMI 1244
E++WLKG+ E+ + + CDSQS I L QI+HERTK+I+I+ H+V D++
Sbjct: 1174 ESVWLKGLFAELCRVDSYINLFCDSQSVICLTKDQIFHERTKYIDIKYHYVCDVV 1228
>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301702|pir||E84601 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1333
Score = 367 bits (943), Expect = 1e-99
Identities = 227/590 (38%), Positives = 333/590 (55%), Gaps = 46/590 (7%)
Query: 7 DIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDK-------- 58
++EKF G + +WK K+ A L ALK E + + + TE +K
Sbjct: 7 EVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLRREL 66
Query: 59 -------AVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYR 111
A SAI+L + D++LR++ +E +A +M LD LYM+K+L +R KQ+LY ++
Sbjct: 67 LEEKRRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLYSFK 126
Query: 112 MMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGK-EGT 170
M E+ I + EF +II DL N +V + DED+A+ LL +LP+ F+ +DT+ YG T
Sbjct: 127 MSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLGRVT 186
Query: 171 ITLEEVQATLITKELTKFKDLK-VDDSGEGLNV-----SRGRNQNRGKGKGKNSKSKSRS 224
++L+EV A + +KEL + K + EGL V +RGR + RG N+ KSRS
Sbjct: 187 LSLDEVVAAIYSKELELGSNKKSIKGQAEGLFVKEKTETRGRTEQRGNN---NNNKKSRS 243
Query: 225 KGDGNKTKYKCFICHNPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPE 284
K +++K C+IC NG N S+ G + AL+ T E
Sbjct: 244 K---SRSKKGCWICGE-----------SSNGSSN-----YSEANGLYVSEALSSTDIHLE 284
Query: 285 KGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLK 344
WV+D+GCSYH++ ++ +FE L + GG VR+GN K++GIGTIR+K L
Sbjct: 285 DEWVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTVSKVRGIGTIRVKNEAGMVVRLT 344
Query: 345 DVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLYILEGSTIIA 404
+VRYIPE+ RNL+S+ F+ GY ++E G + I G V+ + + LY+L+ +
Sbjct: 345 NVRYIPEMDRNLLSLGTFEKSGYSFKLENGTLSIIAGDSVLLTVRRCYTLYLLQWRPV-T 403
Query: 405 HASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVK 464
S+ V D T LWH RLGH+S++ + L K+GLL +K++KL+ C++C GK ++
Sbjct: 404 EESLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKKKVSKLETCEDCIYGKAKRIG 463
Query: 465 FGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEK 523
F + H + E VHSDL G P+ + G YF S IDDY+R+V +Y LK K +AF+K
Sbjct: 464 FNLAQHDTREKLEYVHSDLWGAPSVPFSLGKCQYFISFIDDYTRKVRIYFLKTKDEAFDK 523
Query: 524 FKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
F EW LVENQ ++K LRTDNG EF F+EFC +KGI HR AYT
Sbjct: 524 FVEWANLVENQTDKRIKTLRTDNGLEFCNRSFDEFCSQKGILWHRTCAYT 573
Score = 99.0 bits (245), Expect = 9e-19
Identities = 44/91 (48%), Positives = 67/91 (73%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
EA+W+KG++ E G Q V+I CDSQSAI L+ + ++HERTKHI++R ++RD+I +
Sbjct: 1238 EAVWMKGLLKEFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVRYQYIRDIIANGDG 1297
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
V K+ +E+NPADIFTK +P ++F+ L L+
Sbjct: 1298 DVVKIDTEKNPADIFTKIVPVNKFQAALTLL 1328
>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1243
Score = 360 bits (924), Expect = 2e-97
Identities = 213/578 (36%), Positives = 328/578 (55%), Gaps = 30/578 (5%)
Query: 5 KWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVSAII 64
K+D+ F LW+VKMRA+L Q+ +AL + + + EK + + KA+S I
Sbjct: 5 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEK-KRDRKAISYIH 63
Query: 65 LCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLTE 124
L L + +L+EV +E TA +W KL+ + MTK L + LKQ+L+ +++ + + +M+ L+
Sbjct: 64 LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHLSA 123
Query: 125 FNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITKE 184
F +I+ DL +++V +++D L LLC+LP S+ NF+ T+LY ++ T+TL+EV KE
Sbjct: 124 FKEIVADLESMEVKYDEDDLGLILLCSLPSSYANFRGTILYSRD-TLTLKEVYDAFHAKE 182
Query: 185 LTKFKDLKVDDSG----EGLNVSRGRNQNRGKGKGKNSKSKSRSKG-DGNKTKYK-CFIC 238
K K + + EGL V RGR Q + KS S +G ++ +YK C C
Sbjct: 183 --KMKKMVTSEGSNSQAEGL-VVRGRQQKKNTKNQSRDKSSSSYRGRTKSRGRYKSCKYC 239
Query: 239 HNPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPEKGWVLDSGCSYHIS 298
GH +C + +D K E A +T + E V +GC+ +
Sbjct: 240 KRDGHDISECWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDAEL-LVAYAGCAQ--T 296
Query: 299 PRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYIPELRRNLIS 358
+ +F T E +GG V +G++ C++ GIGT+++KMFD L DV++IP L+R+LIS
Sbjct: 297 SDQDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNLKRSLIS 356
Query: 359 ISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHASV--PSVDTLD 415
+ G+++++ G+LV+ K K LY L G+TI+ + + S+ D
Sbjct: 357 LY-------------GILKVTKGSLVVMKVDIKSANLYHLRGTTILGNVAAVFDSLSNSD 403
Query: 416 ITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGVGVHKSSRP 475
T LWH+RLGH+SE GL EL+K+GLL + + KL FC++C GK +VKF H +
Sbjct: 404 ATNLWHMRLGHMSEIGLAELSKRGLLDGQSIRKLKFCEHCIFGKHKRVKFNTSTHTTEGI 463
Query: 476 FE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEWDILVENQI 535
+ VHSDL GPA ++GG Y +I+DDYSR+VW Y LK+K AF+ FKEW +VE Q
Sbjct: 464 LDYVHSDLWGPAHKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDGFKEWKTMVERQT 523
Query: 536 GTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
K+K+LRTDNG EF + F +C+ +GI H +T
Sbjct: 524 ERKVKILRTDNGMEFCSKIFKSYCKSEGIVCHYTAPHT 561
Score = 40.4 bits (93), Expect = 0.39
Identities = 18/40 (45%), Positives = 26/40 (65%)
Query: 1181 YIVWYNSNLEAIWLKGMIGEMGISQGCVKIHCDSQSAIHL 1220
Y+ + + EAIWL+G+ E+ C+ I CDSQSAI+L
Sbjct: 1201 YMAIFEACKEAIWLRGLYTELCGVTSCINIFCDSQSAIYL 1240
>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301697|pir||B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1335
Score = 359 bits (922), Expect = 3e-97
Identities = 207/551 (37%), Positives = 302/551 (54%), Gaps = 28/551 (5%)
Query: 38 KREAQMSAHLTPAEKTEMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSL 97
KR+A A L E DKA + I L + DK+LR++ TA W LD L+M +SL
Sbjct: 34 KRDADEVARL------ERCDKAKNVIFLNVADKVLRKIELCKTAAEAWETLDRLFMIRSL 87
Query: 98 AHRQCLKQQLYFYRMMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFE 157
HR + Y ++M E+K I E + +F KI+ DL ++ +++ DE +A+ LL +LP ++
Sbjct: 88 PHRVYTQLSFYTFKMQENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILLLSSLPARYD 147
Query: 158 NFKDTMLYGKEGT-ITLEEVQATLITKELTKFKDLKVDDSGEGLNVSRGRNQNRGKGKGK 216
+TM Y + L++V KE ++ + G + +RGR + +G
Sbjct: 148 GLVETMKYSNSREKLRLDDVMVAARDKERELSQNNRPVVEG---HFARGRPDGKNNNQGN 204
Query: 217 NSKSKSRSKG-DGNKTKYKCFICHNPGHFKKDC------PERKDNGGGNPSVQLASKDEG 269
K++SRSK DG + C+IC GHFKK C + K G N LA E
Sbjct: 205 KGKNRSRSKSADGKRV---CWICGKEGHFKKQCYKWIERNKSKQQGSDNGESSLAKSTEA 261
Query: 270 CESAGALTVTSW------EPEKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKAC 323
A L T WVLD+GCS+H++PRK +F+ + G V++GN+
Sbjct: 262 FNPAMVLLATDETLVVTDSIANEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKMGNDTYS 321
Query: 324 KIQGIGTIRLKMFDDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGAL 383
++GIG+I+++ D +L DVRY+P + RNLIS+ + G + + G+++I G
Sbjct: 322 PVKGIGSIKIRNSDGSQVILTDVRYMPNMTRNLISLGTLEDRGCWFKSQDGILKIVKGCS 381
Query: 384 VIAKGSKIHGLYILEGSTIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGN 443
I KG K LYIL+G T S S + D T LWH RLGH+S++G+ L K+G L
Sbjct: 382 TILKGQKRDTLYILDGVTEEGE-SHSSAEVKDETALWHSRLGHMSQKGMEILVKKGCLRR 440
Query: 444 EKLNKLDFCDNCTLGKQHKVKFGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSII 502
E + +L+FC++C GKQH+V F H + VHSDL G P + G YF S +
Sbjct: 441 EVIKELEFCEDCVYGKQHRVSFAPAQHVTKEKLAYVHSDLWGSPHNPASLGNSQYFISFV 500
Query: 503 DDYSRRVWVYILKNKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKK 562
DDYSR+VW+Y L+ K +AFEKF EW +VENQ K+K LRTDNG E+ F +FC+++
Sbjct: 501 DDYSRKVWIYFLRKKDEAFEKFVEWKKMVENQSDRKVKKLRTDNGLEYCNHYFEKFCKEE 560
Query: 563 GIKRHRIVAYT 573
GI RH+ AYT
Sbjct: 561 GIVRHKTCAYT 571
Score = 108 bits (271), Expect = 9e-22
Identities = 51/91 (56%), Positives = 72/91 (79%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
EA+WLKG E+G SQ V++H DSQSAI LA + ++HERTKHI+IRLHF+RD+I I
Sbjct: 1240 EALWLKGFAAELGHSQDYVEVHSDSQSAITLAKNSVHHERTKHIDIRLHFIRDIICAGLI 1299
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
KV K+A+E NPA+IFTK++P ++F+ L+++
Sbjct: 1300 KVVKIATECNPANIFTKTVPLAKFEGALNML 1330
>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|53370655|gb|AAU89150.1| integrase core domain
containing protein [Oryza sativa (japonica
cultivar-group)] gi|40538906|gb|AAR87163.1| putative
polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1322
Score = 359 bits (922), Expect = 3e-97
Identities = 215/586 (36%), Positives = 328/586 (55%), Gaps = 25/586 (4%)
Query: 5 KWDIEKFTGSNLFGLWKVKMRAILIQEKCV-EALKREAQMSAHLTPAEKTEMNDKAVSAI 63
K+D+ F LW+VKMRA+L Q + EAL+ + AE+ + KA+S I
Sbjct: 5 KYDLPLLDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 64
Query: 64 ILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLT 123
L L + +L+EV ++ TA +W KL+ + M+K L + +K +L+ +++ ES ++ ++
Sbjct: 65 QLHLSNDILQEVLQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHIS 124
Query: 124 EFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITK 183
F +I+ DL +++V +DED L LLC+LP S+ NF+ T+L ++ +TL EV L +
Sbjct: 125 VFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSRD-ELTLAEVYEALQNR 183
Query: 184 ELTK--FKDLKVDDSGEGLNVSRGRNQNRGKGKGKN---SKSKSRSKGDGNKTKYKCFIC 238
E K + GE L V RGR++ R + S+S+ RSK G K C C
Sbjct: 184 EKMKGMVQSYASSSKGEALQV-RGRSEQRTYNDSNDHDKSQSRGRSKSRGKKF---CKYC 239
Query: 239 HNPGHFKKDC------PERKDNGGGNPSVQLASKDEG-CESAGALTVTSWEPEKGWVLDS 291
HF ++C +RK +G + + D G C A V S + W+LD+
Sbjct: 240 KKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASHDE---WILDT 296
Query: 292 GCSYHISPRKGYFETLE-LEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYIP 350
CS+HI + +F + + ++ VVR+G++ +I GIG++++K D LKDVR+IP
Sbjct: 297 ACSFHICINRDWFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIP 356
Query: 351 ELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHASVP 409
+ RNLIS+S D GY GV+++S G+LV G LY+L GST+ +
Sbjct: 357 GMARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAA 416
Query: 410 SV--DTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGV 467
+V D T LWH+RLGH+SE G+ EL K+ LL + FC++C GK +VKF
Sbjct: 417 AVTKDEPSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNT 476
Query: 468 GVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEW 527
VH++ + VH+DL GP+ + GG Y +IIDDYSR+ W Y LK+K D F FKE
Sbjct: 477 SVHRTKGILDYVHADLWGPSRKPSLGGARYMLTIIDDYSRKEWPYFLKHKDDTFAAFKER 536
Query: 528 DILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
+++E Q ++KVL TDNG EF + F+++CRK+GI RH + YT
Sbjct: 537 KVMIERQTEKEVKVLCTDNGGEFCSDAFDDYCRKEGIVRHHTIPYT 582
Score = 99.0 bits (245), Expect = 9e-19
Identities = 41/94 (43%), Positives = 66/94 (69%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
E++WLKG+ E+ C+ + CDSQSAI L Q++HERTKHI+I+ H+VRD++ ++
Sbjct: 1228 ESVWLKGLFAELCGVDSCINLFCDSQSAICLTKDQMFHERTKHIDIKYHYVRDIVAQGKL 1287
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLINFI 1283
KV K++ +NPAD+ TK +P ++F+ C L+ +
Sbjct: 1288 KVCKISIHDNPADMMTKPIPVAKFELCSSLVGIV 1321
>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1342
Score = 353 bits (906), Expect = 2e-95
Identities = 223/599 (37%), Positives = 321/599 (53%), Gaps = 55/599 (9%)
Query: 7 DIEKFTGSNLFGLWKVKMRAILI-----------QEKCVEALKREAQMSAHLTPAEKT-- 53
++EKF G + LWK K+ A + +E VE E + P T
Sbjct: 7 EVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDPETATSK 66
Query: 54 -------EMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQ 106
E KA S IIL LG+ +LR+V ++ TA M LD L+M KSL +R LKQ+
Sbjct: 67 LEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNRIYLKQR 126
Query: 107 LYFYRMMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYG 166
LY Y+M E+ + E + +F K+I DL N+ V + DED+A+ LL +LPR F+ K+T+ Y
Sbjct: 127 LYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETLKYC 186
Query: 167 KEGTITLEEVQATLITKELTKFKDLK-VDDSGEGLNV-SRGRNQNRGKGKGKNSKSKSRS 224
K T+ LEE+ + + +K L K + ++ +GL V RGR++ RGKG KN KS+S+S
Sbjct: 187 KT-TLHLEEITSAIRSKILELGASGKLLKNNSDGLFVQDRGRSETRGKGPNKN-KSRSKS 244
Query: 225 KGDGNKTKYKCFICHNPGHFKKDC---PERKDNGGGNPSVQLASKDEGCESAGALTVT-- 279
KG G C+IC GHFKK C ER G + + ++ A AL V+
Sbjct: 245 KGAGK----TCWICGKEGHFKKQCYVWKERNKQGSTSERGEASTVTARVTDAAALVVSRA 300
Query: 280 ----SWEPEKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKM 335
+ W+LD+GCS+H++ RK + + G VR+GN+ +++GIG +R+K
Sbjct: 301 LLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEVKGIGDVRIKN 360
Query: 336 FDDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLY 395
D LL DVRYIPE+ +NLIS+ + G ++G++ I L + G K LY
Sbjct: 361 EDGSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESKKGILTIFKNDLTVLTGKKESTLY 420
Query: 396 ILEGSTIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNC 455
L+G+T+ A+V + D T LWH RLGH+ +GL L +G
Sbjct: 421 FLQGTTLAGEANVIDKEK-DETSLWHSRLGHIGAKGLQVLVSKG---------------- 463
Query: 456 TLGKQHKVKFGVGVHKSSRPFE*VHSDLLGPA*VK-TYGGGSYFTSIIDDYSRRVWVYIL 514
L K + FG H + + VHSDL G V + G YF + IDD++RR W+Y +
Sbjct: 464 HLDKNIMISFGAAKHVTKDKLDYVHSDLWGSTNVPFSIGKCQYFITFIDDFTRRTWIYFI 523
Query: 515 KNKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
+ K +AF KF EW +ENQ KLK+L TDNG EF ++F+ FCRK+G+ RHR AYT
Sbjct: 524 RTKDEAFSKFVEWKTQIENQQDKKLKILITDNGLEFCNQEFDSFCRKEGVIRHRTCAYT 582
Score = 105 bits (262), Expect = 1e-20
Identities = 47/91 (51%), Positives = 68/91 (74%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
EAIWL+G+ EMG Q V++ CDSQSAI L+ + ++HERTKHI++R HF+R+ I EI
Sbjct: 1247 EAIWLRGLAAEMGFEQDAVEVMCDSQSAIALSKNSVHHERTKHIDVRYHFIREKIADGEI 1306
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
+V K+++ NPADIFTK++P S+ + L L+
Sbjct: 1307 QVVKISTTWNPADIFTKTVPVSKLQEALKLL 1337
>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease]
Length = 1328
Score = 350 bits (897), Expect = 2e-94
Identities = 211/594 (35%), Positives = 318/594 (53%), Gaps = 41/594 (6%)
Query: 3 GSKWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVSA 62
G K+++ KF G N F W+ +MR +LIQ+ + L +++ + + +++++A SA
Sbjct: 3 GVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASA 62
Query: 63 IILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQL 122
I L L D ++ + E TA +W +L+ LYM+K+L ++ LK+QLY M E + L
Sbjct: 63 IRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHL 122
Query: 123 TEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLIT 182
FN +I LAN+ V +E+EDKA+ LL +LP S++N T+L+GK TI L++V + L+
Sbjct: 123 NVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKT-TIELKDVTSALLL 181
Query: 183 KELTKFKDLKVDDSGEGL-NVSRGRNQNRGKGKGKNSKSKSRSKGDGNKTKYKCFICHNP 241
E + K ++ G+ L RGR+ R S ++ +SK C+ C+ P
Sbjct: 182 NEKMRKKP---ENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQP 238
Query: 242 GHFKKDCPERKDNGG----------------GNPSVQL-ASKDEGCESAGALTVTSWEPE 284
GHFK+DCP + G N +V L +++E C PE
Sbjct: 239 GHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSG-------PE 291
Query: 285 KGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLK 344
WV+D+ S+H +P + F + G V++GN KI GIG I +K +LK
Sbjct: 292 SEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLK 351
Query: 345 DVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLY-----ILEG 399
DVR++P+LR NLIS D GY + R++ G+LVIAKG LY I +G
Sbjct: 352 DVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQG 411
Query: 400 STIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGK 459
A + SVD LWH R+GH+SE+GL LAK+ L+ K + CD C GK
Sbjct: 412 ELNAAQDEI-SVD------LWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGK 464
Query: 460 QHKVKFGVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSD 519
QH+V F + + V+SD+ GP +++ GG YF + IDD SR++WVYILK K
Sbjct: 465 QHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQ 524
Query: 520 AFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
F+ F+++ LVE + G KLK LR+DNG E+ +F E+C GI+ + V T
Sbjct: 525 VFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGT 578
Score = 103 bits (257), Expect = 4e-20
Identities = 44/100 (44%), Positives = 72/100 (72%)
Query: 1181 YIVWYNSNLEAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFV 1240
YI + E IWLK + E+G+ Q ++CDSQSAI L+ + +YH RTKHI++R H++
Sbjct: 1224 YIAATETGKEMIWLKRFLQELGLHQKEYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWI 1283
Query: 1241 RDMIETKEIKVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
R+M++ + +KV K+++ ENPAD+ TK +PR++F+ C +L+
Sbjct: 1284 REMVDDESLKVLKISTNENPADMLTKVVPRNKFELCKELV 1323
>gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1241
Score = 350 bits (897), Expect = 2e-94
Identities = 195/501 (38%), Positives = 289/501 (56%), Gaps = 23/501 (4%)
Query: 93 MTKSLAHRQCLKQQLYFYRMMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCAL 152
MTK L + LKQ+L+ +++ + +M+ L+ F +I+ DL +++V ++ED L LLC+L
Sbjct: 1 MTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEEDLGLILLCSL 60
Query: 153 PRSFENFKDTMLYGKEGTITLEEVQATLITKELTKFKDLKVDDSG---EGLNVSRGRNQN 209
P S+ NF+DT+LY ++ T+TL+EV L KE K K + + S EGL V RGR Q
Sbjct: 61 PSSYANFRDTILYSRD-TLTLKEVYDALHAKEKMK-KMVPSEGSNSQAEGL-VVRGRQQE 117
Query: 210 RGKGKGKNSKSKSRSKGDG-NKTKYK-CFICHNPGHFKKDCPERKDNGGGNPSVQLASKD 267
+ KS S +G ++ +YK C C GH +C + +D K
Sbjct: 118 KNTNNKSRDKSSSIYRGRSKSRGRYKSCKYCKRDGHDISECWKLQDKDKRTRKYIPKGKK 177
Query: 268 EGCESAGALTVTSWEPE------------KGWVLDSGCSYHISPRKGYFETLELEEGGVV 315
E A +T + E W+LD+ C+YH+ P + +F T E +GG V
Sbjct: 178 EEEGKAAVVTDEKSDAELLVAYAGCAQTSDQWILDTACTYHMCPNRDWFATYEAVQGGTV 237
Query: 316 RLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGV 375
+G++ C++ GIGT+++KMFD L DVR+IP L+R+LIS+ D GY G+
Sbjct: 238 LMGDDTPCEVAGIGTVQIKMFDGCIRTLLDVRHIPNLKRSLISLCTLDRKGYKYSGGDGI 297
Query: 376 MRISHGALVIAKGS-KIHGLYILEGSTIIAHASV--PSVDTLDITKLWHLRLGHVSERGL 432
++++ G+LV+ K K LY L G+TI+ + + S+ D T LWH+RLGH+SE GL
Sbjct: 298 LKVTKGSLVVMKADIKYANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGL 357
Query: 433 VELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGVGVHKSSRPFE*VHSDLLGPA*VKTY 492
EL+K+GLL + + KL FC++C GK +VKF H + + VHSDL GPA ++
Sbjct: 358 AELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPARKTSF 417
Query: 493 GGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVL 552
GG Y +I+DDYSR+VW Y LK+K AF+ FKEW +VE Q K+K+LRTDNG E
Sbjct: 418 GGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVKILRTDNGMELCS 477
Query: 553 EQFNEFCRKKGIKRHRIVAYT 573
+ F +C+ +GI RH V +T
Sbjct: 478 KIFKSYCKSEGIVRHYTVPHT 498
Score = 95.5 bits (236), Expect = 1e-17
Identities = 41/91 (45%), Positives = 63/91 (69%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
E IWL+G+ E+ C+ I CDSQSAI L Q++HERTKHI++R HF+R +I ++
Sbjct: 1147 EVIWLRGLYTELCGVTSCINIFCDSQSAICLTKDQMFHERTKHIDLRYHFIRGVIAEGDV 1206
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
KV K+++ +NP D+ TK +P ++F+ C L+
Sbjct: 1207 KVCKISTHDNPVDMMTKPVPATKFELCSSLV 1237
>gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
gi|25301707|pir||E86490 hypothetical protein F28L22.3 -
Arabidopsis thaliana
Length = 1356
Score = 347 bits (890), Expect = 1e-93
Identities = 205/591 (34%), Positives = 320/591 (53%), Gaps = 29/591 (4%)
Query: 7 DIEKFTGSNLFGLWKVKMRAIL------------IQEKCVEALKREAQMSA--------- 45
+I+ F G F LWK++++A L K V K EA+ +
Sbjct: 9 EIKVFNGDRDFSLWKIRIQAQLGVLGLKDTLTDFSLTKTVPLTKSEAKQESGDGESSGTK 68
Query: 46 HLTPAEKTEMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQ 105
+ K E +++A + II + D +L +V+ T +W L+ YM SL +R +
Sbjct: 69 EVPDPVKIEQSEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNRIYTQL 128
Query: 106 QLYFYRMMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLY 165
+LY ++M+ + I + + EF +I+ +L ++++ +++E +A+ +L +LP S K T+ Y
Sbjct: 129 KLYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLKHTLKY 188
Query: 166 GKEGTITLEEV--QATLITKELTKFKDLKVDDSGEGLNVSRGRNQNRGKGKGKNSKSKSR 223
G + T+T+++V A + +EL + DL + RGR R KG K +SR
Sbjct: 189 GNK-TLTVQDVTSSAKSLERELAEAVDLDKGQAAVLYTTERGRPLVRNNQKGGQGKGRSR 247
Query: 224 SKGDGNKTKYKCFICHNPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEP 283
S +KTK C+ C GH KKDC RK + E + AL+V
Sbjct: 248 SN---SKTKVPCWYCKKEGHVKKDCYSRKKKMESEGQGEAGVITEKLVFSEALSVNEQMV 304
Query: 284 EKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLL 343
+ W+LDSGC+ H++ R+ +F + + + + LG++ + + QG GTIR+ +L
Sbjct: 305 KDLWILDSGCTSHMTSRRDWFISFQEKGNTTILLGDDHSVESQGQGTIRIDTHGGTIKIL 364
Query: 344 KDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLYILEGSTII 403
++V+Y+P LRRNLIS D LGY G +R +GS +GLY+L+GST++
Sbjct: 365 ENVKYVPHLRRNLISTGTLDKLGYRHEGGEGKVRYFKNNKTALRGSLSNGLYVLDGSTVM 424
Query: 404 AHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKV 463
+ D + T LWH RLGH+S L LA +GL+ +++N+L+FC++C +GK KV
Sbjct: 425 SELCNAETDKVK-TALWHSRLGHMSMNNLKVLAGKGLIDRKEINELEFCEHCVMGKSKKV 483
Query: 464 KFGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFE 522
F VG H S VH+DL G P + G YF SIIDD +R+VW+Y LK+K + F+
Sbjct: 484 SFNVGKHTSEDALSYVHADLWGSPNVTPSISGKQYFLSIIDDKTRKVWLYFLKSKDETFD 543
Query: 523 KFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
KF EW LVENQ+ K+K LRTDNG EF +F+ +C++ GI+RHR YT
Sbjct: 544 KFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCKEHGIERHRTCTYT 594
Score = 102 bits (255), Expect = 6e-20
Identities = 44/103 (42%), Positives = 72/103 (69%)
Query: 1181 YIVWYNSNLEAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFV 1240
YI + EA+W+KG++ +MG+ Q VKI CDSQSAI L+ + +YHERTKHI++R +++
Sbjct: 1251 YIALAEAAKEAMWIKGLLQDMGMQQDKVKIWCDSQSAICLSKNSVYHERTKHIDVRFNYI 1310
Query: 1241 RDMIETKEIKVEKVASEENPADIFTKSLPRSRFKHCLDLINFI 1283
RD++E+ ++ V K+ + NP D TK +P ++FK L ++ +
Sbjct: 1311 RDVVESGDVDVLKIHTSRNPVDALTKCIPVNKFKSALGVLKLM 1353
>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
Length = 1298
Score = 345 bits (886), Expect = 4e-93
Identities = 217/586 (37%), Positives = 329/586 (56%), Gaps = 32/586 (5%)
Query: 2 MGSKWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEK-TEMNDKAV 60
M +K++IEKF G N F LWK+K++AIL ++ C+ A+ ++ T +K +EMN+ A+
Sbjct: 1 MAAKFEIEKFNGKN-FSLWKLKVKAILRKDNCLAAI---SERPVDFTDDKKWSEMNEDAM 56
Query: 61 SAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIME 120
+ + L + D +L + + TA +W+ L+ LY KSL ++ LK++LY RM ES + E
Sbjct: 57 ADLYLSIADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTE 116
Query: 121 QLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFE----NFKDTMLYGKEGTITLEEV 176
L N + L ++ +E +++A LL +LP S++ N + +L + ++V
Sbjct: 117 HLNTLNTLFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNIL---TDYLVFDDV 173
Query: 177 QATLITKELTKF--KDLKVD-DSGEGLNVSRGRNQNRGKGKGKNSKSKSRSKGDGNKTKY 233
A ++ +E + +D +V+ E L V RGR+ RG+ G+ +SKS +K
Sbjct: 174 AAAVLEEESRRKNKEDRQVNLQQAEALTVMRGRSTERGQSSGRG-RSKS------SKKNL 226
Query: 234 KCFICHNPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPEKG----WVL 289
C+ C GH KKDC N NP +AS + + + E K W++
Sbjct: 227 TCYNCGKKGHLKKDCWNLAQNS--NPQGNVASTSDDGSALCCEASIAREGRKRFADIWLI 284
Query: 290 DSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYI 349
DSG +YH++ RK +F E GG V ++ A +I GIGTI+LKM+D ++DVR++
Sbjct: 285 DSGATYHMTSRKEWFHHYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHV 344
Query: 350 PELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHG-LYILEGSTII-AHAS 407
L++NL+S + D ++GVM+I GALV+ KG KI LY+L+G T+ A AS
Sbjct: 345 KGLKKNLLSYGILDNSATQIETQKGVMKIFQGALVVMKGEKIAANLYMLKGETLQEAEAS 404
Query: 408 VPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGV 467
V + D T LWH +LGH+S++G+ L +Q L+ L C++C KQH++KF
Sbjct: 405 VAACSP-DSTLLWHQKLGHMSDQGMKILVEQKLIPGLTKVSLPLCEHCITSKQHRLKFST 463
Query: 468 GVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEW 527
+ E VHSD+ A V + GG YF S IDDYSRR WVY +K KSD F FK +
Sbjct: 464 SNSRGKVVLELVHSDVW-QAPVPSLGGAKYFVSFIDDYSRRCWVYPIKKKSDVFATFKAF 522
Query: 528 DILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
VE G K+K RTDNG E+ E+F++FC+K+GIKR VAYT
Sbjct: 523 KARVELDSGKKIKCFRTDNGGEYTSEEFDDFCKKEGIKRQFTVAYT 568
Score = 86.3 bits (212), Expect = 6e-15
Identities = 37/96 (38%), Positives = 61/96 (63%)
Query: 1181 YIVWYNSNLEAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFV 1240
Y+ ++ EAIWLK ++ E+G Q V + CDSQSA+HLA + +H RTKHI ++ HF+
Sbjct: 1194 YVAATQASKEAIWLKMLLEELGHKQEFVSLFCDSQSALHLARNPAFHSRTKHIRVQYHFI 1253
Query: 1241 RDMIETKEIKVEKVASEENPADIFTKSLPRSRFKHC 1276
R+ ++ + ++K+ + +N AD TK + +F C
Sbjct: 1254 REKVKEGTVDLQKIHTADNVADFLTKIINVDKFTWC 1289
>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
Length = 1328
Score = 328 bits (842), Expect = 6e-88
Identities = 203/590 (34%), Positives = 311/590 (52%), Gaps = 32/590 (5%)
Query: 3 GSKWDIEKFTGSN-LFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVS 61
G K+++ KF G +F +W+ +M+ +LIQ+ +AL +++ + + E+++KA S
Sbjct: 3 GVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAAS 62
Query: 62 AIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQ 121
AI L L D ++ + E +A +W KL+ LYM+K+L ++ LK+QLY M E +
Sbjct: 63 AIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSH 122
Query: 122 LTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLI 181
L N +I LAN+ V +E+EDK + LL +LP S++ T+L+GK+ +I L++V + L+
Sbjct: 123 LNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKD-SIQLKDVTSALL 181
Query: 182 TKELTKFKDLKVDDSGEG-LNVSRGRNQNRGKGKGKNSKSKSRSKGDGNKTKYKCFICHN 240
E + K ++ G+ + SRGR+ R S ++ +SK C+ C
Sbjct: 182 LNEKMRKKP---ENHGQVFITESRGRSYQRSSSNYGRSGARGKSKVRSKSKARNCYNCDQ 238
Query: 241 PGHFKKDCPERKDNGG--------GNPSVQLASKDEGC----ESAGALTVTSWEPEKGWV 288
PGHFK+DCP K G N + + + D+ E + + E E WV
Sbjct: 239 PGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTESE--WV 296
Query: 289 LDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRY 348
+D+ SYH +P + F + G V++GN KI GIG I K +LKDVR+
Sbjct: 297 VDTAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLKDVRH 356
Query: 349 IPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLY-----ILEGSTII 403
+P+LR NLIS D GY R++ GALVIAKG LY I +G
Sbjct: 357 VPDLRMNLISGIALDQDGYENYFANQKWRLTKGALVIAKGVARGTLYRTNAEICQGELNA 416
Query: 404 AHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKV 463
AH + LWH R+GH SE+GL L+K+ L+ K + C+ GKQH+V
Sbjct: 417 AHEE-------NSADLWHKRMGHTSEKGLQILSKKSLISFTKGTTIKPCNYWLFGKQHRV 469
Query: 464 KFGVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEK 523
F + S + V+SD+ GP +++ GG YF + IDD SR++WVYI + K F+
Sbjct: 470 SFQTSSERKSNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIFRAKDQVFQV 529
Query: 524 FKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
F+++ LVE + G K K LRTDNG E+ +F E+C GI+ + V T
Sbjct: 530 FQKFHALVERETGRKRKRLRTDNGGEYTSREFEEYCSNHGIRHEKTVPGT 579
Score = 79.3 bits (194), Expect = 8e-13
Identities = 34/91 (37%), Positives = 60/91 (65%)
Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
E +WLK + E G+ Q ++C+SQSA+ L+ +YH TKHI++R H++R+M++ +
Sbjct: 1233 EMLWLKRFLQEHGLHQKEYVVYCESQSAMDLSKKAMYHATTKHIDMRYHWIREMVDDGSL 1292
Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
+V K+ + ENPAD+ TK + +F+ +L+
Sbjct: 1293 QVVKIPTSENPADMVTKVVQNEKFELWKELV 1323
>emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir||S05465
retrovirus-related polyprotein - Arabidopsis thaliana
retrotransposon Ta1-3
Length = 1291
Score = 328 bits (840), Expect = 9e-88
Identities = 187/525 (35%), Positives = 292/525 (55%), Gaps = 6/525 (1%)
Query: 52 KTEMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYR 111
K E ++ A++ II +GD +LR++ +A MW L+ YM SL +R ++ + Y ++
Sbjct: 83 KIEKSENAMNIIIAHVGDAVLRKIDHCKSAAEMWETLNKQYMETSLPNRIYVQLKFYSFK 142
Query: 112 MMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTI 171
M ++K I E + EF KI+ +L+++++N+ +E +A+ L L + K T+ YG + +
Sbjct: 143 MNDTKSINENVNEFLKIVAELSSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGNKA-L 201
Query: 172 TLEEV--QATLITKELTKFKDLKVDDSGEGLNVSRGRNQNRGKGKGKNSKSKSRSKGDGN 229
+L++V A + +EL + K+ + S R R Q R + K + + RSK + N
Sbjct: 202 SLKDVISAARSLERELNEQKETDKNTSTVLYTNERSRPQTRNQNHNKGGQGRGRSKSNSN 261
Query: 230 KTKYKCFICHNPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPEKGWVL 289
K C+ C GH KKD RK + E + AL+V WVL
Sbjct: 262 -AKLTCWYCKKEGHVKKDYFARKRKLESENPGEAGVITEKLVFSEALSVNDLAVRDIWVL 320
Query: 290 DSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYI 349
DSGC+ H+S R+ +F + + G + LG++ + K QG G+I+++ L++V+Y+
Sbjct: 321 DSGCTSHMSARRDWFCSFREDGGPTILLGDDHSVKSQGQGSIKIETHGGTIIGLENVKYV 380
Query: 350 PELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLYILEGSTIIAHASVP 409
PELRRNLIS D GY G +R +G ++GLYIL+G+T+++ V
Sbjct: 381 PELRRNLISTGTLDKRGYKHEGGDGKVRYFKNQKTALRGELVNGLYILDGNTVLSETCVA 440
Query: 410 SVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGVGV 469
+ T+LWH RLGH+ + LA +GL+ E++ LDFC+NC +GK KV F VG
Sbjct: 441 E-GSKGKTELWHSRLGHIGLNNMKVLAGKGLVSKEEIRVLDFCENCVMGKAKKVSFNVGK 499
Query: 470 HKSSRPFE*VHSDLLGPA*VK-TYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEWD 528
H S VH+DL G V + G YF SIIDD +R+VW+Y L++K + F++F EW
Sbjct: 500 HNSEDVLRYVHADLWGSTNVTPSLSGNKYFLSIIDDKTRKVWLYFLRSKDETFDRFCEWK 559
Query: 529 ILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
LVENQ K+K LRTDNG EF +F+ +C++ GI+RH+ YT
Sbjct: 560 ELVENQQNKKVKCLRTDNGLEFCNLKFDAYCKEHGIERHKTCTYT 604
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.343 0.153 0.518
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,022,752,737
Number of Sequences: 2540612
Number of extensions: 82392245
Number of successful extensions: 360308
Number of sequences better than 10.0: 1326
Number of HSP's better than 10.0 without gapping: 934
Number of HSP's successfully gapped in prelim test: 394
Number of HSP's that attempted gapping in prelim test: 355441
Number of HSP's gapped (non-prelim): 3048
length of query: 1283
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1143
effective length of database: 507,674,714
effective search space: 580272198102
effective search space used: 580272198102
T: 11
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.5 bits)
S2: 81 (35.8 bits)
Medicago: description of AC135504.5