
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0220.7
(720 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1... 579 e-163
gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas] 573 e-162
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense] 559 e-157
pir||T02206 hypothetical protein - common tobacco retrotransposo... 550 e-155
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas] 509 e-143
ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sa... 471 e-131
ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultiv... 464 e-129
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-... 464 e-129
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult... 462 e-128
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu... 461 e-128
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-... 451 e-125
gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cult... 439 e-121
ref|XP_476137.1| putative polyprotein [Oryza sativa (japonica cu... 436 e-120
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi... 430 e-119
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu... 422 e-116
gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis ... 413 e-114
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi... 407 e-112
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 404 e-111
emb|CAB79135.1| putative transposable element [Arabidopsis thali... 401 e-110
emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir|... 398 e-109
>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease]
Length = 1328
Score = 579 bits (1492), Expect = e-163
Identities = 305/714 (42%), Positives = 446/714 (61%), Gaps = 27/714 (3%)
Query: 5 KVKIERFDGRD-FGFWKMLMEDYLYQKMLYQPLT--GKKPNDMKQEDWDLLDRQALGVIR 61
K ++ +F+G + F W+ M D L Q+ L++ L KKP+ MK EDW LD +A IR
Sbjct: 5 KYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASAIR 64
Query: 62 LTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINS 121
L LS +V NI++E T + L ++Y NK++L ++L+ L M EG + H+N
Sbjct: 65 LHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNV 124
Query: 122 FNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSE 181
FN +I+QL+++ + + E + LL SLP S+ T + + +LK D L+L+E
Sbjct: 125 FNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELK-DVTSALLLNE 183
Query: 182 DIRRKDSGESSNTFGSALNTESRGRGSQKS-HNQSQGRGRSKSRGRSQTRVRNDITCWNC 240
+R+K + G AL TE RGR Q+S +N + R KS+ RS++RVRN C+NC
Sbjct: 184 KMRKKPENQ-----GQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRN---CYNC 235
Query: 241 DRKGHFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLI--------CSLDSPVDSWV 292
++ GHF C PRK K +D++ A + + ++ L P WV
Sbjct: 236 NQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWV 295
Query: 293 IDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRH 352
+D+ AS H P ++L Y+ G FG V + + I GIGDI I+++ G L +VRH
Sbjct: 296 VDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRH 355
Query: 353 VPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYM----VAEEDMIA 408
VP ++ NLIS LD +GY + F W++TKG+LV+A+G RG+LY + + ++ A
Sbjct: 356 VPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQGELNA 415
Query: 409 VTEAINSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAG 468
+ I S +WH+R+GHMSEKG++I+A K +S K + C++C+ GKQ +VSF +
Sbjct: 416 AQDEI-SVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSS 474
Query: 469 RKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWK 528
+ K L+LV++DV GP ++S+GG++Y+VTFIDD++RK+WVY LK+K VF VF+K+
Sbjct: 475 ER-KLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFH 533
Query: 529 TEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLN 588
VE +TG K+K L+SDNGGEY S+EF+++CS +GIR KT+PGTP+ NGVAERMNRT+
Sbjct: 534 ALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIV 593
Query: 589 ERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGC 648
E+ R M + LPK FW +A+ TA YLINR PSVPL +++PE VW KEVS SHLKVFGC
Sbjct: 594 EKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGC 653
Query: 649 VSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNES 702
++ + ++R KLD K+I C FIGYG + +GYR WD +K+IRS +V F ES
Sbjct: 654 RAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRES 707
>gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas]
Length = 1415
Score = 573 bits (1477), Expect = e-162
Identities = 298/718 (41%), Positives = 439/718 (60%), Gaps = 22/718 (3%)
Query: 1 LEEGKVKIERFDGRDFGFWKMLMEDYLYQKMLYQPL-TGKKPNDMKQEDWDLLDRQALGV 59
+E + R +GR++ WK M+D L+ K L+ P+ KP +M E+WD +Q G
Sbjct: 1 METNTSNMVRLNGRNYHIWKAKMKDLLFVKKLHLPVFASAKPENMSDEEWDFEHQQVCGY 60
Query: 60 IRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHI 119
IR + NV +I+NE L L +Y NK+ L++++ N+R EG + +H+
Sbjct: 61 IRQWVEDNVLNHIINETHARSLWNKLETLYASKTGNNKLFLLKQMMNIRYREGTLINDHV 120
Query: 120 NSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLIL 179
N F ++ QLS + I F++E++ L LL +LPDSW +++NSA + + + ++ IL
Sbjct: 121 NDFQGVLDQLSGMGIKFEDEVLGLWLLNTLPDSWETFRVSLTNSAPNGVVTMEYVKSGIL 180
Query: 180 SEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWN 239
+E+ RR+ S ++S + L T+ RGR QK RGR KSR +S++R + DI C
Sbjct: 181 NEEARRR-SQDTSTSQSDILVTDDRGRNKQKGQ-----RGRDKSRSKSRSRYK-DIECHY 233
Query: 240 CDRKGHFTNQC-KAPRKKKNYQKR*DDDESANAATEEVADTLICSLDSPVD------SWV 292
C +K H K R+KK K D ++ N AD L+ D+ ++ +W+
Sbjct: 234 CGKKSHIKKYSFKWKREKKQDNK---DGDTGNQVATVRADLLVACDDNVINVACHETTWI 290
Query: 293 IDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRH 352
+DSGA++H P KE ++Y G FG++ + + + + G G + + +SNGT L NV+H
Sbjct: 291 VDSGAAYHVTPRKEFFTSYTPGDFGELRMGNDGQVKVTGTGTVCLETSNGTKLVLKNVKH 350
Query: 353 VPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYMV---AEEDMIAV 409
P I+ NLIS G+LDD+G+ FG G WK+TKG+LVVARG K +LY + +D + V
Sbjct: 351 APDIRLNLISTGKLDDDGFCCFFGDGHWKITKGSLVVARGNKSSNLYSLQSSVSDDSVNV 410
Query: 410 TEAINSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGR 469
E +S +WH+RLGHMS KG+ +A K K+S +K L C HC+ GKQR+VSF
Sbjct: 411 VEKECASELWHKRLGHMSVKGIDYLAKKSKLSGVKEAKLDKCVHCLAGKQRRVSFMSHPP 470
Query: 470 KSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKT 529
KSE L+L+H+DV GP V+SLGG+ Y+VTFIDD +RK+WVY LK KSDV VFK++
Sbjct: 471 TRKSEPLDLIHSDVCGPMKVRSLGGASYFVTFIDDYSRKLWVYTLKHKSDVLGVFKEFHA 530
Query: 530 EVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNE 589
VE QTG K+K +++DNGGEY F ++C GIR KT P P+ NG+AERMNRT+ E
Sbjct: 531 LVERQTGKKLKCIRTDNGGEY-CGPFDEYCRRYGIRHQKTPPKIPQLNGLAERMNRTIME 589
Query: 590 RARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCV 649
R RCM + LP FW +A++TA ++IN P + L ++P++VW GK+VS HL+VFGC
Sbjct: 590 RVRCMLDDAKLPSSFWAEAVSTAVHVINLSPVIALKNEVPDKVWCGKDVSYDHLRVFGCK 649
Query: 650 SYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKD 707
++V + D+R KLD K +C FIGYG D +GYR +D +K++RS +V F E+ +D
Sbjct: 650 AFVHVPRDERSKLDSKTRQCIFIGYGFDEFGYRLYDPVEKKLVRSRDVVFFENQTIED 707
>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
Length = 1328
Score = 559 bits (1441), Expect = e-157
Identities = 293/715 (40%), Positives = 437/715 (60%), Gaps = 28/715 (3%)
Query: 5 KVKIERFDGRD--FGFWKMLMEDYLYQKMLYQPLTGK--KPNDMKQEDWDLLDRQALGVI 60
K ++ +F+G F W+ M+D L Q+ L++ L GK KP MK EDW+ LD +A I
Sbjct: 5 KYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAASAI 64
Query: 61 RLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHIN 120
RL L+ +V NIV+E++ + L N+Y NK++L ++L+ L M EG + H+N
Sbjct: 65 RLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSHLN 124
Query: 121 SFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILS 180
N +I+QL+++ + + E + LL SLP S+ T + + +LK D L+L+
Sbjct: 125 VLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKDSIQLK-DVTSALLLN 183
Query: 181 EDIRRKDSGESSNTFGSALNTESRGRGSQKSH-NQSQGRGRSKSRGRSQTRVRNDITCWN 239
E +R+K G TESRGR Q+S N + R KS+ RS+++ RN C+N
Sbjct: 184 EKMRKKPENH-----GQVFITESRGRSYQRSSSNYGRSGARGKSKVRSKSKARN---CYN 235
Query: 240 CDRKGHFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLIC--------SLDSPVDSW 291
CD+ GHF C P++ K +D++ A + D ++ L W
Sbjct: 236 CDQPGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTESEW 295
Query: 292 VIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVR 351
V+D+ AS+H P ++L Y+ G +G V + + I GIGDI +++ G L +VR
Sbjct: 296 VVDTAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLKDVR 355
Query: 352 HVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYM----VAEEDMI 407
HVP ++ NLIS LD +GY F W++TKG LV+A+G RG+LY + + ++
Sbjct: 356 HVPDLRMNLISGIALDQDGYENYFANQKWRLTKGALVIAKGVARGTLYRTNAEICQGELN 415
Query: 408 AVTEAINSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKA 467
A E NS+ +WH+R+GH SEKG++I++ K +S K + C + + GKQ +VSF +
Sbjct: 416 AAHEE-NSADLWHKRMGHTSEKGLQILSKKSLISFTKGTTIKPCNYWLFGKQHRVSFQTS 474
Query: 468 GRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKW 527
+ KS L+LV++DV GP ++S+GG++Y+VTFIDD++RK+WVY ++K VF VF+K+
Sbjct: 475 SER-KSNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIFRAKDQVFQVFQKF 533
Query: 528 KTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTL 587
VE +TG K K L++DNGGEY S+EF+++CS +GIR KT+PGTP+ NGVAERMNRT+
Sbjct: 534 HALVERETGRKRKRLRTDNGGEYTSREFEEYCSNHGIRHEKTVPGTPQHNGVAERMNRTI 593
Query: 588 NERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFG 647
E+ R M + LPK FW +A+ TA YLINR PSVPL++ +PE VW KE+S SHLKVFG
Sbjct: 594 VEKVRSMLRMAKLPKTFWGEAVRTACYLINRSPSVPLEFDIPERVWTNKEMSYSHLKVFG 653
Query: 648 CVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNES 702
C ++ + ++R KLD K++ C FIGYG + +GYR WD +K+IRS +V F ES
Sbjct: 654 CKAFAHVPKEQRTKLDDKSVPCIFIGYGDEEFGYRLWDLVKKKVIRSRDVIFRES 708
>pir||T02206 hypothetical protein - common tobacco retrotransposon Tto1
gi|1167523|dbj|BAA11674.1| ORF(AA 1-1338) [Nicotiana
tabacum]
Length = 1338
Score = 550 bits (1418), Expect = e-155
Identities = 292/724 (40%), Positives = 428/724 (58%), Gaps = 22/724 (3%)
Query: 1 LEEGKVKIERFDGRDFGFWKMLMEDYLYQKMLYQPL-TGKKPNDMKQEDWDLLDRQALGV 59
+E K+ +G ++ W+ M+D L+ ++ P+ + +KP D EDW+ Q G
Sbjct: 1 MEARTSKMVNLNGTNYHLWRNKMKDLLFVTKMHLPVFSSQKPEDKSDEDWEFEHNQVCGY 60
Query: 60 IRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHI 119
IR + NV +I L L +Y NK+ + +L ++ EG +V +H+
Sbjct: 61 IRQFVEDNVYNHISGVTHARSLWDKLEELYASKTGNNKLFYLTKLMQVKYVEGTTVADHL 120
Query: 120 NSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLIL 179
N I+ QLS + I FD+E++ L +L +LP+SW +++NSA + + + ++ IL
Sbjct: 121 NEIQGIVDQLSGMGIKFDDEVLALMVLATLPESWETLKVSITNSAPNGVVNMETVKSGIL 180
Query: 180 SEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWN 239
+E++RR+ G SS+ L +RGR KS + R KSRG+S ++ C
Sbjct: 181 NEEMRRRSQGTSSSQ-SEVLAVTTRGRSQNKSQSN-----RDKSRGKSNKFA--NVECHY 232
Query: 240 CDRKGHFTNQCKAPR--KKKNYQKR*DDDESANAATEE------VADTLICSLDSPVDSW 291
C +KGH C+ + +KKN K+ +ES++ T V D I +L + +W
Sbjct: 233 CKKKGHIKRFCRQFQNDQKKNKGKKVKPEESSDDETNSFGEFNVVYDDDIINLTTQEMTW 292
Query: 292 VIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVR 351
VIDSGA+ H P +EL S+Y G FG+V + + +VG GD+ + + NG L +VR
Sbjct: 293 VIDSGATIHATPRRELFSSYTLGDFGRVKMGNANFSTVVGKGDVCLETMNGMKLLLRDVR 352
Query: 352 HVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYMVA---EEDMIA 408
HVP ++ NLIS+ +LD+EGY TF G WK+TKG+L+VARG K+ LY+ + +I
Sbjct: 353 HVPDMRLNLISVDKLDEEGYCNTFHNGQWKLTKGSLMVARGTKQSKLYVTQASISQQVIN 412
Query: 409 VTEAINSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAG 468
V E ++ +WH+RLGHMSEK M + K + L + L C C+ GKQ +VSF +
Sbjct: 413 VAENDSNIKLWHRRLGHMSEKSMARLVKKNALPGLNQIQLKKCADCLAGKQNRVSFKRFP 472
Query: 469 RKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWK 528
+ L+LVH+DV GP KSLGG+RY+VTFIDD +RK WVY LK+K VF VFK++
Sbjct: 473 PSRRQNVLDLVHSDVCGPFK-KSLGGARYFVTFIDDHSRKTWVYTLKTKDQVFQVFKQFL 531
Query: 529 TEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLN 588
T VE +TG K+K +++DNGGEY Q F +C E+GIR T P TP+ NG+AERMNRTL
Sbjct: 532 TLVERETGKKLKCIRTDNGGEYQGQ-FDAYCKEHGIRHQFTPPKTPQLNGLAERMNRTLI 590
Query: 589 ERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGC 648
ER RC+ S LPK FW +A+ TAAY++N P VPL Y+ PE++W G+++S L+VFGC
Sbjct: 591 ERTRCLLSHSKLPKAFWGEALVTAAYVLNHSPCVPLQYKAPEKIWLGRDISYDQLRVFGC 650
Query: 649 VSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDR 708
+YV + D+R KLD K +C FIGYG DM GY+F+D +K++RS +V F E +D
Sbjct: 651 KAYVHVPKDERSKLDVKTRECVFIGYGQDMLGYKFYDPVEKKLVRSRDVVFVEDQTIEDI 710
Query: 709 SSAE 712
E
Sbjct: 711 DKVE 714
>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
Length = 1298
Score = 509 bits (1312), Expect = e-143
Identities = 279/716 (38%), Positives = 418/716 (57%), Gaps = 32/716 (4%)
Query: 5 KVKIERFDGRDFGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWDLLDRQALGVIRLTL 64
K +IE+F+G++F WK+ ++ L + ++ + + + W ++ A+ + L++
Sbjct: 4 KFEIEKFNGKNFSLWKLKVKAILRKDNCLAAISERPVDFTDDKKWSEMNEDAMADLYLSI 63
Query: 65 SKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNT 124
+ V +I +KT ++ L+ +YE NK+ L R+L+ LRM E SVTEH+N+ NT
Sbjct: 64 ADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTEHLNTLNT 123
Query: 125 IISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSEDIR 184
+ SQL+S+ + + LLQSLPDS+ + ++N+ + L FDD+ +L E+ R
Sbjct: 124 LFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNILTDYLVFDDVAAAVLEEESR 183
Query: 185 RKDSGESS-NTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRK 243
RK+ + N + T RGR +++ QS GRGRSKS + ++TC+NC +K
Sbjct: 184 RKNKEDRQVNLQQAEALTVMRGRSTERG--QSSGRGRSKSS-------KKNLTCYNCGKK 234
Query: 244 GHFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLICSLDSP-------VDSWVIDSG 296
GH C + N Q A+T + L C D W+IDSG
Sbjct: 235 GHLKKDCWNLAQNSNPQGN-------VASTSDDGSALCCEASIAREGRKRFADIWLIDSG 287
Query: 297 ASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGI 356
A++H KE +Y G VY D L+I+GIG I ++ +GT+ T+ +VRHV G+
Sbjct: 288 ATYHMTSRKEWFHHYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVKGL 347
Query: 357 KRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKK-RGSLYMVAEEDMIAVTEAI-- 413
K+NL+S G LD+ G K+ +G LVV +G+K +LYM+ E + ++
Sbjct: 348 KKNLLSYGILDNSATQIETQKGVMKIFQGALVVMKGEKIAANLYMLKGETLQEAEASVAA 407
Query: 414 ---NSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRK 470
+S+ +WHQ+LGHMS++GMKI+ + + L V L +CEHCI KQ ++ FS + +
Sbjct: 408 CSPDSTLLWHQKLGHMSDQGMKILVEQKLIPGLTKVSLPLCEHCITSKQHRLKFSTSNSR 467
Query: 471 SKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTE 530
K LELVH+DVW APV SLGG++Y+V+FIDD +R+ WVY +K KSDVF+ FK +K
Sbjct: 468 GKVV-LELVHSDVW-QAPVPSLGGAKYFVSFIDDYSRRCWVYPIKKKSDVFATFKAFKAR 525
Query: 531 VENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNER 590
VE +G KIK ++DNGGEY S+EF FC + GI+ T+ TP+QNGVAERMNRTL ER
Sbjct: 526 VELDSGKKIKCFRTDNGGEYTSEEFDDFCKKEGIKRQFTVAYTPQQNGVAERMNRTLLER 585
Query: 591 ARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVS 650
R M +GL K FW +A+NTA YL+NR PS ++ + P E+W GK V S+L +FG +
Sbjct: 586 TRAMLRAAGLEKSFWAEAVNTACYLVNRAPSTAIELKTPMEMWTGKPVDYSNLHIFGSIV 645
Query: 651 YVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYK 706
Y + ++ + KLDPK+ KC F+GY + GYR WD K++ S +V F E L +
Sbjct: 646 YAMYNAQEITKLDPKSRKCRFLGYADGVKGYRLWDPTAHKVVISRDVIFVEDRLQR 701
>ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sativa]
gi|14029020|gb|AAK52561.1| Putative retroelement pol
polyprotein [Oryza sativa]
Length = 1326
Score = 471 bits (1213), Expect = e-131
Identities = 272/709 (38%), Positives = 408/709 (57%), Gaps = 29/709 (4%)
Query: 16 FGFWKMLMEDYLYQKM-LYQPLT--GKKPNDMKQEDWDLLDRQALGVIRLTLSKNVAFNI 72
F W++ M L Q L + L GKK + + DR+AL +I+L LS ++ +
Sbjct: 17 FSLWQVKMRAILAQTSDLDEALESFGKKKSTEWTAEEKRKDRKALLLIQLHLSNDILQEV 76
Query: 73 VNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSV 132
+ EKT A+L L ++ +K+H+ +LF+ ++ E SV HI+ F I+ L S+
Sbjct: 77 LQEKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHISVFKEIVVDLVSI 136
Query: 133 KITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESS 192
++ FD+E + L LL SLP S+A + S RD + L E ++ ++S
Sbjct: 137 EVQFDDEDLGLLLLCSLPSSYANFRDTILLS-RDELTLAEVYEALQNREKMKGMVQSDAS 195
Query: 193 NTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKA 252
++ G AL RGR Q+++N S R +S+SRGRS++R + C C +K HF +C
Sbjct: 196 SSKGEALQV--RGRSEQRTYNDSSDRDKSQSRGRSKSRGKK--FCKYCKKKNHFIEECW- 250
Query: 253 PRKKKNYQKR*DDDESANAATEEVADTLICSLD-----SPVDSWVIDSGASFHTIPSKEL 307
K +N +KR D +++ + E +D+ C + + D W++D+ SFH +++
Sbjct: 251 --KLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGCVASHDEWILDTACSFHICINRDW 308
Query: 308 LSNYICGKFGKVY-LADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQL 366
S+Y + G V + D P +IVGIG + I++ +G TL +VRH+PG+ RNLIS+ L
Sbjct: 309 FSSYKSVQNGDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPGMARNLISLSTL 368
Query: 367 DDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYMVAEEDMI--AVTEAINS------SSI 418
D EGY + GG KV+KG+LV G + V + +VT A S +++
Sbjct: 369 DAEGYKYSSSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAAAVSKDEPIKTNL 428
Query: 419 WHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLEL 478
WH RLGHMSE GM + + + + CEHC+ GK ++V F+ + ++K L+
Sbjct: 429 WHMRLGHMSELGMAELMKRNLLDGCTQGKMKFCEHCVFGKHKRVKFNTSVHRTKGI-LDY 487
Query: 479 VHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLK 538
VHTD+WGP+ LGG+RY +T IDD +RKVW YFLK K D F+ FK+WK +E QT +
Sbjct: 488 VHTDLWGPSRKAYLGGARYMLTIIDDYSRKVWPYFLKHKDDTFAAFKEWKVRIERQTEKE 547
Query: 539 IKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQS 598
+K L++DNGGE+ S F +C + GI TIP TP+QNGVAERMNRT+ +ARCM +
Sbjct: 548 VKVLRTDNGGEFCSDAFDDYCRKEGIVRHHTIPYTPQQNGVAERMNRTIISKARCMLSNA 607
Query: 599 GLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDK 658
+ K FW +A NTA YLINR PS+PL+ + P EVW G S L+VFGC +Y +D+
Sbjct: 608 RMNKRFWAEAANTACYLINRSPSIPLNKKTPIEVWSGMPADYSQLRVFGCTAYAHVDN-- 665
Query: 659 RDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKD 707
KL+P+AIKC F+GYGS + Y+ W+ + K +V FN+SV++ D
Sbjct: 666 -GKLEPRAIKCLFLGYGSGVKRYKLWNPETNKTFMRRSVVFNKSVMFND 713
>ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultivar-group)]
gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza
sativa (japonica cultivar-group)]
Length = 1181
Score = 464 bits (1193), Expect = e-129
Identities = 271/732 (37%), Positives = 424/732 (57%), Gaps = 45/732 (6%)
Query: 16 FGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWD----LLDRQALGVIRLTLSKNVAFN 71
F W++ M L Q+ L L+G D + +DW DR+A+ I L LS N+
Sbjct: 17 FSLWQVKMRAVLAQQELDDALSGF---DKRTQDWSNDEKKRDRKAMSYIHLHLSNNILQE 73
Query: 72 IVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSS 131
++ E+T A L L + +K+HL ++LF ++ + SV +H+++F I++ L S
Sbjct: 74 VLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLES 133
Query: 132 VKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLI-LSEDIRRKDSGE 190
+++ +D + + L LL SLP S+A + S RD L ++ D + E +++ E
Sbjct: 134 MEVKYDEKDLALILLCSLPSSYANFRDTILYS-RDT-LTLKEVYDALHAKEKMKKMVPSE 191
Query: 191 SSNTFGSALNTESRGRGSQK---SHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFT 247
SN+ L RGSQ+ ++N+S+ + S RGRS++R R +C C R GH
Sbjct: 192 GSNSQAEGLVV----RGSQQEKNTNNKSRDKSSSSYRGRSKSRGRYK-SCKYCKRDGHDI 246
Query: 248 NQC-KAPRKKKNYQK-----R*DDDESANAATEEVADTLI------CSLDSPVDSWVIDS 295
++C K K K K + +++ A T+E +D + C+ S D W++D+
Sbjct: 247 SKCWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTS--DQWILDT 304
Query: 296 GASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPG 355
++H P+++ + Y + G V + D P ++ GIG + I+ +G + TL +VRH+P
Sbjct: 305 ACTYHMCPNRDWFATYEVVQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRHIPN 364
Query: 356 IKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGK-KRGSLYMVAEEDMIA----VT 410
+KR+LIS+ LD +GY + G G KVTKG+LVV + K +LY + ++ V+
Sbjct: 365 LKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKASIKSANLYHLQGTTILGNVATVS 424
Query: 411 EAINSS---SIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKA 467
+++++S ++WH RLGHMSE G+ ++ +G + L CEHCI GK ++V F+ +
Sbjct: 425 DSLSNSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSISKLKFCEHCIFGKHKRVKFNTS 484
Query: 468 GRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKW 527
++ L+ VH+D+WGPA S GG+RY +T +DD +RKVW YFLK K F+VFK+W
Sbjct: 485 THTTEGI-LDYVHSDLWGPARKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFNVFKEW 543
Query: 528 KTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTL 587
KT VE QT K+K L++DNG E+ S+ FK +C GI T+P TP+QNGVAERMNR +
Sbjct: 544 KTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRII 603
Query: 588 NERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFG 647
+ARCM +GLPK FW +A++TA YLINR PS + + P EVW G + S LKVFG
Sbjct: 604 ISKARCMLSNAGLPKQFWAEAVSTACYLINRSPSY-ANKKTPIEVWSGSPANYSDLKVFG 662
Query: 648 CVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKD 707
C +Y +D+ KL+P+AIKC F+GY S + GY+ W + +K++ S NV F+ES++ D
Sbjct: 663 CTAYAHVDN---GKLEPRAIKCIFLGYPSSVKGYKLWCPETKKVVISRNVVFHESIMLHD 719
Query: 708 RSSAESMSSSKQ 719
+ S S++
Sbjct: 720 KPSTNVPVESQE 731
>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
sativa (japonica cultivar-group)]
Length = 2340
Score = 464 bits (1193), Expect = e-129
Identities = 271/729 (37%), Positives = 418/729 (57%), Gaps = 38/729 (5%)
Query: 16 FGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWD----LLDRQALGVIRLTLSKNVAFN 71
F W++ M L Q+ L L+G D + DW DR+A+ I L LS N+
Sbjct: 224 FSLWQVKMRAVLAQQDLDDALSGF---DKRTHDWSNDEKKRDRKAMSYIHLHLSNNILQE 280
Query: 72 IVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSS 131
++ E+ A L L + +K+HL + LF ++ + SV +H+++F II+ L S
Sbjct: 281 VLKEEIAAGLWLKLEQICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSAFKEIIADLES 340
Query: 132 VKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLI-LSEDIRRKDSGE 190
+++ +D E + L LL SLP S+A + S RD L ++ D + + E +++ E
Sbjct: 341 MEVKYDEEDLGLILLCSLPSSYANFRDTILYS-RDT-LTLKEVYDALHVKEKMKKMVPSE 398
Query: 191 SSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQC 250
SN+ L R + + + NQS+ + S RGRS++R R +C C R GH +C
Sbjct: 399 GSNSQAEGLIVWGRQQ-EKNTKNQSRDKSSSSYRGRSKSRGRYK-SCKYCKRDGHDIFEC 456
Query: 251 -----KAPRKKKNYQKR*DDDES-ANAATEEVADTLI------CSLDSPVDSWVIDSGAS 298
K R K K ++E A T+E +D + C+ S D W++++
Sbjct: 457 WKLHDKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTS--DQWILNTACI 514
Query: 299 FHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKR 358
+H P+++ + Y + G V + D P ++ GIG + I+ +G + TL +VRH+P +KR
Sbjct: 515 YHMCPNRDWFATYEAVQVGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRHIPNLKR 574
Query: 359 NLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGK-KRGSLYMVAEEDMI----AVTEAI 413
+LIS+ LD +GY + G G KVTKG+LVV + K +LY + ++ AV++++
Sbjct: 575 SLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTILGNVAAVSDSL 634
Query: 414 NSS---SIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRK 470
++S ++WH RLGHM+E G+ ++ +G + L CEHCI GK ++V F+ +
Sbjct: 635 SNSDATNLWHMRLGHMTEIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHT 694
Query: 471 SKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTE 530
++ L+ VH+D+WGPA S GG+RY +T +DD +RKVW YFLK K F VFK+WKT
Sbjct: 695 TEGI-LDYVHSDLWGPARKTSFGGTRYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTM 753
Query: 531 VENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNER 590
VE QT K+K L++DNG E+ S+ FK +C GI T+P TP+QNGVAERMNRT+ +
Sbjct: 754 VERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISK 813
Query: 591 ARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVS 650
ARC+ +GLPK FW +A++TA YLINR PS +D + P EVW G + S L+VFGC +
Sbjct: 814 ARCLLSNAGLPKQFWAEAVSTACYLINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTA 873
Query: 651 YVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRSS 710
Y +D+ KL+P+AIKC F+GY S + GY+ W + +K++ S NV F+ESV+ D+ S
Sbjct: 874 YAHVDN---GKLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHESVMLHDKPS 930
Query: 711 AESMSSSKQ 719
S++
Sbjct: 931 TNVPVESQE 939
>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1280
Score = 462 bits (1188), Expect = e-128
Identities = 270/730 (36%), Positives = 419/730 (56%), Gaps = 40/730 (5%)
Query: 16 FGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWD----LLDRQALGVIRLTLSKNVAFN 71
F W++ M L Q+ L L+G D + +DW DR+A+ I L LS N+
Sbjct: 52 FSLWQVKMRAVLAQQDLDDALSGF---DKRTQDWSNDEKKKDRKAMSYIHLHLSNNILQE 108
Query: 72 IVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSS 131
++ E+T A L L + +K+HL ++LF ++ + SV +H+++F I++ L S
Sbjct: 109 VLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSTFKEIVADLES 168
Query: 132 VKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLI-LSEDIRRKDSGE 190
+++ +D E + L LL SLP S+A + S + L ++ D + E +++ E
Sbjct: 169 IEVKYDEEDLGLILLCSLPSSYANFRDTILYS--HDTLILKEVYDALHAKEKMKKMVPSE 226
Query: 191 SSNTFGSALNTESRGRGSQKS-HNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQ 249
SN+ L RGR +K+ NQS+ + S RGRS++R R +C C R GH ++
Sbjct: 227 GSNSQAEGLVV--RGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYK-SCKYCKRDGHDISE 283
Query: 250 C-KAPRKKKNYQK-----R*DDDESANAATEEVADTLI------CSLDSPVDSWVIDSGA 297
C K K K K + +++ A T+E +DT + C+ S D W++D+
Sbjct: 284 CWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDTELLVAYAGCAQTS--DQWILDTAW 341
Query: 298 SFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIK 357
++H P+++ + Y + G V + D P ++ GIG + I+ +G + TL +VRH+P +K
Sbjct: 342 TYHMCPNRDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGYIRTLSDVRHIPNLK 401
Query: 358 RNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGK-KRGSLYMVAEEDMI----AVTEA 412
R+LIS+ LD +GY + G G KVTKG+LVV + K +LY + ++ AV+++
Sbjct: 402 RSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTILGNVAAVSDS 461
Query: 413 INSS---SIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGR 469
+++S ++WH RLGHMSE G+ ++ + + L CEHCI GK ++V F+ +
Sbjct: 462 LSNSDATNLWHMRLGHMSEIGLAELSKRELLDGQSIGKLKFCEHCIFGKHKRVKFNTSTH 521
Query: 470 KSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKT 529
++ L+ VH+D+WGPA S GG+RY +T +DD +RKVW YFLK K F VFK+WKT
Sbjct: 522 TTEGI-LDYVHSDLWGPACKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKT 580
Query: 530 EVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNE 589
VE QT K+K L++DNG E+ S+ FK +C GI T+P TP+QNGVAERMN +
Sbjct: 581 MVERQTEKKVKILRTDNGMEFCSKIFKSYCKSEGIVHHYTVPHTPQQNGVAERMNMAIIS 640
Query: 590 RARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCV 649
+ARCM + LPK FW +A++T YLINR PS D + P EVW G + S L+VFGC
Sbjct: 641 KARCMLSNADLPKQFWAEAVSTTCYLINRSPSYATDKKTPIEVWSGSPANYSDLRVFGCT 700
Query: 650 SYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRS 709
+Y +D+ KL+P+AIKC F+GY S + GY+ W + +K++ S NV F+ESV+ D+
Sbjct: 701 AYAHVDN---GKLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHESVILHDKP 757
Query: 710 SAESMSSSKQ 719
S S++
Sbjct: 758 STNVPVESQE 767
>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|53370655|gb|AAU89150.1| integrase core domain
containing protein [Oryza sativa (japonica
cultivar-group)] gi|40538906|gb|AAR87163.1| putative
polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1322
Score = 461 bits (1185), Expect = e-128
Identities = 269/709 (37%), Positives = 407/709 (56%), Gaps = 29/709 (4%)
Query: 16 FGFWKMLMEDYLYQKM-LYQPLT--GKKPNDMKQEDWDLLDRQALGVIRLTLSKNVAFNI 72
F W++ M L Q L + L GKK + DR+AL +I+L LS ++ +
Sbjct: 17 FSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLIQLHLSNDILQEV 76
Query: 73 VNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSV 132
+ +KT A+L L ++ +K+H+ +LF+ ++ E SV HI+ F I++ L S+
Sbjct: 77 LQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHISVFKEIVADLVSM 136
Query: 133 KITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESS 192
++ FD+E + L LL SLP S+A + S RD + L E ++ +S
Sbjct: 137 EVQFDDEDLGLLLLCSLPSSYANFRHTILLS-RDELTLAEVYEALQNREKMKGMVQSYAS 195
Query: 193 NTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKA 252
++ G AL RGR Q+++N S +S+SRGRS++R + C C +K HF +C
Sbjct: 196 SSKGEALQV--RGRSEQRTYNDSNDHDKSQSRGRSKSRGKK--FCKYCKKKNHFIEECW- 250
Query: 253 PRKKKNYQKR*DDDESANAATEEVADTLICSLD-----SPVDSWVIDSGASFHTIPSKEL 307
K +N +KR D +++ + E +D+ C + + D W++D+ SFH +++
Sbjct: 251 --KLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASHDEWILDTACSFHICINRDW 308
Query: 308 LSNYICGKFGKVY-LADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQL 366
S+Y + V + D P +IVGIG + I++ +G TL +VRH+PG+ RNLIS+ L
Sbjct: 309 FSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPGMARNLISLSTL 368
Query: 367 DDEGYHTTFGGGAWKVTKGNLVVARGKKRGS-LYMVAEEDM------IAVT-EAINSSSI 418
D EGY + GG KV+KG+LV G + LY++ + AVT + + +++
Sbjct: 369 DAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAAAVTKDEPSKTNL 428
Query: 419 WHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLEL 478
WH RLGHMSE GM + + + ++ CEHC+ GK ++V F+ + ++K L+
Sbjct: 429 WHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTSVHRTKGI-LDY 487
Query: 479 VHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLK 538
VH D+WGP+ SLGG+RY +T IDD +RK W YFLK K D F+ FK+ K +E QT +
Sbjct: 488 VHADLWGPSRKPSLGGARYMLTIIDDYSRKEWPYFLKHKDDTFAAFKERKVMIERQTEKE 547
Query: 539 IKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQS 598
+K L +DNGGE+ S F +C + GI TIP TP+QNGVAERMNRT+ +ARCM +
Sbjct: 548 VKVLCTDNGGEFCSDAFDDYCRKEGIVRHHTIPYTPQQNGVAERMNRTIISKARCMLSNA 607
Query: 599 GLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDK 658
+ K FW +A NTA YLINR PS+PL+ + P E+W G S L+VFGC +Y +D+
Sbjct: 608 RMNKRFWAEAANTACYLINRSPSIPLNKKTPIEIWSGMPADYSQLRVFGCTAYAHVDN-- 665
Query: 659 RDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKD 707
KL+P+AIKC F+GYGS + GY+ W+ + K S NV FNE V++ D
Sbjct: 666 -GKLEPRAIKCLFLGYGSGVKGYKLWNPETNKTFMSRNVIFNEFVMFND 713
>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
sativa (japonica cultivar-group)]
Length = 1373
Score = 451 bits (1161), Expect = e-125
Identities = 265/717 (36%), Positives = 396/717 (54%), Gaps = 33/717 (4%)
Query: 16 FGFWKMLMEDYLYQKMLYQPLT---GKKPNDMKQEDWDLLDRQALGVIRLTLSKNVAFNI 72
F W++ M L Q Y GK+ + E+ D++AL +I+L L ++
Sbjct: 14 FSLWQVKMRGILAQTHDYDEALDNFGKRRAEWTAEEIRK-DQKALALIQLHLHNDILQEC 72
Query: 73 VNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSV 132
+ EKT+A+L L ++ +K+ + +LF L+M E +SV H+ F I++ L S+
Sbjct: 73 LTEKTSAELWLKLESICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMAEFKKIVADLVSM 132
Query: 133 KITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSED---IRRKDSG 189
++ +D+E + L LL SLP+S+A + S + LK ++ D + +++ I ++ G
Sbjct: 133 EVKYDDEDLGLLLLCSLPNSYANFRDTILLSRDELTLK--EVYDALQNKEKMKIMVQNDG 190
Query: 190 ESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQ 249
SS+ G AL+ R + RGRSKS+ + C C K H ++
Sbjct: 191 SSSSK-GEALHVRGRTENRTSNEKNYDRRGRSKSKPPGNKKF-----CVYCKLKNHNIDE 244
Query: 250 CKAPRKKKNYQKR*DDDESANAAT--EEVADTLICSLDSPV--DSWVIDSGASFHTIPSK 305
CK + K+ K+ A+AA ++ D L+ D W++DS SFH +
Sbjct: 245 CKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGCVAGHDEWILDSACSFHICTKR 304
Query: 306 ELLSNYICGKFGKVY-LADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIG 364
S+Y + G V + D P IVGIG + I++ +G TL NVR++PG+ RNLIS+
Sbjct: 305 NWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYIPGMSRNLISLS 364
Query: 365 QLDDEGYHTTFGGGAWKVTKGNLVVARGKK--------RGSLYMVAEEDMIAVT-EAINS 415
LD EGY + G KV+KG+LV +G RG ++ A+T + +
Sbjct: 365 TLDAEGYKYSGSDGVLKVSKGSLVCLKGDVNSAKLYVLRGCTLTGSDSAAAAITNDEPSK 424
Query: 416 SSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEK 475
+++WH RLGHMS GM + + + + CEHCI GK ++V F+ + +K
Sbjct: 425 TNLWHMRLGHMSHLGMTELMKRNLLKGCTSSKIKFCEHCIFGKHKRVQFNTSVHTTKGT- 483
Query: 476 LELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQT 535
L+ VH D+WGP+ SLGG+RY +T IDD +RKVW YFLK K D F+ FK WK +E QT
Sbjct: 484 LDYVHADLWGPSKKPSLGGARYMLTIIDDYSRKVWPYFLKHKDDTFTAFKNWKVMIERQT 543
Query: 536 GLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMR 595
K+K L++DNGGE+ S F +C + GI TIP TP+QNGVAERMNRT+ RARCM
Sbjct: 544 ERKVKLLRTDNGGEFCSHAFNDYCRQEGIVRHHTIPHTPQQNGVAERMNRTIISRARCML 603
Query: 596 IQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLID 655
+ + K FW +A +TA YLINR PS+PL+ + P EVW G S LKVFGC +Y +D
Sbjct: 604 SHARMNKRFWAEAASTACYLINRSPSIPLNKKTPIEVWSGTPADYSQLKVFGCTAYAHVD 663
Query: 656 SDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRSSAE 712
+ KL+P+A+KC F+GYGS + GY+ W+ + K S +V FNESV++ + +E
Sbjct: 664 N---GKLEPRAVKCLFLGYGSGVKGYKLWNPETGKTFMSRSVVFNESVMFTNSLPSE 717
>gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1241
Score = 439 bits (1129), Expect = e-121
Identities = 249/646 (38%), Positives = 386/646 (59%), Gaps = 33/646 (5%)
Query: 96 NKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAA 155
+K+HL ++LF ++ + SV +H+++F I++ L S+++ +D E + L LL SLP S+A
Sbjct: 7 SKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEEDLGLILLCSLPSSYAN 66
Query: 156 TVTAVSNSARDNKLKFDDIRDLI-LSEDIRRKDSGESSNTFGSALNTESRGRGSQKS-HN 213
+ S RD L ++ D + E +++ E SN+ L RGR +K+ +N
Sbjct: 67 FRDTILYS-RDT-LTLKEVYDALHAKEKMKKMVPSEGSNSQAEGLVV--RGRQQEKNTNN 122
Query: 214 QSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQC-----KAPRKKKNYQKR*DDDES 268
+S+ + S RGRS++R R +C C R GH ++C K R +K K ++E
Sbjct: 123 KSRDKSSSIYRGRSKSRGRYK-SCKYCKRDGHDISECWKLQDKDKRTRKYIPKGKKEEEG 181
Query: 269 -ANAATEEVADTLI------CSLDSPVDSWVIDSGASFHTIPSKELLSNYICGKFGKVYL 321
A T+E +D + C+ S D W++D+ ++H P+++ + Y + G V +
Sbjct: 182 KAAVVTDEKSDAELLVAYAGCAQTS--DQWILDTACTYHMCPNRDWFATYEAVQGGTVLM 239
Query: 322 ADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWK 381
D P ++ GIG + I+ +G + TL +VRH+P +KR+LIS+ LD +GY + G G K
Sbjct: 240 GDDTPCEVAGIGTVQIKMFDGCIRTLLDVRHIPNLKRSLISLCTLDRKGYKYSGGDGILK 299
Query: 382 VTKGNLVVARGK-KRGSLYMVAEEDMI----AVTEAINSS---SIWHQRLGHMSEKGMKI 433
VTKG+LVV + K +LY + ++ AV++++++S ++WH RLGHMSE G+
Sbjct: 300 VTKGSLVVMKADIKYANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGLAE 359
Query: 434 MASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLG 493
++ +G + L CEHCI GK ++V F+ + ++ L+ VH+D+WGPA S G
Sbjct: 360 LSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGI-LDYVHSDLWGPARKTSFG 418
Query: 494 GSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQ 553
G+RY +T +DD +RKVW YFLK K F VFK+WKT VE QT K+K L++DNG E S+
Sbjct: 419 GARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVKILRTDNGMELCSK 478
Query: 554 EFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAA 613
FK +C GI T+P TP+QNGVAERMNRT+ +ARCM + LPK FW +A++TA
Sbjct: 479 IFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISKARCMLSNASLPKQFWAEAVSTAC 538
Query: 614 YLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIG 673
YLINR PS +D + P EVW G + S L+VFGC +Y +D+ KL+P+ IKC F+G
Sbjct: 539 YLINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTAYAHVDN---GKLEPRVIKCIFLG 595
Query: 674 YGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRSSAESMSSSKQ 719
Y S + GY+ W + +K++ S NV F+ES++ D+ S S++
Sbjct: 596 YLSGVKGYKLWCPETKKVVISRNVVFHESIMLHDKPSTNVPVESQE 641
>ref|XP_476137.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475101|gb|AAT44170.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|46576026|gb|AAT01387.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1175
Score = 436 bits (1120), Expect = e-120
Identities = 241/620 (38%), Positives = 373/620 (59%), Gaps = 28/620 (4%)
Query: 103 RLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSN 162
+LF+ ++ E S+ HI+ F I++ L S+++ FD+E + L LL SLP S+A +
Sbjct: 2 KLFSHKLQESGSILNHISVFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRDTILL 61
Query: 163 SARDNKLKFDDIRDLILS-EDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRS 221
S ++L ++ + + + E ++ ++S++ G AL RGR Q+++N S R ++
Sbjct: 62 SR--SELTLAEVYEALQNREKMKGMVQSDASSSKGEALQV--RGRSEQRTYNDSNDRDKN 117
Query: 222 KSRGRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLI 281
+SRGRS++R + C C +K HF +C K +N +KR D +++ + + +D+
Sbjct: 118 QSRGRSKSRGKK--FCKYCKKKNHFIEECW---KLQNKEKRKSDGKASVVTSADNSDSGD 172
Query: 282 CSLDSPV-----DSWVIDSGASFHTIPSKELLSNYICGKFGKVY-LADGKPLDIVGIGDI 335
C + V D W++D+ SFH +++ S+Y + G V + D P +IVGIG +
Sbjct: 173 CLVVFVVCVSSHDEWILDTTCSFHICINRDWFSSYKSVQNGDVVRMGDDNPREIVGIGSV 232
Query: 336 DIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKR 395
I++ +G TL +VRH+P + RNLIS+ LD EGY + GG KV+KG+LV G
Sbjct: 233 QIKTHDGMTRTLKDVRHIPRMARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMN 292
Query: 396 GS-LYMVAEEDMIA-VTEAINS------SSIWHQRLGHMSEKGMKIMASKGKMSNLKHVD 447
+ LY++ + VT A+ S +++WH RLGHMSE GM + + + +
Sbjct: 293 SANLYVLRGSTLHGYVTAAVVSKDEPSKTNMWHMRLGHMSELGMAELMKRNLLDGCTQGN 352
Query: 448 LGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTR 507
+ CEHC+ GK ++V F+ + ++K L+ VH D+WGP+ SLGG+RY +T IDD +R
Sbjct: 353 MKFCEHCVFGKHKRVKFNTSVHRTKGI-LDYVHADLWGPSRKPSLGGARYMLTIIDDYSR 411
Query: 508 KVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMI 567
KVW YFLK K D F+ FK+WK ++ QT ++K L++DNGG + S F +C + GI M
Sbjct: 412 KVWPYFLKHKDDTFAAFKEWKVMIKRQTEKEVKVLRTDNGGGFCSDAFDDYCRKEGIVMH 471
Query: 568 KTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQ 627
TIP TP+QNGVAERMNRT+ +ARCM + + K FW +A TA YLINR PS+ L+ +
Sbjct: 472 HTIPYTPQQNGVAERMNRTIISKARCMLSNARMNKRFWAEAAKTACYLINRSPSISLNKK 531
Query: 628 LPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQ 687
P EVW G + S L+VFGC +Y +++ KL+P+AIKC F+GYGS + GY+ W+ +
Sbjct: 532 TPIEVWSGMPANYSQLRVFGCTAYAHVNN---GKLEPRAIKCLFLGYGSGVKGYKLWNPE 588
Query: 688 NRKIIRSINVTFNESVLYKD 707
K S +V FNESV++ D
Sbjct: 589 TNKTFMSRSVVFNESVMFND 608
>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301697|pir||B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1335
Score = 430 bits (1106), Expect = e-119
Identities = 253/698 (36%), Positives = 393/698 (56%), Gaps = 38/698 (5%)
Query: 34 QPLTGKK---PNDMKQEDWDLLDR-----QALGVIRLTLSKNVAFNIVNEKTTADLMKAL 85
+PLT ++ P K+ D D + R +A VI L ++ V I KT A+ + L
Sbjct: 19 KPLTEEEEEDPEKRKKRDADEVARLERCDKAKNVIFLNVADKVLRKIELCKTAAEAWETL 78
Query: 86 SNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSL 145
++ ++V+ + +M E + E+I+ F I++ L+ ++I +E+ + L
Sbjct: 79 DRLFMIRSLPHRVYTQLSFYTFKMQENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILL 138
Query: 146 LQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRG 205
L SLP + V + S KL+ DD+ ++ + D K+ S N +RG
Sbjct: 139 LSSLPARYDGLVETMKYSNSREKLRLDDV--MVAARD---KERELSQNNRPVVEGHFARG 193
Query: 206 RGSQKSHNQ-SQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQC-KAPRKKKNYQKR* 263
R K++NQ ++G+ RS+S+ RV CW C ++GHF QC K + K+ Q+
Sbjct: 194 RPDGKNNNQGNKGKNRSRSKSADGKRV-----CWICGKEGHFKKQCYKWIERNKSKQQGS 248
Query: 264 DDDESANAATEEV---------ADTLICSLDSPVDSWVIDSGASFHTIPSKELLSNYICG 314
D+ ES+ A + E D + DS + WV+D+G SFH P K+ ++
Sbjct: 249 DNGESSLAKSTEAFNPAMVLLATDETLVVTDSIANEWVLDTGCSFHMTPRKDWFKDFKEL 308
Query: 315 KFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTT 374
G V + + + GIG I IR+S+G+ L +VR++P + RNLIS+G L+D G
Sbjct: 309 SSGYVKMGNDTYSPVKGIGSIKIRNSDGSQVILTDVRYMPNMTRNLISLGTLEDRGCWFK 368
Query: 375 FGGGAWKVTKGNLVVARGKKRGSLYMV----AEEDMIAVTEAINSSSIWHQRLGHMSEKG 430
G K+ KG + +G+KR +LY++ E + + E + +++WH RLGHMS+KG
Sbjct: 369 SQDGILKIVKGCSTILKGQKRDTLYILDGVTEEGESHSSAEVKDETALWHSRLGHMSQKG 428
Query: 431 MKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWG-PAPV 489
M+I+ KG + +L CE C+ GKQ +VSF+ A +K EKL VH+D+WG P
Sbjct: 429 MEILVKKGCLRREVIKELEFCEDCVYGKQHRVSFAPAQHVTK-EKLAYVHSDLWGSPHNP 487
Query: 490 KSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGE 549
SLG S+Y+++F+DD +RKVW+YFL+ K + F F +WK VENQ+ K+K L++DNG E
Sbjct: 488 ASLGNSQYFISFVDDYSRKVWIYFLRKKDEAFEKFVEWKKMVENQSDRKVKKLRTDNGLE 547
Query: 550 YDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAI 609
Y + F+KFC E GI KT TP+QNG+AER+NRT+ ++ R M +SG+ K FW +A
Sbjct: 548 YCNHYFEKFCKEEGIVRHKTCAYTPQQNGIAERLNRTIMDKVRSMLSRSGMEKKFWAEAA 607
Query: 610 NTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKC 669
+TA YLINR PS +++ LPEE W G LS L+ FGC++Y+ D + KL+P++ K
Sbjct: 608 STAVYLINRSPSTAINFDLPEEKWTGALPDLSSLRKFGCLAYIHAD---QGKLNPRSKKG 664
Query: 670 FFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKD 707
F Y + GY+ W +++K + S NV F E V++KD
Sbjct: 665 IFTSYPEGVKGYKVWVLEDKKCVISRNVIFREQVMFKD 702
>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1243
Score = 422 bits (1085), Expect = e-116
Identities = 257/723 (35%), Positives = 396/723 (54%), Gaps = 54/723 (7%)
Query: 16 FGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWD----LLDRQALGVIRLTLSKNVAFN 71
F W++ M L Q+ L L+G D + +DW DR+A+ I L LS N+
Sbjct: 17 FSLWQVKMRAVLAQQDLDDALSGF---DKRTQDWSNDEKKRDRKAISYIHLHLSNNILQE 73
Query: 72 IVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSS 131
++ E+T A L L + +K+HL ++LF ++ + SV +H+++F I++ L S
Sbjct: 74 VLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHLSAFKEIVADLES 133
Query: 132 VKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLI-LSEDIRRKDSGE 190
+++ +D + + L LL SLP S+A + S RD L ++ D E +++ + E
Sbjct: 134 MEVKYDEDDLGLILLCSLPSSYANFRGTILYS-RDT-LTLKEVYDAFHAKEKMKKMVTSE 191
Query: 191 SSNTFGSALNTESRGRGSQKS-HNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQ 249
SN+ L RGR +K+ NQS+ + S RGR+++R R +C C R GH ++
Sbjct: 192 GSNSQAEGLVV--RGRQQKKNTKNQSRDKSSSSYRGRTKSRGRYK-SCKYCKRDGHDISE 248
Query: 250 C-----KAPRKKKNYQKR*DDDESANAATEEVADTLICSLDSPVDSWVIDSGASFHTIPS 304
C K R K K ++E A D D+ ++ + A
Sbjct: 249 CWKLQDKDKRTGKYIPKGKKEEEGKAAVVT----------DEKSDAELLVAYAGCAQTSD 298
Query: 305 KELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIG 364
++ + Y + G V + D P ++ GIG + I+ +G + TL +V+H+P +KR+LIS+
Sbjct: 299 QDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNLKRSLISLY 358
Query: 365 QLDDEGYHTTFGGGAWKVTKGNLVVARGK-KRGSLYMVAEEDMIAVTEAI-------NSS 416
G KVTKG+LVV + K +LY + ++ A+ +++
Sbjct: 359 -------------GILKVTKGSLVVMKVDIKSANLYHLRGTTILGNVAAVFDSLSNSDAT 405
Query: 417 SIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKL 476
++WH RLGHMSE G+ ++ +G + L CEHCI GK ++V F+ + ++ L
Sbjct: 406 NLWHMRLGHMSEIGLAELSKRGLLDGQSIRKLKFCEHCIFGKHKRVKFNTSTHTTEGI-L 464
Query: 477 ELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTG 536
+ VH+D+WGPA S GG+RY +T +DD +RKVW YFLK K F FK+WKT VE QT
Sbjct: 465 DYVHSDLWGPAHKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDGFKEWKTMVERQTE 524
Query: 537 LKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRI 596
K+K L++DNG E+ S+ FK +C GI T P TP+QN VAERMNRT+ +ARCM
Sbjct: 525 RKVKILRTDNGMEFCSKIFKSYCKSEGIVCHYTAPHTPQQNDVAERMNRTIISKARCMLS 584
Query: 597 QSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDS 656
+GLPK FW +A++TA YLINR P +D + P EVW G + S L+VFGC +Y +D+
Sbjct: 585 NAGLPKQFWAEAVSTACYLINRSPGYAIDKKTPIEVWSGSPTNYSDLRVFGCTAYAHVDN 644
Query: 657 DKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRSSAESMSS 716
KL+P+AIKC F+GY S + GY+ W + +K++ S NV F+ESV+ D+ S
Sbjct: 645 ---GKLEPRAIKCIFLGYASGVKGYKLWCPETKKVVISRNVVFHESVILHDKPSTNVPVE 701
Query: 717 SKQ 719
S++
Sbjct: 702 SQE 704
>gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
gi|25301707|pir||E86490 hypothetical protein F28L22.3 -
Arabidopsis thaliana
Length = 1356
Score = 413 bits (1062), Expect = e-114
Identities = 252/747 (33%), Positives = 401/747 (52%), Gaps = 72/747 (9%)
Query: 5 KVKIERFDG-RDFGFWKMLMEDYLYQKMLYQPLTGKK--------PNDMKQEDWD----- 50
+V+I+ F+G RDF WK+ ++ L L LT ++ KQE D
Sbjct: 7 RVEIKVFNGDRDFSLWKIRIQAQLGVLGLKDTLTDFSLTKTVPLTKSEAKQESGDGESSG 66
Query: 51 ----------LLDRQALGVIRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHL 100
QA +I +S V + + TTADL L+ Y + N+++
Sbjct: 67 TKEVPDPVKIEQSEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNRIYT 126
Query: 101 IRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSW------- 153
+L++ +M ++ ++++ F I+++L S++I D E+ + +L SLP S
Sbjct: 127 QLKLYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLKHTL 186
Query: 154 -----AATVTAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRGRGS 208
TV V++SA+ + + + DL D G+++ L T RGR
Sbjct: 187 KYGNKTLTVQDVTSSAKSLERELAEAVDL---------DKGQAA-----VLYTTERGRPL 232
Query: 209 QKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDES 268
++ NQ G+G+ +SR S+T+V CW C ++GH C + +KK + + +
Sbjct: 233 VRN-NQKGGQGKGRSRSNSKTKV----PCWYCKKEGHVKKDCYSRKKKMESEGQGE---- 283
Query: 269 ANAATEEVADTLICSLDSPV--DSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKP 326
A TE++ + S++ + D W++DSG + H ++ ++ + L D
Sbjct: 284 AGVITEKLVFSEALSVNEQMVKDLWILDSGCTSHMTSRRDWFISFQEKGNTTILLGDDHS 343
Query: 327 LDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGN 386
++ G G I I + GT+ L NV++VP ++RNLIS G LD GY G G + K N
Sbjct: 344 VESQGQGTIRIDTHGGTIKILENVKYVPHLRRNLISTGTLDKLGYRHEGGEGKVRYFKNN 403
Query: 387 LVVARGKKRGSLYM-----VAEEDMIAVTEAINSSSIWHQRLGHMSEKGMKIMASKGKMS 441
RG LY+ V E A T+ + ++ +WH RLGHMS +K++A KG +
Sbjct: 404 KTALRGSLSNGLYVLDGSTVMSELCNAETDKVKTA-LWHSRLGHMSMNNLKVLAGKGLID 462
Query: 442 NLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWG-PAPVKSLGGSRYYVT 500
+ +L CEHC++GK +KVSF+ G+ + + L VH D+WG P S+ G +Y+++
Sbjct: 463 RKEINELEFCEHCVMGKSKKVSFN-VGKHTSEDALSYVHADLWGSPNVTPSISGKQYFLS 521
Query: 501 FIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCS 560
IDD TRKVW+YFLKSK + F F +WK+ VENQ K+K L++DNG E+ + F +C
Sbjct: 522 IIDDKTRKVWLYFLKSKDETFDKFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCK 581
Query: 561 ENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGP 620
E+GI +T TP+QNGVAERMNRT+ E+ RC+ +SG+ ++FW +A TAAYLINR P
Sbjct: 582 EHGIERHRTCTYTPQQNGVAERMNRTIMEKVRCLLNKSGVEEVFWAEAAATAAYLINRSP 641
Query: 621 SVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYG 680
+ +++ +PEE+W ++ HL+ FG ++YV D + KL P+A+K FF+GY + G
Sbjct: 642 ASAINHNVPEEMWLNRKPGYKHLRKFGSIAYVHQD---QGKLKPRALKGFFLGYPAGTKG 698
Query: 681 YRFWDEQNRKIIRSINVTFNESVLYKD 707
Y+ W + K + S NV F ESV+Y+D
Sbjct: 699 YKVWLLEEEKCVISRNVVFQESVVYRD 725
>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301696|pir||F84486 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1356
Score = 407 bits (1045), Expect = e-112
Identities = 241/750 (32%), Positives = 398/750 (52%), Gaps = 43/750 (5%)
Query: 1 LEEGKVKIERFDGR-DFGFWKMLMEDYLYQKMLYQPL-----TGKKPNDMKQEDWDLLDR 54
+ ++++E+FDGR D+ WK + ++ L L TG+K + + + D D ++
Sbjct: 1 MSTARIEVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEK 60
Query: 55 ------------QALGVIRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIR 102
+A I L+++ V I E T A ++ AL +Y N+++ +
Sbjct: 61 LEKFEALEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQ 120
Query: 103 RLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSN 162
+L++ +M E SV +I+ F II+ L ++ + +E + LL +LP ++ +
Sbjct: 121 KLYSFKMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKY 180
Query: 163 SARDNKLKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSK 222
S+ + L D++ I S+++ +S L K N+++G+G K
Sbjct: 181 SSGKSILTLDEVAAAIYSKELELGSVKKSIKVQAEGLYV--------KDKNENKGKGEQK 232
Query: 223 SRGRSQT-RVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDES-------ANAATE 274
+G+ + + + CW C +GHF + C K + Q + ES A AA
Sbjct: 233 GKGKGKKGKSKKKPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAAGY 292
Query: 275 EVADTLICSLDSPVDSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGD 334
V++ L + D W++D+G S+H +E + G V + + + G+G
Sbjct: 293 YVSEALSSTEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGT 352
Query: 335 IDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKK 394
I +++S+G L NVR++P + RNL+S+G + GY G ++ GN V+ G++
Sbjct: 353 IRVKNSDGLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDGILRIKAGNQVLLTGRR 412
Query: 395 RGSLYMV----AEEDMIAVTEAINSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGV 450
+LY++ + +AV + + + +WHQRL HMS+K M+I+ KG + K L V
Sbjct: 413 YDTLYLLNWKPVASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDKKKVSSLDV 472
Query: 451 CEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWG-PAPVKSLGGSRYYVTFIDDSTRKV 509
CE CI GK ++ SFS A +K EKLE +H+D+WG P SLG +Y+++ IDD TRKV
Sbjct: 473 CEDCIYGKAKRKSFSLAHHDTK-EKLEYIHSDLWGAPFVPLSLGKCQYFMSIIDDFTRKV 531
Query: 510 WVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKT 569
WVYF+K+K + F F +W VENQT ++K+L++DNG E+ ++ F FC GI +T
Sbjct: 532 WVYFMKTKDEAFEKFVEWVNLVENQTDRRVKTLRTDNGLEFCNKLFDGFCESIGIHRHRT 591
Query: 570 IPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLP 629
TP+QNGVAERMNRT+ E+ R M SGLPK FW +A +T LIN+ PS L++++P
Sbjct: 592 CAYTPQQNGVAERMNRTIMEKVRSMLSDSGLPKRFWAEATHTTVLLINKTPSSALNFEIP 651
Query: 630 EEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNR 689
++ W G S+L+ +GCV++V D KL+P+A K IGY + GY+ W R
Sbjct: 652 DKKWSGNPPVYSYLRRYGCVAFVHTDD---GKLEPRAKKGVLIGYPVGVKGYKVWILDER 708
Query: 690 KIIRSINVTFNESVLYKDRSSAESMSSSKQ 719
K + S N+ F E+ +YKD + S+++
Sbjct: 709 KCVVSRNIIFQENAVYKDLMQRQENVSTEE 738
>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1342
Score = 404 bits (1039), Expect = e-111
Identities = 238/756 (31%), Positives = 395/756 (51%), Gaps = 66/756 (8%)
Query: 1 LEEGKVKIERFDGR-DFGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWDLLDR----- 54
+ G+ ++E+FDG D+ WK + ++ L + L ++ ++ ++ D
Sbjct: 1 MSSGRAEVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDP 60
Query: 55 -----------------QALGVIRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANK 97
+A I L+L NV ++ +KT A ++K L ++ N+
Sbjct: 61 ETATSKLEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNR 120
Query: 98 VHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATV 157
++L +RL+ +M E ++ E++N F +IS L +VK+ +E + LL SLP +
Sbjct: 121 IYLKQRLYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLK 180
Query: 158 TAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQG 217
+ L ++I I S+ + SG+ L + RGR + ++
Sbjct: 181 ETLKYCK--TTLHLEEITSAIRSKILELGASGKLLKNNSDGLFVQDRGRSETRGKGPNKN 238
Query: 218 RGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDESANAATEEVA 277
+ RSKS+G +T CW C ++GHF QC K++N Q + A+ T V
Sbjct: 239 KSRSKSKGAGKT-------CWICGKEGHFKKQCYV-WKERNKQGSTSERGEASTVTARVT 290
Query: 278 DT--------LICSLDSPVDSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDI 329
D L+ + D+W++D+G SFH K+ + ++ GKV + + ++
Sbjct: 291 DAAALVVSRALLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEV 350
Query: 330 VGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVV 389
GIGD+ I++ +G+ L +VR++P + +NLIS+G L+D+G G + K +L V
Sbjct: 351 KGIGDVRIKNEDGSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESKKGILTIFKNDLTV 410
Query: 390 ARGKKRGSLYMVAEEDMIAVTEAINS----SSIWHQRLGHMSEKGMKIMASKGKMSNLKH 445
GKK +LY + + I+ +S+WH RLGH+ KG++++ SKG H
Sbjct: 411 LTGKKESTLYFLQGTTLAGEANVIDKEKDETSLWHSRLGHIGAKGLQVLVSKG------H 464
Query: 446 VDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVK-SLGGSRYYVTFIDD 504
+D K +SF A +K +KL+ VH+D+WG V S+G +Y++TFIDD
Sbjct: 465 LD----------KNIMISFGAAKHVTK-DKLDYVHSDLWGSTNVPFSIGKCQYFITFIDD 513
Query: 505 STRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGI 564
TR+ W+YF+++K + FS F +WKT++ENQ K+K L +DNG E+ +QEF FC + G+
Sbjct: 514 FTRRTWIYFIRTKDEAFSKFVEWKTQIENQQDKKLKILITDNGLEFCNQEFDSFCRKEGV 573
Query: 565 RMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPL 624
+T TP+QNGVAERMNRT+ + RCM +SGL K FW +A +TA +LIN+ PS +
Sbjct: 574 IRHRTCAYTPQQNGVAERMNRTIMNKVRCMLSESGLGKQFWAEAASTAVFLINKSPSSSI 633
Query: 625 DYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFW 684
++ +PEE W G LK FG V+Y+ D + KL+P+A K F+GY + ++ W
Sbjct: 634 EFDIPEEKWTGHPPDYKILKKFGSVAYIHSD---QGKLNPRAKKGIFLGYPDGVKRFKVW 690
Query: 685 DEQNRKIIRSINVTFNESVLYKDRSSAESMSSSKQL 720
++RK + S ++ F E+ +YK+ + KQL
Sbjct: 691 LLEDRKCVVSRDIVFQENQMYKELQKNDMSEEDKQL 726
>emb|CAB79135.1| putative transposable element [Arabidopsis thaliana]
gi|3402755|emb|CAA20201.1| putative transposable element
[Arabidopsis thaliana] gi|7444415|pir||T05178
hypothetical protein T6K22.90 - Arabidopsis thaliana
Length = 1308
Score = 401 bits (1031), Expect = e-110
Identities = 241/746 (32%), Positives = 389/746 (51%), Gaps = 67/746 (8%)
Query: 5 KVKIERFDG-RDFGFWKMLME-------------DYLYQKMLYQPLTGKKPNDMKQEDWD 50
KV+I+ F+G RDF WK+ +E D+ K + + KK ++ + ++ D
Sbjct: 6 KVEIKTFNGDRDFSLWKIRIEAQLGVLGLKPALSDFTLTKTILVVKSEKKESESEDDETD 65
Query: 51 LLDR-------------QALGVIRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANK 97
QA I ++ V + + T A+L L+ ++ + N+
Sbjct: 66 SKKTEEVPDPIKFEQSDQAKNFIINHITDTVLLKVQHCVTAAELWATLNKLFMETSLPNR 125
Query: 98 VHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATV 157
++ RL++ +M + S+ ++ + F I+++L S++I E+ + +L SLP S+
Sbjct: 126 IYTQLRLYSFKMVDNLSIDQNTDEFLRIVAELGSLQIQVGEEVQAILILNSLPPSYIQLK 185
Query: 158 TAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESSNTF----GSALNTESRGRGSQKSHN 213
+ K ++D++ S ++ E T +AL T RGR Q +
Sbjct: 186 HTLKYGN-----KTLSVQDVVSSAKSLERELSEQKETIRAPASTALYTAERGR-PQTKNT 239
Query: 214 QSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDESANAAT 273
Q QG+GR +S +S+ +TCW C ++GH C A ++K + + A T
Sbjct: 240 QGQGKGRGRSNSKSR------LTCWFCKKEGHVKKDCYAGKRKLENEGQ----GKAGVIT 289
Query: 274 EEVADTLICSL--DSPVDSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVG 331
E++ + S+ D WVIDSG ++H + S + + + L D ++ G
Sbjct: 290 EKLVYSEALSMYDQEAKDKWVIDSGCTYHMTSRMDWFSEFNENETTMILLGDDHTVESKG 349
Query: 332 IGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVAR 391
G + + + G++ L NVR VP ++RNLIS G LD GY G G + K N
Sbjct: 350 SGTVKVNTHGGSIRVLKNVRFVPNLRRNLISTGTLDKLGYKHEGGDGKVRFYKENKTALC 409
Query: 392 GKKRGSLYMVAEEDMIAVTEAINSSS----IWHQRLGHMSEKGMKIMASKGKMSNLKHVD 447
G LY++ ++ + S+ +WH RLGHMS MKI+A KG + +
Sbjct: 410 GNLVNGLYVLDGHTVVNENCNVEGSNEKTELWHCRLGHMSLNNMKILAEKGLLEKKDIKE 469
Query: 448 LGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTR 507
L CE+C++GK +K+SF+ G+ E L +H D+WG +Y+++ IDD +R
Sbjct: 470 LSFCENCVMGKSKKLSFN-VGKHITDEVLGYIHADLWG---------KQYFLSIIDDKSR 519
Query: 508 KVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMI 567
KVW+ FLK+K + F F +WK VENQ K+K L++DNG E+ + +F +FC +NGI
Sbjct: 520 KVWLMFLKTKDETFERFCEWKELVENQVNKKVKILRTDNGLEFCNLKFDEFCKQNGIERH 579
Query: 568 KTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQ 627
+T TP+QNGVA+RMNRTL E+ RC+ +SGL ++FW +A TAAYL+NR P+ +D+
Sbjct: 580 RTCTYTPQQNGVAKRMNRTLMEKVRCLLNESGLEEVFWAEAAATAAYLVNRSPASAVDHN 639
Query: 628 LPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQ 687
+PEE+W K+ HL+ FGC++YV +D + KL P+A+K F+GY GY+ W
Sbjct: 640 VPEELWLDKKPGYKHLRRFGCIAYVHLD---QGKLKPRALKGVFLGYPQGTKGYKVWLLD 696
Query: 688 NRKIIRSINVTFNESVLYKD-RSSAE 712
K + S N+ FNE+ +YKD R S+E
Sbjct: 697 EEKCVISRNIVFNENQVYKDIRESSE 722
>emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir||S05465
retrovirus-related polyprotein - Arabidopsis thaliana
retrotransposon Ta1-3
Length = 1291
Score = 398 bits (1023), Expect = e-109
Identities = 238/741 (32%), Positives = 386/741 (51%), Gaps = 48/741 (6%)
Query: 12 DGRDFGFWKMLMEDYL-----------YQKMLYQPLT---GKKPNDMKQEDWDLLDRQAL 57
+ DF WK M+ +L + + P+ GKK D ++ +
Sbjct: 20 ENSDFSLWKTCMKAHLGLAGLKGIIDDFDLTMTVPIPKSEGKKIEDGDEQGDSSQTKIVP 79
Query: 58 GVIRLTLSKNVAFNIV-------------NEKTTADLMKALSNMYEKPFAANKVHLIRRL 104
++++ S+N A NI+ + K+ A++ + L+ Y + N++++ +
Sbjct: 80 DLVKIEKSEN-AMNIIIAHVGDAVLRKIDHCKSAAEMWETLNKQYMETSLPNRIYVQLKF 138
Query: 105 FNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSA 164
++ +M + S+ E++N F I+++LSS++I E+ + L L ++ +
Sbjct: 139 YSFKMNDTKSINENVNEFLKIVAELSSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGN 198
Query: 165 RDNKLKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSR 224
+ LK D+ S + + E+ + L T R R ++ N ++G + R
Sbjct: 199 KALSLK--DVISAARSLERELNEQKETDKNTSTVLYTNERSRPQTRNQNHNKG---GQGR 253
Query: 225 GRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLICSL 284
GRS++ +TCW C ++GH A ++K + + A TE++ + S+
Sbjct: 254 GRSKSNSNAKLTCWYCKKEGHVKKDYFARKRKLESE----NPGEAGVITEKLVFSEALSV 309
Query: 285 DSPV--DSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNG 342
+ D WV+DSG + H ++ ++ + L D + G G I I + G
Sbjct: 310 NDLAVRDIWVLDSGCTSHMSARRDWFCSFREDGGPTILLGDDHSVKSQGQGSIKIETHGG 369
Query: 343 TLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYMVA 402
T+ L NV++VP ++RNLIS G LD GY G G + K RG+ LY++
Sbjct: 370 TIIGLENVKYVPELRRNLISTGTLDKRGYKHEGGDGKVRYFKNQKTALRGELVNGLYILD 429
Query: 403 EEDMIAVTEAINSSS----IWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGK 458
+++ T S +WH RLGH+ MK++A KG +S + L CE+C++GK
Sbjct: 430 GNTVLSETCVAEGSKGKTELWHSRLGHIGLNNMKVLAGKGLVSKEEIRVLDFCENCVMGK 489
Query: 459 QRKVSFSKAGRKSKSEKLELVHTDVWGPAPVK-SLGGSRYYVTFIDDSTRKVWVYFLKSK 517
+KVSF+ G+ + + L VH D+WG V SL G++Y+++ IDD TRKVW+YFL+SK
Sbjct: 490 AKKVSFN-VGKHNSEDVLRYVHADLWGSTNVTPSLSGNKYFLSIIDDKTRKVWLYFLRSK 548
Query: 518 SDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQN 577
+ F F +WK VENQ K+K L++DNG E+ + +F +C E+GI KT TP+QN
Sbjct: 549 DETFDRFCEWKELVENQQNKKVKCLRTDNGLEFCNLKFDAYCKEHGIERHKTCTYTPQQN 608
Query: 578 GVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKE 637
GVAERMNRT+ E+ RCM +SGL + FW +A TAAYLINR P+ +D+ +PEE+W K+
Sbjct: 609 GVAERMNRTIMEKVRCMLNESGLGEEFWAEAAATAAYLINRSPASAIDHNVPEELWLNKK 668
Query: 638 VSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINV 697
HL+ FG ++YV ID + KL P+A+K FIGY + GY+ W + K + S NV
Sbjct: 669 PGYKHLRRFGSIAYVHID---QGKLKPRALKGIFIGYPAGTKGYKIWLLEEHKCVISRNV 725
Query: 698 TFNESVLYKDRSSAESMSSSK 718
F+E +YKD E + S+
Sbjct: 726 LFHEESVYKDTMKKERVVESE 746
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.317 0.134 0.396
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,206,169,069
Number of Sequences: 2540612
Number of extensions: 51343581
Number of successful extensions: 158306
Number of sequences better than 10.0: 6385
Number of HSP's better than 10.0 without gapping: 5447
Number of HSP's successfully gapped in prelim test: 946
Number of HSP's that attempted gapping in prelim test: 145834
Number of HSP's gapped (non-prelim): 10608
length of query: 720
length of database: 863,360,394
effective HSP length: 135
effective length of query: 585
effective length of database: 520,377,774
effective search space: 304420997790
effective search space used: 304420997790
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)
Lotus: description of TM0220.7