
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC145330.14 - phase: 0
(1199 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAD41085.2| OSJNBb0011N17.2 [Oryza sativa (japonica cultivar... 774 0.0
gb|AAP54332.1| putative copia-like polyprotein [Oryza sativa (ja... 665 0.0
ref|XP_470422.1| putative polyprotein [Oryza sativa (japonica cu... 662 0.0
gb|AAL68641.1| polyprotein [Oryza sativa (japonica cultivar-group)] 658 0.0
ref|XP_470025.1| putative polyprotein [Oryza sativa (japonica cu... 639 0.0
ref|NP_918613.1| polyprotein [Oryza sativa (japonica cultivar-gr... 637 0.0
gb|AAT40550.1| putative receptor kinase [Solanum demissum] 593 e-168
emb|CAE05707.2| OSJNBb0065J09.3 [Oryza sativa (japonica cultivar... 555 e-156
emb|CAA36616.1| unnamed protein product [Solanum tuberosum] gi|4... 529 e-148
emb|CAB81200.1| putative retrotransposon polyprotein [Arabidopsi... 517 e-145
pir||F86470 probable retroelement polyprotein [imported] - Arabi... 513 e-143
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop... 510 e-143
gb|AAU89728.1| putative retroelement pol polyprotein-like [Solan... 509 e-142
pir||E96608 probable retroelement polyprotein F25P12.89 [importe... 509 e-142
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi... 506 e-141
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t... 506 e-141
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi... 504 e-141
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf... 503 e-140
gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi... 500 e-139
gb|AAR24647.1| At2g23330 [Arabidopsis thaliana] gi|22655202|gb|A... 500 e-139
>emb|CAD41085.2| OSJNBb0011N17.2 [Oryza sativa (japonica cultivar-group)]
gi|50925209|ref|XP_472906.1| OSJNBb0011N17.2 [Oryza
sativa (japonica cultivar-group)]
Length = 1262
Score = 774 bits (1999), Expect = 0.0
Identities = 483/1257 (38%), Positives = 676/1257 (53%), Gaps = 181/1257 (14%)
Query: 92 NILKFEVQ----RLDGKNYMEWSQTVRLILDGKGKLGFITGEIQMP-STTDPNYRFWKSE 146
+ILK E+ +L KNY+ WS+ LIL KG G++TGE++ P +T+ ++ W +
Sbjct: 38 SILKIELMQNEIKLGVKNYLSWSRRALLILKTKGLEGYVTGEVKEPENTSSVEWKTWSTT 97
Query: 147 NSTIIAWLLSTMESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSD 206
NS ++AWLL+++ I + +A +W + YS N + + + ++ +Q +
Sbjct: 98 NSLVVAWLLTSLIPAIATTVETISSASEMWKTLTKLYSGEGNVMLMVEAQEKISALRQGE 157
Query: 207 RDVTTYYNELMALWQELDLCYDDNWRCTEDSVLFLKRQ-ENDRVFVLLAGLNNCLDEVRG 265
R V Y EL +LW +LD YD D + +K+ E RV L GLN + R
Sbjct: 158 RSVAEYVAELKSLWSDLDH-YDPLGLEHSDCIAKMKKWVERRRVIEFLKGLNPEFEGRRD 216
Query: 266 RILGRIPLPTLQETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDH 325
+ + LPTL E + + +EE ++ V+ + + + +G E + C +
Sbjct: 217 AMFHQTTLPTLDEAIAAMAQEELKKKVLPSAAPCSPSPTYAIVQGKETRE-------CFN 269
Query: 326 CNRQWHTRDTCWK----LHGKPPNWKKKGGKEGRALQATTSDQEHQSSSSSFPFTKEQLD 381
C H C +G+ + G + GR ++ + L+
Sbjct: 270 CGEMGHLMRDCHAPRKPTYGRGRGVDRGGTRGGRGYAGRSNRGRGYGYRGDYKANAVTLE 329
Query: 382 QLYKMFGSQTPSCSIAQI-----GNFPNTALVSVKPS-PTWIIDSGATDHMTGESSLFAS 435
+ S T ++A G+F N A +S+ S +WI+DSGA+ H+TG S F S
Sbjct: 330 E----GSSGTTPDNVANFAHSTSGSF-NQAFMSMNTSHSSWILDSGASRHVTGMSGEFTS 384
Query: 436 YSPCAGNHK--IKIADGSLSAIAGKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLD 493
Y P + HK I+ ADG+ CQ + TGK +G GL+YLD
Sbjct: 385 YKPYSFAHKETIQTADGT-------SCQ-----------ERRTGKKLGIGIMRDGLWYLD 426
Query: 494 ---------------------NGPDFKDQPQQIMLSQNSETNIGAPKENAQESIMELNPL 532
NG + + ++++ + PK E++M L
Sbjct: 427 RRGTNEDVCALMASTSKEVTENGVAERKNRHLLEIARSLMYTMNVPKFLWSEAVMTAAYL 486
Query: 533 VNEVPTNS-----------------------GATFSNQNHNEQILDIDSNEEEPEMPQQN 569
+N P+ G T ++H I +D + +
Sbjct: 487 INRTPSRILGMKTPYEMIFGKNEFVVPPRVFGCTCFVRDHRPSIGKLDPRAVKCIFIGYS 546
Query: 570 DSNKETK-------NRFNSSDPIWKGNV------------------YERRDHKRGDEGPI 604
S K K F S D ++ +V R DH + EG I
Sbjct: 547 SSQKGYKCWSPSERRTFVSMDVTFRESVPFYGEKTDISSLFVDLDDLTRGDHDQQKEGEI 606
Query: 605 L----------------------QPCQESE---PRNDPNHHPNPGKSSIPKCRGKSSSIT 639
L P QE E P + N + +P +
Sbjct: 607 LGLKENEQSKGKIVVGEIPCAIGDPVQEQEWRKPHEEENLQVYTRRMRLPTTQQVEVDDQ 666
Query: 640 TSDD-------------------PDLHIPIAIRKPVRSCTKHP----------------- 663
SDD + ++PIAIRK +RS P
Sbjct: 667 VSDDLTHVQVSSESGGEQIEIREEESNLPIAIRKGMRSNAGKPPQRYGFEIGDESGDEND 726
Query: 664 MAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKIMT 723
+A +VSY++LSS++ AF + L++ IPK+ +EA + P+W +A+L+E+ ALEKNKTW +++
Sbjct: 727 IANYVSYTSLSSTYKAFVASLNSAIIPKDWKEAKQDPRWHQAMLDELEALEKNKTWDLVS 786
Query: 724 LPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRVL 783
P GK V CKWV+ VK N D VERYKARLVAKG++Q YGIDY ETFAPVAK++TVR +
Sbjct: 787 YPNGKKVVNCKWVYAVKQNPDGKVERYKARLVAKGYSQTYGIDYDETFAPVAKMSTVRTI 846
Query: 784 LSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFED-KFGLNVCKLQKSLYGLKQSP 842
+S AVN DWPL+QLDVKNAFL+GDLQEEVYM+ PPGF + V +L+KSLYGLKQSP
Sbjct: 847 ISCAVNFDWPLHQLDVKNAFLHGDLQEEVYMEIPPGFATLQTKGKVLRLKKSLYGLKQSP 906
Query: 843 RAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKK 902
RAWF++F ++ GY Q DHT+F S D I IL VYVDD+I+TG+D E+ RLK+
Sbjct: 907 RAWFDRFRRAMCAMGYKQCNGDHTVFYHHSGD-HITILAVYVDDMIITGNDCSEITRLKQ 965
Query: 903 NLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNA 962
NL+KEFE+KDLG LKYFLG+E+ARS +GIV+SQRKY LDLL +TGM GCRPA TP++ N
Sbjct: 966 NLSKEFEVKDLGQLKYFLGIEIARSPRGIVLSQRKYALDLLSDTGMLGCRPASTPVDQNH 1025
Query: 963 KLWEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRY 1022
KL + PV+ RYQRLVG+LIYL HTRPDI ++VS+VS++MH P H++AVYRILRY
Sbjct: 1026 KLCAESGNPVNKERYQRLVGRLIYLCHTRPDITYAVSMVSRYMHDPRSGHMDAVYRILRY 1085
Query: 1023 LKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVA 1082
LK +PGKGL+FKK +V + DADWA DR+ST+GYC +V GNLV+WRSKKQ VV+
Sbjct: 1086 LKGSPGKGLWFKKNGHLEVEGYCDADWASCPDDRRSTSGYCVFVGGNLVSWRSKKQPVVS 1145
Query: 1083 RSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKH 1142
RS+AEAE+RAM+ + ELLW++ LL EL L +D P+KL+CD+K+AISIA+NPVQHDRTKH
Sbjct: 1146 RSTAEAEYRAMSVSLSELLWLRNLLSELMLPVDTPMKLWCDNKSAISIANNPVQHDRTKH 1205
Query: 1143 IEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYAPT 1199
+E+DR FIKEK+ G + L +V S Q AD TK L K+GM +IY P+
Sbjct: 1206 VELDRFFIKEKLDEGVLELEFVMSGGQVADCFTKGLGVKECNSSCDKMGMIDIYHPS 1262
>gb|AAP54332.1| putative copia-like polyprotein [Oryza sativa (japonica
cultivar-group)] gi|37535486|ref|NP_922045.1| putative
copia-like polyprotein [Oryza sativa (japonica
cultivar-group)] gi|22094347|gb|AAM91874.1| putative
copia-like polyprotein [Oryza sativa (japonica
cultivar-group)]
Length = 894
Score = 665 bits (1716), Expect = 0.0
Identities = 329/573 (57%), Positives = 427/573 (74%), Gaps = 19/573 (3%)
Query: 645 DLHIPIAIRKPVRSCTKHP-----------------MAKFVSYSNLSSSFAAFTSQLSTV 687
+ ++PIAIRK VRS P +A +VSY++L S++ AF + L++V
Sbjct: 323 ETNLPIAIRKGVRSNAGKPPQRYGFEAQGVNDDENNIANYVSYASLLSTYKAFVTSLNSV 382
Query: 688 EIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTV 747
EIP + +EA + P+W +A+LEE+ ALEKNKTW ++ P GK V CKWV+TVK N D V
Sbjct: 383 EIPNDWREAKQDPRWHQAMLEELEALEKNKTWDLVPFPKGKKVVNCKWVYTVKQNPDENV 442
Query: 748 ERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGD 807
ERYKARLVAKG++Q YGIDY ETFAPVAK++TVR L+S A N DWPL+QLDVKNAFL+GD
Sbjct: 443 ERYKARLVAKGYSQTYGIDYDETFAPVAKMSTVRTLISCAANFDWPLHQLDVKNAFLHGD 502
Query: 808 LQEEVYMDSPPGFE-DKFGLNVCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHT 866
LQEEVYM+ PPGF + V +L+KSLYGLKQSPRAWF++F ++ GY Q DHT
Sbjct: 503 LQEEVYMEIPPGFATSQTEGKVLRLKKSLYGLKQSPRAWFDRFRRAMCGMGYKQCNGDHT 562
Query: 867 LFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVAR 926
+F R N G IL+VYVDD+I+TGDD +E+ RLK+NL+KEFE+KDLG LKYFLG+E+AR
Sbjct: 563 VFYRH-NRGLKTILVVYVDDMIITGDDCLEISRLKQNLSKEFEVKDLGQLKYFLGIEIAR 621
Query: 927 SRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQRLVGKLIY 986
S +GIV+SQRKY+LDLL +TGM GCRPA T +E N KL + PV+ RYQRLVG+LIY
Sbjct: 622 SPRGIVLSQRKYVLDLLSDTGMLGCRPASTLIEQNHKLCAESGDPVNKERYQRLVGRLIY 681
Query: 987 LAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTD 1046
L HTRPDI ++VSVVS++MH P H++ VYRILRYLK++PGKG++FKK D+ + D
Sbjct: 682 LCHTRPDITYAVSVVSRYMHDPRSGHMDVVYRILRYLKASPGKGIWFKKNGHLDMEGYCD 741
Query: 1047 ADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKL 1106
ADW + DR+ST+GYC ++ GNLV+WRSKK+ VV+RS+AEAE+R+M+ + ELLW++ L
Sbjct: 742 ADWGSCLDDRRSTSGYCVFIGGNLVSWRSKKESVVSRSTAEAEYRSMSMSLSELLWLKNL 801
Query: 1107 LEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTS 1166
L ELKL +KL+CD+K+AI+IA+NPVQHDRTKH+EIDR FIKE++ GT+ L +V S
Sbjct: 802 LAELKLSTSTSMKLWCDNKSAINIANNPVQHDRTKHVEIDRFFIKERMDEGTLNLGFVNS 861
Query: 1167 NEQTADILTKSLARPNFERLIVKLGMTNIYAPT 1199
EQ D LTK+L K+GM +IY P+
Sbjct: 862 GEQVVDSLTKALGARECTSSCSKMGMIDIYRPS 894
>ref|XP_470422.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|27573360|gb|AAO20078.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1299
Score = 662 bits (1708), Expect = 0.0
Identities = 332/570 (58%), Positives = 421/570 (73%), Gaps = 19/570 (3%)
Query: 648 IPIAIRKPVRSCTKHP-----------------MAKFVSYSNLSSSFAAFTSQLSTVEIP 690
+PIAIRK VRS P ++ +VSY +LSS++ AF + L +V+IP
Sbjct: 731 LPIAIRKSVRSNAGKPPLRYGFEAQDEGDDENNISNYVSYDSLSSTYKAFIASLDSVQIP 790
Query: 691 KNVQEALKIPKWKEAVLEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERY 750
K+ +EA + P+W +A+L+E+ ALEKNKTW ++ P GK V CKWV+TVK N D VERY
Sbjct: 791 KDWREAKQDPRWHQAMLDELEALEKNKTWDLVPFPKGKKIVNCKWVYTVKQNPDGKVERY 850
Query: 751 KARLVAKGFTQAYGIDYSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQE 810
KARLVAKG++Q YGIDY ETFAPVAK++TVR L+S A N DWPL+QLDVKNAFL+ DLQE
Sbjct: 851 KARLVAKGYSQTYGIDYDETFAPVAKMSTVRTLISCAANFDWPLHQLDVKNAFLHRDLQE 910
Query: 811 EVYMDSPPGFE-DKFGLNVCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFM 869
EVYMD PPGF + V +L+KSLYGLKQSPRAWF++F ++ Y Q DHT+F
Sbjct: 911 EVYMDVPPGFATSQTKGKVLRLKKSLYGLKQSPRAWFDRFRRAMCAMDYKQCNGDHTVFY 970
Query: 870 RFSNDGKIAILIVYVDDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRK 929
S D I IL VYVDD+I+TG+D +E+ RLK+NL+KEFE+KDLG L+YFLG+E+ARS +
Sbjct: 971 HHSGD-HITILAVYVDDMIITGNDCLEITRLKRNLSKEFEVKDLGQLRYFLGIEIARSPR 1029
Query: 930 GIVVSQRKYILDLLEETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQRLVGKLIYLAH 989
GIV+SQRKY+LDLL ETGM GC P TP++ N KL + PV+ RYQRLVG+LIYL H
Sbjct: 1030 GIVISQRKYVLDLLSETGMLGCCPVSTPIDQNHKLCAESGDPVNRERYQRLVGRLIYLCH 1089
Query: 990 TRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADW 1049
TRPDI ++VS+VS++MH P H+EAVYRILRYLK +PGKGL+FKK + + DADW
Sbjct: 1090 TRPDITYAVSMVSRYMHDPRSSHMEAVYRILRYLKGSPGKGLWFKKNGHLKIEGYCDADW 1149
Query: 1050 AGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEE 1109
A + DR+ST+GYC YV GNLV+WRSKKQ VV+RS+AEAE+RAMA + ELLW++ LL E
Sbjct: 1150 ASCLDDRRSTSGYCVYVGGNLVSWRSKKQSVVSRSTAEAEYRAMAASLSELLWLRNLLVE 1209
Query: 1110 LKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQ 1169
LK+ + P+KL CD+K+AI+IA+NPVQHDRTKH+EIDR FIKEK+ G + L +VTS Q
Sbjct: 1210 LKILGNTPMKLLCDNKSAINIANNPVQHDRTKHVEIDRFFIKEKLDEGVLELGFVTSGGQ 1269
Query: 1170 TADILTKSLARPNFERLIVKLGMTNIYAPT 1199
AD LTK L K+GM +IY P+
Sbjct: 1270 VADCLTKGLGVKECNCSCDKMGMIDIYHPS 1299
Score = 123 bits (309), Expect = 3e-26
Identities = 104/425 (24%), Positives = 189/425 (44%), Gaps = 42/425 (9%)
Query: 100 RLDG-KNYMEWSQTVRLILDGKGKLGFITGEIQMPST-TDPNYRFWKSENSTIIAWLLST 157
+L+G KNY+ WS+ LIL KG G++TGEI+ P + ++ W + NS ++AWLL++
Sbjct: 50 KLEGVKNYLSWSRRALLILKTKGLEGYVTGEIKEPENISSVEWKTWSTTNSLVVAWLLTS 109
Query: 158 MESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELM 217
+ I + +A +W + YS N + + + ++ +Q +R V Y EL
Sbjct: 110 LIPAIATTVETISSASEMWKTLTNLYSGEGNVMLMVEAQEKISVLRQGERSVAEYVAELK 169
Query: 218 ALWQELDLCYDDNWRCTEDSVLFLKRQ-ENDRVFVLLAGLNNCLDEVRGRILGRIPLPTL 276
LW +LD YD D + +++ E RV L GLN+ + R + + LP+L
Sbjct: 170 HLWSDLD-HYDPLGLEHPDCIAKMRKWIERRRVIEFLKGLNSEFEGRRDAMFHQTTLPSL 228
Query: 277 QETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDHCNRQWH-TRDT 335
E + + +EE ++ V+ + S + +V + E + C +C H RD
Sbjct: 229 DEAIAAMAQEELKKKVLPSATPSSPSPTYVVAQSKETRE-------CFNCGEMGHLIRD- 280
Query: 336 CWKLHGKPPNWKKKGGKEGRALQATTSDQEHQSSSSSFPFTKEQLDQLYKMFGSQTPSCS 395
++ KP + + G G A + + + + + + + S + S +
Sbjct: 281 -YRAPRKPSYGRGRFGDRGGA-RGGRGYAGRGNRGRGYEYRSDHRANVVTLEESCSGSTN 338
Query: 396 I-------AQIGNFPNTALVSVKPS-PTWIIDSGATDHMTGESSLFASYSPCAGNHKIKI 447
+ + GN N A +S+ S WI+DSGA+ H+T + +
Sbjct: 339 VDVANLVHSSSGN-SNQAFMSINSSHSNWILDSGASRHVT-----------------VNL 380
Query: 448 ADGSLSAIAGKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLDNGPDFKDQPQQIML 507
S S + +C+ + +C+ ++ TGK +G + GL+YLD +D +M
Sbjct: 381 VSIS-SLVDHMNCRVSLDRENCLIQERETGKKLGIGVRRDGLWYLDRKETSEDVCLALMA 439
Query: 508 SQNSE 512
+ E
Sbjct: 440 PTSEE 444
>gb|AAL68641.1| polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1472
Score = 658 bits (1698), Expect = 0.0
Identities = 329/575 (57%), Positives = 425/575 (73%), Gaps = 19/575 (3%)
Query: 643 DPDLHIPIAIRKPVRSCTKHP-----------------MAKFVSYSNLSSSFAAFTSQLS 685
+ + ++PIAIRK +RS P +A +VSY++LSS++ AF + L+
Sbjct: 899 EEESNLPIAIRKGMRSNAGKPPQRYGFEIGDESGDENDIANYVSYTSLSSTYRAFVASLN 958
Query: 686 TVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDN 745
+ IPK+ +EA + P+W +A+L+E+ ALEKNKTW +++ P GK V CKWV+ VK N D
Sbjct: 959 SAIIPKDWKEAKQDPRWHQAMLDELEALEKNKTWDLVSYPNGKKVVNCKWVYAVKQNPDG 1018
Query: 746 TVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLN 805
VERYKARLVAKG++Q YGIDY ETFAPVAK++TVR ++S AVN DWPL+QLDVKNAFL+
Sbjct: 1019 KVERYKARLVAKGYSQTYGIDYDETFAPVAKMSTVRTIISCAVNFDWPLHQLDVKNAFLH 1078
Query: 806 GDLQEEVYMDSPPGFED-KFGLNVCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSD 864
GDLQEEVYM+ PPGF + V +L+KSLYGLKQSPRAWF++F ++ GY Q D
Sbjct: 1079 GDLQEEVYMEIPPGFATLQTKGKVLRLKKSLYGLKQSPRAWFDRFRRAMCAMGYKQCNGD 1138
Query: 865 HTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEV 924
HT+F S D I IL VYVDD+I+TG+D E+ RLK+NL+KEFE+KDLG LKYFLG+E+
Sbjct: 1139 HTVFYHHSGD-HITILAVYVDDMIITGNDCSEITRLKQNLSKEFEVKDLGQLKYFLGIEI 1197
Query: 925 ARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQRLVGKL 984
ARS +GIV+SQRKY LDLL +TGM GCRPA TP++ N KL + PV+ RYQRLVG+L
Sbjct: 1198 ARSPRGIVLSQRKYALDLLSDTGMLGCRPASTPVDQNHKLCAESGNPVNKERYQRLVGRL 1257
Query: 985 IYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIF 1044
IYL HTRPDI ++VS+VS++MH P H++AVYRILRYLK +PGKGL+FKK +V +
Sbjct: 1258 IYLCHTRPDITYAVSMVSRYMHDPRSGHMDAVYRILRYLKGSPGKGLWFKKNGHLEVEGY 1317
Query: 1045 TDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQ 1104
DA WA DR+ST+GYC +V GNLV+WRSKKQ VV+RS+AEAE+RAM+ + ELLW++
Sbjct: 1318 CDAHWASCPDDRRSTSGYCVFVGGNLVSWRSKKQPVVSRSTAEAEYRAMSVSLSELLWLR 1377
Query: 1105 KLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYV 1164
LL EL L +D P+KL+CD+K+AISIA+NPVQHDRTKH+E+DR FIKEK+ G + L +V
Sbjct: 1378 NLLSELMLPVDTPMKLWCDNKSAISIANNPVQHDRTKHVELDRFFIKEKLDEGVLELEFV 1437
Query: 1165 TSNEQTADILTKSLARPNFERLIVKLGMTNIYAPT 1199
S Q AD TK L K+GM +IY P+
Sbjct: 1438 MSGGQVADCFTKGLGVKECNSSCDKMGMIDIYHPS 1472
Score = 147 bits (372), Expect = 2e-33
Identities = 118/455 (25%), Positives = 201/455 (43%), Gaps = 66/455 (14%)
Query: 92 NILKFEVQ----RLDG-KNYMEWSQTVRLILDGKGKLGFITGEIQMP-STTDPNYRFWKS 145
+ILK E+ +L+G KNY+ WS+ LIL KG G++TGE++ P +T+ ++ W +
Sbjct: 38 SILKIELMQNEIKLEGVKNYLSWSRRALLILKTKGLEGYVTGEVKEPENTSSVEWKTWST 97
Query: 146 ENSTIIAWLLSTMESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQS 205
NS ++AWLL+++ I + +A +W + YS N + + + ++ +Q
Sbjct: 98 TNSLVVAWLLTSLIPAIATTVETISSASEMWKTLTKLYSGEGNVMLMVEAQEKISALRQG 157
Query: 206 DRDVTTYYNELMALWQELDLCYDDNWRCTEDSVLFLKR-QENDRVFVLLAGLNNCLDEVR 264
+R V Y EL +LW +LD YD D + +K+ E RV L GLN + R
Sbjct: 158 ERSVAEYVAELKSLWSDLD-HYDPLGLEHSDCIAKMKKWVERRRVIEFLKGLNPEFEGRR 216
Query: 265 GRILGRIPLPTLQETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCD 324
+ + LPTL E + + +EE ++ V+ + + + +G E + C
Sbjct: 217 DAMFHQTTLPTLDEAIAAMAQEELKKKVLPSAAPCSPSPTYAIVQGKETRE-------CF 269
Query: 325 HCNRQWHTRDTCW----KLHGKPPNWKKKGGKEGRALQATTSDQEHQSSSSSFPFTKEQL 380
+C H C +G+ + G + GR ++ + L
Sbjct: 270 NCGEMGHLMRDCHAPRKPTYGRGRGVDRGGTRGGRGYAGRSNRGRGYGYRGDYKANAVTL 329
Query: 381 DQLYKMFGSQTPSCSIAQI-----GNFPNTALVSVKPS-PTWIIDSGATDHMTGESSLFA 434
++ S T ++A G+F N A +S+ S +WI+DSGA+ H+TG S F
Sbjct: 330 EE----GSSGTTPDNVANFAHSTSGSF-NQAFMSMNTSHSSWILDSGASRHVTGMSGEFT 384
Query: 435 SYSPCAGNHK--IKIADGSLSAIAGK---------------------------------- 458
SY P + HK I+ ADG+ + G+
Sbjct: 385 SYKPYSFAHKETIQTADGTSCQVKGEGIVQCTPSITLSSVLYVHSFPVNLISISSLVDNM 444
Query: 459 DCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLD 493
DC+ + +C+ ++ TGK +G + GL+YLD
Sbjct: 445 DCRVSLDRENCLIQERRTGKKLGIGIRRDGLWYLD 479
>ref|XP_470025.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|30103001|gb|AAP21414.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1393
Score = 639 bits (1648), Expect = 0.0
Identities = 341/675 (50%), Positives = 449/675 (66%), Gaps = 37/675 (5%)
Query: 557 DSNEEEPEM-PQQNDSNK------------ETKNRFNSSDPIWKGNVYERRDH-KRGDEG 602
D+ +E+ EM P + D + E R D + VY+RR +G++
Sbjct: 724 DTQDEDREMVPHEEDGEEGEVVVGTIPCPMEGAERVKQKDVL----VYQRRRFDSQGEKR 779
Query: 603 PILQPCQESEPRNDPNHHPNPGKSSIPKCRGKSSSITTSDDPDLH---IPIAIRKPVRSC 659
L Q E + P +S P S + P L +P+ R+ RS
Sbjct: 780 KGLVQSQIEELPHPKCPVPESSQSLSPPASLASLETIGNTSPTLEHVELPLVQRRETRSN 839
Query: 660 T--------------KHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEA 705
H +A +++YS++S ++ F + L T+ IPK+ + A + PKWK+A
Sbjct: 840 AGRPPIRLGFEHLSFMHDIANYITYSHVSPAYKTFIASLQTMPIPKDWKCAKQDPKWKDA 899
Query: 706 VLEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGI 765
+ EE+ AL KNKTW+++ LP K VGCKWVFTVK + V+RYKARLVAKG++Q YGI
Sbjct: 900 MKEELNALVKNKTWELVKLPPEKRAVGCKWVFTVKQTPEGKVDRYKARLVAKGYSQTYGI 959
Query: 766 DYSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFG 825
DY ETFAPVAK+ TVR L+S AVN WPL+QLDVKNAFL+GDL EEVYM+ PPGF +
Sbjct: 960 DYDETFAPVAKMGTVRALVSCAVNFGWPLHQLDVKNAFLHGDLHEEVYMEIPPGFGNSQT 1019
Query: 826 LN-VCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYV 884
+ VCKL+KSLYGLKQSPRAWF++F +V GY Q DHT+F + I IL VYV
Sbjct: 1020 VGKVCKLKKSLYGLKQSPRAWFDRFRHAVCDMGYSQCNGDHTVFYKHRGT-HITILAVYV 1078
Query: 885 DDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLE 944
DDI++TGDD+ E+ LK+ L K FE+KDLG L+YFLG+E+ARS KGIV+SQRKY+LDLL
Sbjct: 1079 DDIVITGDDVEEIRCLKERLGKAFEVKDLGPLRYFLGIEIARSSKGIVLSQRKYVLDLLT 1138
Query: 945 ETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQF 1004
+TGM GCR + TP++ N +L + PVD YQRLVG+LIYL HTRPDI+++VSVVS++
Sbjct: 1139 DTGMLGCRASTTPIDRNHQLCAQSGDPVDKEAYQRLVGRLIYLCHTRPDISYAVSVVSRY 1198
Query: 1005 MHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCA 1064
MH P HL+ V++ILRYLK PGKGL+F+K +V + DADWA S+ DR+ST+GYC
Sbjct: 1199 MHDPRTGHLDVVHKILRYLKGTPGKGLWFRKNGHLNVEGYCDADWASSMDDRRSTSGYCV 1258
Query: 1065 YVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDS 1124
+V GNLV+WRSKKQ VVARS+AEAE+RAMA + E+LW++ LL EL++ + L CD+
Sbjct: 1259 FVGGNLVSWRSKKQAVVARSTAEAEYRAMALSLSEMLWMRSLLTELRVLRSDTVMLHCDN 1318
Query: 1125 KAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFE 1184
K+AISIA+NPVQHDRTKH+EIDR FIKEKI SG + L Y+ S EQ AD LTK L +
Sbjct: 1319 KSAISIANNPVQHDRTKHVEIDRFFIKEKIDSGVLRLEYIKSCEQLADCLTKGLGPSEIQ 1378
Query: 1185 RLIVKLGMTNIYAPT 1199
+ K+GM +I+ P+
Sbjct: 1379 SICNKMGMIDIFCPS 1393
Score = 138 bits (347), Expect = 1e-30
Identities = 119/451 (26%), Positives = 195/451 (42%), Gaps = 68/451 (15%)
Query: 100 RLDG-KNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPN-YRFWKSENSTIIAWLLST 157
RL+G KNY+ W + +L+L KG F+ + PS + +R W + NST+++WL+++
Sbjct: 55 RLEGSKNYLSWCRRAQLMLRAKGVDHFLLESCEEPSDKESQAWRTWNTTNSTVVSWLMTS 114
Query: 158 MESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELM 217
+ +I + + A VW + YS N + + ++++ KQ R V Y +EL
Sbjct: 115 VAPSIGRMIEAIQNAAVVWKTLSNMYSGEGNVMMMVEAQNKVENLKQEGRTVQEYASELQ 174
Query: 218 ALWQELDLCYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQ 277
LW +LD + +D V+ K + RV L GLN ++ R + + LPT++
Sbjct: 175 QLWADLDHYDPLQLKHEDDIVIGNKWLQRRRVIHFLKGLNKEFEDRRAAMFHQATLPTME 234
Query: 278 ETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDHCNRQWHTRDTCW 337
E S + +EE R +M G + SA + N E C +C + H C
Sbjct: 235 EAISAMVQEEMRLRLMRGTNPI---RSAYIAADNRE---------CYNCGQVGHVSYNCP 282
Query: 338 KL------------HG----------------KPPNWKKKGGKEGRALQATTSDQEH--Q 367
HG + +GG+ G + Q + +
Sbjct: 283 TSRNIGGRGSIRGGHGGTRGGFRGDRGVFGGNRGGRGGDRGGRVGGRGRGRGVPQANAVK 342
Query: 368 SSSSSFPFTKEQLDQLYKMFGSQT--PSCSIAQIGNFPN----------TALVSVKPSP- 414
+ EQ+ Q + ++T S + GNF N AL S P
Sbjct: 343 EDGKAVTLIGEQVTQWEEWQKNKTNESSNTTTHFGNFANYAQVGEGTQAQALASTYRHPI 402
Query: 415 TWIIDSGATDHMTGESSLFASYSPCAGNHKIKIADGS-----------LSAIAGKDCQAN 463
WIIDSGA+ H+TG + F SY+P + I+IADG+ SAI C
Sbjct: 403 DWIIDSGASKHVTGLHNTFTSYTPYIHSETIQIADGTSKPIHVNLLSISSAIDQLKCIVV 462
Query: 464 FFHSHCIFKDLNTGKMIGSAKKSGGLYYLDN 494
F + C+F++ TG+ IG+ + GL+Y+++
Sbjct: 463 FDENSCLFQEKGTGRRIGTGVRRDGLWYINH 493
>ref|NP_918613.1| polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1554
Score = 637 bits (1642), Expect = 0.0
Identities = 341/672 (50%), Positives = 447/672 (65%), Gaps = 37/672 (5%)
Query: 557 DSNEEEPEM-PQQNDSNK------------ETKNRFNSSDPIWKGNVYERRDH-KRGDEG 602
D+ +E+ EM P + D + E R D + VY+RR +G++
Sbjct: 885 DTQDEDREMVPHEEDGEEGEVVVGTIPCPMEGAERVKQKDVL----VYQRRRFDSQGEKR 940
Query: 603 PILQPCQESEPRNDPNHHPNPGKSSIPKCRGKSSSITTSDDPDLH---IPIAIRKPVRS- 658
L Q E + P +S P S + P L +P+A R+ RS
Sbjct: 941 KGLVQSQIEELPHQKCPVPESSQSLSPPASLASLETIGNTSPTLEHVELPLAQRRETRSN 1000
Query: 659 -------------CTKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEA 705
+ H +A +++YS++S ++ F + L TV IPK+ + A + PKWK+A
Sbjct: 1001 AGRPPIRLGFEHLSSMHDIANYITYSHVSPAYKTFIASLQTVPIPKDWKCAKQDPKWKDA 1060
Query: 706 VLEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGI 765
+ EE+ AL KNKTW+++ LP K VGCKWVFTVK + V+ YKARLVAKG++Q YGI
Sbjct: 1061 MKEELNALVKNKTWELVKLPPEKRAVGCKWVFTVKQTPEGKVDMYKARLVAKGYSQTYGI 1120
Query: 766 DYSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFG 825
DY ETFAPVAK+ TVR L+S AVN WPL+QLDVKNAFL+GDL EEVYM+ PPGF +
Sbjct: 1121 DYDETFAPVAKMGTVRALVSCAVNFGWPLHQLDVKNAFLHGDLHEEVYMEIPPGFGNSQT 1180
Query: 826 LN-VCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYV 884
+ VCKL+KSLYGLKQSPRAWF++F +V GY Q DHT+F + I IL VYV
Sbjct: 1181 VGKVCKLKKSLYGLKQSPRAWFDRFRHAVCDMGYSQCNGDHTVFYKHRGT-HITILAVYV 1239
Query: 885 DDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLE 944
DDI++TGDD+ E+ LK+ L K FE+KDLG L+YFLG+E+ARS KGIV+SQRKY+LDLL
Sbjct: 1240 DDIVITGDDVEEIRCLKERLGKAFEVKDLGPLRYFLGIEIARSSKGIVLSQRKYVLDLLT 1299
Query: 945 ETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQF 1004
+TGM GCR + TP++ N +L + PVD YQRLVG+LIYL HTRPDI+++VSVVS++
Sbjct: 1300 DTGMLGCRASTTPIDRNHQLCAQSGDPVDKEAYQRLVGRLIYLCHTRPDISYAVSVVSRY 1359
Query: 1005 MHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCA 1064
MH P HL+ V++ILRYLK PGKGL+F+K +V + DADWA S+ DR+ST+GYC
Sbjct: 1360 MHDPRTGHLDVVHKILRYLKGTPGKGLWFRKNGHLNVEGYCDADWASSMDDRRSTSGYCV 1419
Query: 1065 YVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDS 1124
+V GNLV+WRSKKQ VVARS+AEAE+RAMA E+LW++ LL EL++ + L CD+
Sbjct: 1420 FVGGNLVSWRSKKQAVVARSTAEAEYRAMALSFSEMLWMRSLLTELRVLRSDTVMLHCDN 1479
Query: 1125 KAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFE 1184
K+AISIA+NPVQHDRTKH+EIDR FIKEKI SG + L Y+ S EQ AD LTK L +
Sbjct: 1480 KSAISIANNPVQHDRTKHVEIDRFFIKEKIDSGVLRLEYIKSCEQLADCLTKGLGPSEIQ 1539
Query: 1185 RLIVKLGMTNIY 1196
+ K+GM +I+
Sbjct: 1540 SVCNKMGMIDIF 1551
Score = 126 bits (317), Expect = 4e-27
Identities = 120/474 (25%), Positives = 195/474 (40%), Gaps = 91/474 (19%)
Query: 100 RLDG-KNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPN-YRFWKSENSTIIAWLLST 157
RL+G KNY+ W + +L+L KG F+ + PS + +R W + NST+++WL++
Sbjct: 55 RLEGSKNYLSWCRRAQLMLRAKGVDHFLLESCEEPSDKESQAWRTWNTTNSTVVSWLMTL 114
Query: 158 MESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELM 217
+ +I + + A VW + YS N + + ++++ KQ R V Y +EL
Sbjct: 115 VAPSIGRMIEAIQNAAVVWKTLSNMYSGEGNVMMMVEAQNKVENLKQEGRTVQEYASELQ 174
Query: 218 ALWQELDLCYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQ 277
LW +LD + +D V+ K + RV L GLN ++ R + + LPT++
Sbjct: 175 QLWADLDHYDPLLLKHEDDIVIGNKWLQRRRVIHFLKGLNKEFEDRRAAMFHQATLPTME 234
Query: 278 ETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDHCNRQWHTRDTCW 337
E S + +EE R +M G + SA + N E C +C + H C
Sbjct: 235 EAISAMVQEEMRLRLMRGTNPI---RSAYIAADNRE---------CYNCGQVGHVSYNCP 282
Query: 338 KL------------HG----------------KPPNWKKKGGKEGRALQATTSDQEHQSS 369
HG + +GG+ G + Q + ++
Sbjct: 283 TSRNIGGRGSIRGGHGGTRGGFGGDRGGFGGNRGGRGGDRGGRVGGRGRGRGVPQANAAT 342
Query: 370 SSSFPFT--KEQLDQLYKMFGSQT--PSCSIAQIGNFPN----------TALVSVKPSP- 414
T EQ+ Q + ++T S + GNF N AL S P
Sbjct: 343 EDGKAVTLIGEQVTQWEEWQKNKTNESSNTTTHFGNFANYAQVGEGTQAQALASTYRHPI 402
Query: 415 TWIIDSGATDHMTGESSLFASYSPCAGNHKIKIADGS----------------------- 451
WIIDSGA+ H+TG + F SY+P + I+IADG+
Sbjct: 403 DWIIDSGASKHVTGLHNTFTSYTPYIHSETIQIADGTSKPIHGIGSVECTSSMNLSSVLH 462
Query: 452 -----------LSAIAGKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLDN 494
SAI C F + C+F++ TG+ IG+ + GL+Y+++
Sbjct: 463 VPSFPVNLLSVSSAIDQLKCIVVFDENSCLFQEKWTGRRIGTGVRRDGLWYINH 516
>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
Length = 1358
Score = 593 bits (1530), Expect = e-168
Identities = 315/600 (52%), Positives = 421/600 (69%), Gaps = 19/600 (3%)
Query: 610 ESEPRNDPNHHPN-----PGKSSIPKCRGKSSSITTSDDPDLHIPIAIRKPVRSCTKHPM 664
ES+P ++HP+ P +P S++T++ P+ + P+ + + P
Sbjct: 766 ESQPYYTSSNHPDVSMVLPIPQVLPVPTFVESTVTSTS------PVVV-PPLLTYHRRPR 818
Query: 665 AKFVSYSNLSSSFAAFTSQLSTVEIPKNVQ--EALKIPKWKEAVLEEMRALEKNKTWKIM 722
V + + A T+ L P +Q EAL W++A+++EM AL K+ TW+++
Sbjct: 819 PTLVPDDSCHAPDPAPTADLPPPSQPLALQKGEALSHSGWRQAMVDEMSALHKSGTWELV 878
Query: 723 TLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRV 782
+LPAGK+TVGC+WV+ VK D V+R KARLVAKG+TQ +G+DYS+TFAPVAK+ +VR+
Sbjct: 879 SLPAGKSTVGCRWVYAVKIGPDGQVDRLKARLVAKGYTQIFGLDYSDTFAPVAKIASVRL 938
Query: 783 LLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGF--EDKFGLNVCKLQKSLYGLKQ 840
LS+A WPL+QLD+KNAFL+GDL+EEVYM+ PPGF + + VC+L++SLYGLKQ
Sbjct: 939 FLSMAAVRHWPLHQLDIKNAFLHGDLEEEVYMEQPPGFVAQGESSSLVCRLRRSLYGLKQ 998
Query: 841 SPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRL 900
SPRAWF KF+ +++ G ++ +DH++F R S + L+VYVDDI++TG+D + L
Sbjct: 999 SPRAWFGKFSTVIQEFGMTRSGADHSVFYRHSAPSRCIYLVVYVDDIVITGNDQDGITDL 1058
Query: 901 KKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMEL 960
K++L K F+ KDLG LKYFLG+EVA+SR GIV+SQRKY LD+LEETGM GCRP DTPM+
Sbjct: 1059 KQHLFKHFQTKDLGRLKYFLGIEVAQSRSGIVISQRKYALDILEETGMMGCRPVDTPMDP 1118
Query: 961 NAKLWEKGNVPV-DIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRI 1019
N KL P+ + RY+RLVGKL YL TRPDI+F VSVVSQFM SP + H EAV RI
Sbjct: 1119 NVKLLPGQGEPLSNPERYRRLVGKLNYLTVTRPDISFPVSVVSQFMTSPCDSHWEAVVRI 1178
Query: 1020 LRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQG 1079
LRY+KS PGKGL F+ + +TDADWAGS DR+ST+GYC V GNLV+W+SKKQ
Sbjct: 1179 LRYIKSAPGKGLLFEDQGHEHIIGYTDADWAGSPSDRRSTSGYCVLVGGNLVSWKSKKQN 1238
Query: 1080 VVARSSAEAEFRAMAQGICELLWIQKLLEELKL-KIDLPLKLFCDSKAAISIAHNPVQHD 1138
VVARSSAE+E+RAMA CEL+WI++LL ELK K+D ++L CD++AA+ IA NPV H+
Sbjct: 1239 VVARSSAESEYRAMATATCELVWIKQLLGELKFGKVD-KMELVCDNQAALHIASNPVFHE 1297
Query: 1139 RTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYAP 1198
RTKHIEID HF++EKI+SG I +V SN+Q ADI TKSL P + KLG ++YAP
Sbjct: 1298 RTKHIEIDCHFVREKILSGDIVTKFVKSNDQLADIFTKSLTCPRINYICNKLGTYDLYAP 1357
Score = 125 bits (315), Expect = 7e-27
Identities = 131/469 (27%), Positives = 206/469 (42%), Gaps = 91/469 (19%)
Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITG---EIQMPSTTDPNYRFWKSENSTIIAWLLST 157
L NY+ W+ +V L G+G +T E+ + + T K++ + A L S
Sbjct: 16 LGSSNYLSWASSVELWCKGQGVQDHLTNKAYEVDVKAKTSEEDAKAKAQWEKVDAQLCSL 75
Query: 158 MESTI--KKPYMFLP--TAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYY 213
+ +I K +F P T VW+ +A Y++ + SR +D+ SRL K+ + D++TY
Sbjct: 76 LWRSIDFKLMPLFRPFQTCYTVWEKARALYTN--DISRFYDVISRLTNLKKQESDMSTYL 133
Query: 214 NELMALWQELDLCYDDNWRCTEDSVLFLKRQENDRVFVL---LAGLNNCLDEVRGRILGR 270
++ A+ +E D K+QE+ + L LAGL D VR +IL
Sbjct: 134 GQVQAVMEEFDTLMPVTTNVE-------KQQEHRQTLFLVLTLAGLPPDHDSVRDQILAS 186
Query: 271 IPLPTLQETFSEVRREEARQSVMMGKSASITESSALVTKGNEE-----------GKRDGK 319
+PT+ E FS + R A S + S ++ +SS L ++ E+ G R GK
Sbjct: 187 PTVPTIDELFSRLLRLAAPPSHKVVSSPTV-DSSILASQTFEKRTYQSMENRRGGGRFGK 245
Query: 320 -KPFCDHCNRQWHTRDTCWKLHGKPPNWKKKGGKE------GRALQATT-----SDQEHQ 367
+ C HC++ HTRD C+ LHG PP++ KE RA + T+ Q +Q
Sbjct: 246 PRSKCSHCHKPGHTRDICYILHGPPPSYDPIVLKEYNEFLRNRASKQTSPPVAYGAQPNQ 305
Query: 368 SSSSSFPFTKEQLDQLYKMFGSQT---------PSCSIAQIGNFPNTALVSVKPS-PTWI 417
S+++ E + L QT P S+A GN + A VS + TW+
Sbjct: 306 PSNNAHIAQTEYDEFLQYRANKQTSPQVVSVAQPDVSVA--GN--SFACVSQSSTLGTWV 361
Query: 418 IDSGATDHMTGESSLFASYSPCAGNHKIKIADG------------SLSAIA--------- 456
+DSGA+DH++G SL + I +A+G LS++
Sbjct: 362 MDSGASDHISGNKSLLSDIVYSQSLPAITLANGIQTKPKGVGKAKPLSSVTLDSVLYVPG 421
Query: 457 -------------GKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYL 492
C FF + +D +TG+MIG+ +S GLYYL
Sbjct: 422 SPFNLASVSRLTKALHCSITFFDDFFLMQDRSTGQMIGTGHESQGLYYL 470
>emb|CAE05707.2| OSJNBb0065J09.3 [Oryza sativa (japonica cultivar-group)]
Length = 1015
Score = 555 bits (1429), Expect = e-156
Identities = 286/503 (56%), Positives = 362/503 (71%), Gaps = 21/503 (4%)
Query: 615 NDPNHHPNPGKSSIPKCRGKSSSITTSDDPDLHIPIAIRKPVRSCTKHP----------- 663
ND + + + + G+ S I+ + ++PIA RK +RS P
Sbjct: 400 NDQSSYQSDPIQENTETGGEESEISGEES---NLPIANRKGIRSTAGKPPIRYGFEEVEE 456
Query: 664 -----MAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKT 718
+A +VSYS+LS ++ AF + ++ IPK+ +EA PKW EA++EEM ALEKNKT
Sbjct: 457 ENGNDIANYVSYSSLSPAYRAFIASFQSIVIPKDWREAKNDPKWHEAMMEEMSALEKNKT 516
Query: 719 WKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLN 778
W+++ P GK V CKWV VK + VERYKARLVAKG++Q YGIDY ETFAPVAK++
Sbjct: 517 WELVPFPTGKKVVSCKWVNAVKQDPFGKVERYKARLVAKGYSQTYGIDYDETFAPVAKMS 576
Query: 779 TVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFE-DKFGLNVCKLQKSLYG 837
TVR L+S A N DWPL QLDVKNAFL+GDLQEEVYM+ PPGF + V +L+KSLYG
Sbjct: 577 TVRTLISCAANFDWPLYQLDVKNAFLHGDLQEEVYMEIPPGFSTSQTKGKVLRLKKSLYG 636
Query: 838 LKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEM 897
LKQSPRAWF++F ++ GY Q DHTLF R KIAIL VYVDDII+TGDD E+
Sbjct: 637 LKQSPRAWFDRFRRAMCGMGYKQCNGDHTLFYRHRGK-KIAILAVYVDDIIITGDDTQEI 695
Query: 898 DRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTP 957
+LK+N++KEFE+KDLG LKYFLG+E+ARS +GIV+SQRKY+LDLL +TGM GCRPA TP
Sbjct: 696 AQLKENISKEFEVKDLGQLKYFLGIEIARSPRGIVLSQRKYVLDLLCDTGMLGCRPASTP 755
Query: 958 MELNAKLWEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVY 1017
+E N KL + PV+ RYQRLVG+LIYL HTRPDI ++VSVVS++MH P H++AVY
Sbjct: 756 IEQNHKLCAELGDPVNKERYQRLVGRLIYLCHTRPDITYAVSVVSRYMHDPRSGHMDAVY 815
Query: 1018 RILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKK 1077
RILRYLK +PGKGL+FKK V + DADWA S+ DR+ST+GYC +V GNLV+WRSKK
Sbjct: 816 RILRYLKGSPGKGLWFKKNGHLGVEGYCDADWASSLDDRRSTSGYCVFVGGNLVSWRSKK 875
Query: 1078 QGVVARSSAEAEFRAMAQGICEL 1100
Q VV+RS+AEAE+RAM+ IC +
Sbjct: 876 QPVVSRSTAEAEYRAMSGCICRI 898
Score = 42.4 bits (98), Expect = 0.095
Identities = 27/97 (27%), Positives = 43/97 (43%), Gaps = 20/97 (20%)
Query: 399 IGNFPNTA--LVSVKPSPTWIIDSGATDHMTGESSLFASYSPCAGNHKIKIADGSLSAIA 456
+GN N A + P TWI+DSGA+ H+T + + S S +
Sbjct: 22 LGNLVNLAHPMQMQVPHSTWILDSGASRHVT-----------------VNLVSIS-SLVD 63
Query: 457 GKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLD 493
DC +C+ ++ TG+ +G + GL+YLD
Sbjct: 64 HMDCWVTLDRENCLIEERRTGRKLGIGIRQNGLWYLD 100
>emb|CAA36616.1| unnamed protein product [Solanum tuberosum] gi|421955|pir||S25787
hypothetical protein 4 - potato transposon Tst1
Length = 390
Score = 529 bits (1363), Expect = e-148
Identities = 248/356 (69%), Positives = 304/356 (84%), Gaps = 1/356 (0%)
Query: 799 VKNAFLNGDLQEEVYMDSPPGFEDKFGLNVCKLQKSLYGLKQSPRAWFEKFTWSVKKQGY 858
+KN FLNG L+EEVYMD PPGFE K+ +C+L++SLYGLKQSPRAWFE+FT VK+QGY
Sbjct: 1 MKNVFLNGHLEEEVYMDPPPGFEGKYKSKICRLRRSLYGLKQSPRAWFERFTQFVKRQGY 60
Query: 859 MQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKY 918
+Q Q+DHT+F R S +GK +LIVYVDDIILTGDD+VE+ LK+ LA EFEIKDLG LKY
Sbjct: 61 VQGQADHTMFTRHSLEGKTTVLIVYVDDIILTGDDVVEIKNLKERLASEFEIKDLGPLKY 120
Query: 919 FLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQ 978
FLGMEVARS+KGI+VSQRKY+LDLL+ETGMSGCRP +TP++ N K ++G + +D G+YQ
Sbjct: 121 FLGMEVARSKKGIIVSQRKYVLDLLKETGMSGCRPTETPIDPNLKFVKEGKL-IDKGQYQ 179
Query: 979 RLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTND 1038
RLVGKLIYL+HTRPDI+F+VS+V QFMH P EEH EAVYRILRYLKS+PGKGL+FKK
Sbjct: 180 RLVGKLIYLSHTRPDISFAVSLVIQFMHYPREEHQEAVYRILRYLKSSPGKGLFFKKNEQ 239
Query: 1039 RDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGIC 1098
R + +TDADWAGS IDR+ST+GYC +VWGNLVTWRSKKQ VVARSSAEAE+R+MA GIC
Sbjct: 240 RSLEAYTDADWAGSSIDRRSTSGYCTFVWGNLVTWRSKKQNVVARSSAEAEYRSMALGIC 299
Query: 1099 ELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKHIEIDRHFIKEKI 1154
E+LW+++ LEEL+ + P+KL+CD+KAAISIAHNPVQHDRTKH+E+ +K ++
Sbjct: 300 EILWLKRFLEELRRPVSFPMKLYCDNKAAISIAHNPVQHDRTKHVEVTDTSLKRRL 355
>emb|CAB81200.1| putative retrotransposon polyprotein [Arabidopsis thaliana]
gi|4539373|emb|CAB40067.1| putative retrotransposon
polyprotein [Arabidopsis thaliana] gi|7486142|pir||T04294
hypothetical protein F25I24.200 - Arabidopsis thaliana
Length = 1203
Score = 517 bits (1332), Expect = e-145
Identities = 262/584 (44%), Positives = 391/584 (66%), Gaps = 18/584 (3%)
Query: 603 PILQPCQESE-PRNDPNHHPNPGKSSIPKCRGKSSSITTS-DDPDLHIPIAIRKPVRSCT 660
PI +P + ++ P +H N S+P S + +TS + P IP P + T
Sbjct: 445 PIARPKRNAKAPAYLSEYHCN----SVPFLSSLSPTTSTSIETPSSSIP-----PKKITT 495
Query: 661 KHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWK 720
+PM+ +SY L+ F ++ + PK +A+K KW A EE+ ALE+NKTW
Sbjct: 496 PYPMSTAISYDKLTPLFHSYICAYNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWI 555
Query: 721 IMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTV 780
+ +L GKN VGCKWVFT+KYN D ++ERYKARLVA+GFTQ GIDY ETF+PVAK +V
Sbjct: 556 VESLTEGKNVVGCKWVFTIKYNPDGSIERYKARLVAQGFTQQEGIDYMETFSPVAKFGSV 615
Query: 781 RVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLN-----VCKLQKSL 835
++LL LA W L Q+DV NAFL+G+L EE+YM P G+ G++ VC+L KSL
Sbjct: 616 KLLLGLAAATGWSLTQMDVSNAFLHGELDEEIYMSLPQGYTPPTGISLPSKPVCRLLKSL 675
Query: 836 YGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIV 895
YGLKQ+ R W+++ + ++Q+ +D+T+F++ S I +++VYVDD+++ +D
Sbjct: 676 YGLKQASRQWYKRLSSVFLGANFIQSPADNTMFVKVSCTS-IIVVLVYVDDLMIASNDSS 734
Query: 896 EMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPAD 955
++ LK+ L EF+IKDLG ++FLG+E+ARS +GI V QRKY +LLE+ G+SGC+P+
Sbjct: 735 AVENLKELLRSEFKIKDLGPARFFLGLEIARSSEGISVCQRKYAQNLLEDVGLSGCKPSS 794
Query: 956 TPMELNAKLW-EKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLE 1014
PM+ N L E G + + Y+ LVG+L+YL TRPDI F+V +SQF+ +P + H++
Sbjct: 795 IPMDPNLHLTKEMGTLLPNATSYRELVGRLLYLCITRPDITFAVHTLSQFLSAPTDIHMQ 854
Query: 1015 AVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWR 1074
A +++LRYLK NPG+GL + +++ ++ F+DADW R+S TG+C Y+ +L+TW+
Sbjct: 855 AAHKVLRYLKGNPGQGLMYSASSELCLNGFSDADWGTCKDSRRSVTGFCIYLGTSLITWK 914
Query: 1075 SKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNP 1134
SKKQ VV+RSS E+E+R++AQ CE++W+Q+LL++L + + P KLFCD+K+A+ +A NP
Sbjct: 915 SKKQSVVSRSSTESEYRSLAQATCEIIWLQQLLKDLHVTMTCPAKLFCDNKSALHLATNP 974
Query: 1135 VQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSL 1178
V H+RTKHIEID H ++++I +G + +V + Q ADILTK L
Sbjct: 975 VFHERTKHIEIDCHTVRDQIKAGKLKTLHVPTGNQLADILTKPL 1018
>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989049|gb|AAG10812.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1404
Score = 513 bits (1321), Expect = e-143
Identities = 263/539 (48%), Positives = 367/539 (67%), Gaps = 6/539 (1%)
Query: 662 HPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKI 721
HP S + + AF S++S IP+ +EA+++ +W++A+ +E+ A+++N TW
Sbjct: 864 HPFQATCSLALVPLDHQAFLSKISEHWIPQTYEEAMEVKEWRDAIADEINAMKRNHTWDE 923
Query: 722 MTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVR 781
LP GK TV +WVFT+KY S+ +ERYK RLVA+GFTQ YG DY ETFAPVAKL+TVR
Sbjct: 924 DDLPKGKKTVSSRWVFTIKYKSNGDIERYKTRLVARGFTQTYGSDYMETFAPVAKLHTVR 983
Query: 782 VLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLN-VCKLQKSLYGLKQ 840
V+L+LA NL W L Q+DVKNAFL G+L+++VYM PPG ED + V +L+K++YGLKQ
Sbjct: 984 VVLALATNLSWGLWQMDVKNAFLQGELEDDVYMTPPPGLEDTIPCDKVLRLRKAIYGLKQ 1043
Query: 841 SPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRL 900
SPRAW+ K + ++K G+ +++SDHTLF S G I ++++YVDD+I+TGD+ +D
Sbjct: 1044 SPRAWYHKLSRTLKDHGFKKSESDHTLFTLQSPQG-IVVVLIYVDDLIITGDNKDGIDST 1102
Query: 901 KKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMEL 960
K L F+IKDLG LKYFLG+EV RS G+ +SQRKY LDLL ETG +PA TP+E
Sbjct: 1103 KTFLKSCFDIKDLGELKYFLGIEVCRSNAGLFLSQRKYTLDLLNETGFMDAKPARTPLED 1162
Query: 961 NAKLWEKGNVPV----DIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAV 1016
K+ KG D Y++LVGKLIYL +TRPDI F+V+ VSQ M P H V
Sbjct: 1163 GYKVNRKGEKEDEKFGDAPLYRKLVGKLIYLTNTRPDICFAVNQVSQHMKVPMVYHWNMV 1222
Query: 1017 YRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSK 1076
RILRYLK + G+G++ K + ++ + DAD+AG DR+S TGYC ++ GNL TW++K
Sbjct: 1223 ERILRYLKGSSGQGIWMGKNSSTEIVGYCDADYAGDRGDRRSKTGYCTFIGGNLATWKTK 1282
Query: 1077 KQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQ 1136
KQ VV+ SSAE+E+RAM + EL W++ LL++L ++ +P+ + CD+KAAI IA N V
Sbjct: 1283 KQKVVSCSSAESEYRAMRKLTNELTWLKALLKDLGIEQHMPITMHCDNKAAIYIASNSVF 1342
Query: 1137 HDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNI 1195
H+RTKHIE+D H ++EKI+ G Y S +Q ADI TK+ + + KLG+ ++
Sbjct: 1343 HERTKHIEVDCHKVREKIIEGVTLPCYTRSEDQLADIFTKAASLKVCNFIHGKLGLVDL 1401
Score = 167 bits (423), Expect = 2e-39
Identities = 120/435 (27%), Positives = 195/435 (44%), Gaps = 51/435 (11%)
Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITG--------EIQMPSTTDPNYRFWKSENSTIIA 152
L G NY+ WS+T + +L G+G + E + T P W E+ ++A
Sbjct: 13 LQGGNYLTWSRTTKTVLCGRGLWSHVISSQAPKEDKEEEETETISPEEEKWFQEDQAVLA 72
Query: 153 WLLSTMESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTY 212
L +++E++I + Y + TAK +WD +K Y + N +R+F++K + + Q D + T +
Sbjct: 73 LLQNSLETSILEGYSYCETAKELWDTLKNVYGNESNLTRVFEVKKAINELSQEDLEFTKH 132
Query: 213 YNELMALWQELDLCYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNCLDEVRGRILGRIP 272
+ + +LW EL T D + +R+E D+VF LL LN +++ +L
Sbjct: 133 FGKFRSLWSELKSLRPG----TLDPKILHERREQDKVFGLLLTLNPGYNDLIKHLLRSEK 188
Query: 273 LPTLQETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPF-CDHCNRQWH 331
LP+L E S++++E+ + GKS IT + V K + +K CDHC ++ H
Sbjct: 189 LPSLDEVCSKIQKEQGSTGLFGGKSELITANKGEVVANKGVYKNEDRKLLTCDHCKKKGH 248
Query: 332 TRDTCWKLHGKPPNWKKKGGKEGRALQATTSDQEHQSSSSSFPFTKEQLDQLYKMFGSQT 391
T+D CW LH P+ K K+ RA + + +E + SS T + +
Sbjct: 249 TKDKCWLLH---PHLKPAKFKDSRAHFSQETHEEQSQAGSSKGETSTSFGDYVRKSDLEA 305
Query: 392 PSCSIAQIGNFPNTALVSVKPSPTWIIDSGATDHMTGESSLFASYSPCAGNHKIKIADGS 451
SI + S S + +IDSGA+ HM S+L + P G+ + IA+G
Sbjct: 306 LIKSIVSLKE-SGITFSSQTSSGSIVIDSGASHHMISNSNLLDNIEPALGH--VIIANGD 362
Query: 452 LSAIAG--------KD------------------------CQANFFHSHCIFKDLNTGKM 479
I G KD C A F + F+D+ TGK+
Sbjct: 363 KVPIEGIGNLKLFNKDSKAFFMPKFTSNLLSVKRTTRDLNCYAIFGPNDVYFQDIETGKV 422
Query: 480 IGSAKKSGGLYYLDN 494
IG G LY L++
Sbjct: 423 IGEGGSKGELYVLED 437
>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
(gb|U12626). [Arabidopsis thaliana]
gi|25301690|pir||G96722 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana
Length = 1315
Score = 510 bits (1314), Expect = e-143
Identities = 260/613 (42%), Positives = 399/613 (64%), Gaps = 23/613 (3%)
Query: 603 PILQPCQESEPRNDPNHHPNPGKSSIPKCRGKSSSITTSDDPDLHIPIAIRKP------- 655
P L P + ++ + +P+ SS+ S+ T++ P+ + + RK
Sbjct: 704 PDLNPTPPMQRQSSDHVNPSDSSSSVEIL---PSANPTNNVPEPSVQTSHRKAKKPAYLQ 760
Query: 656 ------VRSCTKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEE 709
V S T H + KF+SY ++ + F + L + P N EA K+ W++A+ E
Sbjct: 761 DYYCHSVVSSTPHEIRKFLSYDRINDPYLTFLACLDKTKEPSNYTEAEKLQVWRDAMGAE 820
Query: 710 MRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSE 769
LE TW++ +LPA K +GC+W+F +KYNSD +VERYKARLVA+G+TQ GIDY+E
Sbjct: 821 FDFLEGTHTWEVCSLPADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNE 880
Query: 770 TFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLN-- 827
TF+PVAKLN+V++LL +A L QLD+ NAFLNGDL EE+YM P G+ + G +
Sbjct: 881 TFSPVAKLNSVKLLLGVAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQGDSLP 940
Query: 828 ---VCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYV 884
VC+L+KSLYGLKQ+ R W+ KF+ ++ G++Q+ DHT F++ S DG ++VY+
Sbjct: 941 PNAVCRLKKSLYGLKQASRQWYLKFSSTLLGLGFIQSYCDHTCFLKIS-DGIFLCVLVYI 999
Query: 885 DDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLE 944
DDII+ ++ +D LK + F+++DLG LKYFLG+E+ RS KGI +SQRKY LDLL+
Sbjct: 1000 DDIIIASNNDAAVDILKSQMKSFFKLRDLGELKYFLGLEIVRSDKGIHISQRKYALDLLD 1059
Query: 945 ETGMSGCRPADTPMELNAKL-WEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQ 1003
ETG GC+P+ PM+ + + G V++G Y+RL+G+L+YL TRPDI F+V+ ++Q
Sbjct: 1060 ETGQLGCKPSSIPMDPSMVFAHDSGGDFVEVGPYRRLIGRLMYLNITRPDITFAVNKLAQ 1119
Query: 1004 FMHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYC 1063
F +P + HL+AVY+IL+Y+K G+GL++ T++ + ++ +AD+ R+ST+GYC
Sbjct: 1120 FSMAPRKAHLQAVYKILQYIKGTIGQGLFYSATSELQLKVYANADYNSCRDSRRSTSGYC 1179
Query: 1064 AYVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCD 1123
++ +L+ W+S+KQ VV++SSAEAE+R+++ EL+W+ L+EL++ + P LFCD
Sbjct: 1180 MFLGDSLICWKSRKQDVVSKSSAEAEYRSLSVATDELVWLTNFLKELQVPLSKPTLLFCD 1239
Query: 1124 SKAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNF 1183
++AAI IA+N V H+RTKHIE D H ++E+++ G L ++ + Q AD TK L +F
Sbjct: 1240 NEAAIHIANNHVFHERTKHIESDCHSVRERLLKGLFELYHINTELQIADPFTKPLYPSHF 1299
Query: 1184 ERLIVKLGMTNIY 1196
RLI K+G+ NI+
Sbjct: 1300 HRLISKMGLLNIF 1312
Score = 151 bits (381), Expect = 1e-34
Identities = 112/402 (27%), Positives = 185/402 (45%), Gaps = 48/402 (11%)
Query: 117 LDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMESTIKKPYMFLPTAKNVW 176
++ K KLGF+ G I P DP + W+ NS + +WLL+++ I ++ PTA +W
Sbjct: 5 IEAKNKLGFVDGSIPKPDDDDPYCKIWRRCNSMVKSWLLNSVSKEIYTSILYFPTAAAIW 64
Query: 177 DAVKATYSDIQNSS--RIFDLKSRLWQAKQSDRDVTTYYNELMALWQELDLCYDDNWRCT 234
K Y+ SS R++ L+ ++ +Q + D+++Y+ LW+EL R
Sbjct: 65 ---KDLYTRFHKSSLPRLYKLRQQIHSLRQGNLDLSSYHTRTQTLWEEL-TSLQAVPRTV 120
Query: 235 EDSVLFLKRQENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQETFSEVRREEARQSVMM 294
ED L +E +RV L GLN+C D VR +IL + LP+L E F+ + ++E ++S +
Sbjct: 121 ED---LLIERETNRVIDFLMGLNDCYDTVRSQILMKKTLPSLSEVFNMIDQDETQRSARI 177
Query: 295 GKSASITESSALVTKGNEEGKRDG------KKPFCDHCNRQWHTRDTCWKLHGKPPNWKK 348
+ +T S V+ + + +G ++P C +C+R H DTC+K HG P ++K
Sbjct: 178 STTPGMTSSVFPVSNQSSQSALNGDTYQKKERPVCSYCSRPGHVEDTCYKKHGYPTSFKS 237
Query: 349 KGG--KEGRALQATTSDQE--HQSSSSSFPFTKEQLDQLYKMFGS--QTPSCSIAQIGNF 402
K K + A +E + +S S+ T Q+ QL S Q PS +
Sbjct: 238 KQKFVKPSISANAAIGSEEVVNNTSVSTGDLTTSQIQQLVSFLSSKLQPPSTPVQ----- 292
Query: 403 PNTALVSVKPSPTWIIDSGATDHMTGESSLFASYSPCAGNHKI----------KIADGSL 452
P +SV P+ S ++G L G H I K S+
Sbjct: 293 PEVHSISVSSDPS---SSSTVCPISGSVHL--------GRHLILNDVLFIPQFKFNLLSV 341
Query: 453 SAIA-GKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLD 493
S++ C+ F + C+ +D M+G K+ LY +D
Sbjct: 342 SSLTKSMGCRIWFDETSCVLQDATRELMVGMGKQVANLYIVD 383
>gb|AAU89728.1| putative retroelement pol polyprotein-like [Solanum tuberosum]
Length = 1476
Score = 509 bits (1311), Expect = e-142
Identities = 253/550 (46%), Positives = 368/550 (66%), Gaps = 14/550 (2%)
Query: 662 HPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKI 721
+P++ + YS LSS++ + + S P+ +A +W A+ EE++ALE NKTW++
Sbjct: 909 YPISDNIDYSCLSSTYQCYIASSSVETEPQFYYQAANDCRWVHAMKEEIQALEDNKTWEV 968
Query: 722 MTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVR 781
++LP GK +GCKWV+ +KY + +ER+KARLVAKG+ Q G+DY ETF+PV K+ T+R
Sbjct: 969 VSLPKGKKAIGCKWVYKIKYKASGEIERFKARLVAKGYNQKEGLDYQETFSPVVKMVTLR 1028
Query: 782 VLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFE-DKFG-LNVCKLQKSLYGLK 839
+L+LAV+ W + Q+DV NAFL GDL EEVYM P GF+ DK G VC+L KSLYGLK
Sbjct: 1029 TVLTLAVSKGWDIQQMDVYNAFLQGDLIEEVYMQLPQGFQYDKTGDPKVCRLLKSLYGLK 1088
Query: 840 QSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDR 899
Q+ R W K T ++ G+ Q+ D++L ++ + DG I I+++YVDD+++TG + +D
Sbjct: 1089 QASRQWNVKLTTALLAAGFQQSHLDYSLMLKRTADG-IVIVLIYVDDLLITGSSLQLIDD 1147
Query: 900 LKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPME 959
K+ L F+IKDLG L+YFLGME AR+ G+++ QRKY L+L+ + G+ G +P+ TP+E
Sbjct: 1148 AKQVLKANFKIKDLGTLRYFLGMEFARNASGMLMHQRKYALELISDLGLGGSKPSVTPVE 1207
Query: 960 LNAKLWEK-----------GNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSP 1008
L+ KL + ++ D YQRLVG+L+YL TRPDI+F+V +SQFMH+P
Sbjct: 1208 LHLKLTTREFDLHVGSSGADSLLADPTEYQRLVGRLLYLTITRPDISFAVQHLSQFMHAP 1267
Query: 1009 YEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWG 1068
H+EA R+++Y+K PG GLY + + DADW + RKS TGY
Sbjct: 1268 KVSHMEAAIRVVKYVKQAPGLGLYMAVQTADTLQAYCDADWGSCINTRKSITGYMIQFGS 1327
Query: 1069 NLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAI 1128
L++W+SKKQ ++RSSAEAE+R++A + EL+W+ L +EL + + LP+ L+CDSKAAI
Sbjct: 1328 ALLSWKSKKQPTISRSSAEAEYRSLASTVAELVWLTGLFKELDMPLSLPVSLYCDSKAAI 1387
Query: 1129 SIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIV 1188
IA NPV H+RTKHI+ID HFI+EK+ +G + + Y+ + EQ ADILTK L+ L+
Sbjct: 1388 QIAANPVFHERTKHIDIDCHFIREKVQAGLVMIHYLPTQEQPADILTKGLSSAQHSYLVS 1447
Query: 1189 KLGMTNIYAP 1198
KLG+ NI+ P
Sbjct: 1448 KLGLKNIFIP 1457
Score = 118 bits (295), Expect = 1e-24
Identities = 121/511 (23%), Positives = 197/511 (37%), Gaps = 121/511 (23%)
Query: 98 VQRLDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDP-NYRFWKSENSTIIAWLLS 156
+Q +NY WS+ ++L L K K+GFI G ++ + + W N+ +++WL++
Sbjct: 19 IQLTGMENYSLWSRAMQLTLLTKNKMGFIDGSLRRDDFKEELEKKQWDRCNAMVLSWLMN 78
Query: 157 TMESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNEL 216
+ + + +F A VW+ +K + + N SRIF L + Q V+ YY++L
Sbjct: 79 NVSTDLVSGILFRSNATLVWNDLKERFDKV-NMSRIFHLHKAIVTHVQGVSPVSVYYSKL 137
Query: 217 MALWQELDLCYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNCLDEVRGRILGRIPLPTL 276
LW E D E SV + ++ L GLN+ + R +IL P P++
Sbjct: 138 KDLWDEYDSILPPPSCDCEKSVDYTDSMLRQKLLQFLMGLNDNYGQARSQILMMNPSPSV 197
Query: 277 QETFSEVRREEARQSVMMGKSASITESSALVT-------------KGNEEGKRDG----- 318
+ ++ + ++E+++S + S + +AL T +G+ G +G
Sbjct: 198 NQCYAMIVQDESQRS--LSGSGQTIDPTALFTHRPGGSGFGSQGSQGSGNGSSNGNSHRF 255
Query: 319 ----------------------KKPFCDHCNRQWHTRDTCWKLHGKPPNWKKKG------ 350
K C HCN Q HT+DTC++L G P ++K K
Sbjct: 256 HKGGNIYCDFCNMKGHIRANCNKLKHCTHCNMQGHTKDTCYQLIGYPADYKGKKKANIVT 315
Query: 351 ---------------------------GKEGRALQATTSDQEHQSSS-------SSFP-F 375
G +Q T + H S S S P F
Sbjct: 316 APSLPQMQHNNFNNNLNYPMQYTGDGIGHFVSPMQFTGNTNGHSSGSIAGNFGPGSVPQF 375
Query: 376 TKEQLDQLYKMFGSQTPSCSIAQI-GNFPNTA-LVSVKPSPTWIIDSGATDHMTGESSLF 433
T Q + + +M S S A + G F ++ S S WI+DSGATDHM ++L
Sbjct: 376 TPSQYNNILQMLNKPMLSESSANVAGIFAGSSHCNSNTHSSAWIVDSGATDHMVSNTTLL 435
Query: 434 ASYSPCAGNHKIKIADG--------SLSAIAGKD-------------------------- 459
+ K+++ G S + G D
Sbjct: 436 NHGLSVSHPGKVQLPTGDSAVVTHSGSSQLTGGDVVKNVLCVPTFQFNLLSVSKLTKELN 495
Query: 460 CQANFFHSHCIFKDLNTGKMIGSAKKSGGLY 490
C FF I +DL TGK+ ++ GLY
Sbjct: 496 CCVIFFPDFFIIQDLFTGKVKEIGEEINGLY 526
>pir||E96608 probable retroelement polyprotein F25P12.89 [imported] - Arabidopsis
thaliana gi|9954746|gb|AAG09097.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1486
Score = 509 bits (1310), Expect = e-142
Identities = 255/553 (46%), Positives = 370/553 (66%), Gaps = 2/553 (0%)
Query: 647 HIPIAIRKPVRSCTKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAV 706
++ + +P S T +P+ ++S S S ++ A+ +++ P+N EA+ WK AV
Sbjct: 934 YVTTLLHQPFPSATPYPLDNYISSSRFSDNYQAYILAITSGNEPRNYNEAMLDDHWKGAV 993
Query: 707 LEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGID 766
E+ +LE TW + LP GK +GCKWVF +KY SD T+ER+KARLV G Q G+D
Sbjct: 994 SHEIGSLENLGTWTVEDLPPGKKALGCKWVFRLKYKSDGTLERHKARLVVLGNNQTEGLD 1053
Query: 767 YSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGL 826
Y+ETFAPVAK+ TVR L V+LDW ++Q+DV NAFL+GDL EEVYM PPGF
Sbjct: 1054 YTETFAPVAKMVTVRAFLQQVVSLDWEVHQMDVHNAFLHGDLDEEVYMQFPPGFRTGDKT 1113
Query: 827 NVCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDD 886
VC+L+KSLYGLKQ+PR WF K T ++K G++Q SD++LF+ N ++ +L VYVDD
Sbjct: 1114 KVCRLRKSLYGLKQAPRCWFAKLTSALKNYGFIQDISDYSLFIFHKNGVRLHVL-VYVDD 1172
Query: 887 IILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEET 946
+I+TG I + K L+ F +KDLG L+YFLG+EVARS +GI + QRKY LD++ ET
Sbjct: 1173 LIITGTTIAVITEFKHYLSSCFYMKDLGILRYFLGIEVARSPEGIYLCQRKYALDIITET 1232
Query: 947 GMSGCRPADTPMELNAKL-WEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFM 1005
G+ G +PA P++ N KL + G D RY+RLVG++IYLA TRP++++ + ++SQFM
Sbjct: 1233 GLLGVKPASFPLDQNHKLAFATGETIDDPLRYRRLVGRIIYLATTRPELSYVIHILSQFM 1292
Query: 1006 HSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAY 1065
H+P H EA R++RYLKS+PG+G+ + +S + D+D+ +S TG+
Sbjct: 1293 HNPKPAHWEAALRVVRYLKSSPGQGILLRANTPLVLSAWCDSDFGACPHSDRSLTGWFIQ 1352
Query: 1066 VWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSK 1125
+ G+ ++W+++KQ VV+RSSAEAE+RAMA+ + E++WI++LL L + P L DS
Sbjct: 1353 LGGSPLSWKTQKQNVVSRSSAEAEYRAMAETVSEIIWIRELLPALGIPCTAPTTLHSDSL 1412
Query: 1126 AAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFER 1185
+AIS+A NPV H RTKH+ D HFI++++V+GTI +V++ Q ADILTK+L R F
Sbjct: 1413 SAISLAANPVYHARTKHVRRDVHFIRDELVNGTIATKHVSTTSQLADILTKALGRKEFAD 1472
Query: 1186 LIVKLGMTNIYAP 1198
+ KLG+ N++ P
Sbjct: 1473 FLAKLGICNLHIP 1485
Score = 134 bits (338), Expect = 1e-29
Identities = 110/452 (24%), Positives = 183/452 (40%), Gaps = 68/452 (15%)
Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMES 160
L G NY EW+ +RL L + K GF G I P TDP++ W + N+ +++W+ T++
Sbjct: 42 LRGPNYDEWATNLRLALKARKKFGFADGSIPQPVETDPDFEDWTANNALVVSWMKLTIDE 101
Query: 161 TIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMALW 220
T+ L + +W ++ + ++N R+ LK+ L +Q + TYY L LW
Sbjct: 102 TVSTSMSHLDDSHELWTHIQKRFG-VKNGQRVQRLKTELATCRQKGVAIETYYGRLSQLW 160
Query: 221 QELDLCYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNCL-DEVRGRILGRIPLPTLQET 279
+ L D T D V K +E D++ L GL+ + V+ +L R+PLP+L+E
Sbjct: 161 RSL---ADYQQAKTMDDV--RKEREEDKLHQFLMGLDESVYGAVKSALLSRVPLPSLEEA 215
Query: 280 FSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDHCNRQWHTRDTCWKL 339
++ + ++E +S+ + + + + + + + C +C R H + C+KL
Sbjct: 216 YNALTQDEESKSLSRLHNERV-DGVSFAVQTTSRPRDSSENRVCSNCGRVGHLAEQCFKL 274
Query: 340 HGKPP------NWKKKGGKEGRALQATTSDQEH-QSSSSSFPFTKEQLDQLYKMFGSQTP 392
G PP K L + Q H + SS + + + +P
Sbjct: 275 IGYPPWLEEKLRLKNTASSSRGGLSSFKGKQSHGRGSSINHVASSGMAANVVTNSSLTSP 334
Query: 393 SCSIAQIG---------NFPNTALVSVKPS-----------PTWIIDSGATDHMTGESSL 432
S +IG T L K + +WIIDSGAT+HMTG +
Sbjct: 335 LTSDDRIGLSGLNDSQWKILQTILEERKSTSNDHQSGKYFLESWIIDSGATNHMTGSLAF 394
Query: 433 FASYSPCA--------GNHKIKIADGSLSAIAGKDCQANFF----HSH------------ 468
+ G GS+ + D Q F H H
Sbjct: 395 LRNVCDMPPVLIKLPDGRFTTATKQGSVQLGSSLDLQDVLFVDGLHCHLISVSQLTRTRR 454
Query: 469 ---------CIFKDLNTGKMIGSAKKSGGLYY 491
CI +D T +IG+ ++ GLY+
Sbjct: 455 CIFQITDKVCIVQDRTTLMLIGAGRELNGLYF 486
>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301694|pir||E84535 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1454
Score = 506 bits (1303), Expect = e-141
Identities = 260/589 (44%), Positives = 387/589 (65%), Gaps = 9/589 (1%)
Query: 615 NDPNHHPNPGKSSIPKCRGKSSSITTSDDPDLHIPIAIRKPVRSCTKHPMAKFVSYSNLS 674
+D H P+ S I + SS P H+ ++S K+P++ +SYS +S
Sbjct: 866 SDTTHSPSSLPSQISDLPPQISSQRVRKPP-AHLNDYHCNTMQSDHKYPISSTISYSKIS 924
Query: 675 SSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKIMTLPAGKNTVGCK 734
S + + ++ + IP N EA +W EAV E+ A+EK TW+I TLP GK VGCK
Sbjct: 925 PSHMCYINNITKIPIPTNYAEAQDTKEWCEAVDAEIGAMEKTNTWEITTLPKGKKAVGCK 984
Query: 735 WVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRVLLSLAVNLDWPL 794
WVFT+K+ +D +ERYKARLVAKG+TQ G+DY++TF+PVAK+ T+++LL ++ + W L
Sbjct: 985 WVFTLKFLADGNLERYKARLVAKGYTQKEGLDYTDTFSPVAKMTTIKLLLKVSASKKWFL 1044
Query: 795 NQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGL-----NVCKLQKSLYGLKQSPRAWFEKF 849
QLDV NAFLNG+L+EE++M P G+ ++ G+ V +L++S+YGLKQ+ R WF+KF
Sbjct: 1045 KQLDVSNAFLNGELEEEIFMKIPEGYAERKGIVLPSNVVLRLKRSIYGLKQASRQWFKKF 1104
Query: 850 TWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKKNLAKEFE 909
+ S+ G+ + DHTLF++ DG+ I++VYVDDI++ +L + L + F+
Sbjct: 1105 SSSLLSLGFKKTHGDHTLFLKM-YDGEFVIVLVYVDDIVIASTSEAAAAQLTEELDQRFK 1163
Query: 910 IKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNAKL-WEKG 968
++DLG LKYFLG+EVAR+ GI + QRKY L+LL+ TGM C+P PM N K+ + G
Sbjct: 1164 LRDLGDLKYFLGLEVARTTAGISICQRKYALELLQSTGMLACKPVSVPMIPNLKMRKDDG 1223
Query: 969 NVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRYLKSNPG 1028
++ DI +Y+R+VGKL+YL TRPDI F+V+ + QF +P HL A YR+L+Y+K G
Sbjct: 1224 DLIEDIEQYRRIVGKLMYLTITRPDITFAVNKLCQFSSAPRTTHLTAAYRVLQYIKGTVG 1283
Query: 1029 KGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVARSSAEA 1088
+GL++ ++D + F D+DWA R+STT + +V +L++WRSKKQ V+RSSAEA
Sbjct: 1284 QGLFYSASSDLTLKGFADSDWASCQDSRRSTTSFTMFVGDSLISWRSKKQHTVSRSSAEA 1343
Query: 1089 EFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKHIEIDRH 1148
E+RA+A CE++W+ LL L+ +P+ L+ DS AAI IA NPV H+RTKHI++D H
Sbjct: 1344 EYRALALATCEMVWLFTLLVSLQASPPVPI-LYSDSTAAIYIATNPVFHERTKHIKLDCH 1402
Query: 1149 FIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYA 1197
++E++ +G + L +V + +Q ADILTK L FE L K+ + NI++
Sbjct: 1403 TVRERLDNGELKLLHVRTEDQVADILTKPLFPYQFEHLKSKMSILNIFS 1451
Score = 158 bits (399), Expect = 1e-36
Identities = 130/473 (27%), Positives = 211/473 (44%), Gaps = 84/473 (17%)
Query: 100 RLDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTME 159
RLD NY +WS + + LD K K GFI G + P +D N+R W NS + +WLL+++
Sbjct: 78 RLDETNYGDWSVAMLISLDAKNKTGFIDGTLSRPLESDLNFRLWSRCNSMVKSWLLNSVS 137
Query: 160 STIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMAL 219
I + + + A ++W + + + ++ N R ++L + +Q ++ YY L L
Sbjct: 138 PQIYRSILRMNDASDIWRDLNSRF-NVTNLPRTYNLTQEIQDFRQGTLSLSEYYTRLKTL 196
Query: 220 WQELDLCYDDNWRCTEDSVLFLKRQ-ENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQE 278
W +LD + CT + L+++ E ++ LAGLN VR +I+ + LP+L E
Sbjct: 197 WDQLDSTEALDEPCTCGKAMRLQQKAEQAKIVKFLAGLNESYAIVRRQIIAKKALPSLGE 256
Query: 279 TFSEVRREEARQSVM-------------MGKSASITESSALVTKGNEEGKRDGKKPFCDH 325
+ + ++ ++QS + +S S+ + V G +G +P C
Sbjct: 257 VYHILDQDNSQQSFSNVVAPPAAFQVSEITQSPSMDPTVCYVQNGPNKG-----RPICSF 311
Query: 326 CNRQWHTRDTCWKLHGKPPNWKKKGGKEGRALQ---------ATTSDQEHQSSSSSFPFT 376
NR H + C+K HG PP + K GK G LQ A +S+ S +
Sbjct: 312 YNRVGHIAERCYKKHGFPPGFTPK-GKAGEKLQKPKPLAANVAESSEVNTSLESMVGNLS 370
Query: 377 KEQLDQLYKMFGSQ---TP-----SCSIAQIGNF-----PNT----ALVSVK----PSPT 415
KEQL Q MF SQ TP + S +Q N P+T +++V S T
Sbjct: 371 KEQLQQFIAMFSSQLQNTPPSTYATASTSQSDNLGICFSPSTYSFIGILTVARHTLSSAT 430
Query: 416 WIIDSGATDHMTGESSLFASYS---------------PCAGNHKIKIADG---------- 450
W+IDSGAT H++ + SLF+S +G +K+ D
Sbjct: 431 WVIDSGATHHVSHDRSLFSSLDTSVLSAVNLPTGPTVKISGVGTLKLNDDILLKNVLFIP 490
Query: 451 -------SLSAIAGK-DCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLDNG 495
S+S++ + F + C +DL G+M+G ++ LY LD G
Sbjct: 491 EFRLNLISISSLTDDIGSRVIFDKNSCEIQDLIKGRMLGQGRRVANLYLLDVG 543
>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1491
Score = 506 bits (1302), Expect = e-141
Identities = 251/545 (46%), Positives = 369/545 (67%), Gaps = 2/545 (0%)
Query: 656 VRSCTKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEK 715
++ +++P+ ++ S+ F + ++ + PK+ +EA+K+ W +A+ +E+ ALE
Sbjct: 948 IQGNSQYPLTDYIFDECFSAGHKVFLAAITANDEPKHFKEAVKVKVWNDAMYKEVDALEV 1007
Query: 716 NKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVA 775
NKTW I+ LP GK +G +WV+ K+N+D TVERYKARLV +G Q G DY+ETFAPV
Sbjct: 1008 NKTWDIVDLPTGKVAIGSQWVYKTKFNADGTVERYKARLVVQGNNQIEGEDYTETFAPVV 1067
Query: 776 KLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLNVCKLQKSL 835
K+ TVR LL L W + Q+DV NAFL+GDL+EEVYM PPGF VC+L+KSL
Sbjct: 1068 KMTTVRTLLRLVAANQWEVYQMDVHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRLRKSL 1127
Query: 836 YGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIV 895
YGLKQ+PR WF+K + ++K+ G++Q D++ F +S G ++VYVDD+I+ G+D
Sbjct: 1128 YGLKQAPRCWFKKLSDALKRFGFIQGYEDYSFFS-YSCKGIELRVLVYVDDLIICGNDEY 1186
Query: 896 EMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPAD 955
+ + K+ L + F +KDLG LKYFLG+EV+R GI +SQRKY LD++ ++G G RPA
Sbjct: 1187 MVQKFKEYLGRCFSMKDLGKLKYFLGIEVSRGPDGIFLSQRKYALDIISDSGTLGARPAY 1246
Query: 956 TPMELNAKLW-EKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLE 1014
TP+E N L + G + D ++RLVG+L+YL HTRP++++SV V+SQFM +P E HLE
Sbjct: 1247 TPLEQNHHLASDDGPLLQDPKPFRRLVGRLLYLLHTRPELSYSVHVLSQFMQAPREAHLE 1306
Query: 1015 AVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWR 1074
A RI+RYLK +PG+G+ D + ++ D+D+ + R+S + Y + G+ ++W+
Sbjct: 1307 AAMRIVRYLKGSPGQGILLSSNKDLTLEVYCDSDFQSCPLTRRSLSAYVVLLGGSPISWK 1366
Query: 1075 SKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNP 1134
+KKQ V+ SSAEAE+RAM+ + E+ W+ KLL+EL + + P +LFCDSKAAISIA NP
Sbjct: 1367 TKKQDTVSHSSAEAEYRAMSVALKEIKWLNKLLKELGITLAAPTRLFCDSKAAISIAANP 1426
Query: 1135 VQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTN 1194
V H+RTKHIE D H +++ + G I +V ++EQ ADI TK+L R F L+ KLG+ N
Sbjct: 1427 VFHERTKHIERDCHSVRDAVRDGIITTHHVRTSEQLADIFTKALGRNQFIYLMSKLGIQN 1486
Query: 1195 IYAPT 1199
++ PT
Sbjct: 1487 LHTPT 1491
Score = 140 bits (354), Expect = 2e-31
Identities = 115/460 (25%), Positives = 194/460 (42%), Gaps = 78/460 (16%)
Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMES 160
L G NY EWS + L K K GFI G I P +P+Y W++ NS I+ W+ +++E
Sbjct: 44 LTGDNYNEWSTEMLNALQAKRKTGFINGSISKPPLDNPDYENWQAVNSMIVGWIRASIEP 103
Query: 161 TIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMALW 220
+K F+ A +W +K +S + N R+ +K++L +Q + V YY L LW
Sbjct: 104 KVKSTVTFISDAHQLWSELKQRFS-VGNKVRVHQIKAQLAACRQDGQPVIDYYGRLCKLW 162
Query: 221 QELDLCYDDN----WRCTEDSVLF-LKRQENDRVFVLLAGLNNC-LDEVRGRILGRIPLP 274
+E + CT + L K +E +++ + GL++ + ++ P P
Sbjct: 163 EEFQIYKPITVCKCGLCTCGATLEPSKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFP 222
Query: 275 TLQETFSEVRREEARQSVMMGKSAS------ITESSALVTKGNEEG---KRDGKKPFCDH 325
+L E +S V REE R + + + +T S + G + K + C H
Sbjct: 223 SLGEIYSRVVREEQRLASVQIREQQQSAIGFLTRQSEVTADGRTDSSIIKSRDRSVLCSH 282
Query: 326 CNRQWHTRDTCWKLHGKPPNWKKK---GGK----------------EGRALQATTSDQEH 366
C R H + CW++ G P W ++ GG+ GR T+
Sbjct: 283 CGRSGHEKKDCWQIVGFPDWWTERTNGGGRGSSSRGRGGRSSGSNNSGRGRGQVTAAHAT 342
Query: 367 QSSSSSFP-FTKEQLDQLYKMFGSQTPSCSIAQIGNFPNTALVSVKPSPTWIIDSGATDH 425
S+ SSFP FT +QL + +M ++ S G + I+D+GA+ H
Sbjct: 343 TSNLSSFPEFTPDQLRVITQMIQNKNNGTSDKLSGKMKLGDV---------ILDTGASHH 393
Query: 426 MTGESSLFA-----------------SYSPCAGNHK------------IKIADGSLSAIA 456
MTG+ SL +++ G K + + SL +++
Sbjct: 394 MTGQLSLLTNIVTIPSCSVGFADDRKTFAISMGTFKLSETVSLSNVLYVPALNCSLISVS 453
Query: 457 GK----DCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYL 492
C A F + C+ +D + +IG+ ++ G+YYL
Sbjct: 454 KLVKQIKCLALFTDTICVLQDRFSRTLIGTGEERDGVYYL 493
>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301698|pir||C84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1501
Score = 504 bits (1297), Expect = e-141
Identities = 251/538 (46%), Positives = 365/538 (67%), Gaps = 2/538 (0%)
Query: 663 PMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKIM 722
P+ +VS + SSS A+ + ++ PK+ +EA++I W +A+ E+ ALE NKTW I+
Sbjct: 965 PLTDYVSDAAFSSSHRAYLAAITDNVEPKHFKEAVQIKVWNDAMFTEVDALEINKTWDIV 1024
Query: 723 TLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRV 782
LP GK +G +WVF KYNSD TVERYKARLV +G Q G DY ETFAPV ++ TVR
Sbjct: 1025 DLPPGKVAIGSQWVFKTKYNSDGTVERYKARLVVQGNKQVEGEDYKETFAPVVRMTTVRT 1084
Query: 783 LLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLNVCKLQKSLYGLKQSP 842
LL W + Q+DV NAFL+GDL+EEVYM PPGF VC+L+KSLYGLKQ+P
Sbjct: 1085 LLRNVAANQWEVYQMDVHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAP 1144
Query: 843 RAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKK 902
R WF+K + S+ + G++Q+ D++LF N+ ++ +LI YVDD+++ G+D + + K
Sbjct: 1145 RCWFKKLSDSLLRFGFVQSYEDYSLFSYTRNNIELRVLI-YVDDLLICGNDGYMLQKFKD 1203
Query: 903 NLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNA 962
L++ F +KDLG LKYFLG+EV+R +GI +SQRKY LD++ ++G G RPA TP+E N
Sbjct: 1204 YLSRCFSMKDLGKLKYFLGIEVSRGPEGIFLSQRKYALDVIADSGNLGSRPAHTPLEQNH 1263
Query: 963 KLW-EKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILR 1021
L + G + D Y+RLVG+L+YL HTRP++++SV V++QFM +P E H +A R++R
Sbjct: 1264 HLASDDGPLLSDPKPYRRLVGRLLYLLHTRPELSYSVHVLAQFMQNPREAHFDAALRVVR 1323
Query: 1022 YLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVV 1081
YLK +PG+G+ D + ++ D+DW + R+S + Y + G+ ++W++KKQ V
Sbjct: 1324 YLKGSPGQGILLNADPDLTLEVYCDSDWQSCPLTRRSISAYVVLLGGSPISWKTKKQDTV 1383
Query: 1082 ARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTK 1141
+ SSAEAE+RAM+ + E+ W++KLL+EL ++ P +L+CDSKAAI IA NPV H+RTK
Sbjct: 1384 SHSSAEAEYRAMSYALKEIKWLRKLLKELGIEQSTPARLYCDSKAAIHIAANPVFHERTK 1443
Query: 1142 HIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYAPT 1199
HIE D H +++ + G I +V + EQ AD+ TK+L R F L+ KLG+ N++ PT
Sbjct: 1444 HIESDCHSVRDAVRDGIITTQHVRTTEQLADVFTKALGRNQFLYLMSKLGVQNLHTPT 1501
Score = 139 bits (350), Expect = 6e-31
Identities = 118/467 (25%), Positives = 193/467 (41%), Gaps = 80/467 (17%)
Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMES 160
L+G NY +W+ + L K K GFI G I P DPNY W + NS I+ W+ +++E
Sbjct: 49 LNGDNYNQWATEMLNALQAKRKTGFINGTIPRPPPNDPNYENWTAVNSMIVGWIRTSIEP 108
Query: 161 TIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMALW 220
+K F+ A +W +K +S + N RI ++++L +Q + V YY L LW
Sbjct: 109 KVKATVTFISDAHLLWKDLKQRFS-VGNKVRIHQIRAQLSSCRQDGQAVIEYYGRLSNLW 167
Query: 221 QELDL------CYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNC-LDEVRGRILGRIPL 273
+E ++ C RC S K +E +++ + GL+ + ++ PL
Sbjct: 168 EEYNIYKPVTVCTCGLCRCGATSEP-TKEREEEKIHQFVLGLDESRFGGLCATLINMDPL 226
Query: 274 PTLQETFSEVRREEARQS---VMMGKSASI-----------------TESSALVTKGNEE 313
P+L E +S V REE R + V K ++ + S + T G+
Sbjct: 227 PSLGEIYSRVIREEQRLASVHVREQKEEAVGFLARREQLDHHSRVDASSSRSEHTGGSRS 286
Query: 314 GKRDGKKPFCDHCNRQWHTRDTCWKLHGKPPNWKK--------------KGGKEGRALQA 359
+ C +C R H + CW++ G P W + +G GR
Sbjct: 287 NSIIKGRVTCSNCGRTGHEKKECWQIVGFPDWWSERNGGRGSNGRGRGGRGSNGGRGQGQ 346
Query: 360 TTSDQEHQSSSSSFP-FTKEQLDQLYKMFGSQTPSCSIAQIGNFPNTALVSVKPSPTWII 418
+ S+SS FP FT+E + L ++ ++ S S + N + L I+
Sbjct: 347 VMAAHATSSNSSVFPEFTEEHMRVLSQLVKEKSNSGSTS---NNNSDRLSGKTKLGDIIL 403
Query: 419 DSGATDHMTGESSLFASYSP--------CAGNHKIKIADGSLS----------------- 453
DSGA+ HMTG S + P G+ ++ G L+
Sbjct: 404 DSGASHHMTGTLSSLTNVVPVPPCPVGFADGSKAFALSVGVLTLSNTVSLTNVLFVPSLN 463
Query: 454 --------AIAGKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYL 492
+ C A F + C +D ++ +IGS ++ GG+YYL
Sbjct: 464 CTLISVSKLLKQTQCLATFTDTLCFLQDRSSKTLIGSGEERGGVYYL 510
>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score:
11.19) [Arabidopsis thaliana] gi|7486705|pir||T01879
hypothetical protein F8M12.17 - Arabidopsis thaliana
Length = 1633
Score = 503 bits (1295), Expect = e-140
Identities = 261/596 (43%), Positives = 386/596 (63%), Gaps = 34/596 (5%)
Query: 603 PILQPCQESE-PRNDPNHHPNPGKSSIPKCRGKSSSITTS-DDPDLHIPIAIRKPVRSCT 660
PI +P + ++ P +H N S+P S + +TS + P IP P + T
Sbjct: 859 PIARPKRNAKAPAYLSEYHCN----SVPFLSSLSPTTSTSIETPSSSIP-----PKKITT 909
Query: 661 KHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWK 720
+PM+ +SY L+ F ++ + PK +A+K KW A EE+ ALE+NKTW
Sbjct: 910 PYPMSTAISYDKLTPLFHSYICAYNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWI 969
Query: 721 IMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTV 780
+ +L GKN VGCKWVFT+KYN D ++ERYKARLVA+GFTQ GIDY ETF+PVAK +V
Sbjct: 970 VESLTEGKNVVGCKWVFTIKYNPDGSIERYKARLVAQGFTQQEGIDYMETFSPVAKFGSV 1029
Query: 781 RVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLN-----VCKLQKSL 835
++LL LA W L Q+DV NAFL+G+L EE+YM P G+ G++ VC+L KSL
Sbjct: 1030 KLLLGLAAATGWSLTQMDVSNAFLHGELDEEIYMSLPQGYTPPTGISLPSKPVCRLLKSL 1089
Query: 836 YGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIV 895
YGLKQ+ R W+++ + ++Q+ +D+T+F++ S I +++VYVDD+++ +D
Sbjct: 1090 YGLKQASRQWYKRLSSVFLGANFIQSPADNTMFVKVSCT-SIIVVLVYVDDLMIASNDSS 1148
Query: 896 EMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPAD 955
++ LK+ L EF+IKDLG ++FLG+E+ARS +GI V QRKY +LLE+ G+SGC+P+
Sbjct: 1149 AVENLKELLRSEFKIKDLGPARFFLGLEIARSSEGISVCQRKYAQNLLEDVGLSGCKPSS 1208
Query: 956 TPMELNAKLW-EKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLE 1014
PM+ N L E G + + Y+ LVG+L+YL TRPDI F+V +SQF+ +P + H++
Sbjct: 1209 IPMDPNLHLTKEMGTLLPNATSYRELVGRLLYLCITRPDITFAVHTLSQFLSAPTDIHMQ 1268
Query: 1015 AVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWR 1074
A +++LRYLK NPG+ DADW R+S TG+C Y+ +L+TW+
Sbjct: 1269 AAHKVLRYLKGNPGQ----------------DADWGTCKDSRRSVTGFCIYLGTSLITWK 1312
Query: 1075 SKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNP 1134
SKKQ VV+RSS E+E+R++AQ CE++W+Q+LL++L + + P KLFCD+K+A+ +A NP
Sbjct: 1313 SKKQSVVSRSSTESEYRSLAQATCEIIWLQQLLKDLHVTMTCPAKLFCDNKSALHLATNP 1372
Query: 1135 VQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKL 1190
V H+RTKHIEID H ++++I +G + +V + Q ADILTK L F L+ ++
Sbjct: 1373 VFHERTKHIEIDCHTVRDQIKAGKLKTLHVPTGNQLADILTKPLHPGPFHSLLKRI 1428
Score = 141 bits (356), Expect = 1e-31
Identities = 113/452 (25%), Positives = 196/452 (43%), Gaps = 82/452 (18%)
Query: 105 NYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMESTIKK 164
++ W +++ + L+ + KLGFI G I P +Y W N T+ WL++++ I +
Sbjct: 57 DFHSWRRSIWMALNVRNKLGFIDGTIVKPPLDHRDYGAWSRCNDTVSTWLMNSVSKKIGQ 116
Query: 165 PYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMALWQE-- 222
+F+PTA+ +W + + + ++ R++D++ RL + +Q D++ YY EL LW+E
Sbjct: 117 SLLFIPTAEGIWKNMLSRFKQ-DDAPRVYDIEQRLSKIEQGSMDISAYYTELQTLWEEHK 175
Query: 223 --LDLCYDDNWRCTEDSVLFLKR-QENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQET 279
+DL RC D+ + +R Q+ V L GLN ++ R IL P+ T++E
Sbjct: 176 NYVDLPVCTCGRCECDAAVKWERLQQRSHVTKFLMGLNESYEQTRRHILMLKPIRTIEEA 235
Query: 280 FSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDHCNRQWHTRDTCWKL 339
F+ V ++E ++++ + + L K P C +C + HT C+K+
Sbjct: 236 FNIVTQDERQKAIR--PTPKVDNQDQL------------KLPLCTNCGKVGHTVQKCYKI 281
Query: 340 HGKPPNWKKKGGKEGRALQATTSDQEHQSSSSSFPFTKEQLDQLYKMFGSQ--------- 390
G PP +K +Q Q Q S P ++ + L F +Q
Sbjct: 282 IGYPPGYKAATSYRQPQIQTQPRMQMPQQSQ---PRMQQPIQHLISQFNAQVRVQEPAAT 338
Query: 391 -----TPSCSIAQIG-----------NFPNT-----------------ALVSVKPSPTWI 417
+P+ +I + G FP+T +L +V S WI
Sbjct: 339 SIYTSSPTATITEHGLMAQTSTSGTIPFPSTSLKYENNNLTFQNHTLSSLQNVLSSDAWI 398
Query: 418 IDSGATDHMTGESSLFASYSPCAGNHKIKIADGSLSAI--AGKDCQANFFHSHCI----- 470
IDSGA+ H+ + ++F +G + + +G+ AI G C + H +
Sbjct: 399 IDSGASSHVCSDLTMFRELIHVSG-VTVTLPNGTRVAITHTGTICITSTLILHNVLLVPD 457
Query: 471 FK---------DLNTGKMIGSAKKSGGLYYLD 493
FK +L G MIG K LY L+
Sbjct: 458 FKFNLISVCCLELTRGLMIGRGKTYNNLYILE 489
>gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|7444418|pir||T00499 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1496
Score = 500 bits (1288), Expect = e-139
Identities = 249/541 (46%), Positives = 370/541 (68%), Gaps = 3/541 (0%)
Query: 660 TKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTW 719
T +P++ F++ S S++ AF + + PK+ ++A+ I +W EA+ +E+ ALE N TW
Sbjct: 957 TLYPLSDFLTNSGYSANHIAFMAAILDSNEPKHFKDAILIKEWCEAMSKEIDALEANHTW 1016
Query: 720 KIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNT 779
I LP GK + KWV+ +KYNSD T+ER+KARLV G Q G+D+ ETFAPVAKL T
Sbjct: 1017 DITDLPHGKKAISSKWVYKLKYNSDGTLERHKARLVVMGNHQKEGVDFKETFAPVAKLTT 1076
Query: 780 VRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLNVCKLQKSLYGLK 839
VR +L++A DW ++Q+DV NAFL+GDL+EEVYM PPGF+ VC+L+KSLYGLK
Sbjct: 1077 VRTILAVAAAKDWEVHQMDVHNAFLHGDLEEEVYMRLPPGFKCSDPSKVCRLRKSLYGLK 1136
Query: 840 QSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDR 899
Q+PR WF K + +++ G+ Q+ D++LF N I ++VYVDD+I+ G+++ +DR
Sbjct: 1137 QAPRCWFSKLSTALRNIGFTQSYEDYSLF-SLKNGDTIIHVLVYVDDLIVAGNNLDAIDR 1195
Query: 900 LKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPME 959
K L K F +KDLG LKYFLG+EV+R G +SQRKY LD+++ETG+ GC+P+ P+
Sbjct: 1196 FKSQLHKCFHMKDLGKLKYFLGLEVSRGPDGFCLSQRKYALDIVKETGLLGCKPSAVPIA 1255
Query: 960 LNAKLWE-KGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYR 1018
LN KL G V + +Y+RLVG+ IYL TRPD++++V ++SQFM +P H EA R
Sbjct: 1256 LNHKLASITGPVFTNPEQYRRLVGRFIYLTITRPDLSYAVHILSQFMQAPLVAHWEAALR 1315
Query: 1019 ILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQ 1078
++RYLK +P +G++ + + ++ + D+D+ + R+S + Y Y+ + ++W++KKQ
Sbjct: 1316 LVRYLKGSPAQGIFLRSDSSLIINAYCDSDYNACPLTRRSLSAYVVYLGDSPISWKTKKQ 1375
Query: 1079 GVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHD 1138
V+ SSAEAE+RAMA + EL W++ LL++L + P+KL CDS+AAI IA NPV H+
Sbjct: 1376 DTVSYSSAEAEYRAMAYTLKELKWLKALLKDLGVHHSSPMKLHCDSEAAIHIAANPVFHE 1435
Query: 1139 RTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYAP 1198
RTKHIE D H +++ ++ I ++ + +Q AD+LTKSL RP FERL+ LG+T+ Y P
Sbjct: 1436 RTKHIESDCHKVRDAVLDKLITTEHIYTEDQVADLLTKSLPRPTFERLLSTLGVTD-YVP 1494
Query: 1199 T 1199
+
Sbjct: 1495 S 1495
Score = 151 bits (382), Expect = 1e-34
Identities = 107/360 (29%), Positives = 164/360 (44%), Gaps = 39/360 (10%)
Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMES 160
L NY EWS+ ++ L K KLGFI G I P+ DP W + NS I+ W+ ++++
Sbjct: 39 LKENNYAEWSEELQNFLRAKQKLGFIDGSIPKPAA-DPELSLWIAINSMIVGWIRTSIDP 97
Query: 161 TIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMALW 220
TI+ F+ A +W+ ++ +S + N R LK + Q + V YY L+ LW
Sbjct: 98 TIRSTVGFVSEASQLWENLRRRFS-VGNGVRKTLLKDEIAACTQDGQPVLAYYGRLIKLW 156
Query: 221 QELDLCYDDNWRCTEDSVLFL-KRQENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQET 279
+EL Y C ++ + K +E+DRV L GL++ +R I PLP L +
Sbjct: 157 EELQN-YKSGRECKCEAASDIEKEREDDRVHKFLLGLDSRFSSIRSSITDIEPLPDLYQV 215
Query: 280 FSEVRREEARQSVMMGKSASITESSALVTKGNEEGK-RDGKKPFCDHCNRQWHTRDTCWK 338
+S V REE + K TE+ + + + RD FC HCNR+ H C+
Sbjct: 216 YSRVVREEQNLNASRTKDVVKTEAIGFSVQSSTTPRFRDKSTLFCTHCNRKGHEVTQCFL 275
Query: 339 LHGKPPNWKKKGGKE------GRALQATTSDQEHQSSSSSFPFTK--------------- 377
+HG P W ++ +E GR S + SS P T+
Sbjct: 276 VHGYPDWWLEQNPQENQPSTRGRGSNGRGSSSGRGGNRSSAPTTRGRGRANNAQAAAPTV 335
Query: 378 -----EQLDQLYKMFGSQTPSCSIAQIGNFPNTALVSVKPSPTWIIDSGATDHMTGESSL 432
+Q+ QL + +Q PS S ++ NT L +ID+GA+ HMTG+ S+
Sbjct: 336 SGDGNDQIAQLISLLQAQRPSSSSERLSG--NTCLTD------GVIDTGASHHMTGDCSI 387
>gb|AAR24647.1| At2g23330 [Arabidopsis thaliana] gi|22655202|gb|AAM98191.1| unknown
protein [Arabidopsis thaliana]
Length = 776
Score = 500 bits (1288), Expect = e-139
Identities = 249/541 (46%), Positives = 370/541 (68%), Gaps = 3/541 (0%)
Query: 660 TKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTW 719
T +P++ F++ S S++ AF + + PK+ ++A+ I +W EA+ +E+ ALE N TW
Sbjct: 237 TLYPLSDFLTNSGYSANHIAFMAAILDSNEPKHFKDAILIKEWCEAMSKEIDALEANHTW 296
Query: 720 KIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNT 779
I LP GK + KWV+ +KYNSD T+ER+KARLV G Q G+D+ ETFAPVAKL T
Sbjct: 297 DITDLPHGKKAISSKWVYKLKYNSDGTLERHKARLVVMGNHQKEGVDFKETFAPVAKLTT 356
Query: 780 VRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLNVCKLQKSLYGLK 839
VR +L++A DW ++Q+DV NAFL+GDL+EEVYM PPGF+ VC+L+KSLYGLK
Sbjct: 357 VRTILAVAAAKDWEVHQMDVHNAFLHGDLEEEVYMRLPPGFKCSDPSKVCRLRKSLYGLK 416
Query: 840 QSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDR 899
Q+PR WF K + +++ G+ Q+ D++LF N I ++VYVDD+I+ G+++ +DR
Sbjct: 417 QAPRCWFSKLSTALRNIGFTQSYEDYSLF-SLKNGDTIIHVLVYVDDLIVAGNNLDAIDR 475
Query: 900 LKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPME 959
K L K F +KDLG LKYFLG+EV+R G +SQRKY LD+++ETG+ GC+P+ P+
Sbjct: 476 FKSQLHKCFHMKDLGKLKYFLGLEVSRGPDGFCLSQRKYALDIVKETGLLGCKPSAVPIA 535
Query: 960 LNAKLWE-KGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYR 1018
LN KL G V + +Y+RLVG+ IYL TRPD++++V ++SQFM +P H EA R
Sbjct: 536 LNHKLASITGPVFTNPEQYRRLVGRFIYLTITRPDLSYAVHILSQFMQAPLVAHWEAALR 595
Query: 1019 ILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQ 1078
++RYLK +P +G++ + + ++ + D+D+ + R+S + Y Y+ + ++W++KKQ
Sbjct: 596 LVRYLKGSPAQGIFLRSDSSLIINAYCDSDYNACPLTRRSLSAYVVYLGDSPISWKTKKQ 655
Query: 1079 GVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHD 1138
V+ SSAEAE+RAMA + EL W++ LL++L + P+KL CDS+AAI IA NPV H+
Sbjct: 656 DTVSYSSAEAEYRAMAYTLKELKWLKALLKDLGVHHSSPMKLHCDSEAAIHIAANPVFHE 715
Query: 1139 RTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYAP 1198
RTKHIE D H +++ ++ I ++ + +Q AD+LTKSL RP FERL+ LG+T+ Y P
Sbjct: 716 RTKHIESDCHKVRDAVLDKLITTEHIYTEDQVADLLTKSLPRPTFERLLSTLGVTD-YVP 774
Query: 1199 T 1199
+
Sbjct: 775 S 775
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.315 0.133 0.399
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,171,928,404
Number of Sequences: 2540612
Number of extensions: 102696103
Number of successful extensions: 993617
Number of sequences better than 10.0: 7121
Number of HSP's better than 10.0 without gapping: 5365
Number of HSP's successfully gapped in prelim test: 1998
Number of HSP's that attempted gapping in prelim test: 651344
Number of HSP's gapped (non-prelim): 97214
length of query: 1199
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1059
effective length of database: 507,674,714
effective search space: 537627522126
effective search space used: 537627522126
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 81 (35.8 bits)
Medicago: description of AC145330.14