
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0334a.1
(1167 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi... 503 e-140
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi... 498 e-139
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi... 493 e-137
emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7... 488 e-136
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t... 456 e-126
gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi... 454 e-126
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi... 454 e-125
dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis t... 451 e-125
emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis... 449 e-124
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop... 441 e-122
emb|CAB10526.1| retrotransposon like protein [Arabidopsis thalia... 438 e-121
gb|AAF79879.1| T7N9.5 [Arabidopsis thaliana] 432 e-119
gb|AAU89728.1| putative retroelement pol polyprotein-like [Solan... 424 e-117
gb|AAG50751.1| polyprotein, putative [Arabidopsis thaliana] gi|2... 421 e-116
gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thalia... 414 e-114
pir||E96608 probable retroelement polyprotein F25P12.89 [importe... 410 e-112
pir||F86470 probable retroelement polyprotein [imported] - Arabi... 340 1e-91
gb|AAL68641.1| polyprotein [Oryza sativa (japonica cultivar-group)] 329 3e-88
gb|AAT38747.1| putative polyprotein [Solanum demissum] 304 1e-80
dbj|BAB10743.1| retroelement pol polyprotein-like [Arabidopsis t... 297 2e-78
>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301701|pir||E84589 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1461
Score = 503 bits (1296), Expect = e-140
Identities = 308/877 (35%), Positives = 466/877 (53%), Gaps = 51/877 (5%)
Query: 61 SPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGEIPIAVPGDANYE 120
SP+++HS ++P +++S L+ Y W+ +M+ L AKNK F++G +P + D N+
Sbjct: 64 SPFFLHSADHPGLSIISHRLDETTYGDWSVAMRISLDAKNKLGFVDGSLPRPLESDPNFR 123
Query: 121 AWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLVRIAELQNEIGNL 180
W RCN+++ SW+LNSV+ I SI+ + +A D WRDL DRF+ L R L EI +L
Sbjct: 124 LWSRCNSMVKSWLLNSVSPQIYRSILRLNDATDIWRDLFDRFNLTNLPRTYNLTQEIQDL 183
Query: 181 KQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLGL 240
+Q T+S+++YYT +KTLW++L+ + PC C E+ ++FL GL
Sbjct: 184 RQGTMSLSEYYTLLKTLWDQLDSTEALDD-----PCTCGKAVRLYQKAEKAKIMKFLAGL 238
Query: 241 NETFSVVNSQILMSNPLPPIAKVVSLAMQHERQSETGENEESKSLVKMAEGKKTYGKGKA 300
NE++++V QI+ LP +A+V + Q Q + +++E ++ +
Sbjct: 239 NESYAIVRRQIIAKKALPSLAEVYHILDQDNSQKGFFNVVAPPAAFQVSE--VSHSPITS 296
Query: 301 PGSSSSGSGYKSTGKYCTHCKKPGHTVDVCYRLHGYPT--TSKSKPAFNNVSHINNIT*G 358
P SG C+ C + GH + CY+ HG+P T K K +
Sbjct: 297 PEIMYVQSGPNKGRPTCSFCNRVGHIAERCYKKHGFPPGFTPKGKSSDKPPKPQAVAAQV 356
Query: 359 YDSEDDQEESSKSQRGNGDLFTADQYKTIMAMIQ-----QVTA--TSASQKHTESGKAFA 411
S D ++ GN F+ DQ + ++A+ Q+ + T++SQ S ++ A
Sbjct: 357 TLSPDKMTGQLETLAGN---FSPDQIQNLIALFSSQLQPQIVSPQTASSQHEASSSQSVA 413
Query: 412 N---LVTKSAVCGANTAGNGKHSTLSYSSQYDSRDWIIDSGASDHICFNKLCFDTLNRIK 468
L + S C H++LS S W+IDSGA+ H+ ++ F TL+
Sbjct: 414 PSGILFSPSTYCFIGILAVS-HNSLS------SDTWVIDSGATHHVSHDRKLFQTLDTSI 466
Query: 469 PVSVRLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEF 528
V LP G +++ G V I K I+L +VL++PEF NLIS+ L V+F
Sbjct: 467 VSFVNLPTGPNVRISGVGTVLINKDIILQNVLFIPEFRLNLISISSLTTDLGTRVIFDPS 526
Query: 529 TCFVQEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCFAIPNVVERISDSDQLIPPGALWHF 588
C +Q+ +G G + +LY L SP+ + A+ +V ++WH
Sbjct: 527 CCQIQDLTKGLTLGEGK-RIGNLYVLDTQSPAISVNAVVDV--------------SVWHK 571
Query: 589 RLGHLSHDRILALNALYPSI--DVSKHFVCDICHLAKLKRKMFPDSLHNAKCNFDLLHMD 646
RLGH S R+ +L+ + + K C +CHLAK K+ FP + + F+LLH+D
Sbjct: 572 RLGHPSFSRLDSLSEVLGTTRHKNKKSAYCHVCHLAKQKKLSFPSANNICNSTFELLHID 631
Query: 647 IWGPISVSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVI 706
+WGP SV +V ++YFLT++DDHSR W+ LLK K +V F+ LV+ Q+ VK +
Sbjct: 632 VWGPFSVETVEGYKYFLTIVDDHSRATWIYLLKSKSDVLTVFPAFIDLVENQYDTRVKSV 691
Query: 707 RSDNGPEFMLHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKMW 766
RSDN E +FY A GI+ SC TP+QN VERKHQHILN+ARAL+FQS++ W
Sbjct: 692 RSDNAKELAFTEFYKAKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSNMSLPYW 751
Query: 767 GYAILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLDS 826
G +L AVFL++R S +L NK+P+E+L GK D +L+ FG LC++STSS R K
Sbjct: 752 GDCVLTAVFLINRTPSALLSNKTPFEVL-TGKLPDYSQLKTFGCLCYSSTSSKQRHKFLP 810
Query: 827 RARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPYKSATSTAWTCLDLEVKDQ 886
R+R CVFLG+ G KG+ LLDL S+ + +SR+V FHE++ P S+ +A T D+
Sbjct: 811 RSRACVFLGYPFGFKGYKLLDLESNVVHISRNVEFHEELFPLASSQQSATTASDVFTPMD 870
Query: 887 SSSSLNIPETTSAAQELISDNENFSN---T*LPCHTQ 920
SS N T+ IS + S T P H Q
Sbjct: 871 PLSSGN-SITSHLPSPQISPSTQISKRRITKFPAHLQ 906
>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301694|pir||E84535 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1454
Score = 498 bits (1283), Expect = e-139
Identities = 299/873 (34%), Positives = 456/873 (51%), Gaps = 66/873 (7%)
Query: 12 GIRALTRVLPSQSSATMVRQASNNNNNGVQQAHGVQQPHVVLEPAQNPISPYYIHSGENP 71
G ++TR S + T + N +G +A + +P Q SP+++HS ++P
Sbjct: 17 GTSSVTRKSRSTGAVTTPPNSPPVNRSGASRALTSSESG---DPTQ---SPFFLHSADHP 70
Query: 72 SATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGEIPIAVPGDANYEAWDRCNNLIHS 131
++S L+ NY W+ +M L AKNK FI+G + + D N+ W RCN+++ S
Sbjct: 71 GLNIISHRLDETNYGDWSVAMLISLDAKNKTGFIDGTLSRPLESDLNFRLWSRCNSMVKS 130
Query: 132 WILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLVRIAELQNEIGNLKQNTLSVNDYY 191
W+LNSV+ I SI+ + +A D WRDL RF+ L R L EI + +Q TLS+++YY
Sbjct: 131 WLLNSVSPQIYRSILRMNDASDIWRDLNSRFNVTNLPRTYNLTQEIQDFRQGTLSLSEYY 190
Query: 192 TEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLGLNETFSVVNSQI 251
T +KTLW++L+ + + PC C + EQ ++FL GLNE++++V QI
Sbjct: 191 TRLKTLWDQLDSTEALDE-----PCTCGKAMRLQQKAEQAKIVKFLAGLNESYAIVRRQI 245
Query: 252 LMSNPLPPIAKVVSLAMQHERQSETGENEESKSLVKMAEGKKTYGKGKAPGSSSSGSGYK 311
+ LP + +V + Q Q + +++E T P +G
Sbjct: 246 IAKKALPSLGEVYHILDQDNSQQSFSNVVAPPAAFQVSE--ITQSPSMDPTVCYVQNGPN 303
Query: 312 STGKYCTHCKKPGHTVDVCYRLHGYPT-----------TSKSKPAFNNVSHINNIT*GYD 360
C+ + GH + CY+ HG+P K KP NV+ + +
Sbjct: 304 KGRPICSFYNRVGHIAERCYKKHGFPPGFTPKGKAGEKLQKPKPLAANVAESSEVN---- 359
Query: 361 SEDDQEESSKSQRGNGDLFTADQYKTIMAMIQ-QVTATSASQKHTESGKAFANLVTKSAV 419
S +S GN + +Q + +AM Q+ T S T S NL +
Sbjct: 360 ------TSLESMVGN---LSKEQLQQFIAMFSSQLQNTPPSTYATASTSQSDNL----GI 406
Query: 420 CGANTAGNGKHSTLSYSSQYDSRDWIIDSGASDHICFNKLCFDTLNRIKPVSVRLPNGIS 479
C + + + S W+IDSGA+ H+ ++ F +L+ +V LP G +
Sbjct: 407 CFSPSTYSFIGILTVARHTLSSATWVIDSGATHHVSHDRSLFSSLDTSVLSAVNLPTGPT 466
Query: 480 IQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEFTCFVQEKHTKK 539
++ G + + ILL +VL++PEF NLIS+ L V+F + +C +Q+ +
Sbjct: 467 VKISGVGTLKLNDDILLKNVLFIPEFRLNLISISSLTDDIGSRVIFDKNSCEIQDLIKGR 526
Query: 540 RIGLGSLKEDDLYHLVVTSPSSTCFAIPNVVERISDSDQLIPPGALWHFRLGHLSHDRIL 599
+G G + +LY L V S + A+ ++ ++WH RLGH S R
Sbjct: 527 MLGQGR-RVANLYLLDVGDQSISVNAVVDI--------------SMWHRRLGHASLQR-- 569
Query: 600 ALNALYPSIDVSKHF-----VCDICHLAKLKRKMFPDSLHNAKCNFDLLHMDIWGPISVS 654
L+A+ S+ ++H C +CHLAK ++ FP S K FDLLH+D+WGP SV
Sbjct: 570 -LDAISDSLGTTRHKNKGSDFCHVCHLAKQRKLSFPTSNKVCKEIFDLLHIDVWGPFSVE 628
Query: 655 SVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVIRSDNGPEF 714
+V ++YFLT++DDHSR W+ LLK K EV F+ V+ Q+ VK +RSDN PE
Sbjct: 629 TVEGYKYFLTIVDDHSRATWMYLLKTKSEVLTVFPAFIQQVENQYKVKVKAVRSDNAPEL 688
Query: 715 MLHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKMWGYAILHAV 774
FY+ GI+ SC TP+QN VERKHQHILN+ARAL+FQS +P +WG +L AV
Sbjct: 689 KFTSFYAEKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSQVPLSLWGDCVLTAV 748
Query: 775 FLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLDSRARKCVFL 834
FL++R S++L NK+PYEIL G + ++LR FG LC++STS R K R+R C+FL
Sbjct: 749 FLINRTPSQLLMNKTPYEILTGTAPV-YEQLRTFGCLCYSSTSPKQRHKFQPRSRACLFL 807
Query: 835 GFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILP 867
G+ G KG+ L+DL S+ +F+SR+V FHE++ P
Sbjct: 808 GYPSGYKGYKLMDLESNTVFISRNVQFHEEVFP 840
>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301698|pir||C84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1501
Score = 493 bits (1268), Expect = e-137
Identities = 286/827 (34%), Positives = 439/827 (52%), Gaps = 49/827 (5%)
Query: 60 ISPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGEIPIAVPGDANY 119
+SPY + S +NP A + S+ LNG NYN WA M L AK K FING IP P D NY
Sbjct: 29 VSPYTLASSDNPGAVISSVELNGDNYNQWATEMLNALQAKRKTGFINGTIPRPPPNDPNY 88
Query: 120 EAWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLVRIAELQNEIGN 179
E W N++I WI S+ + ++ F+ +A W+DL+ RFS G VRI +++ ++ +
Sbjct: 89 ENWTAVNSMIVGWIRTSIEPKVKATVTFISDAHLLWKDLKQRFSVGNKVRIHQIRAQLSS 148
Query: 180 LKQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLG 239
+Q+ +V +YY + LWEE Y+P+ C C + CRC A RE++ +F+LG
Sbjct: 149 CRQDGQAVIEYYGRLSNLWEEYNIYKPVTVCTCGL-CRCGATSEPTKEREEEKIHQFVLG 207
Query: 240 LNET-FSVVNSQILMSNPLPPIAKVVSLAMQHERQSET----GENEESKSLVKMAEGKKT 294
L+E+ F + + ++ +PLP + ++ S ++ E++ + + EE+ + E
Sbjct: 208 LDESRFGGLCATLINMDPLPSLGEIYSRVIREEQRLASVHVREQKEEAVGFLARREQLDH 267
Query: 295 YGKGKAPGSSSSGSGYKSTGKY------CTHCKKPGHTVDVCYRLHGYPTTSKSKPAFNN 348
+ + A S S +G + C++C + GH C+++ G+P +
Sbjct: 268 HSRVDASSSRSEHTGGSRSNSIIKGRVTCSNCGRTGHEKKECWQIVGFPDWWSER----- 322
Query: 349 VSHINNIT*GYDSEDDQEESSKSQRGNGDLFTADQYKTIMAMIQQVTATSASQKHTESGK 408
N G + S RG G + A + ++ + T E +
Sbjct: 323 -----NGGRGSNGRGRGGRGSNGGRGQGQVMAAHATSSNSSVFPEFT--------EEHMR 369
Query: 409 AFANLVTKSAVCGANTAGNGKHSTLSYSSQYDSRDWIIDSGASDHICFNKLCFDTLNRIK 468
+ LV + + G+ + N S + D I+DSGAS H+ + +
Sbjct: 370 VLSQLVKEKSNSGSTSNNNSDR----LSGKTKLGDIILDSGASHHMTGTLSSLTNVVPVP 425
Query: 469 PVSVRLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEF 528
P V +G GV++++ + LT+VL+VP LISV KL K+ +C F +
Sbjct: 426 PCPVGFADGSKAFALSVGVLTLSNTVSLTNVLFVPSLNCTLISVSKLLKQTQCLATFTDT 485
Query: 529 TCFVQEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCFAIPNVVERISDSDQLIPPGALWHF 588
CF+Q++ +K IG G + +Y+L +P+ A NV DSDQ ALWH
Sbjct: 486 LCFLQDRSSKTLIGSGE-ERGGVYYLTDVTPAKIHTA--NV-----DSDQ-----ALWHQ 532
Query: 589 RLGHLSHDRILALNALYPSIDVSKHFVCDICHLAKLKRKMFPDSLHNAKCNFDLLHMDIW 648
RLGH S + +L + CD+C AK R++FP+S++ + F L+H D+W
Sbjct: 533 RLGHPSFSVLSSLPLFSKTSSTVTSHSCDVCFRAKQTREVFPESINKTEECFSLIHCDVW 592
Query: 649 GPISVSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVIRS 708
GP V + YFLT++DD+SR VW LL K EV+Q + NF+ + QF VK++RS
Sbjct: 593 GPYRVPASCGAVYFLTIVDDYSRAVWTYLLLEKSEVRQVLTNFLKYAEKQFGKTVKMVRS 652
Query: 709 DNGPEFM-LHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKMWG 767
DNG EFM L ++ +GI+HQ SCV TPQQNGRVERKH+HILN+ARALLFQ+ LP K WG
Sbjct: 653 DNGTEFMCLSSYFRENGIIHQTSCVGTPQQNGRVERKHRHILNVARALLFQASLPIKFWG 712
Query: 768 YAILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLDSR 827
+IL A +L++R S +L ++PYE+L G K + +LRVFGS C+ + + K R
Sbjct: 713 ESILTAAYLINRTPSSILSGRTPYEVLHGSKPV-YSQLRVFGSACYVHRVTRDKDKFGQR 771
Query: 828 ARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPYKSATST 874
+R C+F+G+ G KG+ + D+ +E VSRDV F E++ PY S+
Sbjct: 772 SRSCIFVGYPFGKKGWKVYDIERNEFLVSRDVIFREEVFPYAGVNSS 818
>emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7488558|pir||T14517
hypothetical protein 1 - wild cabbage transposon Melmoth
Length = 1131
Score = 488 bits (1256), Expect = e-136
Identities = 280/847 (33%), Positives = 440/847 (51%), Gaps = 78/847 (9%)
Query: 49 PHVVLEPAQNPISPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGE 108
P ++P Q+P+ ++H+ ++P ++SL L+G NY+ W +MK L AKNK F++G
Sbjct: 4 PPESMDPNQSPL---FMHNADHPGLQLISLKLDGSNYDDWNAAMKIALDAKNKIGFVDGT 60
Query: 109 IPIAVPGDANYEAWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLV 168
+ D + W RCN+++ SW+LNSV+ I SI+ + +A D WRDL RF L
Sbjct: 61 LTRPDTSDPTFRLWSRCNSMVKSWLLNSVSPQIYRSILRLNDAADIWRDLHGRFHMTNLP 120
Query: 169 RIAELQNEIGNLKQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFR 228
R L EI +LKQ ++S++DYYT +KTLW+ LE PC C E +
Sbjct: 121 RTFNLTQEIQDLKQGSMSLSDYYTTLKTLWDNLESVDEPD-----TPCVCGNAEKLQKKV 175
Query: 229 EQDNAIRFLLGLNETFSVVNSQILMSNPLPPIAKVVSLAMQHERQSETGENEESKSLVKM 288
++ ++FL GLN++++++ QI+M LP + +V ++ Q + Q +
Sbjct: 176 DRAKIVKFLAGLNDSYAIIRRQIIMKKVLPSLVEVYNILDQDDSQKGFS--------TAI 227
Query: 289 AEGKKTYGKGKAPGSSSSGSGYKSTGK-----YCTHCKKPGHTVDVCYRLHGYPTTSKSK 343
+ P + +G Y TG C+ C + GH + CY+ HG+P SK
Sbjct: 228 TPAAFNVSENVPPPMAEAGICYVQTGPNKGRPICSFCNRVGHIAERCYKKHGFPPGFVSK 287
Query: 344 PAFNNVSHINNIT*GYDSEDDQEESSKSQRGNGDLFTADQYKTIMAMIQQVTATSASQKH 403
Y S+ + K ++ + + M + S++
Sbjct: 288 ---------------YKSQSSGDRLQKPKQVAAQVSFSPPNSGQSPMTMDHLVGNHSKEQ 332
Query: 404 TESGKAFANLVTKSAVCGANTAGNGK----HSTLSYSSQ--------------YDSRDWI 445
+ A + + G+N A + K +S +S++ + WI
Sbjct: 333 LQQFIALFSSQLPNVTMGSNEASSSKQPMDNSGISFNPTTLVFIGLLTVSRHTLANETWI 392
Query: 446 IDSGASDHICFNKLCFDTLNRIKPVSVRLPNGISIQTCYAGVVSITKQILLTHVLYVPEF 505
IDSGA+ H+C ++ + +++ +V LPNG+ ++ G+V + + I L +VLY+PEF
Sbjct: 393 IDSGATHHVCHDRSMYTSIDITTTSNVNLPNGMIVKISGVGIVQLNEHITLHNVLYIPEF 452
Query: 506 TYNLISVHKLAKRNRCEVVFGEFTCFVQEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCFA 565
NL+S+ L +V+F +C +Q+ IG G + +LY L V S A
Sbjct: 453 RLNLLSISSLTSDIGSQVIFDVSSCAIQDPTKGWTIGQGR-RVANLYVLDVKSSPMKINA 511
Query: 566 IPNVVERISDSDQLIPPGALWHFRLGHLSHDRILALNALYPSIDVSKH-----FVCDICH 620
+ ++ +LWH RLGH S+ R L+ + ++ +KH C +CH
Sbjct: 512 VVDI--------------SLWHKRLGHPSYTR---LDKISEALGTTKHKNKGDAHCHVCH 554
Query: 621 LAKLKRKMFPDSLHNAKCNFDLLHMDIWGPISVSSVHSHRYFLTVLDDHSRFVWVILLKR 680
LAK K+ + H +F LLH+D+WGP SV ++ ++YFLT++DDHSR W+ LL+
Sbjct: 555 LAKQKKLSYSSQNHICTASFQLLHVDVWGPFSVETLEGYKYFLTIVDDHSRATWIYLLQS 614
Query: 681 KGEVQQ*IKNFVALVKTQFSHHVKVIRSDNGPEFMLHDFYSAHGILHQRSCVNTPQQNGR 740
K +V FV ++TQ++ +K +R DN PE + + GI+ SC T +QN
Sbjct: 615 KSDVLHIFPTFVNQIETQYNTKIKSVRRDNAPELSFTELFKEKGIVSYHSCPETLEQNSV 674
Query: 741 VERKHQHILNIARALLFQSHLPKKMWGYAILHAVFLMSRIQSKVLENKSPYEILFGGKKI 800
+ERKHQH+LN+ARAL+FQS +P + WG +L A FL++R S +L NKSPYE+L GK
Sbjct: 675 LERKHQHLLNVARALMFQSQVPLQYWGDCVLTAAFLINRTPSPLLANKSPYEVLM-GKAP 733
Query: 801 DLQELRVFGSLCFASTSSSSRSKLDSRARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVT 860
+LR FG LC+ STS R K R+R CVFLG+ G KG+ LLDL S++I++SR+VT
Sbjct: 734 QYDQLRTFGCLCYGSTSPKQRHKFMPRSRACVFLGYPSGYKGYKLLDLESNKIYISRNVT 793
Query: 861 FHEQILP 867
FHE I P
Sbjct: 794 FHEDIFP 800
>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1491
Score = 456 bits (1172), Expect = e-126
Identities = 279/832 (33%), Positives = 427/832 (50%), Gaps = 45/832 (5%)
Query: 41 QQAHGVQQPHVVLEPAQNPISPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKN 100
++ P +P +SPY + S +NP A + S+ L G NYN W+ M L AK
Sbjct: 5 EEVSSATHPRTNQQPDVTKVSPYTLASSDNPGAMISSVMLTGDNYNEWSTEMLNALQAKR 64
Query: 101 KFKFINGEIPIAVPGDANYEAWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRD 160
K FING I + +YE W N++I WI S+ + +++ F+ +A W +L+
Sbjct: 65 KTGFINGSISKPPLDNPDYENWQAVNSMIVGWIRASIEPKVKSTVTFISDAHQLWSELKQ 124
Query: 161 RFSQGVLVRIAELQNEIGNLKQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEA 220
RFS G VR+ +++ ++ +Q+ V DYY + LWEE + Y+PI C+C + C C A
Sbjct: 125 RFSVGNKVRVHQIKAQLAACRQDGQPVIDYYGRLCKLWEEFQIYKPITVCKCGL-CTCGA 183
Query: 221 IEHAKMFREQDNAIRFLLGLNET-FSVVNSQILMSNPLPPIAKVVSLAMQHERQSETGEN 279
RE++ +F+LGL+++ F +++ ++ +P P + ++ S ++ E++ + +
Sbjct: 184 TLEPSKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFPSLGEIYSRVVREEQRLASVQI 243
Query: 280 EESKSLVKMAEGKKTYGKGKAPGSSSSGSGYKSTGK--YCTHCKKPGHTVDVCYRLHGYP 337
E + + A G T + S KS + C+HC + GH C+++ G+P
Sbjct: 244 REQQ---QSAIGFLTRQSEVTADGRTDSSIIKSRDRSVLCSHCGRSGHEKKDCWQIVGFP 300
Query: 338 TTSKSKPAFNNVSHINNIT*GYDSEDDQEESSKSQRGNGDLFTADQYKTIMAMIQQVTAT 397
+ + G S S+ S RG G QVTA
Sbjct: 301 DWWTERTNGGGRGSSSRGRGGRSS-----GSNNSGRGRG----------------QVTAA 339
Query: 398 SASQKHTESGKAFANLVTKSAVCGANTAGNGKHSTLSYSSQYDSRDWIIDSGASDHICFN 457
A+ + S F + NG L S + D I+D+GAS H+
Sbjct: 340 HATTSNLSSFPEFTPDQLRVITQMIQNKNNGTSDKL--SGKMKLGDVILDTGASHHMTGQ 397
Query: 458 KLCFDTLNRIKPVSVRLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAK 517
+ I SV + G +++ + L++VLYVP +LISV KL K
Sbjct: 398 LSLLTNIVTIPSCSVGFADDRKTFAISMGTFKLSETVSLSNVLYVPALNCSLISVSKLVK 457
Query: 518 RNRCEVVFGEFTCFVQEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCFAIPNVVERISDSD 577
+ +C +F + C +Q++ ++ IG G + D +Y+L + ++ + V+ +D
Sbjct: 458 QIKCLALFTDTICVLQDRFSRTLIGTGE-ERDGVYYLTDAATTTV-----HKVDVTTDH- 510
Query: 578 QLIPPGALWHFRLGHLSHDRILALNALYPSIDVSKHFVCDICHLAKLKRKMFPDSLHNAK 637
ALWH RLGH S + +L S CD+C AK R++FPDS + +
Sbjct: 511 ------ALWHQRLGHPSFSVLSSLPLFSGSSCSVSSRSCDVCFRAKQTREVFPDSSNKST 564
Query: 638 CNFDLLHMDIWGPISVSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKT 697
F L+H D+WGP V S YFLT++DD SR VW LL K EV+ + NF+A +
Sbjct: 565 DCFSLIHCDVWGPYRVPSSCGAVYFLTIVDDFSRSVWTYLLLAKSEVRSVLTNFLAYTEK 624
Query: 698 QFSHHVKVIRSDNGPEFM-LHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALL 756
QF VK+IRSDNG EFM L ++ GI+HQ SCV TPQQNGRVERKH+HILN++RALL
Sbjct: 625 QFGKSVKIIRSDNGTEFMCLSSYFKEQGIVHQTSCVGTPQQNGRVERKHRHILNVSRALL 684
Query: 757 FQSHLPKKMWGYAILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFAST 816
FQ+ LP K WG A++ A +L++R S + SPYE+L G K D +LRVFGS C+A
Sbjct: 685 FQASLPIKFWGEAVMTAAYLINRTPSSIHNGLSPYELLHGCKP-DYDQLRVFGSACYAHR 743
Query: 817 SSSSRSKLDSRARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPY 868
+ + K R+R C+F+G+ G KG+ + DL+++E VSRDV F E + PY
Sbjct: 744 VTRDKDKFGERSRLCIFVGYPFGQKGWKVYDLSTNEFIVSRDVVFRENVFPY 795
>gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|7444418|pir||T00499 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1496
Score = 454 bits (1169), Expect = e-126
Identities = 298/822 (36%), Positives = 415/822 (50%), Gaps = 62/822 (7%)
Query: 63 YYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGEIPIAVPGDANYEAW 122
Y I++ +NP A + S+ L NY W+ ++ L AK K FI+G IP D W
Sbjct: 22 YLINASDNPGALISSVVLKENNYAEWSEELQNFLRAKQKLGFIDGSIPKPA-ADPELSLW 80
Query: 123 DRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLVRIAELQNEIGNLKQ 182
N++I WI S+ +I +++ FV A W +LR RFS G VR L++EI Q
Sbjct: 81 IAINSMIVGWIRTSIDPTIRSTVGFVSEASQLWENLRRRFSVGNGVRKTLLKDEIAACTQ 140
Query: 183 NTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLGLNE 242
+ V YY + LWEEL+ Y+ +C+C EA + RE D +FLLGL+
Sbjct: 141 DGQPVLAYYGRLIKLWEELQNYKSGRECKC------EAASDIEKEREDDRVHKFLLGLDS 194
Query: 243 TFSVVNSQILMSNPLPPIAKVVSLAMQHERQSETGENEESKSLVKMAEGKKTYGKGKAPG 302
FS + S I PLP + +V S ++ E+ +K +VK T G +
Sbjct: 195 RFSSIRSSITDIEPLPDLYQVYSRVVREEQNLNASR---TKDVVK------TEAIGFSVQ 245
Query: 303 SSSSGSGYKSTGKYCTHCKKPGHTVDVCYRLHGYPTT-SKSKPAFNNVSHINNIT*GYDS 361
SS++ + +CTHC + GH V C+ +HGYP + P N S + G S
Sbjct: 246 SSTTPRFRDKSTLFCTHCNRKGHEVTQCFLVHGYPDWWLEQNPQENQPSTRGRGSNGRGS 305
Query: 362 EDDQ---EESSKSQRGNGDLFTADQYKTIMAMIQQVTATSASQKHTESGKAFANLVTKSA 418
+ S+ + RG G A Q A + S + +L+
Sbjct: 306 SSGRGGNRSSAPTTRGRGRANNA-----------QAAAPTVSGDGNDQIAQLISLLQ--- 351
Query: 419 VCGANTAGNGKHSTLSYSSQYDSRDWIIDSGASDHICFNKLCFDTLNRIKPVSVRLPNGI 478
A S+ S D +ID+GAS H+ + + I P V P+G
Sbjct: 352 ------AQRPSSSSERLSGNTCLTDGVIDTGASHHMTGDCSILVDVFDITPSPVTKPDGK 405
Query: 479 SIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEFTCFVQEKHTK 538
+ Q G + + L VL+VP+F LISV KL K+ +F + CF+Q++ +
Sbjct: 406 ASQATKCGTLLLHDSYKLHDVLFVPDFDCTLISVSKLLKQTSSIAIFTDTFCFLQDRFLR 465
Query: 539 KRIGLGSLKEDDLYHLVVTSP----SSTCFAIPNVVERISDSDQLIPPGALWHFRLGHLS 594
IG G +E Y V +P +S+ FAI G LWH RLGH S
Sbjct: 466 TLIGAGEEREGVYYFTGVLAPRVHKASSDFAIS---------------GDLWHRRLGHPS 510
Query: 595 HDRILALNALYPSID-VSKHFVCDICHLAKLKRKMFPDSLHNAKCNFDLLHMDIWGPISV 653
+L+L S K CD C +K R++FP S + F L+H D+WGP
Sbjct: 511 TSVLLSLPECNRSSQGFDKIDSCDTCFRSKQTREVFPISNNKTMECFSLIHGDVWGPYRT 570
Query: 654 SSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVIRSDNGPE 713
S YFLT++DD+SR VW L+ K EV Q IKNF A+ + QF VK R+DNG E
Sbjct: 571 PSTTGAVYFLTLVDDYSRSVWTYLMSSKTEVSQLIKNFCAMSERQFGKQVKAFRTDNGTE 630
Query: 714 FM-LHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKMWGYAILH 772
FM L ++ HGILHQ SCV+TPQQNGRVERKH+HILN+ARA LFQ +LP K WG +IL
Sbjct: 631 FMCLTPYFQTHGILHQTSCVDTPQQNGRVERKHRHILNVARACLFQGNLPVKFWGESILT 690
Query: 773 AVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLDSRARKCV 832
A L++R S VL+ K+PYE+LF G++ LR FG LC+A ++ K SR+RKCV
Sbjct: 691 ATHLINRTPSAVLKGKTPYELLF-GERPSYDMLRSFGCLCYAHIRPRNKDKFTSRSRKCV 749
Query: 833 FLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPYKSATST 874
F+G+ G K + + DL + +IF SRDV FHE I PY +AT +
Sbjct: 750 FIGYPHGKKAWRVYDLETGKIFASRDVRFHEDIYPYATATQS 791
>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301695|pir||D84481 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1413
Score = 454 bits (1167), Expect = e-125
Identities = 279/832 (33%), Positives = 426/832 (50%), Gaps = 45/832 (5%)
Query: 41 QQAHGVQQPHVVLEPAQNPISPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKN 100
++ P +P +SPY + S +NP A + S+ L G NYN W+ M L AK
Sbjct: 5 EEVSSATHPRTNQQPDVTKVSPYTLASSDNPGAMISSVMLTGDNYNEWSTKMLNALQAKR 64
Query: 101 KFKFINGEIPIAVPGDANYEAWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRD 160
K FING I + +YE W N++I WI S+ + +++ F+ +A W +L+
Sbjct: 65 KTGFINGSISKPPLDNPDYENWQAVNSMIVGWIRASIEPKVKSTVTFICDAHQLWSELKQ 124
Query: 161 RFSQGVLVRIAELQNEIGNLKQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEA 220
RFS G V + +++ ++ +Q+ V DYY + LWEE + Y+PI C+C + C C A
Sbjct: 125 RFSVGNKVHVHQIKTQLAACRQDGQPVIDYYGRLCKLWEEFQIYKPITVCKCGL-CTCGA 183
Query: 221 IEHAKMFREQDNAIRFLLGLNET-FSVVNSQILMSNPLPPIAKVVSLAMQHERQSETGEN 279
RE++ +F+LGL+++ F +++ ++ +P P + ++ S ++ E++ + +
Sbjct: 184 TLEPSKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFPSLGEIYSRVVREEQRLASVQI 243
Query: 280 EESKSLVKMAEGKKTYGKGKAPGSSSSGSGYKSTGK--YCTHCKKPGHTVDVCYRLHGYP 337
E + + A G T + S KS + C+HC + GH C+++ G+P
Sbjct: 244 REQQ---QSAIGFLTRQSEVTADGRTDSSIIKSRDRSVLCSHCGRSGHEKKDCWQIVGFP 300
Query: 338 TTSKSKPAFNNVSHINNIT*GYDSEDDQEESSKSQRGNGDLFTADQYKTIMAMIQQVTAT 397
+ + G S S+ S RG G QVTA
Sbjct: 301 DWWTERTNGGGRGSSSRGRGGRSS-----GSNNSGRGRG----------------QVTAA 339
Query: 398 SASQKHTESGKAFANLVTKSAVCGANTAGNGKHSTLSYSSQYDSRDWIIDSGASDHICFN 457
A+ + F + NG L S + D I+D+GAS H+
Sbjct: 340 HATTSNLSPFPEFTPDQLRVITQMIQNKNNGTSDKL--SGKMKLGDVILDTGASHHMTGQ 397
Query: 458 KLCFDTLNRIKPVSVRLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAK 517
+ I SV +G G +++ + L++VLYVP +LISV KL K
Sbjct: 398 LSLLTNIVTIPSCSVGFADGRKTFAISMGTFKLSETVSLSNVLYVPALNCSLISVSKLVK 457
Query: 518 RNRCEVVFGEFTCFVQEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCFAIPNVVERISDSD 577
+ +C +F + C +Q++ ++ IG G + D +Y+L T ++T V ++
Sbjct: 458 QIKCLALFTDTICVLQDRFSRTLIGTGE-ERDGVYYL--TDAATT------TVHKV---- 504
Query: 578 QLIPPGALWHFRLGHLSHDRILALNALYPSIDVSKHFVCDICHLAKLKRKMFPDSLHNAK 637
+ ALWH RLGH S + +L S CD+C AK R++FPDS + +
Sbjct: 505 DITTDHALWHQRLGHPSFSVLSSLPLFSGSSCSVSSRSCDVCFRAKQTREVFPDSSNKST 564
Query: 638 CNFDLLHMDIWGPISVSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKT 697
F L+H D+WGP V S YFLT++DD SR VW LL K EV+ + NF+A +
Sbjct: 565 DCFSLIHCDVWGPYRVPSSCGAVYFLTIVDDFSRSVWTYLLLAKSEVRSVLTNFLAYTEK 624
Query: 698 QFSHHVKVIRSDNGPEFM-LHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALL 756
QF VK+IRSDNG EFM L ++ GI+HQ SCV TPQQNGRVERKH+HILN++RALL
Sbjct: 625 QFGKSVKIIRSDNGTEFMCLSSYFKEQGIVHQTSCVGTPQQNGRVERKHRHILNVSRALL 684
Query: 757 FQSHLPKKMWGYAILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFAST 816
FQ+ LP K WG A++ A +L++R S + SPYE+L G K D +LRVFGS C+A
Sbjct: 685 FQASLPIKFWGEAVMTAAYLINRTPSSIHNGLSPYELLHGCKP-DYDQLRVFGSACYAHR 743
Query: 817 SSSSRSKLDSRARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPY 868
+ + K R+R C+F+G+ G KG+ + DL+++E VSRDV F E + PY
Sbjct: 744 VTRDKDKFGERSRLCIFVGYPFGQKGWKVYDLSTNEFIVSRDVVFRENVFPY 795
>dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1098
Score = 451 bits (1161), Expect = e-125
Identities = 289/843 (34%), Positives = 417/843 (49%), Gaps = 76/843 (9%)
Query: 55 PAQNPISPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGEIPIAVP 114
P P SPY I + +NP A + S+ L NY+ WA + L AK K F++G IP
Sbjct: 7 PVTTP-SPYGITASDNPGALISSVILKEDNYSEWAEELMNSLQAKQKLGFLDGTIPKPTT 65
Query: 115 GDANYEAWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLVRIAELQ 174
A +W N++I WI S+ +I +++ FV +A D W L+ RFS G VR L+
Sbjct: 66 EPA-LSSWKAANSMIIGWIRTSIDPTIRSTVAFVSDAKDLWDSLKQRFSNGNGVRKQLLK 124
Query: 175 NEIGNLKQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAI 234
+EI KQ+ SV YY + LWEEL+ Y+ C C EA RE D
Sbjct: 125 DEILACKQDGQSVLVYYGRLTKLWEELQNYKTSRTCTC------EAAPDIAKEREDDKVH 178
Query: 235 RFLLGLNETFSVVNSQILMSNPLPPIAKVVSLAMQHERQSETGENEESKSLVKMAEGKKT 294
+FLL L+E F + S I + +PLP + +V S + E+ ++ + +
Sbjct: 179 QFLLNLDERFRPIRSTITVQDPLPALNQVYSRVIHEEQNLNASRIKDDIKTEAVGFTVQA 238
Query: 295 YGKGKAPGSSS-SGSGYKSTGKY-CTHCKKPGHTVDVCYRLHGYPT-----TSKSKPAFN 347
P ++ S ++ CTH + GH + C+ +HGYP + A
Sbjct: 239 TPLPPTPQVAAVSAPRFRDRSSLTCTHYHRQGHDITECFLVHGYPDWWLEQNGSNGSAGR 298
Query: 348 NVSHINNIT*GYDSEDDQEESSKSQRGNGDL-------------FTADQYKTIMAMIQQV 394
S N G ++ + SS S RG G ADQ +++++Q
Sbjct: 299 GTSGRGNNGRGNNNRGGRSSSSGS-RGKGRANAASTHPPPTSTPSNADQINQLISLLQAQ 357
Query: 395 TATSASQKHTESGKAFANLVTKSAVCGANTAGNGKHSTLSYSSQYDSRDWIIDSGASDHI 454
++SQK SGK F V IID+GAS H+
Sbjct: 358 NPATSSQKL--SGKTFTTYV------------------------------IIDTGASHHM 385
Query: 455 CFNKLCFDTLNRIKPVSVRLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHK 514
+ + I P V P+G + + G +++ +L VL+VP+F LISV K
Sbjct: 386 TGDITLLTNVEDIIPSPVTKPDGTASRATKRGTLALHNAYVLPDVLFVPDFNCTLISVAK 445
Query: 515 LAKRNRCEVVFGEFTCFVQEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCFAIPNVVERIS 574
L K C +F + CF+Q++ T+ IG G +E Y V + R++
Sbjct: 446 LLKHTGCVAIFTDTLCFLQDRFTRTLIGAGEEREGVYYFTGV------------LAARVN 493
Query: 575 DSDQLIPPGALWHFRLGHLSHDRILALNALYPSI-DVSKHFVCDICHLAKLKRKMFPDSL 633
+ LWH RLGH S +L+ S D+ CDIC+ AK R++F SL
Sbjct: 494 KGFKESSSATLWHHRLGHPSTGVLLSFPEFASSSSDLEIIKSCDICYRAKQAREVFSPSL 553
Query: 634 HNAKCNFDLLHMDIWGPISVSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVA 693
+ F+L+H D+WGP + YFLT++DD SR VW L+ K EV + I+NF A
Sbjct: 554 NKTTVCFELIHCDVWGPYRTPASCGSVYFLTIVDDFSRSVWTFLMAEKSEVSRLIRNFCA 613
Query: 694 LVKTQFSHHVKVIRSDNGPEFM-LHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIA 752
+ + QF +K + SDNG EFM L F+ GI+HQ SCV+T QQNGRVERKH+HILN+A
Sbjct: 614 MSERQFCKSIKTVHSDNGTEFMCLKSFFQEQGIIHQTSCVDTRQQNGRVERKHRHILNVA 673
Query: 753 RALLFQSHLPKKMWGYAILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLC 812
R LFQSHLP+K G +IL A+ L++R +K+L KSPYE+LFG + LR FG LC
Sbjct: 674 RTCLFQSHLPRKFRGESILTAIHLINRTPTKILHGKSPYEVLFGSRP-SYSALRTFGCLC 732
Query: 813 FASTSSSSRSKLDSRARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPYKSAT 872
+A + + K R+R+CVF+G+ G KG+ L DL ++ FVSRDV F E + PY +
Sbjct: 733 YAHYRARDKDKFSERSRRCVFVGYPYGKKGWRLYDLEKNKFFVSRDVVFQETVFPYGTIE 792
Query: 873 STA 875
S++
Sbjct: 793 SSS 795
>emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana]
gi|7268152|emb|CAB78488.1| retrovirus-related like
polyprotein [Arabidopsis thaliana]
gi|7488175|pir||G71406 probable retrovirus-related
polyprotein - Arabidopsis thaliana
Length = 1489
Score = 449 bits (1156), Expect = e-124
Identities = 292/889 (32%), Positives = 432/889 (47%), Gaps = 123/889 (13%)
Query: 55 PAQNPISPYYIHSGENPSATVVSLPLN-GRNYNAWA*SMKRVLVAKNKFKFINGEIPIAV 113
P +PYY+HS ++ +VS L ++++W S+ L +NK FING I
Sbjct: 26 PVDQYENPYYLHSADHAGLILVSDRLTTASDFHSWRRSILMALNVRNKLGFINGTITKPP 85
Query: 114 PGDANYEAWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLVRIAEL 173
++ AW RCN+++ +W++NSV I S++++ W +L RF Q RI ++
Sbjct: 86 EDHRDFGAWSRCNDIVSTWLMNSVDKKIGQSLLYIATVQGIWNNLLSRFKQDDAPRIFDI 145
Query: 174 QNEIGNLKQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNA 233
+ ++ ++Q ++ ++ YYT + TLWEE Y +P C C C C+A + +++
Sbjct: 146 EQKLSKIEQGSMDISTYYTALLTLWEEHRNYVELPVCTCG-RCECDAAVKWEHLQQRSRV 204
Query: 234 IRFLLGLNETFSVVNSQILMSNPLPPIAKVVSLAMQHERQSETGENEESKSLVKMAEGKK 293
+FL LNE F ILM P+P I + ++ Q ERQ S+ +
Sbjct: 205 TKFLKELNEGFDQTRRHILMLKPIPTIKEAFNMVTQDERQRNVKPLTRVDSVA--FQNTS 262
Query: 294 TYGKGKAPGSSSSGSGYKSTGKYCTHCKKPGHTVDVCYRLHGYPTTSKS----------- 342
+ + ++ + + CTHC K GHT+ CY++HGYP K+
Sbjct: 263 MINEDENAYVAAYNTVRPNQKPICTHCGKVGHTIQKCYKVHGYPPGMKTGNTGYTYKPNP 322
Query: 343 --------------------KPAFNNVSHINNI------T*GYDSEDDQEESSKSQRGNG 376
+P N++ N + T Y SE + + G+
Sbjct: 323 QLHVQPRMPMMPQPRMQFPAQPYTNSMQKANVVAQVYAETGAYPSEGYSQAPMMNPYGSY 382
Query: 377 DL--------------FTADQYKTIMAMIQ---QVTATSASQKH-----TESGKAFANLV 414
+ FT Q + +++ Q QV +AS + T S F L
Sbjct: 383 PMPHITHGGNNLSLQDFTPQQIEQMISQFQAQVQVPEPAASSSNPSPLATVSEHGFMALT 442
Query: 415 TKSAVCGANTAGNGKHS---------TLSYSSQYDSRD-WIIDSGASDHICFNKLCFDTL 464
+ S + + K+ TLS ++ D WIIDSGAS H+C + F L
Sbjct: 443 STSGTIIPFPSTSLKYENNDLKFQNHTLSALQKFLPSDAWIIDSGASSHVCSDLAMFREL 502
Query: 465 NRIKPVSVRLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVV 524
+ +G V IT++++L +VL+VP+F +NL+SV L K C
Sbjct: 503 KSV-----------------SGTVHITQKLILHNVLHVPDFKFNLMSVSSLVKTISCSAH 545
Query: 525 FGEFTCFVQEKHTKKRIGLGSLKEDDLYHLVV--TSPSSTCFAIPNVVERISDSDQLIPP 582
F C +QE IG G L + LY L TSPS++ A + ++
Sbjct: 546 FYVDCCLIQELSQGLMIGRGRLYHN-LYILETENTSPSTSTPAA------CLFTGSVLND 598
Query: 583 GALWHFRLGHLSHDRILALNALYPSIDVSKHFVCDICHLAKLKRKMFPDSLHNAKCNFDL 642
G LWH RLGH PS V L KLKR + + A FDL
Sbjct: 599 GHLWHQRLGH-------------PSSVV----------LQKLKRLAYISHNNLASNPFDL 635
Query: 643 LHMDIWGPISVSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHH 702
+H+DIWGP S+ S+ RYFLTV+DD +R WV +L+ K +V F+ LV TQF+
Sbjct: 636 VHLDIWGPFSIESIEGFRYFLTVVDDCTRTTWVYMLRNKKDVSSVFPEFIKLVSTQFNAK 695
Query: 703 VKVIRSDNGPEFMLHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLP 762
+K IRSDN PE + HG+LH SC TPQQN VERKHQHILN+ARALLFQS++P
Sbjct: 696 IKAIRSDNAPELGFTEIVKEHGMLHHFSCAYTPQQNSVVERKHQHILNVARALLFQSNIP 755
Query: 763 KKMWGYAILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRS 822
+ W + AVFL++R+ S +L NKSPYE++ K+ D L+ FG LCF ST++ R+
Sbjct: 756 MQYWSDCVTTAVFLINRLPSPLLNNKSPYELIL-NKQPDYSLLKNFGCLCFVSTNAHERT 814
Query: 823 KLDSRARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPYKSA 871
K RAR CVFLG+ G KG+ +LDL SH + VSR+V F E + P+K++
Sbjct: 815 KFTPRARACVFLGYPSGYKGYKVLDLESHSVTVSRNVVFKEHVFPFKTS 863
>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
(gb|U12626). [Arabidopsis thaliana]
gi|25301690|pir||G96722 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana
Length = 1315
Score = 441 bits (1135), Expect = e-122
Identities = 265/780 (33%), Positives = 401/780 (50%), Gaps = 93/780 (11%)
Query: 92 MKRVLVAKNKFKFINGEIPIAVPGDANYEAWDRCNNLIHSWILNSVTSSIANSIVFVENA 151
M + AKNK F++G IP D + W RCN+++ SW+LNSV+ I SI++ A
Sbjct: 1 MTTSIEAKNKLGFVDGSIPKPDDDDPYCKIWRRCNSMVKSWLLNSVSKEIYTSILYFPTA 60
Query: 152 CDAWRDLRDRFSQGVLVRIAELQNEIGNLKQNTLSVNDYYTEIKTLWEELEQYRPIPQCR 211
W+DL RF + L R+ +L+ +I +L+Q L ++ Y+T +TLWEEL + +P+
Sbjct: 61 AAIWKDLYTRFHKSSLPRLYKLRQQIHSLRQGNLDLSSYHTRTQTLWEELTSLQAVPR-- 118
Query: 212 CVVPCRCEAIEHAKMFREQDNAIRFLLGLNETFSVVNSQILMSNPLPPIAKVVSLAMQHE 271
+E + RE + I FL+GLN+ + V SQILM LP +++V ++
Sbjct: 119 --------TVEDLLIERETNRVIDFLMGLNDCYDTVRSQILMKKTLPSLSEVFNMI---- 166
Query: 272 RQSETGENEESKSLVKMAEGKKTYGKGKAPGSSSSGSGYKSTGKYCTHCKKPGHTVDVCY 331
Q ET + + M + + + + K C++C +PGH D CY
Sbjct: 167 DQDETQRSARISTTPGMTSSVFPVSNQSSQSALNGDTYQKKERPVCSYCSRPGHVEDTCY 226
Query: 332 RLHGYPTTSKSKPAFNNVSHINNIT*GYDSEDDQEESSKSQRGNGDLFTADQYKTIMAMI 391
+ HGYPT+ KSK F S N G +E + + GDL T+ I
Sbjct: 227 KKHGYPTSFKSKQKFVKPSISANAAIG-----SEEVVNNTSVSTGDLTTSQ--------I 273
Query: 392 QQVTATSASQKHTESGKAFANLVTKSAVCGANTAGNGKHSTLSYSSQYDSRDWIIDSGAS 451
QQ+ + +S+ S T + ++S SS D +S
Sbjct: 274 QQLVSFLSSKLQPPS-----------------TPVQPEVHSISVSS---------DPSSS 307
Query: 452 DHICFNKLCFDTLNRIKPVSVRLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNLIS 511
+C P+S G V + + ++L VL++P+F +NL+S
Sbjct: 308 STVC-------------PIS--------------GSVHLGRHLILNDVLFIPQFKFNLLS 340
Query: 512 VHKLAKRNRCEVVFGEFTCFVQEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCFAIPNVVE 571
V L K C + F E +C +Q+ + +G+G + +LY + + S S V
Sbjct: 341 VSSLTKSMGCRIWFDETSCVLQDATRELMVGMGK-QVANLYIVDLDSLSHPGTDSSITVA 399
Query: 572 RISDSDQLIPPGALWHFRLGHLSHDRILALNAL--YPSIDVSKHFVCDICHLAKLKRKMF 629
++ D LWH RLGH S ++ +++L +P + F C +CH++K K F
Sbjct: 400 SVTSHD-------LWHKRLGHPSVQKLQPMSSLLSFPKQKNNTDFHCRVCHISKQKHLPF 452
Query: 630 PDSLHNAKCNFDLLHMDIWGPISVSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IK 689
+ + FDL+H+D WGP SV + +RYFLT++DD+SR WV LL+ K +V I
Sbjct: 453 VSHNNKSSRPFDLIHIDTWGPFSVQTHDGYRYFLTIVDDYSRATWVYLLRNKSDVLTVIP 512
Query: 690 NFVALVKTQFSHHVKVIRSDNGPEFMLHDFYSAHGILHQRSCVNTPQQNGRVERKHQHIL 749
FV +V+ QF +K +RSDN PE FY + GI+ SC TPQQN VERKHQHIL
Sbjct: 513 TFVTMVENQFETTIKGVRSDNAPELNFTQFYHSKGIVPYHSCPETPQQNSVVERKHQHIL 572
Query: 750 NIARALLFQSHLPKKMWGYAILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQE-LRVF 808
N+AR+L FQSH+P WG IL AV+L++R+ + +LE+K P+E+L K + + ++VF
Sbjct: 573 NVARSLFFQSHIPISYWGDCILTAVYLINRLPAPILEDKCPFEVL--TKTVPTYDHIKVF 630
Query: 809 GSLCFASTSSSSRSKLDSRARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPY 868
G LC+ASTS R K RA+ C F+G+ G KG+ LLDL +H I VSR V FHE++ P+
Sbjct: 631 GCLCYASTSPKDRHKFSPRAKACAFIGYPSGFKGYKLLDLETHSIIVSRHVVFHEELFPF 690
>emb|CAB10526.1| retrotransposon like protein [Arabidopsis thaliana]
gi|7268497|emb|CAB78748.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444421|pir||A71444 probable
LTR retrotransposon - Arabidopsis thaliana
Length = 1433
Score = 438 bits (1127), Expect = e-121
Identities = 284/856 (33%), Positives = 423/856 (49%), Gaps = 118/856 (13%)
Query: 53 LEPAQNPISPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGEIPIA 112
+E N SPY++HS ++P +VS L+G NYN W+ +M+ L AKNK F++G +P
Sbjct: 58 IESYDNAHSPYFLHSSDHPGLNIVSHILDGTNYNNWSIAMRMSLDAKNKLSFVDGSLPRP 117
Query: 113 VPGDANYEAWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLVRIAE 172
D ++ W RCN+++ +W+LN VT + W DL RF L R +
Sbjct: 118 DVSDRMFKIWSRCNSMVKTWLLNVVT--------------EMWNDLFSRFRVSNLPRKYQ 163
Query: 173 LQNEIGNLKQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDN 232
L+ I LKQ L ++ YYT+ KTLWE+L R + V C CE ++ E
Sbjct: 164 LEQSIHTLKQGNLDLSTYYTKKKTLWEQLANTRVLT----VRKCNCEHVKELLEEAETSR 219
Query: 233 AIRFLLGLNETFSVVNSQILMSNPLPPIAKVVSLAMQHERQSETGENEESK--------- 283
I+FL+GLN+ F+ + QIL P P + ++ ++ Q E Q G S
Sbjct: 220 IIQFLMGLNDNFAHIRGQILNMKPRPGLTEIYNMLDQDESQRLVGNPTLSNPTAAFQVQA 279
Query: 284 -----SLVKMAEGKKTYGKGKAPGSSSSGSGYKSTGKYCTHCKKPGHTVDVCYRLHGYPT 338
S V MA+G +Y K K C++C K GH VD CY+ HGYP
Sbjct: 280 SPIIDSQVNMAQG--SYKKPK-----------------CSYCNKLGHLVDKCYKKHGYPP 320
Query: 339 TSKSKPAFNNVSHINNIT*GYDSEDDQEESSKSQRGNGDLFTADQYKTIMAMIQ-QVTAT 397
SK + I + E+ + + + F+ DQ +T+++ + ++
Sbjct: 321 GSK----WTKGQTIGSTNLASTQLQPVNETPNEKTDSYEEFSTDQIQTMISYLSTKLHIA 376
Query: 398 SASQKHTESGKAFA---NLVTKSAVCG------ANTAGNGKHSTLSYSSQYDSRDWIIDS 448
SAS T S + + ++ S + G +N + S++S R W+IDS
Sbjct: 377 SASPMPTTSSASISASPSVPMISQISGTFLSLFSNAYYDMLISSVSQEPAVSPRGWVIDS 436
Query: 449 GASDHICFNKLCFDTLNRIKPVSVRLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYN 508
GA+ H+ N+ + ++ VRLPN +++ G + ++ I L +VLY+PEF +N
Sbjct: 437 GATHHVTHNRDLYLNFRSLENTFVRLPNDCTVKIAGIGFIQLSDAISLHNVLYIPEFKFN 496
Query: 509 LISVHKLAKRNRCEVVFGEFTCFVQEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCF--AI 566
LIS E + IG GS + +LY L + T
Sbjct: 497 LIS----------------------ELTKELMIGRGS-QVGNLYVLDFNENNHTVSLKGT 533
Query: 567 PNVVERISDSDQLIPPGALWHFRLGHLSHDRILALNALYPSIDVSKH--------FVCDI 618
++ S ++ WH RLGH ++ +I L+ + ++ V K VC +
Sbjct: 534 TSMCPEFSVCSSVVVDSVTWHKRLGHPAYSKIDLLSDVL-NLKVKKINKEHSPVCHVCHV 592
Query: 619 CHLAKLKRKMFPDSLHNAKCNFDLLHMDIWGPISVSSVHSHRYFLTVLDDHSRFVWVILL 678
CHL+K K F + FDL+H+D WGP SV + + W+ LL
Sbjct: 593 CHLSKQKHLSFQSRQNMCSAAFDLVHIDTWGPFSVPTNDA--------------TWIYLL 638
Query: 679 KRKGEVQQ*IKNFVALVKTQFSHHVKVIRSDNGPEFMLHDFYSAHGILHQRSCVNTPQQN 738
K K +V F+ +V TQ+ +K +RSDN E D ++AHGI+ SC TP+QN
Sbjct: 639 KNKSDVLHVFPAFINMVHTQYQTKLKSVRSDNAHELKFTDLFAAHGIVAYHSCPETPEQN 698
Query: 739 GRVERKHQHILNIARALLFQSHLPKKMWGYAILHAVFLMSRIQSKVLENKSPYEILFGGK 798
VERKHQHILN+ARALLFQS++P + WG +L AVFL++R+ + VL NKSPYE L K
Sbjct: 699 SVVERKHQHILNVARALLFQSNIPLEFWGDCVLTAVFLINRLPTPVLNNKSPYEKL---K 755
Query: 799 KID--LQELRVFGSLCFASTSSSSRSKLDSRARKCVFLGFKQGVKGFVLLDLNSHEIFVS 856
I + L+ FG LC++STS R K + RAR CVFLG+ G KG+ LLD+ +H + +S
Sbjct: 756 NIPPAYESLKTFGCLCYSSTSPKQRHKFEPRARACVFLGYPLGYKGYKLLDIETHAVSIS 815
Query: 857 RDVTFHEQILPYKSAT 872
R V FHE I P+ S+T
Sbjct: 816 RHVIFHEDIFPFISST 831
>gb|AAF79879.1| T7N9.5 [Arabidopsis thaliana]
Length = 1436
Score = 432 bits (1111), Expect = e-119
Identities = 278/887 (31%), Positives = 446/887 (49%), Gaps = 92/887 (10%)
Query: 19 VLPSQSSATMVRQASNNNNNGVQQAHGVQQPHVVLEPAQNPISPYYIHSGENPSATVVSL 78
+L Q T+ +Q + ++ V +G + V + +NP+ +HS ++P ++V+
Sbjct: 23 ILEDQKLTTLKQQLAQRSD--VYGGNGFK----VSDSGENPL---LLHSSDHPGLSIVAH 73
Query: 79 PLNGRNYNAWA*SMKRVLVAKNKFKFINGEIPIAVPGDANYEAWDRCNNLIHSWILNSVT 138
L+G NYN+W+ +M+ L AKNK F++G + D+ + W RCN+++++
Sbjct: 74 ILDGSNYNSWSIAMRISLDAKNKLGFVDGSLLRPSVDDSTFRIWSRCNSMVNN------- 126
Query: 139 SSIANSIVFVENACDAWRDLRDRFSQGVLVRIAELQNEIGNLKQNTLSVNDYYTEIKTLW 198
L R +L+ + L+Q L ++ Y+T+ KTLW
Sbjct: 127 ----------------------------LPRRYQLEQAVMTLQQGKLDLSTYFTKKKTLW 158
Query: 199 EELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLGLNETFSVVNSQILMSNPLP 258
E+L + R V C C+ ++ E I+FL+GL++ F+ + SQI P P
Sbjct: 159 EQLANTKS----RSVKKCDCDQVKELLEEAETSRVIQFLMGLSDDFNTIRSQIFNMKPRP 214
Query: 259 PIAKVVSLAMQHERQSETGENEESKSLVKMAEGKKTYGKGKAPGSSSSGSGYKSTGKYCT 318
+ ++ ++ Q E Q G +S A +T G + G K CT
Sbjct: 215 GLNEIYNMLDQDESQRLVGFAAKSVPSPSPA-AFQTQGVLNDQNTILLAQGNFKKPK-CT 272
Query: 319 HCKKPGHTVDVCYRLHGYPTTSKSKPAFNNVSHINNIT*GYDSEDDQEESSKSQRGNGDL 378
HC + GHTVD CY++HGYP V N + D + Q + S G+ +
Sbjct: 273 HCNRIGHTVDKCYKVHGYPPGHPRAKENTYVGSTNLAS--TDQIETQAPPTMSATGH-ET 329
Query: 379 FTADQYKTIMAMI----QQVTATSASQKHTESGK----AFANLVTKSAVCGANTAGNGKH 430
+ D + +++ + Q + TS K S + + + K+ +N +
Sbjct: 330 MSNDHIQQLISYLSTKLQSPSITSCFDKAIASSSNPVPSISQITDKAIASSSNPVPSISQ 389
Query: 431 STLSYSSQYDS------------------RDWIIDSGASDHICFNKLCFDTLNRIKPVSV 472
T ++ S YDS R W+IDSGAS H+ + + T + V
Sbjct: 390 ITGTFFSLYDSTYYEMLTSSIPIETELSLRAWVIDSGASHHVTHERNLYHTYKALDRTFV 449
Query: 473 RLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEFTCFV 532
RLPNG +++ G + +T + L +VL++PEF +NL+SV L K + +V F C +
Sbjct: 450 RLPNGHTVKIEGTGFIQLTDALSLHNVLFIPEFKFNLLSVSVLTKTLQSKVSFTSDECMI 509
Query: 533 QEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCFAIP--NVVERISDSDQLIPPGALWHFRL 590
Q + +G GS + +LY L + + P +V + + ++ WH RL
Sbjct: 510 QALTKELMLGKGS-QVGNLYILNLDKSLVDVSSFPGKSVCSSVKNESEM------WHKRL 562
Query: 591 GHLSHDRILALN--ALYPSIDVSKHFV-CDICHLAKLKRKMFPDSLHNAKCNFDLLHMDI 647
GH S +I L+ + P ++K C +CHL+K K F H + F+L+H+D
Sbjct: 563 GHPSFAKIDTLSDVLMLPKQKINKDSSHCHVCHLSKQKHLPFKSVNHIREKAFELVHIDT 622
Query: 648 WGPISVSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVIR 707
WGP SV +V S+RYFLT++DD SR W+ LLK+K +V +F+ +V+TQ+ V +R
Sbjct: 623 WGPFSVPTVDSYRYFLTIVDDFSRATWIYLLKQKSDVLTVFPSFLKMVETQYHTKVCSVR 682
Query: 708 SDNGPEFMLHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKMWG 767
SDN E ++ ++ GI C TP+QN VERKHQH+LN+ARAL+FQS +P + WG
Sbjct: 683 SDNAHELKFNELFAKEGIKADHPCPETPEQNFVVERKHQHLLNVARALMFQSGIPLEYWG 742
Query: 768 YAILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLDSR 827
+L AVFL++R+ S V+ N++PYE L GK D L+ FG LC+ STS SR+K D R
Sbjct: 743 DCVLTAVFLINRLLSPVINNETPYERLTKGKP-DYSSLKAFGCLCYCSTSPKSRTKFDPR 801
Query: 828 ARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPYKSATST 874
A+ C+FLG+ G KG+ LLD+ ++ + +SR V F+E I P+ S+ T
Sbjct: 802 AKACIFLGYPMGYKGYKLLDIETYSVSISRHVIFYEDIFPFASSNIT 848
>gb|AAU89728.1| putative retroelement pol polyprotein-like [Solanum tuberosum]
Length = 1476
Score = 424 bits (1091), Expect = e-117
Identities = 278/863 (32%), Positives = 430/863 (49%), Gaps = 120/863 (13%)
Query: 73 ATVVSLPLNG-RNYNAWA*SMKRVLVAKNKFKFINGEIPIA-VPGDANYEAWDRCNNLIH 130
A + + L G NY+ W+ +M+ L+ KNK FI+G + + + WDRCN ++
Sbjct: 14 AVQIGIQLTGMENYSLWSRAMQLTLLTKNKMGFIDGSLRRDDFKEELEKKQWDRCNAMVL 73
Query: 131 SWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLVRIAELQNEIGNLKQNTLSVNDY 190
SW++N+V++ + + I+F NA W DL++RF + + RI L I Q V+ Y
Sbjct: 74 SWLMNNVSTDLVSGILFRSNATLVWNDLKERFDKVNMSRIFHLHKAIVTHVQGVSPVSVY 133
Query: 191 YTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLGLNETFSVVNSQ 250
Y+++K LW+E + P P C C +++++ Q ++FL+GLN+ + SQ
Sbjct: 134 YSKLKDLWDEYDSILPPPSCDCE-----KSVDYTDSMLRQ-KLLQFLMGLNDNYGQARSQ 187
Query: 251 ILMSNPLPPIAKVVSLAMQHERQ---SETGENEESKSLVKMAEGKKTYGKGKAPGS---S 304
ILM NP P + + ++ +Q E Q S +G+ + +L G +G + GS S
Sbjct: 188 ILMMNPSPSVNQCYAMIVQDESQRSLSGSGQTIDPTALFTHRPGGSGFGSQGSQGSGNGS 247
Query: 305 SSGSGYK----------------------STGKYCTHCKKPGHTVDVCYRLHGYPTTSKS 342
S+G+ ++ + K+CTHC GHT D CY+L GYP K
Sbjct: 248 SNGNSHRFHKGGNIYCDFCNMKGHIRANCNKLKHCTHCNMQGHTKDTCYQLIGYPADYKG 307
Query: 343 K--------PAFNNVSHIN-----NIT*GYDSE-------------DDQEESSKSQRGN- 375
K P+ + H N N Y + + SS S GN
Sbjct: 308 KKKANIVTAPSLPQMQHNNFNNNLNYPMQYTGDGIGHFVSPMQFTGNTNGHSSGSIAGNF 367
Query: 376 --GDL--FTADQYKTIMAMIQQVTATSASQKHTESGKAFANLVTKSAVCGANTAGNGKHS 431
G + FT QY I+ M+ + + ES A + S+ C +NT HS
Sbjct: 368 GPGSVPQFTPSQYNNILQMLNKPMLS-------ESSANVAGIFAGSSHCNSNT-----HS 415
Query: 432 TLSYSSQYDSRDWIIDSGASDHICFNKLCFDT-LNRIKPVSVRLPNGISIQTCYAGVVSI 490
+ WI+DSGA+DH+ N + L+ P V+LP G S ++G +
Sbjct: 416 SA----------WIVDSGATDHMVSNTTLLNHGLSVSHPGKVQLPTGDSAVVTHSGSSQL 465
Query: 491 TKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEFTCFVQEKHTKKRIGLGSLKEDD 550
T ++ +VL VP F +NL+SV KL K C V+F +Q+ T K +G ++
Sbjct: 466 TGGDVVKNVLCVPTFQFNLLSVSKLTKELNCCVIFFPDFFIIQDLFTGKVKEIG----EE 521
Query: 551 LYHLVVTSPSSTCFAIPNVVERISDSDQLIPPGALWHFRLGHLSHDRILALNALYPSIDV 610
+ L +T P + I ++ +WH RLGH+ +L ++ S
Sbjct: 522 INGLYITRPHQHHDTSKKTLAAIKGCEE----AEMWHKRLGHIPMS-VLRKIKMFDSPQK 576
Query: 611 SKHFVCDICHLAKLKRKMFPDSLHNAKCNFDLLHMDIWGPISVSSVHSHRYFLTVLDDHS 670
CD+C LA+ R FP S ++ FDL+H+D+WGP ++ + RYFLTV+DDHS
Sbjct: 577 LVLPSCDVCPLARQVRLPFPISQSRSENCFDLIHLDVWGPYKAATHNKMRYFLTVVDDHS 636
Query: 671 RFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVIRSDNGPEF---MLHDFYSAHGILH 727
R+ W+ L+ K +V ++NF+ ++ TQF +K+ RSDNG EF + +HGI+H
Sbjct: 637 RWTWIFLMHLKSDVSTVLQNFILMIDTQFGQKIKIFRSDNGTEFFNAQCDGLFKSHGIVH 696
Query: 728 QRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKMWGYAILHAVFLMSRIQSKVLEN 787
Q SC +TPQQNG VER+H+HIL ARAL FQ HLP + WG +L AV +++RI S VL N
Sbjct: 697 QSSCPHTPQQNGVVERRHKHILETARALRFQGHLPIRFWGECVLSAVHIINRIPSSVLHN 756
Query: 788 KSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLDSRARKCVFLGFKQGVKGFVLLD 847
KSP+E+++ + DL +RV G LC A+ ++ ++ KG+ L D
Sbjct: 757 KSPFELMY-KRSPDLSYMRVIGCLCHATNLVNTSTQ-----------------KGYKLYD 798
Query: 848 LNSHEIFVSRDVTFHEQILPYKS 870
L FVSRD+ F+E + P++S
Sbjct: 799 LEHQHFFVSRDMVFNEAVFPFQS 821
>gb|AAG50751.1| polyprotein, putative [Arabidopsis thaliana]
gi|25301686|pir||F96610 probable polyprotein T8L23.26
[imported] - Arabidopsis thaliana
Length = 1468
Score = 421 bits (1082), Expect = e-116
Identities = 276/850 (32%), Positives = 414/850 (48%), Gaps = 88/850 (10%)
Query: 49 PHVVLEPAQNPISPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGE 108
P V+E + ISPY + + +N A + L NY WA K L ++ KF F++G
Sbjct: 8 PPSVIE-VRRTISPYDLTAADNSGAVISHPILKTNNYEEWACGFKTALRSRKKFGFLDGT 66
Query: 109 IPIAVPGDANYEAWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLV 168
IP + G + E W N L+ SW+ ++ S + +I + A D W +R RFS
Sbjct: 67 IPQPLDGSPDLEDWLTINALLVSWMKMTIDSELLTNISHRDVARDLWEQIRKRFSVSNGP 126
Query: 169 RIAELQNEIGNLKQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFR 228
+ +++ ++ KQ ++V YY ++ +W+ + YRP+ C+C C C + +R
Sbjct: 127 KNQKMKADLATCKQEGMTVEGYYGKLNKIWDNINSYRPLRICKCG-RCICNLGTDQEKYR 185
Query: 229 EQDNAIRFLLGLNET-FSVVNSQILMSNPLPPIAKVVSLAMQHERQ-SETGENEESKSLV 286
E D ++L GLNET F + S + PLP + +V ++ Q E + NEE +
Sbjct: 186 EDDMVHQYLYGLNETKFHTIRSSLTSRVPLPGLEEVYNIVRQEEDMVNNRSSNEERTDVT 245
Query: 287 KMAEGKKTYGKGKAPGSSSSGSGYKSTGKYCTHCKKPGHTVDVCYRLHGYP--------- 337
A + + + + S K CTHC + GH+ + C+ L GYP
Sbjct: 246 AFAVQMRP--RSEVISEKFANSEKLQNKKLCTHCNRGGHSPENCFVLIGYPEWWGDRPRG 303
Query: 338 ------TTSKSK----PAFNN----VSHINNIT*GY--DSEDDQEESSKSQRGNGDLFTA 381
+TS+ + P FN +++N + G SE + S R T
Sbjct: 304 KSNSNGSTSRGRGRFGPGFNGGQPRPTYVNVVMTGPFPSSEHVNRVITDSDRDAVSGLTD 363
Query: 382 DQYKTIMAMIQQVTATSASQKHTESGKAFANLVTKSAVCGANTAGNGKHSTLSYSSQYDS 441
+Q++ ++ ++ + + S H T+S C T+
Sbjct: 364 EQWRGVVKLLNAGRSDNKSNAHE----------TQSGTCSLFTS---------------- 397
Query: 442 RDWIIDSGASDHICFNKLCFDTLNRIKPVSVRLPNGISIQTCYAGVVSITKQILLTHVLY 501
WI+D+GAS H+ N + + PV + L +G G V + ++L V Y
Sbjct: 398 --WILDTGASHHMTGNLELLSDMRSMSPVLIILADGNKRVAVSEGTVRLGSHLILKSVFY 455
Query: 502 VPEFTYNLISVHKLAKRNRCEVVFGEFTCFVQEKHTKKRIGLGSLKEDDLYHLVVTSPSS 561
V E +LISV ++ N C ++ T+ +G + S
Sbjct: 456 VKELESDLISVGQMMDENHCV-----------DRTTRMVTRIGKREN-----------GS 493
Query: 562 TCFAIPNVVERISDSDQLIPPGALWHFRLGHLSHDRILAL--NALYPSIDVSKHFVCDIC 619
CF + S + P LWH RLGH S D+I+ L L S VCD C
Sbjct: 494 FCFRGMENAAAVHTSVKA--PFDLWHRRLGHAS-DKIVNLLPRELLSSGKEILENVCDTC 550
Query: 620 HLAKLKRKMFPDSLHNAKCNFDLLHMDIWGPISVSSVHSHRYFLTVLDDHSRFVWVILLK 679
AK R FP S + + +F L+H D+WGP S RYFLT++DD+SR VWV L+
Sbjct: 551 MRAKQTRDTFPLSDNRSMDSFQLIHCDVWGPYRAPSYSGARYFLTIVDDYSRGVWVYLMT 610
Query: 680 RKGEVQQ*IKNFVALVKTQFSHHVKVIRSDNGPEFM-LHDFYSAHGILHQRSCVNTPQQN 738
K E Q+ +K+F+ALV+ QF +K++RSDNG EF+ + +++ GI H+ SCV TP QN
Sbjct: 611 DKSETQKHLKDFIALVERQFDTEIKIVRSDNGTEFLCMREYFLHKGIAHETSCVGTPHQN 670
Query: 739 GRVERKHQHILNIARALLFQSHLPKKMWGYAILHAVFLMSRIQSKVLENKSPYEILFGGK 798
GRVERKH+HILNIARAL FQS+LP + WG IL A +L++R S +L+ KSPYE+L+
Sbjct: 671 GRVERKHRHILNIARALRFQSYLPIQFWGECILSAAYLINRTPSMLLQGKSPYEMLYKTA 730
Query: 799 KIDLQELRVFGSLCFASTSSSSRSKLDSRARKCVFLGFKQGVKGFVLLDLNSHEIFVSRD 858
LRVFGSLC+A + K +R+R+CVF+G+ G KG+ L DL + FVSRD
Sbjct: 731 P-KYSHLRVFGSLCYAHNQNHKGDKFAARSRRCVFVGYPHGQKGWRLFDLEEQKFFVSRD 789
Query: 859 VTFHEQILPY 868
V F E PY
Sbjct: 790 VIFQETEFPY 799
>gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thaliana]
gi|25403501|pir||H86486 protein Ty1/copia-element
polyprotein [imported] - Arabidopsis thaliana
Length = 1152
Score = 414 bits (1064), Expect = e-114
Identities = 264/823 (32%), Positives = 413/823 (50%), Gaps = 61/823 (7%)
Query: 61 SPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGEIPIAVPGDANYE 120
SPYY+H ++P + + LNG NY WA + L AK K FI+G + +Y
Sbjct: 23 SPYYLHPSDHPHHVLTPMLLNGENYERWAKLTRNNLQAKQKLGFIDGTLTKPSSDSPDYP 82
Query: 121 AWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLVRIAELQNEIGNL 180
W + N+++ W+ S+ + SI V+NA W LR R+S G R+ +L+ +I
Sbjct: 83 RWLQTNSMLVGWLYASLDPQVQKSISVVDNARVMWESLRTRYSVGNASRVHQLKYDIVAC 142
Query: 181 KQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLGL 240
+Q+ + +Y+ ++K +W++L+ Y P+ C C P + ++ R+ + +FL+GL
Sbjct: 143 RQDGQTAANYFGKLKVMWDDLDDYEPLLTCCCNRPSCTHRVRQSQR-RDHERIHQFLMGL 201
Query: 241 NET-FSVVNSQIL---MSNPLPPIAKVVSLAMQHERQ-SETGENEESKSLVKMAEGKKTY 295
+ F + IL + + + S + ER + T EE V A
Sbjct: 202 DAAKFGTSRTNILGRLSRDDNISLDSIYSEIIAEERHLTITRSKEERVDAVGFAVQTGVN 261
Query: 296 GKGKAPGSSSSGSGYKSTGKYCTHCKKPGHTVDVCYRLHGYPTTSKSKPAFNNVSHINNI 355
++ G CTHC + H+ D C++LHG P K + + S
Sbjct: 262 AIASVTRVNNMGP--------CTHCGRSNHSADTCFKLHGVPEWYTEK--YGDTS----- 306
Query: 356 T*GYDSEDDQEESSKSQ---RGNGDLFTADQYKTIMAMIQQVTATSASQKHTESGKAFAN 412
S + SS + RG+G+ + A+ +T + E+ A N
Sbjct: 307 -----SGRGRGRSSTPRGRGRGHGNSYKANNAQTSHPSSSASEFSDIPGVSKEAWSAIRN 361
Query: 413 LVTKSAVCGANTAGNGKHSTLSYSSQYDSRDWIIDSGASDHICFNKLCFDTLNRIKPVSV 472
L+ + S+ S + + D++IDSGAS H+ + I V
Sbjct: 362 LLKQDTAT----------SSEKLSGKTNCVDFLIDSGASHHMTGFLDLLTEIYEIPHSVV 411
Query: 473 RLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEFTCFV 532
LPN G + + + LTHVL+VP+ + LISV +L + C +F + C +
Sbjct: 412 VLPNAKHTIATKKGTLILGANMKLTHVLFVPDLSCTLISVARLLRELHCFAIFTDKVCVI 471
Query: 533 QEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCFAIPNVVERISDSDQLIPPGALWHFRLGH 592
Q++ +K IG+G+ + + +YHL +T NVV+ ++ ALWH RLGH
Sbjct: 472 QDRTSKMLIGVGT-ESNGVYHLQRAEVVATS---ANVVKWKTNK-------ALWHMRLGH 520
Query: 593 LSHDRILALNALYPSID------VSKHFVCDICHLAKLKRKMFPDSLHNAKCNFDLLHMD 646
S L+++ PS++ +CD+C AK R F +S + A+ F +H D
Sbjct: 521 PSSK---VLSSVLPSLEDFDSCSSDLKTICDVCVRAKQTRASFSESFNKAEECFSFIHYD 577
Query: 647 IWGPISVSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVI 706
+WGP +S YFLT++DDHSR VW+ L+ K EV ++ F+A+ QF+ VK +
Sbjct: 578 VWGPYKHASSCGAHYFLTIVDDHSRAVWIHLMLAKSEVASLLQQFIAMASRQFNKQVKTV 637
Query: 707 RSDNGPEFM-LHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKM 765
RS+NG EFM L +++ GI+HQ SCV T QQNGRVERKH+HILN+AR+LLFQ+ LP
Sbjct: 638 RSNNGTEFMSLKSYFAERGIVHQISCVYTHQQNGRVERKHRHILNVARSLLFQAELPISF 697
Query: 766 WGYAILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLD 825
W ++L A +L++R + +L+ K+PY+IL+ + LRVFGSLCFA + K
Sbjct: 698 WEESVLTAAYLINRTPTPILDGKTPYKILY-SQPPSYASLRVFGSLCFARKHTGRLDKFQ 756
Query: 826 SRARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPY 868
R RKC+F+G+ G KG+ + D+ S FVSRDV F E I P+
Sbjct: 757 ERGRKCIFVGYPHGQKGWRIYDIESQIFFVSRDVVFQEDIFPF 799
>pir||E96608 probable retroelement polyprotein F25P12.89 [imported] -
Arabidopsis thaliana gi|9954746|gb|AAG09097.1| Putative
retroelement polyprotein [Arabidopsis thaliana]
Length = 1486
Score = 410 bits (1054), Expect = e-112
Identities = 266/851 (31%), Positives = 430/851 (50%), Gaps = 60/851 (7%)
Query: 60 ISPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGEIPIAVPGDANY 119
ISPY + SG+NP + L G NY+ WA +++ L A+ KF F +G IP V D ++
Sbjct: 22 ISPYDLTSGDNPGTLISKPLLRGPNYDEWATNLRLALKARKKFGFADGSIPQPVETDPDF 81
Query: 120 EAWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLVRIAELQNEIGN 179
E W N L+ SW+ ++ +++ S+ ++++ + W ++ RF R+ L+ E+
Sbjct: 82 EDWTANNALVVSWMKLTIDETVSTSMSHLDDSHELWTHIQKRFGVKNGQRVQRLKTELAT 141
Query: 180 LKQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLG 239
+Q +++ YY + LW L Y+ + + ++ + RE+D +FL+G
Sbjct: 142 CRQKGVAIETYYGRLSQLWRSLADYQ-----------QAKTMDDVRKEREEDKLHQFLMG 190
Query: 240 LNET-FSVVNSQILMSNPLPPIAKVVSLAMQHERQSETGENEESKSLVKMAEGKKTYGKG 298
L+E+ + V S +L PLP + + + Q +EESKSL ++ ++ G
Sbjct: 191 LDESVYGAVKSALLSRVPLPSLEEAYNALTQ---------DEESKSLSRL-HNERVDGVS 240
Query: 299 KAPGSSSSGSGYKSTGKYCTHCKKPGHTVDVCYRLHGYPTTSKSKPAFNNVSHINNIT*G 358
A ++S S + C++C + GH + C++L GYP + K N +
Sbjct: 241 FAVQTTSRPRD-SSENRVCSNCGRVGHLAEQCFKLIGYPPWLEEKLRLKNTA-------- 291
Query: 359 YDSEDDQEESSKSQRGNGDLFTADQYKTIMAMIQQVTATSASQKHTESGKAFANLVTKSA 418
S S K ++ +G + + + VT +S + T + + + S
Sbjct: 292 -SSSRGGLSSFKGKQSHGRGSSINHVASSGMAANVVTNSSLTSPLTSDDRIGLSGLNDSQ 350
Query: 419 VCGANTAGNGKHSTLS--YSSQYDSRDWIIDSGASDHICFNKLCFDTLNRIKPVSVRLPN 476
T + ST + S +Y WIIDSGA++H+ + + + PV ++LP+
Sbjct: 351 WKILQTILEERKSTSNDHQSGKYFLESWIIDSGATNHMTGSLAFLRNVCDMPPVLIKLPD 410
Query: 477 GISIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEFTCFVQEKH 536
G G V + + L VL+V +LISV +L + RC + C VQ++
Sbjct: 411 GRFTTATKQGSVQLGSSLDLQDVLFVDGLHCHLISVSQLTRTRRCIFQITDKVCIVQDRT 470
Query: 537 TKKRIGLGSLKEDDLYHLVVTSPSSTCFAIPNVVERISDSDQLIPPGALWHFRLGHLSHD 596
T IG G +L L T A+ + + +P LWH RLGH S
Sbjct: 471 TLMLIGAGR----ELNGLYFFRGVETAAAV---------TSKALPSSQLWHQRLGHPSSK 517
Query: 597 RILALNALYPSIDVSKHF----VCDICHLAKLKRKMFPDSLHNAKCNFDLLHMDIWGPIS 652
+ L P DV+ C+IC AK R FP S + F+L+H D+WGP
Sbjct: 518 AL----HLLPFSDVTSSTFDSKTCEICIQAKQTRDPFPLSSNKTSFAFELVHCDLWGPYR 573
Query: 653 VSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVIRSDNGP 712
+S+ RYFLT++DD+SR VW+ LL K E + +KNF+ALV+ Q++ ++K+IRSDNG
Sbjct: 574 TTSICGSRYFLTLVDDYSRAVWLYLLPSKQEAPKHLKNFIALVERQYTTNIKMIRSDNGS 633
Query: 713 EFM-LHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKMWGYAIL 771
EF+ L DF++ GI+H+ SCV TPQQNGRVERKH+HILN+ARAL FQS LP + W Y L
Sbjct: 634 EFICLSDFFAQKGIIHETSCVGTPQQNGRVERKHRHILNVARALRFQSGLPIEFWSYCAL 693
Query: 772 HAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLDSRARKC 831
A +L++R + +L+ K+P+E+++ + LQ +R+FG +C+ K SR+ K
Sbjct: 694 TAAYLINRTPTPLLKGKTPFELIY-NRPPPLQHIRIFGCICYVHNLKHGGDKFASRSNKS 752
Query: 832 VFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQILPYKSATSTAWTCLDLEVKDQS---S 888
+FLG+ KG+ + ++ + + VSRDV F E + + + LD + D S
Sbjct: 753 IFLGYPFAKKGWRVYNIETGVVSVSRDVVFRETEFHFPISVMDSSPSLDPVLVDSSELEE 812
Query: 889 SSLNIPETTSA 899
S+ P T S+
Sbjct: 813 ISMTPPVTPSS 823
>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989049|gb|AAG10812.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1404
Score = 340 bits (873), Expect = 1e-91
Identities = 272/912 (29%), Positives = 433/912 (46%), Gaps = 122/912 (13%)
Query: 69 ENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKF-INGEIPIAVPGDANYEA------ 121
E + ++ L G NY W+ + K VL + + I+ + P + E
Sbjct: 2 ETSQKVITTVILQGGNYLTWSRTTKTVLCGRGLWSHVISSQAPKEDKEEEETETISPEEE 61
Query: 122 -WDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRF-SQGVLVRIAELQNEIGN 179
W + + + + + NS+ +SI + E A + W L++ + ++ L R+ E++ I
Sbjct: 62 KWFQEDQAVLALLQNSLETSILEGYSYCETAKELWDTLKNVYGNESNLTRVFEVKKAINE 121
Query: 180 LKQNTLSVNDYYTEIKTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLG 239
L Q L ++ + ++LW EL+ RP I H + REQD LL
Sbjct: 122 LSQEDLEFTKHFGKFRSLWSELKSLRP--------GTLDPKILHER--REQDKVFGLLLT 171
Query: 240 LNETFSVVNSQILMSNPLPPIAKVVSLAMQHERQSETGENEESKSLVKMAEGKKTYGKGK 299
LN ++ + +L S LP + +V S + Q TG L+ +G+ KG
Sbjct: 172 LNPGYNDLIKHLLRSEKLPSLDEVCSKIQKE--QGSTGLFGGKSELITANKGEVVANKGV 229
Query: 300 APGSSSSGSGYKSTGKY---CTHCKKPGHTVDVCYRLHGYPTTSKSKPAFNNVSHINNIT 356
YK+ + C HCKK GHT D C+ LH + +K K ++ +H
Sbjct: 230 ----------YKNEDRKLLTCDHCKKKGHTKDKCWLLHPHLKPAKFK---DSRAHF---- 272
Query: 357 *GYDSEDDQEESSKSQRGNGDLFTADQYKTIMAMIQQVTATSASQKHTESGKAFANLVTK 416
S++ EE S++ G E+ +F + V K
Sbjct: 273 ----SQETHEEQSQAGSSKG----------------------------ETSTSFGDYVRK 300
Query: 417 SAV-CGANTAGNGKHSTLSYSSQYDSRDWIIDSGASDHICFNKLCFDTLNRIKPV--SVR 473
S + + + K S +++SSQ S +IDSGAS H+ N + L+ I+P V
Sbjct: 301 SDLEALIKSIVSLKESGITFSSQTSSGSIVIDSGASHHMISNS---NLLDNIEPALGHVI 357
Query: 474 LPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEFTCFVQ 533
+ NG + G + + + + ++P+FT NL+SV + + C +FG + Q
Sbjct: 358 IANGDKVPIEGIGNLKLFNKD--SKAFFMPKFTSNLLSVKRTTRDLNCYAIFGPNDVYFQ 415
Query: 534 EKHTKKRIGLGSLKEDDLYHLVVTSP-SSTCFAIPNVVERISDSDQLIPPGALWHFRLGH 592
+ T K IG G K +LY L SP SS+CF+ S S I LWH RLGH
Sbjct: 416 DIETGKVIGEGGSK-GELYVLEDLSPNSSSCFS--------SKSHLGISFNTLWHARLGH 466
Query: 593 LSHDRILALNALYPSIDVSKHFVCDICHLAKLKRKMFPDSLHNAKCNFDLLHMDIWGPIS 652
H R AL + P+I H C+ C L K + +FP SL + FDL+H D+W
Sbjct: 467 -PHTR--ALKLMLPNISFD-HTSCEACILGKHCKSVFPKSLTIYEKCFDLVHSDVWTSPC 522
Query: 653 VSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVIRSDNGP 712
VS +++YF+T +++ S++ W+ LL K V + NF V QF+ +KV R+DNG
Sbjct: 523 VSR-DNNKYFVTFINEKSKYTWITLLPSKDRVFEAFTNFETYVTNQFNAKIKVFRTDNGG 581
Query: 713 EF---MLHDFYSAHGILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKMWGYA 769
E+ D + GI+HQ SC TPQQNG ERK++H++ +AR+++F + +PK+ WG A
Sbjct: 582 EYTSQKFRDHLAKRGIIHQTSCPYTPQQNGVAERKNRHLMEVARSMMFHTSVPKRFWGDA 641
Query: 770 ILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLDSRAR 829
+L A +L++R +KVL + SP+E+L K + LRVFG +CF RSKLD+++
Sbjct: 642 VLTACYLINRTPTKVLSDLSPFEVLNNTKPF-IDHLRVFGCVCFVLIPGEQRSKLDAKST 700
Query: 830 KCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTF-----------HEQILPYKSATS----T 874
KC+FLG+ KG+ D + F+SRDV F E + +TS T
Sbjct: 701 KCMFLGYSTTQKGYKCFDPTKNRTFISRDVKFLENQDYNNKKDWENLKDLTHSTSDRVET 760
Query: 875 AWTCLDLEVKDQSSSSLNIPETTSAAQELISDNENFSNT*LPCHTQSETIIEEE--NTQS 932
LD D +S++ + PE T ++L +NE S H ++ T ++E+ NTQ
Sbjct: 761 LKFLLDHLGNDSTSTTQHQPEMTQDQEDLNQENEEVSLQ----HQENLTHVQEDPPNTQE 816
Query: 933 ET-LIEEEEDDT 943
+ ++E +DD+
Sbjct: 817 HSEHVQEIQDDS 828
>gb|AAL68641.1| polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1472
Score = 329 bits (844), Expect = 3e-88
Identities = 236/818 (28%), Positives = 390/818 (46%), Gaps = 89/818 (10%)
Query: 83 RNYNAWA*SMKRVLVAKNKFKFINGEIPIAV-PGDANYEAWDRCNNLIHSWILNSVTSSI 141
+NY +W+ +L K ++ GE+ ++ W N+L+ +W+L S+ +I
Sbjct: 55 KNYLSWSRRALLILKTKGLEGYVTGEVKEPENTSSVEWKTWSTTNSLVVAWLLTSLIPAI 114
Query: 142 ANSIVFVENACDAWRDLRDRFS-QGVLVRIAELQNEIGNLKQNTLSVNDYYTEIKTLWEE 200
A ++ + +A + W+ L +S +G ++ + E Q +I L+Q SV +Y E+K+LW +
Sbjct: 115 ATTVETISSASEMWKTLTKLYSGEGNVMLMVEAQEKISALRQGERSVAEYVAELKSLWSD 174
Query: 201 LEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLGLNETFSVVNSQILMSNPLPPI 260
L+ Y P+ + I K + E+ I FL GLN F + LP +
Sbjct: 175 LDHYDPLGLEHS------DCIAKMKKWVERRRVIEFLKGLNPEFEGRRDAMFHQTTLPTL 228
Query: 261 AKVVSLAMQHERQSETGENEESKS---LVKMAEGKKTYGKGKAPGSSSSGSGYKSTGKYC 317
+ ++ Q E + + + S + +GK+T + C
Sbjct: 229 DEAIAAMAQEELKKKVLPSAAPCSPSPTYAIVQGKET--------------------REC 268
Query: 318 THCKKPGHTVDVCYRLHGYPTTSKSKPAFNNVSHINNIT*GYDSEDDQEESSKSQRGNGD 377
+C + GH + C+ + KP + ++ + + + +S RG G
Sbjct: 269 FNCGEMGHLMRDCH--------APRKPTYGRGRGVDR----GGTRGGRGYAGRSNRGRGY 316
Query: 378 LFTADQYKTIMAMIQQVTATSASQKHTESGKA-FANLVTKSAVCGANTAGNGKHSTLSYS 436
+ D YK VT S T A FA+ +T+G+ + +S +
Sbjct: 317 GYRGD-YKA-----NAVTLEEGSSGTTPDNVANFAH----------STSGSFNQAFMSMN 360
Query: 437 SQYDSRDWIIDSGASDHICFNKLCFDTLNRIKPVS------VRLPNGISIQTCYAGVVSI 490
+ + S WI+DSGAS H+ F + KP S ++ +G S Q G+V
Sbjct: 361 TSHSS--WILDSGASRHVTGMSGEFTSY---KPYSFAHKETIQTADGTSCQVKGEGIVQC 415
Query: 491 TKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEFTCFVQEKHTKKRIGLGSLKEDD 550
T I L+ VLYV F NLIS+ L C V C +QE+ T K++G+G ++ D
Sbjct: 416 TPSITLSSVLYVHSFPVNLISISSLVDNMDCRVSLDRENCLIQERRTGKKLGIG-IRRDG 474
Query: 551 LYHLVVTSPSSTCFAIPNVVERISDSDQLIPPGALWHFRLGHLSHDRILALNALYPS--I 608
L++L + A+ ++ + + + L H RLGH+S + ++ ++P
Sbjct: 475 LWYLDRRGTNEDVCAL------MASTSKEVTEVLLLHCRLGHISFE---IMSKMFPVEFS 525
Query: 609 DVSKHF-VCDICHLAKLKRKMFPDSLHNAKCNFDLLHMDIWGPISVSSVHSHRYFLTVLD 667
V KH +CD C K R + + F L+H D+W V S+ +YF+T +D
Sbjct: 526 KVDKHMLICDACEYGKHTRTSYVSRGLRSILPFMLIHSDVWTS-PVVSMSGMKYFVTFID 584
Query: 668 DHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVIRSDNGPEFMLHDF---YSAHG 724
+SR W+ L++ K EV + +NF A +K F+ V+ IR+DNG E+M +F S G
Sbjct: 585 CYSRMTWLYLMRHKDEVLKCFQNFYAYIKNHFNARVQFIRTDNGGEYMNSEFGHFLSLEG 644
Query: 725 ILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKMWGYAILHAVFLMSRIQSKV 784
ILHQ SC +TP QNG ERK++H+L IAR+L++ ++PK +W A++ A +L++R S++
Sbjct: 645 ILHQTSCPDTPPQNGVAERKNRHLLEIARSLMYTMNVPKFLWSEAVMTAAYLINRTPSRI 704
Query: 785 LENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLDSRARKCVFLGFKQGVKGFV 844
L K+PYE++FG + + RVFG CF S KLD RA KC+F+G+ KG+
Sbjct: 705 LGMKTPYEMIFGKNEFVVPP-RVFGCTCFVRDHRPSIGKLDPRAVKCIFIGYSSSQKGYK 763
Query: 845 LLDLNSHEIFVSRDVTFHEQILPYKSATSTAWTCLDLE 882
+ FVS DVTF E + Y T + +DL+
Sbjct: 764 CWSPSERRTFVSMDVTFRESVPFYGEKTDISSLFVDLD 801
>gb|AAT38747.1| putative polyprotein [Solanum demissum]
Length = 1336
Score = 304 bits (778), Expect = 1e-80
Identities = 257/810 (31%), Positives = 367/810 (44%), Gaps = 72/810 (8%)
Query: 80 LNGRNYNAWA*SMKRVLVAKNKFKFINGEIPIAVPGDANYEAWDRCNNLIHSWILNSVTS 139
LNG NY W+ ++ L + K + + P D +AW R + + I+NS+
Sbjct: 23 LNGSNYLDWSRKIRIYLRSVEKDDHLIQDPPT----DDAKKAWLRDDARLILQIINSID- 77
Query: 140 SIANSIVFVENACDAWRDLRDRFS-----QGVLVRIAELQNEIGNLKQNTLSVNDYYTEI 194
N +V + N C+ ++L D +G L RI E+ ++ S+ Y+ E
Sbjct: 78 ---NEVVGLVNHCEFVKELMDYLEYLYSGKGNLSRIYEVSKAFYRSEKEAKSLTTYFMEF 134
Query: 195 KTLWEELEQYRPIPQCRCVVPCRCEAIEHAKMFREQDNAIRFLLGLNETFSVVNSQILMS 254
K +EEL P I+ + REQ + FL GL F SQIL S
Sbjct: 135 KKTYEELNVLLPFST----------DIKVQQAQREQMAIMSFLAGLPSEFETAKSQILSS 184
Query: 255 NPLPPIAKVVSLAMQHERQSETGENEESKSLV-KMAEGKKTYGKGKAPGSSSSGSGYKST 313
+ + + V S ++ E T N+++ LV K G+ G+ + + K
Sbjct: 185 SEITSLKDVFSQVLRTE---STPANQQTNVLVAKGGGGRNNAGRWNNNNDAGKWNNNKDG 241
Query: 314 GKYCTHCKKPG---HTVDVCYRLHGYPTTSKSKPAFNNVSHINNIT*GYDSEDDQ---EE 367
K+ H G H D G + +NN NN +++++ +E
Sbjct: 242 EKW-NHNNDAGKWNHNNDA-----GRWNNKNNVGVWNNNKEGNNDAGRWNNDNTCRYCKE 295
Query: 368 SSKSQRGNGDLFTADQYKTIMAMIQQVTATSASQKH-TESGKAFANLVTKSAVCGANTAG 426
+R L +Q A V ATS+S T S +A L A +
Sbjct: 296 PGHIRRNCKKLQLHNQQTQTAA----VAATSSSPSTVTISADEYARLTKYQESMPAPSLN 351
Query: 427 NGKHSTLSYSSQYDSRDWIIDSGASDHICFNKLCFDTLNRIK-PVSVRLPNGISIQTCYA 485
+ L SS +WIIDSGA+DH+ N F K P SV + +G S +
Sbjct: 352 ESGNKCLISSSS----NWIIDSGATDHMTGNPKFFSKFQAHKVPSSVTIVDGSSYTIEGS 407
Query: 486 GVVSITKQILLTHVLYVPEFTYNLISVHKLAKRNRCEVVFGEFTCFVQEKHTKKRIGLGS 545
G V+ T I L+ VL +P +NLISV KL K +C V C Q+ TK+ IG
Sbjct: 408 GTVNHTSSITLSSVLGLPSHAFNLISVSKLTKELKCFVSLYPDHCLFQDLMTKQIIGKRH 467
Query: 546 LKEDDLYHLVV-TSPSSTCFAIPNVVERISDSDQLIPPGALWHFRLGHLSHDRILALNAL 604
+ D LY L T PS C +I + E H RLGH S + L L
Sbjct: 468 VS-DGLYILDEWTPPSVACSSIVSPFEA--------------HCRLGHPS---LPVLKKL 509
Query: 605 YPSIDVSKHFVCDICHLAKLKR-KMFPDSLHNAKCNFDLLHMDIWGPISVSSVHSHRYFL 663
P C+ CH AK R + P + A F+L+H D+WGP V S RYF+
Sbjct: 510 CPQFHNVPSIDCESCHFAKHHRISLSPRNNKRANFAFELVHSDVWGPCPVVSKVGFRYFV 569
Query: 664 TVLDDHSRFVWVILLKRKGEVQQ*IKNFVALVKTQFSHHVKVIRSDNGPEFM---LHDFY 720
T +DD SR W+ +K + EV NF A +KTQF+ V ++RSDN EFM ++
Sbjct: 570 TFMDDFSRMTWIYFMKNRSEVFSHFSNFCAEIKTQFNASVHILRSDNAREFMSASFQNYM 629
Query: 721 SAHGILHQRSCVNTPQQNGRVERKHQHILNIARALLFQSHLPKKMWGYAILHAVFLMSRI 780
+ +GILHQ SCV+TP QNG ERK++H+L AR LLFQ +PK+ W + A FL++R+
Sbjct: 630 NQYGILHQSSCVDTPSQNGVAERKNRHLLETARVLLFQMKVPKQFWADTVSTASFLINRM 689
Query: 781 QSKVLENKSPYEILFGGKKIDLQELRVFGSLCFASTSSSSRSKLDSRARKCVFLGFKQGV 840
S VL PY +LF K + E +VFGS C+ +KLD +A KCVFLG+ +
Sbjct: 690 PSTVLNGDIPYGVLFPNKPLFPLEPKVFGSTCYVRDVRPHITKLDPKALKCVFLGYSRLQ 749
Query: 841 KGFVLLDLNSHEIFVSRDVTFHEQILPYKS 870
KG+ + VS DV F E I + S
Sbjct: 750 KGYRCYSPTLNRYMVSIDVVFSESISFFSS 779
>dbj|BAB10743.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1109
Score = 297 bits (760), Expect = 2e-78
Identities = 172/431 (39%), Positives = 238/431 (54%), Gaps = 18/431 (4%)
Query: 450 ASDHICFNKLCFDTLNRIKPVSVRLPNGISIQTCYAGVVSITKQILLTHVLYVPEFTYNL 509
AS H+ N + + PV + L +G G V + ++L V YV E +L
Sbjct: 169 ASHHMTGNLELLSDMRSMSPVLIILADGNKRVAVSEGTVRLGSHLILKSVFYVKELESDL 228
Query: 510 ISVHKLAKRNRCEVVFGEFTCFVQEKHTKKRIGLGSLKEDDLYHLVVTSPSSTCFAIPNV 569
ISV ++ N C V + +Q++ T+ G+G + S CF
Sbjct: 229 ISVGQMMDENHCVVQLADHFLVIQDRTTRMVTGIGKREN-----------GSFCFRGMEN 277
Query: 570 VERISDSDQLIPPGALWHFRLGHLSHDRILAL--NALYPSIDVSKHFVCDICHLAKLKRK 627
+ S + P LWH RLGH S D+I+ L L S VCD C AK R
Sbjct: 278 AAAVHTSVKA--PFDLWHRRLGHAS-DKIVNLLPRELLSSGKEILENVCDTCMRAKQTRD 334
Query: 628 MFPDSLHNAKCNFDLLHMDIWGPISVSSVHSHRYFLTVLDDHSRFVWVILLKRKGEVQQ* 687
FP S + + +F L+H D+WGP S RYFLT++DD+SR VWV L+ K E Q+
Sbjct: 335 TFPLSDNRSMDSFQLIHCDVWGPYRTPSYSGARYFLTIVDDYSRGVWVYLMTDKSETQKH 394
Query: 688 IKNFVALVKTQFSHHVKVIRSDNGPEFM-LHDFYSAHGILHQRSCVNTPQQNGRVERKHQ 746
+K+F+ALV+ QF +K +RSDNG EF+ + +++ GI H+ SCV TP QNGRVERKH+
Sbjct: 395 LKDFIALVERQFDTEIKTVRSDNGTEFLCMREYFLHKGITHETSCVGTPHQNGRVERKHR 454
Query: 747 HILNIARALLFQSHLPKKMWGYAILHAVFLMSRIQSKVLENKSPYEILFGGKKIDLQELR 806
HILNIARAL FQS+LP + WG IL A +L++R S +L+ KSPYE+L+ + LR
Sbjct: 455 HILNIARALRFQSYLPIQFWGECILSAAYLINRTPSMLLQGKSPYEMLYKTAP-NYSHLR 513
Query: 807 VFGSLCFASTSSSSRSKLDSRARKCVFLGFKQGVKGFVLLDLNSHEIFVSRDVTFHEQIL 866
VFGSLC+A + K +R+R+CVF+G+ G KG+ L DL + FVSRDV F E
Sbjct: 514 VFGSLCYAHNQNHKGDKFVARSRRCVFVGYPHGQKGWRLFDLEEQKFFVSRDVIFQETEF 573
Query: 867 PYKSATSTAWT 877
PY + +T
Sbjct: 574 PYSKMSCNRFT 584
Score = 89.0 bits (219), Expect = 9e-16
Identities = 46/162 (28%), Positives = 81/162 (49%), Gaps = 1/162 (0%)
Query: 49 PHVVLEPAQNPISPYYIHSGENPSATVVSLPLNGRNYNAWA*SMKRVLVAKNKFKFINGE 108
P V+E + ISPY + + +N A + L NY WA K L ++ KF F++G
Sbjct: 8 PPSVIE-VRRTISPYDLTAADNSGAVISHPILKTNNYEEWACGFKTALRSRKKFGFLDGT 66
Query: 109 IPIAVPGDANYEAWDRCNNLIHSWILNSVTSSIANSIVFVENACDAWRDLRDRFSQGVLV 168
IP + G + E W N L+ SW+ ++ S + +I + A D W +R RF
Sbjct: 67 IPQPLDGSPDLEDWLTINALLVSWMKMTIDSELLTNISHRDVARDLWEQIRKRFFVSNGP 126
Query: 169 RIAELQNEIGNLKQNTLSVNDYYTEIKTLWEELEQYRPIPQC 210
+ +++ ++ KQ +++ YY ++ +W+ + YRP+ C
Sbjct: 127 KNQKMKADLATCKQEGMTMEGYYGKLNKIWDNINSYRPLRIC 168
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.331 0.141 0.442
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,850,557,645
Number of Sequences: 2540612
Number of extensions: 75101886
Number of successful extensions: 253963
Number of sequences better than 10.0: 1057
Number of HSP's better than 10.0 without gapping: 672
Number of HSP's successfully gapped in prelim test: 385
Number of HSP's that attempted gapping in prelim test: 249403
Number of HSP's gapped (non-prelim): 2188
length of query: 1167
length of database: 863,360,394
effective HSP length: 139
effective length of query: 1028
effective length of database: 510,215,326
effective search space: 524501355128
effective search space used: 524501355128
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 81 (35.8 bits)
Lotus: description of TM0334a.1