
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0321.5
(1411 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 711 0.0
gb|AAK51235.1| polyprotein [Arabidopsis thaliana] 707 0.0
gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis ... 701 0.0
gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein ... 701 0.0
emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|32692... 698 0.0
gb|AAV24907.1| hypothetical protein [Oryza sativa (japonica cult... 692 0.0
ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (jap... 682 0.0
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia... 678 0.0
gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (ja... 670 0.0
ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (jap... 669 0.0
dbj|BAB10876.1| polyprotein [Arabidopsis thaliana] 659 0.0
gb|AAC02664.1| polyprotein [Arabidopsis thaliana] 652 0.0
gb|AAC02666.1| polyprotein [Arabidopsis thaliana] 649 0.0
gb|AAC02669.1| polyprotein [Arabidopsis thaliana] 648 0.0
gb|AAF99727.1| F17L21.7 [Arabidopsis thaliana] 640 0.0
gb|AAC02672.1| polyprotein [Arabidopsis arenosa] gi|7522104|pir|... 601 e-170
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi... 573 e-161
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ... 573 e-161
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t... 565 e-159
gb|AAG50751.1| polyprotein, putative [Arabidopsis thaliana] gi|2... 548 e-154
>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 711 bits (1835), Expect = 0.0
Identities = 388/874 (44%), Positives = 525/874 (59%), Gaps = 36/874 (4%)
Query: 547 FFPVYLVLSVEAQDWFL*CSCSLQSVCGKSISRSIKVFQSDNGTEFTNNKVHTLFASSGV 606
FFP+ + + F+ + Q + + IK FQSD G EFT+NK+ F G+
Sbjct: 549 FFPLRM------KSKFISVFIAYQKLVENQLGTKIKEFQSDGGGEFTSNKLKEHFREHGI 602
Query: 607 LHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYHSHVPTRYWVDAFSTAVYIINRVPSKVL 666
HR SCP+T QNG E+KHRH++ELGL+MLYHSH P ++WV+AF TA Y+ N +PS VL
Sbjct: 603 HHRISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSNLLPSSVL 662
Query: 667 SNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPYMNNKFSPRSTPCIFFGYSSHHKGFKCF 726
P++ LF Y FG +PCLRP NKF PRS C+F GY + +KG++C
Sbjct: 663 KEISPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYRCL 722
Query: 727 NPAISLTYVSHHAQFDEFCFPFAGSHSS---PHDLVVSTFYEPASGSPPSPHPVVLVAPE 783
P Y+S H FDE FPF + S + + ++ +PPS P
Sbjct: 723 YPPTGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTPPS-------VPS 775
Query: 784 SSL-PIGSLSCPSCDDSDVLPALPVPPADAPP---SPAPPSPTQTATPPVGTQAPPASPP 839
S L P+ P S+ P + +A + T++ AP +
Sbjct: 776 SQLQPLARQMTPMAT-SENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVAPVLNDQ 834
Query: 840 TTPPVVAQVPLAPIAARPRTRSQNGIFRPNPRYALVHAQPTGILTAFHTVTKPKGFKSAM 899
+ Q L + TRS++GI +PNPRYAL+ ++ + +PK +AM
Sbjct: 835 NEDNALGQGSLENLHPMI-TRSKDGIQKPNPRYALIVSKSS--------FDEPKTITTAM 885
Query: 900 KHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTVERLKARLVAQ 959
KHP W AA+ DE+ +H TW+LVP N++ KWVF+TK DGT+++LKARLVA+
Sbjct: 886 KHPSWNAAVMDEIDRIHMLNTWSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKARLVAK 945
Query: 960 GFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGHLTETVYMEQP 1019
GF Q G DY TFSPVV+ T+RL+L N+W L QLDV NAFLHG L E V+M QP
Sbjct: 946 GFDQEEGVDYLETFSPVVRTATIRLVLDTATANEWPLKQLDVSNAFLHGELQEPVFMFQP 1005
Query: 1020 PGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPSLFFFYKGHTT 1079
GFVDP P HVCRL KALYGLKQAPRAWF S+FLL GF CS +DPSLF ++ +
Sbjct: 1006 SGFVDPNKPNHVCRLTKALYGLKQAPRAWFDTFSNFLLDFGFECSTSDPSLFVCHQNGQS 1065
Query: 1080 LYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYTPDGLFLGQAK 1139
L LL+YVDDI+LTGSD L+ + + LN F++K LG YFLG+EI +GLFL Q
Sbjct: 1066 LILLLYVDDILLTGSDQLLMDKLLQALNNRFSMKDLGPPRYFLGIEIESYNNGLFLHQHA 1125
Query: 1140 YAHDLLSRAMMLEASHVSTPLAAGSHLVS-SGEAYFDPTHYRSLVGALQYLTITRPDLSY 1198
YA D+L +A M E + + TPL HL + E + +PT++RSL G LQYLTITRPD+ Y
Sbjct: 1126 YASDILHQAGMTECNPMPTPLP--QHLEDLNSEPFEEPTYFRSLAGKLQYLTITRPDIQY 1183
Query: 1199 AVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSDADWARCTDTR 1258
AVN + Q + APT F +KRILRYV GT + GL R+ +P + G+ D+D+A C DTR
Sbjct: 1184 AVNFICQRMHAPTNSDFGLLKRILRYVKGTINMGLPIRKHHNPVLSGFCDSDYAGCKDTR 1243
Query: 1259 RSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNLLHELRVRLSA 1318
RST G+ I LG L+SWSAK+QPT++ SS E+EYRA+++TA E+ W+ +LL +L +
Sbjct: 1244 RSTTGFCILLGSTLISWSAKRQPTISHSSTEAEYRALSDTAREITWISSLLRDLGISQHQ 1303
Query: 1319 TPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPTSLQLADIFTK 1378
+ DN SA++++ NP HK +KH D D H++ E VA G + +H+P ++QLAD+FTK
Sbjct: 1304 PTRVFCDNLSAVYLSANPALHKRSKHFDKDFHYIRERVALGLIETQHIPATIQLADVFTK 1363
Query: 1379 ALPRPLFEIFRSKLRVG---LNPTLSLKGVLEMN 1409
+LPR F R+KL V ++PT SLK +E N
Sbjct: 1364 SLPRRPFITLRAKLGVSASPVSPTPSLKEGVEDN 1397
>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
Length = 1453
Score = 707 bits (1824), Expect = 0.0
Identities = 380/861 (44%), Positives = 513/861 (59%), Gaps = 17/861 (1%)
Query: 556 VEAQDWFL*CSCSLQSVCGKSISRSIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHT 615
++A+ F + Q++ + IKVFQSD G EFT+N + G+ HR SCP+T
Sbjct: 553 LKAKSDFFAVFVAFQNLVENQFNTKIKVFQSDGGGEFTSNLMKKHLTDCGIQHRISCPYT 612
Query: 616 QAQNGRVEQKHRHVLELGLAMLYHSHVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLL 675
QNG E+KHRH +ELGL+M++HSH P ++WV+AF TA ++ N +PS L N P + L
Sbjct: 613 PQQNGIAERKHRHFVELGLSMMFHSHTPLQFWVEAFFTASFLSNMLPSPSLGNVSPLEAL 672
Query: 676 FHVAPTYANFHPFGCRVFPCLRPYMNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYV 735
P YA FG +PCLRP +KF PRS C+F GY+S +KG++C P Y+
Sbjct: 673 LKQKPNYAMLRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYNSQYKGYRCLYPPTGRVYI 732
Query: 736 SHHAQFDEFCFPFAGSHSSPHDLVVSTFYEPASGSPPSPHPVVLVAPESSLPIGSLSCPS 795
S H FDE FPF + S+ S P ++ E I SL+ P
Sbjct: 733 SRHVIFDEETFPFKQKYQFLVPQYESSLLSAWQSSIPQADQSLIPQAEEG-KIESLAKPP 791
Query: 796 CDDSDVLPALPVPPADAPPS-----PAPPSPTQTATPPVGTQAPPASPPTTPPVVAQVPL 850
+ + PA S +T T + + + V +V
Sbjct: 792 SIQKNTIQDTTTQPAILTEGVLNEEEEEDSFEETETESLNEETHTQNDEAEVTVEEEVQQ 851
Query: 851 APIAARPRT-RSQNGIFRPNPRYALVHAQPTGILTAFHTVTKPKGFKSAMKHP*WLAAME 909
P P T RS+ GI + N RYAL LT+ +V +PK A+ HP W A+
Sbjct: 852 EPENTHPMTTRSKAGIHKSNTRYAL--------LTSKFSVEEPKSIDEALNHPGWNNAVN 903
Query: 910 DELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTVERLKARLVAQGFTQIPGFDY 969
DE+ +H TW+LV N++G +WVF+TK DG+V++LKARLVA+GF Q G DY
Sbjct: 904 DEMRTIHMLHTWSLVQPTEDMNILGCRWVFKTKLKPDGSVDKLKARLVAKGFHQEEGLDY 963
Query: 970 SLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGHLTETVYMEQPPGFVDPRFPT 1029
TFSPVV+ T+RL+L W + QLDV NAFLHG L E VYM QPPGFVD P+
Sbjct: 964 LETFSPVVRTATIRLVLDVATAKGWNIKQLDVSNAFLHGELKEPVYMLQPPGFVDQEKPS 1023
Query: 1030 HVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPSLFFFYKGHTTLYLLVYVDDI 1089
+VCRL KALYGLKQAPRAWF +S++LL GFSCS++DPSLF ++K TL LL+YVDDI
Sbjct: 1024 YVCRLTKALYGLKQAPRAWFDTISNYLLDFGFSCSKSDPSLFTYHKNGKTLVLLLYVDDI 1083
Query: 1090 ILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYTPDGLFLGQAKYAHDLLSRAM 1149
+LTGSD +LL + + LN F++K LG YFLG+EI +P+GLFL Q YA D+L +A
Sbjct: 1084 LLTGSDHNLLQELLMSLNKRFSMKDLGAPSYFLGVEIESSPEGLFLHQTAYAKDILHQAA 1143
Query: 1150 MLEASHVSTPLAAGSHLVSSGEAYFDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQA 1209
M + + TPL ++S + + +PT++RSL G LQYLTITRPD+ +AVN + Q + +
Sbjct: 1144 MSNCNSMPTPLPQHIENLNS-DLFPEPTYFRSLAGKLQYLTITRPDIQFAVNFICQRMHS 1202
Query: 1210 PTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSDADWARCTDTRRSTYGYAIFLG 1269
PT F +KRILRYV GT H GL ++ + +++ YSD+DWA C +TRRST G+ LG
Sbjct: 1203 PTTADFGLLKRILRYVKGTIHLGLHIKKNQNLSLVAYSDSDWAGCKETRRSTTGFCTLLG 1262
Query: 1270 DNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNLLHELRVRLSATPLLLSDNQSA 1329
NL+SWSAK+Q TV++SS E+EYRA+ A EL WL LL ++ V + L+ DN SA
Sbjct: 1263 CNLISWSAKRQETVSKSSTEAEYRALTAVAQELTWLSFLLRDIGVTQTHPTLVKCDNLSA 1322
Query: 1330 LFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPTSLQLADIFTKALPRPLFEIFR 1389
++++ NP H +KH D D H++ E VA G + +H+ +LQLADIFTK LPR F R
Sbjct: 1323 VYLSANPALHNRSKHFDTDYHYIREQVALGLVETKHISATLQLADIFTKPLPRRAFIDLR 1382
Query: 1390 SKLRVGLNPTLSLKG-VLEMN 1409
KL V PT SL+G V EM+
Sbjct: 1383 IKLGVAEPPTTSLRGNVSEMS 1403
>gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis thaliana]
gi|25301689|pir||C96578 hypothetical protein T18A20.5
[imported] - Arabidopsis thaliana
Length = 1522
Score = 701 bits (1810), Expect = 0.0
Identities = 383/892 (42%), Positives = 519/892 (57%), Gaps = 66/892 (7%)
Query: 570 QSVCGKSISRSIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHV 629
Q + + IK+FQ D G EF +++ G+ SCP+T QNG E+KHRH+
Sbjct: 568 QKLVENQLGHKIKIFQCDGGGEFISSQFLKHLQDHGIQQNMSCPYTPQQNGMAERKHRHI 627
Query: 630 LELGLAMLYHSHVPTRYWVDAFSTAVYIINRVPSKVL-SNQIPFQLLFHVAPTYANFHPF 688
+ELGL+M++ S +P +YW+++F TA ++IN +P+ L +N+ P+Q L+ AP Y+ F
Sbjct: 628 VELGLSMIFQSKLPLKYWLESFFTANFVINLLPTSSLDNNESPYQKLYGKAPEYSALRVF 687
Query: 689 GCRVFPCLRPYMNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPF 748
GC +P LR Y + KF PRS C+F GY+ +KG++C P Y+S H FDE PF
Sbjct: 688 GCACYPTLRDYASTKFDPRSLKCVFLGYNEKYKGYRCLYPPTGRIYISRHVVFDENTHPF 747
Query: 749 AG--SHSSPHD---LVVSTFYEPASGSPPSPHPVVLVAPESSLPIGSLSCPSCDDSDVLP 803
SH P D L+ + F +P P +S P+ S+ P D P
Sbjct: 748 ESIYSHLHPQDKTPLLEAWFKSFHHVTPTQPD-------QSRYPVSSIPQPETTDLSAAP 800
Query: 804 ALPVPPADAPP---------------SPAPP---------------SPTQTATPPVGTQA 833
A P S +P SPT ++ P ++
Sbjct: 801 ASVAAETAGPNASDDTSQDNETISVVSGSPERTTGLDSASIGDSYHSPTADSSHPSPARS 860
Query: 834 PPASPPTTPPVV---AQVPLAPIAARPR--TRSQNGIFRPNPRYALVHAQPTGILTAFHT 888
PAS P P+ AQ AP+ TR + GI +PN RY L LT +
Sbjct: 861 SPASSPQGSPIQMAPAQQVQAPVTNEHAMVTRGKEGISKPNKRYVL--------LTHKVS 912
Query: 889 VTKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGT 948
+ +PK A+KHP W AM++E+ + TWTLVP NV+G WVFRTK H+DG+
Sbjct: 913 IPEPKTVTEALKHPGWNNAMQEEMGNCKETETWTLVPYSPNMNVLGSMWVFRTKLHADGS 972
Query: 949 VERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHG 1008
+++LKARLVA+GF Q G DY T+SPVV+ TVRLIL + W+L Q+DVKNAFLHG
Sbjct: 973 LDKLKARLVAKGFKQEEGIDYLETYSPVVRTPTVRLILHVATVLKWELKQMDVKNAFLHG 1032
Query: 1009 HLTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADP 1068
LTETVYM QP GFVD P HVC L+K+LYGLKQ+PRAWF R S+FLL GF CS DP
Sbjct: 1033 DLTETVYMRQPAGFVDKSKPDHVCLLHKSLYGLKQSPRAWFDRFSNFLLEFGFICSLFDP 1092
Query: 1069 SLFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITY 1128
SLF + + + LL+YVDD+++TG++ LT +A LN EF +K +G++ YFLG++I
Sbjct: 1093 SLFVYSSNNDVILLLLYVDDMVITGNNSQSLTHLLAALNKEFRMKDMGQVHYFLGIQIQT 1152
Query: 1129 TPDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHLVSS-GEAYFDPTHYRSLVGALQ 1187
GLF+ Q KYA DLL A M S + TPL VS+ E + DPT++RSL G LQ
Sbjct: 1153 YDGGLFMSQQKYAEDLLITASMANCSPMPTPLPLQLDRVSNQDEVFSDPTYFRSLAGKLQ 1212
Query: 1188 YLTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAV---- 1243
YLT+TRPD+ +AVN V Q + P++ F +KRILRY+ GT G+ + SS V
Sbjct: 1213 YLTLTRPDIQFAVNFVCQKMHQPSVSDFNLLKRILRYIKGTVSMGIQYNSNSSSVVSAYE 1272
Query: 1244 -----LGYSDADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANT 1298
YSD+D+A C +TRRS GY F+G N++SWS+KKQPTV+RSS E+EYR+++ T
Sbjct: 1273 SDYDLSAYSDSDYANCKETRRSVGGYCTFMGQNIISWSSKKQPTVSRSSTEAEYRSLSET 1332
Query: 1299 ASELVWLLNLLHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVAS 1358
ASE+ W+ ++L E+ V L TP L DN SA+++ NP H KH D+D H++ E VA
Sbjct: 1333 ASEIKWMSSILREIGVSLPDTPELFCDNLSAVYLTANPAFHARTKHFDVDHHYIRERVAL 1392
Query: 1359 GRLAVRHVPTSLQLADIFTKALPRPLFEIFRSKLRVGLNPTLSLKGVLEMNS 1410
L V+H+P LQLADIFTK+LP F R KL V PT SL+G + N+
Sbjct: 1393 KTLVVKHIPGHLQLADIFTKSLPFEAFTRLRFKLGVDFPPTPSLRGCISTNN 1444
>gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein [Zea mays]
gi|7444442|pir||T02087 gag/pol polyprotein - maize
retrotransposon Hopscotch
Length = 1439
Score = 701 bits (1809), Expect = 0.0
Identities = 397/866 (45%), Positives = 519/866 (59%), Gaps = 64/866 (7%)
Query: 567 CSLQSVCGKSISRSIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKH 626
C Q + + R I FQSD G E+ H F + G+ H+ SCPHT QNG E+KH
Sbjct: 563 CEFQHLVERMFGRKIIAFQSDWGGEYEKLNAH--FKTIGIHHQVSCPHTHQQNGAAERKH 620
Query: 627 RHVLELGLAMLYHSHVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFH 686
RH++E+GLA+L S +P +YW AF AVY+INR PSK +++ P L P Y++
Sbjct: 621 RHIVEVGLALLAQSSMPLKYWDHAFLAAVYLINRTPSKTIAHDTPLHKLTGATPDYSSLR 680
Query: 687 PFGCRVFPCLRPYMNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCF 746
FGC +P LRPY +K RST C+F GYS+ HKGFKC + + Y+S FDE F
Sbjct: 681 IFGCACWPNLRPYNQHKLQFRSTRCVFLGYSNMHKGFKCLDISTGRIYISRDVVFDEHVF 740
Query: 747 PFAGSHSS------------PHDLVVSTFY-EPASGSPPSPHPVVLVAPE-----SSLPI 788
PFA + + PHD + + A+ P S P+ +A S +P
Sbjct: 741 PFASLNKNAGVKYTSEVLLLPHDSCGNNMLTDHANNLPGSSSPLPFLAQHFLQGNSEVPT 800
Query: 789 GS---LSCPSCDDSDV------LPALPVPPADAPPS-------PAPPSPTQTATPPVGTQ 832
+ ++ P+ ++V +P+ VP A P+ PAP + + ++ PPV T+
Sbjct: 801 SNNTAMALPASGPNEVSVPPALVPSSLVPAASPAPTGVSANAEPAPEADSLSSGPPVATE 860
Query: 833 A----PPASPPTTPP---VVAQVP-LAPI-AARPRTRSQNGIFRPNP------RYALVHA 877
+ P A P P V Q P AP+ AA PRTR Q+GI +P RY A
Sbjct: 861 SVTGVPDADPLLQAPGSSVAHQTPDSAPLSAAAPRTRLQHGISKPKQFTDGTVRYGNAAA 920
Query: 878 QPTGILTAFHTVTKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKW 937
+ +T+P A+ P W AAME E AL KN TWTLVP T N++ KW
Sbjct: 921 R----------ITEPSSVSEALADPQWRAAMEAEFQALQKNNTWTLVPPDRTRNLIDCKW 970
Query: 938 VFRTKFHSDGTVERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLH 997
VF+ K+++DG+++RLKARLVA+GF Q G DY TFSPVVK +T+RL+LS V W L
Sbjct: 971 VFKVKYNADGSIDRLKARLVAKGFKQQYGIDYDDTFSPVVKHSTIRLVLSLAVSQKWSLR 1030
Query: 998 QLDVKNAFLHGHLTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLL 1057
QLDV+NAFLHG L ETVYM+QPPGF D P + C L K+LYGLKQ PRAW+ RLS L
Sbjct: 1031 QLDVQNAFLHGILEETVYMKQPPGFADTTHPNYHCHLQKSLYGLKQRPRAWYSRLSEKLQ 1090
Query: 1058 RHGFSCSRADPSLFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGK 1117
GF S+AD SLF + T +Y+LVYVDDII+TGS P + +A+L +FAIK LG
Sbjct: 1091 SLGFVPSKADVSLFIYNAHSTAIYILVYVDDIIITGSSPHAIDNVLAKLKDDFAIKDLGD 1150
Query: 1118 LGYFLGLEITYTPDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHLVSSGEAYFDP- 1176
L YFLG+E+ DGL L Q KYA DLL R M V TP+A L +S P
Sbjct: 1151 LHYFLGIEVHRKGDGLLLCQEKYARDLLKRVGMECCKPVHTPVATSEKLSASAGTLLSPE 1210
Query: 1177 --THYRSLVGALQYLTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLT 1234
T YRS+VGALQYLT+TRPDLSYA+N V QFL APT H+ AVKRILR + T GLT
Sbjct: 1211 ETTKYRSVVGALQYLTLTRPDLSYAINRVCQFLHAPTDLHWTAVKRILRNIQHTIGLGLT 1270
Query: 1235 FRRTSSPAVLGYSDADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRA 1294
R + S + +SDADWA C D R+ST GYA+FLG NL+SW++KKQ TV+RSS E+EY+A
Sbjct: 1271 IRPSLSLMLSAFSDADWAGCPDDRKSTGGYALFLGPNLISWNSKKQSTVSRSSTEAEYKA 1330
Query: 1295 MANTASELVWLLNLLHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCE 1354
MAN +E++WL +LLHEL +RL+ P L DN A +++ P+ + KHI++D HFV +
Sbjct: 1331 MANATAEVIWLQSLLHELGIRLTGIPRLWCDNLGATYLSSKPIFNARTKHIEVDFHFVRD 1390
Query: 1355 LVASGRLAVRHVPTSLQLADIFTKAL 1380
V S +L +R + T+ Q+AD FTKAL
Sbjct: 1391 RVLSKKLDIRLISTNDQVADGFTKAL 1416
>emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|3269282|emb|CAA19715.1|
putative protein [Arabidopsis thaliana]
gi|7444417|pir||T05745 hypothetical protein M4I22.20 -
Arabidopsis thaliana
Length = 1318
Score = 698 bits (1801), Expect = 0.0
Identities = 372/862 (43%), Positives = 504/862 (58%), Gaps = 50/862 (5%)
Query: 577 ISRSIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAM 636
+S+ I VFQ D G EF ++K S G+ + SCPHT QNG E+KHRH++ELGL+M
Sbjct: 386 LSQKISVFQCDGGGEFVSHKFLQHLQSHGIQQQLSCPHTPQQNGLAERKHRHLVELGLSM 445
Query: 637 LYHSHVPTRYWVDAFSTAVYIINRVPSKVLSNQI-PFQLLFHVAPTYANFHPFGCRVFPC 695
L+ SHVP ++WV+AF TA ++IN +P+ L I P++ L+ P Y + FG FP
Sbjct: 446 LFQSHVPHKFWVEAFFTANFLINLLPTSALKESISPYEKLYDKKPDYTSLRSFGSACFPT 505
Query: 696 LRPYMNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSSP 755
LR Y NKF+P S C+F GY+ +KG++C P Y+S H FDE +PF+ ++
Sbjct: 506 LRDYAENKFNPCSLKCVFLGYNEKYKGYRCLYPPTGRLYISRHVIFDESVYPFSHTYKHL 565
Query: 756 HDLVVSTFYEPASGSPPSPHPVVLVAPESSLPIGSLSCPSCDDSDVLPALPVPPADAPPS 815
H + S SP P +P S P+ + S P LP P+
Sbjct: 566 HPQPRTPLLAAWLRSSDSPAPSTSTSPSSRSPLFT--------SADFPPLPQRKTPLLPT 617
Query: 816 PAPPSPTQTATPPVGTQAPPASPPTTPPV----------------------------VAQ 847
P S A+ Q+P T V Q
Sbjct: 618 LVPISSVSHASNITTQQSPDFDSERTTDFDSASIGDSSHSSQAGSDSEETIQQASVNVHQ 677
Query: 848 VPLAPIAARPRTRSQNGIFRPNPRYALVHAQPTGILTAFHTVT--KPKGFKSAMKHP*WL 905
+ TR++ GI +PNPRY + H V+ +PK +A+KHP W
Sbjct: 678 THASTNVHPMVTRAKVGISKPNPRYVFLS----------HKVSYPEPKTVTAALKHPGWT 727
Query: 906 AAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTVERLKARLVAQGFTQIP 965
AM +E+ + TW+LVP S +V+G KWVFRTK H+DGT+ +LKAR+VA+GF Q
Sbjct: 728 GAMTEEIGNCSETQTWSLVPYKSDMHVLGSKWVFRTKLHADGTLNKLKARIVAKGFLQEE 787
Query: 966 GFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGHLTETVYMEQPPGFVDP 1025
G DY T+SPVV+ TVRL+L +W + Q+DVKNAFLHG L ETVYM QP GFVDP
Sbjct: 788 GIDYLETYSPVVRTPTVRLVLHLATALNWDIKQMDVKNAFLHGDLKETVYMTQPAGFVDP 847
Query: 1026 RFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPSLFFFYKGHTTLYLLVY 1085
P HVC L+K++YGLKQ+PRAWF + S+FLL GF CS++DPSLF + + + LL+Y
Sbjct: 848 SKPDHVCLLHKSIYGLKQSPRAWFDKFSTFLLEFGFFCSKSDPSLFIYAHNNNLILLLLY 907
Query: 1086 VDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYTPDGLFLGQAKYAHDLL 1145
VDD+++TG+ LT +A LN EF + +G+L YFLG+++ +GLF+ Q KYA DLL
Sbjct: 908 VDDMVITGNSSQTLTSLLAALNKEFRMTDMGQLHYFLGIQVQRQQNGLFMSQQKYAEDLL 967
Query: 1146 SRAMMLEASHVSTPLAAGSHLVSSGEAYF-DPTHYRSLVGALQYLTITRPDLSYAVNTVS 1204
A M + + TPL V E F DPT++RS+ G LQYLT+TRPD+ +AVN V
Sbjct: 968 IAASMEHCTPLPTPLPVQLDRVPHQEELFSDPTYFRSIAGKLQYLTLTRPDIQFAVNFVC 1027
Query: 1205 QFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSDADWARCTDTRRSTYGY 1264
Q + PT+ F +KRILRY+ GT G+++ R S + YSD+DW C TRRS G
Sbjct: 1028 QKMHQPTISDFHLLKRILRYIKGTITMGISYSRDSPTLLQAYSDSDWGNCKQTRRSVGGL 1087
Query: 1265 AIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNLLHELRVRLSATPLLLS 1324
F+G NL+SWS+KK PTV+RSS E+EY+++++ ASE++WL LL ELR+ L TP L
Sbjct: 1088 CTFMGTNLVSWSSKKHPTVSRSSTEAEYKSLSDAASEILWLSTLLRELRIPLPDTPELFC 1147
Query: 1325 DNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPTSLQLADIFTKALPRPL 1384
DN SA+++ NP H KH D+D HFV E VA L V+H+P S Q+ADIFTK+LP
Sbjct: 1148 DNLSAVYLTANPAFHARTKHFDIDFHFVRERVALKALVVKHIPGSEQIADIFTKSLPYEA 1207
Query: 1385 FEIFRSKLRVGLNPTLSLKGVL 1406
F R KL V L+PT SL+G +
Sbjct: 1208 FIHLRGKLGVTLSPTPSLRGTI 1229
>gb|AAV24907.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
gi|51854440|gb|AAU10819.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1679
Score = 692 bits (1787), Expect = 0.0
Identities = 386/875 (44%), Positives = 514/875 (58%), Gaps = 69/875 (7%)
Query: 580 SIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYH 639
+I+ Q DNG EF N+ T F ++GV R SCPHT QNG+ E+ R + + +ML+
Sbjct: 813 NIRSIQCDNGREFDNSAARTFFLTNGVHLRMSCPHTSPQNGKAERILRSLNNIVRSMLFQ 872
Query: 640 SHVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPY 699
+ +P +WV+A TA ++INR P+K L P L+ P+Y++ FGC+ +P L
Sbjct: 873 AKLPGSFWVEALHTATHLINRHPTKTLDRHTPHFALYGTHPSYSHLRVFGCKCYPNLSAT 932
Query: 700 MNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAG--------- 750
+K +PRST C+F GY +HKG++CF+P + +S H FDE FPF
Sbjct: 933 TPHKLAPRSTMCVFLGYPLYHKGYRCFDPLSNRVIISRHVVFDEHSFPFTELTNGVSNAT 992
Query: 751 ------SHSSPHDLVVSTFYEPA------SGSPPSPHPVVLVAPES-SLPIGSLSCPSCD 797
++P + PA + S P H + P S + P+ + PS
Sbjct: 993 DLDFLEDFTAPAQAPIGATRRPAVAPTTQTASSPMVHGLERPPPCSPTRPVSTPGGPSSP 1052
Query: 798 DSDVLPALPVP----PADAPPSP--APPSPTQTATPPVGTQAPPASPPTTPPVV------ 845
DS + P P P PA P P A P+P+ + P T A PPT P +
Sbjct: 1053 DSRLGPPSPTPALIGPASTSPGPPSAGPAPSASTCPASSTWETVARPPTPPGLPRLDGPH 1112
Query: 846 ------------------AQVPLAPIAARP-------RTRSQNGIFRPNPRYALVHAQPT 880
A PL+ + P TR+++G +P R L HA P
Sbjct: 1113 LPPAPHVPRRLRSVRATGAPTPLSGLEISPVVNDHVMTTRAKSGHHKPVHRLNL-HAAPL 1171
Query: 881 GILTAFHTVTKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFR 940
++ PK +++A+ P W AAME+E +AL N TW LVPRP+ NVV KW+F+
Sbjct: 1172 SLV--------PKTYRAALADPLWRAAMEEEYNALLANRTWDLVPRPAGVNVVTGKWIFK 1223
Query: 941 TKFHSDGTVERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLD 1000
KFH+DG+++R KAR V +GFTQ PG D+ TFSPVVK TVR +LS V DW +HQLD
Sbjct: 1224 HKFHADGSLDRYKARWVLRGFTQRPGVDFDETFSPVVKPATVRTVLSLAVSRDWPVHQLD 1283
Query: 1001 VKNAFLHGHLTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHG 1060
VKNAFLHG L ETVY QPPGFVD P VC LNK+LYGLKQAPRAW+ R ++FL G
Sbjct: 1284 VKNAFLHGTLQETVYCTQPPGFVDSAKPDMVCCLNKSLYGLKQAPRAWYSRFTTFLQSIG 1343
Query: 1061 FSCSRADPSLFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGY 1120
F +++D SLF ++G+ T+YLL+YVDDI+LT S +LL I+ L EF++K LG L +
Sbjct: 1344 FVEAKSDTSLFILHRGNDTVYLLLYVDDIVLTASSRTLLHWTISALQGEFSMKDLGALHH 1403
Query: 1121 FLGLEITYTPDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHLVSS-GEAYFDPTHY 1179
FLG+ +T GL L Q +Y D+L RA M + +TP+ + L SS G DPT +
Sbjct: 1404 FLGVSVTRNSAGLVLSQRQYCIDILERAGMADCKPCNTPVDTTAKLSSSDGPPVADPTDF 1463
Query: 1180 RSLVGALQYLTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTS 1239
RSL GALQYLT TRPD+SYAV V + P H A+KRIL Y+ G+ GL +R+S
Sbjct: 1464 RSLAGALQYLTFTRPDISYAVQQVCLHMHDPREPHLAALKRILHYIRGSVDLGLHIQRSS 1523
Query: 1240 SPAVLGYSDADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTA 1299
+ + YSDADWA C DTRRST GYA+FLGDNL+SWS+K+Q TV+RSS E+EYRA+AN
Sbjct: 1524 ACDLAVYSDADWAGCPDTRRSTSGYAVFLGDNLVSWSSKRQHTVSRSSAEAEYRAVANAV 1583
Query: 1300 SELVWLLNLLHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASG 1359
+E+ WL LL EL S L+ DN SA++++ NPV H+ KH+++D HFV E VA G
Sbjct: 1584 AEVTWLRQLLQELHSPPSRATLVYCDNVSAVYLSSNPVQHQRTKHVEIDLHFVRERVAVG 1643
Query: 1360 RLAVRHVPTSLQLADIFTKALPRPLFEIFRSKLRV 1394
+ V HVPT+ Q ADIFTK LP P+F FRS L V
Sbjct: 1644 AVRVLHVPTTSQYADIFTKGLPTPVFTEFRSSLNV 1678
>ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1373
Score = 682 bits (1761), Expect = 0.0
Identities = 384/873 (43%), Positives = 505/873 (56%), Gaps = 63/873 (7%)
Query: 580 SIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYH 639
+IK Q DNG EF N+ T F S GV R SCP+T QNGR E+ R + + ++L+
Sbjct: 509 TIKSVQCDNGREFDNSPARTFFLSHGVAFRMSCPYTSQQNGRAERSLRTLNNILRSLLFQ 568
Query: 640 SHVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPY 699
+ +P YWV+A TA ++NR+P+K LS+ P+ L+ PTY + FGC +P +
Sbjct: 569 ACLPPVYWVEALHTATLLVNRIPTKTLSSSTPYFHLYSTQPTYDHLRVFGCACYPNMSST 628
Query: 700 MNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSSP---- 755
+K +PRS+ C+F GYSS HKG++C + S H FDE FPFA +SP
Sbjct: 629 APHKLAPRSSLCVFLGYSSEHKGYRCLELGSNRIITSRHVVFDESFFPFADMSTSPMASS 688
Query: 756 --------HDLVV----STFYEPASGS-----------PPSPH------PVVLVAPESSL 786
++L + F + S PP+P P L PE+
Sbjct: 689 ALDIFLDDNELTAQPPRAKFVHAGTSSAARGAVEPSTPPPAPSSIGPRSPATLAGPEAGP 748
Query: 787 PIGS-LSCPSCDDSDVLPALPVPPADAPPSP-----APPSPTQTATPPVGTQAPPASPPT 840
GS + + PA P+ A + AP + T TP + A A+PP
Sbjct: 749 HGGSPAGAATSQPGAISPARTAAPSAATSTTRAVTSAPRAATSGTTPSLSPLAGTAAPPP 808
Query: 841 TPPVVAQ-------------VPLAPI--AARPRTRSQNGIFRPNPRYALVHAQPTGILTA 885
V A V +AP+ A RTR + G+ +P R L HA P +
Sbjct: 809 RAEVAASSTAATGRTLATRPVSIAPVDNAHSMRTRGKAGMAQPVDRLNL-HAAPLSPV-- 865
Query: 886 FHTVTKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHS 945
P+ + A+ P W AAM+ E AL N TW+LVPRP N+V KW+FR K HS
Sbjct: 866 ------PRSVREALSDPNWRAAMQAEFDALLANDTWSLVPRPRGVNLVTGKWIFRHKLHS 919
Query: 946 DGTVERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAF 1005
DG+++R KAR V +GFTQ PG DY TFSPVVK TVR++LS + DW +HQLDVKNAF
Sbjct: 920 DGSLDRYKARWVLRGFTQRPGVDYDETFSPVVKPATVRVVLSLALSQDWPIHQLDVKNAF 979
Query: 1006 LHGHLTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSR 1065
LHG L+ETVY QP GF DP VCRLNK+LYGLKQAPRAW R +S L+ GF ++
Sbjct: 980 LHGTLSETVYCIQPTGFADPSHADLVCRLNKSLYGLKQAPRAWHHRFASHLISLGFIEAQ 1039
Query: 1066 ADPSLFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLE 1125
+D SLF +G+ T+ LL+YVDDI+LT S SLL Q IA L EFA+ +G L +FLG+
Sbjct: 1040 SDSSLFIHRRGNDTVLLLLYVDDIVLTASSASLLQQVIAALQREFAMTDMGPLHHFLGIT 1099
Query: 1126 ITYTPDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHLVSSGEAYFDPTHYRSLVGA 1185
+T GLFL Q +Y+ D+L RA M E STP+ S L + G D T YRSL GA
Sbjct: 1100 VTRFASGLFLSQRQYSQDILERAGMGECKPCSTPVDVHSKLSADGPPVADSTQYRSLAGA 1159
Query: 1186 LQYLTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLG 1245
LQYLT TRPD+++AV V ++ P H A+KRILRY+ GT GLT RR+ ++
Sbjct: 1160 LQYLTFTRPDIAFAVQQVCLYMHDPREPHLAALKRILRYIQGTLSLGLTMRRSPPTDLVV 1219
Query: 1246 YSDADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWL 1305
Y+DADWA C DTRRST GYA+FLGDNL+SWS+K+Q TV+RSS E+EYRA+AN +E WL
Sbjct: 1220 YTDADWAGCPDTRRSTSGYAVFLGDNLVSWSSKRQHTVSRSSAEAEYRAVANGVAEATWL 1279
Query: 1306 LNLLHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRH 1365
LL EL ++ DN SA++++ NPV H+ KH+++D HFV E VA G + V H
Sbjct: 1280 RQLLMELHRPPRTATVVYCDNVSAMYLSSNPVQHQRTKHVEIDLHFVREKVALGHVRVLH 1339
Query: 1366 VPTSLQLADIFTKALPRPLFEIFRSKLRVGLNP 1398
VPT+ Q AD+FTK LP LF+ FR+ L + P
Sbjct: 1340 VPTTSQYADVFTKGLPTSLFQEFRTSLTISDAP 1372
>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
gi|4539447|emb|CAB40035.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444419|pir||T04204
hypothetical protein T4F9.150 - Arabidopsis thaliana
Length = 1515
Score = 678 bits (1749), Expect = 0.0
Identities = 367/879 (41%), Positives = 514/879 (57%), Gaps = 50/879 (5%)
Query: 570 QSVCGKSISRSIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHV 629
Q + I +FQ D G EF + K AS G+ SCPHT QNG E++HR++
Sbjct: 565 QQLVENQYQHKIAMFQCDGGGEFVSYKFVAHLASCGIKQLISCPHTPQQNGIAERRHRYL 624
Query: 630 LELGLAMLYHSHVPTRYWVDAFSTAVYIINRVPSKVLS-NQIPFQLLFHVAPTYANFHPF 688
ELGL++++HS VP + WV+AF T+ ++ N +PS LS N+ P+++L P Y F
Sbjct: 625 TELGLSLMFHSKVPHKLWVEAFFTSNFLSNLLPSSTLSDNKSPYEMLHGTPPVYTALRVF 684
Query: 689 GCRVFPCLRPYMNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPF 748
G +P LRPY NKF P+S C+F GY++ +KG++C +P Y+ H FDE FP+
Sbjct: 685 GSACYPYLRPYAKNKFDPKSLLCVFLGYNNKYKGYRCLHPPTGKVYICRHVLFDERKFPY 744
Query: 749 AGSHSSPHDLVVSTFYEP---------ASGSPPSPHPVVLVAPE----SSLPIGSL---- 791
+ +S + S + S PS + ++ P SS+P G
Sbjct: 745 SDIYSQFQTISGSPLFTAWQKGFSSTALSRETPSTNVEDIIFPSATVSSSVPTGCAPNIA 804
Query: 792 SCPSCDDSDVLPA--LPVPPADAPPSPAPPSPTQTATPP----------VGTQAPPASP- 838
+ D DV A + VPP+ + P P ++ + + + P S
Sbjct: 805 ETATAPDVDVAAAHDMVVPPSPITSTSLPTQPEESTSDQNHYSTDSETAISSAMTPQSIN 864
Query: 839 ---------PTTPPVVAQVPLAPIAARPR-TRSQNGIFRPNPRYALVHAQPTGILTAFHT 888
P V++ AP + P TR+++GI +PNP+YAL +
Sbjct: 865 VSLFEDSDFPPLQSVISSTTAAPETSHPMITRAKSGITKPNPKYALFSVKSN-------- 916
Query: 889 VTKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGT 948
+PK K A+K W AM +E+ +H+ TW LVP ++G KWVF+TK +SDG+
Sbjct: 917 YPEPKSVKEALKDEGWTNAMGEEMGTMHETDTWDLVPPEMVDRLLGCKWVFKTKLNSDGS 976
Query: 949 VERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHG 1008
++RLKARLVA+G+ Q G DY T+SPVV++ TVR IL +N W L QLDVKNAFLH
Sbjct: 977 LDRLKARLVARGYEQEEGVDYVETYSPVVRSATVRSILHVATINKWSLKQLDVKNAFLHD 1036
Query: 1009 HLTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADP 1068
L ETV+M QPPGF DP P +VC+L KA+Y LKQAPRAWF + SS+LL++GF CS +DP
Sbjct: 1037 ELKETVFMTQPPGFEDPSRPDYVCKLKKAIYDLKQAPRAWFDKFSSYLLKYGFICSFSDP 1096
Query: 1069 SLFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITY 1128
SLF + KG ++LL+YVDD+ILTG++ LL Q + L+ EF +K +G L YFLG++ Y
Sbjct: 1097 SLFVYLKGRDVMFLLLYVDDMILTGNNDVLLQQLLNILSTEFRMKDMGALHYFLGIQAHY 1156
Query: 1129 TPDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHLVSSGEAYFDPTHYRSLVGALQY 1188
DGLFL Q KY DLL A M + S + TPL L + + + +PT++R L G LQY
Sbjct: 1157 HNDGLFLSQEKYTSDLLVNAGMSDCSSMPTPLQL-DLLQGNNKPFPEPTYFRRLAGKLQY 1215
Query: 1189 LTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSD 1248
LT+TRPD+ +AVN V Q + APTM F +KRIL Y+ GT G+ + + YSD
Sbjct: 1216 LTLTRPDIQFAVNFVCQKMHAPTMSDFHLLKRILHYLKGTMTMGINLSSNTDSVLRCYSD 1275
Query: 1249 ADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNL 1308
+DWA C DTRRST G+ FLG N++SWSAK+ PTV++SS E+EYR ++ ASE+ W+ L
Sbjct: 1276 SDWAGCKDTRRSTGGFCTFLGYNIISWSAKRHPTVSKSSTEAEYRTLSFAASEVSWIGFL 1335
Query: 1309 LHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPT 1368
L E+ + P + DN SA++++ NP H +KH +D ++V E VA G L V+H+P
Sbjct: 1336 LQEIGLPQQQIPEMYCDNLSAVYLSANPALHSRSKHFQVDYYYVRERVALGALTVKHIPA 1395
Query: 1369 SLQLADIFTKALPRPLFEIFRSKLRVGLNPTLSLKGVLE 1407
S QLADIFTK+LP+ F R KL V L P SL+G ++
Sbjct: 1396 SQQLADIFTKSLPQAPFCDLRFKLGVVLPPDTSLRGCIK 1434
>gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|37530764|ref|NP_919684.1| putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|20042923|gb|AAM08751.1| Putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)]
Length = 1803
Score = 670 bits (1728), Expect = 0.0
Identities = 361/821 (43%), Positives = 482/821 (57%), Gaps = 46/821 (5%)
Query: 581 IKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYHS 640
+ Q+DNG E+ + + +L + G + R SCP++ QNG+ E+ R + + ML HS
Sbjct: 617 VLALQTDNGKEYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDCVRTMLVHS 676
Query: 641 HVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPYM 700
P +W +A TA+++INR P + + P+QLL PTY + FGC +P
Sbjct: 677 AAPLSFWAEALQTAMHLINRRPCRATGSLKPYQLLLGAPPTYDHLRVFGCLCYPNTIATA 736
Query: 701 NNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSSPHDLVV 760
+K SPRS C+F GY + H+G++C++ + S H F E FPF + S
Sbjct: 737 PHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVFPFRDAPS------- 789
Query: 761 STFYEPASGSPPSPHP-----VVLVAPESSLPIGSLSCPSCDDSDVLPALPVPPADAPPS 815
P +PP P V+L AP + V P P DA
Sbjct: 790 -----PRPSAPPPPDHGDDTIVLLPAPAQHV--------------VTPVGTAPAHDAASP 830
Query: 816 PAPPSPTQTATPPVGTQAPPASPPTTPPVVAQVPLAPIAARPRTRSQNGIFRPNPRYALV 875
P+P S T ++ P APP SP T+ P A P + TR++ GI +PNPRYA+
Sbjct: 831 PSPASSTPSSAAPAHDVAPPPSPETSSPASASPPRHAMT----TRARAGISKPNPRYAM- 885
Query: 876 HAQPTGILTAFHTVTK-PKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVG 934
TA T++ P + A++ P W AAM+ E AL N TWTLVPRP ++
Sbjct: 886 --------TATSTLSPTPSSVRVALRDPNWRAAMQAEFDALLANRTWTLVPRPPGARIIT 937
Query: 935 IKWVFRTKFHSDGTVERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDW 994
KWVF+TK H+DG++++ KAR V +GF Q PG D+ TFSPVVK T+R +L+ + W
Sbjct: 938 GKWVFKTKLHADGSLDKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISSKQW 997
Query: 995 QLHQLDVKNAFLHGHLTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSS 1054
HQLDV NAFLHGHL E V +QP GF D P VC L+++LYGL+QAPRAWF+R +
Sbjct: 998 PAHQLDVSNAFLHGHLQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKRFAD 1057
Query: 1055 FLLRHGFSCSRADPSLFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKY 1114
GF SRADPSLF +G T YLL+YVDD+IL+ S SLL + I RL AEF +K
Sbjct: 1058 HATSLGFVQSRADPSLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFKVKD 1117
Query: 1115 LGKLGYFLGLEITYTPDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHLVS-SGEAY 1173
+G L YFLG+E+ T DG L Q+KYA D+L RA M V+TP A L S G +
Sbjct: 1118 MGPLKYFLGIEVQRTADGFVLSQSKYATDVLERAGMANCKAVATPADAKPKLSSDEGPLF 1177
Query: 1174 FDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGL 1233
D + YRS+ GALQYLT+TRPD++YAV V + AP H +KRILRY+ GT FGL
Sbjct: 1178 QDSSWYRSIAGALQYLTLTRPDIAYAVQQVCLHMHAPREAHVTLLKRILRYIKGTAAFGL 1237
Query: 1234 TFRRTSSPAVLGYSDADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYR 1293
R ++SP + +SDADWA C DTRRST G+ IFLGD+L+SWS+K+Q TV+RSS E+EYR
Sbjct: 1238 HLRASTSPTLTAFSDADWAGCPDTRRSTSGFCIFLGDSLISWSSKRQTTVSRSSAEAEYR 1297
Query: 1294 AMANTASELVWLLNLLHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVC 1353
+AN +E WL LL EL R+ + DN S+++M++NPV HK KHI+LD HFV
Sbjct: 1298 GVANAVAECTWLRQLLGELHCRVPQATIAYCDNISSVYMSKNPVHHKRTKHIELDIHFVR 1357
Query: 1354 ELVASGRLAVRHVPTSLQLADIFTKALPRPLFEIFRSKLRV 1394
E VA G L V +P++ Q AD+FTK LP +F FR+ L V
Sbjct: 1358 EKVALGELRVLPIPSAHQFADVFTKGLPSSMFNEFRASLCV 1398
>ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1090
Score = 669 bits (1727), Expect = 0.0
Identities = 372/838 (44%), Positives = 487/838 (57%), Gaps = 28/838 (3%)
Query: 581 IKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYHS 640
+K FQ+DNG EF N + T AS G R SCP+T QNG+ E+ R + +L +
Sbjct: 242 LKSFQADNGREFVNTAITTFLASRGTQLRLSCPYTSPQNGKAERMLRTINNSIRTLLIQA 301
Query: 641 HVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPYM 700
+P YW +A +TA Y++NR PS + +PFQLL P +++ FGC +P L
Sbjct: 302 SMPPSYWAEALATATYLLNRRPSSSIHQSLPFQLLHRTIPDFSHLRVFGCLCYPNLSATT 361
Query: 701 NNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSH--SSPHDL 758
+K SPRST C+F GY + HKG++C + + +S H FDE FPFA + +S D
Sbjct: 362 PHKLSPRSTACVFLGYPTSHKGYRCLDLSTHRIIISRHVVFDESQFPFAATPPAASSFDF 421
Query: 759 VVSTFY--EPASGSPPSPHPVV-----------LVAPESSLPIGSLSCPSCDDSDVLPAL 805
++ + S P P+ L P L G+++ S S P +
Sbjct: 422 LLQGLSPADAPSLEVEQPRPLTVAPSTEVEQPYLPLPSRRLSAGTVTVASEAPSAGAPLV 481
Query: 806 PVPPADAPPSPAPPSPTQTATPP---VGTQAPPAS-PPTTPPVVAQVPLAPIAARPRTRS 861
ADA P P + T P V T+ P + PP++ V AP TRS
Sbjct: 482 GTSSADATP-PGSATRASTIVSPFRHVYTRRPVTTVPPSSSTAVTNAVAAPQPHSMVTRS 540
Query: 862 QNGIFRPNPRYALVHAQPTGILTAFHTVTKPKGFKSAMKHP*WLAAMEDELSALHKNCTW 921
Q+G RP R Q P + SA+ P W AAM DE L N TW
Sbjct: 541 QSGSLRPVDRLTYTATQAAASPV-------PANYHSALADPNWRAAMADEYKELVDNGTW 593
Query: 922 TLVPRPSTTNVVGIKWVFRTKFHSDGTVERLKARLVAQGFTQIPGFDYSLTFSPVVKATT 981
LV RP N+ KW+F+ KFHSDG++ R KAR V +G++Q G DY TFSPVVK T
Sbjct: 594 RLVSRPPRANIATGKWIFKHKFHSDGSLARYKARWVVRGYSQQHGIDYDETFSPVVKLAT 653
Query: 982 VRLILSHVVLNDWQLHQLDVKNAFLHGHLTETVYMEQPPGFVDPRFPTHVCRLNKALYGL 1041
+R++LS W +HQLDVKNAFLHGHL ETVY +QP GFVDP P VC L K+LYGL
Sbjct: 654 IRVVLSIAASRAWPIHQLDVKNAFLHGHLKETVYCQQPSGFVDPTAPDAVCLLQKSLYGL 713
Query: 1042 KQAPRAWFQRLSSFLLRHGFSCSRADPSLFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQ 1101
KQAPRAW+QR ++++ + GF S +D SLF + G YLL+YVDDIILT S +LL Q
Sbjct: 714 KQAPRAWYQRFATYIRQMGFMPSASDTSLFVYKDGDRIAYLLLYVDDIILTASTTTLLQQ 773
Query: 1102 FIARLNAEFAIKYLGKLGYFLGLEITYTPDGLFLGQAKYAHDLLSRAMMLEASHVSTPLA 1161
ARL++EFA+ LG L +FLG+ + +PDGLFL Q +YA DLL RA M E STP+
Sbjct: 774 LTARLHSEFAMTDLGDLHFFLGISVKRSPDGLFLSQRQYAVDLLQRAGMAECHSTSTPVD 833
Query: 1162 AGSHL-VSSGEAYFDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQAPTMEHFQAVKR 1220
+ L + G DP+ YRS+ GALQYLT+TRPDL+YAV V F+ P H VKR
Sbjct: 834 THAKLSATDGLPVADPSAYRSIAGALQYLTLTRPDLAYAVQQVCLFMHDPREPHLALVKR 893
Query: 1221 ILRYVCGTQHFGLTFRRTSSPAVLGYSDADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQ 1280
ILRYV G+ GL ++ YSDADWA C ++RRST GY ++LGDNL+SWS+K+Q
Sbjct: 894 ILRYVKGSLSIGLHIGSGPIQSLTAYSDADWAGCPNSRRSTSGYCVYLGDNLVSWSSKRQ 953
Query: 1281 PTVARSSCESEYRAMANTASELVWLLNLLHELRVRLSATPLLLSDNQSALFMAQNPVAHK 1340
TV+RSS E+EYRA+A+ +E WL LL EL V +++ ++ DN SA++M NPV H+
Sbjct: 954 TTVSRSSAEAEYRAVAHAVAECCWLRQLLQELHVPIASATIVYCDNVSAVYMTANPVHHR 1013
Query: 1341 HAKHIDLDCHFVCELVASGRLAVRHVPTSLQLADIFTKALPRPLFEIFRSKLRVGLNP 1398
KHI++D HFV E VA G++ V +VP+S Q ADI TK LP LF FRS L V P
Sbjct: 1014 RTKHIEIDIHFVREKVALGQVRVLYVPSSHQFADIMTKGLPVQLFTDFRSSLCVRAPP 1071
>dbj|BAB10876.1| polyprotein [Arabidopsis thaliana]
Length = 1429
Score = 659 bits (1699), Expect = 0.0
Identities = 373/864 (43%), Positives = 492/864 (56%), Gaps = 59/864 (6%)
Query: 581 IKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYHS 640
I+ SDNG EF + A+ G+ H S PHT NG E+KHRH++E GLA+L H+
Sbjct: 569 IRTLYSDNGGEFIG--LRPFLAAHGISHLTSPPHTPEHNGLAERKHRHIVETGLALLTHA 626
Query: 641 HVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPYM 700
+P +W AF+TAVY+INR+P++VL P+ LF ++P Y FGC +P LRPY
Sbjct: 627 SLPKTFWTYAFATAVYLINRMPTEVLQGTSPYVKLFQMSPNYLKLRVFGCLCYPWLRPYN 686
Query: 701 NNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSSPHDL-- 758
NK RST C+F GYS + C + A + Y S H QF E FPFA +S D
Sbjct: 687 TNKLEARSTMCVFLGYSLTQSAYLCLDIATNRIYTSRHVQFVESSFPFASPRTSETDSTQ 746
Query: 759 ---------------------------VVSTFYEP-----ASGSPPSPHPVVLVAPESSL 786
+ F+ P + SPPS H + A SS
Sbjct: 747 TMSQPTTTNVIPLLQRPPHIAPPTALPLCPIFHSPPHSPSSPASPPSEHVPLTAASSSSN 806
Query: 787 PIGSLSCPSCDDSDVL-PALPVP---PADAPPSPAPPSPTQTATPPVGTQAPPASP---- 838
I + S V P P P + SP SP T T PP SP
Sbjct: 807 AINDDNISSTGQVSVSGPTSQSPHTTPTNQNTSPLSKSPNPTNTNQSQNSTPPTSPTTSV 866
Query: 839 ----PTTPPVVAQVPLAPIAARP---RTRSQNGIFRPNPRYALVHAQPTGILTAFHTVTK 891
PT P+ PL P RTR++N I +P ++ L T + ++ T+
Sbjct: 867 HQHSPTPSPLPQNPPLPPPPQNDHPMRTRAKNQITKPKTKFNLT----TSLTSSKPTI-- 920
Query: 892 PKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTVER 951
P A+K P W AM +E++A KN TW LV +V+ KW+F K++ DG++ R
Sbjct: 921 PTTVAQALKDPNWRNAMSEEINAQMKNHTWDLVSPEEAKHVISCKWIFTLKYNVDGSIAR 980
Query: 952 LKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGHLT 1011
KARLVA+GF Q G DYS TFSPV+K+TT+R +L V +W +HQ+D+ NAFL G L
Sbjct: 981 YKARLVARGFNQQYGIDYSETFSPVIKSTTIRTVLEVAVKRNWSIHQVDINNAFLQGTLN 1040
Query: 1012 ETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPSLF 1071
E VY+ QPPGF+D P+HVCRLNKALYGLKQAPRAW+Q L FLL+ GF S AD SLF
Sbjct: 1041 EEVYVSQPPGFIDRDRPSHVCRLNKALYGLKQAPRAWYQELRRFLLQAGFVNSLADASLF 1100
Query: 1072 FFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYTPD 1131
+ + +T +Y+LVYVDDII+ G + +L+ F A L + F++K LG L YFLG+E T T
Sbjct: 1101 IYNRHNTFMYVLVYVDDIIIAGEN-ALVQAFNASLASRFSLKDLGPLSYFLGIEATRTSR 1159
Query: 1132 GLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHL-VSSGEAYFDPTHYRSLVGALQYLT 1190
GL L Q KY DLL + ML+ VSTP++ L + SG A D T YR+++G+LQYL
Sbjct: 1160 GLHLMQRKYITDLLKKHNMLDTKPVSTPMSPTPKLSLLSGTALDDATEYRTVLGSLQYLA 1219
Query: 1191 ITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSDAD 1250
TRPD+++AVN +SQF+ PT EH+QA KRILRY+ GT+ G+ R + + +SDAD
Sbjct: 1220 FTRPDIAFAVNRLSQFMHRPTNEHWQAAKRILRYLAGTKSHGIFLRSDTPLTIHAFSDAD 1279
Query: 1251 WARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNLLH 1310
W D ST Y ++ G + +SWS+KKQ +VARSS E+EYRA+ANTASEL WL +LL
Sbjct: 1280 WGCDLDAYLSTNAYIVYFGGSPVSWSSKKQRSVARSSTEAEYRAVANTASELRWLCSLLL 1339
Query: 1311 ELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPTSL 1370
E+ + + P++ DN A ++ NPV H KH+ LD HFV + SG L V HV T
Sbjct: 1340 EMGISQTTVPVIYCDNIGATYLCANPVFHSRMKHVALDYHFVRGYIQSGALRVSHVSTKD 1399
Query: 1371 QLADIFTKALPRPLFEIFRSKLRV 1394
QLAD TK LPRP F SK+ V
Sbjct: 1400 QLADALTKPLPRPRFTELNSKIGV 1423
>gb|AAC02664.1| polyprotein [Arabidopsis thaliana]
Length = 1451
Score = 652 bits (1682), Expect = 0.0
Identities = 377/870 (43%), Positives = 495/870 (56%), Gaps = 60/870 (6%)
Query: 581 IKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYHS 640
I SDNG EF + + AS G+ H + PHT NG E+KHRH++E GL +L +
Sbjct: 588 IGTLYSDNGGEFI--AMRSFLASHGISHMTTPPHTPELNGISERKHRHIVETGLTLLSTA 645
Query: 641 HVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPYM 700
+P YW AF+TAVY+INR+ + VL N+ P+ LF P Y FGC FP LRPY
Sbjct: 646 SMPKEYWSYAFATAVYLINRMLTPVLGNESPYVKLFGQPPNYLKLRIFGCLCFPWLRPYT 705
Query: 701 NNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSS---PHD 757
+K RS PC+ GYS + C + A Y S H QF E FPF+ + S P D
Sbjct: 706 AHKLDNRSVPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFAESSFPFSTTSPSVTPPSD 765
Query: 758 ---------LVVSTFYEPASGSPPS------PH----------PVVLVAPESSL-PIGSL 791
+ V P + +PPS PH P + P SL P +
Sbjct: 766 PPLSQDTRPVSVPLLARPLTTAPPSSPSCSAPHRSPSQSENLSPPAPLQPSLSLSPTSPI 825
Query: 792 SCPSCDDSDVL-------------PALPVPPADAPPSPAPPSPTQTATPPVGTQAPPASP 838
+ PS + ++ P P P P SP SP +++P + P SP
Sbjct: 826 TSPSLSEESLVGHNSETGPTGSSPPLSPQPQRPQPQSPQSTSP-HSSSPQPNSPNPQHSP 884
Query: 839 PTTPPVVAQVPL---------APIAARPRTRSQNGIFRPNPRYALVHAQPTGILTAFHTV 889
+ P + P PI RTRS+N I +PNP++A + +PT +
Sbjct: 885 RSLTPTLTSSPSPSPPPNPNPPPIQHTMRTRSKNNIVKPNPKFANLATKPTPLKPII--- 941
Query: 890 TKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTV 949
PK A+ P W AM DE++A +N T+ LVP NVVG KWVF K+ S+G +
Sbjct: 942 --PKTVVEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVVGCKWVFTLKYLSNGVL 999
Query: 950 ERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGH 1009
+R KARLVA+GF Q G D+ TFSPV+K+TTVR +L V W + Q+DV NAFL G
Sbjct: 1000 DRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSVLHIAVSKGWSIRQIDVNNAFLQGT 1059
Query: 1010 LTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPS 1069
L++ VY+ QPPGFVD HVCRL KALYGLKQAPRAW+Q L S+LL GF S AD S
Sbjct: 1060 LSDEVYVTQPPGFVDKDNAHHVCRLYKALYGLKQAPRAWYQELRSYLLTQGFVNSVADTS 1119
Query: 1070 LFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYT 1129
LF T LY+LVYVDD+++TGSD +++T+FIA L A F++K LG++ YFLG+E T T
Sbjct: 1120 LFTLRHERTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEATRT 1179
Query: 1130 PDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHL-VSSGEAYFDPTHYRSLVGALQY 1188
GL L Q +Y DLL + ML A V TP++ L ++SG+ P+ YR+++G+LQY
Sbjct: 1180 SKGLHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAVLGSLQY 1239
Query: 1189 LTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSD 1248
L TRPD++YAVN +SQ++ PT H+QA KRILRY+ GT G+ R + + YSD
Sbjct: 1240 LLFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLTLHAYSD 1299
Query: 1249 ADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNL 1308
ADWA D ST Y ++LG N +SWS+KKQ VARSS E+EYRA+AN SE+ W+ +L
Sbjct: 1300 ADWAGDIDNYNSTNAYILYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEIRWVCSL 1359
Query: 1309 LHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPT 1368
L EL + LS+ P++ DN A +++ NPV H KHI LD HFV E V +G L V HV T
Sbjct: 1360 LTELGITLSSPPVVYCDNVGATYLSANPVFHSRMKHIALDFHFVRESVQAGALRVTHVST 1419
Query: 1369 SLQLADIFTKALPRPLFEIFRSKLRVGLNP 1398
QLAD TK LPR F SK+ V P
Sbjct: 1420 KDQLADALTKPLPRQPFTTLISKIGVAKAP 1449
>gb|AAC02666.1| polyprotein [Arabidopsis thaliana]
Length = 1451
Score = 649 bits (1673), Expect = 0.0
Identities = 376/870 (43%), Positives = 494/870 (56%), Gaps = 60/870 (6%)
Query: 581 IKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYHS 640
I SDNG EF + + AS G+ H + PHT NG E+KHRH++E GL +L +
Sbjct: 588 IGTLYSDNGGEFI--AMRSFLASHGISHMTTPPHTPELNGISERKHRHIVETGLTLLSTA 645
Query: 641 HVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPYM 700
+P YW AF+TAVY+INR+ + VL N+ P+ LF P Y FGC FP LRPY
Sbjct: 646 SMPKEYWSYAFATAVYLINRMLTPVLGNESPYVKLFGQPPNYLKLRIFGCLCFPWLRPYT 705
Query: 701 NNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSS---PHD 757
+K RS PC+ GYS + C + A Y S H QF E FPF+ + S P D
Sbjct: 706 AHKLDNRSVPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFAESSFPFSTTSPSVTPPSD 765
Query: 758 ---------LVVSTFYEPASGSPPS------PH----------PVVLVAPESSL-PIGSL 791
+ V P + +PPS PH P + P SL P +
Sbjct: 766 PPLSQDTRPVSVPLLARPLTTAPPSSPSCSAPHRSPSQSENLSPPAPLQPSLSLSPTSPI 825
Query: 792 SCPSCDDSDVL-------------PALPVPPADAPPSPAPPSPTQTATPPVGTQAPPASP 838
+ PS + ++ P P P P SP SP +++P + P SP
Sbjct: 826 TSPSLSEESLVGHNSETGPTGSSPPLSPQPQRPQPQSPQSTSP-HSSSPQPNSPNPQHSP 884
Query: 839 PTTPPVVAQVPL---------APIAARPRTRSQNGIFRPNPRYALVHAQPTGILTAFHTV 889
+ P + P PI RTRS+N I +PNP++A + +PT +
Sbjct: 885 RSLTPTLTSSPSPSPPPNPNPPPIQHTMRTRSKNNIVKPNPKFANLATKPTPLKPII--- 941
Query: 890 TKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTV 949
PK A+ P W AM DE++A +N T+ LVP NVVG KWVF K+ S+G +
Sbjct: 942 --PKTVVEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVVGCKWVFTLKYLSNGVL 999
Query: 950 ERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGH 1009
+R KARLVA+GF Q G D+ TFSPV+K+TTVR +L V W + Q+DV NAFL G
Sbjct: 1000 DRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSVLHIAVSKGWSIRQIDVNNAFLQGT 1059
Query: 1010 LTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPS 1069
L++ VY+ QPPGFVD HVCRL KALYGLKQAPRAW+Q L S+LL GF S AD S
Sbjct: 1060 LSDEVYVTQPPGFVDKDNAHHVCRLYKALYGLKQAPRAWYQELRSYLLTQGFVNSVADTS 1119
Query: 1070 LFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYT 1129
LF T LY+LVYVDD+++TGSD +++T+FIA L A F++K LG++ YFLG+E T T
Sbjct: 1120 LFTLRHERTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEATRT 1179
Query: 1130 PDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHL-VSSGEAYFDPTHYRSLVGALQY 1188
GL L Q +Y DLL + ML A V TP++ L ++SG+ P+ YR+++G+LQY
Sbjct: 1180 SKGLHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAVLGSLQY 1239
Query: 1189 LTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSD 1248
L TRPD++YAVN +SQ++ PT H+QA KRILRY+ GT G+ R + + YSD
Sbjct: 1240 LLFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLTLHAYSD 1299
Query: 1249 ADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNL 1308
ADWA D ST Y ++LG N +SWS+KKQ VARSS E+EYRA+AN SE+ W+ +L
Sbjct: 1300 ADWAGDIDNYNSTNAYILYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEIRWVCSL 1359
Query: 1309 LHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPT 1368
L EL + LS+ P++ DN A +++ NPV KHI LD HFV E V +G L V HV T
Sbjct: 1360 LTELGITLSSPPVVYCDNVGATYLSANPVFDSRMKHIALDFHFVRESVQAGALRVTHVST 1419
Query: 1369 SLQLADIFTKALPRPLFEIFRSKLRVGLNP 1398
QLAD TK LPR F SK+ V P
Sbjct: 1420 KDQLADALTKPLPRQPFTTLISKIGVAKAP 1449
>gb|AAC02669.1| polyprotein [Arabidopsis thaliana]
Length = 1451
Score = 648 bits (1671), Expect = 0.0
Identities = 376/870 (43%), Positives = 493/870 (56%), Gaps = 60/870 (6%)
Query: 581 IKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYHS 640
I SDNG EF + + AS G+ H + PHT NG E+KHRH++E GL +L +
Sbjct: 588 IGTLYSDNGGEFI--AMRSFLASHGISHMTTPPHTPELNGISERKHRHIVETGLTLLSTA 645
Query: 641 HVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPYM 700
+P YW AF+TAVY+INR+ + VL N+ P+ LF P Y FGC FP LRPY
Sbjct: 646 SMPKEYWSYAFATAVYLINRMLTPVLGNESPYVKLFGQPPNYLKLRIFGCLCFPWLRPYT 705
Query: 701 NNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSS---PHD 757
+K RS PC+ GYS + C + A Y S H QF E FPF+ + S P D
Sbjct: 706 AHKLDNRSVPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFAESSFPFSTTSPSVTPPSD 765
Query: 758 ---------LVVSTFYEPASGSPPS------PH----------PVVLVAPESSL-PIGSL 791
+ V P + +PPS PH P + P SL P +
Sbjct: 766 PPLSQDTRPVSVPLLARPLTTAPPSSPSCSAPHRSPSQSENLSPPAPLQPSLSLSPTSPI 825
Query: 792 SCPSCDDSDVL-------------PALPVPPADAPPSPAPPSPTQTATPPVGTQAPPASP 838
+ PS + ++ P P P P SP SP +++P + P SP
Sbjct: 826 TSPSLSEESLVGHNSETGPTGSSPPLSPQPQRPQPQSPQSTSP-HSSSPQPNSPNPQHSP 884
Query: 839 PTTPPVVAQVPL---------APIAARPRTRSQNGIFRPNPRYALVHAQPTGILTAFHTV 889
+ P + P PI RTRS+N I +PNP++A + +PT +
Sbjct: 885 RSLTPTLTSSPSPSPPPNPNPPPIQHTMRTRSKNNIVKPNPKFANLATKPTPLKPII--- 941
Query: 890 TKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTV 949
PK A+ P W AM DE++A +N T+ LVP NVVG KWVF K+ S+G +
Sbjct: 942 --PKTVVEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVVGCKWVFTLKYLSNGVL 999
Query: 950 ERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGH 1009
+R KARLVA+GF Q G D+ TFSPV+K TTVR +L V W + Q+DV NAFL G
Sbjct: 1000 DRYKARLVAKGFHQQYGHDFKETFSPVIKLTTVRSVLHIAVSKGWSIRQIDVNNAFLQGT 1059
Query: 1010 LTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPS 1069
L++ VY+ QPPGFVD HVCRL KALYGLKQAPRAW+Q L S+LL GF S AD S
Sbjct: 1060 LSDEVYVTQPPGFVDKDNAHHVCRLYKALYGLKQAPRAWYQELRSYLLTQGFVNSVADTS 1119
Query: 1070 LFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYT 1129
LF T LY+LVYVDD+++TGSD +++T+FIA L A F++K LG++ YFLG+E T T
Sbjct: 1120 LFTLRHERTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEATRT 1179
Query: 1130 PDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHL-VSSGEAYFDPTHYRSLVGALQY 1188
GL L Q +Y DLL + ML A V TP++ L ++SG+ P+ YR+++G+LQY
Sbjct: 1180 SKGLHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAVLGSLQY 1239
Query: 1189 LTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSD 1248
L TRPD++YAVN +SQ++ PT H+QA KRILRY+ GT G+ R + + YSD
Sbjct: 1240 LLFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLTLHAYSD 1299
Query: 1249 ADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNL 1308
ADWA D ST Y ++LG N +SWS+KKQ VARSS E+EYRA+AN SE+ W+ +L
Sbjct: 1300 ADWAGDIDNYNSTNAYILYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEIRWVCSL 1359
Query: 1309 LHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPT 1368
L EL + LS+ P++ DN A +++ NPV KHI LD HFV E V +G L V HV T
Sbjct: 1360 LTELGITLSSPPVVYCDNVGATYLSANPVFDSRMKHIALDFHFVRESVQAGALRVTHVST 1419
Query: 1369 SLQLADIFTKALPRPLFEIFRSKLRVGLNP 1398
QLAD TK LPR F SK+ V P
Sbjct: 1420 KDQLADALTKPLPRQPFTTLISKIGVAKAP 1449
>gb|AAF99727.1| F17L21.7 [Arabidopsis thaliana]
Length = 1534
Score = 640 bits (1650), Expect = 0.0
Identities = 367/869 (42%), Positives = 489/869 (56%), Gaps = 66/869 (7%)
Query: 581 IKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYHS 640
I+ SDNG EF + + G+ H S PHT NG E+KHRH+LE GL +L +
Sbjct: 679 IRTLYSDNGGEFI--ALRQFLLTHGISHLTSLPHTPEHNGIAERKHRHILETGLTLLTQA 736
Query: 641 HVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPYM 700
+PT YW AF TAVY+INR+PS VL+N+ P+ LF +P Y FGC FP LRPY
Sbjct: 737 SIPTSYWTYAFGTAVYLINRLPSSVLNNESPYSKLFKTSPNYLKLRVFGCSCFPWLRPYT 796
Query: 701 NNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPF------AGSHSS 754
N+K RS PC+F GYS + C + + Y S H QF E FPF + S+SS
Sbjct: 797 NHKLERRSQPCVFLGYSLTQSAYLCLDRSSGRVYTSRHVQFVEDQFPFSISDTHSVSNSS 856
Query: 755 PHDLVVSTFYEPASGSPPSPHPVVLVAPESSLPIGS----------LSCPSCDDSDVL-- 802
P + S P+ S P ++ AP S P+ S S S ++DV+
Sbjct: 857 PEEASPSCHQPPSRIPIQSSSPPLVQAPSSLPPLSSDSHRRPNAETSSSSSSTNNDVVVS 916
Query: 803 ----------------------------PALPVPPADAP---PSPAPPSPTQTATPPVGT 831
P+ + + P PSP P + + ++P T
Sbjct: 917 KDNTQVDNRNNFIGPTSSSSAQSQNNSNPSSSIQTQNEPNPSPSPTPQNSSPESSPSSST 976
Query: 832 QAPPASP-PTTPPVVAQVPLAPIAARPRTRSQNGIFRPNPRYALVHAQPTGILTAFHTVT 890
A P P PP P+ RTR++N I +P + +L+ T
Sbjct: 977 SATSTVPNPPPPPPTNNHPM-------RTRAKNHITKPKTKLSLL------AKTVQTRPQ 1023
Query: 891 KPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTVE 950
P A++ W AM +E++A +N T+ LVP NV+ KW+F K+ +GT++
Sbjct: 1024 IPNTVNQALRDEKWRNAMGEEINAQIRNNTFELVPPKPNQNVISTKWIFTLKYLPNGTLD 1083
Query: 951 RLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGHL 1010
R KARLVA+GF Q G YS TFSPVVK+ T+RL+L V W + QLDV NAFL G L
Sbjct: 1084 RYKARLVARGFRQQYGLHYSETFSPVVKSLTIRLVLQLAVSRSWTIKQLDVNNAFLQGTL 1143
Query: 1011 TETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPSL 1070
T+ VY+ QPPGF+DP P HVCRL KALYGLKQAPRAW+Q L +F+ GF+ S AD S+
Sbjct: 1144 TDEVYVTQPPGFIDPDRPHHVCRLKKALYGLKQAPRAWYQELRNFVCSLGFTNSLADTSV 1203
Query: 1071 FFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYTP 1130
F + +Y LVYVDDII+TGS +L+ FI L+ F++K L YFLG+E T T
Sbjct: 1204 FVYINDIQIVYCLVYVDDIIVTGSSDALVMAFITALSRRFSLKDPTDLVYFLGIEATRTS 1263
Query: 1131 DGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHL-VSSGEAYFDPTHYRSLVGALQYL 1189
GL L Q KY +DLLSR ML+A VSTP+A L + SG A +P YR+++G+LQYL
Sbjct: 1264 QGLHLMQHKYVYDLLSRMKMLDAKPVSTPMATHPKLSLYSGIALDEPGEYRTVIGSLQYL 1323
Query: 1190 TITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSDA 1249
TRPD++YAVN +SQF+ PT H+QA KR+LRY+ GT G+ R S ++ +SDA
Sbjct: 1324 AFTRPDIAYAVNRLSQFMHRPTDIHWQAAKRVLRYLAGTATHGILLRSNSPLSLHAFSDA 1383
Query: 1250 DWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNLL 1309
DWA D ST Y ++LG ++WS+KKQ VARSS E+EYRA+ANT SE+ W+ +LL
Sbjct: 1384 DWAGDNDDFVSTNAYIVYLGSTPIAWSSKKQKGVARSSTEAEYRAVANTTSEIRWVCSLL 1443
Query: 1310 HELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPTS 1369
EL + L P++ DN A +++ NPV H KH+ LD HF+ + V++G L V H+ T
Sbjct: 1444 TELGITLPKMPVIYCDNVGATYLSANPVFHSRMKHLALDYHFIRDNVSAGALRVSHISTH 1503
Query: 1370 LQLADIFTKALPRPLFEIFRSKLRVGLNP 1398
QLAD TK LPR F F SK+ V P
Sbjct: 1504 DQLADALTKPLPRQHFLQFSSKIGVSKLP 1532
>gb|AAC02672.1| polyprotein [Arabidopsis arenosa] gi|7522104|pir||T31353 polyprotein
- Arabidopsis arenosa Evelknievel retrotransposon
(fragment)
Length = 1390
Score = 601 bits (1550), Expect = e-170
Identities = 350/816 (42%), Positives = 464/816 (55%), Gaps = 66/816 (8%)
Query: 581 IKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAMLYHS 640
I SDNG EF + + AS G+ H + PHT NG E+KHRH++E GL +L +
Sbjct: 588 IGTLYSDNGGEFI--ALRSFLASHGISHMTTPPHTPELNGISERKHRHIVETGLTLLSTA 645
Query: 641 HVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLRPYM 700
+ YW AF+TAVY+INR+ + VL N+ P+ LF P Y FGC FP LRPY
Sbjct: 646 SMSKEYWSYAFTTAVYLINRMLTPVLGNESPYMKLFGQPPNYLKLRVFGCLCFPWLRPYT 705
Query: 701 NNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSS---PHD 757
+K RS PC+ GYS + C + A Y S H QF E FPF+ + S P D
Sbjct: 706 AHKLDNRSMPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFAESIFPFSTTSPSVTPPSD 765
Query: 758 ---------LVVSTFYEPASGSPPS------PH-----PVVL-------VAPESSLPIGS 790
+ V P + +PPS PH P +L +P SS P
Sbjct: 766 PPLSQDTRPISVPILARPLTTAPPSSPSCSAPHRSPSQPGILSPSAPFQPSPPSS-PTSP 824
Query: 791 LSCPSCDD-------SDVLPALPVPPADAPP-----------SPAPPSPTQTATP----P 828
++ PS + + P PP P SP P SP +P P
Sbjct: 825 ITSPSLSEESHVGHNQETGPTGSSPPVSPQPQSEQSTSPRSTSPQPNSPHTQHSPRSITP 884
Query: 829 VGTQAPPASPPTTPPVVAQVPLAPIAARPRTRSQNGIFRPNPRYALVHAQPTGILTAFHT 888
T +P SPP P P PI RTRS+N I +PNP++A + +PT +
Sbjct: 885 ALTPSPSPSPPPNPN-----PPPPIQHTMRTRSKNNIVKPNPKFANLATKPTPLKPII-- 937
Query: 889 VTKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGT 948
PK A+ P W AM DE++A +N T+ LVP NV+G KWVF K+ +G
Sbjct: 938 ---PKTVAEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVIGCKWVFTLKYLPNGV 994
Query: 949 VERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHG 1008
++R KARLVA+GF Q G D+ TFSPV+K+TTVR +L V W + Q+DV NAFL G
Sbjct: 995 LDRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSVLHVAVSKGWSIRQIDVNNAFLQG 1054
Query: 1009 HLTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADP 1068
L++ VY+ QPPGFVD P HVCRL+KALYGLKQAPRAW+Q L S+LL GF S AD
Sbjct: 1055 TLSDEVYVMQPPGFVDKDNPHHVCRLHKALYGLKQAPRAWYQELRSYLLTQGFVNSIADT 1114
Query: 1069 SLFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITY 1128
SLF T LY+LVYVDD+++TGSD +++T+FIA L A F++K LG++ YFLG+E T
Sbjct: 1115 SLFTLRHKRTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEATR 1174
Query: 1129 TPDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHL-VSSGEAYFDPTHYRSLVGALQ 1187
T GL L Q +Y DLL + ML A V TP++ L ++SG P+ YR+++G+LQ
Sbjct: 1175 TSKGLHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGTPLDKPSEYRAVLGSLQ 1234
Query: 1188 YLTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYS 1247
YL+ TRPD++YAVN +SQ++ PT H+QA KRILRY+ GT G+ R + + YS
Sbjct: 1235 YLSFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLKLHAYS 1294
Query: 1248 DADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLN 1307
DADWA TD ST Y ++LG +SWS+KKQ VARSS E+EYRA+AN SE+ W+ +
Sbjct: 1295 DADWAGDTDNYNSTNAYILYLGSTPISWSSKKQNGVARSSTEAEYRAVANATSEIRWVCS 1354
Query: 1308 LLHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAK 1343
LL EL + LS+ P++ DN A +++ NPV K
Sbjct: 1355 LLTELGITLSSPPVVYCDNVGATYLSANPVFDSRMK 1390
>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301698|pir||C84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1501
Score = 573 bits (1476), Expect = e-161
Identities = 338/866 (39%), Positives = 474/866 (54%), Gaps = 58/866 (6%)
Query: 575 KSISRSIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGL 634
K +++K+ +SDNGTEF + + F +G++H+ SC T QNGRVE+KHRH+L +
Sbjct: 641 KQFGKTVKMVRSDNGTEFMC--LSSYFRENGIIHQTSCVGTPQQNGRVERKHRHILNVAR 698
Query: 635 AMLYHSHVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFP 694
A+L+ + +P ++W ++ TA Y+INR PS +LS + P+++L P Y+ FG +
Sbjct: 699 ALLFQASLPIKFWGESILTAAYLINRTPSSILSGRTPYEVLHGSKPVYSQLRVFGSACYV 758
Query: 695 CLRPYMNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSS 754
+KF RS CIF GY KG+K ++ + VS F E FP+AG +SS
Sbjct: 759 HRVTRDKDKFGQRSRSCIFVGYPFGKKGWKVYDIERNEFLVSRDVIFREEVFPYAGVNSS 818
Query: 755 PHDLVVSTFYEPASGSPPSPHPVVLVAPESSLPIGSLSCPSCDDSDVLPALPVPPADAPP 814
+ ST S P + V S+ C +V+ V ++ P
Sbjct: 819 T---LASTSLPTVSEDDDWAIPPLEV--RGSIDSVETERVVCTTDEVVLDTSVSDSEIPN 873
Query: 815 SPAPPSPTQTATPPVGTQAPPASPPTTP---PVVAQVPLAPIAARPRTRSQNGIFRPNPR 871
P T ++P + + + PTTP PV + +P++P P+ R P P+
Sbjct: 874 QEFVPDDTPPSSPLSVSPSGSPNTPTTPIVVPVASPIPVSP----PKQRKSKRATHPPPK 929
Query: 872 ---YAL---------VHAQPT------------------------------GILTAFHTV 889
Y L +HA P L A
Sbjct: 930 LNDYVLYNAMYTPSSIHALPADPSQSSTVPGKSLFPLTDYVSDAAFSSSHRAYLAAITDN 989
Query: 890 TKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTV 949
+PK FK A++ W AM E+ AL N TW +V P +G +WVF+TK++SDGTV
Sbjct: 990 VEPKHFKEAVQIKVWNDAMFTEVDALEINKTWDIVDLPPGKVAIGSQWVFKTKYNSDGTV 1049
Query: 950 ERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGH 1009
ER KARLV QG Q+ G DY TF+PVV+ TTVR +L +V N W+++Q+DV NAFLHG
Sbjct: 1050 ERYKARLVVQGNKQVEGEDYKETFAPVVRMTTVRTLLRNVAANQWEVYQMDVHNAFLHGD 1109
Query: 1010 LTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPS 1069
L E VYM+ PPGF P VCRL K+LYGLKQAPR WF++LS LLR GF S D S
Sbjct: 1110 LEEEVYMKLPPGFRHSH-PDKVCRLRKSLYGLKQAPRCWFKKLSDSLLRFGFVQSYEDYS 1168
Query: 1070 LFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYT 1129
LF + + + L +L+YVDD+++ G+D +L +F L+ F++K LGKL YFLG+E++
Sbjct: 1169 LFSYTRNNIELRVLIYVDDLLICGNDGYMLQKFKDYLSRCFSMKDLGKLKYFLGIEVSRG 1228
Query: 1130 PDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHLVSS-GEAYFDPTHYRSLVGALQY 1188
P+G+FL Q KYA D+++ + L + TPL HL S G DP YR LVG L Y
Sbjct: 1229 PEGIFLSQRKYALDVIADSGNLGSRPAHTPLEQNHHLASDDGPLLSDPKPYRRLVGRLLY 1288
Query: 1189 LTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSD 1248
L TRP+LSY+V+ ++QF+Q P HF A R++RY+ G+ G+ + Y D
Sbjct: 1289 LLHTRPELSYSVHVLAQFMQNPREAHFDAALRVVRYLKGSPGQGILLNADPDLTLEVYCD 1348
Query: 1249 ADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNL 1308
+DW C TRRS Y + LG + +SW KKQ TV+ SS E+EYRAM+ E+ WL L
Sbjct: 1349 SDWQSCPLTRRSISAYVVLLGGSPISWKTKKQDTVSHSSAEAEYRAMSYALKEIKWLRKL 1408
Query: 1309 LHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPT 1368
L EL + S L D+++A+ +A NPV H+ KHI+ DCH V + V G + +HV T
Sbjct: 1409 LKELGIEQSTPARLYCDSKAAIHIAANPVFHERTKHIESDCHSVRDAVRDGIITTQHVRT 1468
Query: 1369 SLQLADIFTKALPRPLFEIFRSKLRV 1394
+ QLAD+FTKAL R F SKL V
Sbjct: 1469 TEQLADVFTKALGRNQFLYLMSKLGV 1494
>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534632|ref|NP_921618.1| putative pol polyprotein
[Oryza sativa (japonica cultivar-group)]
Length = 1688
Score = 573 bits (1476), Expect = e-161
Identities = 331/848 (39%), Positives = 459/848 (54%), Gaps = 59/848 (6%)
Query: 578 SRSIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGLAML 637
S +I++F+SD+G E+ +N S G L + SCP AQNG E+KHRH++E +L
Sbjct: 441 SSAIRIFRSDSGGEYMSNAFREFLVSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLL 500
Query: 638 YHSHVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFPCLR 697
S VP +W +A STAVY+IN PS L + P ++LF P Y + FGC + L
Sbjct: 501 IASFVPAHFWAEAISTAVYLINMQPSSSLQGRSPGEVLFGSPPRYDHLRVFGCTCYVLLA 560
Query: 698 PYMNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSSPHD 757
P K + +S C+F GYS HKG++C++P+ +S FDE + P
Sbjct: 561 PRERTKLTAQSVECVFLGYSLEHKGYRCYDPSARRIRISRDVTFDE---------NKP-- 609
Query: 758 LVVSTFYEPASGSPPSPHPVVLVAPESSLPIGSLSCPSCDDSDVLPALPVPPADAPPSPA 817
F+ ++ P SP E+S I L P + LP+ P+ P+ +P P+
Sbjct: 610 -----FFYSSTNQPSSP--------ENS--ISFLYLPPIPSPESLPSSPITPSPSPIPPS 654
Query: 818 PPSPTQTATPPVGTQAPPASPPTTP-PVVAQVPLAP-----------IAARPRTRSQNGI 865
PSPT PP P SPP + P + P P + RP+ +++
Sbjct: 655 VPSPTYVPPPPPSPSPSPVSPPPSHIPASSSPPHVPSTITLDTFPFHYSRRPKIPNESQP 714
Query: 866 FRPN--------------PRYAL----VHAQPTGILTAFHTVTKPKGFKSAMKHP*WLAA 907
+P PRY L P V +P ++ A+ P W A
Sbjct: 715 SQPTLEDPTCSVDDSSPAPRYNLRARDALRAPNRDDFVVGVVFEPSTYQEAIVLPHWKLA 774
Query: 908 MEDELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTVERLKARLVAQGFTQIPGF 967
M +EL+AL + TW +VP PS + KWV++ K SDG VER KARLVA+GF Q G
Sbjct: 775 MSEELAALERTNTWDVVPLPSHAVPITCKWVYKVKTKSDGQVERYKARLVARGFQQAHGR 834
Query: 968 DYSLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGHLTETVYMEQPPGFVDPRF 1027
DY TF+PV TTVR +++ W + Q+DVKNAFLHG L E VYM PPG P
Sbjct: 835 DYDETFAPVAHMTTVRTLIAVAATRSWTISQMDVKNAFLHGDLHEEVYMHPPPGVEAP-- 892
Query: 1028 PTHVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPSLFFFYKGHTTLYLLVYVD 1087
P HV RL +ALYGLKQAPRAWF R SS +L GFS S DP+LF LL+YVD
Sbjct: 893 PGHVFRLRRALYGLKQAPRAWFARFSSVVLAAGFSPSDHDPALFIHTSSRGRTLLLLYVD 952
Query: 1088 DIILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYTPDGLFLGQAKYAHDLLSR 1147
D+++TG D + +L+ +F + LG L YFLG+E+T T DG +L Q +Y DLL++
Sbjct: 953 DMLITGDDLEYIAFVKGKLSEQFMMSDLGPLSYFLGIEVTSTVDGYYLSQHRYIEDLLAQ 1012
Query: 1148 AMMLEASHVSTPLAAGSHLVSS-GEAYFDPTHYRSLVGALQYLTITRPDLSYAVNTVSQF 1206
+ + ++ +TP+ L S+ G DP+ YR LVG+L YLT+TRPD++YAV+ +SQF
Sbjct: 1013 SGLTDSRTTTTPMELHVRLRSTDGTPLDDPSRYRHLVGSLVYLTVTRPDIAYAVHILSQF 1072
Query: 1207 LQAPTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSDADWARCTDTRRSTYGYAI 1266
+ AP H+ + R+LRY+ GT L + +S + +SD+ WA RRS GY I
Sbjct: 1073 VSAPISVHYGHLLRVLRYLRGTTTQCLFYAASSPLQLRAFSDSTWASDPIDRRSVTGYCI 1132
Query: 1267 FLGDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNLLHELRVRLSATPLLLSDN 1326
FLG +LL+W +KKQ V+RSS E+E RA+A T SE+VWL LL + V LL DN
Sbjct: 1133 FLGTSLLTWKSKKQTAVSRSSTEAELRALATTTSEIVWLRWLLADFGVSCDVPTPLLCDN 1192
Query: 1327 QSALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPTSLQLADIFTKALPRPLFE 1386
A+ +A +P+ H+ KHI +D F +A+ +VP+ LQ+AD FTKA R
Sbjct: 1193 TGAIQIANDPIKHELTKHIGVDASFTRSHCQQSTIALHYVPSELQVADFFTKAQTREHHR 1252
Query: 1387 IFRSKLRV 1394
+ KL V
Sbjct: 1253 LHLLKLNV 1260
>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1491
Score = 565 bits (1457), Expect = e-159
Identities = 347/879 (39%), Positives = 467/879 (52%), Gaps = 77/879 (8%)
Query: 575 KSISRSIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGL 634
K +S+K+ +SDNGTEF + + F G++H+ SC T QNGRVE+KHRH+L +
Sbjct: 624 KQFGKSVKIIRSDNGTEFMC--LSSYFKEQGIVHQTSCVGTPQQNGRVERKHRHILNVSR 681
Query: 635 AMLYHSHVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFP 694
A+L+ + +P ++W +A TA Y+INR PS + + P++LL P Y FG +
Sbjct: 682 ALLFQASLPIKFWGEAVMTAAYLINRTPSSIHNGLSPYELLHGCKPDYDQLRVFGSACYA 741
Query: 695 CLRPYMNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSS 754
+KF RS CIF GY KG+K ++ + + VS F E FP+A +
Sbjct: 742 HRVTRDKDKFGERSRLCIFVGYPFGQKGWKVYDLSTNEFIVSRDVVFRENVFPYATNEGD 801
Query: 755 PHDLVVSTFYEPASGSPPSPHPVVLVAPESSLPIGSLSCPSCDDS--------------- 799
T Y P P + E LP +L D++
Sbjct: 802 -------TIYTPPVTCP-------ITYDEDWLPFTTLEDRGSDENSLSDPPVCVTDVSES 847
Query: 800 ----DVLPALPVPPADAPPSPAPP-SPTQTATPPVGTQAPPA--SPPT--TPPVVAQVPL 850
D +LP P D P SP+ +PTQT T + +P SPP T P++ P
Sbjct: 848 DTEHDTPQSLPTP-VDDPLSPSTSVTPTQTPTNSSSSTSPSTNVSPPQQDTTPIIENTPP 906
Query: 851 --------------------------APIAARPRTRSQNGIFRPNPRYALVHAQPTGILT 884
P P T + + N +Y L +
Sbjct: 907 RQGKRQVQQLARLKDYILYNASCTPNTPHVLSPSTSQSSSSIQGNSQYPLTDYIFDECFS 966
Query: 885 AFHTV--------TKPKGFKSAMKHP*WLAAMEDELSALHKNCTWTLVPRPSTTNVVGIK 936
A H V +PK FK A+K W AM E+ AL N TW +V P+ +G +
Sbjct: 967 AGHKVFLAAITANDEPKHFKEAVKVKVWNDAMYKEVDALEVNKTWDIVDLPTGKVAIGSQ 1026
Query: 937 WVFRTKFHSDGTVERLKARLVAQGFTQIPGFDYSLTFSPVVKATTVRLILSHVVLNDWQL 996
WV++TKF++DGTVER KARLV QG QI G DY+ TF+PVVK TTVR +L V N W++
Sbjct: 1027 WVYKTKFNADGTVERYKARLVVQGNNQIEGEDYTETFAPVVKMTTVRTLLRLVAANQWEV 1086
Query: 997 HQLDVKNAFLHGHLTETVYMEQPPGFVDPRFPTHVCRLNKALYGLKQAPRAWFQRLSSFL 1056
+Q+DV NAFLHG L E VYM+ PPGF P VCRL K+LYGLKQAPR WF++LS L
Sbjct: 1087 YQMDVHNAFLHGDLEEEVYMKLPPGFRHSH-PDKVCRLRKSLYGLKQAPRCWFKKLSDAL 1145
Query: 1057 LRHGFSCSRADPSLFFFYKGHTTLYLLVYVDDIILTGSDPSLLTQFIARLNAEFAIKYLG 1116
R GF D S F + L +LVYVDD+I+ G+D ++ +F L F++K LG
Sbjct: 1146 KRFGFIQGYEDYSFFSYSCKGIELRVLVYVDDLIICGNDEYMVQKFKEYLGRCFSMKDLG 1205
Query: 1117 KLGYFLGLEITYTPDGLFLGQAKYAHDLLSRAMMLEASHVSTPLAAGSHLVSS-GEAYFD 1175
KL YFLG+E++ PDG+FL Q KYA D++S + L A TPL HL S G D
Sbjct: 1206 KLKYFLGIEVSRGPDGIFLSQRKYALDIISDSGTLGARPAYTPLEQNHHLASDDGPLLQD 1265
Query: 1176 PTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQAPTMEHFQAVKRILRYVCGTQHFGLTF 1235
P +R LVG L YL TRP+LSY+V+ +SQF+QAP H +A RI+RY+ G+ G+
Sbjct: 1266 PKPFRRLVGRLLYLLHTRPELSYSVHVLSQFMQAPREAHLEAAMRIVRYLKGSPGQGILL 1325
Query: 1236 RRTSSPAVLGYSDADWARCTDTRRSTYGYAIFLGDNLLSWSAKKQPTVARSSCESEYRAM 1295
+ Y D+D+ C TRRS Y + LG + +SW KKQ TV+ SS E+EYRAM
Sbjct: 1326 SSNKDLTLEVYCDSDFQSCPLTRRSLSAYVVLLGGSPISWKTKKQDTVSHSSAEAEYRAM 1385
Query: 1296 ANTASELVWLLNLLHELRVRLSATPLLLSDNQSALFMAQNPVAHKHAKHIDLDCHFVCEL 1355
+ E+ WL LL EL + L+A L D+++A+ +A NPV H+ KHI+ DCH V +
Sbjct: 1386 SVALKEIKWLNKLLKELGITLAAPTRLFCDSKAAISIAANPVFHERTKHIERDCHSVRDA 1445
Query: 1356 VASGRLAVRHVPTSLQLADIFTKALPRPLFEIFRSKLRV 1394
V G + HV TS QLADIFTKAL R F SKL +
Sbjct: 1446 VRDGIITTHHVRTSEQLADIFTKALGRNQFIYLMSKLGI 1484
>gb|AAG50751.1| polyprotein, putative [Arabidopsis thaliana] gi|25301686|pir||F96610
probable polyprotein T8L23.26 [imported] - Arabidopsis
thaliana
Length = 1468
Score = 548 bits (1412), Expect = e-154
Identities = 323/846 (38%), Positives = 463/846 (54%), Gaps = 38/846 (4%)
Query: 575 KSISRSIKVFQSDNGTEFTNNKVHTLFASSGVLHRFSCPHTQAQNGRVEQKHRHVLELGL 634
+ IK+ +SDNGTEF + + F G+ H SC T QNGRVE+KHRH+L +
Sbjct: 628 RQFDTEIKIVRSDNGTEFLCMREY--FLHKGIAHETSCVGTPHQNGRVERKHRHILNIAR 685
Query: 635 AMLYHSHVPTRYWVDAFSTAVYIINRVPSKVLSNQIPFQLLFHVAPTYANFHPFGCRVFP 694
A+ + S++P ++W + +A Y+INR PS +L + P+++L+ AP Y++ FG +
Sbjct: 686 ALRFQSYLPIQFWGECILSAAYLINRTPSMLLQGKSPYEMLYKTAPKYSHLRVFGSLCYA 745
Query: 695 CLRPYMNNKFSPRSTPCIFFGYSSHHKGFKCFNPAISLTYVSHHAQFDEFCFPFAGSHSS 754
+ + +KF+ RS C+F GY KG++ F+ +VS F E FP++ +
Sbjct: 746 HNQNHKGDKFAARSRRCVFVGYPHGQKGWRLFDLEEQKFFVSRDVIFQETEFPYSKMSCN 805
Query: 755 PHD------LVVSTFYEPASG---------SPPSPHPVVLVAP-------ESSLPIGSLS 792
D V F E A G + P V P ESS P +S
Sbjct: 806 EEDERVLVDCVGPPFIEEAIGPRTIIGRNIGEATVGPNVATGPIIPEINQESSSPSEFVS 865
Query: 793 CPSCDDSDVLPALPVPPADAPPSPAPPSPTQTATPPVGTQAPPASPPTTPPVVAQVPLAP 852
S D L + V AD P S P+P Q TQ P V+ ++P
Sbjct: 866 LSSLDP--FLASSTVQTADLPLSSTTPAPIQLRRSSRQTQKPMKLKNFVTNTVSVESISP 923
Query: 853 IAARPRTRSQNGIFRPNPRYALVH---AQPTGILTAFHTVTKPKGFKSAMKHP*WLAAME 909
A S + ++ P +Y H + L A +P + AM W AM
Sbjct: 924 EA------SSSSLY-PIEKYVDCHRFTSSHKAFLAAVTAGMEPTTYNEAMVDKAWREAMS 976
Query: 910 DELSALHKNCTWTLVPRPSTTNVVGIKWVFRTKFHSDGTVERLKARLVAQGFTQIPGFDY 969
E+ +L N T+++V P +G KWV++ K+ SDG +ER KARLV G Q G DY
Sbjct: 977 AEIESLRVNQTFSIVNLPPGKRALGNKWVYKIKYRSDGAIERYKARLVVLGNCQKEGVDY 1036
Query: 970 SLTFSPVVKATTVRLILSHVVLNDWQLHQLDVKNAFLHGHLTETVYMEQPPGFVDPRFPT 1029
TF+PV K +TVRL L DW +HQ+DV NAFLHG L E VYM+ P GF P+
Sbjct: 1037 DETFAPVAKMSTVRLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYMKLPQGFQCDD-PS 1095
Query: 1030 HVCRLNKALYGLKQAPRAWFQRLSSFLLRHGFSCSRADPSLFFFYKGHTTLYLLVYVDDI 1089
VCRL+K+LYGLKQAPR WF +LSS L ++GF+ S +D SLF + +++LVYVDD+
Sbjct: 1096 KVCRLHKSLYGLKQAPRCWFSKLSSALKQYGFTQSLSDYSLFSYNNDGIFVHVLVYVDDL 1155
Query: 1090 ILTGSDPSLLTQFIARLNAEFAIKYLGKLGYFLGLEITYTPDGLFLGQAKYAHDLLSRAM 1149
I++GS P + QF + L + F +K LG L YFLG+E++ G +L Q KY D++S
Sbjct: 1156 IISGSCPDAVAQFKSYLESCFHMKDLGLLKYFLGIEVSRNAQGFYLSQRKYVLDIISEMG 1215
Query: 1150 MLEASHVSTPLAAGSHL-VSSGEAYFDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQ 1208
+L A + PL L +S+ D + YR LVG L YL +TRP+LSY+V+T++QF+Q
Sbjct: 1216 LLGARPSAFPLEQNHKLSLSTSPLLSDSSRYRRLVGRLIYLVVTRPELSYSVHTLAQFMQ 1275
Query: 1209 APTMEHFQAVKRILRYVCGTQHFGLTFRRTSSPAVLGYSDADWARCTDTRRSTYGYAIFL 1268
P +H+ A R++RY+ G+ TS+ + G+ D+D+A C TRRS GY + L
Sbjct: 1276 NPRQDHWNAAIRVVRYLKSNPGQGILLSSTSTLQINGWCDSDYAACPLTRRSLTGYFVQL 1335
Query: 1269 GDNLLSWSAKKQPTVARSSCESEYRAMANTASELVWLLNLLHELRVRLSATPLLLSDNQS 1328
GD +SW KKQPTV+RSS E+EYRAMA EL+WL +L++L V + SD++S
Sbjct: 1336 GDTPISWKTKKQPTVSRSSAEAEYRAMAFLTQELMWLKRVLYDLGVSHVQAMRIFSDSKS 1395
Query: 1329 ALFMAQNPVAHKHAKHIDLDCHFVCELVASGRLAVRHVPTSLQLADIFTKALPRPLFEIF 1388
A+ ++ NPV H+ KH+++DCHF+ + + G +A VP+ QLADI TKAL F
Sbjct: 1396 AIALSVNPVQHERTKHVEVDCHFIRDAILDGIIATSFVPSHKQLADILTKALGEKEVRYF 1455
Query: 1389 RSKLRV 1394
KL +
Sbjct: 1456 LRKLGI 1461
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.347 0.151 0.556
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,300,015,958
Number of Sequences: 2540612
Number of extensions: 97859622
Number of successful extensions: 1023097
Number of sequences better than 10.0: 8453
Number of HSP's better than 10.0 without gapping: 3237
Number of HSP's successfully gapped in prelim test: 5832
Number of HSP's that attempted gapping in prelim test: 804446
Number of HSP's gapped (non-prelim): 63231
length of query: 1411
length of database: 863,360,394
effective HSP length: 141
effective length of query: 1270
effective length of database: 505,134,102
effective search space: 641520309540
effective search space used: 641520309540
T: 11
A: 40
X1: 14 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.7 bits)
S2: 82 (36.2 bits)
Lotus: description of TM0321.5