
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0296a.5
(754 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi... 308 4e-82
emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7... 301 4e-80
gb|AAF79879.1| T7N9.5 [Arabidopsis thaliana] 299 3e-79
gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi... 296 2e-78
emb|CAB10526.1| retrotransposon like protein [Arabidopsis thalia... 294 6e-78
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi... 291 5e-77
gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thalia... 291 7e-77
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t... 290 2e-76
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi... 289 3e-76
pir||E96608 probable retroelement polyprotein F25P12.89 [importe... 288 3e-76
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf... 276 2e-72
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi... 276 2e-72
gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsi... 261 6e-68
gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsi... 248 7e-64
gb|AAG50751.1| polyprotein, putative [Arabidopsis thaliana] gi|2... 244 1e-62
gb|AAD41979.1| putative retroelement pol polyprotein [Arabidopsi... 243 2e-62
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop... 243 2e-62
emb|CAB79159.1| LTR retrotransposon like protein [Arabidopsis th... 243 2e-62
dbj|BAB10743.1| retroelement pol polyprotein-like [Arabidopsis t... 242 3e-62
emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis... 240 1e-61
>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301701|pir||E84589 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1461
Score = 308 bits (789), Expect = 4e-82
Identities = 224/668 (33%), Positives = 337/668 (49%), Gaps = 57/668 (8%)
Query: 99 CCEVVKIFKQYRENDCVLCFLRGLNDNFAAARS*ILLMEPLPTLSKICSMIIQQERHLGI 158
C + V+++ Q E ++ FL GLN+++A R I+ + LP+L+++ ++ Q G
Sbjct: 216 CGKAVRLY-QKAEKAKIMKFLAGLNESYAIVRRQIIAKKALPSLAEVYHILDQDNSQKGF 274
Query: 159 -GVLHEPAVMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*R-LCTHCGRQNHTVETCFL 216
V+ PA V S+S Y S N R C+ C R H E C+
Sbjct: 275 FNVVAPPAAFQV------SEVSHSPITSPEIMYVQSGPNKGRPTCSFCNRVGHIAERCYK 328
Query: 217 KHGYPSNF-PNGYKPEKAYIPTS-GGESHSSSDQDEPPSSSL--ELTREYLQGLLALL-- 270
KHG+P F P G +K P + + S D+ +L + + +Q L+AL
Sbjct: 329 KHGFPPGFTPKGKSSDKPPKPQAVAAQVTLSPDKMTGQLETLAGNFSPDQIQNLIALFSS 388
Query: 271 ---PQ--SKPTAVAQ--TPNLKPISTSNLVSSHN-------VIANNIGMVNSQWIMDSGA 316
PQ S TA +Q + + ++ S ++ S + + ++ + + W++DSGA
Sbjct: 389 QLQPQIVSPQTASSQHEASSSQSVAPSGILFSPSTYCFIGILAVSHNSLSSDTWVIDSGA 448
Query: 317 TDHIASSLSFF----SSYYSIKHVPVSLPNRAHAMKTI*GQYFFHLP*LYIMFSMYLNFL 372
T H++ F +S S ++P R + T+ L + + LN +
Sbjct: 449 THHVSHDRKLFQTLDTSIVSFVNLPTGPNVRISGVGTVLINKDIILQNVLFIPEFRLNLI 508
Query: 373 SILFLYTSLPKY*ITD*FSLIPCV*FWTRPL*R*LGELRSRMGYTSWSFHLSLVSNHFHS 432
SI L T L I D C LGE + R+G +L ++ +
Sbjct: 509 SISSLTTDLGTRVIFD----PSCCQIQDLTKGLTLGEGK-RIG------NLYVLDTQSPA 557
Query: 433 VNMSTVSSSNKLPNIWHHRLGHPSQPCYISLQQMYPPITTSPCKN-----CDTCHFAKQK 487
++++ V + +WH RLGHPS + L + + T+ KN C CH AKQK
Sbjct: 558 ISVNAVVDVS----VWHKRLGHPS---FSRLDSLSEVLGTTRHKNKKSAYCHVCHLAKQK 610
Query: 488 RLSFSLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEAR 547
+LSF N I + ELLH+D+WGP+SV ++ G KYF IV D SR TWI L+K+KS+
Sbjct: 611 KLSFPSANNICNSTFELLHIDVWGPFSVETVEGYKYFLTIVDDHSRATWIYLLKSKSDVL 670
Query: 548 AALQTFILYSQRQYGSLVKIVRLDNGVEFAMTDFYSQHGVQHQLSCVKTPQQNGVVERKH 607
FI + QY + VK VR DN E A T+FY G+ SC +TP+QN VVERKH
Sbjct: 671 TVFPAFIDLVENQYDTRVKSVRSDNAKELAFTEFYKAKGIVSFHSCPETPEQNSVVERKH 730
Query: 608 QHILNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLK 667
QHILN+ARALM QSN++L +W V+ AV L+N P+ +LS +PF +L PD LK
Sbjct: 731 QHILNVARALMFQSNMSLPYWGDCVLTAVFLINRTPSALLSNKTPFEVLTGKLPDYSQLK 790
Query: 668 VFGSLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETI 727
FG LCY+++ + R KF R+R C+ LG G KGY + DL++ V +SRNV F+E +
Sbjct: 791 TFGCLCYSSTSSKQRHKFLPRSRACVFLGY-PFGFKGYKLLDLESNVVHISRNVEFHEEL 849
Query: 728 FPFQATDK 735
FP ++ +
Sbjct: 850 FPLASSQQ 857
>emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7488558|pir||T14517
hypothetical protein 1 - wild cabbage transposon Melmoth
Length = 1131
Score = 301 bits (772), Expect = 4e-80
Identities = 210/678 (30%), Positives = 329/678 (47%), Gaps = 62/678 (9%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDNFAAARS*ILLMEPLPTLSKICSMIIQQERHLG 157
C C + ++ + ++ FL GLND++A R I++ + LP+L ++ +++ Q + G
Sbjct: 163 CVCGNAEKLQKKVDRAKIVKFLAGLNDSYAIIRRQIIMKKVLPSLVEVYNILDQDDSQKG 222
Query: 158 IGVLHEPAVMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQNHTVETCFLK 217
PA V + + G Q +K +C+ C R H E C+ K
Sbjct: 223 FSTAITPAAFNV---SENVPPPMAEAGICYVQTGPNKGRP--ICSFCNRVGHIAERCYKK 277
Query: 218 HGYPSNFPNGYKPEKAYIPTSGGESHSSSDQDEPPSSSLEL----------TREYLQGLL 267
HG+P F + YK + + + ++ PP+S ++E LQ +
Sbjct: 278 HGFPPGFVSKYKSQSSGDRLQKPKQVAAQVSFSPPNSGQSPMTMDHLVGNHSKEQLQQFI 337
Query: 268 ALLPQSKPTAVA----QTPNLKPISTSNLVSSHNVIA-------NNIGMVNSQWIMDSGA 316
AL P + + +P+ S + + + + + N WI+DSGA
Sbjct: 338 ALFSSQLPNVTMGSNEASSSKQPMDNSGISFNPTTLVFIGLLTVSRHTLANETWIIDSGA 397
Query: 317 TDHIASSLSFFSSYYSIKHVPVSLPNRAHAMKTI*G--QYFFHLP*LYIMF--SMYLNFL 372
T H+ S ++S V+LPN + G Q H+ +++ LN L
Sbjct: 398 THHVCHDRSMYTSIDITTTSNVNLPNGMIVKISGVGIVQLNEHITLHNVLYIPEFRLNLL 457
Query: 373 SILFLYTSLPKY*ITD*FS--LIPCV*FWTRPL*R*LGELRSRMGYTSWSFHLSLVSNHF 430
SI L + + I D S + WT +G+ R V+N +
Sbjct: 458 SISSLTSDIGSQVIFDVSSCAIQDPTKGWT------IGQGRR-------------VANLY 498
Query: 431 HSVNMSTVSSSNKLPNI--WHHRLGHPSQPCYISLQQMYPPITTSPCKN-----CDTCHF 483
S+ N + +I WH RLGHPS Y L ++ + T+ KN C CH
Sbjct: 499 VLDVKSSPMKINAVVDISLWHKRLGHPS---YTRLDKISEALGTTKHKNKGDAHCHVCHL 555
Query: 484 AKQKRLSFSLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAK 543
AKQK+LS+S +N I +LLHVD+WGP+SV ++ G KYF IV D SR TWI L+++K
Sbjct: 556 AKQKKLSYSSQNHICTASFQLLHVDVWGPFSVETLEGYKYFLTIVDDHSRATWIYLLQSK 615
Query: 544 SEARAALQTFILYSQRQYGSLVKIVRLDNGVEFAMTDFYSQHGVQHQLSCVKTPQQNGVV 603
S+ TF+ + QY + +K VR DN E + T+ + + G+ SC +T +QN V+
Sbjct: 616 SDVLHIFPTFVNQIETQYNTKIKSVRRDNAPELSFTELFKEKGIVSYHSCPETLEQNSVL 675
Query: 604 ERKHQHILNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDI 663
ERKHQH+LN+ARALM QS + L++W V+ A L+N P+ +L+ SP+ +L P
Sbjct: 676 ERKHQHLLNVARALMFQSQVPLQYWGDCVLTAAFLINRTPSPLLANKSPYEVLMGKAPQY 735
Query: 664 EHLKVFGSLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIF 723
+ L+ FG LCY ++ R KF R+R C+ LG G KGY + DL++ ++++SRNV F
Sbjct: 736 DQLRTFGCLCYGSTSPKQRHKFMPRSRACVFLGY-PSGYKGYKLLDLESNKIYISRNVTF 794
Query: 724 YETIFPFQATDKVSLSSL 741
+E IFP K+ SSL
Sbjct: 795 HEDIFPMAKHQKMDESSL 812
>gb|AAF79879.1| T7N9.5 [Arabidopsis thaliana]
Length = 1436
Score = 299 bits (765), Expect = 3e-79
Identities = 226/693 (32%), Positives = 330/693 (47%), Gaps = 74/693 (10%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDNFAAARS*ILLMEPLPTLSKICSMIIQQE--RH 155
C C+ VK + E V+ FL GL+D+F RS I M+P P L++I +M+ Q E R
Sbjct: 172 CDCDQVKELLEEAETSRVIQFLMGLSDDFNTIRSQIFNMKPRPGLNEIYNMLDQDESQRL 231
Query: 156 LGIGVLHEPAVMAVHTGNTQGTHSNSNRGR-SNSQYSSSKSNV*RLCTHCGRQNHTVETC 214
+G P+ TQG ++ N + + K CTHC R HTV+ C
Sbjct: 232 VGFAAKSVPSPSPA-AFQTQGVLNDQNTILLAQGNFKKPK------CTHCNRIGHTVDKC 284
Query: 215 FLKHGYPSNFPNGYKPEKAYIPTSGGESHSSSDQDEPPSSSLE----LTREYLQGLLALL 270
+ HGYP P E Y+ ++ S + PP+ S ++ +++Q L++ L
Sbjct: 285 YKVHGYPPGHPRA--KENTYVGSTNLASTDQIETQAPPTMSATGHETMSNDHIQQLISYL 342
Query: 271 PQSKPT---------AVAQTPNLKP----------ISTSNLVSSHNVIANNI-------- 303
+ A+A + N P S+SN V S + I
Sbjct: 343 STKLQSPSITSCFDKAIASSSNPVPSISQITDKAIASSSNPVPSISQITGTFFSLYDSTY 402
Query: 304 -GMVNSQ-----------WIMDSGATDHIASSLSFFSSYYSIKHVPVSLPNRAHAMKTI* 351
M+ S W++DSGA+ H+ + + +Y ++ V LPN H +K I
Sbjct: 403 YEMLTSSIPIETELSLRAWVIDSGASHHVTHERNLYHTYKALDRTFVRLPN-GHTVK-IE 460
Query: 352 GQYFFHLP*LYIMFSMYL------NFLSILFLYTSLP-KY*ITD*FSLIPCV*FWTRPL* 404
G F L + ++ N LS+ L +L K T +I + T+ L
Sbjct: 461 GTGFIQLTDALSLHNVLFIPEFKFNLLSVSVLTKTLQSKVSFTSDECMIQAL---TKELM 517
Query: 405 R*LGELRSRMGYTSWSFHLSLVSNHFHSVNMSTVSSSNKLPNIWHHRLGHPSQPCYISLQ 464
G + + L VS+ S SS +WH RLGHPS +L
Sbjct: 518 LGKGSQVGNLYILNLDKSLVDVSSF---PGKSVCSSVKNESEMWHKRLGHPSFAKIDTLS 574
Query: 465 Q--MYPPITTSP-CKNCDTCHFAKQKRLSFSLRNTIYGTVLELLHVDIWGPYSVASITGA 521
M P + +C CH +KQK L F N I EL+H+D WGP+SV ++
Sbjct: 575 DVLMLPKQKINKDSSHCHVCHLSKQKHLPFKSVNHIREKAFELVHIDTWGPFSVPTVDSY 634
Query: 522 KYFHIIVYDKSRFTWIKLMKAKSEARAALQTFILYSQRQYGSLVKIVRLDNGVEFAMTDF 581
+YF IV D SR TWI L+K KS+ +F+ + QY + V VR DN E +
Sbjct: 635 RYFLTIVDDFSRATWIYLLKQKSDVLTVFPSFLKMVETQYHTKVCSVRSDNAHELKFNEL 694
Query: 582 YSQHGVQHQLSCVKTPQQNGVVERKHQHILNIARALMLQSNLTLEFWSFAVIHAVCLMNL 641
+++ G++ C +TP+QN VVERKHQH+LN+ARALM QS + LE+W V+ AV L+N
Sbjct: 695 FAKEGIKADHPCPETPEQNFVVERKHQHLLNVARALMFQSGIPLEYWGDCVLTAVFLINR 754
Query: 642 LPTKVLSYSSPFIILHNLKPDIEHLKVFGSLCYATSLTAHRTKFPLRARKCIVLGQKEGG 701
L + V++ +P+ L KPD LK FG LCY ++ RTKF RA+ CI LG G
Sbjct: 755 LLSPVINNETPYERLTKGKPDYSSLKAFGCLCYCSTSPKSRTKFDPRAKACIFLGYPM-G 813
Query: 702 VKGYLMFDLKTREVFLSRNVIFYETIFPFQATD 734
KGY + D++T V +SR+VIFYE IFPF +++
Sbjct: 814 YKGYKLLDIETYSVSISRHVIFYEDIFPFASSN 846
>gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|7444418|pir||T00499 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1496
Score = 296 bits (758), Expect = 2e-78
Identities = 207/663 (31%), Positives = 317/663 (47%), Gaps = 75/663 (11%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDNFAAARS*ILLMEPLPTLSKICSMIIQQERHLG 157
C CE ++ RE+D V FL GL+ F++ RS I +EPLP L ++ S ++++E++L
Sbjct: 168 CKCEAASDIEKEREDDRVHKFLLGLDSRFSSIRSSITDIEPLPDLYQVYSRVVREEQNLN 227
Query: 158 IGVLHEPAVMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQNHTVETCFLK 217
+ + Q + + R +S CTHC R+ H V CFL
Sbjct: 228 ASRTKDVVKTEAIGFSVQSSTTPRFRDKSTL-----------FCTHCNRKGHEVTQCFLV 276
Query: 218 HGYPS---------NFPN-------------GYKPEKAYIPTSGGESHSSSDQDEPPSSS 255
HGYP N P+ G ++ PT+ G +++ Q P+ S
Sbjct: 277 HGYPDWWLEQNPQENQPSTRGRGSNGRGSSSGRGGNRSSAPTTRGRGRANNAQAAAPTVS 336
Query: 256 LELTREYLQGLLALLPQSKPTAVAQTPNLKPISTSNLVSSHNVIANNIGMVNSQWIMDSG 315
+ + Q L++LL +P+ S+S +S + + + + +D+G
Sbjct: 337 GDGNDQIAQ-LISLLQAQRPS-----------SSSERLSGNTCLTDGV--------IDTG 376
Query: 316 ATDHIASSLSFFSSYYSIKHVPVSLPNRAHAMKTI*GQYFFH-----LP*LYIM-FSMYL 369
A+ H+ S + I PV+ P+ + T G H L++ F L
Sbjct: 377 ASHHMTGDCSILVDVFDITPSPVTKPDGKASQATKCGTLLLHDSYKLHDVLFVPDFDCTL 436
Query: 370 NFLSILFLYTSLPKY*ITD*FSLIPCV*FWTRPL*R*LGELRSRMGYTSWSFHLSLVSNH 429
+S L TS TD F + R L +G R G + +++
Sbjct: 437 ISVSKLLKQTSSIAI-FTDTFCFLQ-----DRFLRTLIGAGEEREGVY---YFTGVLAPR 487
Query: 430 FHSVNMSTVSSSNKLPNIWHHRLGHPSQPCYISLQQMYPPITT-SPCKNCDTCHFAKQKR 488
H + S + +WH RLGHPS +SL + +CDTC +KQ R
Sbjct: 488 VHKASSDFAISGD----LWHRRLGHPSTSVLLSLPECNRSSQGFDKIDSCDTCFRSKQTR 543
Query: 489 LSFSLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARA 548
F + N L+H D+WGPY S TGA YF +V D SR W LM +K+E
Sbjct: 544 EVFPISNNKTMECFSLIHGDVWGPYRTPSTTGAVYFLTLVDDYSRSVWTYLMSSKTEVSQ 603
Query: 549 ALQTFILYSQRQYGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKTPQQNGVVERKH 607
++ F S+RQ+G VK R DNG EF +T ++ HG+ HQ SCV TPQQNG VERKH
Sbjct: 604 LIKNFCAMSERQFGKQVKAFRTDNGTEFMCLTPYFQTHGILHQTSCVDTPQQNGRVERKH 663
Query: 608 QHILNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLK 667
+HILN+ARA + Q NL ++FW +++ A L+N P+ VL +P+ +L +P + L+
Sbjct: 664 RHILNVARACLFQGNLPVKFWGESILTATHLINRTPSAVLKGKTPYELLFGERPSYDMLR 723
Query: 668 VFGSLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETI 727
FG LCYA ++ KF R+RKC+ +G G K + ++DL+T ++F SR+V F+E I
Sbjct: 724 SFGCLCYAHIRPRNKDKFTSRSRKCVFIGYPH-GKKAWRVYDLETGKIFASRDVRFHEDI 782
Query: 728 FPF 730
+P+
Sbjct: 783 YPY 785
>emb|CAB10526.1| retrotransposon like protein [Arabidopsis thaliana]
gi|7268497|emb|CAB78748.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444421|pir||A71444 probable
LTR retrotransposon - Arabidopsis thaliana
Length = 1433
Score = 294 bits (753), Expect = 6e-78
Identities = 219/702 (31%), Positives = 333/702 (47%), Gaps = 111/702 (15%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDNFAAARS*ILLMEPLPTLSKICSMIIQQE--RH 155
C CE VK + E ++ FL GLNDNFA R IL M+P P L++I +M+ Q E R
Sbjct: 203 CNCEHVKELLEEAETSRIIQFLMGLNDNFAHIRGQILNMKPRPGLTEIYNMLDQDESQRL 262
Query: 156 LGIGVLHEP-AVMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQNHTVETC 214
+G L P A V + N +G Y K C++C + H V+ C
Sbjct: 263 VGNPTLSNPTAAFQVQASPIIDSQVNMAQG----SYKKPK------CSYCNKLGHLVDKC 312
Query: 215 FLKHGYP--SNFPNGYKPEKAYI------PTSGGESHSSSDQDEPPSSSLELTREYLQGL 266
+ KHGYP S + G + P + + + +E + ++ YL
Sbjct: 313 YKKHGYPPGSKWTKGQTIGSTNLASTQLQPVNETPNEKTDSYEEFSTDQIQTMISYLSTK 372
Query: 267 LAL-----LPQSKPTAVAQTPNLKPISTSN--------------LVSSHNVIANNIGMVN 307
L + +P + +++ +P++ IS + L+SS ++ +
Sbjct: 373 LHIASASPMPTTSSASISASPSVPMISQISGTFLSLFSNAYYDMLISS---VSQEPAVSP 429
Query: 308 SQWIMDSGATDHIASSLSFFSSYYSIKHVPVSLPNRAHAMKTI*GQYFFHLP*LYIMFSM 367
W++DSGAT H+ + + ++ S+++ V LPN G +I S
Sbjct: 430 RGWVIDSGATHHVTHNRDLYLNFRSLENTFVRLPNDCTVKIAGIG---------FIQLSD 480
Query: 368 YLNFLSILFLYTSLPKY*ITD*FSLIPCV*FWTRPL*R*LGELRSRMGYTSWSFHLSLVS 427
++ ++L++ P++ F+LI + EL G + ++ +
Sbjct: 481 AISLHNVLYI----PEFK----FNLISEL----------TKELMIGRGSQVGNLYVLDFN 522
Query: 428 NHFHSVNMS---------TVSSSNKLPNI-WHHRLGHPSQPCYISLQQM----------- 466
+ H+V++ +V SS + ++ WH RLGHP+ L +
Sbjct: 523 ENNHTVSLKGTTSMCPEFSVCSSVVVDSVTWHKRLGHPAYSKIDLLSDVLNLKVKKINKE 582
Query: 467 YPPITTSPCKNCDTCHFAKQKRLSFSLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHI 526
+ P+ C C CH +KQK LSF R + +L+H+D WGP+SV +
Sbjct: 583 HSPV----CHVCHVCHLSKQKHLSFQSRQNMCSAAFDLVHIDTWGPFSVPT--------- 629
Query: 527 IVYDKSRFTWIKLMKAKSEARAALQTFILYSQRQYGSLVKIVRLDNGVEFAMTDFYSQHG 586
+ TWI L+K KS+ FI QY + +K VR DN E TD ++ HG
Sbjct: 630 -----NDATWIYLLKNKSDVLHVFPAFINMVHTQYQTKLKSVRSDNAHELKFTDLFAAHG 684
Query: 587 VQHQLSCVKTPQQNGVVERKHQHILNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKV 646
+ SC +TP+QN VVERKHQHILN+ARAL+ QSN+ LEFW V+ AV L+N LPT V
Sbjct: 685 IVAYHSCPETPEQNSVVERKHQHILNVARALLFQSNIPLEFWGDCVLTAVFLINRLPTPV 744
Query: 647 LSYSSPFIILHNLKPDIEHLKVFGSLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYL 706
L+ SP+ L N+ P E LK FG LCY+++ R KF RAR C+ LG G KGY
Sbjct: 745 LNNKSPYEKLKNIPPAYESLKTFGCLCYSSTSPKQRHKFEPRARACVFLGYPL-GYKGYK 803
Query: 707 MFDLKTREVFLSRNVIFYETIFPF-QATDKVSLSSLFPINSF 747
+ D++T V +SR+VIF+E IFPF +T K + FP+ F
Sbjct: 804 LLDIETHAVSISRHVIFHEDIFPFISSTIKDDIKDFFPLLQF 845
>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301694|pir||E84535 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1454
Score = 291 bits (745), Expect = 5e-77
Identities = 218/678 (32%), Positives = 322/678 (47%), Gaps = 70/678 (10%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDNFAAARS*ILLMEPLPTLSKICSMIIQQERHLG 157
C C +Q E ++ FL GLN+++A R I+ + LP+L ++ ++ Q
Sbjct: 210 CTCGKAMRLQQKAEQAKIVKFLAGLNESYAIVRRQIIAKKALPSLGEVYHILDQDNSQQS 269
Query: 158 IGVLHEPAVMAVHTGNTQG---------THSNSNRGRSNSQYSSSKSNV*RLCTHCGRQN 208
+ P + TQ + N+GR +C+ R
Sbjct: 270 FSNVVAPPAAFQVSEITQSPSMDPTVCYVQNGPNKGRP-------------ICSFYNRVG 316
Query: 209 HTVETCFLKHGYPSNF-PNGYKPEKAYIPTSGGESHSSSDQDEPPSSSL--ELTREYLQG 265
H E C+ KHG+P F P G EK P + + S + S+ L++E LQ
Sbjct: 317 HIAERCYKKHGFPPGFTPKGKAGEKLQKPKPLAANVAESSEVNTSLESMVGNLSKEQLQQ 376
Query: 266 LLALL--------PQSKPTA-VAQTPNLKPI---STSNLVSSHNVIANNIGMVNSQWIMD 313
+A+ P + TA +Q+ NL ST + + V + + ++ W++D
Sbjct: 377 FIAMFSSQLQNTPPSTYATASTSQSDNLGICFSPSTYSFIGILTVARHTLS--SATWVID 434
Query: 314 SGATDHIASSLSFFSSYYSIKHVPVSLPN----RAHAMKTI*GQYFFHLP*LYIMFSMYL 369
SGAT H++ S FSS + V+LP + + T+ L + + L
Sbjct: 435 SGATHHVSHDRSLFSSLDTSVLSAVNLPTGPTVKISGVGTLKLNDDILLKNVLFIPEFRL 494
Query: 370 NFLSILFLYTSLPKY*ITD*FSLIPCV*FWTRPL*R*LGELRSRM-GYTSWSFHLSLVSN 428
N +SI L + I D S ++ RM G +L L+
Sbjct: 495 NLISISSLTDDIGSRVIFDKNSC------------EIQDLIKGRMLGQGRRVANLYLLDV 542
Query: 429 HFHSVNMSTVSSSNKLPNIWHHRLGHPSQPCYISLQQMYPPITTSPCKN-----CDTCHF 483
S++++ V + +WH RLGH S L + + T+ KN C CH
Sbjct: 543 GDQSISVNAVVDIS----MWHRRLGHASLQ---RLDAISDSLGTTRHKNKGSDFCHVCHL 595
Query: 484 AKQKRLSFSLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAK 543
AKQ++LSF N + + +LLH+D+WGP+SV ++ G KYF IV D SR TW+ L+K K
Sbjct: 596 AKQRKLSFPTSNKVCKEIFDLLHIDVWGPFSVETVEGYKYFLTIVDDHSRATWMYLLKTK 655
Query: 544 SEARAALQTFILYSQRQYGSLVKIVRLDNGVEFAMTDFYSQHGVQHQLSCVKTPQQNGVV 603
SE FI + QY VK VR DN E T FY++ G+ SC +TP+QN VV
Sbjct: 656 SEVLTVFPAFIQQVENQYKVKVKAVRSDNAPELKFTSFYAEKGIVSFHSCPETPEQNSVV 715
Query: 604 ERKHQHILNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDI 663
ERKHQHILN+ARALM QS + L W V+ AV L+N P+++L +P+ IL P
Sbjct: 716 ERKHQHILNVARALMFQSQVPLSLWGDCVLTAVFLINRTPSQLLMNKTPYEILTGTAPVY 775
Query: 664 EHLKVFGSLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIF 723
E L+ FG LCY+++ R KF R+R C+ LG G KGY + DL++ VF+SRNV F
Sbjct: 776 EQLRTFGCLCYSSTSPKQRHKFQPRSRACLFLGY-PSGYKGYKLMDLESNTVFISRNVQF 834
Query: 724 YETIFPFQATDKVSLSSL 741
+E +FP A + S SSL
Sbjct: 835 HEEVFPL-AKNPGSESSL 851
>gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thaliana]
gi|25403501|pir||H86486 protein Ty1/copia-element
polyprotein [imported] - Arabidopsis thaliana
Length = 1152
Score = 291 bits (744), Expect = 7e-77
Identities = 222/674 (32%), Positives = 318/674 (46%), Gaps = 87/674 (12%)
Query: 98 CCCEVVKIF-----KQYRENDCVLCFLRGLND-NFAAARS*IL---LMEPLPTLSKICSM 148
CCC Q R+++ + FL GL+ F +R+ IL + +L I S
Sbjct: 172 CCCNRPSCTHRVRQSQRRDHERIHQFLMGLDAAKFGTSRTNILGRLSRDDNISLDSIYSE 231
Query: 149 IIQQERHLGIGVLHEPAVMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQN 208
II +ERHL I E V AV G ++ ++ R N+ CTHCGR N
Sbjct: 232 IIAEERHLTITRSKEERVDAVGFAVQTGVNAIASVTRVNNMGP---------CTHCGRSN 282
Query: 209 HTVETCFLKHGYPSNFPNGYKPE-------KAYIPTSGGESHSSS-----DQDEPPSSSL 256
H+ +TCF HG P + Y ++ P G H +S Q PSSS
Sbjct: 283 HSADTCFKLHGVPEWYTEKYGDTSSGRGRGRSSTPRGRGRGHGNSYKANNAQTSHPSSSA 342
Query: 257 E-------LTREYLQGLLALLPQSKPTAVAQTPNLKPISTSNLVSSHNVIANNIGMVNSQ 309
+++E + LL Q T S+ L N +
Sbjct: 343 SEFSDIPGVSKEAWSAIRNLLKQDTAT-----------SSEKLSGKTNCV---------D 382
Query: 310 WIMDSGATDHIASSLSFFSSYYSIKHVPVSLPNRAHAMKTI*GQYFF--HLP*LYIMF-- 365
+++DSGA+ H+ L + Y I H V LPN H + T G ++ +++F
Sbjct: 383 FLIDSGASHHMTGFLDLLTEIYEIPHSVVVLPNAKHTIATKKGTLILGANMKLTHVLFVP 442
Query: 366 SMYLNFLSILFLYTSLPKY*ITD*FSLIPCV*FWTRPL*R*LGELRSRM-----GYTSWS 420
+ +S+ L L + I F+ CV + + S+M ++
Sbjct: 443 DLSCTLISVARLLRELHCFAI---FTDKVCV----------IQDRTSKMLIGVGTESNGV 489
Query: 421 FHLSLVSNHFHSVNMSTVSSSNKLPNIWHHRLGHPSQPCYISLQQMYPPITT--SPCKN- 477
+HL S N+ ++ L WH RLGHPS S+ + S K
Sbjct: 490 YHLQRAEVVATSANVVKWKTNKAL---WHMRLGHPSSKVLSSVLPSLEDFDSCSSDLKTI 546
Query: 478 CDTCHFAKQKRLSFSLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWI 537
CD C AKQ R SFS +H D+WGPY AS GA YF IV D SR WI
Sbjct: 547 CDVCVRAKQTRASFSESFNKAEECFSFIHYDVWGPYKHASSCGAHYFLTIVDDHSRAVWI 606
Query: 538 KLMKAKSEARAALQTFILYSQRQYGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKT 596
LM AKSE + LQ FI + RQ+ VK VR +NG EF ++ ++++ G+ HQ+SCV T
Sbjct: 607 HLMLAKSEVASLLQQFIAMASRQFNKQVKTVRSNNGTEFMSLKSYFAERGIVHQISCVYT 666
Query: 597 PQQNGVVERKHQHILNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIIL 656
QQNG VERKH+HILN+AR+L+ Q+ L + FW +V+ A L+N PT +L +P+ IL
Sbjct: 667 HQQNGRVERKHRHILNVARSLLFQAELPISFWEESVLTAAYLINRTPTPILDGKTPYKIL 726
Query: 657 HNLKPDIEHLKVFGSLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVF 716
++ P L+VFGSLC+A T KF R RKCI +G G KG+ ++D++++ F
Sbjct: 727 YSQPPSYASLRVFGSLCFARKHTGRLDKFQERGRKCIFVGYPH-GQKGWRIYDIESQIFF 785
Query: 717 LSRNVIFYETIFPF 730
+SR+V+F E IFPF
Sbjct: 786 VSRDVVFQEDIFPF 799
>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1491
Score = 290 bits (741), Expect = 2e-76
Identities = 207/655 (31%), Positives = 320/655 (48%), Gaps = 52/655 (7%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDN-FAAARS*ILLMEPLPTLSKICSMIIQQERHL 156
C C + RE + + F+ GL+D+ F + ++ M+P P+L +I S ++++E+ L
Sbjct: 179 CTCGATLEPSKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFPSLGEIYSRVVREEQRL 238
Query: 157 GIGVLHEPAVMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQNHTVETCFL 216
+ E A+ Q + R S+ S +S LC+HCGR H + C+
Sbjct: 239 ASVQIREQQQSAIGFLTRQSEVTADGRTDSSIIKSRDRSV---LCSHCGRSGHEKKDCWQ 295
Query: 217 KHGYPSNFPNGYKPEKAYIPTSGGESHSSSDQDEPPSSSLELTREYLQGLLALLPQSKPT 276
G+P + E+ GG SS + S S R Q A S +
Sbjct: 296 IVGFPD-----WWTERT---NGGGRGSSSRGRGGRSSGSNNSGRGRGQVTAAHATTSNLS 347
Query: 277 AVAQ-TPNLKPISTSNLVSSHNVIANNIG--MVNSQWIMDSGATDHIASSLSFFSSYYSI 333
+ + TP+ + T + + +N ++ + M I+D+GA+ H+ LS ++ +I
Sbjct: 348 SFPEFTPDQLRVITQMIQNKNNGTSDKLSGKMKLGDVILDTGASHHMTGQLSLLTNIVTI 407
Query: 334 KHVPVSLPNRAHAMKTI*GQY----------FFHLP*L---YIMFSMYLNFLSILFLYTS 380
V + G + ++P L I S + + L L+T
Sbjct: 408 PSCSVGFADDRKTFAISMGTFKLSETVSLSNVLYVPALNCSLISVSKLVKQIKCLALFTD 467
Query: 381 LPKY*ITD*FSLIPCV*FWTRPL*R*LGELRSRMGYTSWSFHLSLVSNHFHSVNMSTVSS 440
+ D FS R L GE R + Y + + H V+++T +
Sbjct: 468 TICV-LQDRFS---------RTLIG-TGEERDGVYYLT-----DAATTTVHKVDVTTDHA 511
Query: 441 SNKLPNIWHHRLGHPSQPCYISLQQMYPPITTSPCKNCDTCHFAKQKRLSFSLRNTIYGT 500
+WH RLGHPS SL + ++CD C AKQ R F +
Sbjct: 512 ------LWHQRLGHPSFSVLSSLPLFSGSSCSVSSRSCDVCFRAKQTREVFPDSSNKSTD 565
Query: 501 VLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAALQTFILYSQRQ 560
L+H D+WGPY V S GA YF IV D SR W L+ AKSE R+ L F+ Y+++Q
Sbjct: 566 CFSLIHCDVWGPYRVPSSCGAVYFLTIVDDFSRSVWTYLLLAKSEVRSVLTNFLAYTEKQ 625
Query: 561 YGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQHILNIARALML 619
+G VKI+R DNG EF ++ ++ + G+ HQ SCV TPQQNG VERKH+HILN++RAL+
Sbjct: 626 FGKSVKIIRSDNGTEFMCLSSYFKEQGIVHQTSCVGTPQQNGRVERKHRHILNVSRALLF 685
Query: 620 QSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVFGSLCYATSLT 679
Q++L ++FW AV+ A L+N P+ + + SP+ +LH KPD + L+VFGS CYA +T
Sbjct: 686 QASLPIKFWGEAVMTAAYLINRTPSSIHNGLSPYELLHGCKPDYDQLRVFGSACYAHRVT 745
Query: 680 AHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFPFQATD 734
+ KF R+R CI +G G KG+ ++DL T E +SR+V+F E +FP+ +
Sbjct: 746 RDKDKFGERSRLCIFVGY-PFGQKGWKVYDLSTNEFIVSRDVVFRENVFPYATNE 799
>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301695|pir||D84481 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1413
Score = 289 bits (739), Expect = 3e-76
Identities = 207/655 (31%), Positives = 319/655 (48%), Gaps = 52/655 (7%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDN-FAAARS*ILLMEPLPTLSKICSMIIQQERHL 156
C C + RE + + F+ GL+D+ F + ++ M+P P+L +I S ++++E+ L
Sbjct: 179 CTCGATLEPSKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFPSLGEIYSRVVREEQRL 238
Query: 157 GIGVLHEPAVMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQNHTVETCFL 216
+ E A+ Q + R S+ S +S LC+HCGR H + C+
Sbjct: 239 ASVQIREQQQSAIGFLTRQSEVTADGRTDSSIIKSRDRSV---LCSHCGRSGHEKKDCWQ 295
Query: 217 KHGYPSNFPNGYKPEKAYIPTSGGESHSSSDQDEPPSSSLELTREYLQGLLALLPQSKPT 276
G+P + E+ GG SS + S S R Q A S +
Sbjct: 296 IVGFPD-----WWTERT---NGGGRGSSSRGRGGRSSGSNNSGRGRGQVTAAHATTSNLS 347
Query: 277 AVAQ-TPNLKPISTSNLVSSHNVIANNIG--MVNSQWIMDSGATDHIASSLSFFSSYYSI 333
+ TP+ + T + + +N ++ + M I+D+GA+ H+ LS ++ +I
Sbjct: 348 PFPEFTPDQLRVITQMIQNKNNGTSDKLSGKMKLGDVILDTGASHHMTGQLSLLTNIVTI 407
Query: 334 KHVPVSLPNRAHAMKTI*GQY----------FFHLP*L---YIMFSMYLNFLSILFLYTS 380
V + G + ++P L I S + + L L+T
Sbjct: 408 PSCSVGFADGRKTFAISMGTFKLSETVSLSNVLYVPALNCSLISVSKLVKQIKCLALFTD 467
Query: 381 LPKY*ITD*FSLIPCV*FWTRPL*R*LGELRSRMGYTSWSFHLSLVSNHFHSVNMSTVSS 440
+ D FS R L GE R + Y + + H V+++T +
Sbjct: 468 TICV-LQDRFS---------RTLIG-TGEERDGVYYLT-----DAATTTVHKVDITTDHA 511
Query: 441 SNKLPNIWHHRLGHPSQPCYISLQQMYPPITTSPCKNCDTCHFAKQKRLSFSLRNTIYGT 500
+WH RLGHPS SL + ++CD C AKQ R F +
Sbjct: 512 ------LWHQRLGHPSFSVLSSLPLFSGSSCSVSSRSCDVCFRAKQTREVFPDSSNKSTD 565
Query: 501 VLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAALQTFILYSQRQ 560
L+H D+WGPY V S GA YF IV D SR W L+ AKSE R+ L F+ Y+++Q
Sbjct: 566 CFSLIHCDVWGPYRVPSSCGAVYFLTIVDDFSRSVWTYLLLAKSEVRSVLTNFLAYTEKQ 625
Query: 561 YGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQHILNIARALML 619
+G VKI+R DNG EF ++ ++ + G+ HQ SCV TPQQNG VERKH+HILN++RAL+
Sbjct: 626 FGKSVKIIRSDNGTEFMCLSSYFKEQGIVHQTSCVGTPQQNGRVERKHRHILNVSRALLF 685
Query: 620 QSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVFGSLCYATSLT 679
Q++L ++FW AV+ A L+N P+ + + SP+ +LH KPD + L+VFGS CYA +T
Sbjct: 686 QASLPIKFWGEAVMTAAYLINRTPSSIHNGLSPYELLHGCKPDYDQLRVFGSACYAHRVT 745
Query: 680 AHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFPFQATD 734
+ KF R+R CI +G G KG+ ++DL T E +SR+V+F E +FP+ +
Sbjct: 746 RDKDKFGERSRLCIFVGY-PFGQKGWKVYDLSTNEFIVSRDVVFRENVFPYATNE 799
>pir||E96608 probable retroelement polyprotein F25P12.89 [imported] -
Arabidopsis thaliana gi|9954746|gb|AAG09097.1| Putative
retroelement polyprotein [Arabidopsis thaliana]
Length = 1486
Score = 288 bits (738), Expect = 3e-76
Identities = 213/667 (31%), Positives = 317/667 (46%), Gaps = 68/667 (10%)
Query: 107 KQYRENDCVLCFLRGLNDN-FAAARS*ILLMEPLPTLSKICSMIIQQERHLGIGVLHEPA 165
++ RE D + FL GL+++ + A +S +L PLP+L + + + Q E + LH
Sbjct: 176 RKEREEDKLHQFLMGLDESVYGAVKSALLSRVPLPSLEEAYNALTQDEESKSLSRLHNER 235
Query: 166 VMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQNHTVETCFLKHGYPSNFP 225
V V S ++ S+ S N R+C++CGR H E CF GYP
Sbjct: 236 VDGV-----------SFAVQTTSRPRDSSEN--RVCSNCGRVGHLAEQCFKLIGYPPWLE 282
Query: 226 NGYKPEKAYIPTSGGESHSSSDQDEPPSSSLE--------------------LTREYLQG 265
+ + + GG S Q SS+ LT + G
Sbjct: 283 EKLRLKNTASSSRGGLSSFKGKQSHGRGSSINHVASSGMAANVVTNSSLTSPLTSDDRIG 342
Query: 266 LLALLPQSKPTAVAQTPNLKPISTSNLVSSHNVIANNIGMVNSQWIMDSGATDHIASSLS 325
L L + QT + STSN S + WI+DSGAT+H+ SL+
Sbjct: 343 LSGL--NDSQWKILQTILEERKSTSNDHQSGKYFLES-------WIIDSGATNHMTGSLA 393
Query: 326 FFSSYYSIKHVPVSLPNRAHAMKTI*GQY----FFHLP*LYIMFSMYLNFLSILFLY-TS 380
F + + V + LP+ T G L + + ++ + +S+ L T
Sbjct: 394 FLRNVCDMPPVLIKLPDGRFTTATKQGSVQLGSSLDLQDVLFVDGLHCHLISVSQLTRTR 453
Query: 381 LPKY*ITD*FSLIPCV*FWTRPL*R*LGELRSRMGYTSWSFHLSLVSNHFHSVNMSTVSS 440
+ ITD ++ R +G R G F V + +
Sbjct: 454 RCIFQITDKVCIVQ-----DRTTLMLIGAGRELNGLY-----------FFRGVETAAAVT 497
Query: 441 SNKLPN--IWHHRLGHPSQPCYISLQQMYPPITTSPCKNCDTCHFAKQKRLSFSLRNTIY 498
S LP+ +WH RLGHPS L +T K C+ C AKQ R F L +
Sbjct: 498 SKALPSSQLWHQRLGHPSSKALHLLPFSDVTSSTFDSKTCEICIQAKQTRDPFPLSSNKT 557
Query: 499 GTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAALQTFILYSQ 558
EL+H D+WGPY SI G++YF +V D SR W+ L+ +K EA L+ FI +
Sbjct: 558 SFAFELVHCDLWGPYRTTSICGSRYFLTLVDDYSRAVWLYLLPSKQEAPKHLKNFIALVE 617
Query: 559 RQYGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQHILNIARAL 617
RQY + +K++R DNG EF ++DF++Q G+ H+ SCV TPQQNG VERKH+HILN+ARAL
Sbjct: 618 RQYTTNIKMIRSDNGSEFICLSDFFAQKGIIHETSCVGTPQQNGRVERKHRHILNVARAL 677
Query: 618 MLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVFGSLCYATS 677
QS L +EFWS+ + A L+N PT +L +PF +++N P ++H+++FG +CY +
Sbjct: 678 RFQSGLPIEFWSYCALTAAYLINRTPTPLLKGKTPFELIYNRPPPLQHIRIFGCICYVHN 737
Query: 678 LTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFPFQATDKVS 737
L KF R+ K I LG KG+ +++++T V +SR+V+F ET F F + S
Sbjct: 738 LKHGGDKFASRSNKSIFLGY-PFAKKGWRVYNIETGVVSVSRDVVFRETEFHFPISVMDS 796
Query: 738 LSSLFPI 744
SL P+
Sbjct: 797 SPSLDPV 803
>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm,
score: 11.19) [Arabidopsis thaliana]
gi|7486705|pir||T01879 hypothetical protein F8M12.17 -
Arabidopsis thaliana
Length = 1633
Score = 276 bits (706), Expect = 2e-72
Identities = 205/673 (30%), Positives = 307/673 (45%), Gaps = 92/673 (13%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDNFAAARS*ILLMEPLPTLSKICSMIIQQERHLG 157
C C+ +++ ++ V FL GLN+++ R IL+++P+ T+ + +++ Q ER
Sbjct: 188 CECDAAVKWERLQQRSHVTKFLMGLNESYEQTRRHILMLKPIRTIEEAFNIVTQDERQKA 247
Query: 158 IGVLHEPAVMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQNHTVETCFLK 217
I R + + LCT+CG+ HTV+ C+
Sbjct: 248 I--------------------------RPTPKVDNQDQLKLPLCTNCGKVGHTVQKCYKI 281
Query: 218 HGYPSNFPNGYKPEKAYIPTSGGESHSSSDQDEPPSSSLELTREYLQGLLALLPQ----- 272
GYP + + I T Q L ++ + P
Sbjct: 282 IGYPPGYKAATSYRQPQIQTQPRMQMPQQSQPRMQQPIQHLISQFNAQVRVQEPAATSIY 341
Query: 273 -SKPTA-------VAQTPNLKPI---STSNLVSSHNVIANNIGMVNSQ-------WIMDS 314
S PTA +AQT I STS ++N+ N + + Q WI+DS
Sbjct: 342 TSSPTATITEHGLMAQTSTSGTIPFPSTSLKYENNNLTFQNHTLSSLQNVLSSDAWIIDS 401
Query: 315 GATDHIASSLSFFSSYYSIKHVPVSLPNRAHAMKTI*GQYFFHLP*LYIMFSMYLNFLSI 374
GA+ H+ S L+ F + V V+LPN T G ++ + I
Sbjct: 402 GASSHVCSDLTMFRELIHVSGVTVTLPNGTRVAITHTG-------------TICITSTLI 448
Query: 375 LFLYTSLPKY*ITD*FSLIP-CV*FWTRPL*R*LGELRSRMGYTSWSFHLSLVSNHFHSV 433
L +P + F+LI C TR L G+ N+ + +
Sbjct: 449 LHNVLLVPDFK----FNLISVCCLELTRGLMIGRGK----------------TYNNLYIL 488
Query: 434 NMSTVSSSNKLPNIWHHRLGHPSQPCYISLQQMYPPI--TTSPCKNCDTCHFAKQKRLSF 491
S S LP HPS P L P + +S +C AKQKRL++
Sbjct: 489 ETQRTSFSPSLPAATSR---HPSLPALQKLVSSIPSLKSVSSTASHCRISPLAKQKRLAY 545
Query: 492 SLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAALQ 551
N + + +L+H+DIWGP+S+ S+ G +YF +V D +R TW+ +MK KSE
Sbjct: 546 VSHNNLASSPFDLIHLDIWGPFSIESVDGFRYFLTLVDDCTRTTWVYMMKNKSEVSNIFP 605
Query: 552 TFILYSQRQYGSLVKIVRLDNGVEFAMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQHIL 611
F+ QY + +K +R DN E A T F + G+ HQ SC TPQQN VVERKHQH+L
Sbjct: 606 VFVKLIFTQYNAKIKAIRSDNVKELAFTKFVKEQGMIHQFSCAYTPQQNSVVERKHQHLL 665
Query: 612 NIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVFGS 671
NIAR+L+ QSN+ L++WS V+ A L+N LP+ +L +PF +L PD LK
Sbjct: 666 NIARSLLFQSNVPLQYWSDCVLTAAYLINRLPSPLLDNKTPFELLLKKIPDYTLLK--SC 723
Query: 672 LCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFPFQ 731
LCYA++ R KF RAR C+ LG G KGY + DL++ + ++RNV+F+ET FPF+
Sbjct: 724 LCYASTNVHDRNKFSPRARPCVFLGY-PSGYKGYKVLDLESHSISITRNVVFHETKFPFK 782
Query: 732 ATDKVSLS-SLFP 743
+ + S +FP
Sbjct: 783 TSKFLKESVDMFP 795
>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301698|pir||C84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1501
Score = 276 bits (705), Expect = 2e-72
Identities = 204/671 (30%), Positives = 326/671 (48%), Gaps = 60/671 (8%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDN-FAAARS*ILLMEPLPTLSKICSMIIQQERHL 156
C C + RE + + F+ GL+++ F + ++ M+PLP+L +I S +I++E+ L
Sbjct: 184 CRCGATSEPTKEREEEKIHQFVLGLDESRFGGLCATLINMDPLPSLGEIYSRVIREEQRL 243
Query: 157 GIGVLHEPAVMAV----------HTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGR 206
+ E AV H + S S + S K V C++CGR
Sbjct: 244 ASVHVREQKEEAVGFLARREQLDHHSRVDASSSRSEHTGGSRSNSIIKGRV--TCSNCGR 301
Query: 207 QNHTVETCFLKHGYPSNFP--NGYKPE----KAYIPTSGGESHSSSDQDEPPSSSLELTR 260
H + C+ G+P + NG + + ++GG SS+ +
Sbjct: 302 TGHEKKECWQIVGFPDWWSERNGGRGSNGRGRGGRGSNGGRGQGQVMAAHATSSNSSVFP 361
Query: 261 EYLQGLLALLPQSKPTAVAQTPNLKPISTSNLVSSHNVIANNIGMVNSQWIMDSGATDHI 320
E+ + + +L Q V + N STSN S +G + I+DSGA+ H+
Sbjct: 362 EFTEEHMRVLSQ----LVKEKSNSG--STSNNNSDRLSGKTKLGDI----ILDSGASHHM 411
Query: 321 ASSLSFFSSYYSIKHVPVSLPNRAHAMKTI*GQYFFHLP*LYIMFSMYLNFLSILFLYT- 379
+LS ++ + PV + + A F L + S ++ ++LF+ +
Sbjct: 412 TGTLSSLTNVVPVPPCPVGFADGSKA---------FALSVGVLTLSNTVSLTNVLFVPSL 462
Query: 380 SLPKY*ITD*FSLIPCV*FWT--------RPL*R*LGELRSRMGYTSWSFHLSLVSN-HF 430
+ ++ C+ +T R +G R G ++L+ V+
Sbjct: 463 NCTLISVSKLLKQTQCLATFTDTLCFLQDRSSKTLIGSGEERGGV----YYLTDVTPAKI 518
Query: 431 HSVNMSTVSSSNKLPNIWHHRLGHPSQPCYISLQQMYPPITTSPCKNCDTCHFAKQKRLS 490
H+ N+ + + +WH RLGHPS SL +T +CD C AKQ R
Sbjct: 519 HTANVDSDQA------LWHQRLGHPSFSVLSSLPLFSKTSSTVTSHSCDVCFRAKQTREV 572
Query: 491 FSLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAAL 550
F L+H D+WGPY V + GA YF IV D SR W L+ KSE R L
Sbjct: 573 FPESINKTEECFSLIHCDVWGPYRVPASCGAVYFLTIVDDYSRAVWTYLLLEKSEVRQVL 632
Query: 551 QTFILYSQRQYGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQH 609
F+ Y+++Q+G VK+VR DNG EF ++ ++ ++G+ HQ SCV TPQQNG VERKH+H
Sbjct: 633 TNFLKYAEKQFGKTVKMVRSDNGTEFMCLSSYFRENGIIHQTSCVGTPQQNGRVERKHRH 692
Query: 610 ILNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVF 669
ILN+ARAL+ Q++L ++FW +++ A L+N P+ +LS +P+ +LH KP L+VF
Sbjct: 693 ILNVARALLFQASLPIKFWGESILTAAYLINRTPSSILSGRTPYEVLHGSKPVYSQLRVF 752
Query: 670 GSLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFP 729
GS CY +T + KF R+R CI +G G KG+ ++D++ E +SR+VIF E +FP
Sbjct: 753 GSACYVHRVTRDKDKFGQRSRSCIFVGY-PFGKKGWKVYDIERNEFLVSRDVIFREEVFP 811
Query: 730 FQATDKVSLSS 740
+ + +L+S
Sbjct: 812 YAGVNSSTLAS 822
>gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301674|pir||D84639 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1156
Score = 261 bits (667), Expect = 6e-68
Identities = 137/310 (44%), Positives = 191/310 (61%), Gaps = 7/310 (2%)
Query: 433 VNMSTVSSSNKLPNIWHHRLGHPSQPCYISLQQMYPPITTSPCKNCDTCHFAKQKRLSFS 492
++ + VSS L WH RLGHPS SL + + ++CD C AKQ R F
Sbjct: 79 IHTAKVSSDQAL---WHQRLGHPSFSVLSSLPVLTSSSLSVGSRSCDVCFRAKQTREVFP 135
Query: 493 LRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAALQT 552
+ L+H D+WGPY V S GA YF IV D SR W L+ AKSE R L
Sbjct: 136 VSTNKSIECFSLIHCDVWGPYRVPSSCGAVYFLTIVDDFSRAVWTYLLLAKSEVRTVLTN 195
Query: 553 FILYSQRQYGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQHIL 611
F++Y+++Q+G VK++R DNG EF + ++ +HG+ HQ SCV TPQQNG VERKH+HIL
Sbjct: 196 FLVYTEKQFGKSVKVLRSDNGTEFMCLASYFREHGIVHQTSCVGTPQQNGRVERKHRHIL 255
Query: 612 NIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVFGS 671
N+ARA++ Q++L ++FW AV+ A L+N PT + + SP+ ILHN KP+ EHL+VFGS
Sbjct: 256 NVARAILFQASLPIQFWGEAVLTAAYLINRTPTSLHNGLSPYEILHNSKPNYEHLRVFGS 315
Query: 672 LCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFPFQ 731
CY + + KF R+R C+ +G KG+ +FD++ +E +SR+V+F E +FP+
Sbjct: 316 ACYVHRASRDKDKFGERSRLCVFIGY-PFAQKGWKVFDMEKKEFLVSRDVVFREDVFPYA 374
Query: 732 A--TDKVSLS 739
A TD VS S
Sbjct: 375 ATNTDHVSAS 384
>gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301700|pir||G84542 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1333
Score = 248 bits (632), Expect = 7e-64
Identities = 130/307 (42%), Positives = 185/307 (59%), Gaps = 5/307 (1%)
Query: 429 HFHSVNM--STVSSSNKLPNIWHHRLGHPSQPCYISLQQMYPPITTSPC-KNCDTCHFAK 485
HF S + S K +WH R+GHP+ + + ++++ K CD CH AK
Sbjct: 371 HFRSTEIAASVTVKEEKNYELWHSRMGHPAARVVSLIPESSVSVSSTHLNKACDVCHRAK 430
Query: 486 QKRLSFSLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSE 545
Q R SF L + EL++ D+WGPY S TGA+YF I+ D SR W+ L+ KSE
Sbjct: 431 QTRNSFPLSINKTLRIFELIYCDLWGPYRTPSHTGARYFLTIIDDYSRGVWLYLLNDKSE 490
Query: 546 ARAALQTFILYSQRQYGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKTPQQNGVVE 604
A L+ F + RQ+ +K VR DNG EF +T F+ + GV H+ SCV TP++N VE
Sbjct: 491 APCHLKNFFAMTDRQFNVKIKTVRSDNGTEFLCLTKFFQEQGVIHERSCVATPERNDRVE 550
Query: 605 RKHQHILNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIE 664
RKH+H+LN+ARAL Q+NL ++FW V+ A L+N P+ VL+ S+P+ LH +P +
Sbjct: 551 RKHRHLLNVARALRFQANLPIQFWGECVLTAAYLINRTPSSVLNDSTPYERLHKKQPRFD 610
Query: 665 HLKVFGSLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFY 724
HL+VFGSLCYA + KF R+R+C+ +G G KG+ +FDL+ E F+SR+V+F
Sbjct: 611 HLRVFGSLCYAHNRNRGGDKFAERSRRCVFVGYPH-GQKGWRLFDLEQNEFFVSRDVVFS 669
Query: 725 ETIFPFQ 731
E FPF+
Sbjct: 670 ELEFPFR 676
Score = 49.7 bits (117), Expect = 4e-04
Identities = 41/152 (26%), Positives = 69/152 (44%), Gaps = 14/152 (9%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDN-FAAARS*ILLMEPLPTLSKICSMIIQQERHL 156
C C++ + ++ RE D V FL GL+D F RS ++ P+ L ++ +++ Q+E L
Sbjct: 80 CVCDLGALQEKDREEDKVHEFLSGLDDALFRTVRSSLVSRIPVQPLEEVYNIVRQEEDLL 139
Query: 157 --GIGVLHEPAVMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQNHTVETC 214
G VL + + + +GR + + S +C HC R H E+C
Sbjct: 140 RNGANVLDDQREVNAFAAQMR---PKLYQGRGDEKDKSM------VCKHCNRSGHASESC 190
Query: 215 FLKHGYPSNFPNGYKPEKAYIPTSGGESHSSS 246
+ GYP + G +P + T G +SS
Sbjct: 191 YAVIGYPEWW--GDRPRSRSLQTRGRGGTNSS 220
>gb|AAG50751.1| polyprotein, putative [Arabidopsis thaliana]
gi|25301686|pir||F96610 probable polyprotein T8L23.26
[imported] - Arabidopsis thaliana
Length = 1468
Score = 244 bits (622), Expect = 1e-62
Identities = 132/300 (44%), Positives = 182/300 (60%), Gaps = 4/300 (1%)
Query: 434 NMSTVSSSNKLP-NIWHHRLGHPSQPCYISLQQMYPPITTSPCKN-CDTCHFAKQKRLSF 491
N + V +S K P ++WH RLGH S L + +N CDTC AKQ R +F
Sbjct: 501 NAAAVHTSVKAPFDLWHRRLGHASDKIVNLLPRELLSSGKEILENVCDTCMRAKQTRDTF 560
Query: 492 SLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAALQ 551
L + +L+H D+WGPY S +GA+YF IV D SR W+ LM KSE + L+
Sbjct: 561 PLSDNRSMDSFQLIHCDVWGPYRAPSYSGARYFLTIVDDYSRGVWVYLMTDKSETQKHLK 620
Query: 552 TFILYSQRQYGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQHI 610
FI +RQ+ + +KIVR DNG EF M +++ G+ H+ SCV TP QNG VERKH+HI
Sbjct: 621 DFIALVERQFDTEIKIVRSDNGTEFLCMREYFLHKGIAHETSCVGTPHQNGRVERKHRHI 680
Query: 611 LNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVFG 670
LNIARAL QS L ++FW ++ A L+N P+ +L SP+ +L+ P HL+VFG
Sbjct: 681 LNIARALRFQSYLPIQFWGECILSAAYLINRTPSMLLQGKSPYEMLYKTAPKYSHLRVFG 740
Query: 671 SLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFPF 730
SLCYA + KF R+R+C+ +G G KG+ +FDL+ ++ F+SR+VIF ET FP+
Sbjct: 741 SLCYAHNQNHKGDKFAARSRRCVFVGYPH-GQKGWRLFDLEEQKFFVSRDVIFQETEFPY 799
Score = 65.9 bits (159), Expect = 5e-09
Identities = 67/277 (24%), Positives = 111/277 (39%), Gaps = 55/277 (19%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDN-FAAARS*ILLMEPLPTLSKICSMIIQQERHL 156
C C + ++YRE+D V +L GLN+ F RS + PLP L ++ +++ Q+E +
Sbjct: 173 CICNLGTDQEKYREDDMVHQYLYGLNETKFHTIRSSLTSRVPLPGLEEVYNIVRQEEDMV 232
Query: 157 GIGVLHEPAVMAVHTGNTQGTHSNSNRGR-SNSQYSSSKSNV*RLCTHCGRQNHTVETCF 215
+E S + +NS+ +K +LCTHC R H+ E CF
Sbjct: 233 NNRSSNEERTDVTAFAVQMRPRSEVISEKFANSEKLQNK----KLCTHCNRGGHSPENCF 288
Query: 216 LKHGYPSNFP------------------------NGYKPEKAYI--------PTSGGESH 243
+ GYP + NG +P Y+ P+S +
Sbjct: 289 VLIGYPEWWGDRPRGKSNSNGSTSRGRGRFGPGFNGGQPRPTYVNVVMTGPFPSSEHVNR 348
Query: 244 SSSDQDEPPSSSLELTREYLQGLLALLPQSKPTAVAQTPNLKPISTSNLVSSHNVIANNI 303
+D D S LT E +G++ LL + + N ++H +
Sbjct: 349 VITDSDRDAVSG--LTDEQWRGVVKLLNAGR--------------SDNKSNAHETQSGTC 392
Query: 304 GMVNSQWIMDSGATDHIASSLSFFSSYYSIKHVPVSL 340
+ S WI+D+GA+ H+ +L S S+ V + L
Sbjct: 393 SLFTS-WILDTGASHHMTGNLELLSDMRSMSPVLIIL 428
>gb|AAD41979.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301676|pir||B84534 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1264
Score = 243 bits (620), Expect = 2e-62
Identities = 128/293 (43%), Positives = 171/293 (57%), Gaps = 9/293 (3%)
Query: 448 WHHRLGHPSQPCYISLQQMYPPITTSPCKN-----CDTCHFAKQKRLSFSLRNTIYGTVL 502
WH+RL H S L + + T+ KN C CH AK ++LSF +N + +
Sbjct: 417 WHNRLRHASLQ---RLDVISESLGTTKHKNKGSDYCHVCHLAKHRKLSFPSQNNVCNEIF 473
Query: 503 ELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAALQTFILYSQRQYG 562
E+LH+DIWGP+SV ++ G +YF IV D SR TWI L+K KSE FI + QY
Sbjct: 474 EMLHIDIWGPFSVETVDGYQYFLTIVDDHSRATWIYLLKTKSEVLTIFHDFIQQVENQYK 533
Query: 563 SLVKIVRLDNGVEFAMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQHILNIARALMLQSN 622
VK VR DN E T Y + G+ SC +TP+QN VVERKHQHILN+ARALM QS
Sbjct: 534 VKVKAVRSDNAPELRFTSLYQRKGIMAFHSCPETPEQNSVVERKHQHILNVARALMFQSQ 593
Query: 623 LTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVFGSLCYATSLTAHR 682
+ L W V+ AV L+N P+++LS +P+ IL P L+ FG LCY+++ R
Sbjct: 594 VPLFLWGECVLTAVFLINRTPSQLLSNKTPYEILSGTAPQYGQLRTFGCLCYSSTSPKQR 653
Query: 683 TKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFPFQATDK 735
KF R++ CI LG G KGY + DL++ +F+SRNV+F E +FP T K
Sbjct: 654 HKFQPRSKACIFLGY-SSGYKGYKLMDLESNAIFISRNVVFLEEVFPLAGTKK 705
Score = 45.8 bits (107), Expect = 0.005
Identities = 36/151 (23%), Positives = 60/151 (38%), Gaps = 23/151 (15%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDNFAAARS*ILLMEPLPTLSKICSMIIQQERHLG 157
C C +Q E ++ FL GLN+++A R ++ + LP+L+++ ++ Q G
Sbjct: 184 CTCGKALRLQQKAERAKIVKFLAGLNESYAIIRRQVIAKKILPSLAEVYHIVDQDNSQQG 243
Query: 158 IGVLHEPAVMAVHTGNTQG---------THSNSNRGRSNSQYSSSKSNV*RLCTHCGRQN 208
+ P V + T + N+GR +C+ R
Sbjct: 244 FSNVVAPPVAFQVSEVTVANIIDPTICYVQNCPNKGRP-------------MCSFYNRVG 290
Query: 209 HTVETCFLKHGYPSNF-PNGYKPEKAYIPTS 238
H E C+ KHG+P F P +K P S
Sbjct: 291 HIAERCYKKHGFPPGFTPKDKVGDKTQKPKS 321
>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
(gb|U12626). [Arabidopsis thaliana]
gi|25301690|pir||G96722 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana
Length = 1315
Score = 243 bits (620), Expect = 2e-62
Identities = 125/307 (40%), Positives = 181/307 (58%), Gaps = 11/307 (3%)
Query: 432 SVNMSTVSSSNKLPNIWHHRLGHPS----QPCYISLQQMYPPITTSPCKNCDTCHFAKQK 487
S+ +++V+S + +WH RLGHPS QP +S +P + +C CH +KQK
Sbjct: 395 SITVASVTSHD----LWHKRLGHPSVQKLQP--MSSLLSFPKQKNNTDFHCRVCHISKQK 448
Query: 488 RLSFSLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEAR 547
L F N +L+H+D WGP+SV + G +YF IV D SR TW+ L++ KS+
Sbjct: 449 HLPFVSHNNKSSRPFDLIHIDTWGPFSVQTHDGYRYFLTIVDDYSRATWVYLLRNKSDVL 508
Query: 548 AALQTFILYSQRQYGSLVKIVRLDNGVEFAMTDFYSQHGVQHQLSCVKTPQQNGVVERKH 607
+ TF+ + Q+ + +K VR DN E T FY G+ SC +TPQQN VVERKH
Sbjct: 509 TVIPTFVTMVENQFETTIKGVRSDNAPELNFTQFYHSKGIVPYHSCPETPQQNSVVERKH 568
Query: 608 QHILNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLK 667
QHILN+AR+L QS++ + +W ++ AV L+N LP +L PF +L P +H+K
Sbjct: 569 QHILNVARSLFFQSHIPISYWGDCILTAVYLINRLPAPILEDKCPFEVLTKTVPTYDHIK 628
Query: 668 VFGSLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETI 727
VFG LCYA++ R KF RA+ C +G G KGY + DL+T + +SR+V+F+E +
Sbjct: 629 VFGCLCYASTSPKDRHKFSPRAKACAFIGY-PSGFKGYKLLDLETHSIIVSRHVVFHEEL 687
Query: 728 FPFQATD 734
FPF +D
Sbjct: 688 FPFLGSD 694
Score = 77.8 bits (190), Expect = 1e-12
Identities = 57/199 (28%), Positives = 95/199 (47%), Gaps = 8/199 (4%)
Query: 110 RENDCVLCFLRGLNDNFAAARS*ILLMEPLPTLSKICSMIIQQERHLGIGVLHEPAVMAV 169
RE + V+ FL GLND + RS IL+ + LP+LS++ +MI Q E + P +
Sbjct: 127 RETNRVIDFLMGLNDCYDTVRSQILMKKTLPSLSEVFNMIDQDETQRSARISTTPGM--- 183
Query: 170 HTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQNHTVETCFLKHGYPSNFPNGYK 229
T + + S++ N K +C++C R H +TC+ KHGYP++F + K
Sbjct: 184 -TSSVFPVSNQSSQSALNGDTYQKKER--PVCSYCSRPGHVEDTCYKKHGYPTSFKSKQK 240
Query: 230 PEKAYIPTSGGESHSSSDQDEPPSSSLELTREYLQGLLALLPQS-KPTAVAQTPNLKPIS 288
K I ++ S + S+ +LT +Q L++ L +P + P + IS
Sbjct: 241 FVKPSI-SANAAIGSEEVVNNTSVSTGDLTTSQIQQLVSFLSSKLQPPSTPVQPEVHSIS 299
Query: 289 TSNLVSSHNVIANNIGMVN 307
S+ SS + + G V+
Sbjct: 300 VSSDPSSSSTVCPISGSVH 318
>emb|CAB79159.1| LTR retrotransposon like protein [Arabidopsis thaliana]
gi|2961349|emb|CAA18107.1| LTR retrotransposon like
protein [Arabidopsis thaliana] gi|11358464|pir||T49111
hypothetical retrovirus-related pol polyprotein
AT4g22040 - Arabidopsis thaliana
Length = 1109
Score = 243 bits (620), Expect = 2e-62
Identities = 131/300 (43%), Positives = 182/300 (60%), Gaps = 4/300 (1%)
Query: 434 NMSTVSSSNKLP-NIWHHRLGHPSQPCYISLQQMYPPITTSPCKN-CDTCHFAKQKRLSF 491
N + V +S K P ++WH RLGH S L + +N CDTC AKQ R +F
Sbjct: 277 NAAAVHTSVKAPFDLWHRRLGHASDKIVNLLPRELLSSGKEILENVCDTCMRAKQTRDTF 336
Query: 492 SLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAALQ 551
LR+ +L+H D+WGPY S +GA+YF IV D SR W+ LM KSE + L+
Sbjct: 337 PLRDNRSMDSFQLIHCDVWGPYRTPSYSGARYFLTIVDDYSRGVWVYLMTDKSETQKHLK 396
Query: 552 TFILYSQRQYGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQHI 610
F+ +RQ+ + +K VR DNG EF M +++ G+ H+ SCV TP QNG VERKH+HI
Sbjct: 397 DFMALVERQFDTEIKTVRSDNGTEFLCMREYFLHKGIAHETSCVGTPHQNGRVERKHRHI 456
Query: 611 LNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVFG 670
LNIARAL QS L ++FW ++ A L+N P+ +L SP+ +L+ P HL+VFG
Sbjct: 457 LNIARALRFQSYLPIQFWGECILSAAYLINRTPSMLLQGKSPYEMLYKTAPKYSHLRVFG 516
Query: 671 SLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFPF 730
SLCYA + KF R+R+C+ +G G KG+ +FDL+ ++ F+SR+VIF ET FP+
Sbjct: 517 SLCYAHNQNHKGDKFAARSRRCVFVGYPH-GQKGWRLFDLEEQKFFVSRDVIFQETEFPY 575
>dbj|BAB10743.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1109
Score = 242 bits (618), Expect = 3e-62
Identities = 131/300 (43%), Positives = 182/300 (60%), Gaps = 4/300 (1%)
Query: 434 NMSTVSSSNKLP-NIWHHRLGHPSQPCYISLQQMYPPITTSPCKN-CDTCHFAKQKRLSF 491
N + V +S K P ++WH RLGH S L + +N CDTC AKQ R +F
Sbjct: 277 NAAAVHTSVKAPFDLWHRRLGHASDKIVNLLPRELLSSGKEILENVCDTCMRAKQTRDTF 336
Query: 492 SLRNTIYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAALQ 551
L + +L+H D+WGPY S +GA+YF IV D SR W+ LM KSE + L+
Sbjct: 337 PLSDNRSMDSFQLIHCDVWGPYRTPSYSGARYFLTIVDDYSRGVWVYLMTDKSETQKHLK 396
Query: 552 TFILYSQRQYGSLVKIVRLDNGVEF-AMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQHI 610
FI +RQ+ + +K VR DNG EF M +++ G+ H+ SCV TP QNG VERKH+HI
Sbjct: 397 DFIALVERQFDTEIKTVRSDNGTEFLCMREYFLHKGITHETSCVGTPHQNGRVERKHRHI 456
Query: 611 LNIARALMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVFG 670
LNIARAL QS L ++FW ++ A L+N P+ +L SP+ +L+ P+ HL+VFG
Sbjct: 457 LNIARALRFQSYLPIQFWGECILSAAYLINRTPSMLLQGKSPYEMLYKTAPNYSHLRVFG 516
Query: 671 SLCYATSLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFPF 730
SLCYA + KF R+R+C+ +G G KG+ +FDL+ ++ F+SR+VIF ET FP+
Sbjct: 517 SLCYAHNQNHKGDKFVARSRRCVFVGYPH-GQKGWRLFDLEEQKFFVSRDVIFQETEFPY 575
>emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana]
gi|7268152|emb|CAB78488.1| retrovirus-related like
polyprotein [Arabidopsis thaliana]
gi|7488175|pir||G71406 probable retrovirus-related
polyprotein - Arabidopsis thaliana
Length = 1489
Score = 240 bits (613), Expect = 1e-61
Identities = 152/488 (31%), Positives = 243/488 (49%), Gaps = 59/488 (12%)
Query: 265 GLLALLPQSKPTAVAQTPNLKPISTSNLVSSHNVIANNIGMVNSQWIMDSGATDHIASSL 324
G +AL S + +LK + +H + A + + WI+DSGA+ H+ S L
Sbjct: 437 GFMALTSTSGTIIPFPSTSLKYENNDLKFQNHTLSALQKFLPSDAWIIDSGASSHVCSDL 496
Query: 325 SFFSSYYSIKHVPVSLPNRAHAMKTI*GQYFFHLP*LYIMFSMYLNFLSILFLYTSLPKY 384
+ F S+ H + + H+P N +S+ L ++
Sbjct: 497 AMFRELKSVS-------GTVHITQKLILHNVLHVP------DFKFNLMSVSSLVKTIS-- 541
Query: 385 *ITD*FSLIPCV*FWTRPL*R*LGELRSRMGYTSWSFHLSLVSNHFHSVNMSTVSSSNKL 444
+ F + C+ + EL + + +L + + ST + + L
Sbjct: 542 -CSAHFYVDCCL----------IQELSQGLMIGRGRLYHNLYILETENTSPSTSTPAACL 590
Query: 445 --------PNIWHHRLGHPSQPCYISLQQMYPPITTSPCKNCDTCHFAKQKRLSFSLRNT 496
++WH RLGHPS + LQ++ KRL++ N
Sbjct: 591 FTGSVLNDGHLWHQRLGHPSS---VVLQKL--------------------KRLAYISHNN 627
Query: 497 IYGTVLELLHVDIWGPYSVASITGAKYFHIIVYDKSRFTWIKLMKAKSEARAALQTFILY 556
+ +L+H+DIWGP+S+ SI G +YF +V D +R TW+ +++ K + + FI
Sbjct: 628 LASNPFDLVHLDIWGPFSIESIEGFRYFLTVVDDCTRTTWVYMLRNKKDVSSVFPEFIKL 687
Query: 557 SQRQYGSLVKIVRLDNGVEFAMTDFYSQHGVQHQLSCVKTPQQNGVVERKHQHILNIARA 616
Q+ + +K +R DN E T+ +HG+ H SC TPQQN VVERKHQHILN+ARA
Sbjct: 688 VSTQFNAKIKAIRSDNAPELGFTEIVKEHGMLHHFSCAYTPQQNSVVERKHQHILNVARA 747
Query: 617 LMLQSNLTLEFWSFAVIHAVCLMNLLPTKVLSYSSPFIILHNLKPDIEHLKVFGSLCYAT 676
L+ QSN+ +++WS V AV L+N LP+ +L+ SP+ ++ N +PD LK FG LC+ +
Sbjct: 748 LLFQSNIPMQYWSDCVTTAVFLINRLPSPLLNNKSPYELILNKQPDYSLLKNFGCLCFVS 807
Query: 677 SLTAHRTKFPLRARKCIVLGQKEGGVKGYLMFDLKTREVFLSRNVIFYETIFPFQATDKV 736
+ RTKF RAR C+ LG G KGY + DL++ V +SRNV+F E +FPF+ ++ +
Sbjct: 808 TNAHERTKFTPRARACVFLGY-PSGYKGYKVLDLESHSVTVSRNVVFKEHVFPFKTSELL 866
Query: 737 SLS-SLFP 743
+ + +FP
Sbjct: 867 NKAVDMFP 874
Score = 74.3 bits (181), Expect = 1e-11
Identities = 42/138 (30%), Positives = 70/138 (50%), Gaps = 9/138 (6%)
Query: 98 CCCEVVKIFKQYRENDCVLCFLRGLNDNFAAARS*ILLMEPLPTLSKICSMIIQQERHLG 157
C C+ ++ ++ V FL+ LN+ F R IL+++P+PT+ + +M+ Q ER
Sbjct: 187 CECDAAVKWEHLQQRSRVTKFLKELNEGFDQTRRHILMLKPIPTIKEAFNMVTQDERQRN 246
Query: 158 IGVLHEPAVMAVHTGNTQGTHSNSNRGRSNSQYSSSKSNV*RLCTHCGRQNHTVETCFLK 217
+ L V +V NT + + N + Y++ + N +CTHCG+ HT++ C+
Sbjct: 247 VKPLTR--VDSVAFQNTSMINEDENA--YVAAYNTVRPNQKPICTHCGKVGHTIQKCYKV 302
Query: 218 HGYPSNFPNG-----YKP 230
HGYP G YKP
Sbjct: 303 HGYPPGMKTGNTGYTYKP 320
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.337 0.145 0.465
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,172,871,816
Number of Sequences: 2540612
Number of extensions: 46625337
Number of successful extensions: 204432
Number of sequences better than 10.0: 932
Number of HSP's better than 10.0 without gapping: 604
Number of HSP's successfully gapped in prelim test: 328
Number of HSP's that attempted gapping in prelim test: 201252
Number of HSP's gapped (non-prelim): 1912
length of query: 754
length of database: 863,360,394
effective HSP length: 136
effective length of query: 618
effective length of database: 517,837,162
effective search space: 320023366116
effective search space used: 320023366116
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.7 bits)
S2: 79 (35.0 bits)
Lotus: description of TM0296a.5