
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0171.6
(799 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q9FJV3 Retroelement pol polyprotein-like [Arabidopsis ... 424 e-117
UniRef100_O65468 Hypothetical protein F21P8.50 [Arabidopsis thal... 421 e-116
UniRef100_Q9FX79 Putative retroelement polyprotein [Arabidopsis ... 408 e-112
UniRef100_Q9XII7 Putative retroelement pol polyprotein [Arabidop... 396 e-108
UniRef100_Q8L700 Hypothetical protein [Arabidopsis thaliana] 392 e-107
UniRef100_O22175 Putative retroelement pol polyprotein [Arabidop... 392 e-107
UniRef100_Q9SIM3 Putative retroelement pol polyprotein [Arabidop... 390 e-107
UniRef100_O23588 Retrotransposon like protein [Arabidopsis thali... 387 e-106
UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana] 385 e-105
UniRef100_Q5XWR5 Putative retroelement pol polyprotein-like [Sol... 379 e-103
UniRef100_Q9LVQ2 Retroelement pol polyprotein-like [Arabidopsis ... 377 e-103
UniRef100_Q9ZPU4 Putative retroelement pol polyprotein [Arabidop... 375 e-102
UniRef100_Q8W153 Polyprotein [Oryza sativa] 370 e-100
UniRef100_Q7X6S0 OSJNBb0011N17.2 protein [Oryza sativa] 369 e-100
UniRef100_Q9C692 Polyprotein, putative [Arabidopsis thaliana] 335 3e-90
UniRef100_Q9MAJ8 F27F5.19 [Arabidopsis thaliana] 305 3e-81
UniRef100_Q9ZPG3 F5K24.2 protein [Arabidopsis thaliana] 304 6e-81
UniRef100_Q9SJ99 Putative retroelement pol polyprotein [Arabidop... 303 2e-80
UniRef100_O65452 LTR retrotransposon like protein [Arabidopsis t... 301 5e-80
UniRef100_Q9FL75 Retroelement pol polyprotein-like [Arabidopsis ... 301 5e-80
>UniRef100_Q9FJV3 Retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1475
Score = 424 bits (1089), Expect = e-117
Identities = 227/555 (40%), Positives = 340/555 (60%), Gaps = 38/555 (6%)
Query: 262 LPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVSSSCKASQTSSGIKYPISNYMSY 321
+ +P S S ++ ++R +RP P +L++Y + V K ++YP++ Y++Y
Sbjct: 890 IENPPSTSESAPKVSSKRESRP---PGYLQDYFCNAVPDVTK------DVRYPLNAYINY 940
Query: 322 SNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVP 381
+ LS AY +++ EP TYA+A K K W+DAM +EI ALE+ TWS+ LP P
Sbjct: 941 TQLSEEFTAYICAVNKYPEPCTYAQAKKIKEWLDAMEIEIDALESTNTWSVCSLPQGKKP 1000
Query: 382 IDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVN 441
I KWV+K+K A+G++ER+KARLVA GY Q EG+ Y+DTFSP AK+T V+ +L++A++
Sbjct: 1001 IGCKWVFKVKLNADGSLERFKARLVAKGYTQREGLDYYDTFSPVAKMTTVKTLLSVAAIK 1060
Query: 442 NWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDSG-----KVCKLHKSSYGLKQASRQW 496
W LHQLD++NAFL GDL E++YM +P G S G V KL KS YGLKQASRQW
Sbjct: 1061 EWSLHQLDISNAFLNGDLKEEIYMTLPPGYSMKQGGVLPQNPVLKLQKSLYGLKQASRQW 1120
Query: 497 YAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDN 556
Y KF+S L G+K++H+DH+LF++ G+++ LL+YVDDI++AGN + +K L
Sbjct: 1121 YLKFSSTLKKLGFKKSHADHTLFTRISGKAYIALLVYVDDIVIAGNNDENIEELKKDLAK 1180
Query: 557 AFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQ 616
AFK++DLG +KYFLGLE++ + +GIS+CQRKY ++L+ D+G+LG +P + P++PS +LSQ
Sbjct: 1181 AFKLRDLGPMKYFLGLEIARTKEGISVCQRKYTMELLEDTGLLGCRPSTIPMEPSLKLSQ 1240
Query: 617 DGGGATL*GCFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVSE 676
H Y ++ L + + LC S + K S
Sbjct: 1241 H------------NDEHVIDNPEVYRRLVGKLMYLTITRPDITYAINRLCQFSSSPKNSH 1288
Query: 677 RESKKRVI------------FPKEFCSTIVGV**CRLGGCVDTRRSVTSYCFFIGNSLIC 724
++ ++V+ + + + G CVD+RRS + C F+G+SLI
Sbjct: 1289 LKAAQKVVHYLKGTIGLGLFYSSKSDLCLKAYTDADWGSCVDSRRSTSGICMFLGDSLIS 1348
Query: 725 WRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAA 784
W+SKKQ S SS+E+ YRA+A + E+ L LL + Q++ KP ++CD+ +A+HIA
Sbjct: 1349 WKSKKQNMASSSSAESEYRAMAMGSREIAWLVKLLAEFQVKQTKPVPLFCDSTAAIHIAN 1408
Query: 785 NPVFHERTKHLEIEC 799
N VFHERTKH+E +C
Sbjct: 1409 NAVFHERTKHIENDC 1423
>UniRef100_O65468 Hypothetical protein F21P8.50 [Arabidopsis thaliana]
Length = 1240
Score = 421 bits (1083), Expect = e-116
Identities = 246/584 (42%), Positives = 343/584 (58%), Gaps = 63/584 (10%)
Query: 237 ASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLH 296
AS+S+S D+ +PSA+ +Q +P P S HT R +P++L++Y H
Sbjct: 7 ASTSSSSIDI---MPSAN---IQNDVPEP--------SVHTSH--RRTRKPAYLQDYYCH 50
Query: 297 TVSSSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDA 356
+V+S + IS ++SY +S +H++ + ++ EP TY EA + W A
Sbjct: 51 SVASLTI---------HDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGA 101
Query: 357 MNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGI 416
M+ EI A+E TW + LPPN PI KWVYKIK ++GT+ERYKARLVA GY Q EGI
Sbjct: 102 MDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGI 161
Query: 417 YYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSC--- 473
+ +TFSP KLT V+++LA++++ N+ LHQLD++NAFL GDL E++YMK+P G +
Sbjct: 162 DFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQG 221
Query: 474 --VDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILL 531
+ VC L KS YGLKQASRQW+ KF+ L+ G+ Q+HSDH+ F K F +L
Sbjct: 222 DSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL 281
Query: 532 IYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLD 591
+YVDDII+ N +K+ L + FK++DLG LKYFLGLE++ SA GI++CQRKY LD
Sbjct: 282 VYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALD 341
Query: 592 LVHDSGVLGSKPVSTPLDPSSRLSQDGGGATL*GCFFIQKTHRKTV-LSYYNQV*YNLCS 650
L+ ++G+LG KP S P+DPS S GG F K +R+ + Y Q+ S
Sbjct: 342 LLDETGLLGCKPSSVPMDPSVTFSAHSGGD-----FVDAKAYRRLIGRLMYLQITRLDIS 396
Query: 651 SAAKSVPLQSYCDSLCCSS*NSKV---------------SERESKKRVIFPKEFCSTIVG 695
A + S L K+ S+ E + +V F S
Sbjct: 397 FAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQS---- 452
Query: 696 V**CRLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RL 755
C DTRRS YC F+G SLI W+SKKQQ +SKSS+EA YRAL+ AT E+ L
Sbjct: 453 --------CKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWL 504
Query: 756 TYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
++LQ+ KP++++CDN +A+HIA N VFHERTKH+E +C
Sbjct: 505 AQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDC 548
>UniRef100_Q9FX79 Putative retroelement polyprotein [Arabidopsis thaliana]
Length = 1413
Score = 408 bits (1048), Expect = e-112
Identities = 232/557 (41%), Positives = 334/557 (59%), Gaps = 49/557 (8%)
Query: 261 PLPSPE-SPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVSSSCKASQTSSGIKYPISNYM 319
PLP E S S R +RP P++L++Y ++V+SS +PIS +
Sbjct: 846 PLPVQETSASNVPAEKQNSRVSRP---PAYLKDYHCNSVTSSTD---------HPISEVL 893
Query: 320 SYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNV 379
SYS+LS P+ + +++ EPHTYA+A + K W DAM +EI+ALE NGTW + LP
Sbjct: 894 SYSSLSDPYMIFINAVNKIPEPHTYAQARQIKEWCDAMGMEITALEDNGTWVVCSLPVGK 953
Query: 380 VPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALAS 439
+ KWVYKIK A+G++ERYKARLVA GY Q EG+ Y DTFSP AKLT V++++A+A+
Sbjct: 954 KAVGCKWVYKIKLNADGSLERYKARLVAKGYTQTEGLDYVDTFSPVAKLTTVKLLIAVAA 1013
Query: 440 VNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-----CVDSGKVCKLHKSSYGLKQASR 494
W L QLD++NAFL G L E++YM +P G S VC+L KS YGLKQASR
Sbjct: 1014 AKGWSLSQLDISNAFLNGSLDEEIYMTLPPGYSPRQGDSFPPNAVCRLKKSLYGLKQASR 1073
Query: 495 QWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAAL 554
QWY KF+ L G+ Q+ DH+LF++ S+ +L+YVDDII+A + E ++ AL
Sbjct: 1074 QWYLKFSESLKALGFTQSSGDHTLFTRKSKNSYMAVLVYVDDIIIASSCDRETELLRDAL 1133
Query: 555 DNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRL 614
+ K++DLG L+YFLGLE++ + GIS+CQRKY L+L+ ++G+LG K S P++P+ +L
Sbjct: 1134 QRSSKLRDLGTLRYFLGLEIARNTDGISICQRKYTLELLAETGLLGCKSSSVPMEPNQKL 1193
Query: 615 SQDGGGATL*GCFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKV 674
SQ+ G +Y ++ L + LC + +V
Sbjct: 1194 SQEDGELI-------------DDAEHYRKLVGKLMYLTFTRPDITYAVHRLCQFTSAPRV 1240
Query: 675 SERESKKRVIFPKE-------FCSTIVGV**CRLGG--------CVDTRRSVTSYCFFIG 719
++ ++I+ + F S V + +L G C D+R+ T YC F+G
Sbjct: 1241 PHLKAVYKIIYYLKGTVGQGLFYSANVDL---KLSGFADSDFSSCSDSRKLTTGYCMFLG 1297
Query: 720 NSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSA 779
SL+ W+SKKQ+ IS SS+EA Y+A++ A E+ L +LL+DL I+ + SV+YCDN +A
Sbjct: 1298 TSLVAWKSKKQEVISMSSAEAEYKAMSMAVREMMWLRFLLEDLWIDVSEASVLYCDNTAA 1357
Query: 780 LHIAANPVFHERTKHLE 796
+HIA NPVFHERTKH+E
Sbjct: 1358 IHIANNPVFHERTKHIE 1374
>UniRef100_Q9XII7 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1454
Score = 396 bits (1018), Expect = e-108
Identities = 226/564 (40%), Positives = 323/564 (57%), Gaps = 45/564 (7%)
Query: 253 ADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVSSSCKASQTSSGIK 312
+D T LPS S +S R R P+HL +Y +T+ S K
Sbjct: 866 SDTTHSPSSLPSQISDLPPQISSQ-----RVRKPPAHLNDYHCNTMQSDHK--------- 911
Query: 313 YPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSL 372
YPIS+ +SYS +S H Y +++ P YAEA K W +A++ EI A+E TW +
Sbjct: 912 YPISSTISYSKISPSHMCYINNITKIPIPTNYAEAQDTKEWCEAVDAEIGAMEKTNTWEI 971
Query: 373 VPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVR 432
LP + KWV+ +K A+G +ERYKARLVA GY Q EG+ Y DTFSP AK+T ++
Sbjct: 972 TTLPKGKKAVGCKWVFTLKFLADGNLERYKARLVAKGYTQKEGLDYTDTFSPVAKMTTIK 1031
Query: 433 MVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-----CVDSGKVCKLHKSSY 487
++L +++ W L QLDV+NAFL G+L E+++MK+PEG + + S V +L +S Y
Sbjct: 1032 LLLKVSASKKWFLKQLDVSNAFLNGELEEEIFMKIPEGYAERKGIVLPSNVVLRLKRSIY 1091
Query: 488 GLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEF 547
GLKQASRQW+ KF+S L++ G+K+ H DH+LF K F I+L+YVDDI++A
Sbjct: 1092 GLKQASRQWFKKFSSSLLSLGFKKTHGDHTLFLKMYDGEFVIVLVYVDDIVIASTSEAAA 1151
Query: 548 TRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTP 607
++ LD FK++DLG LKYFLGLEV+ + GIS+CQRKY L+L+ +G+L KPVS P
Sbjct: 1152 AQLTEELDQRFKLRDLGDLKYFLGLEVARTTAGISICQRKYALELLQSTGMLACKPVSVP 1211
Query: 608 LDPSSRLSQDGGGATL*GCFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCC 667
+ P+ ++ +D G + Y ++ L + + LC
Sbjct: 1212 MIPNLKMRKDDGDLI-------------EDIEQYRRIVGKLMYLTITRPDITFAVNKLCQ 1258
Query: 668 SS*NSKVSERESKKRVI------------FPKEFCSTIVGV**CRLGGCVDTRRSVTSYC 715
S + + + RV+ + T+ G C D+RRS TS+
Sbjct: 1259 FSSAPRTTHLTAAYRVLQYIKGTVGQGLFYSASSDLTLKGFADSDWASCQDSRRSTTSFT 1318
Query: 716 FFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCD 775
F+G+SLI WRSKKQ T+S+SS+EA YRALA ATCE+ L LL LQ P P ++Y D
Sbjct: 1319 MFVGDSLISWRSKKQHTVSRSSAEAEYRALALATCEMVWLFTLLVSLQASPPVP-ILYSD 1377
Query: 776 NQSALHIAANPVFHERTKHLEIEC 799
+ +A++IA NPVFHERTKH++++C
Sbjct: 1378 STAAIYIATNPVFHERTKHIKLDC 1401
>UniRef100_Q8L700 Hypothetical protein [Arabidopsis thaliana]
Length = 776
Score = 392 bits (1008), Expect = e-107
Identities = 236/596 (39%), Positives = 341/596 (56%), Gaps = 42/596 (7%)
Query: 235 VPASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYV 294
VP+SS + D S S SA T+ + +PS+ + + + R + + L+++V
Sbjct: 140 VPSSSPSRSIDRSTSDLSASDTTELLSTGESSTPSSPGLPELLGKGCREKKKSVLLKDFV 199
Query: 295 LHTVS---------------------SSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAM 333
+T S +S A S YP+S++++ S S H A+
Sbjct: 200 TNTTSKKKTASHNIHSPSQVLPSGLPTSLSADSVSGKTLYPLSDFLTNSGYSANHIAFMA 259
Query: 334 SLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRR 393
++ +EP + +A K W +AM+ EI ALEAN TW + LP I +KWVYK+K
Sbjct: 260 AILDSNEPKHFKDAILIKEWCEAMSKEIDALEANHTWDITDLPHGKKAISSKWVYKLKYN 319
Query: 394 ANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNA 453
++GT+ER+KARLV G +Q EG+ + +TF+P AKLT VR +LA+A+ +W +HQ+DV+NA
Sbjct: 320 SDGTLERHKARLVVMGNHQKEGVDFKETFAPVAKLTTVRTILAVAAAKDWEVHQMDVHNA 379
Query: 454 FLLGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAH 513
FL GDL E+VYM++P G C D KVC+L KS YGLKQA R W++K ++ L G+ Q++
Sbjct: 380 FLHGDLEEEVYMRLPPGFKCSDPSKVCRLRKSLYGLKQAPRCWFSKLSTALRNIGFTQSY 439
Query: 514 SDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLE 573
D+SLFS G + +L+YVDD+I+AGN LD R K+ L F +KDLG LKYFLGLE
Sbjct: 440 EDYSLFSLKNGDTIIHVLVYVDDLIVAGNNLDAIDRFKSQLHKCFHMKDLGKLKYFLGLE 499
Query: 574 VSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGGGA--------TL*G 625
VS G L QRKY LD+V ++G+LG KP + P+ + +L+ G L G
Sbjct: 500 VSRGPDGFCLSQRKYALDIVKETGLLGCKPSAVPIALNHKLASITGPVFTNPEQYRRLVG 559
Query: 626 CFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVSERESKKRVIF 685
FI T + LSY + S PL ++ ++ K S + IF
Sbjct: 560 -RFIYLTITRPDLSYAVHI-----LSQFMQAPLVAHWEAALRLVRYLKGSPAQG----IF 609
Query: 686 PKEFCSTIVGV**C--RLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYR 743
+ S I+ C C TRRS+++Y ++G+S I W++KKQ T+S SS+EA YR
Sbjct: 610 LRSDSSLIINA-YCDSDYNACPLTRRSLSAYVVYLGDSPISWKTKKQDTVSYSSAEAEYR 668
Query: 744 ALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
A+A EL L LLKDL + P ++CD+++A+HIAANPVFHERTKH+E +C
Sbjct: 669 AMAYTLKELKWLKALLKDLGVHHSSPMKLHCDSEAAIHIAANPVFHERTKHIESDC 724
>UniRef100_O22175 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1496
Score = 392 bits (1008), Expect = e-107
Identities = 236/596 (39%), Positives = 341/596 (56%), Gaps = 42/596 (7%)
Query: 235 VPASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYV 294
VP+SS + D S S SA T+ + +PS+ + + + R + + L+++V
Sbjct: 860 VPSSSPSRSIDRSTSDLSASDTTELLSTGESSTPSSPGLPELLGKGCREKKKSVLLKDFV 919
Query: 295 LHTVS---------------------SSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAM 333
+T S +S A S YP+S++++ S S H A+
Sbjct: 920 TNTTSKKKTASHNIHSPSQVLPSGLPTSLSADSVSGKTLYPLSDFLTNSGYSANHIAFMA 979
Query: 334 SLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRR 393
++ +EP + +A K W +AM+ EI ALEAN TW + LP I +KWVYK+K
Sbjct: 980 AILDSNEPKHFKDAILIKEWCEAMSKEIDALEANHTWDITDLPHGKKAISSKWVYKLKYN 1039
Query: 394 ANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNA 453
++GT+ER+KARLV G +Q EG+ + +TF+P AKLT VR +LA+A+ +W +HQ+DV+NA
Sbjct: 1040 SDGTLERHKARLVVMGNHQKEGVDFKETFAPVAKLTTVRTILAVAAAKDWEVHQMDVHNA 1099
Query: 454 FLLGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAH 513
FL GDL E+VYM++P G C D KVC+L KS YGLKQA R W++K ++ L G+ Q++
Sbjct: 1100 FLHGDLEEEVYMRLPPGFKCSDPSKVCRLRKSLYGLKQAPRCWFSKLSTALRNIGFTQSY 1159
Query: 514 SDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLE 573
D+SLFS G + +L+YVDD+I+AGN LD R K+ L F +KDLG LKYFLGLE
Sbjct: 1160 EDYSLFSLKNGDTIIHVLVYVDDLIVAGNNLDAIDRFKSQLHKCFHMKDLGKLKYFLGLE 1219
Query: 574 VSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGGGA--------TL*G 625
VS G L QRKY LD+V ++G+LG KP + P+ + +L+ G L G
Sbjct: 1220 VSRGPDGFCLSQRKYALDIVKETGLLGCKPSAVPIALNHKLASITGPVFTNPEQYRRLVG 1279
Query: 626 CFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVSERESKKRVIF 685
FI T + LSY + S PL ++ ++ K S + IF
Sbjct: 1280 -RFIYLTITRPDLSYAVHI-----LSQFMQAPLVAHWEAALRLVRYLKGSPAQG----IF 1329
Query: 686 PKEFCSTIVGV**C--RLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYR 743
+ S I+ C C TRRS+++Y ++G+S I W++KKQ T+S SS+EA YR
Sbjct: 1330 LRSDSSLIINA-YCDSDYNACPLTRRSLSAYVVYLGDSPISWKTKKQDTVSYSSAEAEYR 1388
Query: 744 ALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
A+A EL L LLKDL + P ++CD+++A+HIAANPVFHERTKH+E +C
Sbjct: 1389 AMAYTLKELKWLKALLKDLGVHHSSPMKLHCDSEAAIHIAANPVFHERTKHIESDC 1444
>UniRef100_Q9SIM3 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1461
Score = 390 bits (1002), Expect = e-107
Identities = 230/586 (39%), Positives = 344/586 (58%), Gaps = 41/586 (6%)
Query: 229 LLPETSVPASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPS 288
L P S S++ + D + P + S+ LPSP+ +T +S RR T+ P+
Sbjct: 849 LFPLASSQQSATTASDVFTPMDPLSSGNSITSHLPSPQISPSTQISK--RRITK---FPA 903
Query: 289 HLRNYVLHTVSSSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEAS 348
HL++Y + V+ +PIS+ +SYS +S H Y ++S P +Y EA
Sbjct: 904 HLQDYHCYFVNKDDS---------HPISSSLSYSQISPSHMLYINNISKIPIPQSYHEAK 954
Query: 349 KHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA* 408
K W A++ EI A+E TW + LPP + KWV+ +K A+G++ER+KAR+VA
Sbjct: 955 DSKEWCGAIDQEIGAMERTDTWEITSLPPGKKAVGCKWVFTVKFHADGSLERFKARIVAK 1014
Query: 409 GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVP 468
GY Q EG+ Y +TFSP AK+ V+++L +++ W+L+QLD++NAFL GDL E +YMK+P
Sbjct: 1015 GYTQKEGLDYTETFSPVAKMATVKLLLKVSASKKWYLNQLDISNAFLNGDLEETIYMKLP 1074
Query: 469 EGVSCVDS-----GKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQ 523
+G + + VC+L KS YGLKQASRQW+ KF++ L+ G+++ H DH+LF +
Sbjct: 1075 DGYADIKGTSLPPNVVCRLKKSIYGLKQASRQWFLKFSNSLLALGFEKQHGDHTLFVRCI 1134
Query: 524 GQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISL 583
G F +LL+YVDDI++A + AL +FK+++LG LKYFLGLEV+ +++GISL
Sbjct: 1135 GSEFIVLLVYVDDIVIASTTEQAAQSLTEALKASFKLRELGPLKYFLGLEVARTSEGISL 1194
Query: 584 CQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGG--------GATL*GCFFIQKTHRK 635
QRKY L+L+ + +L KP S P+ P+ RLS++ G L G R
Sbjct: 1195 SQRKYALELLTSADMLDCKPSSIPMTPNIRLSKNDGLLLEDKEMYRRLVGKLMYLTITRP 1254
Query: 636 TVLSYYNQV*YNLC--SSAAKSVPLQSYCDSLCCSS*NSKVSERESKKRVIFPKEFCSTI 693
+ N+ LC SSA ++ L + L + + + + + E T+
Sbjct: 1255 DITFAVNK----LCQFSSAPRTAHLAAVYKVL-------QYIKGTVGQGLFYSAEDDLTL 1303
Query: 694 VGV**CRLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL* 753
G G C D+RRS T + F+G+SLI WRSKKQ T+S+SS+EA YRALA A+CE+
Sbjct: 1304 KGYTDADWGTCPDSRRSTTGFTMFVGSSLISWRSKKQPTVSRSSAEAEYRALALASCEMA 1363
Query: 754 RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
L+ LL L++ P ++Y D+ +A++IA NPVFHERTKH+EI+C
Sbjct: 1364 WLSTLLLALRVHSGVP-ILYSDSTAAVYIATNPVFHERTKHIEIDC 1408
>UniRef100_O23588 Retrotransposon like protein [Arabidopsis thaliana]
Length = 1433
Score = 387 bits (994), Expect = e-106
Identities = 212/500 (42%), Positives = 302/500 (60%), Gaps = 26/500 (5%)
Query: 315 ISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVP 374
+ ++ Y+N + P HA+ +++ P Y+EA K W DAM EI A+ TWS+V
Sbjct: 891 LQDFHCYNNTTEPFHAFINNITNAVIPQRYSEAKDFKAWCDAMKEEIGAMVRTNTWSVVS 950
Query: 375 LPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMV 434
LPPN I KWV+ IK A+G++ERYKARLVA GY Q EG+ Y +TFSP AKLT VRM+
Sbjct: 951 LPPNKKAIGCKWVFTIKHNADGSIERYKARLVAKGYTQEEGLDYEETFSPVAKLTSVRMM 1010
Query: 435 LALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-----CVDSGKVCKLHKSSYGL 489
L LA+ W +HQLD++NAFL GDL E++YMK+P G + + +C+LHKS YGL
Sbjct: 1011 LLLAAKMKWSVHQLDISNAFLNGDLDEEIYMKIPPGYADLVGEALPPHAICRLHKSIYGL 1070
Query: 490 KQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTR 549
KQASRQWY K ++ L G++++++DH+LF K +L+YVDDI++ N D +
Sbjct: 1071 KQASRQWYLKLSNTLKGMGFQKSNADHTLFIKYANGVLMGVLVYVDDIMIVSNSDDAVAQ 1130
Query: 550 IKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLD 609
A L + FK++DLG KYFLG+E++ S KGIS+CQRKY L+L+ +G LGSKP S PLD
Sbjct: 1131 FTAELKSYFKLRDLGAAKYFLGIEIARSEKGISICQRKYILELLSTTGFLGSKPSSIPLD 1190
Query: 610 PSSRLSQDGG--------GATL*GCFFIQKTHRKTVLSYYNQV*YNLC--SSAAKSVPLQ 659
PS +L+++ G L G + R + N LC S A SV L
Sbjct: 1191 PSVKLNKEDGVPLTDSTSYRKLVGKLMYLQITRPDIAYAVN----TLCQFSHAPTSVHLS 1246
Query: 660 SYCDSLCCSS*NSKVSERESKKRVIFPKEFCSTIVGV**CRLGGCVDTRRSVTSYCFFIG 719
+ L + + + + + + + G G C D+RR V +YC FIG
Sbjct: 1247 AVHKVL-------RYLKGTVGQGLFYSADDKFDLRGYTDSDFGSCTDSRRCVAAYCMFIG 1299
Query: 720 NSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSA 779
+ L+ W+SKKQ T+S S++EA +RA++ T E+ L+ L D ++ + P+ +YCDN +A
Sbjct: 1300 DYLVSWKSKKQDTVSMSTAEAEFRAMSQGTKEMIWLSRLFDDFKVPFIPPAYLYCDNTAA 1359
Query: 780 LHIAANPVFHERTKHLEIEC 799
LHI N VFHERTK +E++C
Sbjct: 1360 LHIVNNSVFHERTKFVELDC 1379
>UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana]
Length = 1315
Score = 385 bits (990), Expect = e-105
Identities = 232/569 (40%), Positives = 329/569 (57%), Gaps = 47/569 (8%)
Query: 251 PSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVSSSCKASQTSSG 310
PS ++SV++ LPS +P+ + R +P++L++Y H+V SS
Sbjct: 722 PSDSSSSVEI-LPSA-NPTNNVPEPSVQTSHRKAKKPAYLQDYYCHSVVSSTP------- 772
Query: 311 IKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTW 370
+ I ++SY ++ P+ + L EP Y EA K + W DAM E LE TW
Sbjct: 773 --HEIRKFLSYDRINDPYLTFLACLDKTKEPSNYTEAEKLQVWRDAMGAEFDFLEGTHTW 830
Query: 371 SLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTI 430
+ LP + I +W++KIK ++G+VERYKARLVA GY Q EGI Y +TFSP AKL
Sbjct: 831 EVCSLPADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNETFSPVAKLNS 890
Query: 431 VRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-----CVDSGKVCKLHKS 485
V+++L +A+ L QLD++NAFL GDL E++YM++P+G + + VC+L KS
Sbjct: 891 VKLLLGVAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQGDSLPPNAVCRLKKS 950
Query: 486 SYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLD 545
YGLKQASRQWY KF+S L+ G+ Q++ DH+ F K F +L+Y+DDII+A N
Sbjct: 951 LYGLKQASRQWYLKFSSTLLGLGFIQSYCDHTCFLKISDGIFLCVLVYIDDIIIASNNDA 1010
Query: 546 EFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVS 605
+K+ + + FK++DLG LKYFLGLE+ S KGI + QRKY LDL+ ++G LG KP S
Sbjct: 1011 AVDILKSQMKSFFKLRDLGELKYFLGLEIVRSDKGIHISQRKYALDLLDETGQLGCKPSS 1070
Query: 606 TPLDPSSRLSQDGGG--------ATL*GCFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVP 657
P+DPS + D GG L G R + N++ S A +
Sbjct: 1071 IPMDPSMVFAHDSGGDFVEVGPYRRLIGRLMYLNITRPDITFAVNKL--AQFSMAPRKAH 1128
Query: 658 LQSYCDSLCCSS*N-------SKVSERESKKRVIFPKEFCSTIVGV**CRLGGCVDTRRS 710
LQ+ L S SE + K V ++ S CR D+RRS
Sbjct: 1129 LQAVYKILQYIKGTIGQGLFYSATSELQLK--VYANADYNS-------CR-----DSRRS 1174
Query: 711 VTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPS 770
+ YC F+G+SLICW+S+KQ +SKSS+EA YR+L+ AT EL LT LK+LQ+ KP+
Sbjct: 1175 TSGYCMFLGDSLICWKSRKQDVVSKSSAEAEYRSLSVATDELVWLTNFLKELQVPLSKPT 1234
Query: 771 VIYCDNQSALHIAANPVFHERTKHLEIEC 799
+++CDN++A+HIA N VFHERTKH+E +C
Sbjct: 1235 LLFCDNEAAIHIANNHVFHERTKHIESDC 1263
>UniRef100_Q5XWR5 Putative retroelement pol polyprotein-like [Solanum tuberosum]
Length = 1476
Score = 379 bits (972), Expect = e-103
Identities = 228/577 (39%), Positives = 322/577 (55%), Gaps = 27/577 (4%)
Query: 240 SNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVS 299
S+ +D P+ + +P+ SP S +VSD P R + +
Sbjct: 840 SSHTEDADAVQPAIITSEEIIPVASPPS----AVSDDHLHPPPERRRSYRTGKPPIWQKD 895
Query: 300 SSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNL 359
++ S+ YPIS+ + YS LS + Y S S+++EP Y +A+ WV AM
Sbjct: 896 FITTSTSRSNHCLYPISDNIDYSCLSSTYQCYIASSSVETEPQFYYQAANDCRWVHAMKE 955
Query: 360 EISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYF 419
EI ALE N TW +V LP I KWVYKIK +A+G +ER+KARLVA GYNQ EG+ Y
Sbjct: 956 EIQALEDNKTWEVVSLPKGKKAIGCKWVYKIKYKASGEIERFKARLVAKGYNQKEGLDYQ 1015
Query: 420 DTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDSG-- 477
+TFSP K+ +R VL LA W + Q+DV NAFL GDL E+VYM++P+G +G
Sbjct: 1016 ETFSPVVKMVTLRTVLTLAVSKGWDIQQMDVYNAFLQGDLIEEVYMQLPQGFQYDKTGDP 1075
Query: 478 KVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDI 537
KVC+L KS YGLKQASRQW K T+ L+ G++Q+H D+SL K I+LIYVDD+
Sbjct: 1076 KVCRLLKSLYGLKQASRQWNVKLTTALLAAGFQQSHLDYSLMLKRTADGIVIVLIYVDDL 1135
Query: 538 ILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSG 597
++ G+ L K L FKIKDLG L+YFLG+E + +A G+ + QRKY L+L+ D G
Sbjct: 1136 LITGSSLQLIDDAKQVLKANFKIKDLGTLRYFLGMEFARNASGMLMHQRKYALELISDLG 1195
Query: 598 VLGSKPVSTPLDPSSRLSQDGGGATL*GCFFIQKTHRKTVL---SYYNQV*YNLCSSAAK 654
+ GSKP TP++ +L+ T + + ++L + Y ++ L
Sbjct: 1196 LGGSKPSVTPVELHLKLT------TREFDLHVGSSGADSLLADPTEYQRLVGRLLYLTIT 1249
Query: 655 SVPLQSYCDSLCCSS*NSKVSERESKKRVI------------FPKEFCSTIVGV**CRLG 702
+ L KVS E+ RV+ + T+ G
Sbjct: 1250 RPDISFAVQHLSQFMHAPKVSHMEAAIRVVKYVKQAPGLGLYMAVQTADTLQAYCDADWG 1309
Query: 703 GCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDL 762
C++TR+S+T Y G++L+ W+SKKQ TIS+SS+EA YR+LAS EL LT L K+L
Sbjct: 1310 SCINTRKSITGYMIQFGSALLSWKSKKQPTISRSSAEAEYRSLASTVAELVWLTGLFKEL 1369
Query: 763 QIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
+ P +YCD+++A+ IAANPVFHERTKH++I+C
Sbjct: 1370 DMPLSLPVSLYCDSKAAIQIAANPVFHERTKHIDIDC 1406
>UniRef100_Q9LVQ2 Retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1491
Score = 377 bits (969), Expect = e-103
Identities = 232/589 (39%), Positives = 328/589 (55%), Gaps = 38/589 (6%)
Query: 236 PASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHT--RRPTRPRHQPSHLRNY 293
P S S S+ S+ +TS + SP TT + ++T R+ R Q + L++Y
Sbjct: 864 PLSPSTSVTPTQTPTNSSSSTSPSTNV-SPPQQDTTPIIENTPPRQGKRQVQQLARLKDY 922
Query: 294 VLHTVSS--------SCKASQTSSGIK----YPISNYMSYSNLSIPHHAYAMSLSLDSEP 341
+L+ S S SQ+SS I+ YP+++Y+ S H + +++ + EP
Sbjct: 923 ILYNASCTPNTPHVLSPSTSQSSSSIQGNSQYPLTDYIFDECFSAGHKVFLAAITANDEP 982
Query: 342 HTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERY 401
+ EA K K W DAM E+ ALE N TW +V LP V I ++WVYK K A+GTVERY
Sbjct: 983 KHFKEAVKVKVWNDAMYKEVDALEVNKTWDIVDLPTGKVAIGSQWVYKTKFNADGTVERY 1042
Query: 402 KARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYE 461
KARLV G NQIEG Y +TF+P K+T VR +L L + N W ++Q+DV+NAFL GDL E
Sbjct: 1043 KARLVVQGNNQIEGEDYTETFAPVVKMTTVRTLLRLVAANQWEVYQMDVHNAFLHGDLEE 1102
Query: 462 DVYMKVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSK 521
+VYMK+P G KVC+L KS YGLKQA R W+ K + L G+ Q + D+S FS
Sbjct: 1103 EVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCWFKKLSDALKRFGFIQGYEDYSFFSY 1162
Query: 522 TQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGI 581
+ +L+YVDD+I+ GN + K L F +KDLG LKYFLG+EVS GI
Sbjct: 1163 SCKGIELRVLVYVDDLIICGNDEYMVQKFKEYLGRCFSMKDLGKLKYFLGIEVSRGPDGI 1222
Query: 582 SLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGGGATL*GCFF-------IQKTHR 634
L QRKY LD++ DSG LG++P TPL+ + L+ D G F + H
Sbjct: 1223 FLSQRKYALDIISDSGTLGARPAYTPLEQNHHLASDDGPLLQDPKPFRRLVGRLLYLLHT 1282
Query: 635 KTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVSERE----SKKRVIFPKEFC 690
+ LSY V S P +++ ++ K S + S + + + +C
Sbjct: 1283 RPELSYSVHV-----LSQFMQAPREAHLEAAMRIVRYLKGSPGQGILLSSNKDLTLEVYC 1337
Query: 691 STIVGV**CRLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATC 750
+ C TRRS+++Y +G S I W++KKQ T+S SS+EA YRA++ A
Sbjct: 1338 DS-------DFQSCPLTRRSLSAYVVLLGGSPISWKTKKQDTVSHSSAEAEYRAMSVALK 1390
Query: 751 EL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
E+ L LLK+L I P+ ++CD+++A+ IAANPVFHERTKH+E +C
Sbjct: 1391 EIKWLNKLLKELGITLAAPTRLFCDSKAAISIAANPVFHERTKHIERDC 1439
>UniRef100_Q9ZPU4 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1501
Score = 375 bits (963), Expect = e-102
Identities = 235/592 (39%), Positives = 330/592 (55%), Gaps = 41/592 (6%)
Query: 230 LPETSVPASSSNSLDDLSVS---IPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQ 286
+P+ + P+S LSVS P+ T + VP+ SP S R+ R H
Sbjct: 877 VPDDTPPSSP------LSVSPSGSPNTPTTPIVVPVASPIPVSPPK----QRKSKRATHP 926
Query: 287 PSHLRNYVL----------HTVSSSCKASQTSSGIK-YPISNYMSYSNLSIPHHAYAMSL 335
P L +YVL H + + S T G +P+++Y+S + S H AY ++
Sbjct: 927 PPKLNDYVLYNAMYTPSSIHALPADPSQSSTVPGKSLFPLTDYVSDAAFSSSHRAYLAAI 986
Query: 336 SLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRAN 395
+ + EP + EA + K W DAM E+ ALE N TW +V LPP V I ++WV+K K ++
Sbjct: 987 TDNVEPKHFKEAVQIKVWNDAMFTEVDALEINKTWDIVDLPPGKVAIGSQWVFKTKYNSD 1046
Query: 396 GTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFL 455
GTVERYKARLV G Q+EG Y +TF+P ++T VR +L + N W ++Q+DV+NAFL
Sbjct: 1047 GTVERYKARLVVQGNKQVEGEDYKETFAPVVRMTTVRTLLRNVAANQWEVYQMDVHNAFL 1106
Query: 456 LGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSD 515
GDL E+VYMK+P G KVC+L KS YGLKQA R W+ K + L+ G+ Q++ D
Sbjct: 1107 HGDLEEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCWFKKLSDSLLRFGFVQSYED 1166
Query: 516 HSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVS 575
+SLFS T+ +LIYVDD+++ GN + K L F +KDLG LKYFLG+EVS
Sbjct: 1167 YSLFSYTRNNIELRVLIYVDDLLICGNDGYMLQKFKDYLSRCFSMKDLGKLKYFLGIEVS 1226
Query: 576 HSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPL--------DPSSRLSQDGGGATL*GCF 627
+GI L QRKY LD++ DSG LGS+P TPL D LS L G
Sbjct: 1227 RGPEGIFLSQRKYALDVIADSGNLGSRPAHTPLEQNHHLASDDGPLLSDPKPYRRLVGRL 1286
Query: 628 FIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVSERESKKRVIFPK 687
+ H + LSY V + P +++ D+ K S + ++
Sbjct: 1287 -LYLLHTRPELSYSVHVLAQFMQN-----PREAHFDAALRVVRYLKGSPGQG---ILLNA 1337
Query: 688 EFCSTIVGV**CRLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALAS 747
+ T+ C TRRS+++Y +G S I W++KKQ T+S SS+EA YRA++
Sbjct: 1338 DPDLTLEVYCDSDWQSCPLTRRSISAYVVLLGGSPISWKTKKQDTVSHSSAEAEYRAMSY 1397
Query: 748 ATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
A E+ L LLK+L IE P+ +YCD+++A+HIAANPVFHERTKH+E +C
Sbjct: 1398 ALKEIKWLRKLLKELGIEQSTPARLYCDSKAAIHIAANPVFHERTKHIESDC 1449
>UniRef100_Q8W153 Polyprotein [Oryza sativa]
Length = 1472
Score = 370 bits (949), Expect = e-100
Identities = 205/497 (41%), Positives = 295/497 (59%), Gaps = 15/497 (3%)
Query: 309 SGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANG 368
SG + I+NY+SY++LS + A+ SL+ P + EA + W AM E+ ALE N
Sbjct: 931 SGDENDIANYVSYTSLSSTYRAFVASLNSAIIPKDWKEAKQDPRWHQAMLDELEALEKNK 990
Query: 369 TWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKL 428
TW LV P ++ KWVY +K+ +G VERYKARLVA GY+Q GI Y +TF+P AK+
Sbjct: 991 TWDLVSYPNGKKVVNCKWVYAVKQNPDGKVERYKARLVAKGYSQTYGIDYDETFAPVAKM 1050
Query: 429 TIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDS-GKVCKLHKSSY 487
+ VR +++ A +W LHQLDV NAFL GDL E+VYM++P G + + + GKV +L KS Y
Sbjct: 1051 STVRTIISCAVNFDWPLHQLDVKNAFLHGDLQEEVYMEIPPGFATLQTKGKVLRLKKSLY 1110
Query: 488 GLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEF 547
GLKQ+ R W+ +F + GYKQ + DH++F G TIL +YVDD+I+ GN E
Sbjct: 1111 GLKQSPRAWFDRFRRAMCAMGYKQCNGDHTVFYHHSGDHITILAVYVDDMIITGNDCSEI 1170
Query: 548 TRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTP 607
TR+K L F++KDLG LKYFLG+E++ S +GI L QRKY LDL+ D+G+LG +P STP
Sbjct: 1171 TRLKQNLSKEFEVKDLGQLKYFLGIEIARSPRGIVLSQRKYALDLLSDTGMLGCRPASTP 1230
Query: 608 LDPSSRLSQDGGGATL*GCF------FIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSY 661
+D + +L + G + I H + ++Y + S P +
Sbjct: 1231 VDQNHKLCAESGNPVNKERYQRLVGRLIYLCHTRPDITYAVSM-----VSRYMHDPRSGH 1285
Query: 662 CDSLCCSS*NSKVSERESKKRVIFPKEFCSTIVGV**CRLGGCVDTRRSVTSYCFFIGNS 721
D++ + + K + F K + G C D RRS + YC F+G +
Sbjct: 1286 MDAVYRI---LRYLKGSPGKGLWFKKNGHLEVEGYCDAHWASCPDDRRSTSGYCVFVGGN 1342
Query: 722 LICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALH 781
L+ WRSKKQ +S+S++EA YRA++ + EL L LL +L + P ++CDN+SA+
Sbjct: 1343 LVSWRSKKQPVVSRSTAEAEYRAMSVSLSELLWLRNLLSELMLPVDTPMKLWCDNKSAIS 1402
Query: 782 IAANPVFHERTKHLEIE 798
IA NPV H+RTKH+E++
Sbjct: 1403 IANNPVQHDRTKHVELD 1419
>UniRef100_Q7X6S0 OSJNBb0011N17.2 protein [Oryza sativa]
Length = 1262
Score = 369 bits (946), Expect = e-100
Identities = 205/497 (41%), Positives = 295/497 (59%), Gaps = 15/497 (3%)
Query: 309 SGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANG 368
SG + I+NY+SY++LS + A+ SL+ P + EA + W AM E+ ALE N
Sbjct: 721 SGDENDIANYVSYTSLSSTYKAFVASLNSAIIPKDWKEAKQDPRWHQAMLDELEALEKNK 780
Query: 369 TWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKL 428
TW LV P ++ KWVY +K+ +G VERYKARLVA GY+Q GI Y +TF+P AK+
Sbjct: 781 TWDLVSYPNGKKVVNCKWVYAVKQNPDGKVERYKARLVAKGYSQTYGIDYDETFAPVAKM 840
Query: 429 TIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDS-GKVCKLHKSSY 487
+ VR +++ A +W LHQLDV NAFL GDL E+VYM++P G + + + GKV +L KS Y
Sbjct: 841 STVRTIISCAVNFDWPLHQLDVKNAFLHGDLQEEVYMEIPPGFATLQTKGKVLRLKKSLY 900
Query: 488 GLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEF 547
GLKQ+ R W+ +F + GYKQ + DH++F G TIL +YVDD+I+ GN E
Sbjct: 901 GLKQSPRAWFDRFRRAMCAMGYKQCNGDHTVFYHHSGDHITILAVYVDDMIITGNDCSEI 960
Query: 548 TRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTP 607
TR+K L F++KDLG LKYFLG+E++ S +GI L QRKY LDL+ D+G+LG +P STP
Sbjct: 961 TRLKQNLSKEFEVKDLGQLKYFLGIEIARSPRGIVLSQRKYALDLLSDTGMLGCRPASTP 1020
Query: 608 LDPSSRLSQDGGGATL*GCF------FIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSY 661
+D + +L + G + I H + ++Y + S P +
Sbjct: 1021 VDQNHKLCAESGNPVNKERYQRLVGRLIYLCHTRPDITYAVSM-----VSRYMHDPRSGH 1075
Query: 662 CDSLCCSS*NSKVSERESKKRVIFPKEFCSTIVGV**CRLGGCVDTRRSVTSYCFFIGNS 721
D++ + + K + F K + G C D RRS + YC F+G +
Sbjct: 1076 MDAVYRI---LRYLKGSPGKGLWFKKNGHLEVEGYCDADWASCPDDRRSTSGYCVFVGGN 1132
Query: 722 LICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALH 781
L+ WRSKKQ +S+S++EA YRA++ + EL L LL +L + P ++CDN+SA+
Sbjct: 1133 LVSWRSKKQPVVSRSTAEAEYRAMSVSLSELLWLRNLLSELMLPVDTPMKLWCDNKSAIS 1192
Query: 782 IAANPVFHERTKHLEIE 798
IA NPV H+RTKH+E++
Sbjct: 1193 IANNPVQHDRTKHVELD 1209
>UniRef100_Q9C692 Polyprotein, putative [Arabidopsis thaliana]
Length = 1468
Score = 335 bits (859), Expect = 3e-90
Identities = 173/390 (44%), Positives = 251/390 (64%), Gaps = 8/390 (2%)
Query: 229 LLPETSVPASSSN---SLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRH 285
++PE + +SS + SL L + S+ + +PL S TT RR +R
Sbjct: 849 IIPEINQESSSPSEFVSLSSLDPFLASSTVQTADLPLSS-----TTPAPIQLRRSSRQTQ 903
Query: 286 QPSHLRNYVLHTVSSSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYA 345
+P L+N+V +TVS + + SS YPI Y+ + H A+ +++ EP TY
Sbjct: 904 KPMKLKNFVTNTVSVESISPEASSSSLYPIEKYVDCHRFTSSHKAFLAAVTAGMEPTTYN 963
Query: 346 EASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARL 405
EA K W +AM+ EI +L N T+S+V LPP + NKWVYKIK R++G +ERYKARL
Sbjct: 964 EAMVDKAWREAMSAEIESLRVNQTFSIVNLPPGKRALGNKWVYKIKYRSDGAIERYKARL 1023
Query: 406 VA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYM 465
V G Q EG+ Y +TF+P AK++ VR+ L +A+ +WH+HQ+DV+NAFL GDL E+VYM
Sbjct: 1024 VVLGNCQKEGVDYDETFAPVAKMSTVRLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYM 1083
Query: 466 KVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQ 525
K+P+G C D KVC+LHKS YGLKQA R W++K +S L G+ Q+ SD+SLFS
Sbjct: 1084 KLPQGFQCDDPSKVCRLHKSLYGLKQAPRCWFSKLSSALKQYGFTQSLSDYSLFSYNNDG 1143
Query: 526 SFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQ 585
F +L+YVDD+I++G+ D + K+ L++ F +KDLG+LKYFLG+EVS +A+G L Q
Sbjct: 1144 IFVHVLVYVDDLIISGSCPDAVAQFKSYLESCFHMKDLGLLKYFLGIEVSRNAQGFYLSQ 1203
Query: 586 RKYCLDLVHDSGVLGSKPVSTPLDPSSRLS 615
RKY LD++ + G+LG++P + PL+ + +LS
Sbjct: 1204 RKYVLDIISEMGLLGARPSAFPLEQNHKLS 1233
Score = 92.4 bits (228), Expect = 5e-17
Identities = 46/96 (47%), Positives = 68/96 (69%)
Query: 704 CVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQ 763
C TRRS+T Y +G++ I W++KKQ T+S+SS+EA YRA+A T EL L +L DL
Sbjct: 1321 CPLTRRSLTGYFVQLGDTPISWKTKKQPTVSRSSAEAEYRAMAFLTQELMWLKRVLYDLG 1380
Query: 764 IEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
+ V+ I+ D++SA+ ++ NPV HERTKH+E++C
Sbjct: 1381 VSHVQAMRIFSDSKSAIALSVNPVQHERTKHVEVDC 1416
>UniRef100_Q9MAJ8 F27F5.19 [Arabidopsis thaliana]
Length = 1309
Score = 305 bits (782), Expect = 3e-81
Identities = 168/396 (42%), Positives = 239/396 (59%), Gaps = 23/396 (5%)
Query: 229 LLPETSVPASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPS 288
L P PA DDL + + TS+ P + S+ ++ + R + P
Sbjct: 776 LFPLLQFPAKP----DDLPL-----EQTSLSDAHPHQDVSSSKALVPFDPQSKRQKKPPK 826
Query: 289 HLRNYVLHTVSSSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEAS 348
H +++ + +S+ I YPI +Y+SYS + P HA+ +++ P Y+EA
Sbjct: 827 HFQDFHCYNNTST---------ILYPIKDYISYSYIVEPFHAFINNITNAVVPQRYSEAK 877
Query: 349 KHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA* 408
K W DAM EI A+ TWS+V LPPN I KWV+ IK A+G++ERYKARLVA
Sbjct: 878 DFKAWCDAMKEEIGAMIQTNTWSVVSLPPNKKAIGCKWVFTIKHNADGSIERYKARLVAK 937
Query: 409 GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVP 468
GY Q E + Y +TFSP AKLT VRM+L LA+ W + QLD++NAFL GDL E++YMK+P
Sbjct: 938 GYTQEESLDYEETFSPVAKLTSVRMMLLLAAKMKWSVLQLDISNAFLNGDLDEEIYMKIP 997
Query: 469 EGVS-----CVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQ 523
G + + VC+LHKS YGLKQASRQWY K ++ L G++++++DH+LF K
Sbjct: 998 PGYADLIGESLPPHAVCRLHKSIYGLKQASRQWYLKLSNTLKGMGFQKSNADHTLFIKFA 1057
Query: 524 GQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISL 583
+L+YVDDI++ N + T+ L + FK++DL KYF G+E++ SAKGIS+
Sbjct: 1058 SGVLMGVLVYVDDIMIVSNSDNAVTQFTTELKSYFKLRDLSAAKYFFGIEIARSAKGISI 1117
Query: 584 CQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGG 619
CQRKY L+L+ +G LGSKP S PLD S +L+++ G
Sbjct: 1118 CQRKYILELLSTTGFLGSKPSSIPLDTSVKLNKEDG 1153
>UniRef100_Q9ZPG3 F5K24.2 protein [Arabidopsis thaliana]
Length = 1366
Score = 304 bits (779), Expect = 6e-81
Identities = 208/606 (34%), Positives = 317/606 (51%), Gaps = 76/606 (12%)
Query: 203 LIISQKQCHGLFS-HLRMRILRALVLILLPETSVPASS-SNSLDDLSVSIPSADATSVQV 260
L IS+KQ F + +L V ++ T+VPA + + SL LS ++ + +
Sbjct: 766 LPISEKQKENRFQIYDYFNVLNLEVCPVIEPTTVPAHTHTRSLAPLSTTVTNDQFGNDM- 824
Query: 261 PLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVSSSCKASQTSSGIKYPISNYMS 320
D+T P + PS+L Y H + + S + G + +S+++S
Sbjct: 825 --------------DNTLMPRKETRAPSYLSQY--HCSNVLKEPSSSLHGTAHSLSSHLS 868
Query: 321 YSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVV 380
Y LS + + ++ + EP T+ EA+ + W+DAMN+E+ AL + T + L
Sbjct: 869 YDKLSNEYRLFCFAIIAEKEPTTFKEAALLQKWLDAMNVELDALVSTSTREICSLHDGKR 928
Query: 381 PIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASV 440
I KWV+KIK +++GT+ERYKARLVA GY Q EG+ Y DTFSP AKLT VR++LALA++
Sbjct: 929 AIGCKWVFKIKYKSDGTIERYKARLVANGYTQQEGVDYIDTFSPIAKLTSVRLILALAAI 988
Query: 441 NNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-----CVDSGKVCKLHKSSYGLKQASRQ 495
+NW + Q+DV NAFL GD E++YM++P+G + + VC+L KS YGLKQASRQ
Sbjct: 989 HNWSISQMDVTNAFLHGDFEEEIYMQLPQGYTPRKGELLPKRPVCRLVKSLYGLKQASRQ 1048
Query: 496 WYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALD 555
W+ KF+ +L+ G+ Q+ D +LF + + +F LL+YVDDI+L N +K L
Sbjct: 1049 WFHKFSGVLIQNGFMQSLFDPTLFVRVREDTFLALLVYVDDIMLVSNKDSAVIEVKQILA 1108
Query: 556 NAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLS 615
FK+KDLG +YFLGLE++ S +GIS+ QRKY L+L+ + G LG KPV TP++ + +LS
Sbjct: 1109 KEFKLKDLGQKRYFLGLEIARSKEGISISQRKYALELLEEFGFLGCKPVPTPMELNLKLS 1168
Query: 616 QDGGGATL*GCFFIQKTHRKTVLSYYNQV*YNLCSSAAK-----SVPLQSYCDSLCCSS* 670
Q+ G L + + R L Y ++C + K S P + + L +
Sbjct: 1169 QEDGALLLDASHYRKLIGR---LVYLTVTRPDICFAVNKLNQYMSAPREPH---LMAARR 1222
Query: 671 NSKVSERESKKRVIFPKEFCSTIVGV**CRLGGCVDTRRSVTSYCFFIGNSLICWRSKKQ 730
+ + + + V +P T C ++ S+ S++ W
Sbjct: 1223 ILRYLKNDPGQGVFYPASSTLTFRAFADADWSNCPESSISI---------SIVFW----- 1268
Query: 731 QTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYC--DNQSALHIAANPVF 788
K S+EA +L+ L P I+ D++SALHIA N VF
Sbjct: 1269 ---LKLSTEA----------------WLVLSL------PDTIFVYYDDESALHIAKNSVF 1303
Query: 789 HERTKH 794
HE TK+
Sbjct: 1304 HESTKN 1309
>UniRef100_Q9SJ99 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1156
Score = 303 bits (775), Expect = 2e-80
Identities = 172/405 (42%), Positives = 238/405 (58%), Gaps = 19/405 (4%)
Query: 232 ETSVPASSSNSLDDLSVSIPSADATSV----QVPLPSPESPSTTSVSDHTRRPTRPRHQP 287
++S P + S D L+ S D S PL +SP + S R+ R Q
Sbjct: 515 DSSTPDKNLASGDTLAQIDDSPDIVSTPNRNNQPLFVVDSPFVEATSPRQRK--RQIRQS 572
Query: 288 SHLRNYVLHTVSSSC--------KASQTSSGIK-----YPISNYMSYSNLSIPHHAYAMS 334
L++YVL+ + S +SQ+SS ++ YP+S+Y+S S H A+ +
Sbjct: 573 VRLQDYVLYNATVSPINPHALPDSSSQSSSMVQGTSSLYPLSDYVSDDCFSAGHKAFLAA 632
Query: 335 LSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRA 394
++ + EP + EA + K W DAM E+ ALE N TW +V LPP V I ++WVYK K A
Sbjct: 633 ITANDEPKHFKEAVRIKVWNDAMFKEVDALEINKTWDIVDLPPGKVAIGSQWVYKTKYNA 692
Query: 395 NGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAF 454
+G++ERYKARLV G Q+EG Y +TF+P K+T VR +L L + N W ++Q+DVNNAF
Sbjct: 693 DGSIERYKARLVVQGNKQVEGEDYNETFAPVVKMTTVRTLLRLVAANQWEVYQMDVNNAF 752
Query: 455 LLGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHS 514
L GDL E+VYMK+P G KVC+L KS YGLKQA R W+ K + L+ G+ Q H
Sbjct: 753 LHGDLDEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCWFKKLSDALLRFGFVQGHE 812
Query: 515 DHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEV 574
D+S FS T+ +L+YVDD+++ GN + K L F +KDLG LKYFLG+EV
Sbjct: 813 DYSFFSYTRNGIELRVLVYVDDLLICGNDGYMLQKFKEYLGRCFSMKDLGKLKYFLGIEV 872
Query: 575 SHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGG 619
S ++GI L QRKY LD++ DSG LG +P TPL+ + L+ D G
Sbjct: 873 SRGSEGIFLSQRKYALDIITDSGNLGCRPALTPLEQNHHLATDDG 917
Score = 75.9 bits (185), Expect = 4e-12
Identities = 42/96 (43%), Positives = 56/96 (57%), Gaps = 21/96 (21%)
Query: 704 CVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQ 763
C TRRS+++Y +G S I W++KKQ T+S SS+EA YRA++ A E+ L LLK+L
Sbjct: 1001 CPKTRRSLSAYVVLLGGSPISWKTKKQDTVSHSSAEAEYRAMSVALREIKWLRKLLKEL- 1059
Query: 764 IEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
ANPVFHERTKH+E +C
Sbjct: 1060 --------------------ANPVFHERTKHIESDC 1075
>UniRef100_O65452 LTR retrotransposon like protein [Arabidopsis thaliana]
Length = 1109
Score = 301 bits (771), Expect = 5e-80
Identities = 148/304 (48%), Positives = 212/304 (69%), Gaps = 1/304 (0%)
Query: 312 KYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWS 371
++P S MS + + H A+ +++ EP TY EA K W +AM+ EI +L N T+S
Sbjct: 572 EFPYSK-MSCNRFTSSHKAFLAAVTAGMEPTTYNEAMVDKAWREAMSAEIESLRVNQTFS 630
Query: 372 LVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIV 431
+V LPP + NKWVYKIK R++G +ERYKARLV G Q EG+ Y +TF+P AK++ V
Sbjct: 631 IVNLPPGKRALGNKWVYKIKYRSDGAIERYKARLVVLGNCQKEGVDYDETFAPVAKMSTV 690
Query: 432 RMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQ 491
R+ L +A+ +WH+HQ+DV+NAFL GDL E+VYMK+P+G C D KVC+LHKS YGLKQ
Sbjct: 691 RLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYMKLPQGFQCDDPSKVCRLHKSLYGLKQ 750
Query: 492 ASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIK 551
A R W++K +S L G+ Q+ SD+SLFS F +L+YVDD+I++G+ D + K
Sbjct: 751 APRCWFSKLSSALKQYGFTQSLSDYSLFSYNNDGVFVHVLVYVDDLIISGSCPDAVAQFK 810
Query: 552 AALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPS 611
+ L++ F +KDLG+LKYFLG+EVS +A+G L QRKY LD++ + G+LG++P + PL+ +
Sbjct: 811 SYLESCFHMKDLGLLKYFLGIEVSRNAQGFYLSQRKYVLDIISEMGLLGARPSAFPLEQN 870
Query: 612 SRLS 615
+LS
Sbjct: 871 HKLS 874
Score = 92.8 bits (229), Expect = 4e-17
Identities = 47/96 (48%), Positives = 68/96 (69%)
Query: 704 CVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQ 763
C TRRS+T Y +G++ I W++KKQ TIS+SS+EA YRA+A T EL L +L DL
Sbjct: 962 CPLTRRSLTGYFVQLGDTPISWKTKKQPTISRSSAEAEYRAMAFLTQELMWLKRVLYDLG 1021
Query: 764 IEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
+ V+ I+ D++SA+ ++ NPV HERTKH+E++C
Sbjct: 1022 VSHVQAMRIFSDSKSAIALSVNPVQHERTKHVEVDC 1057
>UniRef100_Q9FL75 Retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1109
Score = 301 bits (771), Expect = 5e-80
Identities = 148/304 (48%), Positives = 212/304 (69%), Gaps = 1/304 (0%)
Query: 312 KYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWS 371
++P S MS + + H A+ +++ EP TY EA K W +AM+ EI +L N T+S
Sbjct: 572 EFPYSK-MSCNRFTSSHKAFLAAVTAGMEPTTYNEAMVDKAWREAMSAEIESLRVNQTFS 630
Query: 372 LVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIV 431
+V LPP + NKWVYKIK R++G +ERYKARLV G Q EG+ Y +TF+P AK++ V
Sbjct: 631 IVNLPPGKRALGNKWVYKIKYRSDGAIERYKARLVVLGNCQKEGVDYDETFAPVAKMSTV 690
Query: 432 RMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQ 491
R+ L +A+ +WH+HQ+DV+NAFL GDL E+VYMK+P+G C D KVC+LHKS YGLKQ
Sbjct: 691 RLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYMKLPQGFQCDDPSKVCRLHKSLYGLKQ 750
Query: 492 ASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIK 551
A R W++K +S L G+ Q+ SD+SLFS F +L+YVDD+I++G+ D + K
Sbjct: 751 APRCWFSKLSSALKQYGFTQSLSDYSLFSYNNDGVFVHVLVYVDDLIISGSCPDAVAQFK 810
Query: 552 AALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPS 611
+ L++ F +KDLG+LKYFLG+EVS +A+G L QRKY LD++ + G+LG++P + PL+ +
Sbjct: 811 SYLESCFHMKDLGLLKYFLGIEVSRNAQGFYLSQRKYVLDIISEMGLLGARPSAFPLEQN 870
Query: 612 SRLS 615
+LS
Sbjct: 871 HKLS 874
Score = 92.4 bits (228), Expect = 5e-17
Identities = 46/96 (47%), Positives = 68/96 (69%)
Query: 704 CVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQ 763
C TRRS+T Y +G++ I W++KKQ T+S+SS+EA YRA+A T EL L +L DL
Sbjct: 962 CPLTRRSLTGYFVQLGDTPISWKTKKQPTVSRSSAEAEYRAMAFLTQELMWLKRVLYDLG 1021
Query: 764 IEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
+ V+ I+ D++SA+ ++ NPV HERTKH+E++C
Sbjct: 1022 VSHVQAMRIFSDSKSAIALSVNPVQHERTKHVEVDC 1057
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.338 0.145 0.471
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,189,591,307
Number of Sequences: 2790947
Number of extensions: 46368661
Number of successful extensions: 208913
Number of sequences better than 10.0: 1462
Number of HSP's better than 10.0 without gapping: 1317
Number of HSP's successfully gapped in prelim test: 147
Number of HSP's that attempted gapping in prelim test: 204397
Number of HSP's gapped (non-prelim): 2601
length of query: 799
length of database: 848,049,833
effective HSP length: 136
effective length of query: 663
effective length of database: 468,481,041
effective search space: 310602930183
effective search space used: 310602930183
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 79 (35.0 bits)
Lotus: description of TM0171.6