
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144731.14 + phase: 0 /pseudo
(1343 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum] 799 0.0
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides] 358 5e-97
pir||F86470 probable retroelement polyprotein [imported] - Arabi... 298 1e-78
ref|XP_476167.1| putative polyprotein [Oryza sativa (japonica cu... 284 2e-74
gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsi... 275 6e-72
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ... 271 1e-70
gb|AAD15534.1| putative retroelement pol polyprotein [Arabidopsi... 269 4e-70
emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7... 265 6e-69
gb|AAT38747.1| putative polyprotein [Solanum demissum] 262 5e-68
gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha... 262 7e-68
gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis ... 259 4e-67
gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsi... 259 6e-67
gb|AAT40550.1| putative receptor kinase [Solanum demissum] 258 1e-66
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 257 2e-66
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia... 254 1e-65
emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|49720... 252 7e-65
ref|XP_475401.1| putative polyprotein [Oryza sativa (japonica cu... 247 2e-63
gb|AAK51235.1| polyprotein [Arabidopsis thaliana] 239 5e-61
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop... 238 1e-60
dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsi... 226 4e-57
>gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]
Length = 1212
Score = 799 bits (2064), Expect = 0.0
Identities = 382/666 (57%), Positives = 497/666 (74%), Gaps = 12/666 (1%)
Query: 4 MTSEKAKDFCVRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPTEKAALEEWEYKDAQ 63
M S + F VRFTGKNY +WEFQF+++V G +LW ++D APT+ L EW+ KDA+
Sbjct: 1 MNSHHFESFSVRFTGKNYSSWEFQFQLFVTGKELWGYIDGSDPAPTDATKLGEWKIKDAR 60
Query: 64 IISWILSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYKQGNLYVQ 123
+++WIL SIDP ++ NLR + T + MW+YL+++YNQDN+A+RFQLE EIANY QG L+VQ
Sbjct: 61 VMTWILGSIDPLIVLNLRPYKTVKAMWDYLQKVYNQDNSARRFQLEYEIANYSQGGLFVQ 120
Query: 124 EFYSGFLNLWTEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFEVVRGALLNRN 183
+++SGF NLW E + I++A +P SL+ +Q V+ S+RDQFLMKLR +FE +R L+NR+
Sbjct: 121 DYFSGFQNLWAEFTDIVYAKIPTESLSVIQAVHEQSKRDQFLMKLRSDFESIRSNLMNRD 180
Query: 184 PVPSLDTCVGELLREEQRLLTQGTMSHDAFISEPVPVAYAAQSRGKGRDMRQVQCFTCKQ 243
P PSLD C ELLREEQRL+TQ + V VA+AAQ +GKGRDM + QC++CK+
Sbjct: 181 PSPSLDVCFRELLREEQRLVTQNVFKKE----NDVTVAFAAQGKGKGRDMSRTQCYSCKE 236
Query: 244 FGHVARSCTAKFCKYCKQNGHVIFDCPIRPPRRTQYPTQALHATTSSAAPPTITSASDGG 303
+GH+A +C+ KF YCKQ GH+I +CP+RP R ++A + T ++S G
Sbjct: 237 YGHIASNCSKKFYNYCKQQGHIIKECPMRPQNRR------INAFQARINGSTDDNSSLGQ 290
Query: 304 SLQPEMIQQMVLAALSNMGIHGKSSNVSRPWFLDSGASNHMTGSSEYLHNLHSYHGNQQI 363
L PEM+QQM+++A S +G+ G S W +DSGASNHMT S+ L N+ Y G QI
Sbjct: 291 VLTPEMVQQMIVSAFSALGLQGNDVT-SNFWIVDSGASNHMTNSTSILKNVRKYQGPSQI 349
Query: 364 QIADGNKLSITDVGDINSDFQDVLVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVS 423
QIA+G+ L IT VGDI F++V VSP L+++L+SVGQLVDNNC+VNFSR GCLVQ+QVS
Sbjct: 350 QIANGSNLPITKVGDITPTFKNVFVSPKLSTSLISVGQLVDNNCDVNFSRNGCLVQDQVS 409
Query: 424 GKVIAKGPKVGRLFPLQF-ISSHLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLG 482
G +IAKGPKVGRLFP+ F I LS AC + + E WH++LGHPNS VLSH+ +GLLG
Sbjct: 410 GTIIAKGPKVGRLFPIHFSIPPVLSFACTSTASKTEVWHKRLGHPNSVVLSHISNSGLLG 469
Query: 483 NKQVVCTASISCPVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVT 542
NK ASI C CKL KSKTLPFP+ RA+ CF++IHSDVWG+SPI SHAH+KYF+T
Sbjct: 470 NKNKFSVASIDCSTCKLGKSKTLPFPNFGSRATKCFDVIHSDVWGISPIISHAHFKYFMT 529
Query: 543 FIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQ 602
FIDDYSRFTW+YFLRSKSEVFSMFK FL Y+ETQF +K+ RS+SGGEYMS+EF+++L
Sbjct: 530 FIDDYSRFTWVYFLRSKSEVFSMFKTFLAYIETQFSTCIKLLRSDSGGEYMSYEFKKFLL 589
Query: 603 HKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF*LIVFL 662
KGI+SQ SCP TPQQNG+AERKNRHLLDVTR+LL+++SVP ++WVEALST + +
Sbjct: 590 DKGIVSQHSCPYTPQQNGVAERKNRHLLDVTRTLLIESSVPSKYWVEALSTAVYLINRLP 649
Query: 663 LQLLNL 668
++LNL
Sbjct: 650 SKVLNL 655
>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
Length = 1382
Score = 358 bits (920), Expect = 5e-97
Identities = 230/681 (33%), Positives = 354/681 (51%), Gaps = 63/681 (9%)
Query: 14 VRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPT-----EKAALEEWEYKDAQIISWI 68
VR GKNY W + R ++KG K+W ++ P + +++ WE +A+II+WI
Sbjct: 14 VRLDGKNYSYWSYVMRNFLKGKKMWGYVSGTYVVPKNTEEGDTVSIDTWEANNAKIITWI 73
Query: 69 LSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYKQGNLYVQEFYSG 128
+ ++ + L + TA+E+W++L+R++ Q N AK++QLE +I Q N+ +QEFYS
Sbjct: 74 NNYVEHSIGTQLAKYETAKEVWDHLQRLFTQSNFAKQYQLENDIRALHQKNMSIQEFYSA 133
Query: 129 FLNLWTEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFEVVRGALLNRNPVPSL 188
+LW + + + V + A E R QFL LR +FE +RG++L+R+P+PS+
Sbjct: 134 MTDLWDQLA--LTESVELKACGAYIERREQQRLVQFLTALRSDFEGLRGSILHRSPLPSV 191
Query: 189 DTCVGELLREEQRLLTQGTMSHDAFISEPVPVAYAAQSRGKGRDMRQV-------QCFTC 241
D+ V ELL EE RL + S +S P A S+ + +C C
Sbjct: 192 DSVVSELLAEEIRLQSY---SEKGILSASNPSVLAVPSKPFSNHQNKPYTRVGFDECSFC 248
Query: 242 KQFGHVARSCTA--------KFCKYCKQNGHVIFDCPIRPPRRTQYPTQALHATTSSAAP 293
KQ GH C K + N H R P+ + P H T + A+P
Sbjct: 249 KQKGHWKAQCPKLRQQNQAWKSGSQSQSNAH-------RSPQGYKPPH---HNTAAVASP 298
Query: 294 PTITSASDGG-------SLQPEMIQQMVLAALSNMGIHGKSSNVSRPWFLDSGASNHMTG 346
+IT + SLQP+ + + L H S W LDSGAS+HM+
Sbjct: 299 GSITDPNTLAEQFQKFLSLQPQAMSASSIGQLP----HSSSGISHSEWVLDSGASHHMSP 354
Query: 347 SSEYLHNLHSYHGNQQIQIADGNKLSITDVGDI---NSDFQDVLVSPGLASNLLSVGQLV 403
S ++ S + + ADG + + VG + + +V + P L NL S+GQ+
Sbjct: 355 DSSSFTSV-SPLSSIPVMTADGTPMPLAGVGSVVTLHLSLPNVYLIPKLKLNLASIGQIC 413
Query: 404 DN-NCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQFISSHLSLACNNV--------- 453
D+ + V FS + C VQ+ S K+I G + L+ L + + +A V
Sbjct: 414 DSGDYLVMFSGSFCCVQDLQSQKLIGTGRRENGLYILDELKVPVVVAATTVDLSFFRLSL 473
Query: 454 -LNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCPVCKLAKSKTLPFPSGAH 512
+S+ WH +LGH +S+ L L TG LGN + C S C CKLAK LPF
Sbjct: 474 SSSSFYLWHSRLGHVSSSRLRFLASTGALGNLKT-CDIS-DCSGCKLAKFSALPFNRSTS 531
Query: 513 RASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFSMFKKFLTY 572
+S+ F++IHSDVWG SP+++ +Y+V+FIDD++R+ W+Y ++ +SE F ++ F
Sbjct: 532 VSSSPFDLIHSDVWGPSPVSTKGGSRYYVSFIDDHTRYCWVYLMKHRSEFFEIYAAFRAL 591
Query: 573 VETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNRHLLDV 632
++TQ A +K FR + GGEY S++F + L G + Q SC +TP+QNG+AERK+RH+++
Sbjct: 592 IKTQHSAVIKCFRCDLGGEYTSNKFCQMLALDGTIHQTSCTDTPEQNGVAERKHRHIVET 651
Query: 633 TRSLLLQASVPPRFWVEALST 653
RSLLL A V FW EA+ T
Sbjct: 652 ARSLLLSAFVLSEFWGEAVLT 672
>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989049|gb|AAG10812.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1404
Score = 298 bits (762), Expect = 1e-78
Identities = 204/673 (30%), Positives = 323/673 (47%), Gaps = 74/673 (10%)
Query: 18 GKNYPAWEFQFRMYVKGNKLWSHL-------DDVSKAPTEKAALEE--WEYKDAQIISWI 68
G NY W + + G LWSH+ +D + TE + EE W +D +++ +
Sbjct: 15 GGNYLTWSRTTKTVLCGRGLWSHVISSQAPKEDKEEEETETISPEEEKWFQEDQAVLALL 74
Query: 69 LSSIDPQMINNLRSFSTAQEMWNYLKRIY-NQDNAAKRFQLELEIANYKQGNLYVQEFYS 127
+S++ ++ TA+E+W+ LK +Y N+ N + F+++ I Q +L + +
Sbjct: 75 QNSLETSILEGYSYCETAKELWDTLKNVYGNESNLTRVFEVKKAINELSQEDLEFTKHFG 134
Query: 128 GFLNLWTEHSAIIHADVPKASLAAVQEVYNTSRRDQ-----FLMKLRPEFEVVRGALLNR 182
F +LW+E ++ + L RR+Q L+ L P + + LL
Sbjct: 135 KFRSLWSELKSLRPGTLDPKIL--------HERREQDKVFGLLLTLNPGYNDLIKHLLRS 186
Query: 183 NPVPSLDTCVGELLREEQRLLTQGTMSHDAFISEPVPVAYAAQSRGKGRDMRQVQCFTCK 242
+PSLD ++ +E+ G S I+ A + K D + + C CK
Sbjct: 187 EKLPSLDEVCSKIQKEQGSTGLFGGKSE--LITANKGEVVANKGVYKNEDRKLLTCDHCK 244
Query: 243 QFGHVARSCTAKFCKYCKQNGHVIFDCPIRPPR----RTQYPTQALHATTSSAAPPTITS 298
+ GH C + ++P + R + + + + + TS
Sbjct: 245 KKGHTKDKCW-------------LLHPHLKPAKFKDSRAHFSQETHEEQSQAGSSKGETS 291
Query: 299 ASDGGSLQPEMIQQMV--LAALSNMGIHGKSSNVSRPWFLDSGASNHMTGSSEYLHNLHS 356
S G ++ ++ ++ + +L GI S S +DSGAS+HM +S L N+
Sbjct: 292 TSFGDYVRKSDLEALIKSIVSLKESGITFSSQTSSGSIVIDSGASHHMISNSNLLDNIEP 351
Query: 357 YHGNQQIQIADGNKLSITDVGDINSDFQD--VLVSPGLASNLLSVGQLV-DNNCNVNFSR 413
G+ + IA+G+K+ I +G++ +D P SNLLSV + D NC F
Sbjct: 352 ALGH--VIIANGDKVPIEGIGNLKLFNKDSKAFFMPKFTSNLLSVKRTTRDLNCYAIFGP 409
Query: 414 AGCLVQEQVSGKVIAKGPKVGRLFPLQFIS----------SHLSLACNNVLNSYEDWHRK 463
Q+ +GKVI +G G L+ L+ +S SHL ++ N + WH +
Sbjct: 410 NDVYFQDIETGKVIGEGGSKGELYVLEDLSPNSSSCFSSKSHLGISFNTL------WHAR 463
Query: 464 LGHPNSTVLSHLFKTGLLGNKQVVCTASISCPVCKLAKSKTLPFPSGAHRASNCFEMIHS 523
LGHP++ L + + SC C L K FP CF+++HS
Sbjct: 464 LGHPHTRALKLMLPN--------ISFDHTSCEACILGKHCKSVFPKSLTIYEKCFDLVHS 515
Query: 524 DVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKI 583
DVW SP S + KYFVTFI++ S++TWI L SK VF F F TYV QF A +K+
Sbjct: 516 DVW-TSPCVSRDNNKYFVTFINEKSKYTWITLLPSKDRVFEAFTNFETYVTNQFNAKIKV 574
Query: 584 FRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVP 643
FR+++GGEY S +F+++L +GI+ Q SCP TPQQNG+AERKNRHL++V RS++ SVP
Sbjct: 575 FRTDNGGEYTSQKFRDHLAKRGIIHQTSCPYTPQQNGVAERKNRHLMEVARSMMFHTSVP 634
Query: 644 PRFWVEALSTVCF 656
RFW +A+ T C+
Sbjct: 635 KRFWGDAVLTACY 647
>ref|XP_476167.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48843849|gb|AAT47108.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1204
Score = 284 bits (726), Expect = 2e-74
Identities = 202/689 (29%), Positives = 326/689 (46%), Gaps = 65/689 (9%)
Query: 14 VRFTGKNYPAWEFQFRMYVKGNKLWSHL------------------------DDVSKA-- 47
V F G NY W R++++G +LW L DD KA
Sbjct: 15 VLFDGCNYSHWAQHMRLHMRGQRLWDVLSSELPCPPCPIAPTMPSLASQATDDDREKAKE 74
Query: 48 ---------PTEKAALEEWEYKDAQIISWILSSIDPQMINNLRSFSTAQEMWNYLKRIYN 98
++ A + W +DA+ + +++S++ + + + ++A MW +L Y
Sbjct: 75 QFDDAMENYQSQFALYKAWLDEDARASAILVASMEIHLTGEVVTLTSAHLMWTHLHDRYA 134
Query: 99 QDNAAKRFQLELEIANYKQGNLYVQEFYSGFLNLWTEHSAIIHADVPKASLAAVQEVYNT 158
+ A + + + +QG+ V EFY+ ++W + ++ Q +
Sbjct: 135 PTSDALYLAMVRQEQSLQQGDSTVDEFYTQLSSIWRQLDSLGPTICHTYPCCQRQRSHMD 194
Query: 159 SRR-DQFLMKLRPEFEVVRGALLNRNPVPSLDTCVGELLREEQRLLTQGTMSHDAFISEP 217
RR FL +LR E+E R LL+R+P ++ + E+ EE RL G + + +
Sbjct: 195 LRRIYDFLTRLRSEYESTRAQLLSRHPRVTIMEALTEIRSEEIRLREAGILPLPSSVLAV 254
Query: 218 VPVAYAAQSRGKGRDMRQVQCFTCKQFGHVARSCTAKF-CKYCKQNGHVIFDCPIRPPRR 276
VA +A S + + V S C YC ++GHV C +
Sbjct: 255 RTVASSASSTPAVHSTVSSSSSSARPPTTVVPSTRGHLHCTYCDKDGHVESFCFRK---- 310
Query: 277 TQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPWFL 336
+ L SS + S GGS E++ M+L L+ G +V+ P
Sbjct: 311 ----KKDLRRGNSSKGTSGSSQKSSGGSDSQEIL--MLLRRLTASAATGSVGSVALP--- 361
Query: 337 DSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGDIN-SDFQDVLVS--PGLA 393
+ + + + GSS S + ADG L+I G ++ S F VS P LA
Sbjct: 362 SAQSGSAVLGSSSSTEGSSSASVPTTVHTADGTPLAIVGRGTLSISSFSVPAVSYVPKLA 421
Query: 394 SNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPK---VGRLFPLQFI-------S 443
L+S GQL D+ C V C VQ+ +G ++ GP+ RL+ L ++ +
Sbjct: 422 MQLMSAGQLTDHGCRVILDSDSCCVQDHRTGLLVGTGPRRRDSQRLWELDWLRLPSAAPA 481
Query: 444 SHLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCPVCKLAKSK 503
S L+ ++ + S+ WH +LGH + LS L + GLLG+ + + C CKL K
Sbjct: 482 SLLASTASSTV-SFAQWHHRLGHLCGSRLSALVRRGLLGSVSGAVSLN-QCQGCKLGKQI 539
Query: 504 TLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVF 563
LP+ S + F+++HSDVWG +P S ++Y++ FIDD+SR TWIYF+ +SEV
Sbjct: 540 QLPYHSSESVSKRPFDLVHSDVWGPAPFVSKGGHRYYIIFIDDFSRHTWIYFMTHRSEVL 599
Query: 564 SMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAE 623
+++K F + T F + +++FR++S GEY+S E + +L +G LSQ SCP QNG+AE
Sbjct: 600 AIYKSFARMIRTHFDSPIRVFRADSAGEYLSRELRVFLSEQGTLSQFSCPGAHAQNGVAE 659
Query: 624 RKNRHLLDVTRSLLLQASVPPRFWVEALS 652
RK+RHLL+ R+L++ +SVPP FW E S
Sbjct: 660 RKHRHLLETARALMIASSVPPHFWAEVYS 688
>gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301693|pir||F84480 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1402
Score = 275 bits (704), Expect = 6e-72
Identities = 206/710 (29%), Positives = 327/710 (46%), Gaps = 105/710 (14%)
Query: 14 VRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPTEKAALEE---------------WE 58
V T KNY W+ QF ++ G L + AP++ + + + W
Sbjct: 17 VTLTAKNYILWKSQFESFLDGQGLLGFVTGSIPAPSQTSVVSDIDGSTSASPNPEYYTWF 76
Query: 59 YKDAQIISWILSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYKQG 118
D + SW+L S +++ + + +T+ E+W + +N+ ++++ F+L+ + N +
Sbjct: 77 KTDRVVKSWLLGSFLEDILSVVVNCNTSHEVWISVANHFNRVSSSRLFELQRRLQNVSKR 136
Query: 119 NLYVQEFYSGFLNLWTEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFEVVRGA 178
+ + E+ + + +++ K + A L L E+E ++
Sbjct: 137 DKSMDEYLKDLKTICDQLASVGSPVTEKMKIFAA------------LNGLGREYEPIKTT 184
Query: 179 LLNRN---PVPSLDTCVGELLREEQRL---LTQGTMS-HDAF---ISEPVPVA--YAAQS 226
+ N P PSL+ + +L + RL L + +S H AF S+ + + A +
Sbjct: 185 IENSMDALPGPSLEDVIPKLTGYDDRLQGYLEETAVSPHVAFNITTSDDSNASGYFNAYN 244
Query: 227 RGKGRDMRQVQCFTCKQFG-HVARSC-----------TAKFCKYCKQNGHVIFDCPIRPP 274
RGKG+ R F+ + G H S T+ C+ C + GH C R
Sbjct: 245 RGKGKSNRGRNSFSTRGRGFHQQISSTNSSSGSQSGGTSVVCQICGKMGHPALKCWHRFN 304
Query: 275 RRTQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPW 334
QY + + AL+ M I + W
Sbjct: 305 NSYQY--------------------------------EELPRALAAMRITDITDQHGNEW 332
Query: 335 FLDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGDI-------NSDFQDVL 387
DS A+ H+T S L YHG+ + +ADGN L IT G N DVL
Sbjct: 333 LPDSAATAHVTNSPRSLQQSQPYHGSDAVMVADGNFLPITHTGSTNLASSSGNVPLTDVL 392
Query: 388 VSPGLASNLLSVGQLV-DNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQFISSHL 446
V P + +LLSV +L D C V F G + ++ + K++ G L+ L+ S
Sbjct: 393 VCPSITKSLLSVSKLTQDYPCTVEFDSDGVRINDKATKKLLIMGSTCDGLYCLKDDSQFK 452
Query: 447 S-LACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLG-NKQVVCTASISCPVCKLAKSKT 504
+ + S E WHR+LGHP+ VL L KT + NK T+ C C+L KS
Sbjct: 453 AFFSTRQQSASDEVWHRRLGHPHPQVLQQLVKTNSISINK----TSKSLCEACQLGKSTR 508
Query: 505 LPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFS 564
LPF S + ++ E +H D+WG SPI S ++Y+ FID YSRF+WIY L+ KS+ ++
Sbjct: 509 LPFVSSSFTSNRPLERVHCDLWGPSPITSVQGFRYYAVFIDHYSRFSWIYPLKLKSDFYN 568
Query: 565 MFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAER 624
+F F VE Q + +F+ + GGE+++H+F ++LQ+ GI S P+TPQQNGLAER
Sbjct: 569 IFVAFHKLVENQLNHKISVFQCDGGGEFVNHKFLQHLQNHGIQQHISYPHTPQQNGLAER 628
Query: 625 KNRHLLDVTRSLLLQASVPPRFWVEALSTVCF*LIVFLLQLLNLILLSFV 674
K+RHL+++ S+L Q+ VP +FWVEA T F L+NL+ S V
Sbjct: 629 KHRHLVELGLSMLFQSKVPLKFWVEAFFTANF--------LINLLPTSAV 670
>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534632|ref|NP_921618.1| putative pol polyprotein
[Oryza sativa (japonica cultivar-group)]
Length = 1688
Score = 271 bits (693), Expect = 1e-70
Identities = 140/342 (40%), Positives = 201/342 (57%), Gaps = 19/342 (5%)
Query: 331 SRPWFLDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGDINSD---FQDVL 387
S+PW LDSGAS HM+ +L + + A+G +T G I+S +V
Sbjct: 181 SQPWILDSGASFHMSFDDSWLTSCRLVKNGATVHTANGTLCKVTHQGSISSPQFTVPNVS 240
Query: 388 VSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQFISSHLS 447
+ P L+ NL+SVGQL D NC V F C VQ++ +G VI G + R L + S LS
Sbjct: 241 LVPKLSMNLISVGQLTDTNCFVGFDDTSCFVQDRHTGAVIGTGHRQKRSCGLYILDS-LS 299
Query: 448 LA-------------CNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISC 494
L C+ S+ WH +LGH + L+ L G+LG+ V T C
Sbjct: 300 LPSSSTNTPSVYSPMCSTACKSFPQWHHRLGHLCGSRLATLINQGVLGSVPVDTT--FVC 357
Query: 495 PVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIY 554
CKL K LP+PS R+S F+++HSDVWG SP S + Y+V F+DDYSR+TWIY
Sbjct: 358 KGCKLGKQVQLPYPSSTSRSSRPFDLVHSDVWGKSPFPSKGGHNYYVIFVDDYSRYTWIY 417
Query: 555 FLRSKSEVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPN 614
F++ +S++ S+++ F + TQF ++++IFRS+SGGEYMS+ F+E+L +G L Q SCP
Sbjct: 418 FMKHRSQLISIYQSFAQMIHTQFSSAIRIFRSDSGGEYMSNAFREFLVSQGTLPQLSCPG 477
Query: 615 TPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF 656
QNG+AERK+RH+++ R+LL+ + VP FW EA+ST +
Sbjct: 478 AHAQNGVAERKHRHIIETARTLLIASFVPAHFWAEAISTAVY 519
>gb|AAD15534.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25411300|pir||F84485 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1664
Score = 269 bits (688), Expect = 4e-70
Identities = 197/696 (28%), Positives = 317/696 (45%), Gaps = 108/696 (15%)
Query: 4 MTSEKAKDFCVRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPTEKAALE-------- 55
M + KA V G NY W + + LW+H+ S+AP+E E
Sbjct: 1 MENTKALFVPVTLKGVNYLLWARTTKTTLCSRGLWAHIL-TSEAPSEATIREGMEIVHVG 59
Query: 56 --EWEYKDAQIISWILSSIDPQMINNLRSFSTAQEMWNYLKRIY-NQDNAAKRFQLELEI 112
+W +D +++ + +S++ ++ TA+E+W L ++ NQ N ++ F+++ I
Sbjct: 60 EEKWFQEDQSVLALLQNSLEASLLEAYSYCETAKELWETLFNVFGNQSNLSRVFEVKKAI 119
Query: 113 ANYKQGNLYVQEFYSGFLNLWTEHSAIIHADV-PKASLAAVQEVYNTSRRDQ-----FLM 166
+ QG++ + + F +LW E + + PK + RR+Q L+
Sbjct: 120 NDLSQGDMEFTQHFGKFRSLWAELEMLRPNTLDPKVLI---------ERREQDKVFGLLL 170
Query: 167 KLRPEFEVVRGALLNRNPVPSLDTCVGELLREEQRLLTQGTMSHDAFISEPVPVAYAAQS 226
L + + LL + +P+L+ ++ +E+ +G +D
Sbjct: 171 TLSSTYNDLIKHLLRADKLPNLEEVCSQIQKEQ---ANRGNYKYD--------------- 212
Query: 227 RGKGRDMRQVQCFTCKQFGHVARSCTAKFCKYCKQNGHVIFDC-----PIRPPRRTQYPT 281
+ A +C++CK++GH C +RP RR
Sbjct: 213 ----------------------NNKKALWCEHCKRSGHTKEKCWTLHPHLRPGRREPRAN 250
Query: 282 QALHATTSSAAPPTITSASDGG-----SLQPEMIQQMVLAAL-------SNMGIHGKSSN 329
Q + ++ GG + +++++ L AL S H SS
Sbjct: 251 QVTGENFGTQEQSGTSNQHLGGNGAAMAASSDLVRRSDLKALIKALKESSGKSYHALSS- 309
Query: 330 VSRPWFLDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGDIN--SDFQDVL 387
+P +DSGAS+HM S+ + N+ GN + IA+G+++ + VGD++
Sbjct: 310 -LKPLIIDSGASHHMISDSKLISNIEPALGN--VVIANGDRIPVKGVGDLDLFDKSSKAF 366
Query: 388 VSPGLASNLLSVGQ-LVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQF----- 441
P SNLLSV + D NC F Q+ + +V+ +G L+ L+
Sbjct: 367 YMPTFTSNLLSVKKATTDLNCYAIFGPNEVHFQDIETSRVLGQGVTKDGLYVLEDTKPSV 426
Query: 442 -ISSHLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCPVCKLA 500
+SSH S N + E WH +LGHP+S L L + N + C C L
Sbjct: 427 PLSSHFSSILGNA--NSESWHARLGHPHSRALKLLLPSTSFKNDE--------CEACILG 476
Query: 501 KSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKS 560
K FP + CF++IHSDVW SP S ++KYFVTFID+ S+FTW L SK
Sbjct: 477 KHCKSVFPKSSTIYEKCFDLIHSDVW-TSPCLSRENHKYFVTFIDEKSKFTWFTLLPSKD 535
Query: 561 EVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNG 620
V F F TYV + A +KI RS++ GEY SH F+++L GI+ Q SCP TPQQNG
Sbjct: 536 RVLEAFTNFQTYVTNHYDAKIKILRSDNRGEYTSHAFKQHLNKHGIIHQTSCPYTPQQNG 595
Query: 621 LAERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF 656
+AERKNRHL++V R ++ +VP FW++ + + C+
Sbjct: 596 VAERKNRHLMEVRRVMMFHTNVPKHFWIDGVVSACY 631
>emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7488558|pir||T14517
hypothetical protein 1 - wild cabbage transposon Melmoth
Length = 1131
Score = 265 bits (678), Expect = 6e-69
Identities = 184/696 (26%), Positives = 313/696 (44%), Gaps = 69/696 (9%)
Query: 14 VRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAP-TEKAALEEWEYKDAQIISWILSSI 72
++ G NY W ++ + +D P T W ++ + SW+L+S+
Sbjct: 30 LKLDGSNYDDWNAAMKIALDAKNKIGFVDGTLTRPDTSDPTFRLWSRCNSMVKSWLLNSV 89
Query: 73 DPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYKQGNLYVQEFYSGFLNL 132
PQ+ ++ + A ++W L ++ N + F L EI + KQG++ + ++Y+ L
Sbjct: 90 SPQIYRSILRLNDAADIWRDLHGRFHMTNLPRTFNLTQEIQDLKQGSMSLSDYYTTLKTL 149
Query: 133 WTEHSAIIHADVPKA--SLAAVQEVYNTSRRDQFLMKLRPEFEVVRGALLNRNPVPSLDT 190
W ++ D P + +Q+ + ++ +FL L + ++R ++ + +PSL
Sbjct: 150 WDNLESVDEPDTPCVCGNAEKLQKKVDRAKIVKFLAGLNDSYAIIRRQIIMKKVLPSLVE 209
Query: 191 CVGELLREEQRLLTQGTMSHDAF-ISEPVP-------VAYAAQSRGKGRDMRQVQCFTCK 242
L +++ + ++ AF +SE VP + Y KGR + C C
Sbjct: 210 VYNILDQDDSQKGFSTAITPAAFNVSENVPPPMAEAGICYVQTGPNKGRPI----CSFCN 265
Query: 243 QFGHVARSCTAKF-------CKYCKQNG--------HVIFDCPIRPPRRTQYPTQALHAT 287
+ GH+A C K KY Q+ V PP Q P H
Sbjct: 266 RVGHIAERCYKKHGFPPGFVSKYKSQSSGDRLQKPKQVAAQVSFSPPNSGQSPMTMDHLV 325
Query: 288 TS-------------SAAPPTITSASDGGSLQPEMIQQ---------MVLAALSNMGIHG 325
+ S+ P +T S+ S + + +V L + H
Sbjct: 326 GNHSKEQLQQFIALFSSQLPNVTMGSNEASSSKQPMDNSGISFNPTTLVFIGLLTVSRHT 385
Query: 326 KSSNVSRPWFLDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGDINSD--- 382
++ W +DSGA++H+ ++ + + +G + I+ VG + +
Sbjct: 386 LANET---WIIDSGATHHVCHDRSMYTSI-DITTTSNVNLPNGMIVKISGVGIVQLNEHI 441
Query: 383 -FQDVLVSPGLASNLLSVGQLV-DNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQ 440
+VL P NLLS+ L D V F + C +Q+ G I +G +V L+ L
Sbjct: 442 TLHNVLYIPEFRLNLLSISSLTSDIGSQVIFDVSSCAIQDPTKGWTIGQGRRVANLYVLD 501
Query: 441 FISSHLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCPVCKLA 500
SS + + N + WH++LGHP+ T L + + LG + C VC LA
Sbjct: 502 VKSSPMKI---NAVVDISLWHKRLGHPSYTRLDKISEA--LGTTKHKNKGDAHCHVCHLA 556
Query: 501 KSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKS 560
K K L + S H + F+++H DVWG + + YKYF+T +DD+SR TWIY L+SKS
Sbjct: 557 KQKKLSYSSQNHICTASFQLLHVDVWGPFSVETLEGYKYFLTIVDDHSRATWIYLLQSKS 616
Query: 561 EVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNG 620
+V +F F+ +ETQ+ +K R ++ E F E + KGI+S SCP T +QN
Sbjct: 617 DVLHIFPTFVNQIETQYNTKIKSVRRDNAPEL---SFTELFKEKGIVSYHSCPETLEQNS 673
Query: 621 LAERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF 656
+ ERK++HLL+V R+L+ Q+ VP ++W + + T F
Sbjct: 674 VLERKHQHLLNVARALMFQSQVPLQYWGDCVLTAAF 709
>gb|AAT38747.1| putative polyprotein [Solanum demissum]
Length = 1336
Score = 262 bits (670), Expect = 5e-68
Identities = 155/407 (38%), Positives = 226/407 (55%), Gaps = 18/407 (4%)
Query: 256 CKYCKQNGHVIFDCPIRPPRRTQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQMVL 315
C+YCK+ GH+ +C Q T A+ AT+SS P T+T ++D + + + M
Sbjct: 290 CRYCKEPGHIRRNCKKLQLHNQQTQTAAVAATSSS--PSTVTISADEYARLTKYQESMPA 347
Query: 316 AALSNMGIHGKSSNVSRPWFLDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITD 375
+L+ G S+ S W +DSGA++HMTG+ ++ ++ + I DG+ +I
Sbjct: 348 PSLNESGNKCLISSSSN-WIIDSGATDHMTGNPKFFSKFQAHKVPSSVTIVDGSSYTIEG 406
Query: 376 VGDINSD----FQDVLVSPGLASNLLSVGQLVDN-NCNVNFSRAGCLVQEQVSGKVIAKG 430
G +N VL P A NL+SV +L C V+ CL Q+ ++ ++I K
Sbjct: 407 SGTVNHTSSITLSSVLGLPSHAFNLISVSKLTKELKCFVSLYPDHCLFQDLMTKQIIGKR 466
Query: 431 PKVGRLFPLQFISSHLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTA 490
L+ L + S+AC+++++ +E H +LGHP+ VL L Q
Sbjct: 467 HVSDGLYILDEWTPP-SVACSSIVSPFEA-HCRLGHPSLPVLKKLCP-------QFHNVP 517
Query: 491 SISCPVCKLAKSKTLPF-PSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSR 549
SI C C AK + P RA+ FE++HSDVWG P+ S ++YFVTF+DD+SR
Sbjct: 518 SIDCESCHFAKHHRISLSPRNNKRANFAFELVHSDVWGPCPVVSKVGFRYFVTFMDDFSR 577
Query: 550 FTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQ 609
TWIYF++++SEVFS F F ++TQF ASV I RS++ E+MS FQ Y+ GIL Q
Sbjct: 578 MTWIYFMKNRSEVFSHFSNFCAEIKTQFNASVHILRSDNAREFMSASFQNYMNQYGILHQ 637
Query: 610 RSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF 656
SC +TP QNG+AERKNRHLL+ R LL Q VP +FW + +ST F
Sbjct: 638 SSCVDTPSQNGVAERKNRHLLETARVLLFQMKVPKQFWADTVSTASF 684
Score = 56.6 bits (135), Expect = 6e-06
Identities = 45/199 (22%), Positives = 96/199 (47%), Gaps = 17/199 (8%)
Query: 15 RFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPTEKAALEEWEYKDAQIISWILSSIDP 74
+ G NY W + R+Y++ + HL + PT+ A + W DA++I I++SID
Sbjct: 22 KLNGSNYLDWSRKIRIYLRSVEKDDHL--IQDPPTDDAK-KAWLRDDARLILQIINSIDN 78
Query: 75 QMINNLRSFSTAQEMWNYLKRIYN-QDNAAKRFQLELEIANYKQGNLYVQEFYSGFLNLW 133
+++ + +E+ +YL+ +Y+ + N ++ +++ ++ + ++ F +
Sbjct: 79 EVVGLVNHCEFVKELMDYLEYLYSGKGNLSRIYEVSKAFYRSEKEAKSLTTYFMEFKKTY 138
Query: 134 TEHSAIIHADVPKASLAAVQEVYNTSRRDQ-----FLMKLRPEFEVVRGALLNRNPVPSL 188
E + ++ P ++ VQ+ ++R+Q FL L EFE + +L+ + + SL
Sbjct: 139 EELNVLL----PFSTDIKVQQ----AQREQMAIMSFLAGLPSEFETAKSQILSSSEITSL 190
Query: 189 DTCVGELLREEQRLLTQGT 207
++LR E Q T
Sbjct: 191 KDVFSQVLRTESTPANQQT 209
>gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana]
gi|7444456|pir||T01908 hypothetical protein T12H20.12 -
Arabidopsis thaliana
Length = 1392
Score = 262 bits (669), Expect = 7e-68
Identities = 186/689 (26%), Positives = 324/689 (46%), Gaps = 98/689 (14%)
Query: 14 VRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAP------------TEKAALE--EWEY 59
++ T NY W+ QF Y+ + L + + P +E+A E +W
Sbjct: 18 LKLTPTNYLLWKTQFESYLSSHLLLGFVTGATPRPASTIIVTKDDIQSEEANQEFLKWTR 77
Query: 60 KDAQIISWILSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYKQGN 119
D + +WI S+ + + + ++AQE+W L R +N+ + +++ L+ + +
Sbjct: 78 IDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQKRLGTCSKAG 137
Query: 120 LYVQEFYSGFLNLWTEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFEVVRGAL 179
+ + S N+ + +I + + V L L E+E + +
Sbjct: 138 KTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGV------------LNGLGKEYESIATVI 185
Query: 180 ---LNRNPVPSLDTCVGELLREEQRLLTQGTMS----HDAFISEPVPVAYAAQS------ 226
L+ P P D V +L + +L T S H AF ++ +Y+++
Sbjct: 186 EHSLDVYPGPCFDDVVYKLTTFDDKLSTYTANSEVTPHLAFYTDK---SYSSRGNNNSRG 242
Query: 227 ------RGKGRDMRQVQCFTCKQFGHVARSCTAK----FCKYCKQNGHVIFDCPIRPPRR 276
RG+G + + F +QFG + + + C+ C++ GH F C R
Sbjct: 243 GRYGNFRGRGSYSSRGRGFH-QQFGSGSNNGSGNGSKPTCQICRKYGHSAFKCYTR---- 297
Query: 277 TQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPWFL 336
+ + + + A + M + ++ S W
Sbjct: 298 ----------------------------FEENYLPEDLPNAFAAMRVSDQNQASSHEWLP 329
Query: 337 DSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGDI-------NSDFQDVLVS 389
DS A+ H+T +++ L N +Y G+ + + +G+ L IT +G I +DVLV
Sbjct: 330 DSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQGTLPLEDVLVC 389
Query: 390 PGLASNLLSVGQLVDN-NCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQFISSHLSL 448
PG+ +LLSV +L D+ C+ F +++++ + +++ +G K L+ L+ +
Sbjct: 390 PGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDVPFQTYY 449
Query: 449 ACNNVLNSYEDWHRKLGHPNSTVLSHLFKT-GLLGNKQVVCTASISCPVCKLAKSKTLPF 507
+ + E WH++LGHPN VL HL KT ++ NK T+S C C++ K LPF
Sbjct: 450 STRQQSSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNK----TSSNMCEACQMGKVCRLPF 505
Query: 508 PSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFSMFK 567
+ +S E IH D+WG +P+ S ++Y+V FID+YSRFTW Y L+ KS+ FS+F
Sbjct: 506 VASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSRFTWFYPLKLKSDFFSVFV 565
Query: 568 KFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNR 627
F VE Q+Q + +F+ + GGE++S++F +L GI SCP+TPQQNG+AER++R
Sbjct: 566 LFQQLVENQYQHKIAMFQCDGGGEFVSYKFVAHLASCGIKQLISCPHTPQQNGIAERRHR 625
Query: 628 HLLDVTRSLLLQASVPPRFWVEALSTVCF 656
+L ++ SL+ + VP + WVEA T F
Sbjct: 626 YLTELGLSLMFHSKVPHKLWVEAFFTSNF 654
>gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis thaliana]
gi|25301689|pir||C96578 hypothetical protein T18A20.5
[imported] - Arabidopsis thaliana
Length = 1522
Score = 259 bits (662), Expect = 4e-67
Identities = 195/691 (28%), Positives = 315/691 (45%), Gaps = 101/691 (14%)
Query: 14 VRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPTEKAALEE--------------WEY 59
V +NY W+ QF ++ G L + AP + ++ W
Sbjct: 17 VTLNQQNYILWKSQFESFLSGQGLLGFVTGSISAPAQTRSVTHNNVTSEEPNPEFYTWHQ 76
Query: 60 KDAQIISWILSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYKQGN 119
D + SW+L S +++ + + T+ ++W L +N+ ++++ F+L+ + ++ +
Sbjct: 77 TDQVVKSWLLGSFAEDILSVVVNCFTSHQVWLTLANHFNRVSSSRLFELQRRLQTLEKKD 136
Query: 120 LYVQEFYSGFLNLWTEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFEVVRGAL 179
++ F ++ + A + + VP+ ++++ L L E+E ++ +
Sbjct: 137 NTMEVFLKDLKHI-CDQLASVGSPVPEK-----MKIFSA------LNGLGREYEPIKTTI 184
Query: 180 LNR---NPVPSLDTCVGELLREEQRL---LTQGTMS-HDAF-ISEPVPVAYAAQSRGKGR 231
N NP SLD +L + RL +T+ T+S H AF ++ Y +RGKGR
Sbjct: 185 ENSVDSNPSLSLDEVASKLRGYDDRLQSYVTEPTISPHVAFNVTHSDSGYYHNNNRGKGR 244
Query: 232 DM----------------RQVQCFTCKQFGHVARSCTAKFCKYCKQNGHVIFDCPIRPPR 275
+Q+ + Q G+ + C+ C + GH C R
Sbjct: 245 SNSGSGKSSFSTRGRGFHQQISPTSGSQAGN-----SGLVCQICGKAGHHALKCWHRFDN 299
Query: 276 RTQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPWF 335
Q+ + AL+ M I + + W
Sbjct: 300 SYQHEDLPM--------------------------------ALATMRITDVTDHHGHEWI 327
Query: 336 LDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGD--INSD-----FQDVLV 388
DS AS H+T + L YHG+ I +ADGN L IT G I S ++VLV
Sbjct: 328 PDSAASAHVTNNRHVLQQSQPYHGSDSIMVADGNFLPITHTGSGSIASSSGKIPLKEVLV 387
Query: 389 SPGLASNLLSVGQLV-DNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQFISSHLS 447
P + +LLSV +L D C+V F + ++ + K++ G L+ L+ +
Sbjct: 388 CPDIVKSLLSVSKLTSDYPCSVEFDADSVRINDKATKKLLVMGRNRDGLYSLEEPKLQVL 447
Query: 448 LACNNVLNSYEDWHRKLGHPNSTVLSHLF--KTGLLGNKQVVCTASISCPVCKLAKSKTL 505
+ S E WHR+LGH N+ VL L K+ ++ NK V C C L KS L
Sbjct: 448 YSTRQNSASSEVWHRRLGHANAEVLHQLASSKSIIIINKVVKTV----CEACHLGKSTRL 503
Query: 506 PFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFSM 565
PF AS E IH D+WG SP +S ++Y+V FID YSRFTW Y L+ KS+ FS
Sbjct: 504 PFMLSTFNASRPLERIHCDLWGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLKLKSDFFST 563
Query: 566 FKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERK 625
F F VE Q +KIF+ + GGE++S +F ++LQ GI SCP TPQQNG+AERK
Sbjct: 564 FVMFQKLVENQLGHKIKIFQCDGGGEFISSQFLKHLQDHGIQQNMSCPYTPQQNGMAERK 623
Query: 626 NRHLLDVTRSLLLQASVPPRFWVEALSTVCF 656
+RH++++ S++ Q+ +P ++W+E+ T F
Sbjct: 624 HRHIVELGLSMIFQSKLPLKYWLESFFTANF 654
>gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301708|pir||B84523 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1149
Score = 259 bits (661), Expect = 6e-67
Identities = 199/678 (29%), Positives = 313/678 (45%), Gaps = 91/678 (13%)
Query: 14 VRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPT-------EKAALEEWEYKDAQIIS 66
V+ T +NY W+ QF ++ G L ++ APT + E D Q +
Sbjct: 21 VKLTDRNYILWKSQFESFLSGQGLLGFVNGAYAAPTGTVSGPQDAGVTEAIPNPDYQ--A 78
Query: 67 WILSS---IDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYKQGNLYVQ 123
W S + +++ + T+ E+W L + +N+ ++++ F+L+ + + + ++
Sbjct: 79 WFRSDQVVMSEDILSVVVGSKTSHEVWMNLAKHFNRISSSRIFELQRRLHSLSKEGKTME 138
Query: 124 EFYSGFLNLWTEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFEVVRGALLNRN 183
E+ + + +++ K + A+ V+ +R + P + G L +
Sbjct: 139 EYLRYLKTICDQLASVGSPVAEKMKIFAM--VHGLTR------EYEPLITSLEGTL-DAF 189
Query: 184 PVPSLDTCVGELLREEQRL---LTQGTMSHDAFISEPVPVAYAAQSRGKG-RDMRQVQCF 239
P PS + V L + RL H AF + + + +RG+G R+ R F
Sbjct: 190 PGPSYEDVVYRLKNFDDRLQGYTVTDVSPHLAFNT------FRSSNRGRGGRNNRGKGNF 243
Query: 240 TCK------QFGHVARSCTAK---FCKYCKQNGHVIFDCPIRPPRRTQYPTQALHATTSS 290
+ + QF + S +A C+ C + GH C R Q+ A A ++
Sbjct: 244 STRGRGFQQQFSSSSSSVSASEKPMCQICGKRGHYALQCWHRFDDSYQHSEAAAAAFSAL 303
Query: 291 AAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPWFLDSGASNHMTGSSEY 350
IT SD W DS A+ H+T +S
Sbjct: 304 H----ITDVSDDSG-----------------------------WVPDSAATAHITNNSSR 330
Query: 351 LHNLHSYHGNQQIQIADGNKLSITDVGDINSD-------FQDVLVSPGLASNLLSVGQLV 403
L + Y GN + +DGN L IT +G N +DVLV P +A +LLSV +L
Sbjct: 331 LQQMQPYLGNDTVMASDGNFLPITHIGSANLPSTSGNLPLKDVLVCPNIAKSLLSVSKLT 390
Query: 404 -DNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGR-LFPLQFISSHLSLACNNVLNSYEDWH 461
D C+ F G LV+++ + KV+ KG L+ L+ + + V + E WH
Sbjct: 391 KDYPCSFTFDADGVLVKDKATCKVLTKGSSTSEGLYKLENPKFQMFYSTRQVKATDEVWH 450
Query: 462 RKLGHPNSTVLSHLFKTGLLGNK---QVVCTASISCPVCKLAKSKTLPFPSGAHRASNCF 518
+LGHPN VL LL NK Q+ + S C C+L KS LPF + AS
Sbjct: 451 MRLGHPNPQVLQ------LLANKKAIQINKSTSKMCESCRLGKSSRLPFIASDFIASRPL 504
Query: 519 EMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQFQ 578
E +H D+WG +P++S ++Y+V FID+ SRF W Y L+ KS+ S+F KF ++VE Q
Sbjct: 505 ERVHCDLWGPAPVSSIQGFQYYVIFIDNRSRFCWFYPLKHKSDFCSLFMKFQSFVENLLQ 564
Query: 579 ASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLL 638
+ F+S+ GGE+ S+ F ++LQ GI SCP+TPQQNGLAERK+R L + +L+
Sbjct: 565 TKIGTFQSDGGGEFTSNRFLQHLQESGIQHYISCPHTPQQNGLAERKHRQLTERGLTLMF 624
Query: 639 QASVPPRFWVEALSTVCF 656
Q+ P RFWVEA T F
Sbjct: 625 QSKAPQRFWVEAFFTANF 642
>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
Length = 1358
Score = 258 bits (659), Expect = 1e-66
Identities = 183/673 (27%), Positives = 320/673 (47%), Gaps = 50/673 (7%)
Query: 20 NYPAWEFQFRMYVKGNKLWSHL-------DDVSKAPTEKA-ALEEWEYKDAQIISWILSS 71
NY +W ++ KG + HL D +K E A A +WE DAQ+ S + S
Sbjct: 20 NYLSWASSVELWCKGQGVQDHLTNKAYEVDVKAKTSEEDAKAKAQWEKVDAQLCSLLWRS 79
Query: 72 IDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYKQGNLYVQEFYSGFLN 131
ID +++ R F T +W + +Y D ++ + + + N K+ + +
Sbjct: 80 IDFKLMPLFRPFQTCYTVWEKARALYTND-ISRFYDVISRLTNLKKQESDMSTYLGQVQA 138
Query: 132 LWTEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFEVVRGALLNRNPVPSLDTC 191
+ E ++ ++ QE T L L P+ + VR +L VP++D
Sbjct: 139 VMEEFDTLMPV---TTNVEKQQEHRQTLFLVLTLAGLPPDHDSVRDQILASPTVPTIDEL 195
Query: 192 VGELLR----EEQRLLTQGTMSHDAFIS---EPVPVAYAAQSRGKGR-DMRQVQCFTCKQ 243
LLR ++++ T+ S E RG GR + +C C +
Sbjct: 196 FSRLLRLAAPPSHKVVSSPTVDSSILASQTFEKRTYQSMENRRGGGRFGKPRSKCSHCHK 255
Query: 244 FGHVARSC--------------TAKFCKYCKQNGHVIFDCPIRPPRRTQYPTQALHATTS 289
GH C ++ ++ + P+ + P+ H +
Sbjct: 256 PGHTRDICYILHGPPPSYDPIVLKEYNEFLRNRASKQTSPPVAYGAQPNQPSNNAHIAQT 315
Query: 290 SAAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPWFLDSGASNHMTGSSE 349
A+ S Q + Q ++ N S+ W +DSGAS+H++G+
Sbjct: 316 EYDEFLQYRANKQTSPQVVSVAQPDVSVAGNSFACVSQSSTLGTWVMDSGASDHISGNKS 375
Query: 350 YLHNLHSYHGNQQIQIADGNKLSITDVGDI----NSDFQDVLVSPGLASNLLSVGQLVDN 405
L ++ I +A+G + VG + VL PG NL SV +L
Sbjct: 376 LLSDIVYSQSLPAITLANGIQTKPKGVGKAKPLSSVTLDSVLYVPGSPFNLASVSRLTKA 435
Query: 406 -NCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQFISSHLSLACNNVLNSYEDWHRKL 464
+C++ F L+Q++ +G++I G + L+ +++S SLA ++ +S + H++L
Sbjct: 436 LHCSITFFDDFFLMQDRSTGQMIGTGHESQGLY---YLTSSNSLAACSITDSPDLIHKRL 492
Query: 465 GHPNSTVLSHLFKTGLLGNKQVVCTASISCPVCKLAKSKTLPFP-SGAHRASNCFEMIHS 523
GH + + L + + + +++ C C+L K F S R+ + F ++HS
Sbjct: 493 GHSSLSKLQKMVPS-------LSSLSTLDCESCQLGKHTRATFSRSTEGRSESIFSLVHS 545
Query: 524 DVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKI 583
D+WG S ++S ++YFV+FIDDYS+ TW++ ++ +SE+FS+FK F ++ QF S++
Sbjct: 546 DIWGPSRVSSTLGFRYFVSFIDDYSKCTWVFLMKDRSELFSIFKSFFAEIQNQFGVSIRT 605
Query: 584 FRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVP 643
FRS++ EY+S +F+E++ H+GI+ Q +CP TPQQNG+AERKNRHL++ R+LLL+++VP
Sbjct: 606 FRSDNALEYLSSQFREFMTHQGIIHQTTCPYTPQQNGVAERKNRHLIETARTLLLESNVP 665
Query: 644 PRFWVEALSTVCF 656
RFW +A+ T C+
Sbjct: 666 LRFWGDAVLTSCY 678
>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 257 bits (656), Expect = 2e-66
Identities = 207/745 (27%), Positives = 344/745 (45%), Gaps = 122/745 (16%)
Query: 14 VRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPTEKAAL--------------EEWEY 59
++ NY W+ QF + KL ++ V P + + E+W
Sbjct: 19 LKLNDSNYLLWKTQFESLLSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEVPNPQYEDWFC 78
Query: 60 KDAQIISWILSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLE--LEIANYKQ 117
D + SW+ ++ +++ ++ + +T++++W L +N+ + A+ F L L++ K
Sbjct: 79 TDQLVRSWLFGTLSEEVLGHVHNLTTSRQIWISLAENFNKSSIAREFSLRRNLQLLTKKD 138
Query: 118 GNL--YVQEFYSGFLNLWTEHSAIIHADVPKASLAAV-QEVYNTSRRDQFLMKLRPEFEV 174
+L Y ++F I D SL+++ + V + + FL L E++
Sbjct: 139 KSLSVYCRDFK-------------IICD----SLSSIGKPVEESMKIFGFLNGLGREYDP 181
Query: 175 VRGAL---LNRNPVPSLDTCVGELLREEQRLL----TQGTMSHDAFISEPVPVA---YAA 224
+ + L++ P P+ + + E+ + +L T H AF +E Y +
Sbjct: 182 ITTVIQSSLSKLPAPTFNDVISEVQGFDSKLQSYDDTVSVNPHLAFNTERSNSGAPQYNS 241
Query: 225 QSRGKGRD--MRQVQCFTCKQFG---HVARSCTA---KFCKYCKQNGHVIFDCPIRPPR- 275
SRG+GR R ++ + G H + S ++ C+ C + GH C R
Sbjct: 242 NSRGRGRSGQNRGRGGYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDNN 301
Query: 276 -RTQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPW 334
+++ PTQA A S + W
Sbjct: 302 YQSEVPTQAFSALRVS-------------------------------------DETGKEW 324
Query: 335 FLDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGD--INSD-----FQDVL 387
+ DS A+ H+T S+ L N +Y GN + + DG L IT VG I+S +VL
Sbjct: 325 YPDSAATAHITASTSGLQNATTYEGNDAVLVGDGTYLPITHVGSTTISSSKGTIPLNEVL 384
Query: 388 VSPGLASNLLSVGQLVDNN-CNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQ---FIS 443
V P + +LLSV +L D+ C V F + + + KV++KGP+ L+ L+ F++
Sbjct: 385 VCPAIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYMLENSEFVA 444
Query: 444 SHLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCPVCK---LA 500
+ + C S E WH +LGH NS +L L L K++ S + PVC+ +
Sbjct: 445 LYSNRQC---AASMETWHHRLGHSNSKILQQL-----LTRKEIQVNKSRTSPVCEPCQMG 496
Query: 501 KSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKS 560
KS L F S RA + +H D+WG SP+ S+ +KY+ F+DD+SRF+W + LR KS
Sbjct: 497 KSTRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSWFFPLRMKS 556
Query: 561 EVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNG 620
+ S+F + VE Q +K F+S+ GGE+ S++ +E+ + GI + SCP TPQQNG
Sbjct: 557 KFISVFIAYQKLVENQLGTKIKEFQSDGGGEFTSNKLKEHFREHGIHHRISCPYTPQQNG 616
Query: 621 LAERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF*LIVFLLQLLNLILLSFV--YLNF 678
+AERK+RHL+++ S+L + P +FWVEA T +L LL +L + Y
Sbjct: 617 VAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTA-----NYLSNLLPSSVLKEISPYETL 671
Query: 679 SLIIVIYILLGVCVLSTYPCLKGIS 703
V Y L V + YPCL+ ++
Sbjct: 672 FQQKVDYTPLRVFGTACYPCLRPLA 696
>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
gi|4539447|emb|CAB40035.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444419|pir||T04204
hypothetical protein T4F9.150 - Arabidopsis thaliana
Length = 1515
Score = 254 bits (649), Expect = 1e-65
Identities = 177/647 (27%), Positives = 309/647 (47%), Gaps = 85/647 (13%)
Query: 42 DDVSKAPTEKAALEEWEYKDAQIISWILSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDN 101
DD+ + L+ W D + +WI S+ + + + ++AQE+W L R +N+ +
Sbjct: 58 DDIQSEEANQEFLK-WTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFS 116
Query: 102 AAKRFQLELEIANYKQGNLYVQEFYSGFLNLWTEHSAIIHADVPKASLAAVQEVYNTSRR 161
+++ L+ + + + + S N+ + +I + + V
Sbjct: 117 TTRKYDLQKRLGTCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGV--------- 167
Query: 162 DQFLMKLRPEFEVVRGAL---LNRNPVPSLDTCVGELLREEQRLLTQGTMS----HDAFI 214
L L E+E + + L+ P P D V +L + +L T S H AF
Sbjct: 168 ---LNGLGKEYESIATVIEHSLDVYPGPCFDDVVYKLTTFDDKLSTYTANSEVTPHLAFY 224
Query: 215 SEPVPVAYAAQS------------RGKGRDMRQVQCFTCKQFGHVARSCTAK----FCKY 258
++ +Y+++ RG+G + + F +QFG + + + C+
Sbjct: 225 TDK---SYSSRGNNNSRGGRYGNFRGRGSYSSRGRGFH-QQFGSGSNNGSGNGSKPTCQI 280
Query: 259 CKQNGHVIFDCPIRPPRRTQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQMVLAAL 318
C++ GH F C R + + + + A
Sbjct: 281 CRKYGHSAFKCYTR--------------------------------FEENYLPEDLPNAF 308
Query: 319 SNMGIHGKSSNVSRPWFLDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGD 378
+ M + ++ S W DS A+ H+T +++ L N +Y G+ + + +G+ L IT +G
Sbjct: 309 AAMRVSDQNQASSHEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGT 368
Query: 379 INSDF-------QDVLVSPGLASNLLSVGQLVDNN-CNVNFSRAGCLVQEQVSGKVIAKG 430
I + +DVLV PG+ +LLSV +L D+ C+ F +++++ + +++ +G
Sbjct: 369 IPLNISQGTLPLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQG 428
Query: 431 PKVGRLFPLQFISSHLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKT-GLLGNKQVVCT 489
K L+ L+ + + + E WH++LGHPN VL HL KT ++ NK T
Sbjct: 429 NKHKGLYVLKDVPFQTYYSTRQQSSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNK----T 484
Query: 490 ASISCPVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSR 549
+S C C++ K LPF + +S E IH D+WG +P+ S ++Y+V FID+YSR
Sbjct: 485 SSNMCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSR 544
Query: 550 FTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQ 609
FTW Y L+ KS+ FS+F F VE Q+Q + +F+ + GGE++S++F +L GI
Sbjct: 545 FTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGGEFVSYKFVAHLASCGIKQL 604
Query: 610 RSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF 656
SCP+TPQQNG+AER++R+L ++ SL+ + VP + WVEA T F
Sbjct: 605 ISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHKLWVEAFFTSNF 651
>emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|4972079|emb|CAB43904.1|
putative protein [Arabidopsis thaliana]
gi|7444467|pir||T08945 hypothetical protein F25O24.20 -
Arabidopsis thaliana
Length = 1415
Score = 252 bits (643), Expect = 7e-65
Identities = 176/667 (26%), Positives = 305/667 (45%), Gaps = 89/667 (13%)
Query: 14 VRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPTEKAALEE--------------WEY 59
++ + NY W+ QF ++ +L + + P ++ W
Sbjct: 19 LKLSTANYLLWKIQFETWLNNQRLLGFVTGANPCPNATRSIRNGDQVTEATNPDFLTWVQ 78
Query: 60 KDAQIISWILSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLELEIANYKQGN 119
D +I+ W+L S+ + ++ T++E+W L + YN+ +A+++ L+ + +
Sbjct: 79 NDQKIMGWLLGSLSEDALRSVYGLHTSREVWFSLAKKYNRVSASRKSDLQRRLNPVSKNE 138
Query: 120 LYVQEFYSGFLNLWTEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFEVVRGAL 179
+ E+ + + + +I P + V N ++ L+ +++G++
Sbjct: 139 KSMLEYLNCVKQICDQLDSI---GCPVPENEKIFGVLNGLGQEYMLVST-----MIKGSM 190
Query: 180 LNRNPVPSLDTCVGELLREEQRLLTQGTMSHDAFISEPVPVAYAAQSRGKGRDMRQVQCF 239
+ P+ S + V +L+ + +L + + + Y + RG + +
Sbjct: 191 -DTYPM-SFEDVVFKLINFDDKLQNGQSGGNRGRNN------YTTKGRGFPQQISS---- 238
Query: 240 TCKQFGHVARSCTAKFCKYCKQNGHVIFDCPIRPPRRTQYPTQALHATTSSAAPPTITSA 299
G + S T C+ C + GH + C +R + Q+ + + AA
Sbjct: 239 -----GSPSDSGTRPTCQICNKYGHSAYKCW----KRFDHAFQSEDFSKAFAA------- 282
Query: 300 SDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPWFLDSGASNHMTGSSEYLHNLHSYHG 359
M + + SN PW DSGA++H+T S+ L + Y G
Sbjct: 283 ---------------------MRVSDQKSN---PWVTDSGATSHITNSTSQLQSAQPYSG 318
Query: 360 NQQIQIADGNKLSITDVGDI-------NSDFQDVLVSPGLASNLLSVGQLV-DNNCNVNF 411
+ + + + L IT +G N +DVLV P + +LLSV +L D C + F
Sbjct: 319 EDSVIVGNSDFLPITHIGSAVLTSNQGNLPLRDVLVCPNITKSLLSVSKLTSDYPCVIEF 378
Query: 412 SRAGCLVQEQVSGKVIAKGPKVGRLFPLQFISSHLSLACNNVLNSYEDWHRKLGHPNSTV 471
G +V+++++ +++ KG + L+ L+ + S E WH +LGHPN V
Sbjct: 379 DSDGVIVKDKLTKQLLTKGTRHNDLYLLENPKFMACYSSRQQATSDEVWHMRLGHPNQDV 438
Query: 472 LSHLFKTGLLGNKQVVC--TASISCPVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMS 529
L L + NK +V T+ C C++ K LPF S +S E +H D+WG +
Sbjct: 439 LQQLLR-----NKAIVISKTSHSLCDACQMGKICKLPFASSDFVSSRLLERVHCDLWGPA 493
Query: 530 PIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKIFRSNSG 589
P+ S ++Y+V FID+YSRFTW Y LR KS+ FS+F F VE Q Q + F+ + G
Sbjct: 494 PVVSSQGFRYYVIFIDNYSRFTWFYPLRLKSDFFSVFLTFQKMVENQCQQKIASFQCDGG 553
Query: 590 GEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVE 649
GE++S++F +L GI SCP TPQQNG+AERK+RH+ ++ S++ Q VP WVE
Sbjct: 554 GEFISNQFVSHLAECGIRQLISCPYTPQQNGIAERKHRHITELGSSMMFQGKVPQFLWVE 613
Query: 650 ALSTVCF 656
A T F
Sbjct: 614 AFYTSNF 620
>ref|XP_475401.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|49328070|gb|AAT58770.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1419
Score = 247 bits (631), Expect = 2e-63
Identities = 135/354 (38%), Positives = 200/354 (56%), Gaps = 31/354 (8%)
Query: 327 SSNVSRP-WFLDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGDINSD--- 382
S+NV+ W LDSGAS HMT L + S +Q ADG L ++ G + +
Sbjct: 480 SANVTDSCWILDSGASFHMTPDISQLQSC-SLTKASSVQTADGTILPVSLQGTLQTKEYT 538
Query: 383 FQDVLVSPGLASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQFI 442
DV P L+ L+SVGQL D C+V F A C V ++ +G ++ G ++ L ++
Sbjct: 539 IPDVFYVPNLSMKLISVGQLTDMKCHVVFDEAACYVLDRATGNLVGAGHRLNGPRGL-YV 597
Query: 443 SSHLSLACN-----------------------NVLNSYEDWHRKLGHPNSTVLSHLFKTG 479
HL L + ++ S+ WH +LGH + LS L + G
Sbjct: 598 LDHLHLPTSTSSGFPGNSASATSITSNSSVYSSLSASFPQWHHRLGHLCGSRLSTLVQQG 657
Query: 480 LLGNKQVVCTASISCPVCKLAKSKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKY 539
+LGN + C CKL K LP+ S R+++ F ++HSDVWG +P S ++Y
Sbjct: 658 VLGNVSI--ETDFVCKGCKLGKQVQLPYRSSMSRSTSPFALVHSDVWGPAPFHSKGGHRY 715
Query: 540 FVTFIDDYSRFTWIYFLRSKSEVFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQE 599
+V F+DD+SR+TWIYF++ +SE++ ++K F + V TQF S+K FRS+SGGEY+S+ F++
Sbjct: 716 YVIFVDDFSRYTWIYFMKHRSELYQVYKSFASMVHTQFSTSIKNFRSDSGGEYLSNAFRQ 775
Query: 600 YLQHKGILSQRSCPNTPQQNGLAERKNRHLLDVTRSLLLQASVPPRFWVEALST 653
L G L Q SCP QNG+AERK+RH+++ R+LL+ + VPP FW EA+ST
Sbjct: 776 LLSDHGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASHVPPHFWAEAVST 829
>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
Length = 1453
Score = 239 bits (610), Expect = 5e-61
Identities = 203/751 (27%), Positives = 336/751 (44%), Gaps = 128/751 (17%)
Query: 14 VRFTGKNYPAWEFQFRMYVKGNKLWSHLDDVSKAPTEKAAL--------------EEWEY 59
++ NY W+ QF + +KL ++ P + E W
Sbjct: 19 LKLNDSNYLLWKTQFESLLSCHKLIGFVNGGITPPPRTLNVVTGDTSVDVANPQYESWFC 78
Query: 60 KDAQIISWILSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQLE--LEIANYKQ 117
D I SW+ ++ +++ + + T++++W L +N+ + A+ F L L++ + K
Sbjct: 79 TDQLIRSWLFGTLSEEVLGYVHNLQTSRDIWISLAENFNKSSVAREFTLRRTLQLLSKKD 138
Query: 118 GNL--YVQEFYSGFLNLWTEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFE-- 173
L Y +EF + V A + + V + + FL L E++
Sbjct: 139 KTLSAYCREFIA----------------VCDALSSIGKPVDESMKIFGFLNGLGREYDPI 182
Query: 174 --VVRGALLNRNPVPSLDTCVGELLREEQRLLTQGTM----SHDAFISEPVPVA--YAAQ 225
V++ +L +P P+ + E+ + +L + H AF ++ Y +
Sbjct: 183 TTVIQSSLSKISP-PTFRDVISEVKGFDVKLQSYEESVTANPHMAFNTQRSEYTDNYTSG 241
Query: 226 SRGKGR----DMRQVQCFTCKQFGHVARSCTAK------FCKYCKQNGHVIFDCPIRPPR 275
+RGKGR R ++ + G + C+ C + GH C R
Sbjct: 242 NRGKGRGGYGQNRGRSGYSTRGRGFSQHQTNSNNTGERPVCQICGRTGHTALKCYNRFDH 301
Query: 276 RTQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPWF 335
Q ++ +A SL+ S + + W
Sbjct: 302 NYQ----------------SVDTAQAFSSLRV-------------------SDSSGKEWV 326
Query: 336 LDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGD--INSD-----FQDVLV 388
DS A+ H+T S+ L Y+G+ + + DG L IT VG I+SD +VLV
Sbjct: 327 PDSAATAHVTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGSTTISSDSGTLPLNEVLV 386
Query: 389 SPGLASNLLSVGQLVDNN-CNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQ---FISS 444
P + +LLSV +L D+ C V F + + + KV++KGP+ L+ L+ F++
Sbjct: 387 CPDIQKSLLSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKGPRSNGLYVLENQEFVAF 446
Query: 445 HLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCPVCK---LAK 501
+ + C S E WH +LGH NS +L L +K++ S PVC+ + K
Sbjct: 447 YSNRQC---AASEEIWHHRLGHSNSRILQQL-----KSSKEISFNKSRMSPVCEPCQMGK 498
Query: 502 SKTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSE 561
S L F S R + IH D+WG SP+ S +KY+V F+DDYSR++W Y L++KS+
Sbjct: 499 SSKLQFFSSNSRELDLLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYSRYSWFYPLKAKSD 558
Query: 562 VFSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGL 621
F++F F VE QF +K+F+S+ GGE+ S+ +++L GI + SCP TPQQNG+
Sbjct: 559 FFAVFVAFQNLVENQFNTKIKVFQSDGGGEFTSNLMKKHLTDCGIQHRISCPYTPQQNGI 618
Query: 622 AERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF*LIVFLLQLLNLILLSFVYLNFSLI 681
AERK+RH +++ S++ + P +FWVEA T F L+ +L S N S +
Sbjct: 619 AERKHRHFVELGLSMMFHSHTPLQFWVEAFFTASF---------LSNMLPSPSLGNVSPL 669
Query: 682 IVI------YILLGVCVLSTYPCLKGISLEH 706
+ Y +L V + YPCL+ + EH
Sbjct: 670 EALLKQKPNYAMLRVFGTACYPCLRPLG-EH 699
>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
(gb|U12626). [Arabidopsis thaliana]
gi|25301690|pir||G96722 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana
Length = 1315
Score = 238 bits (606), Expect = 1e-60
Identities = 170/634 (26%), Positives = 296/634 (45%), Gaps = 56/634 (8%)
Query: 35 NKLWSHLDDVSKAPTEKAALEEWEYKDAQIISWILSSIDPQMINNLRSFSTAQEMWNYLK 94
NKL + K + + W ++ + SW+L+S+ ++ ++ F TA +W L
Sbjct: 9 NKLGFVDGSIPKPDDDDPYCKIWRRCNSMVKSWLLNSVSKEIYTSILYFPTAAAIWKDLY 68
Query: 95 RIYNQDNAAKRFQLELEIANYKQGNLYVQEFYSGFLNLWTEHSAIIHADVPKASLAAVQE 154
+++ + + ++L +I + +QGNL + +++ LW E +++ L +E
Sbjct: 69 TRFHKSSLPRLYKLRQQIHSLRQGNLDLSSYHTRTQTLWEELTSLQAVPRTVEDLLIERE 128
Query: 155 VYNTSRRDQFLMKLRPEFEVVRGALLNRNPVPSLDTCVGELLREEQRLLTQGTMSHDAFI 214
T+R FLM L ++ VR +L + +PSL V ++ +++ + +
Sbjct: 129 ---TNRVIDFLMGLNDCYDTVRSQILMKKTLPSLSE-VFNMIDQDETQRSARISTTPGMT 184
Query: 215 SEPVPVAYAAQSRGKGRDMRQVQ----CFTCKQFGHVARSCTAKFCKYCKQNGHVIFDCP 270
S PV+ + D Q + C C + GHV +C K++G +
Sbjct: 185 SSVFPVSNQSSQSALNGDTYQKKERPVCSYCSRPGHVEDTC-------YKKHG---YPTS 234
Query: 271 IRPPRRTQYPTQALHATTSSAAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNV 330
+ ++ P+ + +A S TS S G L IQQ+V S + S
Sbjct: 235 FKSKQKFVKPSISANAAIGSEEVVNNTSVST-GDLTTSQIQQLVSFLSSKL---QPPSTP 290
Query: 331 SRPWFLDSGASNHMTGSSEYLHNLHSYHGNQQIQIADGNKLSITDVGDINSDFQDVLVSP 390
+P S+ + SS S H + + + DVL P
Sbjct: 291 VQPEVHSISVSSDPSSSSTVCPISGSVHLGRHLIL------------------NDVLFIP 332
Query: 391 GLASNLLSVGQLVDN-NCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQFIS------ 443
NLLSV L + C + F C++Q+ ++ G +V L+ + S
Sbjct: 333 QFKFNLLSVSSLTKSMGCRIWFDETSCVLQDATRELMVGMGKQVANLYIVDLDSLSHPGT 392
Query: 444 -SHLSLACNNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCPVCKLAKS 502
S +++A + S++ WH++LGHP+ L + + LL + C VC ++K
Sbjct: 393 DSSITVAS---VTSHDLWHKRLGHPSVQKLQPM--SSLLSFPKQKNNTDFHCRVCHISKQ 447
Query: 503 KTLPFPSGAHRASNCFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEV 562
K LPF S +++S F++IH D WG + +H Y+YF+T +DDYSR TW+Y LR+KS+V
Sbjct: 448 KHLPFVSHNNKSSRPFDLIHIDTWGPFSVQTHDGYRYFLTIVDDYSRATWVYLLRNKSDV 507
Query: 563 FSMFKKFLTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLA 622
++ F+T VE QF+ ++K RS++ E F ++ KGI+ SCP TPQQN +
Sbjct: 508 LTVIPTFVTMVENQFETTIKGVRSDNAPEL---NFTQFYHSKGIVPYHSCPETPQQNSVV 564
Query: 623 ERKNRHLLDVTRSLLLQASVPPRFWVEALSTVCF 656
ERK++H+L+V RSL Q+ +P +W + + T +
Sbjct: 565 ERKHQHILNVARSLFFQSHIPISYWGDCILTAVY 598
>dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1499
Score = 226 bits (576), Expect = 4e-57
Identities = 184/684 (26%), Positives = 307/684 (43%), Gaps = 90/684 (13%)
Query: 16 FTGKNYPAWEFQFRMYVKGNKLWSHLDD---VSKAPTEKAAL----EEWEYKDAQIISWI 68
F G++Y W+ + +K KLW +++ + +P AL ++ KD + +
Sbjct: 12 FNGESYGFWKIKMITILKTRKLWDVIENGVTSNSSPETSPALTRERDDQVMKDMMALQIL 71
Query: 69 LSSIDPQMINNLRSFSTAQEMWNYLKRIYNQDNAAKRFQL-----ELEIANYKQGNLYVQ 123
S++ + + S+A E WN L+ + + K L E E ++G +
Sbjct: 72 QSAVSDSIFPRIAPASSATEAWNALEMEFQGSSQVKMINLQTLRREYENLKMEEGET-IN 130
Query: 124 EFYSGFLNLWTEHSAIIHADVPKASLAAVQEVYNTSRRDQFLMKLRPEFEVVRGALLNRN 183
+F + +NL + +H + K+ VQ++ L+ + +F+ + G L
Sbjct: 131 DFTTKLINL--SNQLRVHGE-EKSDYQVVQKI---------LISVPQQFDSIVGVLEQTK 178
Query: 184 PVPSLDTC--VGELLREEQRL-LTQGTMSHDAFISEPVPVAYAAQSRGKGRDMR------ 234
+ +L +G L E+RL L + ++ AF E + SRG+ + +
Sbjct: 179 DLSTLSVTELIGTLKAHERRLNLREDRINEGAFNGEKLG------SRGENKQNKIRHGKT 232
Query: 235 QVQCFTCKQFGHVARSCTAKF--------------CKYCKQNGHVIFDCPIRPPRRTQYP 280
+ C CK+ H C K C C + GH+ DC +R R
Sbjct: 233 NMWCGVCKRNNHNEVDCFRKKSESISQRGGSYERRCYVCDKQGHIARDCKLRKGERAHL- 291
Query: 281 TQALHATTSSAAPPTITSASDGGSLQPEMIQQMVLAALSNMGIHGKSSNVSRPWFLDSGA 340
+I + D + E M+ +A+ I S+ W +DSG
Sbjct: 292 --------------SIEESED----EKEDECHMLFSAVEEKEI---STIGEETWLVDSGC 330
Query: 341 SNHMTGSSEYLHNLHSYHGNQQI-QIADGNKLSITDVGDINSD-------FQDVLVSPGL 392
+NHM S + H + + I +I +G K+ GDI +DVL P L
Sbjct: 331 TNHM--SKDVRHFIALDRSKKIIIRIGNGGKVVSEGKGDIRVSTNKGDHVIKDVLYVPEL 388
Query: 393 ASNLLSVGQLVDNNCNVNFSRAGCLVQEQVSGKVIAKGPKVGRLFPLQFISSHLS--LAC 450
A NLLSV Q++ N V F C++Q+ + G+ I R FP+ + S +A
Sbjct: 389 ARNLLSVSQMISNGYRVIFEDNKCVIQD-LKGRKILDIKMKDRSFPIIWKKSREETYMAF 447
Query: 451 NNVLNSYEDWHRKLGHPNSTVLSHLFKTGLLGNKQVVCTASISCPVCKLAKSKTLPFPSG 510
+ WH++ GH N + + ++ C C++ K FP
Sbjct: 448 EEKEEQTDLWHKRFGHVNYDKIETMQTLKIVEKLPKFEVIKGICAACEMGKQSRRSFPKK 507
Query: 511 AHRASN-CFEMIHSDVWGMSPIASHAHYKYFVTFIDDYSRFTWIYFLRSKSEVFSMFKKF 569
+ +N E+IHSDV G S +YF+TFIDD+SR TW+YFL++KSEV + FK F
Sbjct: 508 SQSNTNKTLELIHSDVCGPMQTESINGSRYFLTFIDDFSRMTWVYFLKNKSEVITKFKIF 567
Query: 570 LTYVETQFQASVKIFRSNSGGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAERKNRHL 629
YVE Q ++ +K R++ GGE++S EF + Q GI + + P +PQQNG+AER+NR L
Sbjct: 568 KPYVENQSESRIKRLRTDGGGEFLSREFIKLCQESGIHHEITTPYSPQQNGVAERRNRTL 627
Query: 630 LDVTRSLLLQASVPPRFWVEALST 653
+++ RS++ + + +FW EA++T
Sbjct: 628 VEMARSMIEEKKLSNKFWAEAIAT 651
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.338 0.147 0.468
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,045,495,047
Number of Sequences: 2540612
Number of extensions: 80012374
Number of successful extensions: 335816
Number of sequences better than 10.0: 4116
Number of HSP's better than 10.0 without gapping: 884
Number of HSP's successfully gapped in prelim test: 3235
Number of HSP's that attempted gapping in prelim test: 324896
Number of HSP's gapped (non-prelim): 8605
length of query: 1343
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1203
effective length of database: 507,674,714
effective search space: 610732680942
effective search space used: 610732680942
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 82 (36.2 bits)
Medicago: description of AC144731.14