
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0378b.9
(768 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAL66754.1| putative copia-like retrotransposon Hopscotch pol... 674 0.0
gb|AAK51235.1| polyprotein [Arabidopsis thaliana] 625 e-177
ref|XP_507316.1| PREDICTED P0623F08.22 gene product [Oryza sativ... 621 e-176
ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (jap... 615 e-174
emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|32692... 612 e-173
gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (ja... 608 e-172
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia... 592 e-167
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 588 e-166
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 585 e-165
gb|AAD43604.1| T3P18.3 [Arabidopsis thaliana] gi|25301688|pir||H... 584 e-165
emb|CAE05956.3| OSJNBb0088C09.15 [Oryza sativa (japonica cultiva... 578 e-163
gb|AAW56918.1| putative polyprotein [Oryza sativa (japonica cult... 570 e-161
gb|AAT40550.1| putative receptor kinase [Solanum demissum] 537 e-151
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides] 535 e-150
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi... 529 e-148
pir||E96608 probable retroelement polyprotein F25P12.89 [importe... 528 e-148
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ... 526 e-147
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi... 525 e-147
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf... 515 e-144
pir||G86301 probable retroelement polyprotein [imported] - Arabi... 513 e-144
>gb|AAL66754.1| putative copia-like retrotransposon Hopscotch polyprotein [Zea mays]
Length = 1313
Score = 674 bits (1740), Expect = 0.0
Identities = 376/850 (44%), Positives = 508/850 (59%), Gaps = 105/850 (12%)
Query: 2 FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
F + GI H ++CP+ HQQNG E KHRHI E GL+LLAHA MPL FW +AF AAYLIN
Sbjct: 441 FFNKIGISHLVSCPHAHQQNGSAERKHRHIVEVGLSLLAHASMPLKFWDEAFLAAAYLIN 500
Query: 62 QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
+ P L+L TP ++L+ K+PDYS L+ FGC+C+P+ RPYN+HKL FRS C FLGYS +
Sbjct: 501 RTPTKILNLDTPFERLFHKQPDYSVLRIFGCVCWPNLRPYNSHKLQFRSKQCVFLGYSSL 560
Query: 122 HKGYNCL-TTEGKVIITRNVLFDETVFPFQFQTSKS-----------SSDIL-------- 161
HKG+ CL + G+V ++R+V FDE FPF S + S D+L
Sbjct: 561 HKGFKCLDVSTGRVYVSRDVTFDENFFPFASLHSNAGARLRSEIQLLSPDLLNPATFNSG 620
Query: 162 LDSSLPTTSVIPLLP---------------------ISNASPNVMPHANSV-MPTVAT-N 198
+DS + + + P ++ + PN P A++V +P A+ +
Sbjct: 621 VDSLIDHAADMSTDPNQISGDNSVQDQAEHTPNMDVLAPSVPNTAPEADAVHIPGSASHS 680
Query: 199 GSAPVVQTTVSDFSQPA*DSGV-VHASAQ-----------------------PTEPPVPP 234
GSAP +T + DS V H SA PT P
Sbjct: 681 GSAPAAPSTAAPSPTCRGDSRVATHPSASHDYVASVDMSRGIAGREESFLTAPTTATSPA 740
Query: 235 SS---------------------NVHSM---TTRAKT----GIHRPKAYAAST------A 260
SS V SM +TR +T G+ +PK Y T
Sbjct: 741 SSEPSGSLPATAADAQVPEPILGGVSSMDPASTRPRTRLQQGVRKPKVYTDGTIRYGCFT 800
Query: 261 LSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDG 320
S P AL +W AM EY ALM N TW LVP GR+ IGCKWV++ K +DG
Sbjct: 801 SSGEPYDLNEALGDVNWKDAMDIEYSALMKNKTWHLVPPKKGRNVIGCKWVYKIKRKADG 860
Query: 321 SINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLN 380
S++++K RLVAKG+ Q G DY++TFSPVVK TIR +LS+A+ W + Q D NAFL+
Sbjct: 861 SLDRYKARLVAKGYKQQYGIDYDDTFSPVVKHATIRIILSIAVSRGWSLCQLDVQNAFLH 920
Query: 381 GELVEEVFMQQPPGFSKGSS-HLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCD 439
G L EEV+MQQPPG+ + + VCKL KALYGLKQAPRAW+ +LS+ L GFQA+K D
Sbjct: 921 GVLEEEVYMQQPPGYEDSTKLNYVCKLDKALYGLKQAPRAWYSRLSNKLLSLGFQASKAD 980
Query: 440 TSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEAT 499
TSLF S T+++LVYVDDII+ S+ A +L++DLN F LKDLG L+YFLGIE
Sbjct: 981 TSLFFYNKGSVTIFVLVYVDDIIVASSTHKATEALLSDLNKEFALKDLGDLNYFLGIEVN 1040
Query: 500 HTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGA-LFPKPLL-YRSVV 557
+D G+IL+Q KY +DLL K M P TP+++ KLS + G+ L K + YRS+V
Sbjct: 1041 KVRD-GIILTQDKYASDLLKKVGMSDCKPISTPLSTSEKLSIHEGSPLGEKDITQYRSIV 1099
Query: 558 GALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSI 617
GAL Y+T+TRP+IAF +NKVCQF+H+P HW AVKRILRY+ GL++ R S +
Sbjct: 1100 GALQYLTLTRPDIAFSVNKVCQFLHAPTTLHWAAVKRILRYIKQCTNLGLHIHRSDSTLV 1159
Query: 618 TAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIM 677
+AF+D+DW DDR+ST G +++G NLVSW+A+KQ V+RSSTE+EY++LA E++
Sbjct: 1160 SAFSDADWAGSVDDRKSTGGFAVFLGSNLVSWSARKQPTVSRSSTESEYKALANATAELI 1219
Query: 678 WLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQVTV 737
W++ LL+E+ + + ++ ++CDNL +L ANPI H +TKH +D HFVR+RVA K + +
Sbjct: 1220 WVQILLTEISIKSPRAAKLWCDNLGAKYLSANPIFHARTKHIEVDYHFVRDRVAKKLLDI 1279
Query: 738 VNIPAPSQVS 747
+P QV+
Sbjct: 1280 EYVPTGDQVA 1289
>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
Length = 1453
Score = 625 bits (1612), Expect = e-177
Identities = 345/776 (44%), Positives = 471/776 (60%), Gaps = 38/776 (4%)
Query: 3 LTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQ 62
LT CGIQHR++CPY QQNG E KHRH E GL+++ H+H PL FW +AF TA++L N
Sbjct: 598 LTDCGIQHRISCPYTPQQNGIAERKHRHFVELGLSMMFHSHTPLQFWVEAFFTASFLSNM 657
Query: 63 LPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMH 122
LP P+L +PL+ L +KP+Y+ L+ FG CYP RP HK RS C FLGY+ +
Sbjct: 658 LPSPSLGNVSPLEALLKQKPNYAMLRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYNSQY 717
Query: 123 KGYNCL-TTEGKVIITRNVLFDETVFPF----QFQTSKSSSDILL--DSSLPTT--SVIP 173
KGY CL G+V I+R+V+FDE FPF QF + S +L SS+P S+IP
Sbjct: 718 KGYRCLYPPTGRVYISRHVIFDEETFPFKQKYQFLVPQYESSLLSAWQSSIPQADQSLIP 777
Query: 174 LLPISNASPNVMPHA--------NSVMPTVATNG---------SAPVVQT-TVSDFSQPA 215
P + + P + T G S +T ++++ +
Sbjct: 778 QAEEGKIESLAKPPSIQKNTIQDTTTQPAILTEGVLNEEEEEDSFEETETESLNEETHTQ 837
Query: 216 *DSGVVHASAQPTEPPVPPSSNVHSMTTRAKTGIHRPKA-YAASTALSSV--PTSAK*AL 272
D V + + P N H MTTR+K GIH+ YA T+ SV P S AL
Sbjct: 838 NDEAEVTVEEEVQQEP----ENTHPMTTRSKAGIHKSNTRYALLTSKFSVEEPKSIDEAL 893
Query: 273 SIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAK 332
+ P W+ A+ +E + + +TW LV + +GC+WVF+TK DGS++K K RLVAK
Sbjct: 894 NHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMNILGCRWVFKTKLKPDGSVDKLKARLVAK 953
Query: 333 GFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQP 392
GFHQ EG DY ETFSPVV+ TIR VL +A W I+Q D +NAFL+GEL E V+M QP
Sbjct: 954 GFHQEEGLDYLETFSPVVRTATIRLVLDVATAKGWNIKQLDVSNAFLHGELKEPVYMLQP 1013
Query: 393 PGF-SKGSSHLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSET 451
PGF + VC+L KALYGLKQAPRAWFD +S+ L FGF +K D SLF +T
Sbjct: 1014 PGFVDQEKPSYVCRLTKALYGLKQAPRAWFDTISNYLLDFGFSCSKSDPSLFTYHKNGKT 1073
Query: 452 LYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQT 511
L +L+YVDDI++TGS + L+ LN F +KDLG+ YFLG+E + +G L L QT
Sbjct: 1074 LVLLLYVDDILLTGSDHNLLQELLMSLNKRFSMKDLGAPSYFLGVEIESSPEG-LFLHQT 1132
Query: 512 KYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIA 571
Y D+L +A M + N PTP+ ++ + NS LFP+P +RS+ G L Y+TITRP+I
Sbjct: 1133 AYAKDILHQAAMSNCNSMPTPLPQHIE-NLNSD-LFPEPTYFRSLAGKLQYLTITRPDIQ 1190
Query: 572 FPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDD 631
F +N +CQ MHSP + +KRILRY+ GT GL++++ ++S+ A++DSDW +
Sbjct: 1191 FAVNFICQRMHSPTTADFGLLKRILRYVKGTIHLGLHIKKNQNLSLVAYSDSDWAGCKET 1250
Query: 632 RRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTS 691
RRST+G C +G NL+SW+AK+Q V++SSTEAEYR+L V E+ WL LL ++ V +
Sbjct: 1251 RRSTTGFCTLLGCNLISWSAKRQETVSKSSTEAEYRALTAVAQELTWLSFLLRDIGVTQT 1310
Query: 692 PSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
+L+ CDNLS V+L ANP LH ++KHF D H++RE+VA V +I A Q++
Sbjct: 1311 HPTLVKCDNLSAVYLSANPALHNRSKHFDTDYHYIREQVALGLVETKHISATLQLA 1366
>ref|XP_507316.1| PREDICTED P0623F08.22 gene product [Oryza sativa (japonica
cultivar-group)]
Length = 821
Score = 621 bits (1601), Expect = e-176
Identities = 343/792 (43%), Positives = 462/792 (58%), Gaps = 89/792 (11%)
Query: 44 MPLTFWGDAFETAAYLINQLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNT 103
MP+ FW +A AAYLIN+ P ++ + PL++L+ +KP+Y+ L+ FGC +P+ RPYN
Sbjct: 1 MPIKFWDEAVLAAAYLINRTPSKVINFACPLEQLFKEKPNYTALRIFGCAVWPNLRPYNK 60
Query: 104 HKLAFRSAPCTFLGYSPMHKGYNCLT-TEGKVIITRNVLFDETVFPFQFQTSKSSSDILL 162
HKLAFRS C FLGYS +HKG+ CL G+V ++R+V FDE++FPF S + + +
Sbjct: 61 HKLAFRSKRCVFLGYSNLHKGFKCLEIATGRVYVSRDVTFDESIFPFSELHSNAGACLRA 120
Query: 163 DSSLPTTSVIPLLP--------------------------------ISNASPNVMPHANS 190
+ SL S++P L ++N N A+
Sbjct: 121 EISLLPPSLVPHLSSLGGEQNNHVLNYPPNVTDQFGEENAEIGEEIVANGEENAAAAADE 180
Query: 191 VMPTVATNG----------------SAPVVQTTVSDFSQ----PA*DSGVVHASAQP--- 227
A G S+PV + ++ P + +V AS Q
Sbjct: 181 NAAAAANGGAQDDVHGVAYDASPEHSSPVTDDATASAAEQHGNPIQEEHLVQASPQTASS 240
Query: 228 TEPPVPPSSNVHSMTT-----------------------RAKTGIHRPKAYAASTAL--- 261
T P V S+ VH T R ++GI + K Y T
Sbjct: 241 TSPSVASSAGVHDDVTTDQSDQTDQAMPEAAVAPIRPKTRLQSGIRKEKVYTDGTVKWLN 300
Query: 262 ---SSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNS 318
S P S + A++ HW +AM EY AL+ N TW LVP GR+ I CKWV++ K +
Sbjct: 301 FTSSGEPQSLEEAVNNKHWKEAMDAEYMALIENKTWHLVPPQKGRNVIDCKWVYKVKRKA 360
Query: 319 DGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAF 378
DGS++++K RLVAKGF Q G DY +TFSPVVK TIR VLSLA+ W +RQ D NAF
Sbjct: 361 DGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRIVLSLAVSRGWSLRQLDVKNAF 420
Query: 379 LNGELVEEVFMQQPPGFSKGSS-HLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAK 437
L+G L EEV+M+QPPG+ K S + VCKL KALYGLKQAPRAW+ +LS+ L+ GF +K
Sbjct: 421 LHGVLEEEVYMEQPPGYEKKSMPNYVCKLDKALYGLKQAPRAWYSRLSTKLSELGFVPSK 480
Query: 438 CDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIE 497
DTSLF ++++L+YVDDII+ S A +L+ +L+ F LKDLG LHYFLGIE
Sbjct: 481 ADTSLFFYKKGQVSIFLLIYVDDIIMASSVPDATSTLLQELSKDFALKDLGDLHYFLGIE 540
Query: 498 ATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALF--PKPLLYRS 555
KDG L+LSQ KY +DLL + M P TP+++ KLS N G L YRS
Sbjct: 541 VHKVKDG-LMLSQEKYASDLLRRVGMYECKPVSTPLSTSEKLSVNEGTLLGPQDSTQYRS 599
Query: 556 VVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSM 615
VVGAL Y+T+TRP+I+F +NKVCQF+H+P HW AVKRILRY+ T GL R S+
Sbjct: 600 VVGALQYLTLTRPDISFSINKVCQFLHAPTTTHWAAVKRILRYVKYTVDTGLKFCRNPSL 659
Query: 616 SITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITE 675
++ F+D+DW PDDRRST G +++G NLVSW+A+KQ V+RSSTEAEY++LA E
Sbjct: 660 LVSGFSDADWAGSPDDRRSTGGFAVFLGPNLVSWSARKQATVSRSSTEAEYKALANATAE 719
Query: 676 IMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQV 735
IMW+++LL EL V + ++ ++CDNL +L ANPI H +TKH +D HFVRERVA K +
Sbjct: 720 IMWVQTLLQELGVESPRAAKLWCDNLGAKYLSANPIFHARTKHIEVDFHFVRERVARKLL 779
Query: 736 TVVNIPAPSQVS 747
+ I QV+
Sbjct: 780 EIAYISTKDQVA 791
>ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1090
Score = 615 bits (1587), Expect = e-174
Identities = 341/792 (43%), Positives = 473/792 (59%), Gaps = 46/792 (5%)
Query: 2 FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
FL G Q RL+CPY QNG+ E R I + LL A MP ++W +A TA YL+N
Sbjct: 261 FLASRGTQLRLSCPYTSPQNGKAERMLRTINNSIRTLLIQASMPPSYWAEALATATYLLN 320
Query: 62 QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
+ P ++ S P Q L+ PD+S L+ FGCLCYP+ HKL+ RS C FLGY
Sbjct: 321 RRPSSSIHQSLPFQLLHRTIPDFSHLRVFGCLCYPNLSATTPHKLSPRSTACVFLGYPTS 380
Query: 122 HKGYNCLT-TEGKVIITRNVLFDETVFPFQFQTSKSSS-DILLDSSLPTTSVIPLLPISN 179
HKGY CL + ++II+R+V+FDE+ FPF +SS D LL P + P L +
Sbjct: 381 HKGYRCLDLSTHRIIISRHVVFDESQFPFAATPPAASSFDFLLQGLSPADA--PSLEVEQ 438
Query: 180 ASP-NVMPHANSVMP--------------TVAT---NGSAPVVQTTVSDFSQPA*D---- 217
P V P P TVA+ + AP+V T+ +D + P
Sbjct: 439 PRPLTVAPSTEVEQPYLPLPSRRLSAGTVTVASEAPSAGAPLVGTSSADATPPGSATRAS 498
Query: 218 ---SGVVHASAQPTEPPVPPSSNV-----------HSMTTRAKTGIHRPK---AYAASTA 260
S H + VPPSS+ HSM TR+++G RP Y A+ A
Sbjct: 499 TIVSPFRHVYTRRPVTTVPPSSSTAVTNAVAAPQPHSMVTRSQSGSLRPVDRLTYTATQA 558
Query: 261 LSS-VPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSD 319
+S VP + AL+ P+W AM +EY+ L++N TW LV + KW+F+ K++SD
Sbjct: 559 AASPVPANYHSALADPNWRAAMADEYKELVDNGTWRLVSRPPRANIATGKWIFKHKFHSD 618
Query: 320 GSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFL 379
GS+ ++K R V +G+ Q G DY+ETFSPVVK TIR VLS+A WPI Q D NAFL
Sbjct: 619 GSLARYKARWVVRGYSQQHGIDYDETFSPVVKLATIRVVLSIAASRAWPIHQLDVKNAFL 678
Query: 380 NGELVEEVFMQQPPGFSKGSS-HLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKC 438
+G L E V+ QQP GF ++ VC L+K+LYGLKQAPRAW+ + ++ + + GF +
Sbjct: 679 HGHLKETVYCQQPSGFVDPTAPDAVCLLQKSLYGLKQAPRAWYQRFATYIRQMGFMPSAS 738
Query: 439 DTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEA 498
DTSLF Y+L+YVDDII+T S++T + L A L++ F + DLG LH+FLGI
Sbjct: 739 DTSLFVYKDGDRIAYLLLYVDDIILTASTTTLLQQLTARLHSEFAMTDLGDLHFFLGISV 798
Query: 499 THTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALFPKPLLYRSVVG 558
+ DG L LSQ +Y DLL +A M + + TP+ + KLSA G P YRS+ G
Sbjct: 799 KRSPDG-LFLSQRQYAVDLLQRAGMAECHSTSTPVDTHAKLSATDGLPVADPSAYRSIAG 857
Query: 559 ALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSIT 618
AL Y+T+TRP++A+ + +VC FMH P EPH VKRILRY+ G+ + GL++ S+T
Sbjct: 858 ALQYLTLTRPDLAYAVQQVCLFMHDPREPHLALVKRILRYVKGSLSIGLHIGSGPIQSLT 917
Query: 619 AFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIMW 678
A++D+DW P+ RRSTSG C+Y+G NLVSW++K+QT V+RSS EAEYR++A + E W
Sbjct: 918 AYSDADWAGCPNSRRSTSGYCVYLGDNLVSWSSKRQTTVSRSSAEAEYRAVAHAVAECCW 977
Query: 679 LRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQVTVV 738
LR LL EL VP + ++++YCDN+S V++ ANP+ H +TKH +D+HFVRE+VA QV V+
Sbjct: 978 LRQLLQELHVPIASATIVYCDNVSAVYMTANPVHHRRTKHIEIDIHFVREKVALGQVRVL 1037
Query: 739 NIPAPSQVSSTL 750
+P+ Q + +
Sbjct: 1038 YVPSSHQFADIM 1049
>emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|3269282|emb|CAA19715.1|
putative protein [Arabidopsis thaliana]
gi|7444417|pir||T05745 hypothetical protein M4I22.20 -
Arabidopsis thaliana
Length = 1318
Score = 612 bits (1578), Expect = e-173
Identities = 347/789 (43%), Positives = 460/789 (57%), Gaps = 54/789 (6%)
Query: 7 GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
GIQ +L+CP+ QQNG E KHRH+ E GL++L +H+P FW +AF TA +LIN LP
Sbjct: 414 GIQQQLSCPHTPQQNGLAERKHRHLVELGLSMLFQSHVPHKFWVEAFFTANFLINLLPTS 473
Query: 67 TLDLS-TPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGY 125
L S +P +KLYDKKPDY+ L+ FG C+P R Y +K S C FLGY+ +KGY
Sbjct: 474 ALKESISPYEKLYDKKPDYTSLRSFGSACFPTLRDYAENKFNPCSLKCVFLGYNEKYKGY 533
Query: 126 NCL-TTEGKVIITRNVLFDETVFPFQF----------------------------QTSKS 156
CL G++ I+R+V+FDE+V+PF TS S
Sbjct: 534 RCLYPPTGRLYISRHVIFDESVYPFSHTYKHLHPQPRTPLLAAWLRSSDSPAPSTSTSPS 593
Query: 157 SSDILLDSS----LP--TTSVIP-LLPISNASPNVMPHANSV----MPTVATNGSAPVVQ 205
S L S+ LP T ++P L+PIS+ S HA+++ P + +
Sbjct: 594 SRSPLFTSADFPPLPQRKTPLLPTLVPISSVS-----HASNITTQQSPDFDSERTTDFDS 648
Query: 206 TTVSD---FSQPA*DSGVVHASAQPTEPPVPPSSNVHSMTTRAKTGIHRPK---AYAAST 259
++ D SQ DS A S+NVH M TRAK GI +P + +
Sbjct: 649 ASIGDSSHSSQAGSDSEETIQQASVNVHQTHASTNVHPMVTRAKVGISKPNPRYVFLSHK 708
Query: 260 ALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSD 319
P + AL P W AM EE TW LVP +G KWVFRTK ++D
Sbjct: 709 VSYPEPKTVTAALKHPGWTGAMTEEIGNCSETQTWSLVPYKSDMHVLGSKWVFRTKLHAD 768
Query: 320 GSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFL 379
G++NK K R+VAKGF Q EG DY ET+SPVV+ T+R VL LA W I+Q D NAFL
Sbjct: 769 GTLNKLKARIVAKGFLQEEGIDYLETYSPVVRTPTVRLVLHLATALNWDIKQMDVKNAFL 828
Query: 380 NGELVEEVFMQQPPGFSKGSS-HLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKC 438
+G+L E V+M QP GF S VC L K++YGLKQ+PRAWFDK S+ L FGF +K
Sbjct: 829 HGDLKETVYMTQPAGFVDPSKPDHVCLLHKSIYGLKQSPRAWFDKFSTFLLEFGFFCSKS 888
Query: 439 DTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEA 498
D SLF + + +L+YVDD++ITG+SS + SL+A LN F + D+G LHYFLGI+
Sbjct: 889 DPSLFIYAHNNNLILLLLYVDDMVITGNSSQTLTSLLAALNKEFRMTDMGQLHYFLGIQ- 947
Query: 499 THTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALFPKPLLYRSVVG 558
+ GL +SQ KY DLLI A M+ P PTP+ L + LF P +RS+ G
Sbjct: 948 VQRQQNGLFMSQQKYAEDLLIAASMEHCTPLPTPLPVQLDRVPHQEELFSDPTYFRSIAG 1007
Query: 559 ALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSIT 618
L Y+T+TRP+I F +N VCQ MH P + +KRILRY+ GT T G+ R S +
Sbjct: 1008 KLQYLTLTRPDIQFAVNFVCQKMHQPTISDFHLLKRILRYIKGTITMGISYSRDSPTLLQ 1067
Query: 619 AFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIMW 678
A++DSDWG+ RRS G+C ++G NLVSW++KK V+RSSTEAEY+SL+ +EI+W
Sbjct: 1068 AYSDSDWGNCKQTRRSVGGLCTFMGTNLVSWSSKKHPTVSRSSTEAEYKSLSDAASEILW 1127
Query: 679 LRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQVTVV 738
L +LL ELR+P + ++CDNLS V+L ANP H +TKHF +D HFVRERVA K + V
Sbjct: 1128 LSTLLRELRIPLPDTPELFCDNLSAVYLTANPAFHARTKHFDIDFHFVRERVALKALVVK 1187
Query: 739 NIPAPSQVS 747
+IP Q++
Sbjct: 1188 HIPGSEQIA 1196
>gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|37530764|ref|NP_919684.1| putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|20042923|gb|AAM08751.1| Putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)]
Length = 1803
Score = 608 bits (1569), Expect = e-172
Identities = 335/774 (43%), Positives = 460/774 (59%), Gaps = 44/774 (5%)
Query: 11 RLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIPTLDL 70
RL+CPY QQNG+ E R I + +L H+ PL+FW +A +TA +LIN+ P
Sbjct: 645 RLSCPYSSQQNGKAERILRTINDCVRTMLVHSAAPLSFWAEALQTAMHLINRRPCRATGS 704
Query: 71 STPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGYNCLTT 130
P Q L P Y L+ FGCLCYP+T HKL+ RS C F+GY H+GY C
Sbjct: 705 LKPYQLLLGAPPTYDHLRVFGCLCYPNTIATAPHKLSPRSLACVFIGYPADHRGYRCYDM 764
Query: 131 EGKVIIT-RNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNASPNVMPHAN 189
+ + T R+V F E VFPF+ D+ P S P P + ++
Sbjct: 765 VSRRVFTSRHVTFVEDVFPFR------------DAPSPRPSAPP--PPDHGDDTIV---- 806
Query: 190 SVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTE----PPVPPSSNV------- 238
++P A + PV D + P + +SA P PP P +S+
Sbjct: 807 -LLPAPAQHVVTPVGTAPAHDAASPPSPASSTPSSAAPAHDVAPPPSPETSSPASASPPR 865
Query: 239 HSMTTRAKTGIHRPK---AYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWE 295
H+MTTRA+ GI +P A A++ LS P+S + AL P+W AMQ E+ AL+ N TW
Sbjct: 866 HAMTTRARAGISKPNPRYAMTATSTLSPTPSSVRVALRDPNWRAAMQAEFDALLANRTWT 925
Query: 296 LVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTI 355
LVP G I KWVF+TK ++DGS++K+K R V +GF+Q G D+ ETFSPVVKP TI
Sbjct: 926 LVPRPPGARIITGKWVFKTKLHADGSLDKYKARWVVRGFNQRPGVDFGETFSPVVKPATI 985
Query: 356 RTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFSKGSSHL-VCKLKKALYGLK 414
RTVL+L QWP Q D +NAFL+G L E V QQP GF + VC L ++LYGL+
Sbjct: 986 RTVLTLISSKQWPAHQLDVSNAFLHGHLQERVLCQQPTGFEDAARPADVCLLSRSLYGLR 1045
Query: 415 QAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSL 474
QAPRAWF + + GF ++ D SLF S+T Y+L+YVDD+I++ SSS+ + +
Sbjct: 1046 QAPRAWFKRFADHATSLGFVQSRADPSLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRI 1105
Query: 475 IADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMT 534
I L A F +KD+G L YFLGIE T DG +LSQ+KY D+L +A M + TP
Sbjct: 1106 IDRLQAEFKVKDMGPLKYFLGIEVQRTADG-FVLSQSKYATDVLERAGMANCKAVATPAD 1164
Query: 535 SGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKR 594
+ KLS++ G LF YRS+ GAL Y+T+TRP+IA+ + +VC MH+P E H +KR
Sbjct: 1165 AKPKLSSDEGPLFQDSSWYRSIAGALQYLTLTRPDIAYAVQQVCLHMHAPREAHVTLLKR 1224
Query: 595 ILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQ 654
ILRY+ GTA GL+LR +S ++TAF+D+DW PD RRSTSG CI++G +L+SW++K+Q
Sbjct: 1225 ILRYIKGTAAFGLHLRASTSPTLTAFSDADWAGCPDTRRSTSGFCIFLGDSLISWSSKRQ 1284
Query: 655 TVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHT 714
T V+RSS EAEYR +A + E WLR LL EL +++ YCDN+S V++ NP+ H
Sbjct: 1285 TTVSRSSAEAEYRGVANAVAECTWLRQLLGELHCRVPQATIAYCDNISSVYMSKNPVHHK 1344
Query: 715 KTKHFALDLHFVRERVADKQVTVVNIPAPSQ--------VSSTLFHCFKTKLRV 760
+TKH LD+HFVRE+VA ++ V+ IP+ Q + S++F+ F+ L V
Sbjct: 1345 RTKHIELDIHFVREKVALGELRVLPIPSAHQFADVFTKGLPSSMFNEFRASLCV 1398
>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
gi|4539447|emb|CAB40035.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444419|pir||T04204
hypothetical protein T4F9.150 - Arabidopsis thaliana
Length = 1515
Score = 592 bits (1527), Expect = e-167
Identities = 337/808 (41%), Positives = 474/808 (57%), Gaps = 66/808 (8%)
Query: 3 LTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQ 62
L CGI+ ++CP+ QQNG E +HR++TE GL+L+ H+ +P W +AF T+ +L N
Sbjct: 596 LASCGIKQLISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHKLWVEAFFTSNFLSNL 655
Query: 63 LPIPTL-DLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
LP TL D +P + L+ P Y+ L+ FG CYP+ RPY +K +S C FLGY+
Sbjct: 656 LPSSTLSDNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKFDPKSLLCVFLGYNNK 715
Query: 122 HKGYNCL-TTEGKVIITRNVLFDETVFPF-----QFQT----------SKSSSDILLDSS 165
+KGY CL GKV I R+VLFDE FP+ QFQT K S L
Sbjct: 716 YKGYRCLHPPTGKVYICRHVLFDERKFPYSDIYSQFQTISGSPLFTAWQKGFSSTALSRE 775
Query: 166 LPTTSV----IPLLPISNA-----SPNVMPHANS------------VMPTVATNGSAPVV 204
P+T+V P +S++ +PN+ A + V P+ T+ S P
Sbjct: 776 TPSTNVEDIIFPSATVSSSVPTGCAPNIAETATAPDVDVAAAHDMVVPPSPITSTSLPTQ 835
Query: 205 -QTTVSDFSQPA*DSGVVHASAQPTEP---------PVPPSSNV-----------HSMTT 243
+ + SD + + DS +SA + PP +V H M T
Sbjct: 836 PEESTSDQNHYSTDSETAISSAMTPQSINVSLFEDSDFPPLQSVISSTTAAPETSHPMIT 895
Query: 244 RAKTGIHRPKAYAASTALSS---VPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLL 300
RAK+GI +P A ++ S P S K AL W AM EE + +TW+LVP
Sbjct: 896 RAKSGITKPNPKYALFSVKSNYPEPKSVKEALKDEGWTNAMGEEMGTMHETDTWDLVPPE 955
Query: 301 HGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLS 360
+GCKWVF+TK NSDGS+++ K RLVA+G+ Q EG DY ET+SPVV+ T+R++L
Sbjct: 956 MVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGYEQEEGVDYVETYSPVVRSATVRSILH 1015
Query: 361 LALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFSKGSS-HLVCKLKKALYGLKQAPRA 419
+A +++W ++Q D NAFL+ EL E VFM QPPGF S VCKLKKA+Y LKQAPRA
Sbjct: 1016 VATINKWSLKQLDVKNAFLHDELKETVFMTQPPGFEDPSRPDYVCKLKKAIYDLKQAPRA 1075
Query: 420 WFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLN 479
WFDK SS L ++GF + D SLF + +++L+YVDD+I+TG++ + L+ L+
Sbjct: 1076 WFDKFSSYLLKYGFICSFSDPSLFVYLKGRDVMFLLLYVDDMILTGNNDVLLQQLLNILS 1135
Query: 480 AVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKL 539
F +KD+G+LHYFLGI+A H + GL LSQ KY +DLL+ A M + PTP+ L L
Sbjct: 1136 TEFRMKDMGALHYFLGIQA-HYHNDGLFLSQEKYTSDLLVNAGMSDCSSMPTPLQ--LDL 1192
Query: 540 SANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYL 599
+ FP+P +R + G L Y+T+TRP+I F +N VCQ MH+P + +KRIL YL
Sbjct: 1193 LQGNNKPFPEPTYFRRLAGKLQYLTLTRPDIQFAVNFVCQKMHAPTMSDFHLLKRILHYL 1252
Query: 600 HGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVAR 659
GT T G+ L + + ++DSDW D RRST G C ++G N++SW+AK+ V++
Sbjct: 1253 KGTMTMGINLSSNTDSVLRCYSDSDWAGCKDTRRSTGGFCTFLGYNIISWSAKRHPTVSK 1312
Query: 660 SSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHF 719
SSTEAEYR+L+ +E+ W+ LL E+ +P +YCDNLS V+L ANP LH+++KHF
Sbjct: 1313 SSTEAEYRTLSFAASEVSWIGFLLQEIGLPQQQIPEMYCDNLSAVYLSANPALHSRSKHF 1372
Query: 720 ALDLHFVRERVADKQVTVVNIPAPSQVS 747
+D ++VRERVA +TV +IPA Q++
Sbjct: 1373 QVDYYYVRERVALGALTVKHIPASQQLA 1400
>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078
gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
Arabidopsis thaliana
Length = 1415
Score = 588 bits (1517), Expect = e-166
Identities = 327/754 (43%), Positives = 450/754 (59%), Gaps = 17/754 (2%)
Query: 3 LTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQ 62
L++ GI HR++CPY QQNG E KHRH+ E GL++L H+H P FW ++F TA Y+IN+
Sbjct: 595 LSEHGIHHRISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVESFFTANYIINR 654
Query: 63 LPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMH 122
LP L +P + L+ +KPDYS L+ FG CYP RP +K RS C FLGY+ +
Sbjct: 655 LPSSVLKNLSPYEALFGEKPDYSSLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYNSQY 714
Query: 123 KGYNCL-TTEGKVIITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNAS 181
KGY C GKV I+RNV+F+E+ PF+ + ++ S P IS S
Sbjct: 715 KGYRCFYPPTGKVYISRNVIFNESELPFK----EKYQSLVPQYSTPLLQAWQHNKISEIS 770
Query: 182 PNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGV---VHASAQPTEPPVPPSSNV 238
P P + V ++D + + G V+ A+ N
Sbjct: 771 VPAAPVQLFSKPIDLNTYAGSQVTEQLTDPEPTSNNEGSDEEVNPVAEEIAANQEQVINS 830
Query: 239 HSMTTRAKTGIHRPKA-YAASTALSSV--PTSAK*ALSIPHWHQAMQEEYQALMNNNTWE 295
H+MTTR+K GI +P YA T+ + P + A+ P W++A+ EE + +TW
Sbjct: 831 HAMTTRSKAGIQKPNTRYALITSRMNTAEPKTLASAMKHPGWNEAVHEEINRVHMLHTWS 890
Query: 296 LVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTI 355
LVP + + KWVF+TK + DGSI+K K RLVAKGF Q EG DY ETFSPVV+ TI
Sbjct: 891 LVPPTDDMNILSSKWVFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATI 950
Query: 356 RTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGF--SKGSSHLVCKLKKALYGL 413
R VL ++ WPI+Q D +NAFL+GEL E VFM QP GF + +H VC+L KA+YGL
Sbjct: 951 RLVLDVSTSKGWPIKQLDVSNAFLHGELQEPVFMYQPSGFIDPQKPTH-VCRLTKAIYGL 1009
Query: 414 KQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFS 473
KQAPRAWFD S+ L +GF +K D SLF + LY+L+YVDDI++TGS + +
Sbjct: 1010 KQAPRAWFDTFSNFLLDYGFVCSKSDPSLFVCHQDGKILYLLLYVDDILLTGSDQSLLED 1069
Query: 474 LIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPM 533
L+ L F +KDLG YFLGI+ +G L L QT Y D+L +A M NP PTP+
Sbjct: 1070 LLQALKNRFSMKDLGPPRYFLGIQIEDYANG-LFLHQTAYATDILQQAGMSDCNPMPTPL 1128
Query: 534 TSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVK 593
+L + LF +P +RS+ G L Y+TITRP+I F +N +CQ MHSP + +K
Sbjct: 1129 PQ--QLDNLNSELFAEPTYFRSLAGKLQYLTITRPDIQFAVNFICQRMHSPTTSDFGLLK 1186
Query: 594 RILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKK 653
RILRY+ GT GL ++R S+++++A++DSD + RRST+G CI +G NL+SW+AK+
Sbjct: 1187 RILRYIKGTIGMGLPIKRNSTLTLSAYSDSDHAGCKNTRRSTTGFCILLGSNLISWSAKR 1246
Query: 654 QTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILH 713
Q V+ SSTEAEYR+L EI W+ LL +L +P + +YCDNLS V+L ANP LH
Sbjct: 1247 QPTVSNSSTEAEYRALTYAAREITWISFLLRDLGIPQYLPTQVYCDNLSAVYLSANPALH 1306
Query: 714 TKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
++KHF D H++RE+VA + +I A Q++
Sbjct: 1307 NRSKHFDTDYHYIREQVALGLIETQHISATFQLA 1340
>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 585 bits (1508), Expect = e-165
Identities = 328/761 (43%), Positives = 446/761 (58%), Gaps = 23/761 (3%)
Query: 7 GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
GI HR++CPY QQNG E KHRH+ E GL++L H+H PL FW +AF TA YL N LP
Sbjct: 601 GIHHRISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSNLLPSS 660
Query: 67 TLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGYN 126
L +P + L+ +K DY+ L+ FG CYP RP +K RS C FLGY +KGY
Sbjct: 661 VLKEISPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYR 720
Query: 127 CL-TTEGKVIITRNVLFDETVFPFQ---------FQTSKSSSDILLDSSLPTTSVIPLLP 176
CL GKV I+R+V+FDE FPF+ +QT+ + D + P+ L P
Sbjct: 721 CLYPPTGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTPPSVPSSQLQP 780
Query: 177 ISNASPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*D------SGVVHASAQPTEP 230
++ + N M T + V T SD + D + V++ +
Sbjct: 781 LARQMTPMATSENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVAPVLNDQNEDNAL 840
Query: 231 PVPPSSNVHSMTTRAKTGIHRPKA-YAASTALSSV--PTSAK*ALSIPHWHQAMQEEYQA 287
N+H M TR+K GI +P YA + SS P + A+ P W+ A+ +E
Sbjct: 841 GQGSLENLHPMITRSKDGIQKPNPRYALIVSKSSFDEPKTITTAMKHPSWNAAVMDEIDR 900
Query: 288 LMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFS 347
+ NTW LVP + + KWVF+TK DG+I+K K RLVAKGF Q EG DY ETFS
Sbjct: 901 IHMLNTWSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFS 960
Query: 348 PVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGF-SKGSSHLVCKL 406
PVV+ TIR VL A ++WP++Q D +NAFL+GEL E VFM QP GF + VC+L
Sbjct: 961 PVVRTATIRLVLDTATANEWPLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRL 1020
Query: 407 KKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGS 466
KALYGLKQAPRAWFD S+ L FGF+ + D SLF ++L +L+YVDDI++TGS
Sbjct: 1021 TKALYGLKQAPRAWFDTFSNFLLDFGFECSTSDPSLFVCHQNGQSLILLLYVDDILLTGS 1080
Query: 467 SSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSI 526
+ L+ LN F +KDLG YFLGIE + + GL L Q Y +D+L +A M
Sbjct: 1081 DQLLMDKLLQALNNRFSMKDLGPPRYFLGIEI-ESYNNGLFLHQHAYASDILHQAGMTEC 1139
Query: 527 NPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLE 586
NP PTP+ L+ NS F +P +RS+ G L Y+TITRP+I + +N +CQ MH+P
Sbjct: 1140 NPMPTPLPQHLE-DLNSEP-FEEPTYFRSLAGKLQYLTITRPDIQYAVNFICQRMHAPTN 1197
Query: 587 PHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNL 646
+ +KRILRY+ GT GL +R+ + ++ F DSD+ D RRST+G CI +G L
Sbjct: 1198 SDFGLLKRILRYVKGTINMGLPIRKHHNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTL 1257
Query: 647 VSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHL 706
+SW+AK+Q ++ SSTEAEYR+L+ EI W+ SLL +L + + ++CDNLS V+L
Sbjct: 1258 ISWSAKRQPTISHSSTEAEYRALSDTAREITWISSLLRDLGISQHQPTRVFCDNLSAVYL 1317
Query: 707 VANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
ANP LH ++KHF D H++RERVA + +IPA Q++
Sbjct: 1318 SANPALHKRSKHFDKDFHYIRERVALGLIETQHIPATIQLA 1358
>gb|AAD43604.1| T3P18.3 [Arabidopsis thaliana] gi|25301688|pir||H96650 protein
T3P18.3 [imported] - Arabidopsis thaliana
Length = 1309
Score = 584 bits (1506), Expect = e-165
Identities = 328/761 (43%), Positives = 446/761 (58%), Gaps = 23/761 (3%)
Query: 7 GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
GI HR++CPY QQNG E KHRH+ E GL++L H+H PL FW +AF TA YL N LP
Sbjct: 444 GIHHRISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSNLLPSS 503
Query: 67 TLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGYN 126
L +P + L+ +K DY+ L+ FG CYP RP +K RS C FLGY +KGY
Sbjct: 504 VLKEISPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYR 563
Query: 127 CL-TTEGKVIITRNVLFDETVFPFQ---------FQTSKSSSDILLDSSLPTTSVIPLLP 176
CL GKV I+R+V+FDE FPF+ +QT+ + D + P+ L P
Sbjct: 564 CLYPPTGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTPPSVPSSQLQP 623
Query: 177 ISNASPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*D------SGVVHASAQPTEP 230
++ + N M T + V T SD + D + V++ +
Sbjct: 624 LARQVTPMATSENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVAPVLNDQNEDNAL 683
Query: 231 PVPPSSNVHSMTTRAKTGIHRPKA-YAASTALSSV--PTSAK*ALSIPHWHQAMQEEYQA 287
N+H M TR+K GI +P YA + SS P + A+ P W+ A+ +E
Sbjct: 684 GQGSLENLHPMITRSKDGIQKPNPRYALIVSKSSFDEPKTITTAMKHPGWNAAVMDEIDR 743
Query: 288 LMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFS 347
+ NTW LVP + + KWVF+TK DG+I+K K RLVAKGF Q EG DY ETFS
Sbjct: 744 IHMLNTWSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFS 803
Query: 348 PVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGF-SKGSSHLVCKL 406
PVV+ TIR VL A ++WP++Q D +NAFL+GEL E VFM QP GF + VC+L
Sbjct: 804 PVVRTATIRLVLDTATANEWPLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRL 863
Query: 407 KKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGS 466
KALYGLKQAPRAWFD S+ L FGF+ + D SLF ++L +L+YVDDI++TGS
Sbjct: 864 TKALYGLKQAPRAWFDTFSNFLLDFGFECSTSDPSLFVCHQNGQSLILLLYVDDILLTGS 923
Query: 467 SSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSI 526
+ L+ LN F +KDLG YFLGIE + + GL L Q Y +D+L +A M
Sbjct: 924 DQLLMDKLLQALNNRFSMKDLGPPRYFLGIEI-ESYNNGLFLHQHAYASDILHQAGMTEC 982
Query: 527 NPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLE 586
NP PTP+ L+ NS F +P +RS+ G L Y+TITRP+I + +N +CQ MH+P
Sbjct: 983 NPMPTPLPQHLE-DLNSEP-FEEPTYFRSLAGKLQYLTITRPDIQYAVNFICQRMHAPTN 1040
Query: 587 PHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNL 646
+ +KRILRY+ GT GL +R+ + ++ F DSD+ D RRST+G CI +G L
Sbjct: 1041 SDFGLLKRILRYVKGTINMGLPIRKHHNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTL 1100
Query: 647 VSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHL 706
+SW+AK+Q ++ SSTEAEYR+L+ EI W+ SLL +L + + ++CDNLS V+L
Sbjct: 1101 ISWSAKRQPTISHSSTEAEYRALSDTAREITWISSLLRDLGISQHQPTRVFCDNLSAVYL 1160
Query: 707 VANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
ANP LH ++KHF D H++RERVA + +IPA Q++
Sbjct: 1161 SANPALHKRSKHFDKDFHYIRERVALGLIETQHIPATIQLA 1201
>emb|CAE05956.3| OSJNBb0088C09.15 [Oryza sativa (japonica cultivar-group)]
gi|32487794|emb|CAE05417.1| OSJNBa0035I04.5 [Oryza sativa
(japonica cultivar-group)]
Length = 1246
Score = 578 bits (1490), Expect = e-163
Identities = 324/760 (42%), Positives = 445/760 (57%), Gaps = 21/760 (2%)
Query: 2 FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
FL G RL+CPY QNG+ E R + + LL A MP ++W + TA YL+N
Sbjct: 458 FLAGRGSLLRLSCPYTSPQNGKAERMIRTLNNSIRTLLLQASMPPSYWAEGLATATYLLN 517
Query: 62 QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
+ P +++ S P Q L+ K P+YS L+ FGCLCYP+ HKLA SA C FLGY
Sbjct: 518 RRPSSSVNNSIPFQLLHRKIPNYSMLRVFGCLCYPNLSATAAHKLAPYSAACVFLGYPSS 577
Query: 122 HKGYNCLT-TEGKVIITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNA 180
HKGY CL + ++II+ +V+FDET FPF +SS L P SVI
Sbjct: 578 HKGYCCLNISTRRIIISCHVIFDETQFPFSGDPVDASSLDFLLQDAPAPSVIAPSLAGVE 637
Query: 181 SPNVMPHA------NSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASA--------Q 226
P+ +PHA +PT A + + V + D G H + Q
Sbjct: 638 QPH-LPHAPFPVNVEQRLPTGAPSTKDEHLPYYVQPAAHCGQDDGKFHTAGCHVSIFLKQ 696
Query: 227 PTEPPVPPSSNVHSMTTRAKTGIHRPKAYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQ 286
T P+ +V T+R I P S+ + P AM EE++
Sbjct: 697 GTIVTSYPAVHVFFTTSRHGGAIPVPLPTTTGADDSAPSHRCGCTHTSP---AAMAEEFK 753
Query: 287 ALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETF 346
AL++N TW LVP G + + KW+F+ K++SDGS+ +HK R V G+ Q G DY+ETF
Sbjct: 754 ALIDNGTWRLVPRPPGANVVTGKWIFKHKFHSDGSLARHKARWVVHGYSQQHGIDYDETF 813
Query: 347 SPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGF-SKGSSHLVCK 405
SPVVKP+TIR VLS+A WPI Q D NAFL+G L E V+ QQ GF + VC
Sbjct: 814 SPVVKPSTIRVVLSIATSRSWPIHQLDVKNAFLHGTLDETVYCQQSSGFIDPAAPDAVCL 873
Query: 406 LKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITG 465
L+++LYGLKQAPRAW+ + ++ + + GF + DTSLF Y+L+YVDDII+
Sbjct: 874 LQRSLYGLKQAPRAWYQRFATYIRQLGFTPSASDTSLFILRDGDRLAYLLLYVDDIILMA 933
Query: 466 SSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDS 525
SS+ + + A L+ F + DLG LH+FLGI T D L LSQ +Y DLL +A M
Sbjct: 934 SSAELLCHITARLHTEFAMTDLGDLHFFLGISVRRTPDD-LFLSQRQYAVDLLQRAGMSE 992
Query: 526 INPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPL 585
+P+ TP+ + KLSA+ GA P YRS+ AL Y+T+TRPEIA+ + +VC+FMH P
Sbjct: 993 YHPTATPVDARAKLSASEGAPVADPTEYRSLADALQYLTLTRPEIAYAVQQVCRFMHDPH 1052
Query: 586 EPHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGN 645
EPH VKRI+RY+ G+ + GL++ S+TA++D+DW PD RRSTSG CI++G N
Sbjct: 1053 EPHLALVKRIMRYIKGSLSAGLHIGTGPVGSLTAYSDADWAGCPDSRRSTSGYCIFLGDN 1112
Query: 646 LVSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVH 705
LVSW++K+QT V+RSS EAEYR++A I E LR LL EL P S +++++CDN+S +
Sbjct: 1113 LVSWSSKRQTTVSRSSAEAEYRAVAHAIAECCCLRQLLQELHAPISTTTVVFCDNVSAAY 1172
Query: 706 LVANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQ 745
+ ANP+ H TKH +D+HFVRE+VA QV V+++P+ Q
Sbjct: 1173 MTANPVHHRCTKHIEIDIHFVREKVALGQVRVLHVPSTHQ 1212
>gb|AAW56918.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1018
Score = 570 bits (1468), Expect = e-161
Identities = 317/705 (44%), Positives = 432/705 (60%), Gaps = 42/705 (5%)
Query: 40 AHAHMPLTFWGDAFETAAYLINQLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTR 99
AHA MPL FW +AF TA+YLIN+ P ++ TPL++L+ + P+YSFL+ FGC C+P+ R
Sbjct: 338 AHASMPLKFWDEAFYTASYLINRTPSKVINFETPLERLFHQPPNYSFLRVFGCACWPNLR 397
Query: 100 PYNTHKLAFRSAPCTFLGYSPMHKGYNCLTTE-GKVIITRNVLFDETVFPFQFQTSKSSS 158
PYN HKL FR +HKG+ CL G+V I+R+V FDE VFPF S + +
Sbjct: 398 PYNKHKLQFRH----------LHKGFKCLDVPTGRVYISRDVTFDENVFPFAQLHSNAGA 447
Query: 159 DILLDSSLPTTSVIPLLPISNASPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DS 218
+ + SL +S++P+ S ++ NSV ATN S D D
Sbjct: 448 RLRNEISLLPSSLLPIT-YSGGEQSITSMFNSVPN--ATNLSDAGSAENGGDL-----DV 499
Query: 219 GVVHASAQPTEPPVPPSSNVHSMTTRAKTGIHRPKAYAASTALSSVPTSAK*ALS----- 273
V H T S ++ SM + + + + S + ++PT A A S
Sbjct: 500 SVTHDGVHTTHGDAL-SEDLGSMPQESLGSVSQAPLHHVSGSPPTMPTPAPSAPSPRQSP 558
Query: 274 --------IPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKH 325
HW +AM EYQALM N TW LVP GR+ I CKWV++ K +DGS++++
Sbjct: 559 NQPDEAFATNHWREAMDAEYQALMKNQTWHLVPPQQGRNIIDCKWVYKVKRKADGSLDRY 618
Query: 326 KPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVE 385
K RLVAKGF Q G DY +TFSPVVK TTIRT+LS+ + W +RQ D NAFL+G L E
Sbjct: 619 KARLVAKGFKQRYGIDYEDTFSPVVKATTIRTILSITVSRGWSLRQLDVQNAFLHGILEE 678
Query: 386 EVFMQQPPGF--SKGSSHLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLF 443
EV+M+Q PG+ SK +H +CKL KA+YGLKQAPRAW+ +LS+ L GF+ +K DTSLF
Sbjct: 679 EVYMKQTPGYENSKFPNH-ICKLDKAIYGLKQAPRAWYSRLSTKLQALGFKPSKVDTSLF 737
Query: 444 CKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKD 503
T+++L+YVDDII+ S+ +A +L+ DL F LKDL LHYFLGIE T T+D
Sbjct: 738 FYSKGDVTVFVLIYVDDIIVASSTPSATSALLRDLTKEFALKDLRELHYFLGIEVTRTED 797
Query: 504 GGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALF-PKPLL-YRSVVGALH 561
G++L+Q KY D+L + M P TP+ + KLS N G L PK YRS+VGAL
Sbjct: 798 -GILLTQAKYAYDVLRRVGMQDCKPVNTPLLTSEKLSVNEGDLLGPKDATEYRSIVGALQ 856
Query: 562 YVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFA 621
Y+T+TR +I+F +NKVCQ P HW AVKRILRYL T + GL + + ++ +++F+
Sbjct: 857 YLTLTRLDISFSVNKVCQ---CPTTVHWAAVKRILRYLKHTVSCGLKINKSPTLLVSSFS 913
Query: 622 DSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRS 681
D+DW S DRRST G +++G NLVSW+A+KQ V+RSSTEAE ++LA TEIMWL++
Sbjct: 914 DADWASCLADRRSTGGFTVFLGSNLVSWSARKQATVSRSSTEAECKALADATTEIMWLQT 973
Query: 682 LLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFV 726
LL E+ + ++ DN+ +L ANP+ H +TKH +D HFV
Sbjct: 974 LLCEIGINAPKVVKMWFDNIGAKYLSANPVFHARTKHIEVDYHFV 1018
>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
Length = 1358
Score = 537 bits (1383), Expect = e-151
Identities = 297/751 (39%), Positives = 426/751 (56%), Gaps = 47/751 (6%)
Query: 2 FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
F+T GI H+ TCPY QQNG E K+RH+ ET LL +++PL FWGDA T+ YLIN
Sbjct: 622 FMTHQGIIHQTTCPYTPQQNGVAERKNRHLIETARTLLLESNVPLRFWGDAVLTSCYLIN 681
Query: 62 QLPIPTLDLSTPLQKLYDKKPDYSFL-KCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSP 120
++P ++ P L+ + Y + FG C+ H KLA R+ C FLGYS
Sbjct: 682 RMPSSSIQNQVPHSILFPQSHLYPIPPRVFGSTCFVHNLAPGKDKLAPRALKCVFLGYSR 741
Query: 121 MHKGYNCLTTE-GKVIITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISN 179
+ KGY C + + + +++ +V F E+ + TS + D+ + +LPI
Sbjct: 742 VQKGYRCYSHDLHRYLMSADVTFFESQ---PYYTSSNHPDVSM-----------VLPI-- 785
Query: 180 ASPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTEPPVPPSSNVH 239
P V+P V TV + ++PVV + + H +PT + P + H
Sbjct: 786 --PQVLPVPTFVESTVTS--TSPVVVPPLLTY----------HRRPRPT---LVPDDSCH 828
Query: 240 SMTTRAKTGIHRPKAYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPL 299
+ + P A ALS W QAM +E AL + TWELV L
Sbjct: 829 APDPAPTADLPPPSQPLALQKGE--------ALSHSGWRQAMVDEMSALHKSGTWELVSL 880
Query: 300 LHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVL 359
G+ +GC+WV+ K DG +++ K RLVAKG+ Q G DY++TF+PV K ++R L
Sbjct: 881 PAGKSTVGCRWVYAVKIGPDGQVDRLKARLVAKGYTQIFGLDYSDTFAPVAKIASVRLFL 940
Query: 360 SLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGF-SKG-SSHLVCKLKKALYGLKQAP 417
S+A + WP+ Q D NAFL+G+L EEV+M+QPPGF ++G SS LVC+L+++LYGLKQ+P
Sbjct: 941 SMAAVRHWPLHQLDIKNAFLHGDLEEEVYMEQPPGFVAQGESSSLVCRLRRSLYGLKQSP 1000
Query: 418 RAWFDKLSSTLARFGFQAAKCDTSLFCKFTK-SETLYILVYVDDIIITGSSSTAIFSLIA 476
RAWF K S+ + FG + D S+F + + S +Y++VYVDDI+ITG+ I L
Sbjct: 1001 RAWFGKFSTVIQEFGMTRSGADHSVFYRHSAPSRCIYLVVYVDDIVITGNDQDGITDLKQ 1060
Query: 477 DLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSG 536
L F KDLG L YFLGIE ++ G +++SQ KY D+L + M P TPM
Sbjct: 1061 HLFKHFQTKDLGRLKYFLGIEVAQSRSG-IVISQRKYALDILEETGMMGCRPVDTPMDPN 1119
Query: 537 LKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRIL 596
+KL G P YR +VG L+Y+T+TRP+I+FP++ V QFM SP + HW+AV RIL
Sbjct: 1120 VKLLPGQGEPLSNPERYRRLVGKLNYLTVTRPDISFPVSVVSQFMTSPCDSHWEAVVRIL 1179
Query: 597 RYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTV 656
RY+ GL I + D+DW P DRRSTSG C+ +GGNLVSW +KKQ V
Sbjct: 1180 RYIKSAPGKGLLFEDQGHEHIIGYTDADWAGSPSDRRSTSGYCVLVGGNLVSWKSKKQNV 1239
Query: 657 VARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKT 716
VARSS E+EYR++A E++W++ LL EL+ + CDN + +H+ +NP+ H +T
Sbjct: 1240 VARSSAESEYRAMATATCELVWIKQLLGELKFGKVDKMELVCDNQAALHIASNPVFHERT 1299
Query: 717 KHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
KH +D HFVRE++ + + + Q++
Sbjct: 1300 KHIEIDCHFVREKILSGDIVTKFVKSNDQLA 1330
>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
Length = 1382
Score = 535 bits (1379), Expect = e-150
Identities = 307/753 (40%), Positives = 442/753 (57%), Gaps = 33/753 (4%)
Query: 7 GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
G H+ +C +QNG E KHRHI ET +LL A + FWG+A TA LIN +P
Sbjct: 624 GTIHQTSCTDTPEQNGVAERKHRHIVETARSLLLSAFVLSEFWGEAVLTAVSLINTIPSS 683
Query: 67 TLDLSTPLQKLYDKKPDYSFLKCFGC---LCYPHTRPYNTHKLAFRSAPCTFLGYSPMHK 123
+P +KLY PDYS + FGC + +PH +KL+ RSA C FLGY K
Sbjct: 684 HSSGLSPFEKLYGHVPDYSSFRVFGCTYFVLHPHVE---RNKLSSRSAICVFLGYGEGKK 740
Query: 124 GYNCLTT-EGKVIITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNASP 182
GY C K+ ++ +V+F E + F ++ S L + +I + P S S
Sbjct: 741 GYRCFDPITQKLYVSHHVVFLEHIPFFSIPSTTHS--------LTKSDLIHIDPFSEDSG 792
Query: 183 N-VMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTEPPVPPSSNVHSM 241
N P+ S+ T+ SA T +S + + S AS++ +PP S +
Sbjct: 793 NDTSPYVRSI----CTHNSAGT-GTLLSGTPEASFSSTAPQASSEIVDPPPRQSIRIRKS 847
Query: 242 TTRAKTGIHRPKAYAAS--TALSSV-----PTSAK*ALSIPHWHQAMQEEYQALMNNNTW 294
T K Y++S + L+ + P+S K A+ P QAM EE AL +TW
Sbjct: 848 T---KLPDFAYSCYSSSFTSFLAYIHCLFEPSSYKEAILDPLGQQAMDEELSALHKTDTW 904
Query: 295 ELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTT 354
+LVPL G+ +GC+WV++ K NSDGSI ++K RLVAKG+ Q G DY ETF+P+ K TT
Sbjct: 905 DLVPLPPGKSVVGCRWVYKIKTNSDGSIERYKARLVAKGYSQQYGMDYEETFAPIAKMTT 964
Query: 355 IRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFSKGSSHLVCKLKKALYGLK 414
IRT++++A + QW I Q D NAFLNG+L EEV+M PPG S S + VCKLKKALYGLK
Sbjct: 965 IRTLIAVASIRQWHISQLDVKNAFLNGDLQEEVYMAPPPGISHDSGY-VCKLKKALYGLK 1023
Query: 415 QAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSL 474
QAPRAWF+K S ++ GF ++ D++LF K T + + + +YVDD+IITG I L
Sbjct: 1024 QAPRAWFEKFSIVISSLGFVSSSHDSALFIKCTDAGRIILSLYVDDMIITGDDIDGISVL 1083
Query: 475 IADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMT 534
+L F +KDLG L YFLGIE ++ G L LSQ+KY+ ++L +A++ TP+
Sbjct: 1084 KTELARRFEMKDLGYLRYFLGIEVAYSPRGYL-LSQSKYVANILERARLTDNKTVDTPIE 1142
Query: 535 SGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKR 594
+ S++ G P LYR++VG+L Y+TIT P+IA+ ++ V QF+ SP HW AV R
Sbjct: 1143 VNARYSSSDGLPLIDPTLYRTIVGSLVYLTITHPDIAYAVHVVSQFVASPTTIHWAAVLR 1202
Query: 595 ILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQ 654
ILRYL GT L L SS+ + A++D+D GSDP DR+S +G CI++G +L+SW +KKQ
Sbjct: 1203 ILRYLRGTVFQSLLLSSTSSLELRAYSDADHGSDPTDRKSVTGFCIFLGDSLISWKSKKQ 1262
Query: 655 TVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHT 714
++V++SSTEAEY ++A EI+W R LL+++ + S + +YCDN S + + N + H
Sbjct: 1263 SIVSQSSTEAEYCAMASTTKEIVWSRWLLADMGISFSHLTPMYCDNQSSIQIAHNSVFHE 1322
Query: 715 KTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
+TKH +D H R + + + +P+ Q++
Sbjct: 1323 RTKHIEIDCHLTRHHLKHGTIALPFVPSSLQIA 1355
>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301701|pir||E84589 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1461
Score = 529 bits (1363), Expect = e-148
Identities = 295/752 (39%), Positives = 431/752 (57%), Gaps = 33/752 (4%)
Query: 7 GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
GI +CP +QN VE KH+HI AL+ ++M L +WGD TA +LIN+ P
Sbjct: 709 GIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSNMSLPYWGDCVLTAVFLINRTPSA 768
Query: 67 TLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGYN 126
L TP + L K PDYS LK FGCLCY T HK RS C FLGY KGY
Sbjct: 769 LLSNKTPFEVLTGKLPDYSQLKTFGCLCYSSTSSKQRHKFLPRSRACVFLGYPFGFKGYK 828
Query: 127 CLTTEGKVI-ITRNVLFDETVFPFQF--QTSKSSSDILLDSSLPTTSVIPLLPISNASPN 183
L E V+ I+RNV F E +FP Q++ ++SD+ P+ P+S+ +
Sbjct: 829 LLDLESNVVHISRNVEFHEELFPLASSQQSATTASDVFT----------PMDPLSSGNS- 877
Query: 184 VMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTEPPVPPSSNVHSMTT 243
S +P+ + S + + ++ F D + + P SS +S +
Sbjct: 878 ----ITSHLPSPQISPSTQISKRRITKFPAHLQDYHCYFVNKDDSHPI--SSSLSYSQIS 931
Query: 244 RAKTGIHRPKAYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGR 303
+ Y + + +P S A W A+ +E A+ +TWE+ L G+
Sbjct: 932 PSHM------LYINNISKIPIPQSYHEAKDSKEWCGAIDQEIGAMERTDTWEITSLPPGK 985
Query: 304 DAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLAL 363
A+GCKWVF K+++DGS+ + K R+VAKG+ Q EG DY ETFSPV K T++ +L ++
Sbjct: 986 KAVGCKWVFTVKFHADGSLERFKARIVAKGYTQKEGLDYTETFSPVAKMATVKLLLKVSA 1045
Query: 364 MHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFS--KGSS---HLVCKLKKALYGLKQAPR 418
+W + Q D +NAFLNG+L E ++M+ P G++ KG+S ++VC+LKK++YGLKQA R
Sbjct: 1046 SKKWYLNQLDISNAFLNGDLEETIYMKLPDGYADIKGTSLPPNVVCRLKKSIYGLKQASR 1105
Query: 419 AWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADL 478
WF K S++L GF+ D +LF + SE + +LVYVDDI+I ++ A SL L
Sbjct: 1106 QWFLKFSNSLLALGFEKQHGDHTLFVRCIGSEFIVLLVYVDDIVIASTTEQAAQSLTEAL 1165
Query: 479 NAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLK 538
A F L++LG L YFLG+E T +G + LSQ KY +LL A M PS PMT ++
Sbjct: 1166 KASFKLRELGPLKYFLGLEVARTSEG-ISLSQRKYALELLTSADMLDCKPSSIPMTPNIR 1224
Query: 539 LSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRY 598
LS N G L +YR +VG L Y+TITRP+I F +NK+CQF +P H AV ++L+Y
Sbjct: 1225 LSKNDGLLLEDKEMYRRLVGKLMYLTITRPDITFAVNKLCQFSSAPRTAHLAAVYKVLQY 1284
Query: 599 LHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVA 658
+ GT GL+ +++ + D+DWG+ PD RRST+G +++G +L+SW +KKQ V+
Sbjct: 1285 IKGTVGQGLFYSAEDDLTLKGYTDADWGTCPDSRRSTTGFTMFVGSSLISWRSKKQPTVS 1344
Query: 659 RSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKH 718
RSS EAEYR+LAL E+ WL +LL LRV S ++Y D+ + V++ NP+ H +TKH
Sbjct: 1345 RSSAEAEYRALALASCEMAWLSTLLLALRV-HSGVPILYSDSTAAVYIATNPVFHERTKH 1403
Query: 719 FALDLHFVRERVADKQVTVVNIPAPSQVSSTL 750
+D H VRE++ + Q+ ++++ QV+ L
Sbjct: 1404 IEIDCHTVREKLDNGQLKLLHVKTKDQVADIL 1435
>pir||E96608 probable retroelement polyprotein F25P12.89 [imported] - Arabidopsis
thaliana gi|9954746|gb|AAG09097.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1486
Score = 528 bits (1360), Expect = e-148
Identities = 305/826 (36%), Positives = 440/826 (52%), Gaps = 82/826 (9%)
Query: 2 FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
F Q GI H +C QQNGRVE KHRHI AL + +P+ FW TAAYLIN
Sbjct: 641 FFAQKGIIHETSCVGTPQQNGRVERKHRHILNVARALRFQSGLPIEFWSYCALTAAYLIN 700
Query: 62 QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
+ P P L TP + +Y++ P ++ FGC+CY H + K A RS FLGY
Sbjct: 701 RTPTPLLKGKTPFELIYNRPPPLQHIRIFGCICYVHNLKHGGDKFASRSNKSIFLGYPFA 760
Query: 122 HKGYNCLTTE-GKVIITRNVLFDETVFPFQFQTSKSSSD---ILLDSS------------ 165
KG+ E G V ++R+V+F ET F F SS +L+DSS
Sbjct: 761 KKGWRVYNIETGVVSVSRDVVFRETEFHFPISVMDSSPSLDPVLVDSSELEEISMTPPVT 820
Query: 166 -----------LPTTSVIPLLPISNASPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQP 214
P++ V P P+S +SP V P ++ V P +T SA + T+ D +
Sbjct: 821 PSSPATPSSPVTPSSPVTPSSPVSPSSP-VTP-SSPVTPVSSTTTSAAI--DTIEDITTD 876
Query: 215 A*DSGVV--------HASAQPTEPPVPPSSNVHSMTTRAK-------------------- 246
DS + S TE P SS VH + +
Sbjct: 877 LEDSTSMDFFPDDEDEFSPTATESPASSSSPVHPPAVQLELLGKGHRPKRPPVKLADYVT 936
Query: 247 TGIHRP----------------------KAYAASTALSSVPTSAK*ALSIPHWHQAMQEE 284
T +H+P +AY + + P + A+ HW A+ E
Sbjct: 937 TLLHQPFPSATPYPLDNYISSSRFSDNYQAYILAITSGNEPRNYNEAMLDDHWKGAVSHE 996
Query: 285 YQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNE 344
+L N TW + L G+ A+GCKWVFR KY SDG++ +HK RLV G +QTEG DY E
Sbjct: 997 IGSLENLGTWTVEDLPPGKKALGCKWVFRLKYKSDGTLERHKARLVVLGNNQTEGLDYTE 1056
Query: 345 TFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFSKGSSHLVC 404
TF+PV K T+R L + W + Q D +NAFL+G+L EEV+MQ PPGF G VC
Sbjct: 1057 TFAPVAKMVTVRAFLQQVVSLDWEVHQMDVHNAFLHGDLDEEVYMQFPPGFRTGDKTKVC 1116
Query: 405 KLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIIT 464
+L+K+LYGLKQAPR WF KL+S L +GF D SLF L++LVYVDD+IIT
Sbjct: 1117 RLRKSLYGLKQAPRCWFAKLTSALKNYGFIQDISDYSLFIFHKNGVRLHVLVYVDDLIIT 1176
Query: 465 GSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMD 524
G++ I L++ F +KDLG L YFLGIE + + G+ L Q KY D++ + +
Sbjct: 1177 GTTIAVITEFKHYLSSCFYMKDLGILRYFLGIEVARSPE-GIYLCQRKYALDIITETGLL 1235
Query: 525 SINPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSP 584
+ P+ P+ KL+ +G PL YR +VG + Y+ TRPE+++ ++ + QFMH+P
Sbjct: 1236 GVKPASFPLDQNHKLAFATGETIDDPLRYRRLVGRIIYLATTRPELSYVIHILSQFMHNP 1295
Query: 585 LEPHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGG 644
HW+A R++RYL + G+ LR + + ++A+ DSD+G+ P RS +G I +GG
Sbjct: 1296 KPAHWEAALRVVRYLKSSPGQGILLRANTPLVLSAWCDSDFGACPHSDRSLTGWFIQLGG 1355
Query: 645 NLVSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVV 704
+ +SW +KQ VV+RSS EAEYR++A ++EI+W+R LL L +P + + ++ D+LS +
Sbjct: 1356 SPLSWKTQKQNVVSRSSAEAEYRAMAETVSEIIWIRELLPALGIPCTAPTTLHSDSLSAI 1415
Query: 705 HLVANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVSSTL 750
L ANP+ H +TKH D+HF+R+ + + + ++ SQ++ L
Sbjct: 1416 SLAANPVYHARTKHVRRDVHFIRDELVNGTIATKHVSTTSQLADIL 1461
>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534632|ref|NP_921618.1| putative pol polyprotein
[Oryza sativa (japonica cultivar-group)]
Length = 1688
Score = 526 bits (1355), Expect = e-147
Identities = 303/779 (38%), Positives = 427/779 (53%), Gaps = 35/779 (4%)
Query: 2 FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
FL G +L+CP H QNG E KHRHI ET LL + +P FW +A TA YLIN
Sbjct: 463 FLVSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASFVPAHFWAEAISTAVYLIN 522
Query: 62 QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
P +L +P + L+ P Y L+ FGC CY P KL +S C FLGYS
Sbjct: 523 MQPSSSLQGRSPGEVLFGSPPRYDHLRVFGCTCYVLLAPRERTKLTAQSVECVFLGYSLE 582
Query: 122 HKGYNCLTTEGKVI-ITRNVLFDETVFPFQFQTSKSSSD-----------ILLDSSLPTT 169
HKGY C + I I+R+V FDE F T++ SS I SLP++
Sbjct: 583 HKGYRCYDPSARRIRISRDVTFDENKPFFYSSTNQPSSPENSISFLYLPPIPSPESLPSS 642
Query: 170 SVIPL---LPISNASPNVMP------HANSVMPT---VATNGSAPVVQTTVSDFSQPA*D 217
+ P +P S SP +P + V P + + S P V +T++ + P
Sbjct: 643 PITPSPSPIPPSVPSPTYVPPPPPSPSPSPVSPPPSHIPASSSPPHVPSTITLDTFPFHY 702
Query: 218 SG--VVHASAQPTEPP-------VPPSSNVHSMTTRAKTGIHRPKAYAASTALSSVPTSA 268
S + +QP++P V SS RA+ + P + P++
Sbjct: 703 SRRPKIPNESQPSQPTLEDPTCSVDDSSPAPRYNLRARDALRAPNRDDFVVGVVFEPSTY 762
Query: 269 K*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPR 328
+ A+ +PHW AM EE AL NTW++VPL I CKWV++ K SDG + ++K R
Sbjct: 763 QEAIVLPHWKLAMSEELAALERTNTWDVVPLPSHAVPITCKWVYKVKTKSDGQVERYKAR 822
Query: 329 LVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVF 388
LVA+GF Q G DY+ETF+PV TT+RT++++A W I Q D NAFL+G+L EEV+
Sbjct: 823 LVARGFQQAHGRDYDETFAPVAHMTTVRTLIAVAATRSWTISQMDVKNAFLHGDLHEEVY 882
Query: 389 MQQPPGFSKGSSHLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTK 448
M PPG H V +L++ALYGLKQAPRAWF + SS + GF + D +LF +
Sbjct: 883 MHPPPGVEAPPGH-VFRLRRALYGLKQAPRAWFARFSSVVLAAGFSPSDHDPALFIHTSS 941
Query: 449 SETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLIL 508
+L+YVDD++ITG I + L+ F + DLG L YFLGIE T T DG L
Sbjct: 942 RGRTLLLLYVDDMLITGDDLEYIAFVKGKLSEQFMMSDLGPLSYFLGIEVTSTVDG-YYL 1000
Query: 509 SQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRP 568
SQ +YI DLL ++ + + TPM ++L + G P YR +VG+L Y+T+TRP
Sbjct: 1001 SQHRYIEDLLAQSGLTDSRTTTTPMELHVRLRSTDGTPLDDPSRYRHLVGSLVYLTVTRP 1060
Query: 569 EIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSD 628
+IA+ ++ + QF+ +P+ H+ + R+LRYL GT T L+ S + + AF+DS W SD
Sbjct: 1061 DIAYAVHILSQFVSAPISVHYGHLLRVLRYLRGTTTQCLFYAASSPLQLRAFSDSTWASD 1120
Query: 629 PDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRV 688
P DRRS +G CI++G +L++W +KKQT V+RSSTEAE R+LA +EI+WLR LL++ V
Sbjct: 1121 PIDRRSVTGYCIFLGTSLLTWKSKKQTAVSRSSTEAELRALATTTSEIVWLRWLLADFGV 1180
Query: 689 PTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
+ + CDN + + +PI H TKH +D F R + + +P+ QV+
Sbjct: 1181 SCDVPTPLLCDNTGAIQIANDPIKHELTKHIGVDASFTRSHCQQSTIALHYVPSELQVA 1239
>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301694|pir||E84535 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1454
Score = 525 bits (1351), Expect = e-147
Identities = 294/774 (37%), Positives = 426/774 (54%), Gaps = 33/774 (4%)
Query: 2 FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
F + GI +CP +QN VE KH+HI AL+ + +PL+ WGD TA +LIN
Sbjct: 693 FYAEKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSQVPLSLWGDCVLTAVFLIN 752
Query: 62 QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
+ P L TP + L P Y L+ FGCLCY T P HK RS C FLGY
Sbjct: 753 RTPSQLLMNKTPYEILTGTAPVYEQLRTFGCLCYSSTSPKQRHKFQPRSRACLFLGYPSG 812
Query: 122 HKGYNCLTTEGK-VIITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNA 180
+KGY + E V I+RNV F E VFP SS L +P +S I + +
Sbjct: 813 YKGYKLMDLESNTVFISRNVQFHEEVFPLAKNPGSESSLKLFTPMVPVSSGI--ISDTTH 870
Query: 181 SPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTEPPVPPSSNVHS 240
SP+ +P S +P ++ ++D+ H + ++ P SS +
Sbjct: 871 SPSSLPSQISDLPPQISSQRVRKPPAHLNDY----------HCNTMQSDHKYPISSTISY 920
Query: 241 MTTRAKTGIHRPKAYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLL 300
Y + +PT+ A W +A+ E A+ NTWE+ L
Sbjct: 921 SKISPSH-----MCYINNITKIPIPTNYAEAQDTKEWCEAVDAEIGAMEKTNTWEITTLP 975
Query: 301 HGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLS 360
G+ A+GCKWVF K+ +DG++ ++K RLVAKG+ Q EG DY +TFSPV K TTI+ +L
Sbjct: 976 KGKKAVGCKWVFTLKFLADGNLERYKARLVAKGYTQKEGLDYTDTFSPVAKMTTIKLLLK 1035
Query: 361 LALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFSKGS-----SHLVCKLKKALYGLKQ 415
++ +W ++Q D +NAFLNGEL EE+FM+ P G+++ S++V +LK+++YGLKQ
Sbjct: 1036 VSASKKWFLKQLDVSNAFLNGELEEEIFMKIPEGYAERKGIVLPSNVVLRLKRSIYGLKQ 1095
Query: 416 APRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLI 475
A R WF K SS+L GF+ D +LF K E + +LVYVDDI+I +S A L
Sbjct: 1096 ASRQWFKKFSSSLLSLGFKKTHGDHTLFLKMYDGEFVIVLVYVDDIVIASTSEAAAAQLT 1155
Query: 476 ADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTS 535
+L+ F L+DLG L YFLG+E T G+ + Q KY +LL M + P PM
Sbjct: 1156 EELDQRFKLRDLGDLKYFLGLEVARTT-AGISICQRKYALELLQSTGMLACKPVSVPMIP 1214
Query: 536 GLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRI 595
LK+ + G L YR +VG L Y+TITRP+I F +NK+CQF +P H A R+
Sbjct: 1215 NLKMRKDDGDLIEDIEQYRRIVGKLMYLTITRPDITFAVNKLCQFSSAPRTTHLTAAYRV 1274
Query: 596 LRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQT 655
L+Y+ GT GL+ S +++ FADSDW S D RRST+ +++G +L+SW +KKQ
Sbjct: 1275 LQYIKGTVGQGLFYSASSDLTLKGFADSDWASCQDSRRSTTSFTMFVGDSLISWRSKKQH 1334
Query: 656 VVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTK 715
V+RSS EAEYR+LAL E++WL +LL L+ + P ++Y D+ + +++ NP+ H +
Sbjct: 1335 TVSRSSAEAEYRALALATCEMVWLFTLLVSLQA-SPPVPILYSDSTAAIYIATNPVFHER 1393
Query: 716 TKHFALDLHFVRERVADKQVTVVNIPAPSQVSSTL--------FHCFKTKLRVL 761
TKH LD H VRER+ + ++ ++++ QV+ L F K+K+ +L
Sbjct: 1394 TKHIKLDCHTVRERLDNGELKLLHVRTEDQVADILTKPLFPYQFEHLKSKMSIL 1447
>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score:
11.19) [Arabidopsis thaliana] gi|7486705|pir||T01879
hypothetical protein F8M12.17 - Arabidopsis thaliana
Length = 1633
Score = 515 bits (1327), Expect = e-144
Identities = 298/797 (37%), Positives = 436/797 (54%), Gaps = 67/797 (8%)
Query: 2 FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
F+ + G+ H+ +C Y QQN VE KH+H+ +LL +++PL +W D TAAYLIN
Sbjct: 635 FVKEQGMIHQFSCAYTPQQNSVVERKHQHLLNIARSLLFQSNVPLQYWSDCVLTAAYLIN 694
Query: 62 QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
+LP P LD TP + L K PDY+ LK CLCY T ++ +K + R+ PC FLGY
Sbjct: 695 RLPSPLLDNKTPFELLLKKIPDYTLLK--SCLCYASTNVHDRNKFSPRARPCVFLGYPSG 752
Query: 122 HKGYNCLTTEGKVI-ITRNVLFDETVFPFQFQTS-KSSSDILLDSSLPTTSVIPLLPISN 179
+KGY L E I ITRNV+F ET FPF+ K S D+ +S LP + + +
Sbjct: 753 YKGYKVLDLESHSISITRNVVFHETKFPFKTSKFLKESVDMFPNSILPLPAPLHFVESMP 812
Query: 180 ASPNVMPHAN--SVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHAS-AQPTEPPVPPS- 235
++ N S + ++ S P + +TV+ + A D A+P P+
Sbjct: 813 LDDDLRADDNNASTSNSASSASSIPPLPSTVNTQNTDALDIDTNSVPIARPKRNAKAPAY 872
Query: 236 -SNVH--------SMTTRAKTGIHRPKA----------YAASTALS-------------- 262
S H S++ T I P + Y STA+S
Sbjct: 873 LSEYHCNSVPFLSSLSPTTSTSIETPSSSIPPKKITTPYPMSTAISYDKLTPLFHSYICA 932
Query: 263 ----SVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNS 318
+ P + A+ W +A EE AL N TW + L G++ +GCKWVF KYN
Sbjct: 933 YNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWIVESLTEGKNVVGCKWVFTIKYNP 992
Query: 319 DGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAF 378
DGSI ++K RLVA+GF Q EG DY ETFSPV K +++ +L LA W + Q D +NAF
Sbjct: 993 DGSIERYKARLVAQGFTQQEGIDYMETFSPVAKFGSVKLLLGLAAATGWSLTQMDVSNAF 1052
Query: 379 LNGELVEEVFMQQPPGFSKGS-----SHLVCKLKKALYGLKQAPRAWFDKLSSTLARFGF 433
L+GEL EE++M P G++ + S VC+L K+LYGLKQA R W+ +LSS F
Sbjct: 1053 LHGELDEEIYMSLPQGYTPPTGISLPSKPVCRLLKSLYGLKQASRQWYKRLSSVFLGANF 1112
Query: 434 QAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYF 493
+ D ++F K + + + +LVYVDD++I + S+A+ +L L + F +KDLG +F
Sbjct: 1113 IQSPADNTMFVKVSCTSIIVVLVYVDDLMIASNDSSAVENLKELLRSEFKIKDLGPARFF 1172
Query: 494 LGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALFPKPLLY 553
LG+E + +G + + Q KY +LL + PS PM L L+ G L P Y
Sbjct: 1173 LGLEIARSSEG-ISVCQRKYAQNLLEDVGLSGCKPSSIPMDPNLHLTKEMGTLLPNATSY 1231
Query: 554 RSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPS 613
R +VG L Y+ ITRP+I F ++ + QF+ +P + H QA ++LRYL G
Sbjct: 1232 RELVGRLLYLCITRPDITFAVHTLSQFLSAPTDIHMQAAHKVLRYLKGNPGQ-------- 1283
Query: 614 SMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVI 673
D+DWG+ D RRS +G CIY+G +L++W +KKQ+VV+RSSTE+EYRSLA
Sbjct: 1284 --------DADWGTCKDSRRSVTGFCIYLGTSLITWKSKKQSVVSRSSTESEYRSLAQAT 1335
Query: 674 TEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADK 733
EI+WL+ LL +L V + + ++CDN S +HL NP+ H +TKH +D H VR+++
Sbjct: 1336 CEIIWLQQLLKDLHVTMTCPAKLFCDNKSALHLATNPVFHERTKHIEIDCHTVRDQIKAG 1395
Query: 734 QVTVVNIPAPSQVSSTL 750
++ +++P +Q++ L
Sbjct: 1396 KLKTLHVPTGNQLADIL 1412
>pir||G86301 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989054|gb|AAG10817.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1413
Score = 513 bits (1322), Expect = e-144
Identities = 284/755 (37%), Positives = 419/755 (54%), Gaps = 41/755 (5%)
Query: 7 GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
GI +CP +QN VE KH+HI ALL + +PL++WGD TA ++IN+ P P
Sbjct: 674 GIVAYHSCPETPEQNSVVERKHQHILNVARALLFQSQIPLSYWGDCILTAVFIINRTPSP 733
Query: 67 TLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGYN 126
+ T + L K PDY+ LK FGCLCY T P HK R+ C FLGY +KGY
Sbjct: 734 VISNKTLFEMLTKKVPDYTHLKSFGCLCYASTSPKQRHKFEDRARTCAFLGYPSGYKGYK 793
Query: 127 CLTTEGKVI-ITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNASPNVM 185
L E I I+RNV+F E +FPF+ + +++ + P + +
Sbjct: 794 LLDLESHTIFISRNVVFYEDLFPFKTKPAENEESSVF---------FPHIYVDRNDS--- 841
Query: 186 PHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTEPPVPPS--------SN 237
H + +P T+ S + S S+P H ++ + P S S+
Sbjct: 842 -HPSQPLPVQETSASNVPAEKQNSRVSRPPAYLKDYHCNSVTSSTDHPISEVLSYSSLSD 900
Query: 238 VHSMTTRAKTGIHRPKAYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELV 297
+ + A I P YA A I W AM E AL +N TW +
Sbjct: 901 PYMIFINAVNKIPEPHTYAQ-------------ARQIKEWCDAMGMEITALEDNGTWVVC 947
Query: 298 PLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRT 357
L G+ A+GCKWV++ K N+DGS+ ++K RLVAKG+ QTEG DY +TFSPV K TT++
Sbjct: 948 SLPVGKKAVGCKWVYKIKLNADGSLERYKARLVAKGYTQTEGLDYVDTFSPVAKLTTVKL 1007
Query: 358 VLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFS--KGSS---HLVCKLKKALYG 412
++++A W + Q D +NAFLNG L EE++M PPG+S +G S + VC+LKK+LYG
Sbjct: 1008 LIAVAAAKGWSLSQLDISNAFLNGSLDEEIYMTLPPGYSPRQGDSFPPNAVCRLKKSLYG 1067
Query: 413 LKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIF 472
LKQA R W+ K S +L GF + D +LF + +K+ + +LVYVDDIII S
Sbjct: 1068 LKQASRQWYLKFSESLKALGFTQSSGDHTLFTRKSKNSYMAVLVYVDDIIIASSCDRETE 1127
Query: 473 SLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTP 532
L L L+DLG+L YFLG+E DG + + Q KY +LL + + S P
Sbjct: 1128 LLRDALQRSSKLRDLGTLRYFLGLEIARNTDG-ISICQRKYTLELLAETGLLGCKSSSVP 1186
Query: 533 MTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAV 592
M KLS G L YR +VG L Y+T TRP+I + ++++CQF +P PH +AV
Sbjct: 1187 MEPNQKLSQEDGELIDDAEHYRKLVGKLMYLTFTRPDITYAVHRLCQFTSAPRVPHLKAV 1246
Query: 593 KRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAK 652
+I+ YL GT GL+ + ++ FADSD+ S D R+ T+G C+++G +LV+W +K
Sbjct: 1247 YKIIYYLKGTVGQGLFYSANVDLKLSGFADSDFSSCSDSRKLTTGYCMFLGTSLVAWKSK 1306
Query: 653 KQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPIL 712
KQ V++ SS EAEY+++++ + E+MWLR LL +L + S +S++YCDN + +H+ NP+
Sbjct: 1307 KQEVISMSSAEAEYKAMSMAVREMMWLRFLLEDLWIDVSEASVLYCDNTAAIHIANNPVF 1366
Query: 713 HTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
H +TKH D H +RE++ + +++ +Q++
Sbjct: 1367 HERTKHIERDYHHIREKIILGLIRTLHVRTENQLA 1401
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.322 0.135 0.416
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,317,730,480
Number of Sequences: 2540612
Number of extensions: 55900775
Number of successful extensions: 138493
Number of sequences better than 10.0: 1769
Number of HSP's better than 10.0 without gapping: 1605
Number of HSP's successfully gapped in prelim test: 170
Number of HSP's that attempted gapping in prelim test: 131928
Number of HSP's gapped (non-prelim): 3051
length of query: 768
length of database: 863,360,394
effective HSP length: 136
effective length of query: 632
effective length of database: 517,837,162
effective search space: 327273086384
effective search space used: 327273086384
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 79 (35.0 bits)
Lotus: description of TM0378b.9