Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0033b.8
         (1310 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop...   712  0.0
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi...   700  0.0
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf...   699  0.0
emb|CAB10526.1| retrotransposon like protein [Arabidopsis thalia...   689  0.0
emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis...   682  0.0
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi...   681  0.0
pir||G86301 probable retroelement polyprotein [imported] - Arabi...   681  0.0
gb|AAU89728.1| putative retroelement pol polyprotein-like [Solan...   635  e-180
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi...   627  e-178
gb|AAG50751.1| polyprotein, putative [Arabidopsis thaliana] gi|2...   616  e-174
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t...   606  e-171
emb|CAB81200.1| putative retrotransposon polyprotein [Arabidopsi...   600  e-169
emb|CAD41085.2| OSJNBb0011N17.2 [Oryza sativa (japonica cultivar...   582  e-164
gb|AAT40550.1| putative receptor kinase [Solanum demissum]            582  e-164
pir||F86470 probable retroelement polyprotein [imported] - Arabi...   557  e-156
gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (ja...   536  e-150
emb|CAB79271.1| putative protein [Arabidopsis thaliana] gi|30212...   535  e-150
ref|NP_194047.2| protein kinase family protein [Arabidopsis thal...   535  e-150
dbj|BAB11447.1| polyprotein-like [Arabidopsis thaliana]               527  e-147
gb|AAC98469.1| putative retroelement pol polyprotein [Arabidopsi...   524  e-147

>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
            (gb|U12626). [Arabidopsis thaliana]
            gi|25301690|pir||G96722 hypothetical protein F20P5.25
            [imported] - Arabidopsis thaliana
          Length = 1315

 Score =  712 bits (1838), Expect = 0.0
 Identities = 363/772 (47%), Positives = 508/772 (65%), Gaps = 28/772 (3%)

Query: 548  HHLC*N-PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDN 606
            +H C   PQQN++VERKHQH+++VAR+L FQ+H+P+++W   +  AVYLINRLP P+L++
Sbjct: 551  YHSCPETPQQNSVVERKHQHILNVARSLFFQSHIPISYWGDCILTAVYLINRLPAPILED 610

Query: 607  KCPFQILYNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYDL 666
            KCPF++L  TVP   ++K+FG LC+ ST    R KF PRA  C F+G+    KG+   DL
Sbjct: 611  KCPFEVLTKTVPTYDHIKVFGCLCYASTSPKDRHKFSPRAKACAFIGYPSGFKGYKLLDL 670

Query: 667  KSRVISISRNVTFHEHNPFKPLDSDLTPTQA--FP--TTLPPI---FDDDIPVPATVQAQ 719
            ++  I +SR+V FHE   F  L SDL+  +   FP     PP+     D +    +  + 
Sbjct: 671  ETHSIIVSRHVVFHEEL-FPFLGSDLSQEEQNFFPDLNPTPPMQRQSSDHVNPSDSSSSV 729

Query: 720  E--PPVNNQNQVIAP--RTSQRIRKPPSYLQDYHCTLLSSDSVPITSSSTGINYPLSKVL 775
            E  P  N  N V  P  +TS R  K P+YLQDY+C  + S S P         + + K L
Sbjct: 730  EILPSANPTNNVPEPSVQTSHRKAKKPAYLQDYYCHSVVS-STP---------HEIRKFL 779

Query: 776  SYDHLNPKYQSFVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDS 835
            SYD +N  Y +F+  +  + EP+ ++EA K + WR AM  E + LE   TW +   P D 
Sbjct: 780  SYDRINDPYLTFLACLDKTKEPSNYTEAEKLQVWRDAMGAEFDFLEGTHTWEVCSLPADK 839

Query: 836  KPIGCKWVYKVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCA 895
            + IGC+W++K+KY  DGS+ERYKARLV +GYTQ EG+D+ +TFSPVAK+ +++++L + A
Sbjct: 840  RCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNETFSPVAKLNSVKLLLGVAA 899

Query: 896  SQ*WHLHQLDVDNAFLHASLDEQIYMTIPQGLVCNKA-----NQVCLLQKSLYGLKQASR 950
                 L QLD+ NAFL+  LDE+IYM +PQG    +      N VC L+KSLYGLKQASR
Sbjct: 900  RFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQGDSLPPNAVCRLKKSLYGLKQASR 959

Query: 951  QWFNTLSASLKKLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSL 1010
            QW+   S++L  LG+ QS  DHT ++K S G F  +L+Y DD+++A N+   + ++KS +
Sbjct: 960  QWYLKFSSTLLGLGFIQSYCDHTCFLKISDGIFLCVLVYIDDIIIASNNDAAVDILKSQM 1019

Query: 1011 HDQFRIKDLGEAKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKL 1070
               F+++DLGE KYFLGLEI RS  GI ++QRKYAL L+ ++G L  KP S PMD S   
Sbjct: 1020 KSFFKLRDLGELKYFLGLEIVRSDKGIHISQRKYALDLLDETGQLGCKPSSIPMDPSMVF 1079

Query: 1071 GTNIGTPLTDIGSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYL 1130
              + G    ++G YRRL+GRL+YLN TRPDITFAVN+L+QF  AP   H QA +++L+Y+
Sbjct: 1080 AHDSGGDFVEVGPYRRLIGRLMYLNITRPDITFAVNKLAQFSMAPRKAHLQAVYKILQYI 1139

Query: 1131 KGSPGSGLFYPASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASR 1190
            KG+ G GLFY A+S   L  ++++D+  C D+R+S +GYC+FLGDSLI W+S+KQ   S+
Sbjct: 1140 KGTIGQGLFYSATSELQLKVYANADYNSCRDSRRSTSGYCMFLGDSLICWKSRKQDVVSK 1199

Query: 1191 SSCESEYRAMATTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHI 1250
            SS E+EYR+++    E+ WL   L++L  P   PT +F DN++A+HIA+N  +HERTKHI
Sbjct: 1200 SSAEAEYRSLSVATDELVWLTNFLKELQVPLSKPTLLFCDNEAAIHIANNHVFHERTKHI 1259

Query: 1251 ELDCHIVREKLQQGLVHLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKDI 1302
            E DCH VRE+L +GL  L  I+T LQ AD FTK L P+ F   ISK+G+ +I
Sbjct: 1260 ESDCHSVRERLLKGLFELYHINTELQIADPFTKPLYPSHFHRLISKMGLLNI 1311


>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301694|pir||E84535 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1454

 Score =  700 bits (1806), Expect = 0.0
 Identities = 363/781 (46%), Positives = 502/781 (63%), Gaps = 26/781 (3%)

Query: 531  PGISHASFLFF*GHHSSHHLC*NPQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVA 590
            P +   SF    G  S H     P+QN++VERKHQH+++VARAL+FQ+ +P++ W   V 
Sbjct: 686  PELKFTSFYAEKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSQVPLSLWGDCVL 745

Query: 591  HAVYLINRLPTPVLDNKCPFQILYNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCI 650
             AV+LINR P+ +L NK P++IL  T P    L+ FG LC+ ST    R KF PR+  C+
Sbjct: 746  TAVFLINRTPSQLLMNKTPYEILTGTAPVYEQLRTFGCLCYSSTSPKQRHKFQPRSRACL 805

Query: 651  FLGFKPDTKGFVTYDLKSRVISISRNVTFHEHN-PFKPLDSDLTPTQAFPTTLPP---IF 706
            FLG+    KG+   DL+S  + ISRNV FHE   P        +  + F   +P    I 
Sbjct: 806  FLGYPSGYKGYKLMDLESNTVFISRNVQFHEEVFPLAKNPGSESSLKLFTPMVPVSSGII 865

Query: 707  DDDIPVPATVQAQEPPVNNQNQVIAPRTSQRIRKPPSYLQDYHCTLLSSDSVPITSSSTG 766
             D    P+++ +Q   +  Q       +SQR+RKPP++L DYHC  + SD          
Sbjct: 866  SDTTHSPSSLPSQISDLPPQI------SSQRVRKPPAHLNDYHCNTMQSDH--------- 910

Query: 767  INYPLSKVLSYDHLNPKYQSFVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTW 826
              YP+S  +SY  ++P +  ++ NI+    PT ++EA   + W +A+D EI A+E+  TW
Sbjct: 911  -KYPISSTISYSKISPSHMCYINNITKIPIPTNYAEAQDTKEWCEAVDAEIGAMEKTNTW 969

Query: 827  ILVDKPPDSKPIGCKWVYKVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTT 886
             +   P   K +GCKWV+ +K+  DG++ERYKARLV KGYTQ EG+D+ DTFSPVAKMTT
Sbjct: 970  EITTLPKGKKAVGCKWVFTLKFLADGNLERYKARLVAKGYTQKEGLDYTDTFSPVAKMTT 1029

Query: 887  LRVILALCASQ*WHLHQLDVDNAFLHASLDEQIYMTIPQGLVCNK-----ANQVCLLQKS 941
            ++++L + AS+ W L QLDV NAFL+  L+E+I+M IP+G    K     +N V  L++S
Sbjct: 1030 IKLLLKVSASKKWFLKQLDVSNAFLNGELEEEIFMKIPEGYAERKGIVLPSNVVLRLKRS 1089

Query: 942  LYGLKQASRQWFNTLSASLKKLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMH 1001
            +YGLKQASRQWF   S+SL  LG+K+++ DHTL++K   G F  +L+Y DD+++A     
Sbjct: 1090 IYGLKQASRQWFKKFSSSLLSLGFKKTHGDHTLFLKMYDGEFVIVLVYVDDIVIASTSEA 1149

Query: 1002 EIQLVKSSLHDQFRIKDLGEAKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVS 1061
                +   L  +F+++DLG+ KYFLGLE+AR+T+GI + QRKYAL+L+  +G LA KPVS
Sbjct: 1150 AAAQLTEELDQRFKLRDLGDLKYFLGLEVARTTAGISICQRKYALELLQSTGMLACKPVS 1209

Query: 1062 TPMDNSQKLGTNIGTPLTDIGSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQ 1121
             PM  + K+  + G  + DI  YRR+VG+L+YL  TRPDITFAVN+L QF SAP   H  
Sbjct: 1210 VPMIPNLKMRKDDGDLIEDIEQYRRIVGKLMYLTITRPDITFAVNKLCQFSSAPRTTHLT 1269

Query: 1122 AAHRVLKYLKGSPGSGLFYPASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWR 1181
            AA+RVL+Y+KG+ G GLFY ASS  TL  F+DSDWA C D+R+S T + +F+GDSLISWR
Sbjct: 1270 AAYRVLQYIKGTVGQGLFYSASSDLTLKGFADSDWASCQDSRRSTTSFTMFVGDSLISWR 1329

Query: 1182 SKKQSTASRSSCESEYRAMATTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNP 1241
            SKKQ T SRSS E+EYRA+A   CE+ WL  LL  L      P  ++ D+ +A++IA NP
Sbjct: 1330 SKKQHTVSRSSAEAEYRALALATCEMVWLFTLLVSLQASPPVPI-LYSDSTAAIYIATNP 1388

Query: 1242 SYHERTKHIELDCHIVREKLQQGLVHLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKD 1301
             +HERTKHI+LDCH VRE+L  G + LL + T  Q AD+ TK L P  F+   SK+ + +
Sbjct: 1389 VFHERTKHIKLDCHTVRERLDNGELKLLHVRTEDQVADILTKPLFPYQFEHLKSKMSILN 1448

Query: 1302 I 1302
            I
Sbjct: 1449 I 1449


>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score:
            11.19) [Arabidopsis thaliana] gi|7486705|pir||T01879
            hypothetical protein F8M12.17 - Arabidopsis thaliana
          Length = 1633

 Score =  699 bits (1804), Expect = 0.0
 Identities = 370/801 (46%), Positives = 500/801 (62%), Gaps = 80/801 (9%)

Query: 554  PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDNKCPFQIL 613
            PQQN++VERKHQH++++AR+LLFQ+++P+ +WS  V  A YLINRLP+P+LDNK PF++L
Sbjct: 651  PQQNSVVERKHQHLLNIARSLLFQSNVPLQYWSDCVLTAAYLINRLPSPLLDNKTPFELL 710

Query: 614  YNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYDLKSRVISI 673
               +PD T LK    LC+ ST    R KF PRA  C+FLG+    KG+   DL+S  ISI
Sbjct: 711  LKKIPDYTLLK--SCLCYASTNVHDRNKFSPRARPCVFLGYPSGYKGYKVLDLESHSISI 768

Query: 674  SRNVTFHEHN-PFKPLDSDLTPTQAFPTTLPPI-----------FDDDI----------- 710
            +RNV FHE   PFK           FP ++ P+            DDD+           
Sbjct: 769  TRNVVFHETKFPFKTSKFLKESVDMFPNSILPLPAPLHFVESMPLDDDLRADDNNASTSN 828

Query: 711  ---------PVPATVQAQEPPVNNQNQVIAP-RTSQRIRKPPSYLQDYHCTLLSSDSVPI 760
                     P+P+TV  Q     + +    P    +R  K P+YL +YHC     +SVP 
Sbjct: 829  SASSASSIPPLPSTVNTQNTDALDIDTNSVPIARPKRNAKAPAYLSEYHC-----NSVPF 883

Query: 761  TSS-----STGIN--------------YPLSKVLSYDHLNPKYQSFVMNISSSLEPTRFS 801
             SS     ST I               YP+S  +SYD L P + S++   +   EP  F+
Sbjct: 884  LSSLSPTTSTSIETPSSSIPPKKITTPYPMSTAISYDKLTPLFHSYICAYNVETEPKAFT 943

Query: 802  EAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGCKWVYKVKYKQDGSIERYKARL 861
            +A+K E W +A ++E+ ALE+N+TWI+         +GCKWV+ +KY  DGSIERYKARL
Sbjct: 944  QAMKSEKWTRAANEELHALEQNKTWIVESLTEGKNVVGCKWVFTIKYNPDGSIERYKARL 1003

Query: 862  VVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WHLHQLDVDNAFLHASLDEQIYM 921
            V +G+TQ EG+D+ +TFSPVAK  +++++L L A+  W L Q+DV NAFLH  LDE+IYM
Sbjct: 1004 VAQGFTQQEGIDYMETFSPVAKFGSVKLLLGLAAATGWSLTQMDVSNAFLHGELDEEIYM 1063

Query: 922  TIPQGL-----VCNKANQVCLLQKSLYGLKQASRQWFNTLSASLKKLGYKQSNADHTLYI 976
            ++PQG      +   +  VC L KSLYGLKQASRQW+  LS+      + QS AD+T+++
Sbjct: 1064 SLPQGYTPPTGISLPSKPVCRLLKSLYGLKQASRQWYKRLSSVFLGANFIQSPADNTMFV 1123

Query: 977  KASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKYFLGLEIARSTSG 1036
            K S  S   +L+Y DD+++A ND   ++ +K  L  +F+IKDLG A++FLGLEIARS+ G
Sbjct: 1124 KVSCTSIIVVLVYVDDLMIASNDSSAVENLKELLRSEFKIKDLGPARFFLGLEIARSSEG 1183

Query: 1037 IVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIGTPLTDIGSYRRLVGRLLYLNT 1096
            I + QRKYA  L+ D G    KP S PMD +  L   +GT L +  SYR LVGRLLYL  
Sbjct: 1184 ISVCQRKYAQNLLEDVGLSGCKPSSIPMDPNLHLTKEMGTLLPNATSYRELVGRLLYLCI 1243

Query: 1097 TRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGSGLFYPASSSTTLTAFSDSDW 1156
            TRPDITFAV+ LSQFLSAPTDIH QAAH+VL+YLKG+PG                 D+DW
Sbjct: 1244 TRPDITFAVHTLSQFLSAPTDIHMQAAHKVLRYLKGNPG----------------QDADW 1287

Query: 1157 AGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCESEYRAMATTVCEIQWLHYLLQD 1216
              C D+R+S+TG+C++LG SLI+W+SKKQS  SRSS ESEYR++A   CEI WL  LL+D
Sbjct: 1288 GTCKDSRRSVTGFCIYLGTSLITWKSKKQSVVSRSSTESEYRSLAQATCEIIWLQQLLKD 1347

Query: 1217 LNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCHIVREKLQQGLVHLLPISTTLQ 1276
            L+     P  +F DN+SA+H+A NP +HERTKHIE+DCH VR++++ G +  L + T  Q
Sbjct: 1348 LHVTMTCPAKLFCDNKSALHLATNPVFHERTKHIEIDCHTVRDQIKAGKLKTLHVPTGNQ 1407

Query: 1277 TADVFTKSLTPAPFKTCISKL 1297
             AD+ TK L P PF + + ++
Sbjct: 1408 LADILTKPLHPGPFHSLLKRI 1428


>emb|CAB10526.1| retrotransposon like protein [Arabidopsis thaliana]
            gi|7268497|emb|CAB78748.1| retrotransposon like protein
            [Arabidopsis thaliana] gi|7444421|pir||A71444 probable
            LTR retrotransposon - Arabidopsis thaliana
          Length = 1433

 Score =  689 bits (1778), Expect = 0.0
 Identities = 350/771 (45%), Positives = 489/771 (63%), Gaps = 37/771 (4%)

Query: 547  SHHLC*N-PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLD 605
            ++H C   P+QN++VERKHQH+++VARALLFQ+++P+ FW   V  AV+LINRLPTPVL+
Sbjct: 687  AYHSCPETPEQNSVVERKHQHILNVARALLFQSNIPLEFWGDCVLTAVFLINRLPTPVLN 746

Query: 606  NKCPFQILYNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYD 665
            NK P++ L N  P   +LK FG LC+ ST    R KF+PRA  C+FLG+    KG+   D
Sbjct: 747  NKSPYEKLKNIPPAYESLKTFGCLCYSSTSPKQRHKFEPRARACVFLGYPLGYKGYKLLD 806

Query: 666  LKSRVISISRNVTFHEHN-PFKPLDSDLTPTQAFPTTLPPIFDDDIPVPATVQAQEPPVN 724
            +++  +SISR+V FHE   PF            FP    P   DD+P+  T      P  
Sbjct: 807  IETHAVSISRHVIFHEDIFPFISSTIKDDIKDFFPLLQFPARTDDLPLEQTSIIDTHPHQ 866

Query: 725  N--QNQVIAP--RTSQRIRKPPSYLQDYHCTLLSSDSVPITSSSTGINYPLSKVLSYDHL 780
            +   ++ + P    S+R +KPP +LQD+HC                          Y++ 
Sbjct: 867  DVSSSKALVPFDPLSKRQKKPPKHLQDFHC--------------------------YNNT 900

Query: 781  NPKYQSFVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGC 840
               + +F+ NI++++ P R+SEA   + W  AM +EI A+ R  TW +V  PP+ K IGC
Sbjct: 901  TEPFHAFINNITNAVIPQRYSEAKDFKAWCDAMKEEIGAMVRTNTWSVVSLPPNKKAIGC 960

Query: 841  KWVYKVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WH 900
            KWV+ +K+  DGSIERYKARLV KGYTQ EG+D+++TFSPVAK+T++R++L L A   W 
Sbjct: 961  KWVFTIKHNADGSIERYKARLVAKGYTQEEGLDYEETFSPVAKLTSVRMMLLLAAKMKWS 1020

Query: 901  LHQLDVDNAFLHASLDEQIYMTIPQGLV-----CNKANQVCLLQKSLYGLKQASRQWFNT 955
            +HQLD+ NAFL+  LDE+IYM IP G           + +C L KS+YGLKQASRQW+  
Sbjct: 1021 VHQLDISNAFLNGDLDEEIYMKIPPGYADLVGEALPPHAICRLHKSIYGLKQASRQWYLK 1080

Query: 956  LSASLKKLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFR 1015
            LS +LK +G+++SNADHTL+IK ++G    +L+Y DD+++  N    +    + L   F+
Sbjct: 1081 LSNTLKGMGFQKSNADHTLFIKYANGVLMGVLVYVDDIMIVSNSDDAVAQFTAELKSYFK 1140

Query: 1016 IKDLGEAKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIG 1075
            ++DLG AKYFLG+EIARS  GI + QRKY L+L+S +G L SKP S P+D S KL    G
Sbjct: 1141 LRDLGAAKYFLGIEIARSEKGISICQRKYILELLSTTGFLGSKPSSIPLDPSVKLNKEDG 1200

Query: 1076 TPLTDIGSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPG 1135
             PLTD  SYR+LVG+L+YL  TRPDI +AVN L QF  APT +H  A H+VL+YLKG+ G
Sbjct: 1201 VPLTDSTSYRKLVGKLMYLQITRPDIAYAVNTLCQFSHAPTSVHLSAVHKVLRYLKGTVG 1260

Query: 1136 SGLFYPASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCES 1195
             GLFY A     L  ++DSD+  C D+R+ +  YC+F+GD L+SW+SKKQ T S S+ E+
Sbjct: 1261 QGLFYSADDKFDLRGYTDSDFGSCTDSRRCVAAYCMFIGDYLVSWKSKKQDTVSMSTAEA 1320

Query: 1196 EYRAMATTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCH 1255
            E+RAM+    E+ WL  L  D   P + P  ++ DN +A+HI +N  +HERTK +ELDC+
Sbjct: 1321 EFRAMSQGTKEMIWLSRLFDDFKVPFIPPAYLYCDNTAALHIVNNSVFHERTKFVELDCY 1380

Query: 1256 IVREKLQQGLVHLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKDIHLPV 1306
              RE ++ G +  + + T  Q AD  TK++ PA F   I K+G+ +I  P+
Sbjct: 1381 KTREAVESGFLKTMFVETGEQVADPLTKAIHPAQFHKLIGKMGVCNIFAPL 1431


>emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana]
            gi|7268152|emb|CAB78488.1| retrovirus-related like
            polyprotein [Arabidopsis thaliana] gi|7488175|pir||G71406
            probable retrovirus-related polyprotein - Arabidopsis
            thaliana
          Length = 1489

 Score =  682 bits (1760), Expect = 0.0
 Identities = 362/821 (44%), Positives = 499/821 (60%), Gaps = 112/821 (13%)

Query: 544  HHSSHHL--C*NPQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPT 601
            H   HH      PQQN++VERKHQH+++VARALLFQ+++P+ +WS  V  AV+LINRLP+
Sbjct: 716  HGMLHHFSCAYTPQQNSVVERKHQHILNVARALLFQSNIPMQYWSDCVTTAVFLINRLPS 775

Query: 602  PVLDNKCPFQILYNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGF 661
            P+L+NK P++++ N  PD + LK FG LCFVST    R KF PRA  C+FLG+    KG+
Sbjct: 776  PLLNNKSPYELILNKQPDYSLLKNFGCLCFVSTNAHERTKFTPRARACVFLGYPSGYKGY 835

Query: 662  VTYDLKSRVISISRNVTFHEHN-PFKPLDS-----DLTPTQAFPTTLP-------PIFDD 708
               DL+S  +++SRNV F EH  PFK  +      D+ P    P   P       P+ D+
Sbjct: 836  KVLDLESHSVTVSRNVVFKEHVFPFKTSELLNKAVDMFPNSILPLPAPLHFVETMPLIDE 895

Query: 709  DIPVPATVQAQE----------------PPVNN------QNQVIAPRTSQRIRKPPSYLQ 746
            D  +P T  ++                 PP +N       +  +    S+R  + PSYL 
Sbjct: 896  DSLIPTTTDSRTADNHASSSSSALPSIIPPSSNTETQDIDSNAVPITRSKRTTRAPSYLS 955

Query: 747  DYHCTLLSSDS-VPITSSSTGIN----------------YPLSKVLSYDHLNPKYQSFVM 789
            +YHC+L+ S S +P T SS  I+                YP+S V+SYD   P  QS++ 
Sbjct: 956  EYHCSLVPSISTLPPTDSSIPIHPLPEIFTASSPKKTTPYPISTVVSYDKYTPLCQSYIF 1015

Query: 790  NISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGCKWVYKVKYK 849
              ++  EP  FS+A+K E W +   +E++A+E N+TW +   PPD   +GCKWV+ +KY 
Sbjct: 1016 AYNTETEPKTFSQAMKSEKWIRVAVEELQAMELNKTWSVESLPPDKNVVGCKWVFTIKYN 1075

Query: 850  QDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WHLHQLDVDNA 909
             DG++ERYKARLV +G+TQ EG+DF DTFSPVAK+T+ +++L L A   W L Q+DV +A
Sbjct: 1076 PDGTVERYKARLVAQGFTQQEGIDFLDTFSPVAKLTSAKMMLGLAAITGWTLTQMDVSDA 1135

Query: 910  FLHASLDEQIYMTIPQGLVCNKA-----NQVCLLQKSLYGLKQASRQWFNTLSASLKKLG 964
            FLH  LDE+I+M++PQG           N VC L KS+YGLKQASRQW+           
Sbjct: 1136 FLHGDLDEEIFMSLPQGYTPPAGTILPPNPVCRLLKSIYGLKQASRQWYKR--------- 1186

Query: 965  YKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKY 1024
                              F A L+Y DD+++A N+  E++ +K+ L  +F+IKDLG A++
Sbjct: 1187 ------------------FVAALVYIDDIMIASNNDAEVENLKALLRSEFKIKDLGPARF 1228

Query: 1025 FLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIGTPLTDIGSY 1084
            FLGL                          L  KP S PMD +  L  ++GTPL +  +Y
Sbjct: 1229 FLGL--------------------------LGCKPSSIPMDPTLHLVRDMGTPLPNPTAY 1262

Query: 1085 RRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGSGLFYPASS 1144
            R+L+GRLLYL  TRPDIT+AV+QLSQF+SAP+DIH QAAH+VL+Y+K +PG GL Y A  
Sbjct: 1263 RKLIGRLLYLTITRPDITYAVHQLSQFISAPSDIHLQAAHKVLRYIKANPGQGLMYSADY 1322

Query: 1145 STTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCESEYRAMATTV 1204
               L  FSD+DWA C DTR+SI+G+C++LG SLISW+SKKQ+ ASRSS ESEYR+MA   
Sbjct: 1323 EICLNGFSDADWAACKDTRRSISGFCIYLGTSLISWKSKKQAVASRSSTESEYRSMAQAT 1382

Query: 1205 CEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCHIVREKLQQG 1264
            CEI WL  LL+DL+ P   P  +F DN+SA+H + NP +HERTKHIE+DCH VR++++ G
Sbjct: 1383 CEIIWLQQLLKDLHIPLTCPAKLFCDNKSALHSSLNPVFHERTKHIEIDCHTVRDQIKAG 1442

Query: 1265 LVHLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKDIHLP 1305
             +  L + T  Q AD+ TK+L P PF   + ++ +  + LP
Sbjct: 1443 NLKALHVPTENQHADILTKALHPGPFHHLLRQMSLSSLFLP 1483


>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301701|pir||E84589 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1461

 Score =  681 bits (1758), Expect = 0.0
 Identities = 354/766 (46%), Positives = 492/766 (64%), Gaps = 31/766 (4%)

Query: 547  SHHLC*N-PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLD 605
            S H C   P+QN++VERKHQH+++VARAL+FQ+++ + +W   V  AV+LINR P+ +L 
Sbjct: 712  SFHSCPETPEQNSVVERKHQHILNVARALMFQSNMSLPYWGDCVLTAVFLINRTPSALLS 771

Query: 606  NKCPFQILYNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYD 665
            NK PF++L   +PD + LK FG LC+ ST +  R KF PR+  C+FLG+    KG+   D
Sbjct: 772  NKTPFEVLTGKLPDYSQLKTFGCLCYSSTSSKQRHKFLPRSRACVFLGYPFGFKGYKLLD 831

Query: 666  LKSRVISISRNVTFHEHNPFKPLDSDLTPTQAFPTTLPPIFDDDIPVPA----TVQAQEP 721
            L+S V+ ISRNV FHE     PL S    +Q   TT   +F    P+ +    T     P
Sbjct: 832  LESNVVHISRNVEFHEE--LFPLAS----SQQSATTASDVFTPMDPLSSGNSITSHLPSP 885

Query: 722  PVNNQNQVIAPRTSQRIRKPPSYLQDYHCTLLSSDSVPITSSSTGINYPLSKVLSYDHLN 781
             ++   Q+    + +RI K P++LQDYHC  ++ D           ++P+S  LSY  ++
Sbjct: 886  QISPSTQI----SKRRITKFPAHLQDYHCYFVNKDD----------SHPISSSLSYSQIS 931

Query: 782  PKYQSFVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGCK 841
            P +  ++ NIS    P  + EA   + W  A+DQEI A+ER  TW +   PP  K +GCK
Sbjct: 932  PSHMLYINNISKIPIPQSYHEAKDSKEWCGAIDQEIGAMERTDTWEITSLPPGKKAVGCK 991

Query: 842  WVYKVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WHL 901
            WV+ VK+  DGS+ER+KAR+V KGYTQ EG+D+ +TFSPVAKM T++++L + AS+ W+L
Sbjct: 992  WVFTVKFHADGSLERFKARIVAKGYTQKEGLDYTETFSPVAKMATVKLLLKVSASKKWYL 1051

Query: 902  HQLDVDNAFLHASLDEQIYMTIPQGLVCNKA-----NQVCLLQKSLYGLKQASRQWFNTL 956
            +QLD+ NAFL+  L+E IYM +P G    K      N VC L+KS+YGLKQASRQWF   
Sbjct: 1052 NQLDISNAFLNGDLEETIYMKLPDGYADIKGTSLPPNVVCRLKKSIYGLKQASRQWFLKF 1111

Query: 957  SASLKKLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFRI 1016
            S SL  LG+++ + DHTL+++     F  LL+Y DD+++A       Q +  +L   F++
Sbjct: 1112 SNSLLALGFEKQHGDHTLFVRCIGSEFIVLLVYVDDIVIASTTEQAAQSLTEALKASFKL 1171

Query: 1017 KDLGEAKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIGT 1076
            ++LG  KYFLGLE+AR++ GI L+QRKYAL+L++ +  L  KP S PM  + +L  N G 
Sbjct: 1172 RELGPLKYFLGLEVARTSEGISLSQRKYALELLTSADMLDCKPSSIPMTPNIRLSKNDGL 1231

Query: 1077 PLTDIGSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGS 1136
             L D   YRRLVG+L+YL  TRPDITFAVN+L QF SAP   H  A ++VL+Y+KG+ G 
Sbjct: 1232 LLEDKEMYRRLVGKLMYLTITRPDITFAVNKLCQFSSAPRTAHLAAVYKVLQYIKGTVGQ 1291

Query: 1137 GLFYPASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCESE 1196
            GLFY A    TL  ++D+DW  C D+R+S TG+ +F+G SLISWRSKKQ T SRSS E+E
Sbjct: 1292 GLFYSAEDDLTLKGYTDADWGTCPDSRRSTTGFTMFVGSSLISWRSKKQPTVSRSSAEAE 1351

Query: 1197 YRAMATTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCHI 1256
            YRA+A   CE+ WL  LL  L      P  ++ D+ +A++IA NP +HERTKHIE+DCH 
Sbjct: 1352 YRALALASCEMAWLSTLLLALRVHSGVPI-LYSDSTAAVYIATNPVFHERTKHIEIDCHT 1410

Query: 1257 VREKLQQGLVHLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKDI 1302
            VREKL  G + LL + T  Q AD+ TK L P  F   +SK+ +++I
Sbjct: 1411 VREKLDNGQLKLLHVKTKDQVADILTKPLFPYQFAHLLSKMSIQNI 1456


>pir||G86301 probable retroelement polyprotein [imported] - Arabidopsis thaliana
            gi|9989054|gb|AAG10817.1| Putative retroelement
            polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score =  681 bits (1756), Expect = 0.0
 Identities = 349/747 (46%), Positives = 496/747 (65%), Gaps = 25/747 (3%)

Query: 547  SHHLC*N-PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLD 605
            ++H C   P+QN++VERKHQH+++VARALLFQ+ +P+++W   +  AV++INR P+PV+ 
Sbjct: 677  AYHSCPETPEQNSVVERKHQHILNVARALLFQSQIPLSYWGDCILTAVFIINRTPSPVIS 736

Query: 606  NKCPFQILYNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYD 665
            NK  F++L   VPD T+LK FG LC+ ST    R KF+ RA  C FLG+    KG+   D
Sbjct: 737  NKTLFEMLTKKVPDYTHLKSFGCLCYASTSPKQRHKFEDRARTCAFLGYPSGYKGYKLLD 796

Query: 666  LKSRVISISRNVTFHEHN-PFKPLDSDLTPTQAFPTTLPPIFDD--DIPVPATVQAQEPP 722
            L+S  I ISRNV F+E   PFK   ++   +  F    P I+ D  D      +  QE  
Sbjct: 797  LESHTIFISRNVVFYEDLFPFKTKPAENEESSVF---FPHIYVDRNDSHPSQPLPVQETS 853

Query: 723  VNNQNQVIAPRTSQRIRKPPSYLQDYHCTLLSSDSVPITSSSTGINYPLSKVLSYDHLNP 782
             +N   V A + + R+ +PP+YL+DYHC  ++S +          ++P+S+VLSY  L+ 
Sbjct: 854  ASN---VPAEKQNSRVSRPPAYLKDYHCNSVTSST----------DHPISEVLSYSSLSD 900

Query: 783  KYQSFVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGCKW 842
             Y  F+  ++   EP  +++A + + W  AM  EI ALE N TW++   P   K +GCKW
Sbjct: 901  PYMIFINAVNKIPEPHTYAQARQIKEWCDAMGMEITALEDNGTWVVCSLPVGKKAVGCKW 960

Query: 843  VYKVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WHLH 902
            VYK+K   DGS+ERYKARLV KGYTQ EG+D+ DTFSPVAK+TT+++++A+ A++ W L 
Sbjct: 961  VYKIKLNADGSLERYKARLVAKGYTQTEGLDYVDTFSPVAKLTTVKLLIAVAAAKGWSLS 1020

Query: 903  QLDVDNAFLHASLDEQIYMTIPQGLVCNKA-----NQVCLLQKSLYGLKQASRQWFNTLS 957
            QLD+ NAFL+ SLDE+IYMT+P G    +      N VC L+KSLYGLKQASRQW+   S
Sbjct: 1021 QLDISNAFLNGSLDEEIYMTLPPGYSPRQGDSFPPNAVCRLKKSLYGLKQASRQWYLKFS 1080

Query: 958  ASLKKLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFRIK 1017
             SLK LG+ QS+ DHTL+ + S  S+ A+L+Y DD+++A +   E +L++ +L    +++
Sbjct: 1081 ESLKALGFTQSSGDHTLFTRKSKNSYMAVLVYVDDIIIASSCDRETELLRDALQRSSKLR 1140

Query: 1018 DLGEAKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIGTP 1077
            DLG  +YFLGLEIAR+T GI + QRKY L+L++++G L  K  S PM+ +QKL    G  
Sbjct: 1141 DLGTLRYFLGLEIARNTDGISICQRKYTLELLAETGLLGCKSSSVPMEPNQKLSQEDGEL 1200

Query: 1078 LTDIGSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGSG 1137
            + D   YR+LVG+L+YL  TRPDIT+AV++L QF SAP   H +A ++++ YLKG+ G G
Sbjct: 1201 IDDAEHYRKLVGKLMYLTFTRPDITYAVHRLCQFTSAPRVPHLKAVYKIIYYLKGTVGQG 1260

Query: 1138 LFYPASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCESEY 1197
            LFY A+    L+ F+DSD++ C D+RK  TGYC+FLG SL++W+SKKQ   S SS E+EY
Sbjct: 1261 LFYSANVDLKLSGFADSDFSSCSDSRKLTTGYCMFLGTSLVAWKSKKQEVISMSSAEAEY 1320

Query: 1198 RAMATTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCHIV 1257
            +AM+  V E+ WL +LL+DL       + ++ DN +A+HIA+NP +HERTKHIE D H +
Sbjct: 1321 KAMSMAVREMMWLRFLLEDLWIDVSEASVLYCDNTAAIHIANNPVFHERTKHIERDYHHI 1380

Query: 1258 REKLQQGLVHLLPISTTLQTADVFTKS 1284
            REK+  GL+  L + T  Q AD+  KS
Sbjct: 1381 REKIILGLIRTLHVRTENQLADIPYKS 1407


>gb|AAU89728.1| putative retroelement pol polyprotein-like [Solanum tuberosum]
          Length = 1476

 Score =  635 bits (1637), Expect = e-180
 Identities = 345/779 (44%), Positives = 484/779 (61%), Gaps = 52/779 (6%)

Query: 554  PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDNKCPFQIL 613
            PQQN +VER+H+H++  ARAL FQ HLP+ FW   V  AV++INR+P+ VL NK PF+++
Sbjct: 704  PQQNGVVERRHKHILETARALRFQGHLPIRFWGECVLSAVHIINRIPSSVLHNKSPFELM 763

Query: 614  YNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYDLKSRVISI 673
            Y   PD++ +++ G LC  + L +   +                 KG+  YDL+ +   +
Sbjct: 764  YKRSPDLSYMRVIGCLCHATNLVNTSTQ-----------------KGYKLYDLEHQHFFV 806

Query: 674  SRNVTFHEHN-PFK-PLDSDLTPTQAFPTTLP---PIFDDDIPVPATVQAQE------PP 722
            SR++ F+E   PF+ P  +D   T  F  + P      D D   PA + ++E      PP
Sbjct: 807  SRDMVFNEAVFPFQSPALADPHDTPVFLASPPCSSHTEDADAVQPAIITSEEIIPVASPP 866

Query: 723  VNNQNQVIAP----RTSQRIRKPPSYLQDYHCTLLSSDSVPITSSSTGINYPLSKVLSYD 778
                +  + P    R S R  KPP + +D+  T         TS S    YP+S  + Y 
Sbjct: 867  SAVSDDHLHPPPERRRSYRTGKPPIWQKDFITTS--------TSRSNHCLYPISDNIDYS 918

Query: 779  HLNPKYQSFVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPI 838
             L+  YQ ++ + S   EP  + +A     W  AM +EI+ALE N+TW +V  P   K I
Sbjct: 919  CLSSTYQCYIASSSVETEPQFYYQAANDCRWVHAMKEEIQALEDNKTWEVVSLPKGKKAI 978

Query: 839  GCKWVYKVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ* 898
            GCKWVYK+KYK  G IER+KARLV KGY Q EG+D+Q+TFSPV KM TLR +L L  S+ 
Sbjct: 979  GCKWVYKIKYKASGEIERFKARLVAKGYNQKEGLDYQETFSPVVKMVTLRTVLTLAVSKG 1038

Query: 899  WHLHQLDVDNAFLHASLDEQIYMTIPQGLVCNKAN--QVCLLQKSLYGLKQASRQWFNTL 956
            W + Q+DV NAFL   L E++YM +PQG   +K    +VC L KSLYGLKQASRQW   L
Sbjct: 1039 WDIQQMDVYNAFLQGDLIEEVYMQLPQGFQYDKTGDPKVCRLLKSLYGLKQASRQWNVKL 1098

Query: 957  SASLKKLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFRI 1016
            + +L   G++QS+ D++L +K ++     +L+Y DD+L+ G+ +  I   K  L   F+I
Sbjct: 1099 TTALLAAGFQQSHLDYSLMLKRTADGIVIVLIYVDDLLITGSSLQLIDDAKQVLKANFKI 1158

Query: 1017 KDLGEAKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGT---- 1072
            KDLG  +YFLG+E AR+ SG++++QRKYAL+LISD G   SKP  TP++   KL T    
Sbjct: 1159 KDLGTLRYFLGMEFARNASGMLMHQRKYALELISDLGLGGSKPSVTPVELHLKLTTREFD 1218

Query: 1073 -NIGTP-----LTDIGSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRV 1126
             ++G+      L D   Y+RLVGRLLYL  TRPDI+FAV  LSQF+ AP   H +AA RV
Sbjct: 1219 LHVGSSGADSLLADPTEYQRLVGRLLYLTITRPDISFAVQHLSQFMHAPKVSHMEAAIRV 1278

Query: 1127 LKYLKGSPGSGLFYPASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQS 1186
            +KY+K +PG GL+    ++ TL A+ D+DW  CI+TRKSITGY +  G +L+SW+SKKQ 
Sbjct: 1279 VKYVKQAPGLGLYMAVQTADTLQAYCDADWGSCINTRKSITGYMIQFGSALLSWKSKKQP 1338

Query: 1187 TASRSSCESEYRAMATTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHER 1246
            T SRSS E+EYR++A+TV E+ WL  L ++L+ P   P S++ D+++A+ IA NP +HER
Sbjct: 1339 TISRSSAEAEYRSLASTVAELVWLTGLFKELDMPLSLPVSLYCDSKAAIQIAANPVFHER 1398

Query: 1247 TKHIELDCHIVREKLQQGLVHLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKDIHLP 1305
            TKHI++DCH +REK+Q GLV +  + T  Q AD+ TK L+ A     +SKLG+K+I +P
Sbjct: 1399 TKHIDIDCHFIREKVQAGLVMIHYLPTQEQPADILTKGLSSAQHSYLVSKLGLKNIFIP 1457


>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301698|pir||C84512 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1501

 Score =  627 bits (1617), Expect = e-178
 Identities = 334/821 (40%), Positives = 492/821 (59%), Gaps = 69/821 (8%)

Query: 554  PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDNKCPFQIL 613
            PQQN  VERKH+H+++VARALLFQA LP+ FW  ++  A YLINR P+ +L  + P+++L
Sbjct: 680  PQQNGRVERKHRHILNVARALLFQASLPIKFWGESILTAAYLINRTPSSILSGRTPYEVL 739

Query: 614  YNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYDLKSRVISI 673
            + + P  + L++FG+ C+V  +T  + KF  R+  CIF+G+    KG+  YD++     +
Sbjct: 740  HGSKPVYSQLRVFGSACYVHRVTRDKDKFGQRSRSCIFVGYPFGKKGWKVYDIERNEFLV 799

Query: 674  SRNVTFHEHN-PFKPLDSDLTPTQAFPTT-------LPP--------------------- 704
            SR+V F E   P+  ++S    + + PT        +PP                     
Sbjct: 800  SRDVIFREEVFPYAGVNSSTLASTSLPTVSEDDDWAIPPLEVRGSIDSVETERVVCTTDE 859

Query: 705  ------IFDDDIP------------VPATVQAQEPPVNNQNQVIAP------------RT 734
                  + D +IP             P +V     P      ++ P            R 
Sbjct: 860  VVLDTSVSDSEIPNQEFVPDDTPPSSPLSVSPSGSPNTPTTPIVVPVASPIPVSPPKQRK 919

Query: 735  SQRIRKPPSYLQDYHC-----TLLSSDSVPITSSSTGIN-----YPLSKVLSYDHLNPKY 784
            S+R   PP  L DY       T  S  ++P   S +        +PL+  +S    +  +
Sbjct: 920  SKRATHPPPKLNDYVLYNAMYTPSSIHALPADPSQSSTVPGKSLFPLTDYVSDAAFSSSH 979

Query: 785  QSFVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGCKWVY 844
            ++++  I+ ++EP  F EAV+ + W  AM  E++ALE N+TW +VD PP    IG +WV+
Sbjct: 980  RAYLAAITDNVEPKHFKEAVQIKVWNDAMFTEVDALEINKTWDIVDLPPGKVAIGSQWVF 1039

Query: 845  KVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WHLHQL 904
            K KY  DG++ERYKARLVV+G  QVEG D+++TF+PV +MTT+R +L   A+  W ++Q+
Sbjct: 1040 KTKYNSDGTVERYKARLVVQGNKQVEGEDYKETFAPVVRMTTVRTLLRNVAANQWEVYQM 1099

Query: 905  DVDNAFLHASLDEQIYMTIPQGLVCNKANQVCLLQKSLYGLKQASRQWFNTLSASLKKLG 964
            DV NAFLH  L+E++YM +P G   +  ++VC L+KSLYGLKQA R WF  LS SL + G
Sbjct: 1100 DVHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCWFKKLSDSLLRFG 1159

Query: 965  YKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKY 1024
            + QS  D++L+    +     +L+Y DD+L+ GND + +Q  K  L   F +KDLG+ KY
Sbjct: 1160 FVQSYEDYSLFSYTRNNIELRVLIYVDDLLICGNDGYMLQKFKDYLSRCFSMKDLGKLKY 1219

Query: 1025 FLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIGTPLTDIGSY 1084
            FLG+E++R   GI L+QRKYAL +I+DSG+L S+P  TP++ +  L ++ G  L+D   Y
Sbjct: 1220 FLGIEVSRGPEGIFLSQRKYALDVIADSGNLGSRPAHTPLEQNHHLASDDGPLLSDPKPY 1279

Query: 1085 RRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGSGLFYPASS 1144
            RRLVGRLLYL  TRP+++++V+ L+QF+  P + H  AA RV++YLKGSPG G+   A  
Sbjct: 1280 RRLVGRLLYLLHTRPELSYSVHVLAQFMQNPREAHFDAALRVVRYLKGSPGQGILLNADP 1339

Query: 1145 STTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCESEYRAMATTV 1204
              TL  + DSDW  C  TR+SI+ Y + LG S ISW++KKQ T S SS E+EYRAM+  +
Sbjct: 1340 DLTLEVYCDSDWQSCPLTRRSISAYVVLLGGSPISWKTKKQDTVSHSSAEAEYRAMSYAL 1399

Query: 1205 CEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCHIVREKLQQG 1264
             EI+WL  LL++L   Q  P  ++ D+++A+HIA NP +HERTKHIE DCH VR+ ++ G
Sbjct: 1400 KEIKWLRKLLKELGIEQSTPARLYCDSKAAIHIAANPVFHERTKHIESDCHSVRDAVRDG 1459

Query: 1265 LVHLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKDIHLP 1305
            ++    + TT Q ADVFTK+L    F   +SKLG++++H P
Sbjct: 1460 IITTQHVRTTEQLADVFTKALGRNQFLYLMSKLGVQNLHTP 1500


>gb|AAG50751.1| polyprotein, putative [Arabidopsis thaliana] gi|25301686|pir||F96610
            probable polyprotein T8L23.26 [imported] - Arabidopsis
            thaliana
          Length = 1468

 Score =  616 bits (1588), Expect = e-174
 Identities = 313/802 (39%), Positives = 490/802 (61%), Gaps = 51/802 (6%)

Query: 554  PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDNKCPFQIL 613
            P QN  VERKH+H++++ARAL FQ++LP+ FW   +  A YLINR P+ +L  K P+++L
Sbjct: 667  PHQNGRVERKHRHILNIARALRFQSYLPIQFWGECILSAAYLINRTPSMLLQGKSPYEML 726

Query: 614  YNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYDLKSRVISI 673
            Y T P  ++L++FG+LC+         KF  R+ +C+F+G+    KG+  +DL+ +   +
Sbjct: 727  YKTAPKYSHLRVFGSLCYAHNQNHKGDKFAARSRRCVFVGYPHGQKGWRLFDLEEQKFFV 786

Query: 674  SRNVTFHEHN-PFKPL----------------------------------DSDLTPTQAF 698
            SR+V F E   P+  +                                  ++ + P  A 
Sbjct: 787  SRDVIFQETEFPYSKMSCNEEDERVLVDCVGPPFIEEAIGPRTIIGRNIGEATVGPNVAT 846

Query: 699  PTTLPPIFD--------------DDIPVPATVQAQEPPVNNQNQV-IAPRTSQRIRKPPS 743
               +P I                D     +TVQ  + P+++     I  R S R  + P 
Sbjct: 847  GPIIPEINQESSSPSEFVSLSSLDPFLASSTVQTADLPLSSTTPAPIQLRRSSRQTQKPM 906

Query: 744  YLQDYHCTLLSSDSVPITSSSTGINYPLSKVLSYDHLNPKYQSFVMNISSSLEPTRFSEA 803
             L+++    +S +S+   +SS+ + YP+ K +        +++F+  +++ +EPT ++EA
Sbjct: 907  KLKNFVTNTVSVESISPEASSSSL-YPIEKYVDCHRFTSSHKAFLAAVTAGMEPTTYNEA 965

Query: 804  VKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGCKWVYKVKYKQDGSIERYKARLVV 863
            +  + WR+AM  EIE+L  NQT+ +V+ PP  + +G KWVYK+KY+ DG+IERYKARLVV
Sbjct: 966  MVDKAWREAMSAEIESLRVNQTFSIVNLPPGKRALGNKWVYKIKYRSDGAIERYKARLVV 1025

Query: 864  KGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WHLHQLDVDNAFLHASLDEQIYMTI 923
             G  Q EGVD+ +TF+PVAKM+T+R+ L + A++ WH+HQ+DV NAFLH  L E++YM +
Sbjct: 1026 LGNCQKEGVDYDETFAPVAKMSTVRLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYMKL 1085

Query: 924  PQGLVCNKANQVCLLQKSLYGLKQASRQWFNTLSASLKKLGYKQSNADHTLYIKASSGSF 983
            PQG  C+  ++VC L KSLYGLKQA R WF+ LS++LK+ G+ QS +D++L+   + G F
Sbjct: 1086 PQGFQCDDPSKVCRLHKSLYGLKQAPRCWFSKLSSALKQYGFTQSLSDYSLFSYNNDGIF 1145

Query: 984  TALLLYADDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKYFLGLEIARSTSGIVLNQRK 1043
              +L+Y DD++++G+    +   KS L   F +KDLG  KYFLG+E++R+  G  L+QRK
Sbjct: 1146 VHVLVYVDDLIISGSCPDAVAQFKSYLESCFHMKDLGLLKYFLGIEVSRNAQGFYLSQRK 1205

Query: 1044 YALQLISDSGHLASKPVSTPMDNSQKLGTNIGTPLTDIGSYRRLVGRLLYLNTTRPDITF 1103
            Y L +IS+ G L ++P + P++ + KL  +    L+D   YRRLVGRL+YL  TRP++++
Sbjct: 1206 YVLDIISEMGLLGARPSAFPLEQNHKLSLSTSPLLSDSSRYRRLVGRLIYLVVTRPELSY 1265

Query: 1104 AVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGSGLFYPASSSTTLTAFSDSDWAGCIDTR 1163
            +V+ L+QF+  P   H  AA RV++YLK +PG G+   ++S+  +  + DSD+A C  TR
Sbjct: 1266 SVHTLAQFMQNPRQDHWNAAIRVVRYLKSNPGQGILLSSTSTLQINGWCDSDYAACPLTR 1325

Query: 1164 KSITGYCLFLGDSLISWRSKKQSTASRSSCESEYRAMATTVCEIQWLHYLLQDLNQPQLA 1223
            +S+TGY + LGD+ ISW++KKQ T SRSS E+EYRAMA    E+ WL  +L DL    + 
Sbjct: 1326 RSLTGYFVQLGDTPISWKTKKQPTVSRSSAEAEYRAMAFLTQELMWLKRVLYDLGVSHVQ 1385

Query: 1224 PTSMFRDNQSAMHIAHNPSYHERTKHIELDCHIVREKLQQGLVHLLPISTTLQTADVFTK 1283
               +F D++SA+ ++ NP  HERTKH+E+DCH +R+ +  G++    + +  Q AD+ TK
Sbjct: 1386 AMRIFSDSKSAIALSVNPVQHERTKHVEVDCHFIRDAILDGIIATSFVPSHKQLADILTK 1445

Query: 1284 SLTPAPFKTCISKLGMKDIHLP 1305
            +L     +  + KLG+ D+H P
Sbjct: 1446 ALGEKEVRYFLRKLGILDVHAP 1467


>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1491

 Score =  606 bits (1563), Expect = e-171
 Identities = 331/828 (39%), Positives = 490/828 (58%), Gaps = 76/828 (9%)

Query: 554  PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDNKCPFQIL 613
            PQQN  VERKH+H+++V+RALLFQA LP+ FW  AV  A YLINR P+ + +   P+++L
Sbjct: 663  PQQNGRVERKHRHILNVSRALLFQASLPIKFWGEAVMTAAYLINRTPSSIHNGLSPYELL 722

Query: 614  YNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYDLK------ 667
            +   PD   L++FG+ C+   +T  + KF  R+  CIF+G+    KG+  YDL       
Sbjct: 723  HGCKPDYDQLRVFGSACYAHRVTRDKDKFGERSRLCIFVGYPFGQKGWKVYDLSTNEFIV 782

Query: 668  SRVISISRNVTFHEHN---------------------PFKPLD----------------- 689
            SR +    NV  +  N                     PF  L+                 
Sbjct: 783  SRDVVFRENVFPYATNEGDTIYTPPVTCPITYDEDWLPFTTLEDRGSDENSLSDPPVCVT 842

Query: 690  --SDLTPTQAFPTTLPPIFDDDIPVPATVQAQEPPVNNQNQV------------------ 729
              S+       P +LP   DD +    +V   + P N+ +                    
Sbjct: 843  DVSESDTEHDTPQSLPTPVDDPLSPSTSVTPTQTPTNSSSSTSPSTNVSPPQQDTTPIIE 902

Query: 730  -IAPRTSQRIRKPPSYLQDY-----HCT-----LLSSDSVPITSSSTGIN-YPLSKVLSY 777
               PR  +R  +  + L+DY      CT     +LS  +   +SS  G + YPL+  +  
Sbjct: 903  NTPPRQGKRQVQQLARLKDYILYNASCTPNTPHVLSPSTSQSSSSIQGNSQYPLTDYIFD 962

Query: 778  DHLNPKYQSFVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKP 837
            +  +  ++ F+  I+++ EP  F EAVK + W  AM +E++ALE N+TW +VD P     
Sbjct: 963  ECFSAGHKVFLAAITANDEPKHFKEAVKVKVWNDAMYKEVDALEVNKTWDIVDLPTGKVA 1022

Query: 838  IGCKWVYKVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ 897
            IG +WVYK K+  DG++ERYKARLVV+G  Q+EG D+ +TF+PV KMTT+R +L L A+ 
Sbjct: 1023 IGSQWVYKTKFNADGTVERYKARLVVQGNNQIEGEDYTETFAPVVKMTTVRTLLRLVAAN 1082

Query: 898  *WHLHQLDVDNAFLHASLDEQIYMTIPQGLVCNKANQVCLLQKSLYGLKQASRQWFNTLS 957
             W ++Q+DV NAFLH  L+E++YM +P G   +  ++VC L+KSLYGLKQA R WF  LS
Sbjct: 1083 QWEVYQMDVHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCWFKKLS 1142

Query: 958  ASLKKLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFRIK 1017
             +LK+ G+ Q   D++ +  +  G    +L+Y DD+++ GND + +Q  K  L   F +K
Sbjct: 1143 DALKRFGFIQGYEDYSFFSYSCKGIELRVLVYVDDLIICGNDEYMVQKFKEYLGRCFSMK 1202

Query: 1018 DLGEAKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIGTP 1077
            DLG+ KYFLG+E++R   GI L+QRKYAL +ISDSG L ++P  TP++ +  L ++ G  
Sbjct: 1203 DLGKLKYFLGIEVSRGPDGIFLSQRKYALDIISDSGTLGARPAYTPLEQNHHLASDDGPL 1262

Query: 1078 LTDIGSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGSG 1137
            L D   +RRLVGRLLYL  TRP+++++V+ LSQF+ AP + H +AA R+++YLKGSPG G
Sbjct: 1263 LQDPKPFRRLVGRLLYLLHTRPELSYSVHVLSQFMQAPREAHLEAAMRIVRYLKGSPGQG 1322

Query: 1138 LFYPASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCESEY 1197
            +   ++   TL  + DSD+  C  TR+S++ Y + LG S ISW++KKQ T S SS E+EY
Sbjct: 1323 ILLSSNKDLTLEVYCDSDFQSCPLTRRSLSAYVVLLGGSPISWKTKKQDTVSHSSAEAEY 1382

Query: 1198 RAMATTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCHIV 1257
            RAM+  + EI+WL+ LL++L     APT +F D+++A+ IA NP +HERTKHIE DCH V
Sbjct: 1383 RAMSVALKEIKWLNKLLKELGITLAAPTRLFCDSKAAISIAANPVFHERTKHIERDCHSV 1442

Query: 1258 REKLQQGLVHLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKDIHLP 1305
            R+ ++ G++    + T+ Q AD+FTK+L    F   +SKLG++++H P
Sbjct: 1443 RDAVRDGIITTHHVRTSEQLADIFTKALGRNQFIYLMSKLGIQNLHTP 1490


>emb|CAB81200.1| putative retrotransposon polyprotein [Arabidopsis thaliana]
            gi|4539373|emb|CAB40067.1| putative retrotransposon
            polyprotein [Arabidopsis thaliana] gi|7486142|pir||T04294
            hypothetical protein F25I24.200 - Arabidopsis thaliana
          Length = 1203

 Score =  600 bits (1547), Expect = e-169
 Identities = 321/686 (46%), Positives = 427/686 (61%), Gaps = 62/686 (9%)

Query: 659  KGFVTYDLKSRVISISRNVTFHEHN-PFKPLDSDLTPTQAFPTTLPPI-----------F 706
            KG+   DL+S  ISI+RNV FHE   PFK           FP ++ P+            
Sbjct: 340  KGYKVLDLESHSISITRNVVFHETKFPFKTSKFLKESVDMFPNSILPLPAPLHFVESMPL 399

Query: 707  DDDI--------------------PVPATVQAQEPPVNNQNQVIAP-RTSQRIRKPPSYL 745
            DDD+                    P+P+TV  Q     + +    P    +R  K P+YL
Sbjct: 400  DDDLRADDNNASTSNSASSASSIPPLPSTVNTQNTDALDIDTNSVPIARPKRNAKAPAYL 459

Query: 746  QDYHCTLLSSDSVPITSS-----STGIN--------------YPLSKVLSYDHLNPKYQS 786
             +YHC     +SVP  SS     ST I               YP+S  +SYD L P + S
Sbjct: 460  SEYHC-----NSVPFLSSLSPTTSTSIETPSSSIPPKKITTPYPMSTAISYDKLTPLFHS 514

Query: 787  FVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGCKWVYKV 846
            ++   +   EP  F++A+K E W +A ++E+ ALE+N+TWI+         +GCKWV+ +
Sbjct: 515  YICAYNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWIVESLTEGKNVVGCKWVFTI 574

Query: 847  KYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WHLHQLDV 906
            KY  DGSIERYKARLV +G+TQ EG+D+ +TFSPVAK  +++++L L A+  W L Q+DV
Sbjct: 575  KYNPDGSIERYKARLVAQGFTQQEGIDYMETFSPVAKFGSVKLLLGLAAATGWSLTQMDV 634

Query: 907  DNAFLHASLDEQIYMTIPQGL-----VCNKANQVCLLQKSLYGLKQASRQWFNTLSASLK 961
             NAFLH  LDE+IYM++PQG      +   +  VC L KSLYGLKQASRQW+  LS+   
Sbjct: 635  SNAFLHGELDEEIYMSLPQGYTPPTGISLPSKPVCRLLKSLYGLKQASRQWYKRLSSVFL 694

Query: 962  KLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFRIKDLGE 1021
               + QS AD+T+++K S  S   +L+Y DD+++A ND   ++ +K  L  +F+IKDLG 
Sbjct: 695  GANFIQSPADNTMFVKVSCTSIIVVLVYVDDLMIASNDSSAVENLKELLRSEFKIKDLGP 754

Query: 1022 AKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIGTPLTDI 1081
            A++FLGLEIARS+ GI + QRKYA  L+ D G    KP S PMD +  L   +GT L + 
Sbjct: 755  ARFFLGLEIARSSEGISVCQRKYAQNLLEDVGLSGCKPSSIPMDPNLHLTKEMGTLLPNA 814

Query: 1082 GSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGSGLFYP 1141
             SYR LVGRLLYL  TRPDITFAV+ LSQFLSAPTDIH QAAH+VL+YLKG+PG GL Y 
Sbjct: 815  TSYRELVGRLLYLCITRPDITFAVHTLSQFLSAPTDIHMQAAHKVLRYLKGNPGQGLMYS 874

Query: 1142 ASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCESEYRAMA 1201
            ASS   L  FSD+DW  C D+R+S+TG+C++LG SLI+W+SKKQS  SRSS ESEYR++A
Sbjct: 875  ASSELCLNGFSDADWGTCKDSRRSVTGFCIYLGTSLITWKSKKQSVVSRSSTESEYRSLA 934

Query: 1202 TTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCHIVREKL 1261
               CEI WL  LL+DL+     P  +F DN+SA+H+A NP +HERTKHIE+DCH VR+++
Sbjct: 935  QATCEIIWLQQLLKDLHVTMTCPAKLFCDNKSALHLATNPVFHERTKHIEIDCHTVRDQI 994

Query: 1262 QQGLVHLLPISTTLQTADVFTKSLTP 1287
            + G +  L + T  Q AD+ TK L P
Sbjct: 995  KAGKLKTLHVPTGNQLADILTKPLHP 1020


>emb|CAD41085.2| OSJNBb0011N17.2 [Oryza sativa (japonica cultivar-group)]
            gi|50925209|ref|XP_472906.1| OSJNBb0011N17.2 [Oryza
            sativa (japonica cultivar-group)]
          Length = 1262

 Score =  582 bits (1499), Expect = e-164
 Identities = 318/819 (38%), Positives = 471/819 (56%), Gaps = 73/819 (8%)

Query: 556  QNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDNKCPFQILYN 615
            +N + ERK++H++ +AR+L++  ++P   WS AV  A YLINR P+ +L  K P+++++ 
Sbjct: 447  ENGVAERKNRHLLEIARSLMYTMNVPKFLWSEAVMTAAYLINRTPSRILGMKTPYEMIFG 506

Query: 616  TVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGF-------------- 661
                +   ++FG  CFV        K DPRA KCIF+G+    KG+              
Sbjct: 507  KNEFVVPPRVFGCTCFVRDHRPSIGKLDPRAVKCIFIGYSSSQKGYKCWSPSERRTFVSM 566

Query: 662  -VTY-----------DLKSRVISISRNVTFHEHNPFKPLDSDLTPTQAFPTTLPPIFDDD 709
             VT+           D+ S  + +  ++T  +H+  K  + ++   +    +   I   +
Sbjct: 567  DVTFRESVPFYGEKTDISSLFVDLD-DLTRGDHDQQK--EGEILGLKENEQSKGKIVVGE 623

Query: 710  IPV----PATVQAQEPPVNNQNQVIAPR-----TSQRIRKPPSYLQDYHCTLLSSDS--- 757
            IP     P   Q    P   +N  +  R     T+Q++        D     +SS+S   
Sbjct: 624  IPCAIGDPVQEQEWRKPHEEENLQVYTRRMRLPTTQQVEVDDQVSDDLTHVQVSSESGGE 683

Query: 758  ----------VPIT--------------------SSSTGINYPLSKVLSYDHLNPKYQSF 787
                      +PI                        +G    ++  +SY  L+  Y++F
Sbjct: 684  QIEIREEESNLPIAIRKGMRSNAGKPPQRYGFEIGDESGDENDIANYVSYTSLSSTYKAF 743

Query: 788  VMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGCKWVYKVK 847
            V +++S++ P  + EA +   W +AM  E+EALE+N+TW LV  P   K + CKWVY VK
Sbjct: 744  VASLNSAIIPKDWKEAKQDPRWHQAMLDELEALEKNKTWDLVSYPNGKKVVNCKWVYAVK 803

Query: 848  YKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WHLHQLDVD 907
               DG +ERYKARLV KGY+Q  G+D+ +TF+PVAKM+T+R I++   +  W LHQLDV 
Sbjct: 804  QNPDGKVERYKARLVAKGYSQTYGIDYDETFAPVAKMSTVRTIISCAVNFDWPLHQLDVK 863

Query: 908  NAFLHASLDEQIYMTIPQGLVCNKAN-QVCLLQKSLYGLKQASRQWFNTLSASLKKLGYK 966
            NAFLH  L E++YM IP G    +   +V  L+KSLYGLKQ+ R WF+    ++  +GYK
Sbjct: 864  NAFLHGDLQEEVYMEIPPGFATLQTKGKVLRLKKSLYGLKQSPRAWFDRFRRAMCAMGYK 923

Query: 967  QSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKYFL 1026
            Q N DHT++   S    T L +Y DD+++ GND  EI  +K +L  +F +KDLG+ KYFL
Sbjct: 924  QCNGDHTVFYHHSGDHITILAVYVDDMIITGNDCSEITRLKQNLSKEFEVKDLGQLKYFL 983

Query: 1027 GLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIGTPLTDIGSYRR 1086
            G+EIARS  GIVL+QRKYAL L+SD+G L  +P STP+D + KL    G P+     Y+R
Sbjct: 984  GIEIARSPRGIVLSQRKYALDLLSDTGMLGCRPASTPVDQNHKLCAESGNPVNK-ERYQR 1042

Query: 1087 LVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGSGLFYPASSST 1146
            LVGRL+YL  TRPDIT+AV+ +S+++  P   H  A +R+L+YLKGSPG GL++  +   
Sbjct: 1043 LVGRLIYLCHTRPDITYAVSMVSRYMHDPRSGHMDAVYRILRYLKGSPGKGLWFKKNGHL 1102

Query: 1147 TLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCESEYRAMATTVCE 1206
             +  + D+DWA C D R+S +GYC+F+G +L+SWRSKKQ   SRS+ E+EYRAM+ ++ E
Sbjct: 1103 EVEGYCDADWASCPDDRRSTSGYCVFVGGNLVSWRSKKQPVVSRSTAEAEYRAMSVSLSE 1162

Query: 1207 IQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCHIVREKLQQGLV 1266
            + WL  LL +L  P   P  ++ DN+SA+ IA+NP  H+RTKH+ELD   ++EKL +G++
Sbjct: 1163 LLWLRNLLSELMLPVDTPMKLWCDNKSAISIANNPVQHDRTKHVELDRFFIKEKLDEGVL 1222

Query: 1267 HLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKDIHLP 1305
             L  + +  Q AD FTK L      +   K+GM DI+ P
Sbjct: 1223 ELEFVMSGGQVADCFTKGLGVKECNSSCDKMGMIDIYHP 1261


>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
          Length = 1358

 Score =  582 bits (1499), Expect = e-164
 Identities = 311/761 (40%), Positives = 453/761 (58%), Gaps = 50/761 (6%)

Query: 554  PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDNKCPFQIL 613
            PQQN + ERK++H+I  AR LL ++++P+ FW  AV  + YLINR+P+  + N+ P  IL
Sbjct: 638  PQQNGVAERKNRHLIETARTLLLESNVPLRFWGDAVLTSCYLINRMPSSSIQNQVPHSIL 697

Query: 614  YNT-----VPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYDLKS 668
            +       +P     ++FG+ CFV  L   + K  PRA KC+FLG+    KG+  Y    
Sbjct: 698  FPQSHLYPIPP----RVFGSTCFVHNLAPGKDKLAPRALKCVFLGYSRVQKGYRCYSHDL 753

Query: 669  RVISISRNVTFHEHNPFKPLDSDLTPTQAFPTTLPPIFDDDIPVPATVQAQEPPVNNQNQ 728
                +S +VTF E  P+    S   P  +    +P +    +PVP  V   E  V + + 
Sbjct: 754  HRYLMSADVTFFESQPY--YTSSNHPDVSMVLPIPQV----LPVPTFV---ESTVTSTSP 804

Query: 729  VIAPRTSQRIRKP-PSYLQDYHCTLLSSDSVPITSSSTGINYPLSKVLSYDHLNPKYQSF 787
            V+ P      R+P P+ + D  C   + D  P                    L P  Q  
Sbjct: 805  VVVPPLLTYHRRPRPTLVPDDSCH--APDPAPTAD-----------------LPPPSQPL 845

Query: 788  VMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGCKWVYKVK 847
             +         +  EA+ H  WR+AM  E+ AL ++ TW LV  P     +GC+WVY VK
Sbjct: 846  AL---------QKGEALSHSGWRQAMVDEMSALHKSGTWELVSLPAGKSTVGCRWVYAVK 896

Query: 848  YKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WHLHQLDVD 907
               DG ++R KARLV KGYTQ+ G+D+ DTF+PVAK+ ++R+ L++ A + W LHQLD+ 
Sbjct: 897  IGPDGQVDRLKARLVAKGYTQIFGLDYSDTFAPVAKIASVRLFLSMAAVRHWPLHQLDIK 956

Query: 908  NAFLHASLDEQIYMTIPQGLVCN--KANQVCLLQKSLYGLKQASRQWFNTLSASLKKLGY 965
            NAFLH  L+E++YM  P G V     ++ VC L++SLYGLKQ+ R WF   S  +++ G 
Sbjct: 957  NAFLHGDLEEEVYMEQPPGFVAQGESSSLVCRLRRSLYGLKQSPRAWFGKFSTVIQEFGM 1016

Query: 966  KQSNADHTLYIKASSGSFTA-LLLYADDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKY 1024
             +S ADH+++ + S+ S    L++Y DD+++ GND   I  +K  L   F+ KDLG  KY
Sbjct: 1017 TRSGADHSVFYRHSAPSRCIYLVVYVDDIVITGNDQDGITDLKQHLFKHFQTKDLGRLKY 1076

Query: 1025 FLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIGTPLTDIGSY 1084
            FLG+E+A+S SGIV++QRKYAL ++ ++G +  +PV TPMD + KL    G PL++   Y
Sbjct: 1077 FLGIEVAQSRSGIVISQRKYALDILEETGMMGCRPVDTPMDPNVKLLPGQGEPLSNPERY 1136

Query: 1085 RRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGSGLFYPASS 1144
            RRLVG+L YL  TRPDI+F V+ +SQF+++P D H +A  R+L+Y+K +PG GL +    
Sbjct: 1137 RRLVGKLNYLTVTRPDISFPVSVVSQFMTSPCDSHWEAVVRILRYIKSAPGKGLLFEDQG 1196

Query: 1145 STTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCESEYRAMATTV 1204
               +  ++D+DWAG    R+S +GYC+ +G +L+SW+SKKQ+  +RSS ESEYRAMAT  
Sbjct: 1197 HEHIIGYTDADWAGSPSDRRSTSGYCVLVGGNLVSWKSKKQNVVARSSAESEYRAMATAT 1256

Query: 1205 CEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCHIVREKLQQG 1264
            CE+ W+  LL +L   ++    +  DNQ+A+HIA NP +HERTKHIE+DCH VREK+  G
Sbjct: 1257 CELVWIKQLLGELKFGKVDKMELVCDNQAALHIASNPVFHERTKHIEIDCHFVREKILSG 1316

Query: 1265 LVHLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKDIHLP 1305
             +    + +  Q AD+FTKSLT        +KLG  D++ P
Sbjct: 1317 DIVTKFVKSNDQLADIFTKSLTCPRINYICNKLGTYDLYAP 1357


>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
            gi|9989049|gb|AAG10812.1| Putative retroelement
            polyprotein [Arabidopsis thaliana]
          Length = 1404

 Score =  557 bits (1435), Expect = e-156
 Identities = 308/795 (38%), Positives = 456/795 (56%), Gaps = 46/795 (5%)

Query: 554  PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDNKCPFQIL 613
            PQQN + ERK++H++ VAR+++F   +P  FW  AV  A YLINR PT VL +  PF++L
Sbjct: 607  PQQNGVAERKNRHLMEVARSMMFHTSVPKRFWGDAVLTACYLINRTPTKVLSDLSPFEVL 666

Query: 614  YNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYDLKSRVISI 673
             NT P I +L++FG +CFV      R K D ++TKC+FLG+    KG+  +D       I
Sbjct: 667  NNTKPFIDHLRVFGCVCFVLIPGEQRSKLDAKSTKCMFLGYSTTQKGYKCFDPTKNRTFI 726

Query: 674  SRNVTFHEHNPFKPLDS-----DLTPTQAFPTTLPPIFDDDIPVPATVQAQEPP------ 722
            SR+V F E+  +          DLT + +          D +   +T   Q  P      
Sbjct: 727  SRDVKFLENQDYNNKKDWENLKDLTHSTSDRVETLKFLLDHLGNDSTSTTQHQPEMTQDQ 786

Query: 723  --VNNQNQVIAPRTSQR---IRKPPSYLQDY--HCTLLSSDSV-------------PITS 762
              +N +N+ ++ +  +    +++ P   Q++  H   +  DS              P+  
Sbjct: 787  EDLNQENEEVSLQHQENLTHVQEDPPNTQEHSEHVQEIQDDSSEDEEPTQVLPPPPPLRR 846

Query: 763  S-----------STGINYPLSKVLSYDHLNPKYQSFVMNISSSLEPTRFSEAVKHECWRK 811
            S           S  + +P     S   +   +Q+F+  IS    P  + EA++ + WR 
Sbjct: 847  STRIRRKKEFFNSNAVAHPFQATCSLALVPLDHQAFLSKISEHWIPQTYEEAMEVKEWRD 906

Query: 812  AMDQEIEALERNQTWILVDKPPDSKPIGCKWVYKVKYKQDGSIERYKARLVVKGYTQVEG 871
            A+  EI A++RN TW   D P   K +  +WV+ +KYK +G IERYK RLV +G+TQ  G
Sbjct: 907  AIADEINAMKRNHTWDEDDLPKGKKTVSSRWVFTIKYKSNGDIERYKTRLVARGFTQTYG 966

Query: 872  VDFQDTFSPVAKMTTLRVILALCASQ*WHLHQLDVDNAFLHASLDEQIYMTIPQGLVCN- 930
             D+ +TF+PVAK+ T+RV+LAL  +  W L Q+DV NAFL   L++ +YMT P GL    
Sbjct: 967  SDYMETFAPVAKLHTVRVVLALATNLSWGLWQMDVKNAFLQGELEDDVYMTPPPGLEDTI 1026

Query: 931  KANQVCLLQKSLYGLKQASRQWFNTLSASLKKLGYKQSNADHTLYIKASSGSFTALLLYA 990
              ++V  L+K++YGLKQ+ R W++ LS +LK  G+K+S +DHTL+   S      +L+Y 
Sbjct: 1027 PCDKVLRLRKAIYGLKQSPRAWYHKLSRTLKDHGFKKSESDHTLFTLQSPQGIVVVLIYV 1086

Query: 991  DDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKYFLGLEIARSTSGIVLNQRKYALQLIS 1050
            DD+++ G++   I   K+ L   F IKDLGE KYFLG+E+ RS +G+ L+QRKY L L++
Sbjct: 1087 DDLIITGDNKDGIDSTKTFLKSCFDIKDLGELKYFLGIEVCRSNAGLFLSQRKYTLDLLN 1146

Query: 1051 DSGHLASKPVSTPMDNSQKL---GTNIGTPLTDIGSYRRLVGRLLYLNTTRPDITFAVNQ 1107
            ++G + +KP  TP+++  K+   G        D   YR+LVG+L+YL  TRPDI FAVNQ
Sbjct: 1147 ETGFMDAKPARTPLEDGYKVNRKGEKEDEKFGDAPLYRKLVGKLIYLTNTRPDICFAVNQ 1206

Query: 1108 LSQFLSAPTDIHEQAAHRVLKYLKGSPGSGLFYPASSSTTLTAFSDSDWAGCIDTRKSIT 1167
            +SQ +  P   H     R+L+YLKGS G G++   +SST +  + D+D+AG    R+S T
Sbjct: 1207 VSQHMKVPMVYHWNMVERILRYLKGSSGQGIWMGKNSSTEIVGYCDADYAGDRGDRRSKT 1266

Query: 1168 GYCLFLGDSLISWRSKKQSTASRSSCESEYRAMATTVCEIQWLHYLLQDLNQPQLAPTSM 1227
            GYC F+G +L +W++KKQ   S SS ESEYRAM     E+ WL  LL+DL   Q  P +M
Sbjct: 1267 GYCTFIGGNLATWKTKKQKVVSCSSAESEYRAMRKLTNELTWLKALLKDLGIEQHMPITM 1326

Query: 1228 FRDNQSAMHIAHNPSYHERTKHIELDCHIVREKLQQGLVHLLPISTTLQTADVFTKSLTP 1287
              DN++A++IA N  +HERTKHIE+DCH VREK+ +G+       +  Q AD+FTK+ + 
Sbjct: 1327 HCDNKAAIYIASNSVFHERTKHIEVDCHKVREKIIEGVTLPCYTRSEDQLADIFTKAASL 1386

Query: 1288 APFKTCISKLGMKDI 1302
                    KLG+ D+
Sbjct: 1387 KVCNFIHGKLGLVDL 1401


>gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|37530764|ref|NP_919684.1| putative
            copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|20042923|gb|AAM08751.1| Putative
            copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)]
          Length = 1803

 Score =  536 bits (1380), Expect = e-150
 Identities = 303/755 (40%), Positives = 427/755 (56%), Gaps = 37/755 (4%)

Query: 555  QQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDNKCPFQILY 614
            QQN   ER  + +    R +L  +  P++FW+ A+  A++LINR P     +  P+Q+L 
Sbjct: 653  QQNGKAERILRTINDCVRTMLVHSAAPLSFWAEALQTAMHLINRRPCRATGSLKPYQLLL 712

Query: 615  NTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYDLKSRVISIS 674
               P   +L++FG LC+ +T+ +   K  PR+  C+F+G+  D +G+  YD+ SR +  S
Sbjct: 713  GAPPTYDHLRVFGCLCYPNTIATAPHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTS 772

Query: 675  RNVTFHEHN-PFKPLDSDLTPTQAFPTTLPPIFDDD----IPVPA-----------TVQA 718
            R+VTF E   PF+   S   P  + P   PP   DD    +P PA              A
Sbjct: 773  RHVTFVEDVFPFRDAPS---PRPSAPP--PPDHGDDTIVLLPAPAQHVVTPVGTAPAHDA 827

Query: 719  QEPPVNNQNQVIAPRTSQRIRKPPSYLQDYHCTLLSSDSVPITSSSTGINYPLSKVLSYD 778
              PP    +   +   +  +  PPS       +         T +  GI+ P        
Sbjct: 828  ASPPSPASSTPSSAAPAHDVAPPPSPETSSPASASPPRHAMTTRARAGISKP-------- 879

Query: 779  HLNPKYQSFVMNISSSLEPTRFSE--AVKHECWRKAMDQEIEALERNQTWILVDKPPDSK 836
              NP+Y    M  +S+L PT  S   A++   WR AM  E +AL  N+TW LV +PP ++
Sbjct: 880  --NPRY---AMTATSTLSPTPSSVRVALRDPNWRAAMQAEFDALLANRTWTLVPRPPGAR 934

Query: 837  PIGCKWVYKVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCAS 896
             I  KWV+K K   DGS+++YKAR VV+G+ Q  GVDF +TFSPV K  T+R +L L +S
Sbjct: 935  IITGKWVFKTKLHADGSLDKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISS 994

Query: 897  Q*WHLHQLDVDNAFLHASLDEQIYMTIPQGLV-CNKANQVCLLQKSLYGLKQASRQWFNT 955
            + W  HQLDV NAFLH  L E++    P G     +   VCLL +SLYGL+QA R WF  
Sbjct: 995  KQWPAHQLDVSNAFLHGHLQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKR 1054

Query: 956  LSASLKKLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFR 1015
             +     LG+ QS AD +L++         LLLY DD++L+ +    +Q +   L  +F+
Sbjct: 1055 FADHATSLGFVQSRADPSLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFK 1114

Query: 1016 IKDLGEAKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIG 1075
            +KD+G  KYFLG+E+ R+  G VL+Q KYA  ++  +G    K V+TP D   KL ++ G
Sbjct: 1115 VKDMGPLKYFLGIEVQRTADGFVLSQSKYATDVLERAGMANCKAVATPADAKPKLSSDEG 1174

Query: 1076 TPLTDIGSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPG 1135
                D   YR + G L YL  TRPDI +AV Q+   + AP + H     R+L+Y+KG+  
Sbjct: 1175 PLFQDSSWYRSIAGALQYLTLTRPDIAYAVQQVCLHMHAPREAHVTLLKRILRYIKGTAA 1234

Query: 1136 SGLFYPASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCES 1195
             GL   AS+S TLTAFSD+DWAGC DTR+S +G+C+FLGDSLISW SK+Q+T SRSS E+
Sbjct: 1235 FGLHLRASTSPTLTAFSDADWAGCPDTRRSTSGFCIFLGDSLISWSSKRQTTVSRSSAEA 1294

Query: 1196 EYRAMATTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCH 1255
            EYR +A  V E  WL  LL +L+      T  + DN S+++++ NP +H+RTKHIELD H
Sbjct: 1295 EYRGVANAVAECTWLRQLLGELHCRVPQATIAYCDNISSVYMSKNPVHHKRTKHIELDIH 1354

Query: 1256 IVREKLQQGLVHLLPISTTLQTADVFTKSLTPAPF 1290
             VREK+  G + +LPI +  Q ADVFTK L  + F
Sbjct: 1355 FVREKVALGELRVLPIPSAHQFADVFTKGLPSSMF 1389


>emb|CAB79271.1| putative protein [Arabidopsis thaliana] gi|3021268|emb|CAA18463.1|
            putative protein [Arabidopsis thaliana]
            gi|7485945|pir||T04833 hypothetical protein F21P8.50 -
            Arabidopsis thaliana
          Length = 1240

 Score =  535 bits (1379), Expect = e-150
 Identities = 276/577 (47%), Positives = 377/577 (64%), Gaps = 24/577 (4%)

Query: 721  PPVNNQNQVIAP--RTSQRIRKPPSYLQDYHCTLLSSDSVPITSSSTGINYPLSKVLSYD 778
            P  N QN V  P   TS R  + P+YLQDY+C  ++S ++          + +S+ LSY+
Sbjct: 18   PSANIQNDVPEPSVHTSHRRTRKPAYLQDYYCHSVASLTI----------HDISQFLSYE 67

Query: 779  HLNPKYQSFVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPI 838
             ++P Y SF++ I+ + EP+ ++EA +   W  AMD EI A+E   TW +   PP+ KPI
Sbjct: 68   KVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPI 127

Query: 839  GCKWVYKVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ* 898
            GCKWVYK+KY  DG+IERYKARLV KGYTQ EG+DF +TFSPV K+T++++ILA+ A   
Sbjct: 128  GCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYN 187

Query: 899  WHLHQLDVDNAFLHASLDEQIYMTIPQGLVCNKA-----NQVCLLQKSLYGLKQASRQWF 953
            + LHQLD+ NAFL+  LDE+IYM +P G    +      N VC L+KS+YGLKQASRQWF
Sbjct: 188  FTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWF 247

Query: 954  NTLSASLKKLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQ 1013
               S +L   G+ QS++DHT ++K ++  F  +L+Y DD+++  N+   +  +KS L   
Sbjct: 248  LKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSC 307

Query: 1014 FRIKDLGEAKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTN 1073
            F+++DLG  KYFLGLEIARS +GI + QRKYAL L+ ++G L  KP S PMD S     +
Sbjct: 308  FKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAH 367

Query: 1074 IGTPLTDIGSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGS 1133
             G    D  +YRRL+GRL+YL  TR DI+FAVN+LSQF  AP   H+QA  ++L Y+KG+
Sbjct: 368  SGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGT 427

Query: 1134 PGSGLFYPASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSC 1193
             G GLFY + +   L  FSD+ +  C DTR+S  GYC+FLG SLISW+SKKQ   S+SS 
Sbjct: 428  VGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSA 487

Query: 1194 ESEYRAMATTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELD 1253
            E+EYRA++    E+ WL    ++L  P   PT +F DN +A+HIA N  +HERTKHIE D
Sbjct: 488  EAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESD 547

Query: 1254 CHIVREKLQQGLVHLLPISTTLQT---ADVFTKSLTP 1287
            CH VRE+     V+   +S + Q     D FT+ L+P
Sbjct: 548  CHSVRER----SVYQATLSYSFQAYDEQDGFTEYLSP 580


>ref|NP_194047.2| protein kinase family protein [Arabidopsis thaliana]
          Length = 1262

 Score =  535 bits (1379), Expect = e-150
 Identities = 276/577 (47%), Positives = 377/577 (64%), Gaps = 24/577 (4%)

Query: 721  PPVNNQNQVIAP--RTSQRIRKPPSYLQDYHCTLLSSDSVPITSSSTGINYPLSKVLSYD 778
            P  N QN V  P   TS R  + P+YLQDY+C  ++S ++          + +S+ LSY+
Sbjct: 18   PSANIQNDVPEPSVHTSHRRTRKPAYLQDYYCHSVASLTI----------HDISQFLSYE 67

Query: 779  HLNPKYQSFVMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPI 838
             ++P Y SF++ I+ + EP+ ++EA +   W  AMD EI A+E   TW +   PP+ KPI
Sbjct: 68   KVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPI 127

Query: 839  GCKWVYKVKYKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ* 898
            GCKWVYK+KY  DG+IERYKARLV KGYTQ EG+DF +TFSPV K+T++++ILA+ A   
Sbjct: 128  GCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYN 187

Query: 899  WHLHQLDVDNAFLHASLDEQIYMTIPQGLVCNKA-----NQVCLLQKSLYGLKQASRQWF 953
            + LHQLD+ NAFL+  LDE+IYM +P G    +      N VC L+KS+YGLKQASRQWF
Sbjct: 188  FTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWF 247

Query: 954  NTLSASLKKLGYKQSNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQ 1013
               S +L   G+ QS++DHT ++K ++  F  +L+Y DD+++  N+   +  +KS L   
Sbjct: 248  LKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSC 307

Query: 1014 FRIKDLGEAKYFLGLEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTN 1073
            F+++DLG  KYFLGLEIARS +GI + QRKYAL L+ ++G L  KP S PMD S     +
Sbjct: 308  FKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAH 367

Query: 1074 IGTPLTDIGSYRRLVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGS 1133
             G    D  +YRRL+GRL+YL  TR DI+FAVN+LSQF  AP   H+QA  ++L Y+KG+
Sbjct: 368  SGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGT 427

Query: 1134 PGSGLFYPASSSTTLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSC 1193
             G GLFY + +   L  FSD+ +  C DTR+S  GYC+FLG SLISW+SKKQ   S+SS 
Sbjct: 428  VGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSA 487

Query: 1194 ESEYRAMATTVCEIQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELD 1253
            E+EYRA++    E+ WL    ++L  P   PT +F DN +A+HIA N  +HERTKHIE D
Sbjct: 488  EAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESD 547

Query: 1254 CHIVREKLQQGLVHLLPISTTLQT---ADVFTKSLTP 1287
            CH VRE+     V+   +S + Q     D FT+ L+P
Sbjct: 548  CHSVRER----SVYQATLSYSFQAYDEQDGFTEYLSP 580


>dbj|BAB11447.1| polyprotein-like [Arabidopsis thaliana]
          Length = 509

 Score =  527 bits (1357), Expect = e-147
 Identities = 255/498 (51%), Positives = 352/498 (70%), Gaps = 4/498 (0%)

Query: 812  AMDQEIEALERNQTWILVDKPPDSKPIGCKWVYKVKYKQDGSIERYKARLVVKGYTQVEG 871
            AM+ E+  +E N+TW +V  PP+   +GCKWVY ++Y  DGSIERYKARLV KG+TQ EG
Sbjct: 2    AMNVELGVMELNKTWSVVSLPPNKNVVGCKWVYTIEYNADGSIERYKARLVAKGFTQQEG 61

Query: 872  VDFQDTFSPVAKMTTLRVILALCASQ*WHLHQLDVDNAFLHASLDEQIYMTIPQGLVCNK 931
            VD+ DTFSPVAK+ +++++L L A + W   Q+DV NAFLH+ L+E+IYM++ QG   + 
Sbjct: 62   VDYFDTFSPVAKLASVKLVLGLVARKGWSTTQMDVTNAFLHSDLEEEIYMSLAQGYTPSS 121

Query: 932  A----NQVCLLQKSLYGLKQASRQWFNTLSASLKKLGYKQSNADHTLYIKASSGSFTALL 987
                 N VC L KS+YGLKQASRQW+  LS +L   G++QS  D+TL++K +S +  A+L
Sbjct: 122  GSLPPNPVCRLHKSIYGLKQASRQWYKCLSQTLLDDGFQQSYVDNTLFVKITSTAIVAML 181

Query: 988  LYADDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKYFLGLEIARSTSGIVLNQRKYALQ 1047
            +Y DD+L+  N+   +  VKS L  +++IKDLG AK+FLGLEIAR++ GI + QRKY L 
Sbjct: 182  IYVDDILIVSNNDEVVCAVKSVLAARYKIKDLGPAKFFLGLEIARNSDGISICQRKYCLD 241

Query: 1048 LISDSGHLASKPVSTPMDNSQKLGTNIGTPLTDIGSYRRLVGRLLYLNTTRPDITFAVNQ 1107
            L+++SG L  KP S PMD    L  ++GT L D   YR L+GRLLYL  TRPDITFAV+ 
Sbjct: 242  LLANSGLLGCKPKSVPMDPKVVLTKDLGTLLEDGRPYRELIGRLLYLCVTRPDITFAVHN 301

Query: 1108 LSQFLSAPTDIHEQAAHRVLKYLKGSPGSGLFYPASSSTTLTAFSDSDWAGCIDTRKSIT 1167
            LSQFLS PT++H  AAH+VLKYLK +PG GLF  A +   L  F+D+DW  C+D+R+S++
Sbjct: 302  LSQFLSCPTNVHLHAAHQVLKYLKNNPGQGLFSSAGTELYLNGFADADWGTCLDSRRSVS 361

Query: 1168 GYCLFLGDSLISWRSKKQSTASRSSCESEYRAMATTVCEIQWLHYLLQDLNQPQLAPTSM 1227
            G C+FLG SLI+W+SKKQ  AS SS E+EYR+MA    E+ WL  +L+DL+        +
Sbjct: 362  GVCVFLGTSLITWKSKKQEVASGSSTEAEYRSMAVATKELLWLAQMLKDLHVEMEFQVKL 421

Query: 1228 FRDNQSAMHIAHNPSYHERTKHIELDCHIVREKLQQGLVHLLPISTTLQTADVFTKSLTP 1287
            F DN+SAMHIA+N  +HERTKH+E+DCH  R++++ G + +L + T  Q AD+ TK+L P
Sbjct: 422  FCDNKSAMHIANNSVFHERTKHVEIDCHTTRDRVKNGFLKVLHVDTENQLADILTKALQP 481

Query: 1288 APFKTCISKLGMKDIHLP 1305
             PF++ + +L +  + LP
Sbjct: 482  GPFRSILGRLSVSSLFLP 499


>gb|AAC98469.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25411253|pir||A84480 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1102

 Score =  524 bits (1350), Expect = e-147
 Identities = 308/759 (40%), Positives = 434/759 (56%), Gaps = 67/759 (8%)

Query: 554  PQQNAIVERKHQHVISVARALLFQAHLPVTFWSYAVAHAVYLINRLPTPVLDNKCPFQIL 613
            PQQNA VERKH+H+++VAR  LFQ + P                  P+PVL  K P+++L
Sbjct: 403  PQQNARVERKHRHILNVARTCLFQGNFPT-----------------PSPVLKGKTPYEVL 445

Query: 614  YNTVPDITNLKIFGTLCFVSTLTSHRKKFDPRATKCIFLGFKPDTKGFVTYDLKSRVISI 673
            +   P    L+ FG LC+       + KF  R+ KCIF+G+  +T    T+D      SI
Sbjct: 446  FGKQPSYDMLRTFGCLCYAHIRPRDKDKFASRSRKCIFIGYPHETATPNTHD------SI 499

Query: 674  SRNVTFHEHNPFKPLDSDLTPTQAFPTTLPPIFDDDIPVPATVQAQEPPVNNQNQVIAPR 733
                T  + N   P    +TP    P +   I    I V          +N  +   +P 
Sbjct: 500  DPTSTSSDENNTPP--EPVTPQAEQPHSPSSISSPHI-VHNKGSVHSRHLNEDHDSSSPG 556

Query: 734  TSQRIRK------PPSYLQDYHCTLLSSDSVPITSSSTGINYPLSKVLSYDHLNPKYQSF 787
              + + K      PP YL+DY    + S   P TSS    +  +S  +S +H+     +F
Sbjct: 557  LPELLGKGHRPKHPPVYLKDYVAHKVHSS--PHTSSPGLSDSNVSPTVSANHI-----AF 609

Query: 788  VMNISSSLEPTRFSEAVKHECWRKAMDQEIEALERNQTWILVDKPPDSKPIGCKWVYKVK 847
            +  I  S E   F + V  + W  AM +EIEALE N TW + D P   K I  KWVYK+K
Sbjct: 610  MAAILDSNEQNHFKDDVLIKEWCDAMQKEIEALEANHTWDVTDLPHGKKAISSKWVYKLK 669

Query: 848  YKQDGSIERYKARLVVKGYTQVEGVDFQDTFSPVAKMTTLRVILALCASQ*WHLHQLDVD 907
            +  DG++ER+KARLVV G  Q EG+DF++TF+PVAKMTT+R++LA+ A++ W + Q+DV 
Sbjct: 670  FNSDGTLERHKARLVVMGNHQKEGIDFKETFAPVAKMTTVRLLLAVAAAKDWDVFQMDVH 729

Query: 908  NAFLHASLDEQIYMTIPQGLVCNKANQVCLLQKSLYGLKQASRQWFNTLSASLKKLGYKQ 967
            NAFLH  L+                      Q+SLYGLKQA R WF  LS +L+KLG+ Q
Sbjct: 730  NAFLHGDLE----------------------QESLYGLKQAPRCWFAKLSTALRKLGFTQ 767

Query: 968  SNADHTLYIKASSGSFTALLLYADDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKYFLG 1027
            S  D++L+     G+    L+Y DD ++ GN++  I   K  LH  F +KDLG+ KYFLG
Sbjct: 768  SYEDYSLFSLNRDGTVIHFLVYVDDFIIVGNNLKAIDHFKEHLHKCFHMKDLGKLKYFLG 827

Query: 1028 LEIARSTSGIVLNQRKYALQLISDSGHLASKPVSTPMDNSQKLGTNIGTPLTDI-GSYRR 1086
            LE++R   G  L+Q+KYAL +I+++G L  KP + PM+   KLG+ I +P+ D    YRR
Sbjct: 828  LEVSRGADGFCLSQQKYALDIINEAGLLGYKPSAVPMELHHKLGS-ISSPVFDNPAQYRR 886

Query: 1087 LVGRLLYLNTTRPDITFAVNQLSQFLSAPTDIHEQAAHRVLKYLKGSPGSGLFYPASSST 1146
            LV R +YL  TRPD+++AV+ LSQF+  P + H  A  R+++YLKGSP  G+   +  + 
Sbjct: 887  LVDRFIYLTITRPDLSYAVHILSQFMQTPLEAHWHATLRLVRYLKGSPDQGILLRSDRAL 946

Query: 1147 TLTAFSDSDWAGCIDTRKSITGYCLFLGDSLISWRSKKQSTASRSSCESEYRAMATTVCE 1206
            +LTA+ DSD+  C  TR+S++ Y L+LGD+ ISW++KKQ T S SS E+EYRAMA T+ E
Sbjct: 947  SLTAYCDSDYNPCPRTRRSLSAYVLYLGDTPISWKTKKQDTVSSSSAEAEYRAMAYTLKE 1006

Query: 1207 IQWLHYLLQDLNQPQLAPTSMFRDNQSAMHIAHNPSYHERTKHIELDCHIVREKLQQGLV 1266
            I+WL  L+  L      P  +F D+Q+A+HIA NP +HERTKHIE DCH VR+ +   ++
Sbjct: 1007 IKWLKALMTTLGVDHTQPILLFCDSQAAIHIAANPVFHERTKHIEKDCHQVRDAVTDKVI 1066

Query: 1267 HLLPISTTLQTADVFTKSLTPAPFKTCISKLGMKDIHLP 1305
                ISTT    D+ TK+L    F+  +S LG  +  LP
Sbjct: 1067 STPHISTT----DLLTKALPRPTFERLLSTLGTCNYDLP 1101


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.339    0.148    0.479 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,992,783,214
Number of Sequences: 2540612
Number of extensions: 78229593
Number of successful extensions: 265427
Number of sequences better than 10.0: 1714
Number of HSP's better than 10.0 without gapping: 1628
Number of HSP's successfully gapped in prelim test: 86
Number of HSP's that attempted gapping in prelim test: 259389
Number of HSP's gapped (non-prelim): 2870
length of query: 1310
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1170
effective length of database: 507,674,714
effective search space: 593979415380
effective search space used: 593979415380
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 81 (35.8 bits)


Lotus: description of TM0033b.8