Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0039.4
         (1435 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

TC86737 weakly similar to GP|6683624|dbj|BAA89272.1 Pol {Alterna...   314  1e-85
BG644693 weakly similar to GP|18767374|g Putative 22 kDa kafirin...   239  7e-63
BG586326 similar to PIR|G84493|G8 probable retroelement pol poly...   223  4e-58
TC77595 weakly similar to PIR|T18350|T18350 probable pol polypro...   171  2e-42
BF003873 similar to GP|14715222|em putative polyprotein {Cicer a...   151  2e-36
TC93153 similar to GP|14715220|emb|CAC44106. gag polyprotein {Ci...   140  4e-33
BG587145 similar to PIR|H86337|H8 protein F5M15.26 [imported] - ...    89  2e-29
BG644740 similar to PIR|A84460|A84 probable retroelement pol pol...    87  5e-24
AL366725                                                               85  2e-16
BG454871 weakly similar to GP|10140673|g putative gag-pol polypr...    75  3e-16
BG644699 similar to PIR|T07863|T078 probable polyprotein - pinea...    66  1e-10
BG587101 similar to GP|6691191|gb F7F22.15 {Arabidopsis thaliana...    61  3e-09
BG452991 PIR|A25875|A25 histone H4 - Tetrahymena thermophila, pa...    52  1e-07
BG587176 weakly similar to PIR|G84493|G84 probable retroelement ...    47  4e-05
BG586308 weakly similar to PIR|F84528|F8 probable retroelement p...    45  3e-04
CA860311 weakly similar to GP|7289872|gb|A CG17427 gene product ...    42  0.001
CB893783 weakly similar to GP|22830935|dbj hypothetical protein~...    42  0.002
AJ497987 weakly similar to GP|9927273|dbj Similar to Arabidopsis...    41  0.004
TC87382 similar to EGAD|146423|156195 vitellogenin {Anolis pulch...    38  0.032
TC87383 similar to GP|19168656|emb|CAD26175. DNA-DIRECTED RNA PO...    37  0.072

>TC86737 weakly similar to GP|6683624|dbj|BAA89272.1 Pol {Alternaria
           alternata}, partial (21%)
          Length = 1540

 Score =  314 bits (805), Expect = 1e-85
 Identities = 175/395 (44%), Positives = 247/395 (62%), Gaps = 14/395 (3%)
 Frame = +1

Query: 476 VVREFPEVF-PEDMTELPPEREV-EFAIDVIP----GTTPISAAP-YRISPLELAELQKQ 528
           V+ EFP++F PE   ++P  R + + AI +IP       P+   P Y +S  EL  L+K 
Sbjct: 343 VLEEFPDLFNPEKAYQVPASRGLLDHAIPLIPDKDGNDPPLPWGPLYGMSRQELLVLKKT 522

Query: 529 VEELLSKGFIRPSVSPWGAPVLLVKKKDGSMRLCVDYRQLNKVTIKNRYPLPRIDDLMDQ 588
           +E+LL KGFI+ S S  GAPVL V+K  G +R CVDYR LN +T K+RYPLP I + + +
Sbjct: 523 LEDLLDKGFIKASGSAAGAPVLFVRKPGGGIRFCVDYRALNAITKKDRYPLPLISETLRR 702

Query: 589 LKGARVFSKIDLRSGYHQIRVKSDDVQKTAFRTRYGHYEYLVMPFGVTNAPAIFMDYMNR 648
           + GAR F+K+D+ + +H++R+K +D +KTAFRTRYG +E++V PFG+T APA F  Y+N+
Sbjct: 703 VAGARWFTKLDVVAAFHKMRIKDEDQEKTAFRTRYGLFEWIVCPFGLTGAPATFQRYINK 882

Query: 649 IFHPYLDKFVIVFIDDILIYSK-SKEEHVEHMQVVLKVLKDRKLYAKLSKCEFWLEQVQF 707
             H +LD FV  +IDD+LIY+  SK++H   ++ VL+ L D  L     KCEF +  V++
Sbjct: 883 TLHEFLDDFVTAYIDDVLIYTTGSKKDHEAQVRRVLRRLADAGLSLDPKKCEFSVTTVKY 1062

Query: 708 LGHVVSE-DGIAVDPAKVEAVNSWKVPETVTGVRSFLGLAGYYRRFIEGFSKIATPLTQL 766
           +G +++   G++ DP K+ A+  W  P +V G RSFLG   YY+ FI G+S+I  PLT+L
Sbjct: 1063VGFILTAGKGVSCDPLKLAAIRDWLPPGSVKGARSFLGFCNYYKDFIPGYSEITEPLTRL 1242

Query: 767 TKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCDASKSGLGCVLMQE--- 823
           T+KD PF W  + E +F  LK    + PVL + DP     V  D S   LG VL QE   
Sbjct: 1243TRKDFPFRWGAEQEAAFTKLKRLFAEEPVLRMFDPEAVTTVETDCSGFALGGVLTQEDGT 1422

Query: 824 --RKVIAYASQQLRPHEQNYPTHDMELAAVVFALK 856
                +A+ SQ+L P E NYP HD EL AV   L+
Sbjct: 1423GAAHPVAFHSQRLSPAEYNYPIHDKELLAVWACLR 1527


>BG644693 weakly similar to GP|18767374|g Putative 22 kDa kafirin cluster;
           Ty3-Gypsy type {Oryza sativa}, partial (15%)
          Length = 716

 Score =  239 bits (609), Expect = 7e-63
 Identities = 124/230 (53%), Positives = 162/230 (69%)
 Frame = +2

Query: 491 LPPEREVEFAIDVIPGTTPISAAPYRISPLELAELQKQVEELLSKGFIRPSVSPWGAPVL 550
           +PPE +++F ID++P   PI    YRI+PL+L  L+ Q+++LL KGFI+PS+ P G  VL
Sbjct: 17  VPPEWKIDFGIDLLPNMNPI*IPSYRINPLKLKVLKLQLKDLLEKGFIQPSIYP*GVVVL 196

Query: 551 LVKKKDGSMRLCVDYRQLNKVTIKNRYPLPRIDDLMDQLKGARVFSKIDLRSGYHQIRVK 610
            +KKKDG +R+ +DY QLN V IK +YPLP ID+L D L+G++ F KIDLR G HQ RV 
Sbjct: 197 FLKKKDGFLRMSIDYPQLNNVNIKIKYPLPLIDELFDNLQGSKWFFKIDLRLG*HQHRVI 376

Query: 611 SDDVQKTAFRTRYGHYEYLVMPFGVTNAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSK 670
            +DV KTAFR RYGHYE LVM FG TN P  FM+ MNR+F  YLD  VIVF +DILIYSK
Sbjct: 377 GEDVPKTAFRIRYGHYEILVMSFG*TNPPMAFMELMNRVFQDYLDSLVIVFSNDILIYSK 556

Query: 671 SKEEHVEHMQVVLKVLKDRKLYAKLSKCEFWLEQVQFLGHVVSEDGIAVD 720
           ++ EH  H+++ LKVLKD  L  ++S     +E   F  HV+S +G+ VD
Sbjct: 557 NENEHENHLRLALKVLKDIGL-CQISYV*ILVEVGFFSLHVISGEGLKVD 703


>BG586326 similar to PIR|G84493|G8 probable retroelement pol polyprotein
            [imported] - Arabidopsis thaliana, partial (13%)
          Length = 736

 Score =  223 bits (568), Expect = 4e-58
 Identities = 122/247 (49%), Positives = 157/247 (63%), Gaps = 2/247 (0%)
 Frame = +2

Query: 791  TKAPVLTLPDPSKDYDVYCDASKSGLGCVLMQERKVIAYASQQLRPHEQNYPTHDMELAA 850
            T AP+L LP+    Y VY DAS +GLGCVL Q  KVIAYAS+QLR HE NYPTHD+E+AA
Sbjct: 8    TSAPILVLPELIT-YVVYTDASITGLGCVLTQHEKVIAYASRQLRKHEGNYPTHDLEMAA 184

Query: 851  VVFALKIWRHYLYGVKFTIYSDHQSLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPGKA 910
            VVFALKIWR YLYG K  I++DH+SLKY+F Q  LN+RQRRW+EF+ DYD  + Y+PGKA
Sbjct: 185  VVFALKIWRSYLYGAKVQIHTDHKSLKYIFTQPELNLRQRRWMEFVADYDLDITYYPGKA 364

Query: 911  NVVADALSRKSLHAARLMIEETELIEKFRDMNLIMETLPQGTRLGTLTLTN--EFIEEVK 968
            N+VADALSR+ +  +    E  +L    R + L    L + T    L   N  +    ++
Sbjct: 365  NLVADALSRRRVDVSAER-EADDLDGMVRALRL--NVLTKATESLGLEAVNQADLFTRIR 535

Query: 969  KEQARDENLQKEAHGRDSMSRPDFLKGPDGLWRYQGRLCVPEGGELRQKILEEGHKSDFS 1028
              Q +DENLQK A       R ++    DG     GR+ VP    L+++I+ E HKS FS
Sbjct: 536  LAQGQDENLQKVAQN----DRTEYQTAKDGTILVNGRISVPNDRSLKEEIMSEAHKSRFS 703

Query: 1029 IHPGTTK 1035
            +HPG  +
Sbjct: 704  VHPGAPR 724


>TC77595 weakly similar to PIR|T18350|T18350 probable pol polyprotein - rice
            blast fungus gypsy retroelement (fragment), partial (14%)
          Length = 1708

 Score =  171 bits (432), Expect = 2e-42
 Identities = 126/451 (27%), Positives = 213/451 (46%), Gaps = 23/451 (5%)
 Frame = +2

Query: 1002 YQGRLCVPEG-------GELRQKILEEGHKSDFSIHPGTTKMYQDLKKMFWWPGMKKDIM 1054
            ++GR+ VP          ELR K+++E H S  + HPG     + + + F+WPG  + + 
Sbjct: 110  FRGRIWVPGSDDEESPLNELRTKLVQESHDSTAAGHPGRNGTLEIVSRKFFWPGQSQTVR 289

Query: 1055 KKVTSCLTCQKVKGEHQKPSGSLQPLSIPEWKWEGISMDFVSGLPRTT-TGHDAIWVIVD 1113
            + V +C  C  +    Q   G L+PL +P      +SMDF++ LP T   G   +WVIVD
Sbjct: 290  RFVRNCDVCGGIHIWRQAKRGFLKPLPVPNRLHSDLSMDFITSLPPTRGRGSQYLWVIVD 469

Query: 1114 RLTKSAHFIAVNMTFPSEKLARIYVKEIVRLHGVPANIVSDRDPRFVSKFWGSLHEALGT 1173
            RL+KS     ++ T  +E  A+ ++    R HG+P +IVSDR   +V +FW       G 
Sbjct: 470  RLSKSVTLEEMD-TMEAEACAQRFLSCHYRFHGMPQSIVSDRGSNWVGRFWREFCRLTGV 646

Query: 1174 RLSLSSAYHPQSDGQSERTIQTLEDMLRACVLDYKGSWEDFLPLAEFSYNNSYHSSLGMA 1233
               LS++YHPQ+DG +ER  Q ++ +LRA V   + +W D LP  + +  N ++SS+G  
Sbjct: 647  TQLLSTSYHPQTDGGTERWNQEIQAVLRAYVCWSQDNWGDLLPTVQLALRNRHNSSIGAT 826

Query: 1234 PFEALYGRRCKTPLCWLSGEDKITLGPE-----LLQEMTEKVRSIREKLRIAQDRQKSYY 1288
            PF   +G     P+  +     +    E     L++ M +    I+ ++  AQ R ++  
Sbjct: 827  PFFVEHGYHV-DPIPTVEDTGGVVSEGEAAAQLLVKRMKDVTGFIQAEIVAAQQRSEASA 1003

Query: 1289 DKRHKPLE-FQEGDHVFLRVTPITGVGRS-----IHSKKLTPKYLGPYQILDRIGAVAYR 1342
            +KR  P + +Q GD V+L V+       S     +H K    +++ P+ +   +    Y 
Sbjct: 1004 NKRRCPADRYQVGDKVWLNVSNYKSPRPSKKLDWLHHKYEVTRFVTPHVVELNVPGTVY- 1180

Query: 1343 IALPPSLSNLHDVFHISQLRKYLPD---SSHVIEPDNIEL-EENLTYPTQPVKILERREK 1398
                         FH+  LR+   D      V++P    + +++     +  +IL  R  
Sbjct: 1181 -----------PKFHVDLLRRAASDPLPGQEVVDPQPPPIVDDDGEVEWEVEEILAARWH 1327

Query: 1399 QLRKRTVPLVKLAWSDDNQDATWELEESARK 1429
            Q+ +       + W     DATWE  ++ R+
Sbjct: 1328 QVGRGRRRQALVKWK-GFVDATWEAADAIRE 1417


>BF003873 similar to GP|14715222|em putative polyprotein {Cicer arietinum},
            partial (82%)
          Length = 559

 Score =  151 bits (381), Expect = 2e-36
 Identities = 69/125 (55%), Positives = 94/125 (75%), Gaps = 1/125 (0%)
 Frame = +2

Query: 1312 GVGRSIHSKKLTPKYLGPYQILDRIGAVAYRIALPPSLSNLHDVFHISQLRKYLPDSSHV 1371
            GVGR++ SKKLT +++GPYQI +R+G VAYR+ LPP L NLHDVFH+SQLRKY+PD SHV
Sbjct: 2    GVGRALKSKKLTVRFIGPYQISERVGTVAYRVGLPPHLLNLHDVFHVSQLRKYVPDPSHV 181

Query: 1372 IEPDNIELEENLTYPTQPVKILERREKQLRKRTVPLVKLAWSDDN-QDATWELEESARKR 1430
            I+ D++++ +NLT  T PV+I +R+ K LR + +PLV++ W   N +  TWELE    + 
Sbjct: 182  IQSDDVQVRDNLTVETLPVRIDDRKVKTLRGKEIPLVRVVWDRANGESLTWELESKMVES 361

Query: 1431 YPSLF 1435
            YP LF
Sbjct: 362  YPELF 376


>TC93153 similar to GP|14715220|emb|CAC44106. gag polyprotein {Cicer
           arietinum}, partial (8%)
          Length = 516

 Score =  140 bits (353), Expect = 4e-33
 Identities = 65/155 (41%), Positives = 102/155 (64%), Gaps = 2/155 (1%)
 Frame = +2

Query: 14  ARAEQQNQPAE--DDVYKGIDKFLKRNPPLFDGGYDPEGANRWLRKIEQIYESLPTSEDR 71
           A+A QQ    +   D  + ++ FL+ +PP F G Y P+GA +WL++IE+I+  +   E +
Sbjct: 50  AQAVQQLPKVDTGSDGTRMLETFLRNHPPTFKGRYAPDGA*KWLKEIERIFRVMQCFETQ 229

Query: 72  MIAYASYLFHEEARNWWVHAKSRITPPDGVLTWSIFKEAFLEKYFPADVKGKKETEFLEL 131
            + + +++  EEA +WW+     +   D V+TW++F++ FL +YFP DV+GKKE EFLEL
Sbjct: 230 KVQFGTHMLAEEADDWWISLLPVLEQDDAVVTWAMFRKEFLGRYFPEDVRGKKEIEFLEL 409

Query: 132 KQGEMFVGQYAARFEELSQFHPYYGTTADDASKCI 166
           KQG+M V +YAA+F EL+ F+P+Y     + SKCI
Sbjct: 410 KQGDMSVTEYAAKFVELATFYPHYSAETAEFSKCI 514


>BG587145 similar to PIR|H86337|H8 protein F5M15.26 [imported] - Arabidopsis
           thaliana, partial (13%)
          Length = 763

 Score = 88.6 bits (218), Expect(2) = 2e-29
 Identities = 46/133 (34%), Positives = 76/133 (56%)
 Frame = +2

Query: 612 DDVQKTAFRTRYGHYEYLVMPFGVTNAPAIFMDYMNRIFHPYLDKFVIVFIDDILIYSKS 671
           DD++KTAF T  G Y Y VMPFG+ NA + +   +NR+F   L   + V+IDD+L+ S  
Sbjct: 11  DDLEKTAFITDRGTYCYKVMPFGLKNAGSTYQRLVNRMFADKLGNTMEVYIDDMLVKSLR 190

Query: 672 KEEHVEHMQVVLKVLKDRKLYAKLSKCEFWLEQVQFLGHVVSEDGIAVDPAKVEAVNSWK 731
             +H+ H++   K L +  +    +KC F +   +FLG++V++ GI V+P ++ A+    
Sbjct: 191 ATDHLNHLKE*FKTLDEYIMKLNPAKCTFGVTSGEFLGYIVTQQGIEVNPKQITAILDLP 370

Query: 732 VPETVTGVRSFLG 744
            P+    V+   G
Sbjct: 371 SPKNSREVQRLTG 409



 Score = 60.5 bits (145), Expect(2) = 2e-29
 Identities = 36/106 (33%), Positives = 55/106 (50%), Gaps = 4/106 (3%)
 Frame = +3

Query: 751 RFIEGFSKIATPLTQLTKKDHPFVWTEKCE*SFQTLKERLTKAPVLTLPDPSKDYDVYCD 810
           RFI   +    P  +L   +  FVW EKCE +F+ LK+ LT  PVL+ P+      +Y  
Sbjct: 429 RFISRSTDKCLPFYKLLCGNKRFVWDEKCEEAFEQLKQYLTTPPVLSKPEAGDTLSLYIA 608

Query: 811 ASKSGLGCVLMQ----ERKVIAYASQQLRPHEQNYPTHDMELAAVV 852
            S + +  VL++    E+K I Y S+++   E  YPT +    AV+
Sbjct: 609 ISSTAVSSVLIREDRGEQKPIFYTSKRMTDPETRYPTLEKMAFAVI 746


>BG644740 similar to PIR|A84460|A84 probable retroelement pol polyprotein
           [imported] - Arabidopsis thaliana, partial (4%)
          Length = 754

 Score = 87.0 bits (214), Expect(2) = 5e-24
 Identities = 41/69 (59%), Positives = 52/69 (74%)
 Frame = -1

Query: 535 KGFIRPSVSPWGAPVLLVKKKDGSMRLCVDYRQLNKVTIKNRYPLPRIDDLMDQLKGARV 594
           K F +PS+SP GA +L V+KKDG  R+C+DYRQ NKVT KN+YPLPRID+L D+++    
Sbjct: 274 KRFQQPSISP*GAALLFVRKKDGYFRMCIDYRQFNKVTTKNKYPLPRIDNLFDKIQEDCY 95

Query: 595 FSKIDLRSG 603
           F  IDLR G
Sbjct: 94  F*NIDLRLG 68



 Score = 43.9 bits (102), Expect(2) = 5e-24
 Identities = 25/69 (36%), Positives = 40/69 (57%)
 Frame = -2

Query: 469 PAMEDIPVVREFPEVFPEDMTELPPEREVEFAIDVIPGTTPISAAPYRISPLELAELQKQ 528
           P  E + V++ F  VFP++   +P ERE+ F ID++  T  IS  P  +   EL +L+  
Sbjct: 471 PLFEVVLVLKGFS*VFPDNFPVIPLEREIFFCIDLLLDTQLISNPP*LMDRTELKKLKI* 292

Query: 529 VEELLSKGF 537
           +++ L KGF
Sbjct: 291 LKDSLEKGF 265


>AL366725 
          Length = 485

 Score = 85.1 bits (209), Expect = 2e-16
 Identities = 45/120 (37%), Positives = 69/120 (57%)
 Frame = +2

Query: 150 QFHPYYGTTADDASKCIRFECGLRPDIRAAIGHQQIRTFTVLVEKCRIFEENDRARREYY 209
           +F+P+Y     + SKCI+FE GLRPDI+ AIG+QQ+R F  LV  CRI+EE+ +A  +  
Sbjct: 2   KFYPHYAAETAEFSKCIKFENGLRPDIKRAIGYQQLRVFPDLVNTCRIYEEDTKAHDKVV 181

Query: 210 KSSKFNKTSKRREERKKPYSPRNYKPELQNRNYGGARPTNPNSHVTCYRCGKEGHKSWSC 269
              K    +K +  R KPYS    K + +  +    +  +  + + C+  G++GHKS  C
Sbjct: 182 NERK----TKGQ*SRPKPYSAPADKGKQRMVDDRRPKKKDAPAEIVCFNYGEKGHKSNVC 349


>BG454871 weakly similar to GP|10140673|g putative gag-pol polyprotein {Oryza
            sativa (japonica cultivar-group)}, partial (7%)
          Length = 674

 Score = 75.1 bits (183), Expect(2) = 3e-16
 Identities = 48/120 (40%), Positives = 64/120 (53%), Gaps = 3/120 (2%)
 Frame = +2

Query: 1161 SKFWGSLHEALGTRLSLSSAYHPQSDGQSERTIQTLEDMLRACVLDYKGSWEDFLPLAEF 1220
            S FW  L +  GT L++SSAYHP SDGQSE   +  E  LR  +      W    P AE+
Sbjct: 32   SNFWKQLFKLHGTILTMSSAYHP*SDGQSEALNKGXEMYLRCLMFTDPLKWSKAFPWAEY 211

Query: 1221 SYNNSYHSSLGMAPFEALYGRRCKTPL-CWLSGEDKITLGPELLQ--EMTEKVRSIREKL 1277
             YN SY+ S  M PF+ALYGR     +    S +D   L  +L Q  E+  +++SI  +L
Sbjct: 212  WYNTSYNISAAMTPFKALYGRDLSMLIRSKGSSKDTADLQSQLAQREELLSQLQSISTRL 391



 Score = 29.6 bits (65), Expect(2) = 3e-16
 Identities = 12/30 (40%), Positives = 18/30 (60%)
 Frame = +1

Query: 1280 AQDRQKSYYDKRHKPLEFQEGDHVFLRVTP 1309
            AQ   K   DK+ +  EFQ G+HV +++ P
Sbjct: 388  AQQTMKHQADKKRRHFEFQLGEHVLVKLQP 477


>BG644699 similar to PIR|T07863|T078 probable polyprotein - pineapple
            retrotransposon dea1 (fragment), partial (5%)
          Length = 231

 Score = 65.9 bits (159), Expect = 1e-10
 Identities = 31/73 (42%), Positives = 48/73 (65%), Gaps = 1/73 (1%)
 Frame = +2

Query: 1301 DHVFLRVTPIT-GVGRSIHSKKLTPKYLGPYQILDRIGAVAYRIALPPSLSNLHDVFHIS 1359
            + V L+V P   G  R     KL+ +Y+GP++++ RIG VAY +ALPP LS +H VFH+S
Sbjct: 2    EQVLLKVLPTERGDCRFGKRGKLSLRYIGPFEVIKRIGEVAYELALPPGLSGVHPVFHVS 181

Query: 1360 QLRKYLPDSSHVI 1372
              ++Y  D +++I
Sbjct: 182  MFKRYHGDGNYII 220


>BG587101 similar to GP|6691191|gb F7F22.15 {Arabidopsis thaliana}, partial
            (10%)
          Length = 624

 Score = 61.2 bits (147), Expect = 3e-09
 Identities = 45/177 (25%), Positives = 87/177 (48%), Gaps = 4/177 (2%)
 Frame = +2

Query: 1018 ILEEGHKSDFSIHPGTTKMYQDLKKM-FWWPGMKKDIMKKVTSCLTCQK---VKGEHQKP 1073
            IL   H S+++ H   +K    +++  FWWP M KD    ++ C  CQ+   +   ++ P
Sbjct: 104  ILFHCHGSNYAGHFAVSKTVSKIQQAGFWWPTMFKDAHSFISKCDPCQRQGNIS*RNEMP 283

Query: 1074 SGSLQPLSIPEWKWEGISMDFVSGLPRTTTGHDAIWVIVDRLTKSAHFIAVNMTFPSEKL 1133
               +  + +    ++   +DF+   P ++  +  I V VD ++K    IA + T  +  +
Sbjct: 284  QNFILEVEV----FDVWGIDFMGPFP-SSYNNKYILVAVDYVSKWVEAIA-SPTNDATVV 445

Query: 1134 ARIYVKEIVRLHGVPANIVSDRDPRFVSKFWGSLHEALGTRLSLSSAYHPQSDGQSE 1190
             +++   I    GVP  ++SD    F++K +  L +  G R  +++AYHPQ   +S+
Sbjct: 446  VKMFKSVIFPRFGVPRVVISDGGSHFINKVFEKLLKKNGVRHKVATAYHPQKAERSK 616


>BG452991 PIR|A25875|A25 histone H4 - Tetrahymena thermophila, partial (33%)
          Length = 560

 Score = 52.0 bits (123), Expect(2) = 1e-07
 Identities = 24/78 (30%), Positives = 42/78 (53%)
 Frame = +3

Query: 929  IEETELIEKFRDMNLIMETLPQGTRLGTLTLTNEFIEEVKKEQARDENLQKEAHGRDSMS 988
            +E    +E+FRD++L+ E  PQ  +LG L + NEF++ +K+ Q  D  L     G +   
Sbjct: 51   LESWSCLEQFRDLSLVCEVSPQSVKLGMLKINNEFLDSIKEAQKVDVKLVDLMFGNNQTE 230

Query: 989  RPDFLKGPDGLWRYQGRL 1006
              DF     G+ +++ R+
Sbjct: 231  DGDFKVDDQGVLQFRDRI 284



 Score = 23.5 bits (49), Expect(2) = 1e-07
 Identities = 10/12 (83%), Positives = 11/12 (91%)
 Frame = +1

Query: 912 VVADALSRKSLH 923
           VVAD LSRK+LH
Sbjct: 1   VVADVLSRKTLH 36


>BG587176 weakly similar to PIR|G84493|G84 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana, partial
            (1%)
          Length = 729

 Score = 47.4 bits (111), Expect = 4e-05
 Identities = 21/54 (38%), Positives = 32/54 (58%), Gaps = 1/54 (1%)
 Frame = -1

Query: 1383 LTYPTQPVKILERREKQLRKRTVPLVKLAWS-DDNQDATWELEESARKRYPSLF 1435
            L   T+PV+IL+R EK +RK+ + +VK+ W     ++ TWE E   +  YP  F
Sbjct: 717  LDLETRPVRILDRMEKAMRKKPIQMVKIVWDCSGREEITWETEARMKADYPEWF 556


>BG586308 weakly similar to PIR|F84528|F8 probable retroelement pol polyprotein
            [imported] - Arabidopsis thaliana, partial (7%)
          Length = 686

 Score = 44.7 bits (104), Expect = 3e-04
 Identities = 25/71 (35%), Positives = 39/71 (54%)
 Frame = -2

Query: 1145 HGVPANIVSDRDPRFVSKFWGSLHEALGTRLSLSSAYHPQSDGQSERTIQTLEDMLRACV 1204
            HG+P  IV+D    F+S  +    E    RL+ +S  +PQS+GQ+E + + + D L+  +
Sbjct: 685  HGLPYEIVTDNGSHFISNKFREFCERWRIRLNTASPRYPQSNGQAEASNKIIIDGLKKRL 506

Query: 1205 LDYKGSWEDFL 1215
               KG W D L
Sbjct: 505  DLKKGCWADEL 473


>CA860311 weakly similar to GP|7289872|gb|A CG17427 gene product {Drosophila
           melanogaster}, partial (20%)
          Length = 192

 Score = 42.4 bits (98), Expect = 0.001
 Identities = 23/60 (38%), Positives = 36/60 (59%), Gaps = 5/60 (8%)
 Frame = +1

Query: 815 GLGCVLMQERKV-----IAYASQQLRPHEQNYPTHDMELAAVVFALKIWRHYLYGVKFTI 869
           G+G  L Q+ +      IAYAS+ L   E+NY   + E  A ++A++ +RHYL+G KF +
Sbjct: 13  GIGAGLSQKDEENHEHPIAYASRLLTAAERNYTVVERECLAAIWAIRNFRHYLHGPKFEL 192


>CB893783 weakly similar to GP|22830935|dbj hypothetical protein~similar to
           gag-pol polyprotein {Oryza sativa (japonica
           cultivar-group)}, partial (8%)
          Length = 853

 Score = 41.6 bits (96), Expect = 0.002
 Identities = 49/205 (23%), Positives = 83/205 (39%), Gaps = 33/205 (16%)
 Frame = +2

Query: 335 LCDISLVVLYDSGATHSFISHERAKSLKLVITQLPYDLVVTTPTKESAVTSSVCKKCPLV 394
           +CD    V+ DSG+  + +S+   + L+L     P+   +    K + V  S C      
Sbjct: 257 VCD----VIIDSGSCENVVSNYMVEKLELPTKDHPHRYKLQWLKKGNEVRVSKCCLVSFS 424

Query: 395 IEDREYITNLVC--LPLEGLDIILGMNWLSINNVLLDCRLRVPIFLQ------------- 439
           I  ++Y  N+ C  + ++   ++LG  W    + L D       F++             
Sbjct: 425 I-GQKYKDNVWCDVISMDACHMLLGRPWQYDRHALYDGHANTYTFVKYGVKIKLVPLPPN 601

Query: 440 -----KYKEKHTASLPEKEPSAY------------LILFSSEGTKRPAMEDIPVVREFPE 482
                K   K   SL  KEP               L+  + E T +  +E + V  +F +
Sbjct: 602 AFDEGKKDFKPIVSLVSKEPFKVTTKDIQDMSLILLVKSNEESTIQKEVEHLLV--DFTD 775

Query: 483 VFPEDMTE-LPPEREVEFAIDVIPG 506
           V P ++   LPP R+++ AID IPG
Sbjct: 776 VVPSEIPSGLPPMRDIQHAIDFIPG 850


>AJ497987 weakly similar to GP|9927273|dbj Similar to Arabidopsis thaliana
           chromosome II BAC F26H6; putative retroelement pol
           polyprotein, partial (1%)
          Length = 636

 Score = 40.8 bits (94), Expect = 0.004
 Identities = 23/78 (29%), Positives = 39/78 (49%), Gaps = 1/78 (1%)
 Frame = -2

Query: 850 AVVFALKIWRHYLYGVKFTIYSDHQSLKYLFDQKTLNMRQRRWVEFLEDYDFKLQYHPG- 908
           A+ +A K  RHY+      + S    +KY+F++  L  R  RW   L +YD + +     
Sbjct: 623 ALAWAAKRLRHYMINHTTWLVSKMDPIKYIFEKPALTGRIARWQMLLSEYDIEYRSQKAI 444

Query: 909 KANVVADALSRKSLHAAR 926
           K +++AD L+ + L   R
Sbjct: 443 KGSILADHLAHQPLEDYR 390


>TC87382 similar to EGAD|146423|156195 vitellogenin {Anolis pulchellus},
           partial (7%)
          Length = 2304

 Score = 37.7 bits (86), Expect = 0.032
 Identities = 44/192 (22%), Positives = 80/192 (40%), Gaps = 15/192 (7%)
 Frame = +2

Query: 35  LKRNPPLFDGGYDPEGANRWLRKIEQIYESLPTSEDRMIAYASYLFHEEARNWWVHAKSR 94
           +K + P F+G    +    WL+ IE+++E     E++ +   +    + A  WW + K R
Sbjct: 614 IKVDIPDFEGNLQLDDFLDWLQTIERVFEYKEVPEEQKVKIVAAKLKKHALIWWENLKRR 793

Query: 95  --ITPPDGVLTWSIFKEAFLEKYFP-----ADVKGK---KETEFLELKQGEMFVGQYAAR 144
                   + TW   ++    KY P     A+   K   K++ +  L   +  +  +   
Sbjct: 794 RKREGKSKIKTWDKMRQKLTRKYLPPHYYQANFTQK*LPKKSSYQPLSPTKNHIDYHKPL 973

Query: 145 FEE-LSQFHPYYGTTAD---DASKCIRFECGLRPDIRAAIGHQQIRTFTVLVEKC-RIFE 199
             + +S F P   T  +   +  KC  F C     I     +Q++  FT++ E+   IFE
Sbjct: 974 IHQPISSFRPQRNTIKERNTNIPKC--FICQGYGHIALDCVNQKV--FTIVNEEINNIFE 1141

Query: 200 ENDRARREYYKS 211
           E    R + Y+S
Sbjct: 1142EE---REDVYES 1168


>TC87383 similar to GP|19168656|emb|CAD26175. DNA-DIRECTED RNA POLYMERASE II
           {Encephalitozoon cuniculi}, partial (0%)
          Length = 1247

 Score = 36.6 bits (83), Expect = 0.072
 Identities = 19/85 (22%), Positives = 38/85 (44%), Gaps = 2/85 (2%)
 Frame = -2

Query: 35  LKRNPPLFDGGYDPEGANRWLRKIEQIYESLPTSEDRMIAYASYLFHEEARNWWVHAKSR 94
           +K + P F+G   P+    WL+ +E++++     E++ +   +    + A  WW + K R
Sbjct: 739 IK*DIPDFEGNLQPDDLLDWLQIMERLFKYKEVLEEQKVKIVAAKLKKLASIWWENVKRR 560

Query: 95  --ITPPDGVLTWSIFKEAFLEKYFP 117
                   + TW   ++    KY P
Sbjct: 559 RKREGKSKIKTWEKMRQKLTRKYLP 485


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.319    0.136    0.411 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 44,440,311
Number of Sequences: 36976
Number of extensions: 630017
Number of successful extensions: 3134
Number of sequences better than 10.0: 57
Number of HSP's better than 10.0 without gapping: 3054
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3124
length of query: 1435
length of database: 9,014,727
effective HSP length: 108
effective length of query: 1327
effective length of database: 5,021,319
effective search space: 6663290313
effective search space used: 6663290313
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)


Lotus: description of TM0039.4