Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC148227.2 + phase: 0 /pseudo
         (1075 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAA36615.1| unnamed protein product [Solanum tuberosum] gi|4...   196  5e-48
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi...   148  8e-34
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi...   140  2e-31
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t...   140  3e-31
pir||F86470 probable retroelement polyprotein [imported] - Arabi...   134  2e-29
gb|AAT40550.1| putative receptor kinase [Solanum demissum]            132  5e-29
ref|NP_918613.1| polyprotein [Oryza sativa (japonica cultivar-gr...   130  2e-28
dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis t...   127  2e-27
gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi...   121  1e-25
gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thalia...   121  1e-25
gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsi...   120  2e-25
dbj|BAD99220.1| polypeptide with an integrase domain [Petunia x ...   119  5e-25
gb|AAT38747.1| putative polyprotein [Solanum demissum]                117  2e-24
pir||E96608 probable retroelement polyprotein F25P12.89 [importe...   110  3e-22
gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsi...   108  1e-21
gb|AAU89730.1| putative polyprotein [Solanum tuberosum]               100  3e-19
emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis...    99  8e-19
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ...    95  1e-17
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf...    86  5e-15
gb|AAO26691.1| gag-pol polyprotein [Vitis vinifera]                    82  1e-13

>emb|CAA36615.1| unnamed protein product [Solanum tuberosum] gi|421954|pir||S25786
           hypothetical protein 3 - potato transposon Tst1
          Length = 675

 Score =  196 bits (497), Expect = 5e-48
 Identities = 164/532 (30%), Positives = 246/532 (45%), Gaps = 85/532 (15%)

Query: 1   CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
           CNL+S  KL     C+  F+   C FQ+ +SGK I SA+++GGLY+L D      QL  I
Sbjct: 32  CNLVSFRKLTRSLNCRVIFYSDLCEFQEKVSGKMIGSARESGGLYFL-DNGNNSLQLNPI 90

Query: 61  SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQH-R 119
             F  S FV N    VMLWH  LGHPSF YL+ + P+LF   + S FQCE  E +K H  
Sbjct: 91  --FLNSTFVLNK---VMLWHYGLGHPSFYYLRHLLPQLFRNKNPSLFQCEFCEMAKHHVD 145

Query: 120 SSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*DK 179
           +SFP Q Y+ SK F++IHSDVWGP+RI+   T  G    +  I D              +
Sbjct: 146 TSFPSQRYQASKPFTMIHSDVWGPSRIS---TMFGKRWFVTFIDDH------------TR 190

Query: 180 LSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETF------------------FFMENG 221
           LS + L   +  +  +++    T  + + TQF E                    FF + G
Sbjct: 191 LSWVFLLKGKSEVKNVFE----TFHVMVETQFNEKIKIFRSDNGREFFNEQLGSFFRKTG 246

Query: 222 IVQQSTCVSSPQQNGITERKNRHLLEMARALLFFH*SSKILMG*G------CINCCTLNK 275
           +V QS+C  +PQQNGI ERKNRHLLE  RAL+F     + L G         IN      
Sbjct: 247 VVHQSSCPDTPQQNGIAERKNRHLLEATRALMFTSKVPQHLWGEALLTATYLINRMPSRP 306

Query: 276 LYVISCFKP*DSSRNLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVG 335
           L   + FK    S    ++ +  P         + +H          +   RA K +FVG
Sbjct: 307 LEFKTPFKVFRESFPSSRLTTDLPLRVFGCTTFVHVH-------NRSKLEPRAKKCIFVG 359

Query: 336 YSPTRKGYKCLDLNSKRFLVTMDVTFF*K*TFFFRTIIFKGGNQMKIHLIFFEDLILFEN 395
           Y+P++KGYKC D ++++ +VTMD+TFF    +F   +      Q + HL      ++   
Sbjct: 360 YAPSQKGYKCYDPHARKIIVTMDLTFFESQLYFTTHL------QGEYHLGEDSFFVILRK 413

Query: 396 MFMSHSSRPFVSKENAP-----DNVSEHTPSMSEDVTKLVATNQNSNNDSLEPNDNQELI 450
           + +    R  + + N       +++++  P   +D + L+   Q    + + P+++    
Sbjct: 414 LDIK-QMRSLILQINTDVRDVGEDINKCDPRDDKDQSDLMIKTQKFKPEPVAPSND---- 468

Query: 451 QMSLHEHPYNETERKFGEVEGTWKGIIYGRRNHDKVVEDLIPQHSHESEPRE 502
                       + K G  E   +  +Y RRN  +       QH  +S P++
Sbjct: 469 ------------KNKNGNREQKTEMQVYSRRNRTQEKRTEDSQHCQKSVPQD 508


>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301698|pir||C84512 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1501

 Score =  148 bits (374), Expect = 8e-34
 Identities = 113/365 (30%), Positives = 165/365 (44%), Gaps = 27/365 (7%)

Query: 1   CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
           C L+SVSKL+   +C   F DT C  QD  S   I S ++ GG+YYL D           
Sbjct: 464 CTLISVSKLLKQTQCLATFTDTLCFLQDRSSKTLIGSGEERGGVYYLTDVTPA------- 516

Query: 61  SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRS 120
                    +N   D  LWH RLGHPSF  L  +          +S  C++   +KQ R 
Sbjct: 517 -----KIHTANVDSDQALWHQRLGHPSFSVLSSLPLFSKTSSTVTSHSCDVCFRAKQTRE 571

Query: 121 SFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*DKL 180
            FP    K  + FS+IH DVWGP R+   P   G    L I+ D+   ++        ++
Sbjct: 572 VFPESINKTEECFSLIHCDVWGPYRV---PASCGAVYFLTIVDDYSRAVWTYLLLEKSEV 628

Query: 181 SRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGITER 240
            ++L N  + +     K  ++    N       + +F ENGI+ Q++CV +PQQNG  ER
Sbjct: 629 RQVLTNFLKYAEKQFGKTVKMVRSDNGTEFMCLSSYFRENGIIHQTSCVGTPQQNGRVER 688

Query: 241 KNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLKILSKCPY 300
           K+RH+L +ARALLF         G   +    L      S      S R   ++L    +
Sbjct: 689 KHRHILNVARALLFQASLPIKFWGESILTAAYLINRTPSSIL----SGRTPYEVL----H 740

Query: 301 FSRFAFKNIRMH----YFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLNSKRFLVT 356
            S+  +  +R+     Y    T+   +   R+   +FVGY   +KG+K  D+    FLV+
Sbjct: 741 GSKPVYSQLRVFGSACYVHRVTRDKDKFGQRSRSCIFVGYPFGKKGWKVYDIERNEFLVS 800

Query: 357 MDVTF 361
            DV F
Sbjct: 801 RDVIF 805


>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301695|pir||D84481 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1413

 Score =  140 bits (353), Expect = 2e-31
 Identities = 114/372 (30%), Positives = 166/372 (43%), Gaps = 41/372 (11%)

Query: 1   CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
           C+L+SVSKL+   +C   F DT C+ QD  S   I + ++  G+YYL D   T      I
Sbjct: 447 CSLISVSKLVKQIKCLALFTDTICVLQDRFSRTLIGTGEERDGVYYLTDAATTTVHKVDI 506

Query: 61  SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHD--FSSFQCEIYEFSKQH 118
           ++            D  LWH RLGHPSF  L  +   LF G     SS  C++   +KQ 
Sbjct: 507 TT------------DHALWHQRLGHPSFSVLSSL--PLFSGSSCSVSSRSCDVCFRAKQT 552

Query: 119 RSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*D 178
           R  FP  + K +  FS+IH DVWGP R+   P+  G    L I+ DF   ++        
Sbjct: 553 REVFPDSSNKSTDCFSLIHCDVWGPYRV---PSSCGAVYFLTIVDDFSRSVWTYLLLAKS 609

Query: 179 KLSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGIT 238
           ++  +L N    +     K  ++    N       + +F E GIV Q++CV +PQQNG  
Sbjct: 610 EVRSVLTNFLAYTEKQFGKSVKIIRSDNGTEFMCLSSYFKEQGIVHQTSCVGTPQQNGRV 669

Query: 239 ERKNRHLLEMARALLFFH*SSKILMG*GCINCCTL---------NKLYVISCFKP*DSSR 289
           ERK+RH+L ++RALLF         G   +    L         N L             
Sbjct: 670 ERKHRHILNVSRALLFQASLPIKFWGEAVMTAAYLINRTPSSIHNGLSPYELLHGCKPDY 729

Query: 290 NLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLN 349
           + L++     Y  R              T+   +   R+   +FVGY   +KG+K  DL+
Sbjct: 730 DQLRVFGSACYAHRV-------------TRDKDKFGERSRLCIFVGYPFGQKGWKVYDLS 776

Query: 350 SKRFLVTMDVTF 361
           +  F+V+ DV F
Sbjct: 777 TNEFIVSRDVVF 788


>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1491

 Score =  140 bits (352), Expect = 3e-31
 Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 41/372 (11%)

Query: 1   CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
           C+L+SVSKL+   +C   F DT C+ QD  S   I + ++  G+YYL D   T      +
Sbjct: 447 CSLISVSKLVKQIKCLALFTDTICVLQDRFSRTLIGTGEERDGVYYLTDAATTTVHKVDV 506

Query: 61  SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHD--FSSFQCEIYEFSKQH 118
           ++            D  LWH RLGHPSF  L  +   LF G     SS  C++   +KQ 
Sbjct: 507 TT------------DHALWHQRLGHPSFSVLSSL--PLFSGSSCSVSSRSCDVCFRAKQT 552

Query: 119 RSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*D 178
           R  FP  + K +  FS+IH DVWGP R+   P+  G    L I+ DF   ++        
Sbjct: 553 REVFPDSSNKSTDCFSLIHCDVWGPYRV---PSSCGAVYFLTIVDDFSRSVWTYLLLAKS 609

Query: 179 KLSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGIT 238
           ++  +L N    +     K  ++    N       + +F E GIV Q++CV +PQQNG  
Sbjct: 610 EVRSVLTNFLAYTEKQFGKSVKIIRSDNGTEFMCLSSYFKEQGIVHQTSCVGTPQQNGRV 669

Query: 239 ERKNRHLLEMARALLFFH*SSKILMG*GCINCCTL---------NKLYVISCFKP*DSSR 289
           ERK+RH+L ++RALLF         G   +    L         N L             
Sbjct: 670 ERKHRHILNVSRALLFQASLPIKFWGEAVMTAAYLINRTPSSIHNGLSPYELLHGCKPDY 729

Query: 290 NLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLN 349
           + L++     Y  R              T+   +   R+   +FVGY   +KG+K  DL+
Sbjct: 730 DQLRVFGSACYAHRV-------------TRDKDKFGERSRLCIFVGYPFGQKGWKVYDLS 776

Query: 350 SKRFLVTMDVTF 361
           +  F+V+ DV F
Sbjct: 777 TNEFIVSRDVVF 788


>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
           gi|9989049|gb|AAG10812.1| Putative retroelement
           polyprotein [Arabidopsis thaliana]
          Length = 1404

 Score =  134 bits (336), Expect = 2e-29
 Identities = 133/518 (25%), Positives = 205/518 (38%), Gaps = 82/518 (15%)

Query: 2   NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
           NLLSV +   D  C   F      FQD  +GK I      G LY LED          +S
Sbjct: 390 NLLSVKRTTRDLNCYAIFGPNDVYFQDIETGKVIGEGGSKGELYVLED----------LS 439

Query: 62  SFSESFFVSNNKDDVM---LWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQH 118
             S S F S +   +    LWH RLGHP  + LK++ P + F H      CE     K  
Sbjct: 440 PNSSSCFSSKSHLGISFNTLWHARLGHPHTRALKLMLPNISFDHT----SCEACILGKHC 495

Query: 119 RSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*D 178
           +S FP       K F ++HSDVW    ++    +D     +  I +   Y +       D
Sbjct: 496 KSVFPKSLTIYEKCFDLVHSDVWTSPCVS----RDNNKYFVTFINEKSKYTWITLLPSKD 551

Query: 179 KLSRILLN*CRLSLIPLYKFSELTMELNILT-----QF*ETFF---FMENGIVQQSTCVS 230
           ++     N         Y  ++   ++ +       ++    F     + GI+ Q++C  
Sbjct: 552 RVFEAFTN------FETYVTNQFNAKIKVFRTDNGGEYTSQKFRDHLAKRGIIHQTSCPY 605

Query: 231 SPQQNGITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTL---NKLYVISCFKP*DS 287
           +PQQNG+ ERKNRHL+E+AR+++F     K   G   +  C L       V+S   P + 
Sbjct: 606 TPQQNGVAERKNRHLMEVARSMMFHTSVPKRFWGDAVLTACYLINRTPTKVLSDLSPFEV 665

Query: 288 SRNLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLD 347
             N    +     F    F  I      P  Q ++    +++K +F+GYS T+KGYKC D
Sbjct: 666 LNNTKPFIDHLRVFGCVCFVLI------PGEQRSKLD-AKSTKCMFLGYSTTQKGYKCFD 718

Query: 348 LNSKRFLVTMDVTFF*K*TFFFRTIIFKGGNQMKIHLIFFEDLILFENMFMSHSSRPFVS 407
               R  ++ DV F     +  +    K    +K       D +      + H       
Sbjct: 719 PTKNRTFISRDVKFLENQDYNNK----KDWENLKDLTHSTSDRVETLKFLLDHLG----- 769

Query: 408 KENAPDNVSEHTPSMSEDVTKLVATNQNSNNDSLEPNDNQELIQMSLHEHPYNETERKFG 467
             N   + ++H P M++D   L   NQ +   SL+  +N   +Q    E P N  E    
Sbjct: 770 --NDSTSTTQHQPEMTQDQEDL---NQENEEVSLQHQENLTHVQ----EDPPNTQE---- 816

Query: 468 EVEGTWKGIIYGRRNHDKVVEDLIPQHSHESEPRENQP 505
                          H + V+++    S + EP +  P
Sbjct: 817 ---------------HSEHVQEIQDDSSEDEEPTQVLP 839


>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
          Length = 1358

 Score =  132 bits (333), Expect = 5e-29
 Identities = 115/378 (30%), Positives = 163/378 (42%), Gaps = 54/378 (14%)

Query: 2   NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
           NL SVS+L     C   FFD   L QD  +G+ I +  ++ GLYYL              
Sbjct: 425 NLASVSRLTKALHCSITFFDDFFLMQDRSTGQMIGTGHESQGLYYLTS------------ 472

Query: 62  SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSS 121
             S S    +  D   L H RLGH S   L+ + P L      S+  CE  +  K  R++
Sbjct: 473 --SNSLAACSITDSPDLIHKRLGHSSLSKLQKMVPSL---SSLSTLDCESCQLGKHTRAT 527

Query: 122 FPVQTYKPSK-LFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*DKL 180
           F   T   S+ +FS++HSD+WGP+R++S     G    +  I D+             K 
Sbjct: 528 FSRSTEGRSESIFSLVHSDIWGPSRVSSTL---GFRYFVSFIDDY------------SKC 572

Query: 181 SRILLN*CRLSLIPLYK--FSELTMELNIL--------------TQF*ETFFFMENGIVQ 224
           + + L   R  L  ++K  F+E+  +  +               +QF E  F    GI+ 
Sbjct: 573 TWVFLMKDRSELFSIFKSFFAEIQNQFGVSIRTFRSDNALEYLSSQFRE--FMTHQGIIH 630

Query: 225 QSTCVSSPQQNGITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP 284
           Q+TC  +PQQNG+ ERKNRHL+E AR LL          G   +  C L      S  + 
Sbjct: 631 QTTCPYTPQQNGVAERKNRHLIETARTLLLESNVPLRFWGDAVLTSCYLINRMPSSSIQN 690

Query: 285 *DSSRNLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYK 344
                 L       P   R       +H   P      +   RA K VF+GYS  +KGY+
Sbjct: 691 QVPHSILFPQSHLYPIPPRVFGSTCFVHNLAP---GKDKLAPRALKCVFLGYSRVQKGYR 747

Query: 345 CLDLNSKRFLVTMDVTFF 362
           C   +  R+L++ DVTFF
Sbjct: 748 CYSHDLHRYLMSADVTFF 765


>ref|NP_918613.1| polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1554

 Score =  130 bits (328), Expect = 2e-28
 Identities = 112/371 (30%), Positives = 177/371 (47%), Gaps = 38/371 (10%)

Query: 2   NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
           NLLSVS  I   +C   F +  CLFQ+  +G+ I +  +  GL+Y+  E     +LG  +
Sbjct: 469 NLLSVSSAIDQLKCIVVFDENSCLFQEKWTGRRIGTGVRRDGLWYINHE-----ELGLAA 523

Query: 62  SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSS 121
                  V + + ++ L H +LGHPSF+ L  ++P LF   D     C+  E  K  RS+
Sbjct: 524 ------VVGDVEKEISLLHCQLGHPSFEILSKLYPDLFSRVDKHRLVCDACELGKHTRST 577

Query: 122 FPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMII--LDFIGYIY*KKNQR*DK 179
           +     +  +LF +IHSDVWGP  + S     G    +  I     + +IY  K++   +
Sbjct: 578 YVGIGLRNCELFILIHSDVWGPCPVTSV---SGFKWFVTFIDCHTRMTWIYMLKHK--SE 632

Query: 180 LSRILLN*CRL------SLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQ 233
           + R   +  +L      + + + +    T  +N   +F    +  + GI+ Q+TC  +P 
Sbjct: 633 VLRCFQDFHKLVTTQFDAKVKIIRTDNGTEYIN--NEF--VSYVSDEGIIHQTTCPGTPP 688

Query: 234 QNGITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTL-NKL--YVISCFKP*DSSRN 290
           QNG+ ERKNRHLLE+AR+L+F     K L     +    L N++   ++    P +    
Sbjct: 689 QNGVAERKNRHLLEVARSLMFQMNVPKYLWSEAVMTAAYLINRMPSRILGMKSPAELLLG 748

Query: 291 LLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLNS 350
             +       F    F  +R H       +  +    A K VFVGY+ ++KGYKC D   
Sbjct: 749 KREFKVPPKVFGCVCF--VRDH-----RPSVGKLDPHAVKCVFVGYASSQKGYKCWDPIG 801

Query: 351 KRFLVTMDVTF 361
           +R  V+MDVTF
Sbjct: 802 RRLFVSMDVTF 812


>dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1098

 Score =  127 bits (320), Expect = 2e-27
 Identities = 115/374 (30%), Positives = 162/374 (42%), Gaps = 43/374 (11%)

Query: 1   CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
           C L+SV+KL+    C   F DT C  QD  +   I + ++  G+YY    L      G  
Sbjct: 438 CTLISVAKLLKHTGCVAIFTDTLCFLQDRFTRTLIGAGEEREGVYYFTGVLAARVNKG-- 495

Query: 61  SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQ----CEIYEFSK 116
             F ES           LWH RLGHPS   L + FP+  F    S  +    C+I   +K
Sbjct: 496 --FKES-------SSATLWHHRLGHPSTGVL-LSFPE--FASSSSDLEIIKSCDICYRAK 543

Query: 117 QHRSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR 176
           Q R  F     K +  F +IH DVWGP R    P   G    L I+ DF   ++      
Sbjct: 544 QAREVFSPSLNKTTVCFELIHCDVWGPYRT---PASCGSVYFLTIVDDFSRSVWTFLMAE 600

Query: 177 *DKLSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNG 236
             ++SR++ N C +S     K  +     N         FF E GI+ Q++CV + QQNG
Sbjct: 601 KSEVSRLIRNFCAMSERQFCKSIKTVHSDNGTEFMCLKSFFQEQGIIHQTSCVDTRQQNG 660

Query: 237 ITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLKIL- 295
             ERK+RH+L +AR  LF     +   G   +    L              +R   KIL 
Sbjct: 661 RVERKHRHILNVARTCLFQSHLPRKFRGESILTAIHL-------------INRTPTKILH 707

Query: 296 SKCPY----FSRFAFKNIR----MHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLD 347
            K PY     SR ++  +R    + Y     +   +   R+ + VFVGY   +KG++  D
Sbjct: 708 GKSPYEVLFGSRPSYSALRTFGCLCYAHYRARDKDKFSERSRRCVFVGYPYGKKGWRLYD 767

Query: 348 LNSKRFLVTMDVTF 361
           L   +F V+ DV F
Sbjct: 768 LEKNKFFVSRDVVF 781


>gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|7444418|pir||T00499 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1496

 Score =  121 bits (304), Expect = 1e-25
 Identities = 106/368 (28%), Positives = 160/368 (42%), Gaps = 30/368 (8%)

Query: 1   CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
           C L+SVSKL+        F DT C  QD      I + ++  G+YY      TG    ++
Sbjct: 434 CTLISVSKLLKQTSSIAIFTDTFCFLQDRFLRTLIGAGEEREGVYYF-----TGVLAPRV 488

Query: 61  SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQ-CEIYEFSKQHR 119
              S  F +S +     LWH RLGHPS   L  +         F     C+    SKQ R
Sbjct: 489 HKASSDFAISGD-----LWHRRLGHPSTSVLLSLPECNRSSQGFDKIDSCDTCFRSKQTR 543

Query: 120 SSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*DK 179
             FP+   K  + FS+IH DVWGP R    P+  G    L ++ D+   ++        +
Sbjct: 544 EVFPISNNKTMECFSLIHGDVWGPYRT---PSTTGAVYFLTLVDDYSRSVWTYLMSSKTE 600

Query: 180 LSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGITE 239
           +S+++ N C +S     K  +     N       T +F  +GI+ Q++CV +PQQNG  E
Sbjct: 601 VSQLIKNFCAMSERQFGKQVKAFRTDNGTEFMCLTPYFQTHGILHQTSCVDTPQQNGRVE 660

Query: 240 RKNRHLLEMARALLF------FH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLK 293
           RK+RH+L +ARA LF            IL     IN      L   + ++     R    
Sbjct: 661 RKHRHILNVARACLFQGNLPVKFWGESILTATHLINRTPSAVLKGKTPYELLFGERPSYD 720

Query: 294 ILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLNSKRF 353
           +L     F    + +IR        +   +  +R+ K VF+GY   +K ++  DL + + 
Sbjct: 721 MLRS---FGCLCYAHIR-------PRNKDKFTSRSRKCVFIGYPHGKKAWRVYDLETGKI 770

Query: 354 LVTMDVTF 361
             + DV F
Sbjct: 771 FASRDVRF 778


>gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thaliana]
           gi|25403501|pir||H86486 protein Ty1/copia-element
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1152

 Score =  121 bits (303), Expect = 1e-25
 Identities = 114/374 (30%), Positives = 170/374 (44%), Gaps = 40/374 (10%)

Query: 1   CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
           C L+SV++L+ +  C   F D  C+ QD  S   I    ++ G+Y+L+          + 
Sbjct: 446 CTLISVARLLRELHCFAIFTDKVCVIQDRTSKMLIGVGTESNGVYHLQ----------RA 495

Query: 61  SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQ------CEIYEF 114
              + S  V   K +  LWH+RLGHPS K L  V P L    DF S        C++   
Sbjct: 496 EVVATSANVVKWKTNKALWHMRLGHPSSKVLSSVLPSL---EDFDSCSSDLKTICDVCVR 552

Query: 115 SKQHRSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKN 174
           +KQ R+SF     K  + FS IH DVWGP +   + +  G    L I+ D    ++    
Sbjct: 553 AKQTRASFSESFNKAEECFSFIHYDVWGPYK---HASSCGAHYFLTIVDDHSRAVWIHLM 609

Query: 175 QR*DKLSRILLN*CRLSLIPLYKFSELTMELNILTQF*E-TFFFMENGIVQQSTCVSSPQ 233
               +++ +L     ++     K    T+  N  T+F     +F E GIV Q +CV + Q
Sbjct: 610 LAKSEVASLLQQFIAMASRQFNK-QVKTVRSNNGTEFMSLKSYFAERGIVHQISCVYTHQ 668

Query: 234 QNGITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVIS-CFKP*DSSRNLL 292
           QNG  ERK+RH+L +AR+LLF     +  +         L   Y+I+    P    +   
Sbjct: 669 QNGRVERKHRHILNVARSLLF-----QAELPISFWEESVLTAAYLINRTPTPILDGKTPY 723

Query: 293 KILSKCP--YFSRFAFKNI---RMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLD 347
           KIL   P  Y S   F ++   R H     T    +   R  K +FVGY   +KG++  D
Sbjct: 724 KILYSQPPSYASLRVFGSLCFARKH-----TGRLDKFQERGRKCIFVGYPHGQKGWRIYD 778

Query: 348 LNSKRFLVTMDVTF 361
           + S+ F V+ DV F
Sbjct: 779 IESQIFFVSRDVVF 792


>gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301674|pir||D84639 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1156

 Score =  120 bits (302), Expect = 2e-25
 Identities = 104/344 (30%), Positives = 157/344 (45%), Gaps = 39/344 (11%)

Query: 28  DSISGKTIVSAKKNGGLYYLEDELETGHQLGQISSFSESFFVSNNKDDVMLWHLRLGHPS 87
           D  S   I S ++  G+YYL D         ++SS            D  LWH RLGHPS
Sbjct: 52  DRFSRTLIGSGEERDGVYYLTDVATAKIHTAKVSS------------DQALWHQRLGHPS 99

Query: 88  FKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSSFPVQTYKPSKLFSIIHSDVWGPNRIN 147
           F  L  +           S  C++   +KQ R  FPV T K  + FS+IH DVWGP R+ 
Sbjct: 100 FSVLSSLPVLTSSSLSVGSRSCDVCFRAKQTREVFPVSTNKSIECFSLIHCDVWGPYRV- 158

Query: 148 SYPTKDGLSPLLMIILDFIGYIY*KKNQR*DKLSRILLN*CRLSLIPLYKFSELTMELNI 207
             P+  G    L I+ DF   ++        ++  +L N        +Y   +    + +
Sbjct: 159 --PSSCGAVYFLTIVDDFSRAVWTYLLLAKSEVRTVLTN------FLVYTEKQFGKSVKV 210

Query: 208 LTQF*ETFF------FMENGIVQQSTCVSSPQQNGITERKNRHLLEMARALLFFH*SSKI 261
           L     T F      F E+GIV Q++CV +PQQNG  ERK+RH+L +ARA+LF     + 
Sbjct: 211 LRSDNGTEFMCLASYFREHGIVHQTSCVGTPQQNGRVERKHRHILNVARAILF-----QA 265

Query: 262 LMG*GCINCCTLNKLYVISCFKP*DSSRNLLKILSKCPYFSRFAFKNIRMH----YFCP* 317
            +         L   Y+I+  +   S  N L    +  + S+  ++++R+     Y    
Sbjct: 266 SLPIQFWGEAVLTAAYLIN--RTPTSLHNGLSPY-EILHNSKPNYEHLRVFGSACYVHRA 322

Query: 318 TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLNSKRFLVTMDVTF 361
           ++   +   R+   VF+GY   +KG+K  D+  K FLV+ DV F
Sbjct: 323 SRDKDKFGERSRLCVFIGYPFAQKGWKVFDMEKKEFLVSRDVVF 366


>dbj|BAD99220.1| polypeptide with an integrase domain [Petunia x hybrida]
          Length = 492

 Score =  119 bits (298), Expect = 5e-25
 Identities = 94/305 (30%), Positives = 141/305 (45%), Gaps = 30/305 (9%)

Query: 71  NNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSSFPVQTYKPS 130
           ++ D+  LWH RLGH  F  +K +        +  +F C+I   ++Q +  FP  T K  
Sbjct: 8   DSMDESKLWHFRLGHLPFHAMKTIKTLPVTVDNKQTFPCDICPMARQSKPPFPSSTIKSK 67

Query: 131 KLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIG----YIY*KKNQR*DKLSRILLN 186
           + F +IH D WGP  +   PT  G    L I+ DF      Y+   K+     L + LL+
Sbjct: 68  QCFELIHIDTWGPYNV---PTYKGERYFLTIVDDFSRATWTYLLTTKSNAFATL-KSLLS 123

Query: 187 *CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGITERKNRHLL 246
                     K         + +    + F    GI+ Q+TCV +PQQNG+ ERK+RHLL
Sbjct: 124 LIERQFSSKVKIIRSDNAYELGSGVIPSEFLASLGIIHQTTCVGTPQQNGVVERKHRHLL 183

Query: 247 EMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLKILS-KCPYFSRFA 305
           E  RALL+     K   G      C L   Y+I+ F          K+L+ KCPY   F 
Sbjct: 184 ETCRALLYQSHLPKKFWG-----DCLLTATYLINRFPS--------KVLNGKCPYQVLFG 230

Query: 306 ----FKNIR----MHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLNSKRFLVTM 357
               + +++    + +    T+   +   RA   VF+GY   +KGYK L+L + + +V+ 
Sbjct: 231 SLPDYSHLKSFGSLCFVSTLTRHRDKLMPRAIPGVFLGYPFAQKGYKVLNLQTSQVIVSR 290

Query: 358 DVTFF 362
           DV FF
Sbjct: 291 DVKFF 295


>gb|AAT38747.1| putative polyprotein [Solanum demissum]
          Length = 1336

 Score =  117 bits (293), Expect = 2e-24
 Identities = 88/259 (33%), Positives = 125/259 (47%), Gaps = 28/259 (10%)

Query: 2   NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
           NL+SVSKL  + +C    +  HCLFQD ++ + I     + GLY L++          I 
Sbjct: 430 NLISVSKLTKELKCFVSLYPDHCLFQDLMTKQIIGKRHVSDGLYILDEWTPPSVACSSIV 489

Query: 62  SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSS 121
           S  E+             H RLGHPS   LK + P+    H+  S  CE   F+K HR S
Sbjct: 490 SPFEA-------------HCRLGHPSLPVLKKLCPQF---HNVPSIDCESCHFAKHHRIS 533

Query: 122 F-PVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDF--IGYIY*KKNQR*D 178
             P    + +  F ++HSDVWGP  + S   K G    +  + DF  + +IY  KN+   
Sbjct: 534 LSPRNNKRANFAFELVHSDVWGPCPVVS---KVGFRYFVTFMDDFSRMTWIYFMKNR--S 588

Query: 179 KLSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFF---FMENGIVQQSTCVSSPQQN 235
           ++     N C   +   +  S   +  +   +F    F     + GI+ QS+CV +P QN
Sbjct: 589 EVFSHFSNFC-AEIKTQFNASVHILRSDNAREFMSASFQNYMNQYGILHQSSCVDTPSQN 647

Query: 236 GITERKNRHLLEMARALLF 254
           G+ ERKNRHLLE AR LLF
Sbjct: 648 GVAERKNRHLLETARVLLF 666



 Score = 39.3 bits (90), Expect = 0.72
 Identities = 18/42 (42%), Positives = 27/42 (63%)

Query: 327 RASKRVFVGYSPTRKGYKCLDLNSKRFLVTMDVTFF*K*TFF 368
           +A K VF+GYS  +KGY+C      R++V++DV F    +FF
Sbjct: 736 KALKCVFLGYSRLQKGYRCYSPTLNRYMVSIDVVFSESISFF 777


>pir||E96608 probable retroelement polyprotein F25P12.89 [imported] -
           Arabidopsis thaliana gi|9954746|gb|AAG09097.1| Putative
           retroelement polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score =  110 bits (275), Expect = 3e-22
 Identities = 105/374 (28%), Positives = 167/374 (44%), Gaps = 45/374 (12%)

Query: 1   CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
           C+L+SVS+L   +RC     D  C+ QD  +   I + ++  GLY+    +ET   +   
Sbjct: 441 CHLISVSQLTRTRRCIFQITDKVCIVQDRTTLMLIGAGRELNGLYFFRG-VETAAAV--- 496

Query: 61  SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRS 120
                    S       LWH RLGHPS K L ++         F S  CEI   +KQ R 
Sbjct: 497 --------TSKALPSSQLWHQRLGHPSSKALHLLPFSDVTSSTFDSKTCEICIQAKQTRD 548

Query: 121 SFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*DKL 180
            FP+ + K S  F ++H D+WGP R  S     G    L ++ D+   ++        + 
Sbjct: 549 PFPLSSNKTSFAFELVHCDLWGPYRTTSI---CGSRYFLTLVDDYSRAVWLYLLPSKQEA 605

Query: 181 SRILLN*CRLSLIPLYKFSELTMELNILTQF*ETF-----FFMENGIVQQSTCVSSPQQN 235
            + L N      I L +    T    I +     F     FF + GI+ +++CV +PQQN
Sbjct: 606 PKHLKN-----FIALVERQYTTNIKMIRSDNGSEFICLSDFFAQKGIIHETSCVGTPQQN 660

Query: 236 GITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLKIL 295
           G  ERK+RH+L +ARAL F     +  +     + C L   Y+I+      +   LLK  
Sbjct: 661 GRVERKHRHILNVARALRF-----QSGLPIEFWSYCALTAAYLIN-----RTPTPLLK-- 708

Query: 296 SKCPYFSRF----AFKNIRMH----YFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLD 347
            K P+   +      ++IR+     Y         +  +R++K +F+GY   +KG++  +
Sbjct: 709 GKTPFELIYNRPPPLQHIRIFGCICYVHNLKHGGDKFASRSNKSIFLGYPFAKKGWRVYN 768

Query: 348 LNSKRFLVTMDVTF 361
           + +    V+ DV F
Sbjct: 769 IETGVVSVSRDVVF 782


>gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301700|pir||G84542 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1333

 Score =  108 bits (270), Expect = 1e-21
 Identities = 98/375 (26%), Positives = 164/375 (43%), Gaps = 48/375 (12%)

Query: 2   NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
           +L+S+ +L+ + RC     D   + QD  S   + + ++ GG ++              +
Sbjct: 327 DLISIGQLMDENRCVLQMSDRFLVVQDRTSRMVMGAGRRVGGTFHFRS-----------T 375

Query: 62  SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIV-FPKLFFGHDFSSFQCEIYEFSKQHRS 120
             + S  V   K+   LWH R+GHP+ + + ++    +       +  C++   +KQ R+
Sbjct: 376 EIAASVTVKEEKN-YELWHSRMGHPAARVVSLIPESSVSVSSTHLNKACDVCHRAKQTRN 434

Query: 121 SFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIG----YIY*KKNQR 176
           SFP+   K  ++F +I+ D+WGP R    P+  G    L II D+      Y+   K++ 
Sbjct: 435 SFPLSINKTLRIFELIYCDLWGPYRT---PSHTGARYFLTIIDDYSRGVWLYLLNDKSEA 491

Query: 177 *DKLSRILLN*CRLSLIPLYKF-SELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQN 235
              L        R   + +    S+   E   LT+     FF E G++ + +CV++P++N
Sbjct: 492 PCHLKNFFAMTDRQFNVKIKTVRSDNGTEFLCLTK-----FFQEQGVIHERSCVATPERN 546

Query: 236 GITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLKIL 295
              ERK+RHLL +ARAL F         G      C L   Y+I        +R    +L
Sbjct: 547 DRVERKHRHLLNVARALRFQANLPIQFWG-----ECVLTAAYLI--------NRTPSSVL 593

Query: 296 SKCPYFSRFAFKNIRMHY------FCP*TQTNR---QT*TRASKRVFVGYSPTRKGYKCL 346
           +    + R   K  R  +       C     NR   +   R+ + VFVGY   +KG++  
Sbjct: 594 NDSTPYERLHKKQPRFDHLRVFGSLCYAHNRNRGGDKFAERSRRCVFVGYPHGQKGWRLF 653

Query: 347 DLNSKRFLVTMDVTF 361
           DL    F V+ DV F
Sbjct: 654 DLEQNEFFVSRDVVF 668


>gb|AAU89730.1| putative polyprotein [Solanum tuberosum]
          Length = 1280

 Score =  100 bits (249), Expect = 3e-19
 Identities = 68/194 (35%), Positives = 101/194 (52%), Gaps = 20/194 (10%)

Query: 70  SNNKDDVMLWHLRLGHPSFKYLK----IVFPKLFFGHDFSSFQCEIYEFSKQHRSSFPVQ 125
           +N   D+ LWH+RLGH  F  +K    I FP +      S + C +   ++Q+R  FPV 
Sbjct: 467 ANVVSDIALWHVRLGHLPFSAMKNLDFISFPSV------SPYICPVCPKARQNRLPFPVS 520

Query: 126 TYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIG----YIY*KKNQR*DKLS 181
           + K  ++F +IH D WGP   +   T DG +  L I+ DF      +I   K+     L 
Sbjct: 521 SIKSKRIFELIHIDTWGPFNTS---THDGYNYFLTIVDDFSRGTWTFILKTKSNAFPVLK 577

Query: 182 RILLN*CRLSLIPLYKF-SELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGITER 240
             L    R   + + +  S+  +EL   +Q  ET F    GI+ + +CV++PQQNG+ ER
Sbjct: 578 DFLAMVERQFELKVQRIRSDNALELGRGSQ--ETMFLHSQGILHERSCVATPQQNGVVER 635

Query: 241 KNRHLLEMARALLF 254
           K++HLLE AR L F
Sbjct: 636 KHKHLLEAARGLFF 649


>emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana]
           gi|7268152|emb|CAB78488.1| retrovirus-related like
           polyprotein [Arabidopsis thaliana]
           gi|7488175|pir||G71406 probable retrovirus-related
           polyprotein - Arabidopsis thaliana
          Length = 1489

 Score = 99.0 bits (245), Expect = 8e-19
 Identities = 104/364 (28%), Positives = 158/364 (42%), Gaps = 43/364 (11%)

Query: 2   NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
           NL+SVS L+    C  HF+   CL Q+   G  I      G LY+    LET +     S
Sbjct: 529 NLMSVSSLVKTISCSAHFYVDCCLIQELSQGLMI----GRGRLYHNLYILETENTSPSTS 584

Query: 62  SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSS 121
           + +   F  +  +D  LWH RLGHPS     +V  KL                    R +
Sbjct: 585 TPAACLFTGSVLNDGHLWHQRLGHPS----SVVLQKL-------------------KRLA 621

Query: 122 FPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDF--IGYIY*KKNQR*DK 179
           +       S  F ++H D+WGP  I S    +G    L ++ D     ++Y  +N++   
Sbjct: 622 YISHNNLASNPFDLVHLDIWGPFSIESI---EGFRYFLTVVDDCTRTTWVYMLRNKK--D 676

Query: 180 LSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGITE 239
           +S +     +L +   +      +  +   +   T    E+G++   +C  +PQQN + E
Sbjct: 677 VSSVFPEFIKL-VSTQFNAKIKAIRSDNAPELGFTEIVKEHGMLHHFSCAYTPQQNSVVE 735

Query: 240 RKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLK-ILSKC 298
           RK++H+L +ARALLF    S I M     +C T     +     P  ++++  + IL+K 
Sbjct: 736 RKHQHILNVARALLF---QSNIPMQ-YWSDCVTTAVFLINRLPSPLLNNKSPYELILNKQ 791

Query: 299 PYFSRFAFKNIRMHYFCP*TQTNRQT*T-RASKRVFVGYSPTRKGYKCLDLNSKRFLVTM 357
           P +S    KN     F       R   T RA   VF+GY    KGYK LDL S    V+ 
Sbjct: 792 PDYS--LLKNFGCLCFVSTNAHERTKFTPRARACVFLGYPSGYKGYKVLDLESHSVTVSR 849

Query: 358 DVTF 361
           +V F
Sbjct: 850 NVVF 853


>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|37534632|ref|NP_921618.1| putative pol polyprotein
           [Oryza sativa (japonica cultivar-group)]
          Length = 1688

 Score = 94.7 bits (234), Expect = 1e-17
 Identities = 101/394 (25%), Positives = 160/394 (39%), Gaps = 55/394 (13%)

Query: 2   NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSA---KKNGGLYYLEDELETGHQLG 58
           NL+SV +L  D  C   F DT C  QD  +G  I +    K++ GLY L+          
Sbjct: 248 NLISVGQLT-DTNCFVGFDDTSCFVQDRHTGAVIGTGHRQKRSCGLYILDSLSLPSSSTN 306

Query: 59  QISSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHD--FSSFQCEIYEFSK 116
             S +S     S        WH RLGH     L  +  +   G     ++F C+  +  K
Sbjct: 307 TPSVYSP--MCSTACKSFPQWHHRLGHLCGSRLATLINQGVLGSVPVDTTFVCKGCKLGK 364

Query: 117 QHRSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGY--IY*KKN 174
           Q +  +P  T + S+ F ++HSDVWG    + +P+K G +  ++ + D+  Y  IY  K+
Sbjct: 365 QVQLPYPSSTSRSSRPFDLVHSDVWGK---SPFPSKGGHNYYVIFVDDYSRYTWIYFMKH 421

Query: 175 QR*DKLSRILLN*CRLSLIPLYK-FSELTMELNILTQF*ETF------------------ 215
                         R  LI +Y+ F+++     I TQF                      
Sbjct: 422 --------------RSQLISIYQSFAQM-----IHTQFSSAIRIFRSDSGGEYMSNAFRE 462

Query: 216 FFMENGIVQQSTCVSSPQQNGITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNK 275
           F +  G + Q +C  +  QNG+ ERK+RH++E AR LL                   L  
Sbjct: 463 FLVSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASFVPAHFWAEAISTAVYLIN 522

Query: 276 LYVISCFKP*DSSRNLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVG 335
           +   S  +       L       P +          +      +  + T  ++ + VF+G
Sbjct: 523 MQPSSSLQGRSPGEVL---FGSPPRYDHLRVFGCTCYVLLAPRERTKLT-AQSVECVFLG 578

Query: 336 YSPTRKGYKCLDLNSKRFLVTMDVTFF*K*TFFF 369
           YS   KGY+C D +++R  ++ DVTF     FF+
Sbjct: 579 YSLEHKGYRCYDPSARRIRISRDVTFDENKPFFY 612


>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm,
           score: 11.19) [Arabidopsis thaliana]
           gi|7486705|pir||T01879 hypothetical protein F8M12.17 -
           Arabidopsis thaliana
          Length = 1633

 Score = 86.3 bits (212), Expect = 5e-15
 Identities = 115/431 (26%), Positives = 178/431 (40%), Gaps = 63/431 (14%)

Query: 32  GKTIVSAKKNGGLYYLEDELETGHQLGQISSFSESFFVSNNKDDVMLWHLRLGHPSFKYL 91
           G  I   K    LY LE          Q +SFS S   + ++           HPS   L
Sbjct: 473 GLMIGRGKTYNNLYILET---------QRTSFSPSLPAATSR-----------HPSLPAL 512

Query: 92  KIVFPKLFFGHDFSSF--QCEIYEFSKQHRSSFPVQTYKPSKLFSIIHSDVWGPNRINSY 149
           + +   +      SS    C I   +KQ R ++       S  F +IH D+WGP  I S 
Sbjct: 513 QKLVSSIPSLKSVSSTASHCRISPLAKQKRLAYVSHNNLASSPFDLIHLDIWGPFSIESV 572

Query: 150 PTKDGLSPLLMIILDFIG--YIY*KKNQR*DKLSRILLN*CRLSLIPLYKFSELTMELNI 207
              DG    L ++ D     ++Y  KN+   ++S I     +L +   Y      +  + 
Sbjct: 573 ---DGFRYFLTLVDDCTRTTWVYMMKNK--SEVSNIFPVFVKL-IFTQYNAKIKAIRSDN 626

Query: 208 LTQF*ETFFFMENGIVQQSTCVSSPQQNGITERKNRHLLEMARALLFFH*SSKILMG*GC 267
           + +   T F  E G++ Q +C  +PQQN + ERK++HLL +AR+LLF    S + +    
Sbjct: 627 VKELAFTKFVKEQGMIHQFSCAYTPQQNSVVERKHQHLLNIARSLLF---QSNVPLQ--Y 681

Query: 268 INCCTLNKLYVISCFKP*--DSSRNLLKILSKCPYFSR------FAFKNIR-MHYFCP*T 318
            + C L   Y+I+       D+      +L K P ++       +A  N+   + F P  
Sbjct: 682 WSDCVLTAAYLINRLPSPLLDNKTPFELLLKKIPDYTLLKSCLCYASTNVHDRNKFSP-- 739

Query: 319 QTNRQT*TRASKRVFVGYSPTRKGYKCLDLNSKRFLVTMDVTFF*K*TFFFRTIIFKGGN 378
                   RA   VF+GY    KGYK LDL S    +T +V F  +  F F+T  F    
Sbjct: 740 --------RARPCVFLGYPSGYKGYKVLDLESHSISITRNVVFH-ETKFPFKTSKF---- 786

Query: 379 QMKIHLIFFEDLIL-FENMFMSHSSRPFVSKENAPDN--VSEHTPSMSEDVTKLVATNQN 435
            +K  +  F + IL          S P      A DN   + ++ S +  +  L +T   
Sbjct: 787 -LKESVDMFPNSILPLPAPLHFVESMPLDDDLRADDNNASTSNSASSASSIPPLPSTVNT 845

Query: 436 SNNDSLEPNDN 446
            N D+L+ + N
Sbjct: 846 QNTDALDIDTN 856


>gb|AAO26691.1| gag-pol polyprotein [Vitis vinifera]
          Length = 450

 Score = 81.6 bits (200), Expect = 1e-13
 Identities = 51/148 (34%), Positives = 71/148 (47%), Gaps = 17/148 (11%)

Query: 2   NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
           NL+SVSKL  +  C   FF  HC+FQD ++ +T      + GLY L++ +         +
Sbjct: 316 NLISVSKLTKNLNCSVSFFPDHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTA 375

Query: 62  SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSS 121
           S  E+             H RLGHPS   LK + P+        S  C+   F+K HRSS
Sbjct: 376 SPVEA-------------HCRLGHPSLPVLKKLCPQF---DTLPSLDCKSCHFAKHHRSS 419

Query: 122 F-PVQTYKPSKLFSIIHSDVWGPNRINS 148
             P    +   LF ++HSDVWGP  + S
Sbjct: 420 LGPRLNKRAESLFELVHSDVWGPCPVTS 447


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.355    0.158    0.554 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,528,345,794
Number of Sequences: 2540612
Number of extensions: 57345355
Number of successful extensions: 313861
Number of sequences better than 10.0: 533
Number of HSP's better than 10.0 without gapping: 440
Number of HSP's successfully gapped in prelim test: 93
Number of HSP's that attempted gapping in prelim test: 312252
Number of HSP's gapped (non-prelim): 1308
length of query: 1075
length of database: 863,360,394
effective HSP length: 139
effective length of query: 936
effective length of database: 510,215,326
effective search space: 477561545136
effective search space used: 477561545136
T: 11
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.6 bits)
S2: 81 (35.8 bits)


Medicago: description of AC148227.2