Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC141114.8 - phase: 0 
         (239 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAT40504.1| putative polyprotein [Solanum demissum]                282  6e-75
gb|AAT40500.1| putative reverse transcriptase [Solanum demissum]      171  2e-41
gb|AAM47598.1| NBS/LRR resistance protein-like protein [Capsicum...   130  2e-29
emb|CAF90588.1| unnamed protein product [Tetraodon nigroviridis]      104  2e-21
gb|AAK14815.1| polyprotein [Schistosoma japonicum]                     85  2e-15
gb|AAM93546.1| polyprotein [Schistosoma japonicum]                     85  2e-15
gb|AAT39320.1| hypothetical protein PGEC400L14.17 [Solanum demis...    82  1e-14
gb|AAF36061.1| Hypothetical protein Y76B12C.5 [Caenorhabditis el...    82  2e-14
gb|AAB00700.1| Hypothetical protein C34D4.5 [Caenorhabditis eleg...    81  3e-14
gb|AAC24982.2| reverse transcriptase [synthetic construct]             81  3e-14
gi|67625701 TPA: endonuclease-reverse transcriptase [Schistosoma...    80  4e-14
gb|AAK18958.1| Hypothetical protein F56C9.2 [Caenorhabditis eleg...    78  2e-13
gb|AAF36059.1| Hypothetical protein Y75D11A.4 [Caenorhabditis el...    77  3e-13
ref|XP_556470.1| ENSANGP00000028171 [Anopheles gambiae str. PEST...    74  3e-12
ref|XP_552671.1| ENSANGP00000005174 [Anopheles gambiae str. PEST...    72  1e-11
gb|AAC70880.1| Hypothetical protein F21E9.5 [Caenorhabditis eleg...    70  7e-11
gb|AAF60814.1| Hypothetical protein Y58G8A.2 [Caenorhabditis ele...    68  2e-10
ref|NP_493497.1| predicted CDS, reverse transcriptase family mem...    68  2e-10
pir||T20517 hypothetical protein F02E9.8 - Caenorhabditis elegans      66  9e-10
gb|AAF36001.1| Hypothetical protein Y71F9AL.3 [Caenorhabditis el...    61  3e-08

>gb|AAT40504.1| putative polyprotein [Solanum demissum]
          Length = 832

 Score =  282 bits (721), Expect = 6e-75
 Identities = 133/228 (58%), Positives = 166/228 (72%)

Query: 2   YEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLDVLTEHIQELAPRCMLFADDV 61
           Y    T VRT  G +E FP+ IGLHQGS LSP+LF LV+D LT  IQE  P CMLFADD+
Sbjct: 370 YYTAKTRVRTVGGDSEHFPVEIGLHQGSVLSPFLFALVMDELTRSIQETVPWCMLFADDI 429

Query: 62  VLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHIIPQ 121
           VL+ E+R+ VN RLE WRQ LE+ GFRLSR+KTEY+   FS     + +EV++   +IP+
Sbjct: 430 VLIDETRDRVNARLEVWRQTLESKGFRLSRTKTEYLGCKFSDGLDETDVEVRLAAQVIPK 489

Query: 122 VTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIRPA 181
              F+YLG+ +Q  G+I+ DV+HR+ A W+KWR ASGVLCDKK+  KLKGKFYR  +RPA
Sbjct: 490 KESFRYLGAVIQGSGDIDDDVTHRVGAAWMKWRLASGVLCDKKISPKLKGKFYRVVVRPA 549

Query: 182 LLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIREGRG 229
           LLYG ECW VK+ H +++ V EMRMLRWM G TR D+IRN+ IRE  G
Sbjct: 550 LLYGAECWPVKNAHVHKMHVAEMRMLRWMCGHTRSDKIRNEVIREKVG 597


>gb|AAT40500.1| putative reverse transcriptase [Solanum demissum]
          Length = 213

 Score =  171 bits (433), Expect = 2e-41
 Identities = 77/135 (57%), Positives = 99/135 (73%)

Query: 95  EYMEWNFSGRRSRSTLEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWR 154
           EY+   FS     + +EV++   +IP+   FKYLG+ +Q  G+I+ DV+HR+ A W+KWR
Sbjct: 7   EYLGCKFSDVLDETDVEVRLAAQVIPKKESFKYLGAVIQGSGDIDDDVTHRVGAAWMKWR 66

Query: 155 RASGVLCDKKVPLKLKGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKT 214
            ASGVLCDKK+PLKLKGKFYR  +RPALLYG ECW VK+ H +++ V EMRMLRWM G T
Sbjct: 67  LASGVLCDKKIPLKLKGKFYRVVVRPALLYGAECWPVKNAHVHKMHVAEMRMLRWMCGHT 126

Query: 215 RQDRIRNDTIREGRG 229
           R D+IRN+ IRE  G
Sbjct: 127 RSDKIRNEVIREKVG 141


>gb|AAM47598.1| NBS/LRR resistance protein-like protein [Capsicum annuum]
          Length = 122

 Score =  130 bits (328), Expect = 2e-29
 Identities = 61/110 (55%), Positives = 83/110 (75%)

Query: 52  PRCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLE 111
           P CMLFADDVVL+ E+R  VN +LE WRQ LE+ GFR+SR+KTEY+E  F+  R  + + 
Sbjct: 10  PWCMLFADDVVLIDETRGGVNDKLELWRQTLESKGFRVSRTKTEYVECKFNDVRRENEVV 69

Query: 112 VKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLC 161
           V++    + +  +FKYLGS +Q++GEI+ DVSHRI AGW+KW+ ASGV+C
Sbjct: 70  VRLEAQEVKKRDKFKYLGSVIQSNGEIDEDVSHRIGAGWMKWKLASGVMC 119


>emb|CAF90588.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 183

 Score =  104 bits (259), Expect = 2e-21
 Identities = 65/175 (37%), Positives = 99/175 (56%), Gaps = 17/175 (9%)

Query: 33  PYLFTLVLDVLTEHIQELAPRCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRS 92
           P L  +V+D LT+ +++ +P   +FA D+V+   SRE+V  +LE  R ALE+       S
Sbjct: 19  PLLVAMVMDRLTDEVRQESPWTTMFAGDIVMC--SREQVEEKLEERRFALES-------S 69

Query: 93  KTEYMEWNFSGRRSRSTLEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLK 152
           KTE  E + SG       E+K G+ +       K LGS VQ  GE   +V  R+QAGW  
Sbjct: 70  KTEN-ERDLSGSVRLQGEEIKKGEDL-------KNLGSTVQTSGECGKEVKKRVQAGWNW 121

Query: 153 WRRASGVLCDKKVPLKLKGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRML 207
           W + SGV+CD+ V  K+K K  +T  RPA+++G E   ++ + E ++ + EM+ L
Sbjct: 122 WGKVSGVMCDRGVSAKIKRKVDKTVARPAIIFGLETVPLRKRQEAELELAEMKAL 176


>gb|AAK14815.1| polyprotein [Schistosoma japonicum]
          Length = 1091

 Score = 85.1 bits (209), Expect = 2e-15
 Identities = 63/250 (25%), Positives = 111/250 (44%), Gaps = 20/250 (8%)

Query: 1   MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLDVLTEHIQ--------ELAP 52
           +Y   +  VR     +     + G+ Q   LSP+LF  ++D+L E           +L P
Sbjct: 744 LYSNTTCRVRAYGRLSSELTTSSGVRQACPLSPFLFNFIIDILLELTLSSSDFPGVDLFP 803

Query: 53  RCML----FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYM--EWNFSGRRS 106
              L    +ADD+VL+ E  +++   L T    +   G R S SK + +  +W       
Sbjct: 804 GDKLTDLEYADDIVLLSEDADKMQDFLTTLNMNVSMLGMRFSPSKCKMLLQDW------L 857

Query: 107 RSTLEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVP 166
            S  ++ +G   I  V RF YLGS +  +G +  ++S RI      +     +   + + 
Sbjct: 858 NSAPKLVIGRETIECVNRFTYLGSLISPNGLVSDEISARIHKARSAFANLRHLWRRRDIR 917

Query: 167 LKLKGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIRE 226
           L  KG+ Y  A+R  L YG E W ++ +   ++ V + R LR ++     +R+ N  +R 
Sbjct: 918 LMTKGRVYCAAVRSVLPYGCETWPLRVEDIRRILVFDHRCLRNIARVCWDNRVSNAWVRN 977

Query: 227 GRGGIHSRKV 236
              G + + +
Sbjct: 978 RVLGKYGKSI 987


>gb|AAM93546.1| polyprotein [Schistosoma japonicum]
          Length = 976

 Score = 85.1 bits (209), Expect = 2e-15
 Identities = 63/250 (25%), Positives = 111/250 (44%), Gaps = 20/250 (8%)

Query: 1   MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLDVLTEHIQ--------ELAP 52
           +Y   +  VR     +     + G+ Q   LSP+LF  ++D+L E           +L P
Sbjct: 629 LYSNTTCRVRAYGRLSSELTTSSGVRQACPLSPFLFNFIIDILLELTLSSSDFPGVDLFP 688

Query: 53  RCML----FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYM--EWNFSGRRS 106
              L    +ADD+VL+ E  +++   L T    +   G R S SK + +  +W       
Sbjct: 689 GDKLTDLEYADDIVLLSEDADKMQDFLTTLNMNVSMLGMRFSPSKCKMLLQDW------L 742

Query: 107 RSTLEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVP 166
            S  ++ +G   I  V RF YLGS +  +G +  ++S RI      +     +   + + 
Sbjct: 743 NSAPKLVIGRETIECVNRFTYLGSLISPNGLVSDEISARIHKARSAFANLRHLWRRRDIR 802

Query: 167 LKLKGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIRE 226
           L  KG+ Y  A+R  L YG E W ++ +   ++ V + R LR ++     +R+ N  +R 
Sbjct: 803 LMTKGRVYCAAVRSVLPYGCETWPLRVEDIRRILVFDHRCLRNIARVCWDNRVSNAWVRN 862

Query: 227 GRGGIHSRKV 236
              G + + +
Sbjct: 863 RVLGKYGKSI 872


>gb|AAT39320.1| hypothetical protein PGEC400L14.17 [Solanum demissum]
          Length = 139

 Score = 82.4 bits (202), Expect = 1e-14
 Identities = 50/166 (30%), Positives = 75/166 (45%), Gaps = 49/166 (29%)

Query: 59  DDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHI 118
           +D+VL+ E+R  VN R E WR  L++ G  LS++KTEYME  FS    ++  +V++   +
Sbjct: 4   NDIVLINETRGGVNDRQEIWRSTLDSKGLTLSKTKTEYMECKFSVASEKANRKVRIDTQL 63

Query: 119 IPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAI 178
           IP+   FK +                                                 I
Sbjct: 64  IPKKGSFKVI-------------------------------------------------I 74

Query: 179 RPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTI 224
           R  +LY  EC +V++ +  Q+ V EMRM RWM  +TR+D+I N  I
Sbjct: 75  RQTMLYEVECLSVQNSYVQQMKVAEMRMFRWMCRQTRKDKIGNKDI 120


>gb|AAF36061.1| Hypothetical protein Y76B12C.5 [Caenorhabditis elegans]
           gi|17544228|ref|NP_500154.1| predicted CDS, reverse
           transcriptase family member (4C744) [Caenorhabditis
           elegans]
          Length = 938

 Score = 81.6 bits (200), Expect = 2e-14
 Identities = 67/200 (33%), Positives = 100/200 (49%), Gaps = 19/200 (9%)

Query: 1   MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLD-VLTEHIQELAP------- 52
           + EG    +   D   +V   T G+ QG + SP LF+  L  +LT+   ELA        
Sbjct: 696 LMEGGQAEITVHDKKLKVNLCT-GIRQGDSASPALFSAALQAILTDCDNELAGVGISVEG 754

Query: 53  ---RCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRST 109
              R + FADDVVL+  + EEV  RLE   +    YG ++++SKT  ++  F     RS 
Sbjct: 755 RHIRRLEFADDVVLICSTPEEVQERLEILDRISSNYGLKINQSKTVLLKNKF----CRSQ 810

Query: 110 LEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKL 169
             +  G  IIP V   +YLG ++   G I+ ++S RI+AGW        VL  + +P K 
Sbjct: 811 DILFNGSPIIP-VPGCRYLGRWIDISGSIDEEISRRIRAGWGALVGIKEVL--RIMPNKE 867

Query: 170 KGKFYRTAIRPALLYGTECW 189
           +   ++  + PALLY +E W
Sbjct: 868 RIILFKQNVLPALLYASETW 887


>gb|AAB00700.1| Hypothetical protein C34D4.5 [Caenorhabditis elegans]
           gi|17539028|ref|NP_501121.1| predicted CDS, reverse
           transcriptase family member (4H911) [Caenorhabditis
           elegans] gi|7497012|pir||T29286 hypothetical protein
           C34D4.5 - Caenorhabditis elegans
          Length = 624

 Score = 80.9 bits (198), Expect = 3e-14
 Identities = 66/200 (33%), Positives = 98/200 (49%), Gaps = 19/200 (9%)

Query: 1   MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLD-VLTEHIQELAP------- 52
           M EG    +   D   +V  +  G+ QG + SP LF+  L  +LT+   E A        
Sbjct: 296 MMEGGQAEISVHDKKLKV-NLRTGVRQGDSASPALFSAALQAILTDCDNEFAGVGIKVEG 354

Query: 53  ---RCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRST 109
              R + FADDVVL+  + EEV  RLE   +    YG ++++SKT  ++  F     RS 
Sbjct: 355 RHIRRLEFADDVVLICSTPEEVQERLEILDRISSIYGLKINQSKTVLLKNKF----CRSQ 410

Query: 110 LEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKL 169
                G  IIP V   +YLG ++   G I+ ++S RI+AGW        VL  + +P K 
Sbjct: 411 DVFFNGSPIIP-VPGCRYLGRWIDISGSIDEEISRRIRAGWGALVGIKEVL--RIMPNKE 467

Query: 170 KGKFYRTAIRPALLYGTECW 189
           +   ++  + PALLY +E W
Sbjct: 468 RIILFKQNVLPALLYASETW 487


>gb|AAC24982.2| reverse transcriptase [synthetic construct]
          Length = 1016

 Score = 80.9 bits (198), Expect = 3e-14
 Identities = 59/237 (24%), Positives = 110/237 (45%), Gaps = 16/237 (6%)

Query: 1   MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLF---------TLVLDVLTEHIQELA 51
           +Y   S  VR  +  + +F  + G+ QG  +SP+LF         T ++DV    +  L 
Sbjct: 669 LYTNTSGRVRAYNHLSPLFHSSSGVRQGCPISPFLFNFAIDDILETALMDVSNGGVDMLP 728

Query: 52  PRCML---FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRS 108
              +L   +ADD+VL+ ++ + +   L     ++  YG   + SK + +  ++       
Sbjct: 729 GERLLDLEYADDIVLLCDNAQGMQSALNQLAISVRRYGMCFAPSKCKVLLQDWQDSHPVL 788

Query: 109 TLEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLK 168
           TL+   G+ I   V +F YLGS++   G +  ++  RI      +     +   + V L 
Sbjct: 789 TLD---GEQI-EVVEKFVYLGSYISAGGGVSDEIDARIMKARAAYANLGHLWRLRDVSLA 844

Query: 169 LKGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIR 225
           +KG+ Y  ++R  LLY  E W ++ +   ++SV + R LR ++    Q  + N  +R
Sbjct: 845 VKGRIYNASVRAVLLYACETWPLRVEDVRRLSVFDHRCLRRIADIQWQHHVSNAEVR 901


>gi|67625701 TPA: endonuclease-reverse transcriptase [Schistosoma mansoni]
          Length = 992

 Score = 80.5 bits (197), Expect = 4e-14
 Identities = 60/249 (24%), Positives = 110/249 (44%), Gaps = 16/249 (6%)

Query: 2   YEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLDVLTE-------HIQELAPRC 54
           Y+G++  +      T+ F +  G+ QG  LSP+LF LV+D + +       H  +   R 
Sbjct: 666 YDGLNCQIVHGGQLTDSFEVKTGVRQGCLLSPFLFLLVIDWIMKTSTSGGMHGIQWTGRM 725

Query: 55  ML----FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTL 110
            L    FADD+ L+ ++++++  +  +   A  A G  +++ K++ + +N     +  T 
Sbjct: 726 QLDDLDFADDLALLSQTQQQMQEKTTSVAAASAAVGLNINKGKSKTLRYN-----TICTN 780

Query: 111 EVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLK 170
            + +    +  V  F YLGS +   G  +ADV  RI      + +   +   K++    K
Sbjct: 781 PITLDGEALEDVEIFTYLGSIIDEHGGSDADVRARIGKARAAYLQLKNIWSSKQLSTNTK 840

Query: 171 GKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIREGRGG 230
            + + T ++  LLYG E W        ++ V     LR +      D I N  + E    
Sbjct: 841 VRIFNTNVKTVLLYGAETWRTTKAIIQKIQVFINSCLRKILRIRWPDTISNKLLWETTNQ 900

Query: 231 IHSRKVGRK 239
           I + +  RK
Sbjct: 901 IPAEEEIRK 909


>gb|AAK18958.1| Hypothetical protein F56C9.2 [Caenorhabditis elegans]
           gi|17553662|ref|NP_498615.1| predicted CDS, reverse
           transcriptase family member (3I419) [Caenorhabditis
           elegans] gi|7504398|pir||T16474 hypothetical protein
           F56C9.2 - Caenorhabditis elegans
          Length = 772

 Score = 78.2 bits (191), Expect = 2e-13
 Identities = 71/231 (30%), Positives = 109/231 (46%), Gaps = 26/231 (11%)

Query: 1   MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLD-VLTEHIQELAP------- 52
           M +G    +   D   +V   T G+ QG + SP LF+  L  +LT+   E A        
Sbjct: 444 MMDGGQAEITVHDKKLKVNLCT-GVRQGDSASPALFSAALQAILTDCDNEFAGVGINVEG 502

Query: 53  ---RCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRST 109
              R + FADDVVL+  +  EV  RLE   +    YG ++++SKT  ++  F     RS 
Sbjct: 503 RHIRRLEFADDVVLICSTPGEVQERLEILDRISSNYGLKINQSKTVLLKNKF----CRSQ 558

Query: 110 LEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKL 169
             +  G  IIP V   +YLG ++   G I+ ++S RI+AGW        VL  + +P K 
Sbjct: 559 DVLFNGSPIIP-VPGCRYLGRWIDISGSIDEEISRRIRAGWGALVGIKEVL--RIMPNKE 615

Query: 170 KGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIR 220
           +   ++  + PALLY +E W   +        + +R+ R +SG      IR
Sbjct: 616 RIILFKQNVLPALLYASETWTCNAG-------STLRLKRTVSGLIDAAEIR 659


>gb|AAF36059.1| Hypothetical protein Y75D11A.4 [Caenorhabditis elegans]
           gi|17570529|ref|NP_508323.1| predicted CDS, reverse
           transcriptase family member (XC378) [Caenorhabditis
           elegans]
          Length = 480

 Score = 77.4 bits (189), Expect = 3e-13
 Identities = 61/177 (34%), Positives = 90/177 (50%), Gaps = 18/177 (10%)

Query: 24  GLHQGSTLSPYLFTLVLD-VLTEHIQELAP----------RCMLFADDVVLVGESREEVN 72
           G+ QG + SP LF+  L  +LT+   ELA           R + FADDVVL+  + EEV 
Sbjct: 174 GVRQGDSASPALFSAALQAILTDCDNELAGVGISVEGRHIRRLEFADDVVLICSTPEEVQ 233

Query: 73  GRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHIIPQVTRFKYLGSFV 132
            RLE   +    YG ++ +SKT  ++  F     RS   +  G  IIP V   +YLG ++
Sbjct: 234 ERLEILDRISSYYGLKIDQSKTVLLKNKF----CRSQDVLFNGSPIIP-VPGCRYLGRWI 288

Query: 133 QNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIRPALLYGTECW 189
              G I+ ++S RI+AGW        VL  + +P K     ++  + PALLY ++ W
Sbjct: 289 DISGSIDEEISRRIRAGWGALVGIKEVL--RIMPNKENIILFKQNVLPALLYASKTW 343


>ref|XP_556470.1| ENSANGP00000028171 [Anopheles gambiae str. PEST]
           gi|55238617|gb|EAL39934.1| ENSANGP00000028171 [Anopheles
           gambiae str. PEST]
          Length = 777

 Score = 73.9 bits (180), Expect = 3e-12
 Identities = 61/223 (27%), Positives = 95/223 (42%), Gaps = 18/223 (8%)

Query: 5   VSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLD--------VLTEHIQELAPRCML 56
           V+  VR     +  F  T GL QG  L+  LF L L+          T  I   + + + 
Sbjct: 515 VTCQVRVDGKLSGPFATTKGLRQGDGLACLLFNLALERAIRDSRVETTGTIFYKSTQILA 574

Query: 57  FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYM-------EWNFSGRRSRST 109
           +ADD+ ++G     V    +   QA E  G +++ +KT+ M         N    R R  
Sbjct: 575 YADDIDIIGLRLSYVAEAYQGIEQAAENLGLQINEAKTKLMVATSADLPINNPNLRRR-- 632

Query: 110 LEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKL 169
            +V++G+     V  F YLGS V ND  +E ++  R+ A    +         K +  + 
Sbjct: 633 -DVQIGERTFEVVPEFTYLGSKVSNDNSMEVELRARMLAANRSFYSLKKQFTSKNLSRRT 691

Query: 170 KGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSG 212
           K   Y T I P L Y +E W +    E  ++  E +MLR + G
Sbjct: 692 KLGLYSTYIVPVLTYASETWTLSKSDEALLAAFERKMLRRILG 734


>ref|XP_552671.1| ENSANGP00000005174 [Anopheles gambiae str. PEST]
           gi|55235101|gb|EAL38938.1| ENSANGP00000005174 [Anopheles
           gambiae str. PEST]
          Length = 329

 Score = 72.0 bits (175), Expect = 1e-11
 Identities = 56/216 (25%), Positives = 93/216 (42%), Gaps = 12/216 (5%)

Query: 5   VSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLD--------VLTEHIQELAPRCML 56
           V+  VR     +  F  T GL QG  L+  LF L L+          T  I   + + + 
Sbjct: 18  VTCQVRVDGKLSGPFATTKGLRQGDGLACLLFNLALERAIRDSRVETTGTIFYKSTQILA 77

Query: 57  FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSG----RRSRSTLEV 112
           +ADD+ ++      V    +   QA E+ G +++ +KT+ M    +G     ++    +V
Sbjct: 78  YADDIDIIDLRLSYVAEAYQGIEQAAESLGLQINEAKTKLMVATSAGLPINNQNLRRRDV 137

Query: 113 KVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGK 172
           ++G+     V  F  LGS V ND  +EA++  R+ A    +         K +  + K  
Sbjct: 138 QIGERTFEVVPEFTCLGSKVSNDNSMEAELRARMLAANRSFYSLKKQFTSKNLSRRTKLG 197

Query: 173 FYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLR 208
            Y T I P   Y +E W +    E  ++  E +MLR
Sbjct: 198 LYSTYIVPVFTYASETWTLSKSDETLLAAFERKMLR 233


>gb|AAC70880.1| Hypothetical protein F21E9.5 [Caenorhabditis elegans]
           gi|17567237|ref|NP_508248.1| predicted CDS, reverse
           transcriptase family member (XB968) [Caenorhabditis
           elegans] gi|7499579|pir||T31973 hypothetical protein
           F21E9.5 - Caenorhabditis elegans
          Length = 864

 Score = 69.7 bits (169), Expect = 7e-11
 Identities = 52/225 (23%), Positives = 95/225 (42%), Gaps = 19/225 (8%)

Query: 20  PITIGLHQGSTLSPYLFTLVLDVL----------------TEHIQELAPRC--MLFADDV 61
           P+T G+ QG  +SP LF+  L+ +                TE I+        + FADD+
Sbjct: 517 PVTRGVRQGDPISPNLFSACLEHVFRQLNWKHFKGDERYETEGIRVNGQNLTNLRFADDI 576

Query: 62  VLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHIIPQ 121
           VLV  +    +  L    +   + G +++  KT+ +   F+ +R          + II  
Sbjct: 577 VLVAHNPRTASQMLTELVEKCSSVGLKINTGKTKVLRNRFAYKRKVEIRCPNTTNIIIDD 636

Query: 122 VTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIRPA 181
           V  + YLG  + +   +  ++  R +A W  +            P +L+   + + + PA
Sbjct: 637 VNEYIYLGRQINDSNNLLPELHRRRRAAWAAFTNIKSTTDQITCP-RLRANLFDSTVLPA 695

Query: 182 LLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIRE 226
           L YG+E W    +   +V VT   + R + G T  ++ + +  RE
Sbjct: 696 LTYGSEAWTFTKELAERVRVTHAALERKLVGLTLTEQRKRNIHRE 740


>gb|AAF60814.1| Hypothetical protein Y58G8A.2 [Caenorhabditis elegans]
           gi|17566042|ref|NP_503166.1| predicted CDS, reverse
           transcriptase family member (5A739) [Caenorhabditis
           elegans]
          Length = 769

 Score = 68.2 bits (165), Expect = 2e-10
 Identities = 53/229 (23%), Positives = 95/229 (41%), Gaps = 23/229 (10%)

Query: 20  PITIGLHQGSTLSPYLFTLVLDVL----------------TEHIQELAPRC--MLFADDV 61
           P+  G+ QG  +SP LF+  L+ +                TE I+        + FADD+
Sbjct: 464 PVIRGVRQGDPISPNLFSACLEHVFRQLNWKHFKGDERYETEGIRVNGQNLTNLRFADDI 523

Query: 62  VLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHIIPQ 121
           VLV  +    +  L    +   + G +++  KT+ +   F+ +R          + II  
Sbjct: 524 VLVAHNPRTASQMLTELVEKCSSVGLKINTGKTKVLRNRFAYKRKVEIRCPNTTNIIIDD 583

Query: 122 VTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIRPA 181
           V  + YLG  + +   +  ++  R +A W  +            P +L+   + + + PA
Sbjct: 584 VNEYIYLGRQINDSNNLLPELHRRRRAAWAAFTNIKSTTDQITCP-RLRANLFDSTVLPA 642

Query: 182 LLYGTECWAVKSQHENQVSVTEMRMLRWMSGKT----RQDRIRNDTIRE 226
           L YG+E W    +   +V VT   + R + G T    R+  I  + +RE
Sbjct: 643 LTYGSEAWTFTKELAERVRVTHAALERKLVGLTLTEQRERNIHREEVRE 691


>ref|NP_493497.1| predicted CDS, reverse transcriptase family member (1O881)
           [Caenorhabditis elegans]
          Length = 835

 Score = 68.2 bits (165), Expect = 2e-10
 Identities = 49/200 (24%), Positives = 86/200 (42%), Gaps = 19/200 (9%)

Query: 20  PITIGLHQGSTLSPYLFTLVLDVL----------------TEHIQELAPRC--MLFADDV 61
           P+T G+ QG  +SP LF+  L+ +                TE I+        + FADD+
Sbjct: 583 PVTRGVRQGDPISPNLFSACLEHVFRQLNWKHFKGDERYETEGIRVNGQNLTNLRFADDI 642

Query: 62  VLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHIIPQ 121
           VLV  +   V+  L    +   + G +++  KT+ +   FS +R          + II  
Sbjct: 643 VLVAHNPRTVSQMLTELVEKCSSVGLKINTGKTKVLRNRFSYKRKVEIRCPNTTNIIIDD 702

Query: 122 VTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIRPA 181
           V  + YLG  + +   + A++  R +A W  +      +     P +L+   + + + PA
Sbjct: 703 VNEYIYLGRQINDSNNLLAELHRRRRAAWAAFTNIKSTMDQITCP-RLRANLFDSTVLPA 761

Query: 182 LLYGTECWAVKSQHENQVSV 201
           L YG+E W    +   +V V
Sbjct: 762 LTYGSEAWTFTKELAERVRV 781


>pir||T20517 hypothetical protein F02E9.8 - Caenorhabditis elegans
          Length = 341

 Score = 65.9 bits (159), Expect = 9e-10
 Identities = 48/137 (35%), Positives = 72/137 (52%), Gaps = 7/137 (5%)

Query: 53  RCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEV 112
           R + FADDVVL   +  EV  RLE   +    YG ++++SKT  ++  F     RS   +
Sbjct: 75  RRLEFADDVVLTCSTPGEVQERLEILDRISSNYGLKINQSKTVLLKNKF----CRSQDVL 130

Query: 113 KVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGK 172
             G  IIP V   +YLG ++   G I+ ++S RI+AGW        VL  + +P K +  
Sbjct: 131 FNGSPIIP-VPGCRYLGRWIDISGSIDEEISRRIRAGWGALVGIKEVL--RIMPNKERII 187

Query: 173 FYRTAIRPALLYGTECW 189
            ++  + PALLY +E W
Sbjct: 188 LFKQNVLPALLYASETW 204


>gb|AAF36001.1| Hypothetical protein Y71F9AL.3 [Caenorhabditis elegans]
           gi|17510457|ref|NP_491073.1| reverse transcriptase
           family member (1D477) [Caenorhabditis elegans]
          Length = 423

 Score = 60.8 bits (146), Expect = 3e-08
 Identities = 48/227 (21%), Positives = 86/227 (37%), Gaps = 22/227 (9%)

Query: 20  PITIGLHQGSTLSPYLFTLVLDVLTEHIQELA--------------------PRCMLFAD 59
           P+T G+ QG  +SP LF+  L+ +   +  +                     P  + FAD
Sbjct: 100 PVTKGVRQGDPISPNLFSACLEHVFRKLSWIELKGEAEDYDTIPGMRVNGRNPTNLRFAD 159

Query: 60  DVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHII 119
           D+VL+       +  L+   Q     G  ++  KT+ +   F+   S+           +
Sbjct: 160 DIVLIANHPNTASKMLQELVQKCSEVGLEINTGKTKVLRNRFADP-SKVYFGSPSPTTQL 218

Query: 120 PQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIR 179
             V  + YLG  +     +  ++  R +A W  +        D     K++   + + + 
Sbjct: 219 DDVDEYIYLGRQINAQNNLMPEIHRRRRAAWAAFNGIKNT-ADSITDKKIRANLFDSIVL 277

Query: 180 PALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIRE 226
           PAL YG+E W        +V +T   + R + G T   +   D  RE
Sbjct: 278 PALTYGSEAWTFTKALSERVRITHASLERRLVGITLTQQRERDLHRE 324


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.321    0.136    0.410 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 398,883,470
Number of Sequences: 2540612
Number of extensions: 15840254
Number of successful extensions: 33864
Number of sequences better than 10.0: 1211
Number of HSP's better than 10.0 without gapping: 274
Number of HSP's successfully gapped in prelim test: 937
Number of HSP's that attempted gapping in prelim test: 33068
Number of HSP's gapped (non-prelim): 1255
length of query: 239
length of database: 863,360,394
effective HSP length: 124
effective length of query: 115
effective length of database: 548,324,506
effective search space: 63057318190
effective search space used: 63057318190
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)


Medicago: description of AC141114.8