
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC141114.8 - phase: 0
(239 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAT40504.1| putative polyprotein [Solanum demissum] 282 6e-75
gb|AAT40500.1| putative reverse transcriptase [Solanum demissum] 171 2e-41
gb|AAM47598.1| NBS/LRR resistance protein-like protein [Capsicum... 130 2e-29
emb|CAF90588.1| unnamed protein product [Tetraodon nigroviridis] 104 2e-21
gb|AAK14815.1| polyprotein [Schistosoma japonicum] 85 2e-15
gb|AAM93546.1| polyprotein [Schistosoma japonicum] 85 2e-15
gb|AAT39320.1| hypothetical protein PGEC400L14.17 [Solanum demis... 82 1e-14
gb|AAF36061.1| Hypothetical protein Y76B12C.5 [Caenorhabditis el... 82 2e-14
gb|AAB00700.1| Hypothetical protein C34D4.5 [Caenorhabditis eleg... 81 3e-14
gb|AAC24982.2| reverse transcriptase [synthetic construct] 81 3e-14
gi|67625701 TPA: endonuclease-reverse transcriptase [Schistosoma... 80 4e-14
gb|AAK18958.1| Hypothetical protein F56C9.2 [Caenorhabditis eleg... 78 2e-13
gb|AAF36059.1| Hypothetical protein Y75D11A.4 [Caenorhabditis el... 77 3e-13
ref|XP_556470.1| ENSANGP00000028171 [Anopheles gambiae str. PEST... 74 3e-12
ref|XP_552671.1| ENSANGP00000005174 [Anopheles gambiae str. PEST... 72 1e-11
gb|AAC70880.1| Hypothetical protein F21E9.5 [Caenorhabditis eleg... 70 7e-11
gb|AAF60814.1| Hypothetical protein Y58G8A.2 [Caenorhabditis ele... 68 2e-10
ref|NP_493497.1| predicted CDS, reverse transcriptase family mem... 68 2e-10
pir||T20517 hypothetical protein F02E9.8 - Caenorhabditis elegans 66 9e-10
gb|AAF36001.1| Hypothetical protein Y71F9AL.3 [Caenorhabditis el... 61 3e-08
>gb|AAT40504.1| putative polyprotein [Solanum demissum]
Length = 832
Score = 282 bits (721), Expect = 6e-75
Identities = 133/228 (58%), Positives = 166/228 (72%)
Query: 2 YEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLDVLTEHIQELAPRCMLFADDV 61
Y T VRT G +E FP+ IGLHQGS LSP+LF LV+D LT IQE P CMLFADD+
Sbjct: 370 YYTAKTRVRTVGGDSEHFPVEIGLHQGSVLSPFLFALVMDELTRSIQETVPWCMLFADDI 429
Query: 62 VLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHIIPQ 121
VL+ E+R+ VN RLE WRQ LE+ GFRLSR+KTEY+ FS + +EV++ +IP+
Sbjct: 430 VLIDETRDRVNARLEVWRQTLESKGFRLSRTKTEYLGCKFSDGLDETDVEVRLAAQVIPK 489
Query: 122 VTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIRPA 181
F+YLG+ +Q G+I+ DV+HR+ A W+KWR ASGVLCDKK+ KLKGKFYR +RPA
Sbjct: 490 KESFRYLGAVIQGSGDIDDDVTHRVGAAWMKWRLASGVLCDKKISPKLKGKFYRVVVRPA 549
Query: 182 LLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIREGRG 229
LLYG ECW VK+ H +++ V EMRMLRWM G TR D+IRN+ IRE G
Sbjct: 550 LLYGAECWPVKNAHVHKMHVAEMRMLRWMCGHTRSDKIRNEVIREKVG 597
>gb|AAT40500.1| putative reverse transcriptase [Solanum demissum]
Length = 213
Score = 171 bits (433), Expect = 2e-41
Identities = 77/135 (57%), Positives = 99/135 (73%)
Query: 95 EYMEWNFSGRRSRSTLEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWR 154
EY+ FS + +EV++ +IP+ FKYLG+ +Q G+I+ DV+HR+ A W+KWR
Sbjct: 7 EYLGCKFSDVLDETDVEVRLAAQVIPKKESFKYLGAVIQGSGDIDDDVTHRVGAAWMKWR 66
Query: 155 RASGVLCDKKVPLKLKGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKT 214
ASGVLCDKK+PLKLKGKFYR +RPALLYG ECW VK+ H +++ V EMRMLRWM G T
Sbjct: 67 LASGVLCDKKIPLKLKGKFYRVVVRPALLYGAECWPVKNAHVHKMHVAEMRMLRWMCGHT 126
Query: 215 RQDRIRNDTIREGRG 229
R D+IRN+ IRE G
Sbjct: 127 RSDKIRNEVIREKVG 141
>gb|AAM47598.1| NBS/LRR resistance protein-like protein [Capsicum annuum]
Length = 122
Score = 130 bits (328), Expect = 2e-29
Identities = 61/110 (55%), Positives = 83/110 (75%)
Query: 52 PRCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLE 111
P CMLFADDVVL+ E+R VN +LE WRQ LE+ GFR+SR+KTEY+E F+ R + +
Sbjct: 10 PWCMLFADDVVLIDETRGGVNDKLELWRQTLESKGFRVSRTKTEYVECKFNDVRRENEVV 69
Query: 112 VKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLC 161
V++ + + +FKYLGS +Q++GEI+ DVSHRI AGW+KW+ ASGV+C
Sbjct: 70 VRLEAQEVKKRDKFKYLGSVIQSNGEIDEDVSHRIGAGWMKWKLASGVMC 119
>emb|CAF90588.1| unnamed protein product [Tetraodon nigroviridis]
Length = 183
Score = 104 bits (259), Expect = 2e-21
Identities = 65/175 (37%), Positives = 99/175 (56%), Gaps = 17/175 (9%)
Query: 33 PYLFTLVLDVLTEHIQELAPRCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRS 92
P L +V+D LT+ +++ +P +FA D+V+ SRE+V +LE R ALE+ S
Sbjct: 19 PLLVAMVMDRLTDEVRQESPWTTMFAGDIVMC--SREQVEEKLEERRFALES-------S 69
Query: 93 KTEYMEWNFSGRRSRSTLEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLK 152
KTE E + SG E+K G+ + K LGS VQ GE +V R+QAGW
Sbjct: 70 KTEN-ERDLSGSVRLQGEEIKKGEDL-------KNLGSTVQTSGECGKEVKKRVQAGWNW 121
Query: 153 WRRASGVLCDKKVPLKLKGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRML 207
W + SGV+CD+ V K+K K +T RPA+++G E ++ + E ++ + EM+ L
Sbjct: 122 WGKVSGVMCDRGVSAKIKRKVDKTVARPAIIFGLETVPLRKRQEAELELAEMKAL 176
>gb|AAK14815.1| polyprotein [Schistosoma japonicum]
Length = 1091
Score = 85.1 bits (209), Expect = 2e-15
Identities = 63/250 (25%), Positives = 111/250 (44%), Gaps = 20/250 (8%)
Query: 1 MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLDVLTEHIQ--------ELAP 52
+Y + VR + + G+ Q LSP+LF ++D+L E +L P
Sbjct: 744 LYSNTTCRVRAYGRLSSELTTSSGVRQACPLSPFLFNFIIDILLELTLSSSDFPGVDLFP 803
Query: 53 RCML----FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYM--EWNFSGRRS 106
L +ADD+VL+ E +++ L T + G R S SK + + +W
Sbjct: 804 GDKLTDLEYADDIVLLSEDADKMQDFLTTLNMNVSMLGMRFSPSKCKMLLQDW------L 857
Query: 107 RSTLEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVP 166
S ++ +G I V RF YLGS + +G + ++S RI + + + +
Sbjct: 858 NSAPKLVIGRETIECVNRFTYLGSLISPNGLVSDEISARIHKARSAFANLRHLWRRRDIR 917
Query: 167 LKLKGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIRE 226
L KG+ Y A+R L YG E W ++ + ++ V + R LR ++ +R+ N +R
Sbjct: 918 LMTKGRVYCAAVRSVLPYGCETWPLRVEDIRRILVFDHRCLRNIARVCWDNRVSNAWVRN 977
Query: 227 GRGGIHSRKV 236
G + + +
Sbjct: 978 RVLGKYGKSI 987
>gb|AAM93546.1| polyprotein [Schistosoma japonicum]
Length = 976
Score = 85.1 bits (209), Expect = 2e-15
Identities = 63/250 (25%), Positives = 111/250 (44%), Gaps = 20/250 (8%)
Query: 1 MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLDVLTEHIQ--------ELAP 52
+Y + VR + + G+ Q LSP+LF ++D+L E +L P
Sbjct: 629 LYSNTTCRVRAYGRLSSELTTSSGVRQACPLSPFLFNFIIDILLELTLSSSDFPGVDLFP 688
Query: 53 RCML----FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYM--EWNFSGRRS 106
L +ADD+VL+ E +++ L T + G R S SK + + +W
Sbjct: 689 GDKLTDLEYADDIVLLSEDADKMQDFLTTLNMNVSMLGMRFSPSKCKMLLQDW------L 742
Query: 107 RSTLEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVP 166
S ++ +G I V RF YLGS + +G + ++S RI + + + +
Sbjct: 743 NSAPKLVIGRETIECVNRFTYLGSLISPNGLVSDEISARIHKARSAFANLRHLWRRRDIR 802
Query: 167 LKLKGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIRE 226
L KG+ Y A+R L YG E W ++ + ++ V + R LR ++ +R+ N +R
Sbjct: 803 LMTKGRVYCAAVRSVLPYGCETWPLRVEDIRRILVFDHRCLRNIARVCWDNRVSNAWVRN 862
Query: 227 GRGGIHSRKV 236
G + + +
Sbjct: 863 RVLGKYGKSI 872
>gb|AAT39320.1| hypothetical protein PGEC400L14.17 [Solanum demissum]
Length = 139
Score = 82.4 bits (202), Expect = 1e-14
Identities = 50/166 (30%), Positives = 75/166 (45%), Gaps = 49/166 (29%)
Query: 59 DDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHI 118
+D+VL+ E+R VN R E WR L++ G LS++KTEYME FS ++ +V++ +
Sbjct: 4 NDIVLINETRGGVNDRQEIWRSTLDSKGLTLSKTKTEYMECKFSVASEKANRKVRIDTQL 63
Query: 119 IPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAI 178
IP+ FK + I
Sbjct: 64 IPKKGSFKVI-------------------------------------------------I 74
Query: 179 RPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTI 224
R +LY EC +V++ + Q+ V EMRM RWM +TR+D+I N I
Sbjct: 75 RQTMLYEVECLSVQNSYVQQMKVAEMRMFRWMCRQTRKDKIGNKDI 120
>gb|AAF36061.1| Hypothetical protein Y76B12C.5 [Caenorhabditis elegans]
gi|17544228|ref|NP_500154.1| predicted CDS, reverse
transcriptase family member (4C744) [Caenorhabditis
elegans]
Length = 938
Score = 81.6 bits (200), Expect = 2e-14
Identities = 67/200 (33%), Positives = 100/200 (49%), Gaps = 19/200 (9%)
Query: 1 MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLD-VLTEHIQELAP------- 52
+ EG + D +V T G+ QG + SP LF+ L +LT+ ELA
Sbjct: 696 LMEGGQAEITVHDKKLKVNLCT-GIRQGDSASPALFSAALQAILTDCDNELAGVGISVEG 754
Query: 53 ---RCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRST 109
R + FADDVVL+ + EEV RLE + YG ++++SKT ++ F RS
Sbjct: 755 RHIRRLEFADDVVLICSTPEEVQERLEILDRISSNYGLKINQSKTVLLKNKF----CRSQ 810
Query: 110 LEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKL 169
+ G IIP V +YLG ++ G I+ ++S RI+AGW VL + +P K
Sbjct: 811 DILFNGSPIIP-VPGCRYLGRWIDISGSIDEEISRRIRAGWGALVGIKEVL--RIMPNKE 867
Query: 170 KGKFYRTAIRPALLYGTECW 189
+ ++ + PALLY +E W
Sbjct: 868 RIILFKQNVLPALLYASETW 887
>gb|AAB00700.1| Hypothetical protein C34D4.5 [Caenorhabditis elegans]
gi|17539028|ref|NP_501121.1| predicted CDS, reverse
transcriptase family member (4H911) [Caenorhabditis
elegans] gi|7497012|pir||T29286 hypothetical protein
C34D4.5 - Caenorhabditis elegans
Length = 624
Score = 80.9 bits (198), Expect = 3e-14
Identities = 66/200 (33%), Positives = 98/200 (49%), Gaps = 19/200 (9%)
Query: 1 MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLD-VLTEHIQELAP------- 52
M EG + D +V + G+ QG + SP LF+ L +LT+ E A
Sbjct: 296 MMEGGQAEISVHDKKLKV-NLRTGVRQGDSASPALFSAALQAILTDCDNEFAGVGIKVEG 354
Query: 53 ---RCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRST 109
R + FADDVVL+ + EEV RLE + YG ++++SKT ++ F RS
Sbjct: 355 RHIRRLEFADDVVLICSTPEEVQERLEILDRISSIYGLKINQSKTVLLKNKF----CRSQ 410
Query: 110 LEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKL 169
G IIP V +YLG ++ G I+ ++S RI+AGW VL + +P K
Sbjct: 411 DVFFNGSPIIP-VPGCRYLGRWIDISGSIDEEISRRIRAGWGALVGIKEVL--RIMPNKE 467
Query: 170 KGKFYRTAIRPALLYGTECW 189
+ ++ + PALLY +E W
Sbjct: 468 RIILFKQNVLPALLYASETW 487
>gb|AAC24982.2| reverse transcriptase [synthetic construct]
Length = 1016
Score = 80.9 bits (198), Expect = 3e-14
Identities = 59/237 (24%), Positives = 110/237 (45%), Gaps = 16/237 (6%)
Query: 1 MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLF---------TLVLDVLTEHIQELA 51
+Y S VR + + +F + G+ QG +SP+LF T ++DV + L
Sbjct: 669 LYTNTSGRVRAYNHLSPLFHSSSGVRQGCPISPFLFNFAIDDILETALMDVSNGGVDMLP 728
Query: 52 PRCML---FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRS 108
+L +ADD+VL+ ++ + + L ++ YG + SK + + ++
Sbjct: 729 GERLLDLEYADDIVLLCDNAQGMQSALNQLAISVRRYGMCFAPSKCKVLLQDWQDSHPVL 788
Query: 109 TLEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLK 168
TL+ G+ I V +F YLGS++ G + ++ RI + + + V L
Sbjct: 789 TLD---GEQI-EVVEKFVYLGSYISAGGGVSDEIDARIMKARAAYANLGHLWRLRDVSLA 844
Query: 169 LKGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIR 225
+KG+ Y ++R LLY E W ++ + ++SV + R LR ++ Q + N +R
Sbjct: 845 VKGRIYNASVRAVLLYACETWPLRVEDVRRLSVFDHRCLRRIADIQWQHHVSNAEVR 901
>gi|67625701 TPA: endonuclease-reverse transcriptase [Schistosoma mansoni]
Length = 992
Score = 80.5 bits (197), Expect = 4e-14
Identities = 60/249 (24%), Positives = 110/249 (44%), Gaps = 16/249 (6%)
Query: 2 YEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLDVLTE-------HIQELAPRC 54
Y+G++ + T+ F + G+ QG LSP+LF LV+D + + H + R
Sbjct: 666 YDGLNCQIVHGGQLTDSFEVKTGVRQGCLLSPFLFLLVIDWIMKTSTSGGMHGIQWTGRM 725
Query: 55 ML----FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTL 110
L FADD+ L+ ++++++ + + A A G +++ K++ + +N + T
Sbjct: 726 QLDDLDFADDLALLSQTQQQMQEKTTSVAAASAAVGLNINKGKSKTLRYN-----TICTN 780
Query: 111 EVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLK 170
+ + + V F YLGS + G +ADV RI + + + K++ K
Sbjct: 781 PITLDGEALEDVEIFTYLGSIIDEHGGSDADVRARIGKARAAYLQLKNIWSSKQLSTNTK 840
Query: 171 GKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIREGRGG 230
+ + T ++ LLYG E W ++ V LR + D I N + E
Sbjct: 841 VRIFNTNVKTVLLYGAETWRTTKAIIQKIQVFINSCLRKILRIRWPDTISNKLLWETTNQ 900
Query: 231 IHSRKVGRK 239
I + + RK
Sbjct: 901 IPAEEEIRK 909
>gb|AAK18958.1| Hypothetical protein F56C9.2 [Caenorhabditis elegans]
gi|17553662|ref|NP_498615.1| predicted CDS, reverse
transcriptase family member (3I419) [Caenorhabditis
elegans] gi|7504398|pir||T16474 hypothetical protein
F56C9.2 - Caenorhabditis elegans
Length = 772
Score = 78.2 bits (191), Expect = 2e-13
Identities = 71/231 (30%), Positives = 109/231 (46%), Gaps = 26/231 (11%)
Query: 1 MYEGVSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLD-VLTEHIQELAP------- 52
M +G + D +V T G+ QG + SP LF+ L +LT+ E A
Sbjct: 444 MMDGGQAEITVHDKKLKVNLCT-GVRQGDSASPALFSAALQAILTDCDNEFAGVGINVEG 502
Query: 53 ---RCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRST 109
R + FADDVVL+ + EV RLE + YG ++++SKT ++ F RS
Sbjct: 503 RHIRRLEFADDVVLICSTPGEVQERLEILDRISSNYGLKINQSKTVLLKNKF----CRSQ 558
Query: 110 LEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKL 169
+ G IIP V +YLG ++ G I+ ++S RI+AGW VL + +P K
Sbjct: 559 DVLFNGSPIIP-VPGCRYLGRWIDISGSIDEEISRRIRAGWGALVGIKEVL--RIMPNKE 615
Query: 170 KGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIR 220
+ ++ + PALLY +E W + + +R+ R +SG IR
Sbjct: 616 RIILFKQNVLPALLYASETWTCNAG-------STLRLKRTVSGLIDAAEIR 659
>gb|AAF36059.1| Hypothetical protein Y75D11A.4 [Caenorhabditis elegans]
gi|17570529|ref|NP_508323.1| predicted CDS, reverse
transcriptase family member (XC378) [Caenorhabditis
elegans]
Length = 480
Score = 77.4 bits (189), Expect = 3e-13
Identities = 61/177 (34%), Positives = 90/177 (50%), Gaps = 18/177 (10%)
Query: 24 GLHQGSTLSPYLFTLVLD-VLTEHIQELAP----------RCMLFADDVVLVGESREEVN 72
G+ QG + SP LF+ L +LT+ ELA R + FADDVVL+ + EEV
Sbjct: 174 GVRQGDSASPALFSAALQAILTDCDNELAGVGISVEGRHIRRLEFADDVVLICSTPEEVQ 233
Query: 73 GRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHIIPQVTRFKYLGSFV 132
RLE + YG ++ +SKT ++ F RS + G IIP V +YLG ++
Sbjct: 234 ERLEILDRISSYYGLKIDQSKTVLLKNKF----CRSQDVLFNGSPIIP-VPGCRYLGRWI 288
Query: 133 QNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIRPALLYGTECW 189
G I+ ++S RI+AGW VL + +P K ++ + PALLY ++ W
Sbjct: 289 DISGSIDEEISRRIRAGWGALVGIKEVL--RIMPNKENIILFKQNVLPALLYASKTW 343
>ref|XP_556470.1| ENSANGP00000028171 [Anopheles gambiae str. PEST]
gi|55238617|gb|EAL39934.1| ENSANGP00000028171 [Anopheles
gambiae str. PEST]
Length = 777
Score = 73.9 bits (180), Expect = 3e-12
Identities = 61/223 (27%), Positives = 95/223 (42%), Gaps = 18/223 (8%)
Query: 5 VSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLD--------VLTEHIQELAPRCML 56
V+ VR + F T GL QG L+ LF L L+ T I + + +
Sbjct: 515 VTCQVRVDGKLSGPFATTKGLRQGDGLACLLFNLALERAIRDSRVETTGTIFYKSTQILA 574
Query: 57 FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYM-------EWNFSGRRSRST 109
+ADD+ ++G V + QA E G +++ +KT+ M N R R
Sbjct: 575 YADDIDIIGLRLSYVAEAYQGIEQAAENLGLQINEAKTKLMVATSADLPINNPNLRRR-- 632
Query: 110 LEVKVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKL 169
+V++G+ V F YLGS V ND +E ++ R+ A + K + +
Sbjct: 633 -DVQIGERTFEVVPEFTYLGSKVSNDNSMEVELRARMLAANRSFYSLKKQFTSKNLSRRT 691
Query: 170 KGKFYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLRWMSG 212
K Y T I P L Y +E W + E ++ E +MLR + G
Sbjct: 692 KLGLYSTYIVPVLTYASETWTLSKSDEALLAAFERKMLRRILG 734
>ref|XP_552671.1| ENSANGP00000005174 [Anopheles gambiae str. PEST]
gi|55235101|gb|EAL38938.1| ENSANGP00000005174 [Anopheles
gambiae str. PEST]
Length = 329
Score = 72.0 bits (175), Expect = 1e-11
Identities = 56/216 (25%), Positives = 93/216 (42%), Gaps = 12/216 (5%)
Query: 5 VSTSVRTQDGTTEVFPITIGLHQGSTLSPYLFTLVLD--------VLTEHIQELAPRCML 56
V+ VR + F T GL QG L+ LF L L+ T I + + +
Sbjct: 18 VTCQVRVDGKLSGPFATTKGLRQGDGLACLLFNLALERAIRDSRVETTGTIFYKSTQILA 77
Query: 57 FADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSG----RRSRSTLEV 112
+ADD+ ++ V + QA E+ G +++ +KT+ M +G ++ +V
Sbjct: 78 YADDIDIIDLRLSYVAEAYQGIEQAAESLGLQINEAKTKLMVATSAGLPINNQNLRRRDV 137
Query: 113 KVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGK 172
++G+ V F LGS V ND +EA++ R+ A + K + + K
Sbjct: 138 QIGERTFEVVPEFTCLGSKVSNDNSMEAELRARMLAANRSFYSLKKQFTSKNLSRRTKLG 197
Query: 173 FYRTAIRPALLYGTECWAVKSQHENQVSVTEMRMLR 208
Y T I P Y +E W + E ++ E +MLR
Sbjct: 198 LYSTYIVPVFTYASETWTLSKSDETLLAAFERKMLR 233
>gb|AAC70880.1| Hypothetical protein F21E9.5 [Caenorhabditis elegans]
gi|17567237|ref|NP_508248.1| predicted CDS, reverse
transcriptase family member (XB968) [Caenorhabditis
elegans] gi|7499579|pir||T31973 hypothetical protein
F21E9.5 - Caenorhabditis elegans
Length = 864
Score = 69.7 bits (169), Expect = 7e-11
Identities = 52/225 (23%), Positives = 95/225 (42%), Gaps = 19/225 (8%)
Query: 20 PITIGLHQGSTLSPYLFTLVLDVL----------------TEHIQELAPRC--MLFADDV 61
P+T G+ QG +SP LF+ L+ + TE I+ + FADD+
Sbjct: 517 PVTRGVRQGDPISPNLFSACLEHVFRQLNWKHFKGDERYETEGIRVNGQNLTNLRFADDI 576
Query: 62 VLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHIIPQ 121
VLV + + L + + G +++ KT+ + F+ +R + II
Sbjct: 577 VLVAHNPRTASQMLTELVEKCSSVGLKINTGKTKVLRNRFAYKRKVEIRCPNTTNIIIDD 636
Query: 122 VTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIRPA 181
V + YLG + + + ++ R +A W + P +L+ + + + PA
Sbjct: 637 VNEYIYLGRQINDSNNLLPELHRRRRAAWAAFTNIKSTTDQITCP-RLRANLFDSTVLPA 695
Query: 182 LLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIRE 226
L YG+E W + +V VT + R + G T ++ + + RE
Sbjct: 696 LTYGSEAWTFTKELAERVRVTHAALERKLVGLTLTEQRKRNIHRE 740
>gb|AAF60814.1| Hypothetical protein Y58G8A.2 [Caenorhabditis elegans]
gi|17566042|ref|NP_503166.1| predicted CDS, reverse
transcriptase family member (5A739) [Caenorhabditis
elegans]
Length = 769
Score = 68.2 bits (165), Expect = 2e-10
Identities = 53/229 (23%), Positives = 95/229 (41%), Gaps = 23/229 (10%)
Query: 20 PITIGLHQGSTLSPYLFTLVLDVL----------------TEHIQELAPRC--MLFADDV 61
P+ G+ QG +SP LF+ L+ + TE I+ + FADD+
Sbjct: 464 PVIRGVRQGDPISPNLFSACLEHVFRQLNWKHFKGDERYETEGIRVNGQNLTNLRFADDI 523
Query: 62 VLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHIIPQ 121
VLV + + L + + G +++ KT+ + F+ +R + II
Sbjct: 524 VLVAHNPRTASQMLTELVEKCSSVGLKINTGKTKVLRNRFAYKRKVEIRCPNTTNIIIDD 583
Query: 122 VTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIRPA 181
V + YLG + + + ++ R +A W + P +L+ + + + PA
Sbjct: 584 VNEYIYLGRQINDSNNLLPELHRRRRAAWAAFTNIKSTTDQITCP-RLRANLFDSTVLPA 642
Query: 182 LLYGTECWAVKSQHENQVSVTEMRMLRWMSGKT----RQDRIRNDTIRE 226
L YG+E W + +V VT + R + G T R+ I + +RE
Sbjct: 643 LTYGSEAWTFTKELAERVRVTHAALERKLVGLTLTEQRERNIHREEVRE 691
>ref|NP_493497.1| predicted CDS, reverse transcriptase family member (1O881)
[Caenorhabditis elegans]
Length = 835
Score = 68.2 bits (165), Expect = 2e-10
Identities = 49/200 (24%), Positives = 86/200 (42%), Gaps = 19/200 (9%)
Query: 20 PITIGLHQGSTLSPYLFTLVLDVL----------------TEHIQELAPRC--MLFADDV 61
P+T G+ QG +SP LF+ L+ + TE I+ + FADD+
Sbjct: 583 PVTRGVRQGDPISPNLFSACLEHVFRQLNWKHFKGDERYETEGIRVNGQNLTNLRFADDI 642
Query: 62 VLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHIIPQ 121
VLV + V+ L + + G +++ KT+ + FS +R + II
Sbjct: 643 VLVAHNPRTVSQMLTELVEKCSSVGLKINTGKTKVLRNRFSYKRKVEIRCPNTTNIIIDD 702
Query: 122 VTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIRPA 181
V + YLG + + + A++ R +A W + + P +L+ + + + PA
Sbjct: 703 VNEYIYLGRQINDSNNLLAELHRRRRAAWAAFTNIKSTMDQITCP-RLRANLFDSTVLPA 761
Query: 182 LLYGTECWAVKSQHENQVSV 201
L YG+E W + +V V
Sbjct: 762 LTYGSEAWTFTKELAERVRV 781
>pir||T20517 hypothetical protein F02E9.8 - Caenorhabditis elegans
Length = 341
Score = 65.9 bits (159), Expect = 9e-10
Identities = 48/137 (35%), Positives = 72/137 (52%), Gaps = 7/137 (5%)
Query: 53 RCMLFADDVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEV 112
R + FADDVVL + EV RLE + YG ++++SKT ++ F RS +
Sbjct: 75 RRLEFADDVVLTCSTPGEVQERLEILDRISSNYGLKINQSKTVLLKNKF----CRSQDVL 130
Query: 113 KVGDHIIPQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGK 172
G IIP V +YLG ++ G I+ ++S RI+AGW VL + +P K +
Sbjct: 131 FNGSPIIP-VPGCRYLGRWIDISGSIDEEISRRIRAGWGALVGIKEVL--RIMPNKERII 187
Query: 173 FYRTAIRPALLYGTECW 189
++ + PALLY +E W
Sbjct: 188 LFKQNVLPALLYASETW 204
>gb|AAF36001.1| Hypothetical protein Y71F9AL.3 [Caenorhabditis elegans]
gi|17510457|ref|NP_491073.1| reverse transcriptase
family member (1D477) [Caenorhabditis elegans]
Length = 423
Score = 60.8 bits (146), Expect = 3e-08
Identities = 48/227 (21%), Positives = 86/227 (37%), Gaps = 22/227 (9%)
Query: 20 PITIGLHQGSTLSPYLFTLVLDVLTEHIQELA--------------------PRCMLFAD 59
P+T G+ QG +SP LF+ L+ + + + P + FAD
Sbjct: 100 PVTKGVRQGDPISPNLFSACLEHVFRKLSWIELKGEAEDYDTIPGMRVNGRNPTNLRFAD 159
Query: 60 DVVLVGESREEVNGRLETWRQALEAYGFRLSRSKTEYMEWNFSGRRSRSTLEVKVGDHII 119
D+VL+ + L+ Q G ++ KT+ + F+ S+ +
Sbjct: 160 DIVLIANHPNTASKMLQELVQKCSEVGLEINTGKTKVLRNRFADP-SKVYFGSPSPTTQL 218
Query: 120 PQVTRFKYLGSFVQNDGEIEADVSHRIQAGWLKWRRASGVLCDKKVPLKLKGKFYRTAIR 179
V + YLG + + ++ R +A W + D K++ + + +
Sbjct: 219 DDVDEYIYLGRQINAQNNLMPEIHRRRRAAWAAFNGIKNT-ADSITDKKIRANLFDSIVL 277
Query: 180 PALLYGTECWAVKSQHENQVSVTEMRMLRWMSGKTRQDRIRNDTIRE 226
PAL YG+E W +V +T + R + G T + D RE
Sbjct: 278 PALTYGSEAWTFTKALSERVRITHASLERRLVGITLTQQRERDLHRE 324
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.321 0.136 0.410
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 398,883,470
Number of Sequences: 2540612
Number of extensions: 15840254
Number of successful extensions: 33864
Number of sequences better than 10.0: 1211
Number of HSP's better than 10.0 without gapping: 274
Number of HSP's successfully gapped in prelim test: 937
Number of HSP's that attempted gapping in prelim test: 33068
Number of HSP's gapped (non-prelim): 1255
length of query: 239
length of database: 863,360,394
effective HSP length: 124
effective length of query: 115
effective length of database: 548,324,506
effective search space: 63057318190
effective search space used: 63057318190
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)
Medicago: description of AC141114.8