Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC002669A_C01 KMC002669A_c01
(512 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like ... 79 2e-14
ref|NP_179822.1| putative non-LTR retroelement reverse transcrip... 77 2e-13
ref|NP_189754.1| non-LTR reverse transcriptase, putative; protei... 74 8e-13
dbj|BAB09192.1| non-LTR retroelement reverse transcriptase-like ... 74 1e-12
ref|NP_680730.1| similar to non-LTR reverse transcriptase, putat... 74 1e-12
>dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
thaliana]
Length = 676
Score = 79.3 bits (194), Expect = 2e-14
Identities = 50/157 (31%), Positives = 76/157 (47%)
Frame = -2
Query: 505 GEYRMDSDGAFKHDDDRMGMGGIVRDAHGAWISGFYAGSLGGDALRAEIAALKHGLTLLW 326
G M++DGA + + GG++RD HG+W+ GF A AE+ + +GL + W
Sbjct: 514 GWVTMNTDGASHGNPGQATAGGVIRDEHGSWLVGFALNIGVCSAPLAELWGVYYGLVVAW 573
Query: 325 NAHVRRATCEVDCLDIVEALENDRYQFHALASELLDIRLLLDRDWTVTLAYVPREANAAA 146
RR EVD +V L++ H LA + + +DW V + +V REAN A
Sbjct: 574 ERGWRRVRLEVDSALVVGFLQSGIGDSHPLAFLVRLCHGFISKDWIVRITHVYREANRLA 633
Query: 145 DCLAGLGASLLCPLTCLESPPQELQPILARDLLAL*F 35
D LA +L L+S P+ + IL D++ F
Sbjct: 634 DGLANYAFTLPFGFLLLDSCPEHVSSILLEDVMGTSF 670
>ref|NP_179822.1| putative non-LTR retroelement reverse transcriptase; protein id:
At2g22350.1 [Arabidopsis thaliana]
gi|25412100|pir||F84611 hypothetical protein At2g22350
[imported] - Arabidopsis thaliana
gi|4544460|gb|AAD22368.1| putative non-LTR retroelement
reverse transcriptase [Arabidopsis thaliana]
Length = 321
Score = 76.6 bits (187), Expect = 2e-13
Identities = 52/152 (34%), Positives = 73/152 (47%)
Frame = -2
Query: 505 GEYRMDSDGAFKHDDDRMGMGGIVRDAHGAWISGFYAGSLGGDALRAEIAALKHGLTLLW 326
G ++++DGA + + GG++RD +GAWI GF A AE+ + +GL + W
Sbjct: 159 GWVKLNTDGASRGNPGFATAGGVLRDHNGAWIGGFAVNIGVCSAPLAELWGVYYGLFIAW 218
Query: 325 NAHVRRATCEVDCLDIVEALENDRYQFHALASELLDIRLLLDRDWTVTLAYVPREANAAA 146
RR EVD +V L H L+ L L + W V +++V REAN A
Sbjct: 219 GRGARRVELEVDSKMVVGFLTTGIADSHPLSFLLRLCYDFLSKGWIVRISHVYREANRLA 278
Query: 145 DCLAGLGASLLCPLTCLESPPQELQPILARDL 50
D LA SL L LES P + IL D+
Sbjct: 279 DGLANYAFSLSLGLHLLESRPDVVSSILLDDV 310
>ref|NP_189754.1| non-LTR reverse transcriptase, putative; protein id: At3g32110.1
[Arabidopsis thaliana]
Length = 1911
Score = 74.3 bits (181), Expect = 8e-13
Identities = 47/153 (30%), Positives = 76/153 (48%)
Frame = -2
Query: 511 VSGEYRMDSDGAFKHDDDRMGMGGIVRDAHGAWISGFYAGSLGGDALRAEIAALKHGLTL 332
+ G Y++++DGA + + GG++R++ GAW GF A AE+ + +GL +
Sbjct: 1747 MEGWYKINTDGASRGNPGLASAGGVLRNSAGAWCGGFAVNIGRCSAPLAELWGVYYGLYM 1806
Query: 331 LWNAHVRRATCEVDCLDIVEALENDRYQFHALASELLDIRLLLDRDWTVTLAYVPREANA 152
W + EVD +V L+ + H L+ + L +DWTV +++V REAN+
Sbjct: 1807 AWAKQLTHLELEVDSEVVVGFLKTGIGETHPLSFLVRLCHNFLSKDWTVRISHVYREANS 1866
Query: 151 AADCLAGLGASLLCPLTCLESPPQELQPILARD 53
AD LA SL L + P L +L+ D
Sbjct: 1867 LADGLANHAFSLSLGLHVFDEIPISLVMLLSED 1899
>dbj|BAB09192.1| non-LTR retroelement reverse transcriptase-like protein
[Arabidopsis thaliana]
Length = 308
Score = 73.6 bits (179), Expect = 1e-12
Identities = 48/153 (31%), Positives = 67/153 (43%)
Frame = -2
Query: 505 GEYRMDSDGAFKHDDDRMGMGGIVRDAHGAWISGFYAGSLGGDALRAEIAALKHGLTLLW 326
G ++++DGA + + GG++RD GAW GF A AE+ + +GL W
Sbjct: 146 GWVKVNTDGASRGNPGLASAGGVLRDCEGAWCGGFSLNIGRCSAQHAELWGVYYGLYFAW 205
Query: 325 NAHVRRATCEVDCLDIVEALENDRYQFHALASELLDIRLLLDRDWTVTLAYVPREANAAA 146
V R EVD IV L+ H L+ + L +DW V + YV REAN A
Sbjct: 206 EKKVPRVELEVDSEAIVGFLKTGISDSHPLSFLVRLCHNFLQKDWLVRIVYVYREANCLA 265
Query: 145 DCLAGLGASLLCPLTCLESPPQELQPILARDLL 47
D LA L + P + +L D L
Sbjct: 266 DGLANYTILLSLGFHSFDFVPDAMSSLLKEDTL 298
>ref|NP_680730.1| similar to non-LTR reverse transcriptase, putative; protein id:
At4g20725.1 [Arabidopsis thaliana]
Length = 851
Score = 73.6 bits (179), Expect = 1e-12
Identities = 51/154 (33%), Positives = 72/154 (46%)
Frame = -2
Query: 508 SGEYRMDSDGAFKHDDDRMGMGGIVRDAHGAWISGFYAGSLGGDALRAEIAALKHGLTLL 329
+G Y++++DGA + + GG++RD G W GF A AE+ + +GL L
Sbjct: 688 TGWYKVNTDGASRGNPGLATAGGVIRDGAGNWCGGFALNIGRCSAPLAELWGVYYGLYLA 747
Query: 328 WNAHVRRATCEVDCLDIVEALENDRYQFHALASELLDIRLLLDRDWTVTLAYVPREANAA 149
W + R EVD +V L+ H L+ + LL +DW V + V REAN
Sbjct: 748 WTKALTRVELEVDSELVVGFLKTGIGDQHPLSFLVRLCHGLLSKDWIVRITRVYREANRL 807
Query: 148 ADCLAGLGASLLCPLTCLESPPQELQPILARDLL 47
AD LA SL L P +L+ IL D L
Sbjct: 808 ADGLANYAFSLPLGFHSLIDVPDDLEVILHEDSL 841
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 480,314,115
Number of Sequences: 1393205
Number of extensions: 11252189
Number of successful extensions: 39190
Number of sequences better than 10.0: 134
Number of HSP's better than 10.0 without gapping: 36256
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 38725
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 16232377112
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)