
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148819.5 - phase: 0
(83 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q7XLX8 OSJNBa0042I15.12 protein [Oryza sativa] 50 1e-05
UniRef100_P10978 Retrovirus-related Pol polyprotein from transpo... 46 3e-04
UniRef100_Q9AU17 Polyprotein-like [Lycopersicon chilense] 45 6e-04
UniRef100_Q9M1F5 Copia-like polyprotein [Arabidopsis thaliana] 42 0.005
UniRef100_Q7XTM9 OSJNBa0033G05.13 protein [Oryza sativa] 42 0.005
UniRef100_Q8LNW7 Putative polyprotein [Oryza sativa] 41 0.008
UniRef100_Q6L4V3 Putative polyprotein [Oryza sativa] 40 0.014
UniRef100_Q6L4X8 Putative polyprotein [Oryza sativa] 39 0.025
UniRef100_Q6BCY1 Gag-Pol [Ipomoea batatas] 38 0.055
UniRef100_Q9SH77 Putative retroelement pol polyprotein [Arabidop... 37 0.16
UniRef100_O81903 Putative transposable element [Arabidopsis thal... 36 0.21
UniRef100_O96968 Yokozuna protein [Bombyx mori] 36 0.27
UniRef100_Q9FFM0 Copia-like retrotransposable element [Arabidops... 35 0.36
UniRef100_Q9SZY0 Putative retrotransposon [Arabidopsis thaliana] 35 0.46
UniRef100_Q9SJT2 Putative retroelement pol polyprotein [Arabidop... 33 1.8
UniRef100_Q9SJU6 Putative retroelement pol polyprotein [Arabidop... 32 3.0
UniRef100_Q9LJP6 Non-LTR retroelement reverse transcriptase-like... 32 3.0
UniRef100_Q8GYM7 Putative non-LTR reverse transcriptase [Arabido... 32 3.0
UniRef100_Q9SHR5 F28L22.3 protein [Arabidopsis thaliana] 32 5.1
>UniRef100_Q7XLX8 OSJNBa0042I15.12 protein [Oryza sativa]
Length = 432
Score = 50.1 bits (118), Expect = 1e-05
Identities = 26/80 (32%), Positives = 49/80 (60%), Gaps = 2/80 (2%)
Query: 4 KWDIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMADNARSFFV 63
K+D+ KF G +FGLW+ +++ +L QQ+ +KA KG + M +D +M A +
Sbjct: 43 KFDLVKFDGFGNFGLWQTRVKDLLAQQEVSKALKG--VKPAKMEDDDCEEMQLQAAATIR 100
Query: 64 LCLRDKVLREVVKDTTIKVM 83
CL D+++ V+++T+ K++
Sbjct: 101 RCLSDQIMYHVMEETSPKII 120
>UniRef100_P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-94
[Contains: Protease (EC 3.4.23.-); Reverse
transcriptase (EC 2.7.7.49); Endonuclease] [Nicotiana
tabacum]
Length = 1328
Score = 45.8 bits (107), Expect = 3e-04
Identities = 25/80 (31%), Positives = 40/80 (49%)
Query: 2 GTKWDIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMADNARSF 61
G K+++ KF GD+ F W+ ++ +LIQQ K DTM ED + + A S
Sbjct: 3 GVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASA 62
Query: 62 FVLCLRDKVLREVVKDTTIK 81
L L D V+ ++ + T +
Sbjct: 63 IRLHLSDDVVNNIIDEDTAR 82
>UniRef100_Q9AU17 Polyprotein-like [Lycopersicon chilense]
Length = 1328
Score = 44.7 bits (104), Expect = 6e-04
Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 1/77 (1%)
Query: 2 GTKWDIEKFIGDSD-FGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMADNARS 60
G K+++ KF GD F +W+ +++ +LIQQ KA GK ++M ED ++ + A S
Sbjct: 3 GVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAAS 62
Query: 61 FFVLCLRDKVLREVVKD 77
L L D V+ +V +
Sbjct: 63 AIRLHLTDDVVNNIVDE 79
>UniRef100_Q9M1F5 Copia-like polyprotein [Arabidopsis thaliana]
Length = 1363
Score = 41.6 bits (96), Expect = 0.005
Identities = 28/97 (28%), Positives = 47/97 (47%), Gaps = 15/97 (15%)
Query: 2 GTKWDIEKFIGDSDFGLWKVK---------IEVVLIQQKCAKARKGKILFSDTMSQEDKT 52
G + ++EKF G D+ +WK K + VL + + ++ SD +E++
Sbjct: 3 GARIEVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEERE 62
Query: 53 KM------ADNARSFFVLCLRDKVLREVVKDTTIKVM 83
KM ARS VL + D+VLR++ K+T+ M
Sbjct: 63 KMEAFEEKKRKARSTIVLSVSDRVLRKIKKETSAAAM 99
>UniRef100_Q7XTM9 OSJNBa0033G05.13 protein [Oryza sativa]
Length = 1181
Score = 41.6 bits (96), Expect = 0.005
Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 1/76 (1%)
Query: 4 KWDIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMADNARSFFV 63
K+D+ D+ F LW+VK+ VL QQ+ A G + S ++K K A S+
Sbjct: 5 KYDLPLLDRDTRFSLWQVKMRAVLAQQELDDALSGFDKRTQDWSNDEK-KRDRKAMSYIH 63
Query: 64 LCLRDKVLREVVKDTT 79
L L + +L+EV+K+ T
Sbjct: 64 LHLSNNILQEVLKEET 79
>UniRef100_Q8LNW7 Putative polyprotein [Oryza sativa]
Length = 1280
Score = 40.8 bits (94), Expect = 0.008
Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 4 KWDIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMADNARSFFV 63
K+D+ D+ F LW+VK+ VL QQ A G + S ++K K A S+
Sbjct: 40 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEK-KKDRKAMSYIH 98
Query: 64 LCLRDKVLREVVKDTT 79
L L + +L+EV+K+ T
Sbjct: 99 LHLSNNILQEVLKEET 114
>UniRef100_Q6L4V3 Putative polyprotein [Oryza sativa]
Length = 1243
Score = 40.0 bits (92), Expect = 0.014
Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 4 KWDIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMADNARSFFV 63
K+D+ D+ F LW+VK+ VL QQ A G + S ++K K A S+
Sbjct: 5 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEK-KRDRKAISYIH 63
Query: 64 LCLRDKVLREVVKDTT 79
L L + +L+EV+K+ T
Sbjct: 64 LHLSNNILQEVLKEET 79
>UniRef100_Q6L4X8 Putative polyprotein [Oryza sativa]
Length = 1211
Score = 39.3 bits (90), Expect = 0.025
Identities = 25/76 (32%), Positives = 39/76 (50%), Gaps = 1/76 (1%)
Query: 4 KWDIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMADNARSFFV 63
K+D+ D+ F LW+VK+ VL QQ A G + S ++K K S+
Sbjct: 5 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEK-KRDRKTMSYIH 63
Query: 64 LCLRDKVLREVVKDTT 79
L L + +L+EV+K+ T
Sbjct: 64 LHLSNNILQEVLKEET 79
>UniRef100_Q6BCY1 Gag-Pol [Ipomoea batatas]
Length = 1298
Score = 38.1 bits (87), Expect = 0.055
Identities = 23/79 (29%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 1 MGTKWDIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMADNARS 60
M K++IEKF G +F LWK+K++ +L + C A + + D + ++M ++A +
Sbjct: 1 MAAKFEIEKFNG-KNFSLWKLKVKAILRKDNCLAAISERPV--DFTDDKKWSEMNEDAMA 57
Query: 61 FFVLCLRDKVLREVVKDTT 79
L + D VL + + T
Sbjct: 58 DLYLSIADGVLSSIEEKKT 76
>UniRef100_Q9SH77 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1356
Score = 36.6 bits (83), Expect = 0.16
Identities = 24/93 (25%), Positives = 44/93 (46%), Gaps = 15/93 (16%)
Query: 6 DIEKFIGDSDFGLWKVKIEVVL--------IQQKCAKARKGKILFSDTMSQEDKTKMAD- 56
++EKF G D+ +WK K+ + +++ + K +L E+K + +
Sbjct: 7 EVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEKLEKFEA 66
Query: 57 ------NARSFFVLCLRDKVLREVVKDTTIKVM 83
ARS VL + D+VLR++ K++T M
Sbjct: 67 LEEKKKKARSAIVLSVTDRVLRKIKKESTAAAM 99
>UniRef100_O81903 Putative transposable element [Arabidopsis thaliana]
Length = 1308
Score = 36.2 bits (82), Expect = 0.21
Identities = 26/98 (26%), Positives = 45/98 (45%), Gaps = 24/98 (24%)
Query: 1 MGTKWDIEKFIGDSDFGLWKVKIE-------------------VVLIQQKCAKARKGKIL 41
M TK +I+ F GD DF LWK++IE +L+ + K + +
Sbjct: 3 MSTKVEIKTFNGDRDFSLWKIRIEAQLGVLGLKPALSDFTLTKTILVVKSEKKESESEDD 62
Query: 42 FSDTMSQED-----KTKMADNARSFFVLCLRDKVLREV 74
+D+ E+ K + +D A++F + + D VL +V
Sbjct: 63 ETDSKKTEEVPDPIKFEQSDQAKNFIINHITDTVLLKV 100
>UniRef100_O96968 Yokozuna protein [Bombyx mori]
Length = 1316
Score = 35.8 bits (81), Expect = 0.27
Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 7 IEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMADNARSFFVLCL 66
+ +F G ++F W +I +L +++C +A + I S++ K K A ARS V CL
Sbjct: 10 LPEFTG-TNFSTWIFRISCILEEKECKEAIEDGITEEILKSEKFKKKDA-KARSLIVNCL 67
Query: 67 RDKVLREVVKDTTIKVM 83
DK L V T+ K M
Sbjct: 68 SDKHLEYVRSATSAKEM 84
>UniRef100_Q9FFM0 Copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1342
Score = 35.4 bits (80), Expect = 0.36
Identities = 30/94 (31%), Positives = 47/94 (49%), Gaps = 20/94 (21%)
Query: 6 DIEKFIGDSDFGLWKVKI----EVV-----LIQQKCAKARKGKILFSDTMSQEDKT---K 53
++EKF GD D+ LWK K+ E++ L +++ A SD +Q+ +T K
Sbjct: 7 EVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDPETATSK 66
Query: 54 MAD--------NARSFFVLCLRDKVLREVVKDTT 79
+ D ARS +L L + VLR+V+K T
Sbjct: 67 LEDKILKEKRGKARSTIILSLGNNVLRKVIKQKT 100
>UniRef100_Q9SZY0 Putative retrotransposon [Arabidopsis thaliana]
Length = 1230
Score = 35.0 bits (79), Expect = 0.46
Identities = 26/91 (28%), Positives = 41/91 (44%), Gaps = 13/91 (14%)
Query: 6 DIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMAD--------- 56
++EKF G D+ LWK K+ + A + SD + E++ K ++
Sbjct: 7 EMEKFDGHGDYTLWKEKLMAHMDLLGLTVALRETQSVSDPLESEEEGKESEKGDKEALME 66
Query: 57 ----NARSFFVLCLRDKVLREVVKDTTIKVM 83
ARS VL + D+VLR+ K+ T M
Sbjct: 67 EKRQKARSTIVLSVSDQVLRKSKKEKTAPSM 97
>UniRef100_Q9SJT2 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1333
Score = 33.1 bits (74), Expect = 1.8
Identities = 24/98 (24%), Positives = 43/98 (43%), Gaps = 25/98 (25%)
Query: 6 DIEKFIGDSDFGLWKVK--------------------IEVVLIQQKCAKARKGKILFSDT 45
++EKF G D+ +WK K +E V Q + K ++L +
Sbjct: 7 EVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLRREL 66
Query: 46 MSQEDKTKMADNARSFFVLCLRDKVLREVVKDTTIKVM 83
+ ++ + ARS VL + D+VLR++ K+ + M
Sbjct: 67 LEEKRR-----KARSAIVLSVTDRVLRKIKKEQSAAAM 99
>UniRef100_Q9SJU6 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 838
Score = 32.3 bits (72), Expect = 3.0
Identities = 26/90 (28%), Positives = 42/90 (45%), Gaps = 20/90 (22%)
Query: 6 DIEKFIGDSDFGLWKVKI--------------EVVLIQQKCAKARKGKILFSDTMSQEDK 51
++EK G+ D+ LWK K+ E I+++ + A +L EDK
Sbjct: 7 EVEKLDGEGDYVLWKEKLLAHIELLGLLEGLEEDEAIEEEESTAETDSLL----TKTEDK 62
Query: 52 T--KMADNARSFFVLCLRDKVLREVVKDTT 79
+ ARS +L L + VLR+V+K+ T
Sbjct: 63 VLKEKRGKARSTVILSLGNHVLRKVIKEKT 92
>UniRef100_Q9LJP6 Non-LTR retroelement reverse transcriptase-like protein
[Arabidopsis thaliana]
Length = 637
Score = 32.3 bits (72), Expect = 3.0
Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 14/88 (15%)
Query: 6 DIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILF----SDTMSQEDKTKMAD----- 56
D+EKF G D+ WKVK L + K ++ ++ S E + K +
Sbjct: 7 DLEKFDGFGDYISWKVKFLDHLKDLDLLEGLKEEVCLEKGSGNSGSDEGRVKDLEECKIL 66
Query: 57 -----NARSFFVLCLRDKVLREVVKDTT 79
AR+F ++ + D VLR+++K+ T
Sbjct: 67 EEKKRKARAFIIVSVGDHVLRKIMKERT 94
>UniRef100_Q8GYM7 Putative non-LTR reverse transcriptase [Arabidopsis thaliana]
Length = 155
Score = 32.3 bits (72), Expect = 3.0
Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 14/88 (15%)
Query: 6 DIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILF----SDTMSQEDKTKMAD----- 56
D+EKF G D+ WKVK L + K ++ ++ S E + K +
Sbjct: 7 DLEKFDGFGDYISWKVKFLDHLKDLDLLEGLKEEVCLEKGSGNSGSDEGRVKDLEECKIL 66
Query: 57 -----NARSFFVLCLRDKVLREVVKDTT 79
AR+F ++ + D VLR+++K+ T
Sbjct: 67 EEKKRKARAFIIVSVGDHVLRKIMKERT 94
>UniRef100_Q9SHR5 F28L22.3 protein [Arabidopsis thaliana]
Length = 1356
Score = 31.6 bits (70), Expect = 5.1
Identities = 18/58 (31%), Positives = 29/58 (49%), Gaps = 10/58 (17%)
Query: 3 TKWDIEKFIGDSDFGLWKVKIEVVLIQQKCAKARKGKILFSDTMSQEDKTKMADNARS 60
T+ +I+ F GD DF LWK++I+ A+ G + DT++ TK +S
Sbjct: 6 TRVEIKVFNGDRDFSLWKIRIQ----------AQLGVLGLKDTLTDFSLTKTVPLTKS 53
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.323 0.137 0.400
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 122,665,918
Number of Sequences: 2790947
Number of extensions: 3540852
Number of successful extensions: 9990
Number of sequences better than 10.0: 19
Number of HSP's better than 10.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 9975
Number of HSP's gapped (non-prelim): 19
length of query: 83
length of database: 848,049,833
effective HSP length: 59
effective length of query: 24
effective length of database: 683,383,960
effective search space: 16401215040
effective search space used: 16401215040
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 68 (30.8 bits)
Medicago: description of AC148819.5