
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC135101.6 + phase: 0 /pseudo
(1281 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q69F89 Gag-pol polyprotein [Phaseolus vulgaris] 160 2e-37
UniRef100_Q9SEL2 Gag-pol polyprotein [Vitis vinifera] 151 1e-34
UniRef100_Q6L3Y5 Putative gag-pol polyprotein [Solanum demissum] 87 4e-15
UniRef100_Q6L3X6 Putative polyprotein [Solanum demissum] 83 6e-14
UniRef100_Q7XHD8 Putative copia-type polyprotein [Oryza sativa] 72 1e-10
UniRef100_Q7Y141 Putative polyprotein [Oryza sativa] 72 1e-10
UniRef100_Q8W3A4 Putative gag-pol polyprotein [Oryza sativa] 72 1e-10
UniRef100_Q9ZQE9 Putative retroelement pol polyprotein [Arabidop... 67 3e-09
UniRef100_Q6I5H9 Putative polyprotein [Oryza sativa] 66 6e-09
UniRef100_Q8L515 P0018C10.19 protein [Oryza sativa] 65 1e-08
UniRef100_Q9FH39 Copia-type polyprotein [Arabidopsis thaliana] 63 5e-08
UniRef100_Q9C7Y1 Copia-type polyprotein, putative; 28768-32772 [... 63 5e-08
UniRef100_Q9M1C6 Hypothetical protein T2O9.150 [Arabidopsis thal... 62 1e-07
UniRef100_Q6L3N8 Putative gag-pol polyprotein [Solanum demissum] 60 3e-07
UniRef100_Q84VH6 Gag-pol polyprotein [Glycine max] 60 4e-07
UniRef100_Q9C536 Copia-type polyprotein, putative [Arabidopsis t... 59 7e-07
UniRef100_Q9SXB2 T28P6.8 protein [Arabidopsis thaliana] 59 7e-07
UniRef100_Q9M2D1 Copia-type polyprotein [Arabidopsis thaliana] 59 7e-07
UniRef100_Q9M197 Copia-type reverse transcriptase-like protein [... 59 7e-07
UniRef100_Q9C739 Copia-type polyprotein, putative [Arabidopsis t... 59 7e-07
>UniRef100_Q69F89 Gag-pol polyprotein [Phaseolus vulgaris]
Length = 529
Score = 160 bits (405), Expect = 2e-37
Identities = 71/110 (64%), Positives = 93/110 (84%)
Query: 4 VAPPLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKA 63
+APP+FDG++Y++WAV++E YLEALD+WEA+EEDYE+ LP NPT+AQ+K K++K K++
Sbjct: 1 MAPPVFDGDSYQMWAVRMETYLEALDLWEAVEEDYEIQSLPENPTVAQIKSQKDKKMKRS 60
Query: 64 KAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNLM 113
KAK+CLFA V VFTRIM+LK+ K IWDYLK EY GDERIR +Q LNL+
Sbjct: 61 KAKACLFAAVSPMVFTRIMSLKSAKEIWDYLKAEYEGDERIRGMQALNLI 110
>UniRef100_Q9SEL2 Gag-pol polyprotein [Vitis vinifera]
Length = 581
Score = 151 bits (381), Expect = 1e-34
Identities = 66/110 (60%), Positives = 92/110 (83%)
Query: 4 VAPPLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKA 63
VAP + DG+NYE WAV++ +L+ALDVWEA+EE+YEVPPL +PT+AQ+K HKER+T+KA
Sbjct: 8 VAPSVLDGDNYETWAVRMTVHLQALDVWEAVEENYEVPPLGADPTVAQMKLHKERRTRKA 67
Query: 64 KAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNLM 113
KAK+CLFA V ++F +IM + + IW+YLKEEY GDERI+++Q++NL+
Sbjct: 68 KAKACLFAAVSPSIFIKIMKIDSAAEIWEYLKEEYKGDERIKNMQVMNLI 117
>UniRef100_Q6L3Y5 Putative gag-pol polyprotein [Solanum demissum]
Length = 1133
Score = 86.7 bits (213), Expect = 4e-15
Identities = 40/92 (43%), Positives = 61/92 (65%)
Query: 21 IEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAKSCLFAGVLETVFTR 80
++ YL+ L +W+ +EED PL NPT+ Q++ H+E K KA S + A V +++FTR
Sbjct: 1 MKTYLKGLGLWKYVEEDNAPEPLRANPTLQQIRSHEEEIAKGPKALSFIHAAVSDSIFTR 60
Query: 81 IMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
IMT ++ K WD LK+E+ G +R R +Q+LNL
Sbjct: 61 IMTCESAKESWDMLKQEFQGSDRTRQMQILNL 92
>UniRef100_Q6L3X6 Putative polyprotein [Solanum demissum]
Length = 1758
Score = 82.8 bits (203), Expect = 6e-14
Identities = 40/70 (57%), Positives = 50/70 (71%)
Query: 43 LPNNPTMAQLKYHKERKTKKAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDE 102
LP NPT+AQ+K + E KTKK+KAKS + V ++VF RIM KT K WD LKEEY G +
Sbjct: 22 LPENPTLAQIKSNNEEKTKKSKAKSLMQNAVADSVFYRIMACKTAKEAWDRLKEEYQGSD 81
Query: 103 RIRSIQMLNL 112
R R +Q+LNL
Sbjct: 82 RTRQMQVLNL 91
>UniRef100_Q7XHD8 Putative copia-type polyprotein [Oryza sativa]
Length = 1350
Score = 72.0 bits (175), Expect = 1e-10
Identities = 34/106 (32%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+F G NY++W++K+ L + +W+ +E Y+ T Q K E + AKA
Sbjct: 7 PVFAGENYDIWSIKMRTLLLSQGLWDIVENGYQEYSAGETLTAEQKKSLAEDRMSDAKAL 66
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
+ GV E++F RI+ K K WD LKEE+ G +++ ++++ L
Sbjct: 67 FLIQQGVAESLFPRIIGAKKSKEAWDKLKEEFQGSQKVLAVKLQTL 112
>UniRef100_Q7Y141 Putative polyprotein [Oryza sativa]
Length = 1335
Score = 72.0 bits (175), Expect = 1e-10
Identities = 34/106 (32%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+F G NY++W++K+ L + +W+ +E Y+ T Q K E + AKA
Sbjct: 7 PVFAGENYDIWSIKMRTLLLSQGLWDIVENGYQEYSAGETLTAEQKKSLAEDRMSDAKAL 66
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
+ GV E++F RI+ K K WD LKEE+ G +++ ++++ L
Sbjct: 67 FLIQQGVAESLFPRIIGAKKSKEAWDKLKEEFQGSQKVLAVKLQTL 112
>UniRef100_Q8W3A4 Putative gag-pol polyprotein [Oryza sativa]
Length = 1167
Score = 72.0 bits (175), Expect = 1e-10
Identities = 34/106 (32%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+F G NY++W++K+ L + +W+ +E Y+ T Q K E + AKA
Sbjct: 7 PVFAGENYDIWSIKMRTLLLSQGLWDIVENGYQEYSAGETLTAEQKKSLAEDRMSDAKAL 66
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
+ GV E++F RI+ K K WD LKEE+ G +++ ++++ L
Sbjct: 67 FLIQQGVAESLFPRIIGAKKSKEAWDKLKEEFQGSQKVLAVKLQTL 112
>UniRef100_Q9ZQE9 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1347
Score = 67.4 bits (163), Expect = 3e-09
Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 3/109 (2%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLP--NNPTMAQLK-YHKERKTKKA 63
P+FDG Y+ W++K+ +W +EE V P+ P A+ K +E T
Sbjct: 10 PIFDGEKYDFWSIKMATIFRTRKLWSVVEEGVPVEPVQAEETPETARAKTLREEAVTNDT 69
Query: 64 KAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
A L V + +F+RI + K WD LK+EY G ++R +++ +L
Sbjct: 70 MALQILQTAVTDQIFSRIAAASSSKEAWDVLKDEYQGSPQVRLVKLQSL 118
>UniRef100_Q6I5H9 Putative polyprotein [Oryza sativa]
Length = 1136
Score = 66.2 bits (160), Expect = 6e-09
Identities = 32/106 (30%), Positives = 56/106 (52%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+F NY++W++K+ L + +W+ +E Y+ T Q K E + AKA
Sbjct: 7 PVFARENYDIWSIKMRTLLLSQGLWDIVENGYQEYSAGETLTAEQKKSLVEDRMSDAKAL 66
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
+ GV E++F RI+ K K WD LKEE+ +++ ++++ L
Sbjct: 67 FLIQQGVAESLFPRIIGAKKSKEAWDKLKEEFQASQKVLAVKLQTL 112
>UniRef100_Q8L515 P0018C10.19 protein [Oryza sativa]
Length = 278
Score = 65.1 bits (157), Expect = 1e-08
Identities = 36/108 (33%), Positives = 54/108 (49%), Gaps = 15/108 (13%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ NY WA+K+EA +EA +W+AIE P +K K+ A+
Sbjct: 31 PVLTSTNYSTWAIKMEANMEAQGIWDAIE--------PAADEAVDIKKDKQ-------AR 75
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNLMS 114
+CLF V E V +I KT K +W+ LK + G ER++ ++ L S
Sbjct: 76 ACLFGAVPEDVLQQIAKKKTAKEVWESLKTRFLGVERVKKARVQTLKS 123
>UniRef100_Q9FH39 Copia-type polyprotein [Arabidopsis thaliana]
Length = 1334
Score = 63.2 bits (152), Expect = 5e-08
Identities = 35/106 (33%), Positives = 58/106 (54%), Gaps = 1/106 (0%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P FDG+ YE WA+ +E + + + W+ IE P T AQ E+ K K K
Sbjct: 10 PKFDGD-YEHWAMLMENLIRSKEWWDIIETGIPRPERNVILTGAQRTELAEKTVKDHKVK 68
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
+ LFA + +T+ I+ +T K +W+ +K +Y G++R++S Q+ L
Sbjct: 69 NYLFASIDKTILKTILQKETSKDLWESMKRKYQGNDRVQSAQLQRL 114
>UniRef100_Q9C7Y1 Copia-type polyprotein, putative; 28768-32772 [Arabidopsis
thaliana]
Length = 1334
Score = 63.2 bits (152), Expect = 5e-08
Identities = 35/106 (33%), Positives = 58/106 (54%), Gaps = 1/106 (0%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P FDG+ YE WA+ +E + + + W+ IE P T AQ E+ K K K
Sbjct: 10 PKFDGD-YEHWAMLMENLIRSKEWWDIIETGIPRPERNVILTGAQRTELAEKTVKDHKVK 68
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
+ LFA + +T+ I+ +T K +W+ +K +Y G++R++S Q+ L
Sbjct: 69 NYLFASIDKTILKTILQKETSKDLWESMKRKYQGNDRVQSAQLQRL 114
>UniRef100_Q9M1C6 Hypothetical protein T2O9.150 [Arabidopsis thaliana]
Length = 1339
Score = 62.0 bits (149), Expect = 1e-07
Identities = 33/110 (30%), Positives = 57/110 (51%), Gaps = 2/110 (1%)
Query: 1 FSKVAPPLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTM-AQLKYHKERK 59
F + A P FDG Y+ W++ +E +L + ++W +EE + P AQ +E K
Sbjct: 7 FVQPAIPRFDGY-YDFWSMTMENFLRSRELWRLVEEGIPAIVVGTTPVSEAQRSAVEEAK 65
Query: 60 TKKAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQM 109
K K K+ LF + + I+ T KAIW+ +K++Y G +++ Q+
Sbjct: 66 LKDLKVKNFLFQAIDREILETILDKSTSKAIWESMKKKYQGSTKVKRAQL 115
>UniRef100_Q6L3N8 Putative gag-pol polyprotein [Solanum demissum]
Length = 1333
Score = 60.5 bits (145), Expect = 3e-07
Identities = 29/106 (27%), Positives = 62/106 (58%), Gaps = 9/106 (8%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+F G NY+ W++K++ ++ ++W+ +E +P N Q++ H++R +K A
Sbjct: 15 PIFRGENYQFWSLKMKTLFKSQELWDIVETG--IPEGNAN----QMREHRKRDSK---AL 65
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
+ + + +F RI ++T K W+ LK+EY GD+++ ++++ L
Sbjct: 66 FTIQQALDDEIFPRISAVETSKQAWEILKQEYFGDDKVITVKLQTL 111
>UniRef100_Q84VH6 Gag-pol polyprotein [Glycine max]
Length = 1577
Score = 60.1 bits (144), Expect = 4e-07
Identities = 35/111 (31%), Positives = 55/111 (49%), Gaps = 11/111 (9%)
Query: 6 PPLFDGNNYELWAVKIEAYLEALD--VWEAIEEDYEVPPL------PNN---PTMAQLKY 54
PP+ DG NYE W ++ A+L++LD W+A+ + +E P + P N P K
Sbjct: 13 PPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNELKPEEDWTKE 72
Query: 55 HKERKTKKAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIR 105
E +KA + LF GV + +F I T K W+ LK + G +++
Sbjct: 73 EDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVK 123
>UniRef100_Q9C536 Copia-type polyprotein, putative [Arabidopsis thaliana]
Length = 1320
Score = 59.3 bits (142), Expect = 7e-07
Identities = 25/106 (23%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ +NY+ W+++++A L A DVWE +E+ + P + + Q ++ + + KA
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKAL 70
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
++ G+ E F +++ + K W+ L+ Y G ++++ +++ L
Sbjct: 71 CLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTL 116
>UniRef100_Q9SXB2 T28P6.8 protein [Arabidopsis thaliana]
Length = 1352
Score = 59.3 bits (142), Expect = 7e-07
Identities = 25/106 (23%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ +NY+ W+++++A L A DVWE +E+ + P + + Q ++ + + KA
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKAL 70
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
++ G+ E F +++ + K W+ L+ Y G ++++ +++ L
Sbjct: 71 CLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTL 116
>UniRef100_Q9M2D1 Copia-type polyprotein [Arabidopsis thaliana]
Length = 1352
Score = 59.3 bits (142), Expect = 7e-07
Identities = 25/106 (23%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ +NY+ W+++++A L A DVWE +E+ + P + + Q ++ + + KA
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKAL 70
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
++ G+ E F +++ + K W+ L+ Y G ++++ +++ L
Sbjct: 71 CLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTL 116
>UniRef100_Q9M197 Copia-type reverse transcriptase-like protein [Arabidopsis
thaliana]
Length = 1272
Score = 59.3 bits (142), Expect = 7e-07
Identities = 25/106 (23%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ +NY+ W+++++A L A DVWE +E+ + P + + Q ++ + + KA
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKAL 70
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
++ G+ E F +++ + K W+ L+ Y G ++++ +++ L
Sbjct: 71 CLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTL 116
>UniRef100_Q9C739 Copia-type polyprotein, putative [Arabidopsis thaliana]
Length = 1352
Score = 59.3 bits (142), Expect = 7e-07
Identities = 25/106 (23%), Positives = 58/106 (54%)
Query: 7 PLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKKAKAK 66
P+ +NY+ W+++++A L A DVWE +E+ + P + + Q ++ + + KA
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKAL 70
Query: 67 SCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
++ G+ E F +++ + K W+ L+ Y G ++++ +++ L
Sbjct: 71 CLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTL 116
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.361 0.159 0.550
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,628,378,383
Number of Sequences: 2790947
Number of extensions: 54059672
Number of successful extensions: 217892
Number of sequences better than 10.0: 165
Number of HSP's better than 10.0 without gapping: 37
Number of HSP's successfully gapped in prelim test: 128
Number of HSP's that attempted gapping in prelim test: 217665
Number of HSP's gapped (non-prelim): 277
length of query: 1281
length of database: 848,049,833
effective HSP length: 139
effective length of query: 1142
effective length of database: 460,108,200
effective search space: 525443564400
effective search space used: 525443564400
T: 11
A: 40
X1: 14 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.9 bits)
S2: 81 (35.8 bits)
Medicago: description of AC135101.6