
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146752.15 + phase: 0 /pseudo
(178 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAM91276.1| putative protein [Arabidopsis thaliana] gi|182530... 113 3e-24
emb|CAB80269.1| putative protein [Arabidopsis thaliana] gi|33675... 108 9e-23
sp|P48917|NU4M_PICCA NADH-ubiquinone oxidoreductase chain 4 (NAD... 33 2.7
ref|YP_033043.1| ABC transporter, permease protein [Bartonella h... 33 3.6
ref|NP_345798.1| hypothetical protein SP1340 [Streptococcus pneu... 33 3.6
ref|NP_692345.1| hypothetical protein OB1424 [Oceanobacillus ihe... 33 4.7
ref|NP_850243.1| F-box family protein [Arabidopsis thaliana] 32 6.1
ref|NP_220854.1| AMPG PROTEIN (ampG1) [Rickettsia prowazekii str... 32 6.1
gb|EAL19902.1| hypothetical protein CNBG0450 [Cryptococcus neofo... 32 6.1
gb|AAW44786.1| hexose transport-related protein, putative [Crypt... 32 6.1
emb|CAH98581.1| conserved hypothetical protein [Plasmodium berghei] 32 6.1
ref|YP_180552.1| putative integral membrane protein [Ehrlichia r... 32 7.9
emb|CAI28168.1| Conserved hypothetical protein [Ehrlichia rumina... 32 7.9
gb|AAM44904.1| unknown protein [Arabidopsis thaliana] gi|1433472... 32 7.9
gb|AAC34348.1| Unknown protein [Arabidopsis thaliana] gi|7487004... 32 7.9
ref|NP_377771.1| hypothetical anaerobic dimethyl sulfoxide reduc... 32 7.9
>gb|AAM91276.1| putative protein [Arabidopsis thaliana] gi|18253011|gb|AAL62432.1|
putative protein [Arabidopsis thaliana]
gi|22329188|ref|NP_195278.2|
phosphatidylinositolglycan-related [Arabidopsis
thaliana]
Length = 195
Score = 113 bits (282), Expect = 3e-24
Identities = 69/181 (38%), Positives = 103/181 (56%), Gaps = 16/181 (8%)
Query: 5 RYNYVHDQ--KYPSEDVDIHHIVLRSNGGKYF---FVYASALIVLACGIYLYLLEEKSIS 59
RY Y+H+ K E +DIHH+++ + G + + L+ LA +Y L ++
Sbjct: 10 RYTYIHESGSKSTREAIDIHHVIINGSSGTGYARRWGLGFFLVFLASSMYFLLGKDNPAR 69
Query: 60 LVYYSLLFDIFLVKLLLRLPVIKVLKLILVILTKLSFSCRVCCDYASFWSTA*----DSL 115
+ + L FLV L R V K +IL +F ++ Y S + + D +
Sbjct: 70 TLSWGCLLSGFLVMLHSRKFVKKESVIILP-----TFGIQLETQYLSGKTVSRFIPIDKI 124
Query: 116 YEPVLLECVTPVTCYWTLSLIVREESEMVLVYKNLRPPVKMLVHVWKALCAA--TDNKGE 173
+PVL+ECVTP+TCYW+LSL +R E ++ LV+K LRPP+KMLV +WKALCAA TD++ E
Sbjct: 125 LKPVLVECVTPITCYWSLSLFLRGEEQLTLVFKELRPPLKMLVPIWKALCAAIGTDHQSE 184
Query: 174 T 174
T
Sbjct: 185 T 185
>emb|CAB80269.1| putative protein [Arabidopsis thaliana] gi|3367571|emb|CAA20023.1|
putative protein [Arabidopsis thaliana]
gi|7486656|pir||T04658 hypothetical protein F8D20.40 -
Arabidopsis thaliana
Length = 204
Score = 108 bits (269), Expect = 9e-23
Identities = 67/190 (35%), Positives = 105/190 (55%), Gaps = 25/190 (13%)
Query: 5 RYNYVHDQ--KYPSEDVDIHHIVLRSNGGK---------YFFVYASALIVLACGIY---L 50
RY Y+H+ K E +DIHH+++ + G +F V+ ++ + G +
Sbjct: 10 RYTYIHESGSKSTREAIDIHHVIINGSSGTGYARRWGLGFFLVFLASSMYFLLGKLKGII 69
Query: 51 YLLEEKSISLVYYSLLFDIFLVKLLLRLPVIKVLKLILVILTKLSFSCRVCCDYASFWST 110
++ ++ + + L FLV L R V K +IL +F ++ Y S +
Sbjct: 70 HMKQDNPARTLSWGCLLSGFLVMLHSRKFVKKESVIILP-----TFGIQLETQYLSGKTV 124
Query: 111 A*----DSLYEPVLLECVTPVTCYWTLSLIVREESEMVLVYKNLRPPVKMLVHVWKALCA 166
+ D + +PVL+ECVTP+TCYW+LSL +R E ++ LV+K LRPP+KMLV +WKALCA
Sbjct: 125 SRFIPIDKILKPVLVECVTPITCYWSLSLFLRGEEQLTLVFKELRPPLKMLVPIWKALCA 184
Query: 167 A--TDNKGET 174
A TD++ ET
Sbjct: 185 AIGTDHQSET 194
>sp|P48917|NU4M_PICCA NADH-ubiquinone oxidoreductase chain 4 (NADH dehydrogenase subunit
4) gi|11466061|ref|NP_038220.1| NADH dehydrogenase
subunit 4 [Pichia canadensis]
gi|11036349|dbj|BAA06575.2| NADH dehydrogenase subunit 4
[Pichia canadensis]
Length = 511
Score = 33.5 bits (75), Expect = 2.7
Identities = 28/154 (18%), Positives = 66/154 (42%), Gaps = 29/154 (18%)
Query: 8 YVHDQKYPSEDVDIHHIVLRSNGGKYFFVY---ASALIVLACGIYLYLLEEKSISLVY-- 62
Y + K + ++ ++I R Y F+Y +S ++L+ GIY+Y++ + +Y
Sbjct: 161 YNGNNKIIIKGINNNNITPREKAAYYIFIYTLFSSLFMLLSIGIYIYMINNIDYNNIYNI 220
Query: 63 ------YSLLFDIFLVKLLLRLPVIKVLKLILVILTKLSFSCRVCCDYASFWSTA*DSLY 116
S++F ++ +L++ PV V + ++ + S +
Sbjct: 221 ILSIDLQSIIFIGLIIGILVKTPVFPVHTWLPLVHAESPISGSI---------------- 264
Query: 117 EPVLLECVTPVTCYWTLSLIVREESEMVLVYKNL 150
+L + + Y + LI+ S+++++Y L
Sbjct: 265 --ILAGIIIKLAIYAIIRLILINLSDVIIIYNPL 296
>ref|YP_033043.1| ABC transporter, permease protein [Bartonella henselae str.
Houston-1] gi|49237807|emb|CAF27002.1| ABC transporter,
permease protein [Bartonella henselae str. Houston-1]
Length = 293
Score = 33.1 bits (74), Expect = 3.6
Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 14/96 (14%)
Query: 41 LIVLACGIYLYLLEEKSISLVYYSLLFDIFLVKLLLRLPVIKVLKLILVILTKLSFSCRV 100
LI LA G+ + L++ +I L++ LLF L I L+L+ L C +
Sbjct: 105 LIALAAGVIIVSLKDSTIDLLH--LLFGSVLA--------IDTQALLLITTIMLITVCNL 154
Query: 101 CCDYASFWSTA*DSLYEPVLLECVTPVTCYWTLSLI 136
C FW ++P+ + ++P++ Y +SL+
Sbjct: 155 CI----FWRALVVESFDPLFFKSLSPLSKYIHVSLL 186
>ref|NP_345798.1| hypothetical protein SP1340 [Streptococcus pneumoniae TIGR4]
gi|15903244|ref|NP_358794.1| hypothetical protein
spr1201 [Streptococcus pneumoniae R6]
gi|15458835|gb|AAL00005.1| Hypothetical protein
[Streptococcus pneumoniae R6] gi|14972823|gb|AAK75438.1|
hypothetical protein [Streptococcus pneumoniae TIGR4]
gi|25508898|pir||H98021 hypothetical protein spr1201
[imported] - Streptococcus pneumoniae (strain R6)
gi|25389054|pir||E95155 hypothetical protein SP1340
[imported] - Streptococcus pneumoniae (strain TIGR4)
Length = 439
Score = 33.1 bits (74), Expect = 3.6
Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 11/71 (15%)
Query: 30 GGKYFFVYASALIVLACGIYLYLLEEKSISLVYYSL----LFDIFLVKLLLRLPVIKVLK 85
G F + ++LI +LL K ++L+YYSL + D+ L+ L+L ++ ++
Sbjct: 366 GEAIFLTFVTSLIA-------FLLNIKIMALIYYSLEDILIDDMNLLGLILPNFIVSIIL 418
Query: 86 LILVILTKLSF 96
IL+ +TK S+
Sbjct: 419 FILIFITKSSY 429
>ref|NP_692345.1| hypothetical protein OB1424 [Oceanobacillus iheyensis HTE831]
gi|22777106|dbj|BAC13380.1| hypothetical conserved
protein [Oceanobacillus iheyensis HTE831]
Length = 95
Score = 32.7 bits (73), Expect = 4.7
Identities = 21/79 (26%), Positives = 43/79 (53%), Gaps = 8/79 (10%)
Query: 21 IHHIVLRSNG-GKYFFVYASALIVLACGIYLYLLEEK----SISLVYYSLLFDIFLVKL- 74
+H +L S G G F+++ +VL+ Y +K +VY L+F +F++ +
Sbjct: 4 VHEFILDSFGVGNIFWIFYVINLVLSAVAYKLGFAKKLPIAKNIIVYLLLMFGVFILTIF 63
Query: 75 --LLRLPVIKVLKLILVIL 91
++R+P ++ L +I+V+L
Sbjct: 64 STVMRMPTVECLLVIIVVL 82
>ref|NP_850243.1| F-box family protein [Arabidopsis thaliana]
Length = 163
Score = 32.3 bits (72), Expect = 6.1
Identities = 13/30 (43%), Positives = 21/30 (69%)
Query: 18 DVDIHHIVLRSNGGKYFFVYASALIVLACG 47
DVD+HHI + +NGG+ +Y A+++L G
Sbjct: 103 DVDLHHIGIAANGGQKEAIYMYAMLLLCRG 132
>ref|NP_220854.1| AMPG PROTEIN (ampG1) [Rickettsia prowazekii str. Madrid E]
gi|3861030|emb|CAA14930.1| AMPG PROTEIN (ampG1)
[Rickettsia prowazekii] gi|7467579|pir||H71706 ampg
protein (ampG1) RP475 - Rickettsia prowazekii
Length = 452
Score = 32.3 bits (72), Expect = 6.1
Identities = 32/107 (29%), Positives = 50/107 (45%), Gaps = 14/107 (13%)
Query: 3 DTRYNYVHDQKYPSEDVDIHHIVLRSNGGKYFFVYASA-LIVLACGIYLYLLEEKSISLV 61
+T Y+YV + Y D+++ + N YF + SA L+ + G Y + I+L
Sbjct: 212 NTSYSYVA-RCYAIADMNLKNKFFIKNYFNYFKNFISAYLLKIFSGFYFH---RNDINLA 267
Query: 62 YYSLLFDIFLVKLLLRLPVIKVLKLILVILTKLSFS-------CRVC 101
YY +L IFLV L RLP + +I L L ++ C+ C
Sbjct: 268 YYIILILIFLV--LYRLPDNLINVMINPFLLHLGYNAFEIASVCKFC 312
>gb|EAL19902.1| hypothetical protein CNBG0450 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 516
Score = 32.3 bits (72), Expect = 6.1
Identities = 17/45 (37%), Positives = 25/45 (54%)
Query: 24 IVLRSNGGKYFFVYASALIVLACGIYLYLLEEKSISLVYYSLLFD 68
I L S G KY+ VY +I+ +Y Y++E K +L +L FD
Sbjct: 440 IALDSIGYKYYAVYMPLVIIQWFLVYFYMVETKGYTLEEIALAFD 484
>gb|AAW44786.1| hexose transport-related protein, putative [Cryptococcus neoformans
var. neoformans JEC21] gi|57228328|gb|AAW44785.1| hexose
transport-related protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
gi|58269874|ref|XP_572093.1| hexose transport-related
protein, putative [Cryptococcus neoformans var.
neoformans JEC21] gi|58269872|ref|XP_572092.1| hexose
transport-related protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 516
Score = 32.3 bits (72), Expect = 6.1
Identities = 17/45 (37%), Positives = 25/45 (54%)
Query: 24 IVLRSNGGKYFFVYASALIVLACGIYLYLLEEKSISLVYYSLLFD 68
I L S G KY+ VY +I+ +Y Y++E K +L +L FD
Sbjct: 440 IALDSIGYKYYAVYMPLVIIQWFLVYFYMVETKGYTLEEIALAFD 484
>emb|CAH98581.1| conserved hypothetical protein [Plasmodium berghei]
Length = 244
Score = 32.3 bits (72), Expect = 6.1
Identities = 22/94 (23%), Positives = 47/94 (49%), Gaps = 5/94 (5%)
Query: 4 TRYNYVHDQKYPSE--DVDIHHIVL-RSNGGKYFFVYASALIVLACGIYLYLLEEKSISL 60
++ N + ++K S+ D +I HI+L R G F + L + C LY+ E+ I+
Sbjct: 90 SKKNIIENKKNLSDLIDANIEHIILYRMQFGILFILLL--LFLYTCLAILYIYHEQIITF 147
Query: 61 VYYSLLFDIFLVKLLLRLPVIKVLKLILVILTKL 94
Y ++ +L+K+ + + +L+++ L +
Sbjct: 148 FYSNIEMRNYLIKVFFIITIESLLEVLAAFLNNI 181
>ref|YP_180552.1| putative integral membrane protein [Ehrlichia ruminantium str.
Welgevonden] gi|58418014|emb|CAI27218.1| Conserved
hypothetical protein [Ehrlichia ruminantium str.
Welgevonden] gi|57161495|emb|CAH58421.1| putative
integral membrane protein [Ehrlichia ruminantium str.
Welgevonden] gi|58579388|ref|YP_197600.1| hypothetical
protein ERWE_CDS_07240 [Ehrlichia ruminantium str.
Welgevonden]
Length = 881
Score = 32.0 bits (71), Expect = 7.9
Identities = 16/50 (32%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 34 FFVYASALIVLACGIYLYLLEEKSISLVYYSLLFDIFLVKLLLRLPVIKV 83
FF+ S +++L+C +Y+ L+E I +Y + DIFL K + + +I +
Sbjct: 370 FFISVSCIVILSCAVYV-LIEGVDIYGIYIPGIKDIFLDKSIYCISLILI 418
>emb|CAI28168.1| Conserved hypothetical protein [Ehrlichia ruminantium str. Gardel]
gi|58617443|ref|YP_196642.1| hypothetical protein
ERGA_CDS_07160 [Ehrlichia ruminantium str. Gardel]
Length = 881
Score = 32.0 bits (71), Expect = 7.9
Identities = 16/50 (32%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 34 FFVYASALIVLACGIYLYLLEEKSISLVYYSLLFDIFLVKLLLRLPVIKV 83
FF+ S +++L+C +Y+ L+E I +Y + DIFL K + + +I +
Sbjct: 370 FFISVSCIVILSCAVYV-LIEGVDIYGIYIPGIKDIFLDKSIYCISLILI 418
>gb|AAM44904.1| unknown protein [Arabidopsis thaliana] gi|14334724|gb|AAK59540.1|
unknown protein [Arabidopsis thaliana]
gi|18411404|ref|NP_565152.1| expressed protein
[Arabidopsis thaliana]
Length = 484
Score = 32.0 bits (71), Expect = 7.9
Identities = 26/89 (29%), Positives = 45/89 (50%), Gaps = 6/89 (6%)
Query: 23 HIVLRSNGGKYF---FVYASALIVLACGIYLYLLEEKSISLVYYSLLFDIFLVKLLLRLP 79
H L + G+Y + AS +V+A + +YL+ E S Y FL+ L+L +P
Sbjct: 28 HPNLGVDSGQYLTWPILSASVFVVIAILLPMYLIFEHLAS--YNQPEEQKFLIGLILMVP 85
Query: 80 VIKVLKLILVILTKLSFSCRVCCD-YASF 107
V V + ++ ++ +F+C V D Y +F
Sbjct: 86 VYAVESFLSLVNSEAAFNCEVIRDCYEAF 114
>gb|AAC34348.1| Unknown protein [Arabidopsis thaliana] gi|7487004|pir||T00451
hypothetical protein T14N5.8 - Arabidopsis thaliana
Length = 500
Score = 32.0 bits (71), Expect = 7.9
Identities = 26/89 (29%), Positives = 45/89 (50%), Gaps = 6/89 (6%)
Query: 23 HIVLRSNGGKYF---FVYASALIVLACGIYLYLLEEKSISLVYYSLLFDIFLVKLLLRLP 79
H L + G+Y + AS +V+A + +YL+ E S Y FL+ L+L +P
Sbjct: 28 HPNLGVDSGQYLTWPILSASVFVVIAILLPMYLIFEHLAS--YNQPEEQKFLIGLILMVP 85
Query: 80 VIKVLKLILVILTKLSFSCRVCCD-YASF 107
V V + ++ ++ +F+C V D Y +F
Sbjct: 86 VYAVESFLSLVNSEAAFNCEVIRDCYEAF 114
>ref|NP_377771.1| hypothetical anaerobic dimethyl sulfoxide reductase [Sulfolobus
tokodaii str. 7] gi|15622890|dbj|BAB66880.1| 391aa long
hypothetical anaerobic dimethyl sulfoxide reductase
[Sulfolobus tokodaii str. 7]
Length = 391
Score = 32.0 bits (71), Expect = 7.9
Identities = 26/107 (24%), Positives = 50/107 (46%), Gaps = 13/107 (12%)
Query: 54 EEKSISLVYYSLLFDIFLVKLLLRLPVIKVLKLILVIL---------TKLSFSCRVCCDY 104
EE+ I L+ +++L ++ L L+ R+P ++ LIL+ + + + RV +
Sbjct: 179 EERYIELLLFTILSELSLGILITRIPFYSLISLILLAIGLVPSIFHVNRRERAYRVIMNL 238
Query: 105 ASFWSTA----*DSLYEPVLLECVTPVTCYWTLSLIVREESEMVLVY 147
S W + + +LLE VTP Y ++ LI +++Y
Sbjct: 239 KSSWLSREVFFGGLSFLSLLLELVTPSLYYLSVILISLSVFSSIMIY 285
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.331 0.144 0.456
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 293,973,202
Number of Sequences: 2540612
Number of extensions: 10546665
Number of successful extensions: 30175
Number of sequences better than 10.0: 16
Number of HSP's better than 10.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 30162
Number of HSP's gapped (non-prelim): 18
length of query: 178
length of database: 863,360,394
effective HSP length: 119
effective length of query: 59
effective length of database: 561,027,566
effective search space: 33100626394
effective search space used: 33100626394
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 71 (32.0 bits)
Medicago: description of AC146752.15