
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147538.1 + phase: 0
(138 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAL47365.1| unknown protein [Arabidopsis thaliana] gi|1839628... 85 4e-16
ref|XP_465920.1| unknown protein [Oryza sativa (japonica cultiva... 79 2e-14
emb|CAG88154.1| unnamed protein product [Debaryomyces hansenii C... 35 0.51
gb|AAL30450.1| phosphatidylinositol phosphate kinase 6 [Dictyost... 35 0.51
ref|XP_643612.1| hypothetical protein DDB0185056 [Dictyostelium ... 35 0.51
emb|CAA86373.1| NO330 [Saccharomyces cerevisiae] gi|1302440|emb|... 33 1.1
emb|CAA91071.1| SPAC22F3.06c [Schizosaccharomyces pombe] gi|1911... 32 2.5
gb|AAG50995.1| hypothetical protein; 36992-33466 [Arabidopsis th... 32 2.5
emb|CAH98657.1| conserved hypothetical protein [Plasmodium berghei] 32 3.3
emb|CAI74862.1| smc protein, putative [Theileria annulata] 32 4.3
ref|YP_001691.1| hypothetical protein LIC11739 [Leptospira inter... 32 4.3
ref|NP_712367.1| hypothetical protein LA2186 [Leptospira interro... 32 4.3
dbj|BAB75488.1| alr3789 [Nostoc sp. PCC 7120] gi|25533211|pir||A... 32 4.3
emb|CAH02602.1| unnamed protein product [Kluyveromyces lactis NR... 31 5.6
ref|XP_598263.1| PREDICTED: similar to Ubiquitin carboxyl-termin... 31 5.6
gb|EAK99620.1| hypothetical protein CaO19.9971 [Candida albicans... 31 5.6
dbj|BAB71816.1| chaperone protein CaMsi3p [Candida albicans] gi|... 31 5.6
dbj|BAB06205.1| signal recognition particle (docking protein) [B... 31 5.6
emb|CAF96253.1| unnamed protein product [Tetraodon nigroviridis] 31 5.6
gb|AAP83181.1| zinc transporter [Danio rerio] gi|41053557|ref|NP... 31 7.4
>gb|AAL47365.1| unknown protein [Arabidopsis thaliana] gi|18396284|ref|NP_564275.1|
expressed protein [Arabidopsis thaliana]
gi|15450914|gb|AAK96728.1| Unknown protein [Arabidopsis
thaliana] gi|9802527|gb|AAF99729.1| F17L21.9
[Arabidopsis thaliana]
Length = 200
Score = 84.7 bits (208), Expect = 4e-16
Identities = 55/141 (39%), Positives = 77/141 (54%), Gaps = 19/141 (13%)
Query: 3 KGGETVKQSEASSPLKQ-VDSAIEKTKGE---TADGDKDEIIKELKKVKKQNFVTHCLLS 58
KG E V E ++ DS+ K K E + +EI K++KKVKKQN +TH LLS
Sbjct: 54 KGKEDVITGEEEEKEEESADSSPSKGKSEGQRQLEESIEEIAKDIKKVKKQNKITHVLLS 113
Query: 59 AMIVLTVAWQLSEVSLVMRVKDGMNHPFRSVGNMFKEMVK-----------DVSGSNGQD 107
A I+LT+ WQLSE S++ +KD ++HP RS+G M M K S SN Q+
Sbjct: 114 ATIILTLVWQLSEYSMIFMLKDRISHPVRSIGGMLNGMFKGKLRPIKNQLAGTSNSNDQN 173
Query: 108 AAGDKENNESTSLPSLKIPDM 128
G N S + P L++P++
Sbjct: 174 NHG----NGSHTGPQLQVPEL 190
>ref|XP_465920.1| unknown protein [Oryza sativa (japonica cultivar-group)]
gi|48716971|dbj|BAD23664.1| unknown protein [Oryza
sativa (japonica cultivar-group)]
Length = 211
Score = 79.0 bits (193), Expect = 2e-14
Identities = 41/124 (33%), Positives = 71/124 (57%), Gaps = 18/124 (14%)
Query: 19 QVDSAIEKTKGETADGDKD--------EIIKELKKVKKQNFVTHCLLSAMIVLTVAWQLS 70
Q D+A+E+ K + + + ++ K+L K+K+QN +TH LL +IV+T WQ +
Sbjct: 85 QNDAALEQVKPNCSHPNPEISKEVKLGDVAKDLNKIKRQNMITHILLGTVIVMTAVWQFN 144
Query: 71 EVSLVMRVKDGMNHPFRSVGNMFKEMVKDVSGSNGQDAAGDKENNESTSLPSLKIPDMTN 130
EVS ++ VK +++PF+S+G++ K +K G K E+ LP + +PD+T
Sbjct: 145 EVSFLLAVKRKLSNPFKSLGDLIKSSLK----------GGGKPMIEAPPLPPVGVPDVTR 194
Query: 131 VDVP 134
D+P
Sbjct: 195 NDLP 198
>emb|CAG88154.1| unnamed protein product [Debaryomyces hansenii CBS767]
gi|50422671|ref|XP_459912.1| unnamed protein product
[Debaryomyces hansenii]
Length = 645
Score = 34.7 bits (78), Expect = 0.51
Identities = 26/98 (26%), Positives = 43/98 (43%), Gaps = 16/98 (16%)
Query: 55 CLLSAMIVLTVAWQLSEVSLVMR------------VKDGMNHPFRSVGNMFKEMVKDVSG 102
C+L+A I TV + L LV++ ++DG+N + + + + K S
Sbjct: 117 CILTAAIGGTVDFALDSTQLVLKSVNETIISVSEDIQDGLNGLLKVINTLLSGVDKVKSF 176
Query: 103 SNGQDAAG----DKENNESTSLPSLKIPDMTNVDVPNF 136
QD DK N +L ++KIP+ D+ NF
Sbjct: 177 FTNQDTDSTEYVDKVNLSIKALQNIKIPNSVLTDIDNF 214
>gb|AAL30450.1| phosphatidylinositol phosphate kinase 6 [Dictyostelium discoideum]
Length = 1589
Score = 34.7 bits (78), Expect = 0.51
Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 5/119 (4%)
Query: 6 ETVKQSEASSPLKQVDSAIEKTKGETADGDKDEIIKELKK--VKKQNFVTHCLLSAMIVL 63
+ +KQ E + K+V++ + K + + +++E IKE +K K NFV ++
Sbjct: 958 QEIKQKEDENQ-KEVENQKQDLKNQKEEKEEEEEIKEQRKFETKNDNFVQVNTKKSVRGS 1016
Query: 64 TVAWQLSEVSLVMRVKDGMNHPFRSVGNMFKEMVKDVSGSNGQDAAGDKENNESTSLPS 122
L + + NH RS+ V D + +N + +K NNEST+ P+
Sbjct: 1017 IFFAPLDSHEKINNISSIGNH--RSLIRNSSSSVLDNNSNNSNNNNNNKNNNESTATPN 1073
>ref|XP_643612.1| hypothetical protein DDB0185056 [Dictyostelium discoideum]
gi|28829686|gb|AAO52202.1| similar to Dictyostelium
discoideum (Slime mold). Phosphatidylinositol phosphate
kinase 6 (Fragment) gi|60471612|gb|EAL69568.1|
hypothetical protein DDB0185056 [Dictyostelium
discoideum]
Length = 1589
Score = 34.7 bits (78), Expect = 0.51
Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 5/119 (4%)
Query: 6 ETVKQSEASSPLKQVDSAIEKTKGETADGDKDEIIKELKK--VKKQNFVTHCLLSAMIVL 63
+ +KQ E + K+V++ + K + + +++E IKE +K K NFV ++
Sbjct: 958 QEIKQKEDENQ-KEVENQKQDLKNQKEEKEEEEEIKEQRKFETKNDNFVQVNTKKSVRGS 1016
Query: 64 TVAWQLSEVSLVMRVKDGMNHPFRSVGNMFKEMVKDVSGSNGQDAAGDKENNESTSLPS 122
L + + NH RS+ V D + +N + +K NNEST+ P+
Sbjct: 1017 IFFAPLDSHEKINNISSIGNH--RSLIRNSSSSVLDNNSNNSNNNNNNKNNNESTATPN 1073
>emb|CAA86373.1| NO330 [Saccharomyces cerevisiae] gi|1302440|emb|CAA96256.1| unnamed
protein product [Saccharomyces cerevisiae]
gi|6324005|ref|NP_014074.1| Protein required for
efficient mating, member of a family of eukaryotic
proteins that contain a domain homologous to Sac1p;
Fig4p [Saccharomyces cerevisiae]
gi|1176581|sp|P42837|FIG4_YEAST Polyphosphoinositide
phosphatase (Phosphatidylinositol 3,5-bisphosphate
5-phosphatase) (Factor induced gene 4)
Length = 879
Score = 33.5 bits (75), Expect = 1.1
Identities = 35/131 (26%), Positives = 59/131 (44%), Gaps = 7/131 (5%)
Query: 9 KQSEASSPLKQVDSA--IEKTKG---ETADGDKDEIIK-ELKKVKKQNFVTHCLLSAMIV 62
KQS ++ LK + S+ +EK +G ET D D+D K EL+ + + +
Sbjct: 701 KQSWLNNKLKMITSSKSLEKAEGRVVETTDLDRDTSPKQELELYEHYLHIISDRSQKLEE 760
Query: 63 LTVAWQLSEVSLVMRVKDGMNHPFRSV-GNMFKEMVKDVSGSNGQDAAGDKENNESTSLP 121
++ S+ + + + P R V G ++ +D + D GD EN+E T+
Sbjct: 761 KMNSFSYSKYPIFISHESSEIPPMRKVIGEPLVDIAEDFTDVYDDDDDGDDENDEMTTEA 820
Query: 122 SLKIPDMTNVD 132
L PD +VD
Sbjct: 821 LLIAPDHVSVD 831
>emb|CAA91071.1| SPAC22F3.06c [Schizosaccharomyces pombe]
gi|19113947|ref|NP_593035.1| hypothetical protein
SPAC22F3.06c [Schizosaccharomyces pombe 972h-]
gi|2130315|pir||S62421 endopeptidase La homolog (EC
3.4.21.-) PIM1 precursor, mitochondrial - fission yeast
(Schizosaccharomyces pombe)
gi|1170811|sp|Q09769|LONM_SCHPO Putative Lon protease
homolog, mitochondrial precursor
Length = 1067
Score = 32.3 bits (72), Expect = 2.5
Identities = 20/61 (32%), Positives = 34/61 (54%), Gaps = 9/61 (14%)
Query: 73 SLVMRVKDGMNHPFRSVGNMFKEMVKDVSGSNGQDAAGDKENNESTSLPSLKIPDMTNVD 132
S+V + D +N +S G K+ S + +DAA N E++S+P LK+PD N++
Sbjct: 781 SIVKEIDDELNSKEKSTGKSGKK----TSPQSSEDAA----NKEASSVP-LKVPDKVNIE 831
Query: 133 V 133
+
Sbjct: 832 I 832
>gb|AAG50995.1| hypothetical protein; 36992-33466 [Arabidopsis thaliana]
gi|15230765|ref|NP_187318.1| fip1 motif-containing
protein [Arabidopsis thaliana]
Length = 980
Score = 32.3 bits (72), Expect = 2.5
Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 2 SKGGETVKQSEASSPLKQVDSAIEKTKGETADGDKD--EIIKELKKVKKQN 50
S GG+T++Q + P K + + EK DK+ EII+E+K V+ N
Sbjct: 875 SNGGDTIEQQDVQKPRKLMGKSEEKAMQNRDINDKEEGEIIEEVKGVEIDN 925
>emb|CAH98657.1| conserved hypothetical protein [Plasmodium berghei]
Length = 725
Score = 32.0 bits (71), Expect = 3.3
Identities = 14/39 (35%), Positives = 22/39 (55%)
Query: 97 VKDVSGSNGQDAAGDKENNESTSLPSLKIPDMTNVDVPN 135
VKDV +N ++ +K NNE + KI D+ ++D N
Sbjct: 510 VKDVENNNTEEKKEEKTNNEKDKIKEKKINDLVDIDKIN 548
>emb|CAI74862.1| smc protein, putative [Theileria annulata]
Length = 1348
Score = 31.6 bits (70), Expect = 4.3
Identities = 22/64 (34%), Positives = 29/64 (44%), Gaps = 16/64 (25%)
Query: 9 KQSEASSPLKQVDSAIEKTKGETADGDKDEIIKELKK----------------VKKQNFV 52
K S S LK ++S+++ K D D + IK+LKK VKKQN V
Sbjct: 794 KISSDESYLKDLESSLKSLKSTIQDQDNENQIKQLKKSLRETQNRKESTYEELVKKQNEV 853
Query: 53 THCL 56
CL
Sbjct: 854 DKCL 857
>ref|YP_001691.1| hypothetical protein LIC11739 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|45600845|gb|AAS70328.1| conserved hypothetical
protein [Leptospira interrogans serovar Copenhageni str.
Fiocruz L1-130]
Length = 1934
Score = 31.6 bits (70), Expect = 4.3
Identities = 27/85 (31%), Positives = 41/85 (47%), Gaps = 6/85 (7%)
Query: 49 QNFVTHCLLSAMIVLTVAWQLSEVSLVMRVKD--GMNHPFRSVGNMFKEMVKDVSGSNGQ 106
+N V H +L L A +++ + LV R+K G P ++ F ++V SGSN +
Sbjct: 695 ENVVFHPILDR--TLFRAGEIASIKLVSRIKKSFGFEIPSQNEYPAFAKLVH--SGSNKE 750
Query: 107 DAAGDKENNESTSLPSLKIPDMTNV 131
+ K N E TS KIP N+
Sbjct: 751 YSISLKWNGEGTSSFQFKIPKEANL 775
>ref|NP_712367.1| hypothetical protein LA2186 [Leptospira interrogans serovar Lai
str. 56601] gi|24195909|gb|AAN49385.1| conserved
hypothetical protein [Leptospira interrogans serovar lai
str. 56601]
Length = 1934
Score = 31.6 bits (70), Expect = 4.3
Identities = 27/85 (31%), Positives = 41/85 (47%), Gaps = 6/85 (7%)
Query: 49 QNFVTHCLLSAMIVLTVAWQLSEVSLVMRVKD--GMNHPFRSVGNMFKEMVKDVSGSNGQ 106
+N V H +L L A +++ + LV R+K G P ++ F ++V SGSN +
Sbjct: 695 ENVVFHPILDR--TLFRAGEIASIKLVSRIKKSFGFEIPSQNEYPAFAKLVH--SGSNKE 750
Query: 107 DAAGDKENNESTSLPSLKIPDMTNV 131
+ K N E TS KIP N+
Sbjct: 751 YSISLKWNGEGTSSFQFKIPKEANL 775
>dbj|BAB75488.1| alr3789 [Nostoc sp. PCC 7120] gi|25533211|pir||AF2279 hypothetical
protein alr3789 [imported] - Nostoc sp. (strain PCC
7120) gi|17231281|ref|NP_487829.1| hypothetical protein
alr3789 [Nostoc sp. PCC 7120]
Length = 766
Score = 31.6 bits (70), Expect = 4.3
Identities = 14/52 (26%), Positives = 30/52 (56%)
Query: 21 DSAIEKTKGETADGDKDEIIKELKKVKKQNFVTHCLLSAMIVLTVAWQLSEV 72
+ AI +TK ++ D D +++E+++ +++ HC M+ + A LSE+
Sbjct: 74 EEAISETKLQSQGIDLDALVREVRQKRQEKIQDHCSTLQMLDIAQAIPLSEI 125
>emb|CAH02602.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
gi|50302161|ref|XP_451014.1| unnamed protein product
[Kluyveromyces lactis]
Length = 297
Score = 31.2 bits (69), Expect = 5.6
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 22 SAIEKTKGETADGDKDEIIKELKKVKKQ 49
+A+ +TKGE GD E +K +KK+K +
Sbjct: 140 AAMIRTKGEAGTGDVSEAVKHIKKIKSE 167
>ref|XP_598263.1| PREDICTED: similar to Ubiquitin carboxyl-terminal hydrolase 35
(Ubiquitin thiolesterase 35) (Ubiquitin-specific
processing protease 35) (Deubiquitinating enzyme 35),
partial [Bos taurus]
Length = 248
Score = 31.2 bits (69), Expect = 5.6
Identities = 14/46 (30%), Positives = 29/46 (62%)
Query: 5 GETVKQSEASSPLKQVDSAIEKTKGETADGDKDEIIKELKKVKKQN 50
GE+ ++ EA+ ++ + EK K E + ++E +KE +KVK+++
Sbjct: 168 GESTEEEEAAGEEEEEEEEEEKVKAEVEEKVEEEKVKEEEKVKEKS 213
>gb|EAK99620.1| hypothetical protein CaO19.9971 [Candida albicans SC5314]
gi|46440224|gb|EAK99532.1| hypothetical protein
CaO19.2435 [Candida albicans SC5314]
Length = 701
Score = 31.2 bits (69), Expect = 5.6
Identities = 26/104 (25%), Positives = 45/104 (43%), Gaps = 3/104 (2%)
Query: 20 VDSAIEKTKGETADGDKDEIIKELKKVKKQNFVTHCLLSAMIVLTVAWQLSEVSLVMRVK 79
V IE +GE D D + +E+KK+ K+N +T SA + E M ++
Sbjct: 504 VKELIEPAEGEEVDEDAEPQYREVKKLVKKNDLTIECESAALPDAELQAFIEKEASMVME 563
Query: 80 DGMNHPFRSVGNMFKEMVKDVSG---SNGQDAAGDKENNESTSL 120
D + N +E + ++ G +D A D+E + + L
Sbjct: 564 DKLVFDTEERKNQLEEYIYELRGKLDEEYKDFASDQEKEKLSGL 607
>dbj|BAB71816.1| chaperone protein CaMsi3p [Candida albicans]
gi|57338491|sp|Q96VB9|HSP7F_CANAL Heat shock protein
homolog SSE1 (Chaperone protein MSI3)
Length = 702
Score = 31.2 bits (69), Expect = 5.6
Identities = 26/104 (25%), Positives = 45/104 (43%), Gaps = 3/104 (2%)
Query: 20 VDSAIEKTKGETADGDKDEIIKELKKVKKQNFVTHCLLSAMIVLTVAWQLSEVSLVMRVK 79
V IE +GE D D + +E+KK+ K+N +T SA + E M ++
Sbjct: 505 VKELIEPAEGEEVDEDAEPQYREVKKLVKKNDLTIECESAALPDAELQAFIEKEASMVME 564
Query: 80 DGMNHPFRSVGNMFKEMVKDVSG---SNGQDAAGDKENNESTSL 120
D + N +E + ++ G +D A D+E + + L
Sbjct: 565 DKLVFDTEERKNQLEEYIYELRGKLDEEYKDFASDQEKEKLSGL 608
>dbj|BAB06205.1| signal recognition particle (docking protein) [Bacillus halodurans
C-125] gi|15615049|ref|NP_243352.1| signal recognition
particle (docking protein) [Bacillus halodurans C-125]
gi|25315141|pir||F83960 signal recognition particle
(docking protein) ftsY [imported] - Bacillus halodurans
(strain C-125)
Length = 330
Score = 31.2 bits (69), Expect = 5.6
Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 10/55 (18%)
Query: 4 GGETVKQSEASSPLKQVDSAIEKTKGETAD----------GDKDEIIKELKKVKK 48
G + +KQSE S P + AI+ K AD +K ++KEL+KVK+
Sbjct: 174 GVDVIKQSEGSDPAAVMFDAIQAAKSREADILICDTAGRLQNKVNLMKELEKVKR 228
>emb|CAF96253.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2446
Score = 31.2 bits (69), Expect = 5.6
Identities = 29/104 (27%), Positives = 44/104 (41%), Gaps = 5/104 (4%)
Query: 5 GETVKQSEASSPLKQVDSAIEKTKGETADGDKDEIIKELKKVKKQNFVTHCLLSAMIVLT 64
G Q + L + S + K G +A G I L K K Q V HCLLS+ I
Sbjct: 1521 GNRAMQLTSVEVLTLLFSELAKVTGGSAKGFASFISDVLSKCKVQKVVLHCLLSS-IFSA 1579
Query: 65 VAWQLSEVSLV--MRVKDGMNHPFRSVGNMFKEMVKDVSGSNGQ 106
W V+ V V++G++ S+ N+ ++ + S Q
Sbjct: 1580 QKWHDQRVAGVNAASVEEGLSED--SIINLSEDQIDSCSAVQSQ 1621
>gb|AAP83181.1| zinc transporter [Danio rerio] gi|41053557|ref|NP_571006.1| solute
carrier family 39 (zinc transporter), member 7 [Danio
rerio]
Length = 444
Score = 30.8 bits (68), Expect = 7.4
Identities = 16/46 (34%), Positives = 23/46 (49%)
Query: 3 KGGETVKQSEASSPLKQVDSAIEKTKGETADGDKDEIIKELKKVKK 48
KGG + S + S K DS E K G+KD+++ + K KK
Sbjct: 243 KGGHSHSHSHSPSAPKSKDSDEEDDKKGQKKGEKDKVVSQQKPTKK 288
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.308 0.127 0.343
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 225,066,061
Number of Sequences: 2540612
Number of extensions: 8149539
Number of successful extensions: 22143
Number of sequences better than 10.0: 42
Number of HSP's better than 10.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 22124
Number of HSP's gapped (non-prelim): 43
length of query: 138
length of database: 863,360,394
effective HSP length: 114
effective length of query: 24
effective length of database: 573,730,626
effective search space: 13769535024
effective search space used: 13769535024
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 67 (30.4 bits)
Medicago: description of AC147538.1