
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0022.14
(334 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAA73364.1| Pge1 protein [Lotus corniculatus var. japonicus] 60 1e-07
gb|EAA70801.1| hypothetical protein FG04147.1 [Gibberella zeae P... 37 0.83
gb|AAF56538.1| CG14540-PA [Drosophila melanogaster] gi|24650208|... 37 0.83
emb|CAA90971.1| Hypothetical protein C09G9.1 [Caenorhabditis ele... 36 1.4
ref|NP_013513.1| Possible U3 snoRNP protein involved in maturati... 36 1.4
ref|YP_236049.1| cointegrate resolution protein T [Pseudomonas s... 36 1.8
emb|CAG88269.1| unnamed protein product [Debaryomyces hansenii C... 35 2.4
gb|EAL03518.1| hypothetical protein CaO19.12424 [Candida albican... 35 3.1
gb|EAL03396.1| hypothetical protein CaO19.4959 [Candida albicans... 35 3.1
dbj|BAB14821.1| unnamed protein product [Homo sapiens] 35 4.1
emb|CAG62752.1| unnamed protein product [Candida glabrata CBS138... 35 4.1
gb|AAN17675.1| MLL5 [Homo sapiens] gi|33636768|ref|NP_891847.1| ... 35 4.1
gb|AAM74947.1| MLL5 [Homo sapiens] gi|23503327|ref|NP_061152.2| ... 35 4.1
gb|AAN76325.1| myeloid/lymphoid or mixed-lineage leukemia 5 [Hom... 35 4.1
gb|EAA52603.1| hypothetical protein MG05295.4 [Magnaporthe grise... 35 4.1
gb|AAN71720.1| putative gag protein [Danio rerio] 34 5.4
dbj|BAE01311.1| unnamed protein product [Macaca fascicularis] 33 9.1
gb|AAH48022.1| LOC398577 protein [Xenopus laevis] 33 9.1
>emb|CAA73364.1| Pge1 protein [Lotus corniculatus var. japonicus]
Length = 210
Score = 59.7 bits (143), Expect = 1e-07
Identities = 23/50 (46%), Positives = 37/50 (74%)
Query: 44 KVFASRDEILEWARNLGKQHGFIIVITRSDNGGLKRKTFMILGCERCDKY 93
+ FAS ++++WAR +GK++G+++++ RSD G KRK + LGCER KY
Sbjct: 155 EAFASHTDLIDWARCVGKENGYVVIVIRSDYGSAKRKPLITLGCERGGKY 204
>gb|EAA70801.1| hypothetical protein FG04147.1 [Gibberella zeae PH-1]
gi|46116610|ref|XP_384323.1| hypothetical protein
FG04147.1 [Gibberella zeae PH-1]
Length = 207
Score = 37.0 bits (84), Expect = 0.83
Identities = 22/74 (29%), Positives = 37/74 (49%), Gaps = 2/74 (2%)
Query: 45 VFASRDEILEWARNLGKQHGFIIVITRSDNGGLKRKTFMILGCERCDKYVPYKEVLKHQS 104
+F + D+++ + + K G+ IV R+ N + T L C+R V Y K ++
Sbjct: 16 IFRTFDDLMASVQRVAKDQGYGIVKLRASNYRDGKPTRYDLVCDRGG--VKYNSTAKKRN 73
Query: 105 TGTKKCYCPFRLRA 118
T+K CPFR +A
Sbjct: 74 PSTRKIDCPFRAKA 87
>gb|AAF56538.1| CG14540-PA [Drosophila melanogaster] gi|24650208|ref|NP_651451.1|
CG14540-PA [Drosophila melanogaster]
Length = 613
Score = 37.0 bits (84), Expect = 0.83
Identities = 29/99 (29%), Positives = 42/99 (42%), Gaps = 20/99 (20%)
Query: 163 YPLDA---FSSKQLMTPDQSHSFGGVTLVTKKEIKRKN--SYDSTPTLILSRGAQSVSPQ 217
YP+D +S K+LM P Q + ++R N S+ + L GA PQ
Sbjct: 178 YPMDDNDDYSDKELMVPQQGNM---------GNMRRHNPHSHAHQMQMQLQSGAMQHPPQ 228
Query: 218 HSFRQVSQTVDTVLHNPPRHLESSSPRRVNHREPVRHGH 256
H Q Q HN HL+ +R+ H +P +H H
Sbjct: 229 HQLHQQQQMTT---HN---HLKHQQQQRLQHSQPHQHQH 261
>emb|CAA90971.1| Hypothetical protein C09G9.1 [Caenorhabditis elegans]
gi|17538606|ref|NP_501536.1| putative protein of
bilaterial origin (46.1 kD) (4J628) [Caenorhabditis
elegans] gi|7495776|pir||T19149 hypothetical protein
C09G9.1 - Caenorhabditis elegans
Length = 416
Score = 36.2 bits (82), Expect = 1.4
Identities = 44/193 (22%), Positives = 68/193 (34%), Gaps = 37/193 (19%)
Query: 77 LKRKTFMILGCERCDKYVPYKEVLKHQSTGTKKCYCPFRLRARGTKSSTNMFTKQPISLV 136
LK T +L + Y +L H+S+ L R T S ++ +
Sbjct: 137 LKSLTATVLSGYLLSRAAKYMGILDHRSSRGSVSPARSVLSRRQTASRSSHAAEV----- 191
Query: 137 ITLDSPPPMG---STVHAITSTSFFDFFYYPLDAFSSKQLMTPDQSHSFGGVTLVTKKEI 193
+PPP T ++ STS++ LD SS QL TP QSH G +T
Sbjct: 192 ----TPPPASYAPETPRSVASTSYYSPSELHLDTTSSVQLETPPQSHESGSFMPMTPS-- 245
Query: 194 KRKNSYDSTPTLILSRGAQSVSPQHSFRQVSQTVDTV--LHNPPRHLESSSPRRVNHREP 251
+ +HS + Q ++ H R+L S + +NH P
Sbjct: 246 ---------------------ATEHSSSEFGQDLEDAGDRHENYRYLHSYALNNLNHSRP 284
Query: 252 VRHGHLTMGVVRQ 264
+ L + Q
Sbjct: 285 KKETILRRTTMNQ 297
>ref|NP_013513.1| Possible U3 snoRNP protein involved in maturation of pre-18S rRNA,
based on computational analysis of large-scale
protein-protein interaction data; Utp21p [Saccharomyces
cerevisiae] gi|625119|gb|AAB82361.1| Ylr409cp
[Saccharomyces cerevisiae]
gi|15214400|sp|Q06078|YL09_YEAST Hypothetical 104.8 kDa
Trp-Asp repeats containing protein in RPL31B-VIP1
intergenic region gi|1084649|pir||S55965 probable
membrane protein YLR409c - yeast (Saccharomyces
cerevisiae)
Length = 939
Score = 36.2 bits (82), Expect = 1.4
Identities = 29/102 (28%), Positives = 41/102 (39%), Gaps = 12/102 (11%)
Query: 150 HAITSTSFFDFFYYPLDAFSSKQLMTPDQSHSFGGVTLVTKKEIKRKNSYDSTPTLILSR 209
H TS D +Y LD S ++ S+GGVT T + P ++ S
Sbjct: 263 HLSVGTSSGDLIFYDLDRRSRIHVLKNIHRESYGGVTQAT--------FLNGQPIIVTSG 314
Query: 210 GAQSVSPQHSFRQVSQTVDTVLHNPPRHLESSSPRRVNHREP 251
G S+ +SQ V+ PPR+L S R H +P
Sbjct: 315 GDNSLKEYVFDPSLSQGSGDVVVQPPRYLRS----RGGHSQP 352
>ref|YP_236049.1| cointegrate resolution protein T [Pseudomonas syringae pv. syringae
B728a] gi|63256915|gb|AAY38011.1| cointegrate resolution
protein T [Pseudomonas syringae pv. syringae B728a]
Length = 336
Score = 35.8 bits (81), Expect = 1.8
Identities = 20/70 (28%), Positives = 35/70 (49%), Gaps = 1/70 (1%)
Query: 209 RGAQSVSPQHS-FRQVSQTVDTVLHNPPRHLESSSPRRVNHREPVRHGHLTMGVVRQSTH 267
R Q V +H+ Q +Q ++T LH+ + S + + RE + H + R+
Sbjct: 134 RTLQDVQTEHARLLQANQDLETRLHDKDGQIHSLEEKHQHAREALEHYRNAIREQREQEQ 193
Query: 268 RRREGQLQRI 277
RR EGQ+Q++
Sbjct: 194 RRHEGQVQQL 203
>emb|CAG88269.1| unnamed protein product [Debaryomyces hansenii CBS767]
gi|50422877|ref|XP_460016.1| unnamed protein product
[Debaryomyces hansenii]
Length = 481
Score = 35.4 bits (80), Expect = 2.4
Identities = 33/135 (24%), Positives = 59/135 (43%), Gaps = 14/135 (10%)
Query: 7 SEVDKPEEKPLTGKLVRVEDVQDEPLAVD---YTQSFTTDKVFASRDEILEWARNLGKQH 63
SE + +P+ +++ V + + + T+ F ++V +RD++ E+ + + +
Sbjct: 121 SEYNIQIPEPIIDSKLKIYPVTENTITAEGNLITRPFP-EQVLHNRDDLNEFIQEFARDN 179
Query: 64 GFIIVITRSDN---------GGLKRKTFMILGCERCDKYVPYKEVLKHQSTGTKKCYCPF 114
GF +VI S+ GG R+ G E + +T TKK CPF
Sbjct: 180 GFGVVIAHSNKKAIYYTCELGGRYRQKKSKKGMEDARHLEVDNGYILDPNTKTKKLRCPF 239
Query: 115 RLRARGTKSSTNMFT 129
+ A K ST M+T
Sbjct: 240 SMTAT-YKKSTGMWT 253
>gb|EAL03518.1| hypothetical protein CaO19.12424 [Candida albicans SC5314]
Length = 573
Score = 35.0 bits (79), Expect = 3.1
Identities = 31/101 (30%), Positives = 44/101 (42%), Gaps = 21/101 (20%)
Query: 43 DKVFASRDEILEWARNLGKQHGFIIVITRSDNGGLKRKTFMILGCERCDKYVPYK----E 98
++VF SRDE+ E+ + +GF +VI S+ K + CE +Y K +
Sbjct: 208 EQVFNSRDELNEFIAEFARDNGFGVVIAHSN------KKAIYYTCELGGRYRHKKNKKID 261
Query: 99 VLKHQSTG----------TKKCYCPFRLRARGTKSSTNMFT 129
V K G TKK CPF + A K S N +T
Sbjct: 262 VTKQIDVGDGYMLDPDTKTKKLKCPFAMTA-SYKKSANAWT 301
>gb|EAL03396.1| hypothetical protein CaO19.4959 [Candida albicans SC5314]
Length = 573
Score = 35.0 bits (79), Expect = 3.1
Identities = 31/101 (30%), Positives = 44/101 (42%), Gaps = 21/101 (20%)
Query: 43 DKVFASRDEILEWARNLGKQHGFIIVITRSDNGGLKRKTFMILGCERCDKYVPYK----E 98
++VF SRDE+ E+ + +GF +VI S+ K + CE +Y K +
Sbjct: 208 EQVFNSRDELNEFIAEFARDNGFGVVIAHSN------KKAIYYTCELGGRYRHKKNKKID 261
Query: 99 VLKHQSTG----------TKKCYCPFRLRARGTKSSTNMFT 129
V K G TKK CPF + A K S N +T
Sbjct: 262 VTKQIDVGDGYMLDPDTKTKKLKCPFAMTA-SYKKSANAWT 301
>dbj|BAB14821.1| unnamed protein product [Homo sapiens]
Length = 589
Score = 34.7 bits (78), Expect = 4.1
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 15/128 (11%)
Query: 138 TLDSPPPM---GSTVHAITSTSFFDFFYYPLDAFSSKQLMTPDQSHSFGGVTLVTKKEIK 194
T SPP M GS I + + F P + S T S + GV L K E++
Sbjct: 402 TCKSPPKMSKPGSPGSVIPAQAHGKIFTKPDPQWDS----TVSASEAENGVHL--KTELQ 455
Query: 195 RKNSYDSTPTLILSRGAQSVSPQHSFRQVSQ---TVDTVLHNPPR-HLESSSPRRVNHRE 250
+K ++ L + Q+ ++S Q+SQ +V T LH PP HLE+ P+
Sbjct: 456 QKQLSNNNQALSKNHPPQT-HVRNSSEQLSQKLPSVPTKLHCPPSPHLENP-PKSSTPHT 513
Query: 251 PVRHGHLT 258
PV+HG+L+
Sbjct: 514 PVQHGYLS 521
>emb|CAG62752.1| unnamed protein product [Candida glabrata CBS138]
gi|50294726|ref|XP_449774.1| unnamed protein product
[Candida glabrata]
Length = 936
Score = 34.7 bits (78), Expect = 4.1
Identities = 27/102 (26%), Positives = 43/102 (41%), Gaps = 12/102 (11%)
Query: 150 HAITSTSFFDFFYYPLDAFSSKQLMTPDQSHSFGGVTLVTKKEIKRKNSYDSTPTLILSR 209
H +S D +Y L+ + L+ S FGGVT R + + P ++ S
Sbjct: 261 HICVGSSKGDILFYDLNRRARIHLLKNVHSEEFGGVT--------RASFLNGQPIIVTSG 312
Query: 210 GAQSVSPQHSFRQVSQTVDTVLHNPPRHLESSSPRRVNHREP 251
G S+ +SQ+ + ++ PPR L S R H +P
Sbjct: 313 GDNSLKEYVFDPSLSQSDEDMVVQPPRFLRS----RGGHSQP 350
>gb|AAN17675.1| MLL5 [Homo sapiens] gi|33636768|ref|NP_891847.1| myeloid/lymphoid or
mixed-lineage leukemia 5 [Homo sapiens]
Length = 1858
Score = 34.7 bits (78), Expect = 4.1
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 15/128 (11%)
Query: 138 TLDSPPPM---GSTVHAITSTSFFDFFYYPLDAFSSKQLMTPDQSHSFGGVTLVTKKEIK 194
T SPP M GS I + + F P + S T S + GV L K E++
Sbjct: 1347 TCKSPPKMSKPGSPGSVIPAQAHGKIFTKPDPQWDS----TVSASEAENGVHL--KTELQ 1400
Query: 195 RKNSYDSTPTLILSRGAQSVSPQHSFRQVSQ---TVDTVLHNPPR-HLESSSPRRVNHRE 250
+K ++ L + Q+ ++S Q+SQ +V T LH PP HLE+ P+
Sbjct: 1401 QKQLSNNNQALSKNHPPQT-HVRNSSEQLSQKLPSVPTKLHCPPSPHLENP-PKSSTPHT 1458
Query: 251 PVRHGHLT 258
PV+HG+L+
Sbjct: 1459 PVQHGYLS 1466
>gb|AAM74947.1| MLL5 [Homo sapiens] gi|23503327|ref|NP_061152.2| myeloid/lymphoid or
mixed-lineage leukemia 5 [Homo sapiens]
Length = 1858
Score = 34.7 bits (78), Expect = 4.1
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 15/128 (11%)
Query: 138 TLDSPPPM---GSTVHAITSTSFFDFFYYPLDAFSSKQLMTPDQSHSFGGVTLVTKKEIK 194
T SPP M GS I + + F P + S T S + GV L K E++
Sbjct: 1347 TCKSPPKMSKPGSPGSVIPAQAHGKIFTKPDPQWDS----TVSASEAENGVHL--KTELQ 1400
Query: 195 RKNSYDSTPTLILSRGAQSVSPQHSFRQVSQ---TVDTVLHNPPR-HLESSSPRRVNHRE 250
+K ++ L + Q+ ++S Q+SQ +V T LH PP HLE+ P+
Sbjct: 1401 QKQLSNNNQALSKNHPPQT-HVRNSSEQLSQKLPSVPTKLHCPPSPHLENP-PKSSTPHT 1458
Query: 251 PVRHGHLT 258
PV+HG+L+
Sbjct: 1459 PVQHGYLS 1466
>gb|AAN76325.1| myeloid/lymphoid or mixed-lineage leukemia 5 [Homo sapiens]
Length = 1778
Score = 34.7 bits (78), Expect = 4.1
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 15/128 (11%)
Query: 138 TLDSPPPM---GSTVHAITSTSFFDFFYYPLDAFSSKQLMTPDQSHSFGGVTLVTKKEIK 194
T SPP M GS I + + F P + S T S + GV L K E++
Sbjct: 1267 TCKSPPKMSKPGSPGSVIPAQAHGKIFTKPDPQWDS----TVSASEAENGVHL--KTELQ 1320
Query: 195 RKNSYDSTPTLILSRGAQSVSPQHSFRQVSQ---TVDTVLHNPPR-HLESSSPRRVNHRE 250
+K ++ L + Q+ ++S Q+SQ +V T LH PP HLE+ P+
Sbjct: 1321 QKQLSNNNQALSKNHPPQT-HVRNSSEQLSQKLPSVPTKLHCPPSPHLENP-PKSSTPHT 1378
Query: 251 PVRHGHLT 258
PV+HG+L+
Sbjct: 1379 PVQHGYLS 1386
>gb|EAA52603.1| hypothetical protein MG05295.4 [Magnaporthe grisea 70-15]
gi|39939890|ref|XP_359482.1| hypothetical protein
MG05295.4 [Magnaporthe grisea 70-15]
Length = 466
Score = 34.7 bits (78), Expect = 4.1
Identities = 20/74 (27%), Positives = 37/74 (49%), Gaps = 2/74 (2%)
Query: 45 VFASRDEILEWARNLGKQHGFIIVITRSDNGGLKRKTFMILGCERCDKYVPYKEVLKHQS 104
++ S +++L + K+ G+ +V R+ N + T L C+R V Y K ++
Sbjct: 248 IYRSFEDLLSAVQQFSKEQGYGVVKLRASNYRDGKPTRYDLVCDRGG--VKYSSTAKKRN 305
Query: 105 TGTKKCYCPFRLRA 118
T+K CP+R +A
Sbjct: 306 PSTRKVDCPWRAKA 319
>gb|AAN71720.1| putative gag protein [Danio rerio]
Length = 612
Score = 34.3 bits (77), Expect = 5.4
Identities = 38/166 (22%), Positives = 61/166 (35%), Gaps = 5/166 (3%)
Query: 163 YPLDAFSSKQLMTPDQSHSFGGVTLVTKKEIKRKNSYDS--TPTLILSRGAQSVSPQHSF 220
YP + S +L P QS + T K+ R + +S TP + ++ +
Sbjct: 161 YPQHSTSQTELSHPGQSKKATQKSSTTSKKASRPHIQESRTTPAISFQHTETPLANISTI 220
Query: 221 RQVSQTVDTVLHNPPRHLESSSPRRVNHREPVRHGHLTMGVVRQSTHRRREGQLQRIILT 280
+Q T L PP SS+P H + H H + R QL +
Sbjct: 221 SHANQPFTTSLTWPPAPHSSSTPSPPLHTTAISHSHSQPPIPNLP---RTSTQLIHTTSS 277
Query: 281 SINHTRQLSQYACCMNEMPVDKMHSGGMGINGSALTPATVGPFRPA 326
SI++ + LS + P + S + S+ T A P P+
Sbjct: 278 SIHNAQPLSNPFTLSSIPPYNPPPSLHQALTHSSSTDAAQHPTVPS 323
>dbj|BAE01311.1| unnamed protein product [Macaca fascicularis]
Length = 358
Score = 33.5 bits (75), Expect = 9.1
Identities = 27/113 (23%), Positives = 55/113 (47%), Gaps = 12/113 (10%)
Query: 166 DAFSSKQLMTPDQSHSFGGVTL-VTKKEIKRKNSYDSTPTLILSRGAQSVSPQHSF-RQV 223
D F K +TP Q H G + T EI+++NS ++ A S SP+ + R
Sbjct: 236 DEFFKKCKVTPSQEHLNGPLPEPFTNGEIQKENSREALAE------AASESPRPTLVRSA 289
Query: 224 SQTVDTVLHN---PPRHLESSSPRRVNHREPVRHGHLTMGVVRQSTHRRREGQ 273
S L++ PP+ +S++P + +P+ ++++ + ++ H++R +
Sbjct: 290 SSDTSEELNSQDSPPKQ-DSTAPSSTSSSDPILDFNISLAMAKERAHQKRSSK 341
>gb|AAH48022.1| LOC398577 protein [Xenopus laevis]
Length = 936
Score = 33.5 bits (75), Expect = 9.1
Identities = 19/63 (30%), Positives = 29/63 (45%)
Query: 191 KEIKRKNSYDSTPTLILSRGAQSVSPQHSFRQVSQTVDTVLHNPPRHLESSSPRRVNHRE 250
KE++R+ Y +T TL S +SP+ +FR +T + H + P R
Sbjct: 863 KELRRQTCYPATSTLRESLSPSRLSPERTFRPSERTYQSPERTFRSHERTFRPPERTSRS 922
Query: 251 PVR 253
PVR
Sbjct: 923 PVR 925
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.319 0.133 0.394
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 570,603,402
Number of Sequences: 2540612
Number of extensions: 22757702
Number of successful extensions: 48774
Number of sequences better than 10.0: 18
Number of HSP's better than 10.0 without gapping: 2
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 48766
Number of HSP's gapped (non-prelim): 19
length of query: 334
length of database: 863,360,394
effective HSP length: 128
effective length of query: 206
effective length of database: 538,162,058
effective search space: 110861383948
effective search space used: 110861383948
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 75 (33.5 bits)
Lotus: description of TM0022.14