
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0098b.3
(287 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_573059.1| CG8184-PB [Drosophila melanogaster] gi|22832284... 42 0.026
ref|XP_646413.1| hypothetical protein DDB0201906 [Dictyostelium ... 37 0.49
gb|AAS64790.2| CG33558-PA [Drosophila melanogaster] gi|62471665|... 37 0.84
ref|XP_641917.1| hypothetical protein DDB0206172 [Dictyostelium ... 36 1.4
gb|AAM68311.1| CG4527-PA, isoform A [Drosophila melanogaster] gi... 35 1.9
emb|CAB61005.2| Hypothetical protein F15D3.1a [Caenorhabditis el... 35 1.9
emb|CAG84520.1| unnamed protein product [Debaryomyces hansenii C... 35 1.9
ref|XP_467701.1| hypothetical protein [Oryza sativa (japonica cu... 35 1.9
ref|NP_917283.1| OSJNBb0032K15.13 [Oryza sativa (japonica cultiv... 35 1.9
emb|CAE12059.1| polo kinase kinase 1 [Drosophila melanogaster] 35 1.9
emb|CAA10033.1| DYS-1 protein [Caenorhabditis elegans] 35 1.9
gb|AAS64767.1| CG4527-PD, isoform D [Drosophila melanogaster] gi... 35 1.9
gb|AAX52684.1| CG4527-PE, isoform E [Drosophila melanogaster] gi... 35 1.9
dbj|BAD68705.1| hypothetical protein [Oryza sativa (japonica cul... 35 1.9
pir||JC7783 RAD 23B protein - channel catfish 35 2.5
emb|CAB03885.1| Hypothetical protein C14A6.8 [Caenorhabditis ele... 35 3.2
gb|AAC24823.1| synapsin s-syn-long [Loligo pealei] 35 3.2
emb|CAF93411.1| unnamed protein product [Tetraodon nigroviridis] 35 3.2
gb|AAW41500.1| hypothetical protein CNB01350 [Cryptococcus neofo... 34 4.2
ref|NP_149268.1| Arsenate reductase, arsC, protein-tyrosine-phos... 34 4.2
>ref|NP_573059.1| CG8184-PB [Drosophila melanogaster] gi|22832284|gb|AAF48495.2|
CG8184-PB [Drosophila melanogaster]
Length = 5146
Score = 41.6 bits (96), Expect = 0.026
Identities = 30/92 (32%), Positives = 44/92 (47%), Gaps = 4/92 (4%)
Query: 90 HLLEALAMPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLSHPPPPPPSFTIATIVVV 149
H++EAL L + + N AS +L ++ S S + PPPPPPS + I V
Sbjct: 1485 HVIEALRTNASLEEATDYLLNNPEAS----SLSTTGGQSSSSAPPPPPPPSASTMDIDVD 1540
Query: 150 AIATGSITASNSTVVITNTHCHHHPPLMPLFI 181
A G T S S+ +++ + H LMP I
Sbjct: 1541 VPADGESTQSKSSTSPNSSYDYKHLKLMPSLI 1572
>ref|XP_646413.1| hypothetical protein DDB0201906 [Dictyostelium discoideum]
gi|60474759|gb|EAL72696.1| hypothetical protein
DDB0201906 [Dictyostelium discoideum]
Length = 1007
Score = 37.4 bits (85), Expect = 0.49
Identities = 21/61 (34%), Positives = 31/61 (50%), Gaps = 2/61 (3%)
Query: 110 INDMASQVDKALLSSSSSSLSLSH--PPPPPPSFTIATIVVVAIATGSITASNSTVVITN 167
IN + ++SSSS S+ H PPPPPPS T T + + + T + +T V+
Sbjct: 41 INHSNPSIPTQFITSSSSLDSIPHTPPPPPPPSSTTTTTSLNTLGAPTTTTAPATTVVPP 100
Query: 168 T 168
T
Sbjct: 101 T 101
>gb|AAS64790.2| CG33558-PA [Drosophila melanogaster] gi|62471665|ref|NP_001014500.1|
CG33558-PA [Drosophila melanogaster]
Length = 1998
Score = 36.6 bits (83), Expect = 0.84
Identities = 24/95 (25%), Positives = 42/95 (43%), Gaps = 5/95 (5%)
Query: 86 TTVIHLLEALAMPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLSHPPPPPPSFTIAT 145
TT + L + +L+K + +S +++S S S + PP PP+ +A
Sbjct: 1613 TTSFNALPHFPLSSSTSSLLSKVSSFSNSSSASPPTTAATSGSASSHYQPPQPPNAAVAN 1672
Query: 146 IVVVAIATGSIT-----ASNSTVVITNTHCHHHPP 175
+AI + S T A + + ++H HH PP
Sbjct: 1673 SKDMAIYSSSFTKNPAAAQSPNMRQAHSHQHHQPP 1707
>ref|XP_641917.1| hypothetical protein DDB0206172 [Dictyostelium discoideum]
gi|60469998|gb|EAL67979.1| hypothetical protein
DDB0206172 [Dictyostelium discoideum]
Length = 783
Score = 35.8 bits (81), Expect = 1.4
Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 6/89 (6%)
Query: 96 AMPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLSHPPPP----PPSFTIATIVVVAI 151
++P V+ K T + + ++ SSSSSS + S PPP PPS + + I+
Sbjct: 552 SLPARPQSVMFKNTPLSVNTSSSQSSSSSSSSSFASSASPPPTPTKPPSSSSSPIITTTS 611
Query: 152 ATGSITASNSTVVITNTHCHHH--PPLMP 178
+ +++T V TN + H PP +P
Sbjct: 612 PNSNTNINSNTSVNTNINPRHSVLPPSLP 640
>gb|AAM68311.1| CG4527-PA, isoform A [Drosophila melanogaster]
gi|24762616|ref|NP_611908.2| CG4527-PA, isoform A
[Drosophila melanogaster]
Length = 1300
Score = 35.4 bits (80), Expect = 1.9
Identities = 32/119 (26%), Positives = 50/119 (41%), Gaps = 22/119 (18%)
Query: 82 ESDSTTVIHLLEALAMPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLS--------- 132
ESD + +A P PL KP +D + K +S ++ +
Sbjct: 465 ESDKKHFVKKGKAPPPPSPLGLANAKPAASDSQTSPKKLATPEPTSPVTTAIEVAIGQEA 524
Query: 133 -HPPPPPPSFTIATIV-VVAIATGSITASNSTVVITNTHC-----------HHHPPLMP 178
P P PPS T ++IV V ++A+ S + S S V++++ HHH PL P
Sbjct: 525 MEPKPQPPSPTASSIVSVQSVASSSSSGSVSNAVLSSSTSLITINSDPPTPHHHQPLPP 583
>emb|CAB61005.2| Hypothetical protein F15D3.1a [Caenorhabditis elegans]
gi|14530423|emb|CAB61012.2| Hypothetical protein F15D3.1a
[Caenorhabditis elegans] gi|55584033|sp|Q9TW65|DMD_CAEEL
Dystrophin-1 gi|17506447|ref|NP_492946.1| DYStrophin
related (417.4 kD) (dys-1) [Caenorhabditis elegans]
Length = 3674
Score = 35.4 bits (80), Expect = 1.9
Identities = 17/65 (26%), Positives = 37/65 (56%), Gaps = 3/65 (4%)
Query: 214 KVSLDDVVRWSVPFLMVERTMVQGLTDREGKPEQEFIGRNKFVKEKVQEKSEDRIEVDIS 273
K+ LD+VVRW M E+ Q + +G ++ GR +++QE+ +D ++++++
Sbjct: 2181 KLELDEVVRWCE---MAEKEAAQNVNSLDGDGLEKLDGRLAQFTKELQERKDDMVQLEMA 2237
Query: 274 YSTIV 278
+ I+
Sbjct: 2238 KNMII 2242
>emb|CAG84520.1| unnamed protein product [Debaryomyces hansenii CBS767]
gi|50405847|ref|XP_456564.1| unnamed protein product
[Debaryomyces hansenii]
Length = 1152
Score = 35.4 bits (80), Expect = 1.9
Identities = 24/85 (28%), Positives = 37/85 (43%), Gaps = 15/85 (17%)
Query: 114 ASQVDKALLSSSSSSLSLSHPPPPPPSFTIATIV----------VVAIATGSITASNSTV 163
+ + LL S ++S S S PPPPPPS + V +++ + S T S +
Sbjct: 1016 SKSISNPLLRSPTASTSASTPPPPPPSRKVGNPVKPPIGFSSTPLISSRSNSATPSRGSP 1075
Query: 164 VIT-----NTHCHHHPPLMPLFITE 183
+ T N HP L P+ T+
Sbjct: 1076 ISTPSTTGNEQNQEHPKLNPIVPTK 1100
>ref|XP_467701.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
gi|46390300|dbj|BAD15749.1| hypothetical protein [Oryza
sativa (japonica cultivar-group)]
gi|46390566|dbj|BAD16052.1| hypothetical protein [Oryza
sativa (japonica cultivar-group)]
Length = 207
Score = 35.4 bits (80), Expect = 1.9
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 16/78 (20%)
Query: 114 ASQVDKALLSSSSSSLS-------------LSHPPPPPPSFTIATIVVVAIATGSITASN 160
+S A L+S++SS+S H PPPPP+ T A + + S ++S+
Sbjct: 42 SSSSTTARLTSATSSVSRHRHPQPRRRPSGCCHCPPPPPAPTSA---ASSSLSSSSSSSS 98
Query: 161 STVVITNTHCHHHPPLMP 178
S+ V H HHHPP P
Sbjct: 99 SSSVRHCRHRHHHPPPPP 116
>ref|NP_917283.1| OSJNBb0032K15.13 [Oryza sativa (japonica cultivar-group)]
Length = 313
Score = 35.4 bits (80), Expect = 1.9
Identities = 18/51 (35%), Positives = 24/51 (46%), Gaps = 7/51 (13%)
Query: 128 SLSLSHPPPPPPSFTIATIVVVAIATGSITASNSTVVITNTHCHHHPPLMP 178
S LS PPPPPP ++ T + A A S +A + CH H +P
Sbjct: 205 SRPLSQPPPPPPPSSLPTAIPTATAAASFSA-------PDRRCHRHCSRLP 248
>emb|CAE12059.1| polo kinase kinase 1 [Drosophila melanogaster]
Length = 1342
Score = 35.4 bits (80), Expect = 1.9
Identities = 32/119 (26%), Positives = 50/119 (41%), Gaps = 22/119 (18%)
Query: 82 ESDSTTVIHLLEALAMPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLS--------- 132
ESD + +A P PL KP +D + K +S ++ +
Sbjct: 465 ESDKKHFVKKGKAPPPPSPLGLANAKPAASDSQTSPKKLATPEPTSPVTTAIEVAIGQEA 524
Query: 133 -HPPPPPPSFTIATIV-VVAIATGSITASNSTVVITNTHC-----------HHHPPLMP 178
P P PPS T ++IV V ++A+ S + S S V++++ HHH PL P
Sbjct: 525 MEPKPQPPSPTASSIVSVQSVASSSSSGSVSNAVLSSSTSLITINSDPPTPHHHQPLPP 583
>emb|CAA10033.1| DYS-1 protein [Caenorhabditis elegans]
Length = 3674
Score = 35.4 bits (80), Expect = 1.9
Identities = 17/65 (26%), Positives = 37/65 (56%), Gaps = 3/65 (4%)
Query: 214 KVSLDDVVRWSVPFLMVERTMVQGLTDREGKPEQEFIGRNKFVKEKVQEKSEDRIEVDIS 273
K+ LD+VVRW M E+ Q + +G ++ GR +++QE+ +D ++++++
Sbjct: 2181 KLELDEVVRWCE---MAEKEAAQNVNSLDGDGLEKLDGRLAQFTKELQERKDDMVQLEMA 2237
Query: 274 YSTIV 278
+ I+
Sbjct: 2238 KNMII 2242
>gb|AAS64767.1| CG4527-PD, isoform D [Drosophila melanogaster]
gi|45445395|gb|AAS64766.1| CG4527-PC, isoform C
[Drosophila melanogaster] gi|21626741|gb|AAF47198.2|
CG4527-PB, isoform B [Drosophila melanogaster]
gi|45552821|ref|NP_995936.1| CG4527-PC, isoform C
[Drosophila melanogaster] gi|45552819|ref|NP_995935.1|
CG4527-PD, isoform D [Drosophila melanogaster]
gi|24762614|ref|NP_726441.1| CG4527-PB, isoform B
[Drosophila melanogaster]
Length = 1703
Score = 35.4 bits (80), Expect = 1.9
Identities = 32/119 (26%), Positives = 50/119 (41%), Gaps = 22/119 (18%)
Query: 82 ESDSTTVIHLLEALAMPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLS--------- 132
ESD + +A P PL KP +D + K +S ++ +
Sbjct: 465 ESDKKHFVKKGKAPPPPSPLGLANAKPAASDSQTSPKKLATPEPTSPVTTAIEVAIGQEA 524
Query: 133 -HPPPPPPSFTIATIV-VVAIATGSITASNSTVVITNTHC-----------HHHPPLMP 178
P P PPS T ++IV V ++A+ S + S S V++++ HHH PL P
Sbjct: 525 MEPKPQPPSPTASSIVSVQSVASSSSSGSVSNAVLSSSTSLITINSDPPTPHHHQPLPP 583
>gb|AAX52684.1| CG4527-PE, isoform E [Drosophila melanogaster]
gi|62471776|ref|NP_001014549.1| CG4527-PE, isoform E
[Drosophila melanogaster]
Length = 1342
Score = 35.4 bits (80), Expect = 1.9
Identities = 32/119 (26%), Positives = 50/119 (41%), Gaps = 22/119 (18%)
Query: 82 ESDSTTVIHLLEALAMPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLS--------- 132
ESD + +A P PL KP +D + K +S ++ +
Sbjct: 465 ESDKKHFVKKGKAPPPPSPLGLANAKPAASDSQTSPKKLATPEPTSPVTTAIEVAIGQEA 524
Query: 133 -HPPPPPPSFTIATIV-VVAIATGSITASNSTVVITNTHC-----------HHHPPLMP 178
P P PPS T ++IV V ++A+ S + S S V++++ HHH PL P
Sbjct: 525 MEPKPQPPSPTASSIVSVQSVASSSSSGSVSNAVLSSSTSLITINSDPPTPHHHQPLPP 583
>dbj|BAD68705.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
Length = 342
Score = 35.4 bits (80), Expect = 1.9
Identities = 18/51 (35%), Positives = 24/51 (46%), Gaps = 7/51 (13%)
Query: 128 SLSLSHPPPPPPSFTIATIVVVAIATGSITASNSTVVITNTHCHHHPPLMP 178
S LS PPPPPP ++ T + A A S +A + CH H +P
Sbjct: 220 SRPLSQPPPPPPPSSLPTAIPTATAAASFSA-------PDRRCHRHCSRLP 263
>pir||JC7783 RAD 23B protein - channel catfish
Length = 385
Score = 35.0 bits (79), Expect = 2.5
Identities = 24/83 (28%), Positives = 35/83 (41%)
Query: 107 KPTINDMASQVDKALLSSSSSSLSLSHPPPPPPSFTIATIVVVAIATGSITASNSTVVIT 166
KP A+Q SSSSS+ S + P PP + + AT T + + S S+V+
Sbjct: 76 KPKAATAAAQSSTTAASSSSSTSSTTTPTVPPVAASAATTTTTTTTTTTDSTSESSVIEE 135
Query: 167 NTHCHHHPPLMPLFITEVTHTNI 189
P P +T+ NI
Sbjct: 136 KAAEEKPPSSTPASSGSLTNVNI 158
>emb|CAB03885.1| Hypothetical protein C14A6.8 [Caenorhabditis elegans]
gi|17557930|ref|NP_507551.1| predicted CDS, putative
protein (5S648) [Caenorhabditis elegans]
gi|7495970|pir||T19263 hypothetical protein C14A6.8 -
Caenorhabditis elegans
Length = 129
Score = 34.7 bits (78), Expect = 3.2
Identities = 16/64 (25%), Positives = 32/64 (50%)
Query: 97 MPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLSHPPPPPPSFTIATIVVVAIATGSI 156
M P +D N +D + A+L ++ ++ + PPPPPP T+A +++ +
Sbjct: 1 MGDPFFDPTNNLLKSDGNEYQNLAILDPNAPAVGGNEPPPPPPPLTVAQDALISDNGAKV 60
Query: 157 TASN 160
T+ +
Sbjct: 61 TSGD 64
>gb|AAC24823.1| synapsin s-syn-long [Loligo pealei]
Length = 503
Score = 34.7 bits (78), Expect = 3.2
Identities = 31/125 (24%), Positives = 53/125 (41%), Gaps = 8/125 (6%)
Query: 18 SGLWMKMQSSNLMSVDDIIRYSFSEWVASFSSFFGGSCSLYVELLAKKHG--LLLDINLG 75
SG W K + + M + + WV S FGG + VE L K G ++++N
Sbjct: 309 SGNW-KANTGSAMLEQIQMNEKYKLWVDECSQLFGGLDIVAVEALQGKDGREYIIEVNDS 367
Query: 76 YRWVICESDSTTVIHLLEALAMPHPLYDVLNKPTINDMASQVDKALLSSSSSSLSLSHPP 135
++ E+ + E + +Y KP N M+ + + S++ S + PP
Sbjct: 368 SMALLGETQEEDRRLIAEMVLQKMHMYC---KP--NTMSQAMSSGTIQSAADSTATPPPP 422
Query: 136 PPPPS 140
PP P+
Sbjct: 423 PPRPA 427
>emb|CAF93411.1| unnamed protein product [Tetraodon nigroviridis]
Length = 580
Score = 34.7 bits (78), Expect = 3.2
Identities = 23/68 (33%), Positives = 31/68 (44%), Gaps = 5/68 (7%)
Query: 77 RWVICESDSTTVIHLLEALAMPHPLYDVLNKPTINDMASQVDKALLSSSS-----SSLSL 131
RW ++HLLE L + P +LN + + + V +L SS S L
Sbjct: 55 RWGSRNGRVRDLLHLLEGLELLRPRDLILNGQSSPPLRTPVTSSLRPDSSPFEGVSCLKP 114
Query: 132 SHPPPPPP 139
S PPPPPP
Sbjct: 115 SPPPPPPP 122
>gb|AAW41500.1| hypothetical protein CNB01350 [Cryptococcus neoformans var.
neoformans JEC21] gi|58262794|ref|XP_568807.1|
hypothetical protein CNB01350 [Cryptococcus neoformans
var. neoformans JEC21]
Length = 858
Score = 34.3 bits (77), Expect = 4.2
Identities = 21/43 (48%), Positives = 25/43 (57%), Gaps = 2/43 (4%)
Query: 105 LNKPTINDM--ASQVDKALLSSSSSSLSLSHPPPPPPSFTIAT 145
LNKP+ +S + LLS SS S SLS PPP PP+ AT
Sbjct: 781 LNKPSSRSPRESSILSHPLLSGSSQSSSLSPPPPTPPNTQRAT 823
>ref|NP_149268.1| Arsenate reductase, arsC, protein-tyrosine-phosphatase family
enzyme [Clostridium acetobutylicum ATCC 824]
gi|14994420|gb|AAK76850.1| Arsenate reductase, arsC,
protein-tyrosine-phosphatase family enzyme [Clostridium
acetobutylicum ATCC 824]
Length = 136
Score = 34.3 bits (77), Expect = 4.2
Identities = 17/47 (36%), Positives = 29/47 (61%)
Query: 226 PFLMVERTMVQGLTDREGKPEQEFIGRNKFVKEKVQEKSEDRIEVDI 272
PF++ + T GL D GK ++EFI K ++EKV++ ++ I +I
Sbjct: 88 PFVLSKHTEDWGLDDPSGKSDEEFIRTAKTIEEKVKDLAKRIINKEI 134
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.319 0.134 0.395
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 492,513,520
Number of Sequences: 2540612
Number of extensions: 20516493
Number of successful extensions: 116177
Number of sequences better than 10.0: 64
Number of HSP's better than 10.0 without gapping: 17
Number of HSP's successfully gapped in prelim test: 52
Number of HSP's that attempted gapping in prelim test: 115983
Number of HSP's gapped (non-prelim): 161
length of query: 287
length of database: 863,360,394
effective HSP length: 127
effective length of query: 160
effective length of database: 540,702,670
effective search space: 86512427200
effective search space used: 86512427200
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 74 (33.1 bits)
Lotus: description of TM0098b.3