
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0043b.1
(188 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q61ZC4 Hypothetical protein CBG03168 [Caenorhabditis b... 38 0.12
UniRef100_UPI000021A304 UPI000021A304 UniRef100 entry 35 1.3
UniRef100_Q8SQR0 Putative NUCLEAR PROTEIN OF THE SMC FAMILY [Enc... 34 1.7
UniRef100_Q74GN6 Hypothetical protein [Geobacter sulfurreducens] 33 2.9
UniRef100_Q9ASF5 Putative transcription factor [Oryza sativa] 33 3.8
UniRef100_Q6FN20 Similar to sp|Q04437 Saccharomyces cerevisiae Y... 33 3.8
UniRef100_Q92817 Envoplakin [Homo sapiens] 33 3.8
UniRef100_UPI0000305489 UPI0000305489 UniRef100 entry 33 4.9
UniRef100_UPI000027994B UPI000027994B UniRef100 entry 33 4.9
UniRef100_Q26471 Blackjack [Schistocerca americana] 33 4.9
UniRef100_Q5T7P4 Involucrin [Homo sapiens] 33 4.9
UniRef100_P07476 Involucrin [Homo sapiens] 33 4.9
UniRef100_UPI0000499955 UPI0000499955 UniRef100 entry 32 6.4
UniRef100_UPI00002E2A5E UPI00002E2A5E UniRef100 entry 32 6.4
UniRef100_UPI000029D0C7 UPI000029D0C7 UniRef100 entry 32 6.4
UniRef100_Q9YI03 Nuclear oncoprotein [Brachydanio rerio] 32 6.4
UniRef100_Q8F2R5 Hypothetical protein [Leptospira interrogans] 32 6.4
UniRef100_Q72SS4 Hypothetical protein [Leptospira interrogans] 32 6.4
UniRef100_Q8T2A1 Similar to Staphylococcus epidermidis. Putative... 32 6.4
UniRef100_Q8PSG1 ATP-dependent protease La [Methanosarcina mazei] 32 6.4
>UniRef100_Q61ZC4 Hypothetical protein CBG03168 [Caenorhabditis briggsae]
Length = 2898
Score = 38.1 bits (87), Expect = 0.12
Identities = 24/73 (32%), Positives = 39/73 (52%), Gaps = 3/73 (4%)
Query: 111 QEALKSEIDAWYVQIFCQEKSKEELLKEIRKKKEPKHPPAPQGPQPSQMELLMASMQRLE 170
QEA + E + W + +EK E + +E R+K PPAP PQ +++ L +S + E
Sbjct: 265 QEAAEKEWNQWKEYLQLEEKEPENVTEE-RQKTPEAEPPAP--PQTTRVLLASSSDEDQE 321
Query: 171 TTHGEMREENRAQ 183
GE++ E +Q
Sbjct: 322 KNGGELKSEEVSQ 334
>UniRef100_UPI000021A304 UPI000021A304 UniRef100 entry
Length = 1783
Score = 34.7 bits (78), Expect = 1.3
Identities = 20/63 (31%), Positives = 33/63 (51%), Gaps = 2/63 (3%)
Query: 128 QEKSKEELLKEIRKKKE-PKHPPAPQGPQPSQMEL-LMASMQRLETTHGEMREENRAQYR 185
+EK KE+ + +++KKE P PP P+ P+P M + ++ L H +E +R
Sbjct: 1023 KEKEKEKEKEPVKEKKEAPAPPPVPEIPKPRTMPCDICRQLEPLGDQHITCKECRMTVHR 1082
Query: 186 ACY 188
CY
Sbjct: 1083 NCY 1085
>UniRef100_Q8SQR0 Putative NUCLEAR PROTEIN OF THE SMC FAMILY [Encephalitozoon
cuniculi]
Length = 1025
Score = 34.3 bits (77), Expect = 1.7
Identities = 22/91 (24%), Positives = 47/91 (51%), Gaps = 4/91 (4%)
Query: 91 LPSIITDLCKFHN-VPILSGHQEALKSEIDAWYVQIFCQEKSKEELLKEIRKKKEPKHPP 149
L ++ +L H +L +E+LK++ + + E+ K+E+L + + + PP
Sbjct: 291 LVKVVEELRDIHQETEMLGVDEESLKNKREKRMTNL---ERLKKEILDLQDEVSKVEIPP 347
Query: 150 APQGPQPSQMELLMASMQRLETTHGEMREEN 180
P+GP +++E+L M L G+++ E+
Sbjct: 348 QPRGPDEARIEVLEEKMSGLMRARGKIQHES 378
>UniRef100_Q74GN6 Hypothetical protein [Geobacter sulfurreducens]
Length = 320
Score = 33.5 bits (75), Expect = 2.9
Identities = 20/84 (23%), Positives = 42/84 (49%), Gaps = 2/84 (2%)
Query: 82 IQRDAPVMILPSIITDLCKFHNVPILSGHQEALKSE--IDAWYVQIFCQEKSKEELLKEI 139
+ D V+ S +TD+ + VP++S + +S + AW + ++ +++ EI
Sbjct: 205 VSTDNTVVSAMSALTDVAMKNKVPVMSADPSSAESHPVLAAWGFDYYKMGRATGKMVAEI 264
Query: 140 RKKKEPKHPPAPQGPQPSQMELLM 163
K K+P+ P + S ++LL+
Sbjct: 265 LKGKKPEQLPTRFMTKASDVDLLV 288
>UniRef100_Q9ASF5 Putative transcription factor [Oryza sativa]
Length = 307
Score = 33.1 bits (74), Expect = 3.8
Identities = 26/101 (25%), Positives = 43/101 (41%), Gaps = 4/101 (3%)
Query: 13 NEEEELTGGIPRRHMTQLSQVWTSLLLRNIWPSLHVTSLPVTKENLLYSILEGHSVDIAA 72
N ++ +P R ++ W S + + + S V SL + +L Y+IL+G IAA
Sbjct: 87 NRWAQIAKHLPGRTDNEVKNFWNSTIKKKLI-SQAVGSLHPSSADLYYNILDGAGQSIAA 145
Query: 73 VMSNAIADA---IQRDAPVMILPSIITDLCKFHNVPILSGH 110
NA+ +A + + P + S H L GH
Sbjct: 146 ASLNAVENAAHGVTQSPPSSVYNSAAWASFSSHQPIFLPGH 186
>UniRef100_Q6FN20 Similar to sp|Q04437 Saccharomyces cerevisiae YMR106c [Candida
glabrata]
Length = 615
Score = 33.1 bits (74), Expect = 3.8
Identities = 35/142 (24%), Positives = 54/142 (37%), Gaps = 20/142 (14%)
Query: 38 LLRNIWPSLHVTSLPVTKENLLYSILEGHSVDIAAVMSNAIADAIQRDAPVMI------- 90
L+ S + LP +E Y E D V+ D + P+M+
Sbjct: 466 LMAGFVESRSMDDLPRVEEKKYYEFYEDSVRDTTLVLPMKSKDETRERDPLMVPAMGLHR 525
Query: 91 --------LPSIITDLCKFHNVPILSGHQEALKSEIDAWYVQIFCQEKSKEELL--KEIR 140
L ++ + + + P L H LK++I V F + + LL K++
Sbjct: 526 QLQVMLEYLHQVVINKSEKFDPPELQDH---LKAKITPHIVSDFPELEELVNLLELKKVE 582
Query: 141 KKKEPKHPPAPQGPQPSQMELL 162
K + PPAP P PS ELL
Sbjct: 583 PKSKEFKPPAPAEPVPSLEELL 604
>UniRef100_Q92817 Envoplakin [Homo sapiens]
Length = 2033
Score = 33.1 bits (74), Expect = 3.8
Identities = 27/97 (27%), Positives = 40/97 (40%), Gaps = 22/97 (22%)
Query: 107 LSGHQEALKSEIDAWYVQIFCQEK----------------SKEELLKEIRKKKEPKHPPA 150
+ HQEALK E + CQE S + L ++ + K+ PA
Sbjct: 304 IQAHQEALKMEWQNFLNLCICQETQLQHVEDYRRFQEEADSVSQTLAKLNSNLDAKYSPA 363
Query: 151 PQGPQPSQMELLM---ASMQRLETTH---GEMREENR 181
P GP + ELL A +RL T G+++ +R
Sbjct: 364 PGGPPGAPTELLQQLEAEEKRLAVTERATGDLQRRSR 400
>UniRef100_UPI0000305489 UPI0000305489 UniRef100 entry
Length = 482
Score = 32.7 bits (73), Expect = 4.9
Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 14/124 (11%)
Query: 64 EGHSVDIAAVMSNAIADA-IQRDAPVMILPSIITDLCKFHNVPILSGHQEALKSEIDAWY 122
E H +IA + + AIA+A Q +A + + + QE L+ E++
Sbjct: 197 EAHEKEIAEIKAQAIAEAKRQAEAEAQTIRDKLQ-----QEIDAAKAAQEQLRKEVE--- 248
Query: 123 VQIFCQEKSKEELLKEIRKKKEPKHPPAPQGPQPSQMELLMASMQRLETTHGEMREENRA 182
+ K +E L++I ++++ A Q ++E A+ ++LE E R A
Sbjct: 249 -----EIKLAKEKLEQIEQERKDAEAKAAQEQLQRELEEAKAAKEQLEREEAEERARVEA 303
Query: 183 QYRA 186
+ RA
Sbjct: 304 EERA 307
>UniRef100_UPI000027994B UPI000027994B UniRef100 entry
Length = 184
Score = 32.7 bits (73), Expect = 4.9
Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 6/89 (6%)
Query: 88 VMILPSIITDLCKFHNVPILSGHQEALKSEIDAWYVQIFCQEKSKEELLKEIRKKKEPKH 147
V+ + I L + +N P L +E K ++D QI +K + ++ K+I KK+ K
Sbjct: 96 VVAIGVIFMTLKENYNEPQLVKSKEIEKKQVDK-KTQISKDQKKQLQIKKKIEKKESTKK 154
Query: 148 PPAPQGP-----QPSQMELLMASMQRLET 171
AP+ P PSQ++ L+ L+T
Sbjct: 155 AEAPKTPPIKEKDPSQVKELVLPNLNLKT 183
>UniRef100_Q26471 Blackjack [Schistocerca americana]
Length = 1547
Score = 32.7 bits (73), Expect = 4.9
Identities = 24/101 (23%), Positives = 51/101 (49%), Gaps = 3/101 (2%)
Query: 49 TSLPVTKE-NLLYSILEGHSVDIAAVMSNA--IADAIQRDAPVMILPSIITDLCKFHNVP 105
T +P E N I+E ++++ +++ I+ I ++P+ +L ++ T+L +
Sbjct: 440 TDVPKAGEANENVDIVEQKPTEVSSDLASGAPISQIISSESPIELLDTVKTELTELDKDT 499
Query: 106 ILSGHQEALKSEIDAWYVQIFCQEKSKEELLKEIRKKKEPK 146
+ S H+ AL +E D+ + E+ +E L K + +K K
Sbjct: 500 LSSEHERALLAEADSELSESRSCEQLEEYLSKSSQNEKSEK 540
>UniRef100_Q5T7P4 Involucrin [Homo sapiens]
Length = 585
Score = 32.7 bits (73), Expect = 4.9
Identities = 30/99 (30%), Positives = 44/99 (44%), Gaps = 7/99 (7%)
Query: 83 QRDAPVMILPSIITDLCKFHNVPILSGHQEALKSEIDAWYVQIFCQEKSKEELLKEIRKK 142
Q PV + P++ +L K P+ + HQE +K + CQ K EL E+ K
Sbjct: 4 QHTLPVTLSPALSQELLKTVPPPV-NTHQEQMKQPTP---LPPPCQ-KVPVELPVEVPSK 58
Query: 143 KEPKHPPAPQG--PQPSQMELLMASMQRLETTHGEMREE 179
+E KH A +G Q + + Q L+ H E EE
Sbjct: 59 QEEKHMTAVKGLPEQECEQQQKEPQEQELQQQHWEQHEE 97
>UniRef100_P07476 Involucrin [Homo sapiens]
Length = 585
Score = 32.7 bits (73), Expect = 4.9
Identities = 30/99 (30%), Positives = 44/99 (44%), Gaps = 7/99 (7%)
Query: 83 QRDAPVMILPSIITDLCKFHNVPILSGHQEALKSEIDAWYVQIFCQEKSKEELLKEIRKK 142
Q PV + P++ +L K P+ + HQE +K + CQ K EL E+ K
Sbjct: 4 QHTLPVTLSPALSQELLKTVPPPV-NTHQEQMKQPTP---LPPPCQ-KVPVELPVEVPSK 58
Query: 143 KEPKHPPAPQG--PQPSQMELLMASMQRLETTHGEMREE 179
+E KH A +G Q + + Q L+ H E EE
Sbjct: 59 QEEKHMTAVKGLPEQECEQQQKEPQEQELQQQHWEQHEE 97
>UniRef100_UPI0000499955 UPI0000499955 UniRef100 entry
Length = 1166
Score = 32.3 bits (72), Expect = 6.4
Identities = 16/52 (30%), Positives = 29/52 (55%)
Query: 91 LPSIITDLCKFHNVPILSGHQEALKSEIDAWYVQIFCQEKSKEELLKEIRKK 142
L I D + P L GH +A++ I++ V+I C E+++ +L +I K+
Sbjct: 456 LAHYIVDWVQCQKYPYLFGHNQAMQEWIESHLVKINCNEENEYQLSIQIEKE 507
>UniRef100_UPI00002E2A5E UPI00002E2A5E UniRef100 entry
Length = 244
Score = 32.3 bits (72), Expect = 6.4
Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 14/124 (11%)
Query: 64 EGHSVDIAAVMSNAIADA-IQRDAPVMILPSIITDLCKFHNVPILSGHQEALKSEIDAWY 122
E H +IA + + AIA+A Q +A + + + QE L+ E++
Sbjct: 99 EAHEKEIAEIKAQAIAEAKRQAEAEAQTIRDKLQ-----QEIDAAKAAQEQLRKEVE--- 150
Query: 123 VQIFCQEKSKEELLKEIRKKKEPKHPPAPQGPQPSQMELLMASMQRLETTHGEMREENRA 182
+ K +E L++I ++++ A Q ++E A+ ++LE E R A
Sbjct: 151 -----EIKLAKEKLEQIEQERKDVEAKAAQEQLQRELEEAKAAKEQLEREEAEERARVEA 205
Query: 183 QYRA 186
+ RA
Sbjct: 206 EERA 209
>UniRef100_UPI000029D0C7 UPI000029D0C7 UniRef100 entry
Length = 288
Score = 32.3 bits (72), Expect = 6.4
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 50 SLPVTKENLLYSILEG--HSVDIAAVMSNAIADAIQRDAPVMILPSIITDLCKFHNVPIL 107
++PV K + L G +DI A+ + + +A D V +I D+ KF N +
Sbjct: 159 TIPVKKGEPISEKLAGLLTKLDIKAIEAGIVLNAALEDGLVYQEQEMIIDVEKFRN-DLA 217
Query: 108 SGHQEALKSEIDAWYV 123
HQEA+ I+A Y+
Sbjct: 218 QAHQEAVSLSIEAAYI 233
>UniRef100_Q9YI03 Nuclear oncoprotein [Brachydanio rerio]
Length = 727
Score = 32.3 bits (72), Expect = 6.4
Identities = 33/119 (27%), Positives = 54/119 (44%), Gaps = 14/119 (11%)
Query: 75 SNAIADAIQRDAPVMILPSIITDLCKFHNVPILSGHQ----EALKSEIDAWYVQI---FC 127
S++ D PV LP+++T P+LSG + L+SE++ +
Sbjct: 491 SSSAKDLSSPSGPVATLPAVVTPAAGPPEGPLLSGGEGHAMNNLESELETLRQALDNGLD 550
Query: 128 QEKSKEELLKEI---RKKKEPKHPPAPQGPQPSQMELLMASMQRLETTHGEMREENRAQ 183
++SKE+ L EI R K+E K A Q + Q EL + + E ++RE A+
Sbjct: 551 SKESKEKFLHEIVKMRVKQEEKLGSALQAKRSLQQELEFLRVAKKE----KLREATEAK 605
>UniRef100_Q8F2R5 Hypothetical protein [Leptospira interrogans]
Length = 269
Score = 32.3 bits (72), Expect = 6.4
Identities = 23/98 (23%), Positives = 44/98 (44%), Gaps = 6/98 (6%)
Query: 47 HVTSLPVTKENLLYSILEGHSVDIAAVMSNAIADAIQRDAPVMILPSIITDLCKFHNVPI 106
H + +E +L +L+ + + SN+I + + D PV+ P D + PI
Sbjct: 61 HKEIMDQAEEEILEELLQNGDIQTSTDSSNSIEEDLNEDVPVLEPPIFSMDSTNEPDFPI 120
Query: 107 LSGHQEALKSEIDAWYVQIFCQ-EKSKEELLKEIRKKK 143
S K E+ ++++ + + KS L+K +R K
Sbjct: 121 PSE-----KGELSLFFLKFYGKGSKSHSRLVKVLRLSK 153
>UniRef100_Q72SS4 Hypothetical protein [Leptospira interrogans]
Length = 269
Score = 32.3 bits (72), Expect = 6.4
Identities = 23/98 (23%), Positives = 44/98 (44%), Gaps = 6/98 (6%)
Query: 47 HVTSLPVTKENLLYSILEGHSVDIAAVMSNAIADAIQRDAPVMILPSIITDLCKFHNVPI 106
H + +E +L +L+ + + SN+I + + D PV+ P D + PI
Sbjct: 61 HKEIMDQAEEEILEELLQNGDIQTSTDSSNSIEEDLNEDVPVLEPPIFSMDSTNEPDFPI 120
Query: 107 LSGHQEALKSEIDAWYVQIFCQ-EKSKEELLKEIRKKK 143
S K E+ ++++ + + KS L+K +R K
Sbjct: 121 PSE-----KGELSLFFLKFYGKGSKSHSRLVKVLRLSK 153
>UniRef100_Q8T2A1 Similar to Staphylococcus epidermidis. Putative cell-surface adhesin
SdrF [Dictyostelium discoideum]
Length = 3763
Score = 32.3 bits (72), Expect = 6.4
Identities = 17/57 (29%), Positives = 29/57 (50%)
Query: 65 GHSVDIAAVMSNAIADAIQRDAPVMILPSIITDLCKFHNVPILSGHQEALKSEIDAW 121
G +V + +N+IA ++ + +L +++T F NVPIL G+Q S W
Sbjct: 2848 GGAVIVQLYRNNSIATDLRGNPIPSVLTNVLTGEYLFDNVPILEGYQVMFSSPPPGW 2904
>UniRef100_Q8PSG1 ATP-dependent protease La [Methanosarcina mazei]
Length = 795
Score = 32.3 bits (72), Expect = 6.4
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 14/106 (13%)
Query: 51 LPVTKENLLYSILEGHSVD-----IAAVMSNAIADAIQRDAPVMILPSIITDLCKFHNVP 105
L + K++LL SILE H +D I I D R+A V L + +F +
Sbjct: 502 LAIAKDHLLPSILEDHGLDADKLQIEDEALKVIIDKYTREAGVRGLKKQLAKTARFVSEK 561
Query: 106 ILSGHQEALKSEIDAWYVQIFCQEKSKEELLKEIRKKKEPKHPPAP 151
I+SG ++D YV + KE L KEI +++E + P
Sbjct: 562 IVSG-------KVDLPYV--VKADMLKEILGKEIIRQEEARKENVP 598
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.317 0.132 0.390
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 304,509,415
Number of Sequences: 2790947
Number of extensions: 11582915
Number of successful extensions: 50266
Number of sequences better than 10.0: 26
Number of HSP's better than 10.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 50251
Number of HSP's gapped (non-prelim): 30
length of query: 188
length of database: 848,049,833
effective HSP length: 120
effective length of query: 68
effective length of database: 513,136,193
effective search space: 34893261124
effective search space used: 34893261124
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 71 (32.0 bits)
Lotus: description of TM0043b.1