
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0520.11
(117 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q8GX16 Hypothetical protein [Arabidopsis thaliana] 69 3e-11
UniRef100_Q9FIP8 Arabidopsis thaliana genomic DNA, chromosome 5,... 54 8e-07
UniRef100_Q6ZFJ8 Hypothetical protein OJ1435_F07.27 [Oryza sativa] 36 0.23
UniRef100_UPI000021BABE UPI000021BABE UniRef100 entry 35 0.31
UniRef100_UPI0000467324 UPI0000467324 UniRef100 entry 35 0.40
UniRef100_Q6IJE9 HDC15037 [Drosophila melanogaster] 35 0.40
UniRef100_Q730Z1 Hypothetical protein [Bacillus cereus] 34 0.68
UniRef100_Q5ZI98 Hypothetical protein [Gallus gallus] 34 0.89
UniRef100_UPI000042D4C1 UPI000042D4C1 UniRef100 entry 33 2.0
UniRef100_Q6HDW3 Hypothetical protein [Bacillus thuringiensis] 33 2.0
UniRef100_Q6HTI2 Hypothetical protein [Bacillus anthracis] 33 2.0
UniRef100_Q7Z010 Fibroin heavy chain [Plodia interpunctella] 33 2.0
UniRef100_UPI0000430365 UPI0000430365 UniRef100 entry 32 2.6
UniRef100_UPI0000318D25 UPI0000318D25 UniRef100 entry 32 2.6
UniRef100_UPI000028277D UPI000028277D UniRef100 entry 32 2.6
UniRef100_Q7KSW9 CG33324-PA [Drosophila melanogaster] 32 2.6
UniRef100_Q6IKM3 HDC12113 [Drosophila melanogaster] 32 2.6
UniRef100_Q6H3Y9 Atency associated nuclear antigen-like [Oryza s... 32 3.4
UniRef100_UPI00002AC90B UPI00002AC90B UniRef100 entry 32 4.4
UniRef100_Q8F932 Hypothetical protein [Leptospira interrogans] 32 4.4
>UniRef100_Q8GX16 Hypothetical protein [Arabidopsis thaliana]
Length = 106
Score = 68.6 bits (166), Expect = 3e-11
Identities = 43/112 (38%), Positives = 61/112 (54%), Gaps = 15/112 (13%)
Query: 1 MSLNCLTCSQNLQRTDSFGEFFTEKEYKEVCKKVDRNWSGNLIASSSSSSSTSQCDLPKG 60
MSLNCL C LQRTDS + + K D ++ N S+ ++ LP
Sbjct: 1 MSLNCLAC-HILQRTDSDRDMGSRK---------DSSFKENFATSAFEKMVRNRSSLPV- 49
Query: 61 QGGNVAKIKAEHRRVHSTGNIPYPGSSQPKLVRSSGMRRDWSFENLAENQDQ 112
V ++ HRR++S + Y +PKLVRSSG+RRDWSFE+L +++DQ
Sbjct: 50 ----VRRVNKGHRRLYSADIMVYGELDEPKLVRSSGIRRDWSFEDLKKHKDQ 97
>UniRef100_Q9FIP8 Arabidopsis thaliana genomic DNA, chromosome 5, P1 clone:MZA15
[Arabidopsis thaliana]
Length = 133
Score = 53.9 bits (128), Expect = 8e-07
Identities = 48/131 (36%), Positives = 61/131 (45%), Gaps = 33/131 (25%)
Query: 1 MSLNCLTCSQNLQRTDS------FGEFFTEKEYKEVCKKV-------DRNWSGNLIASSS 47
MSLNCL+C Q L RTDS G E V K RNWSGNL
Sbjct: 1 MSLNCLSC-QALPRTDSNKDVDLSGPGPPRVEINNVLGKTCCVNPIGGRNWSGNL----- 54
Query: 48 SSSSTSQCDLPKGQGGNVAKIKAEHRRVH----------STGNIPYPGSSQPKLVRSSGM 97
S + P G ++A + +++H S N+P QPKLVRS+G+
Sbjct: 55 SPRIYEKIGRP---GSSLAHKMKKVKKIHHVRLSGPVGSSPSNVP-TRPEQPKLVRSTGV 110
Query: 98 RRDWSFENLAE 108
RR+WSFENL +
Sbjct: 111 RRNWSFENLRD 121
>UniRef100_Q6ZFJ8 Hypothetical protein OJ1435_F07.27 [Oryza sativa]
Length = 156
Score = 35.8 bits (81), Expect = 0.23
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 85 GSSQPKLVRSSGMRRDWSFENLAEN 109
G P+L RS G+RRDWSFE+L N
Sbjct: 128 GGPPPRLRRSGGVRRDWSFEDLRAN 152
>UniRef100_UPI000021BABE UPI000021BABE UniRef100 entry
Length = 231
Score = 35.4 bits (80), Expect = 0.31
Identities = 21/61 (34%), Positives = 29/61 (47%), Gaps = 1/61 (1%)
Query: 18 FGEFFTEKEYKEVCKKV-DRNWSGNLIASSSSSSSTSQCDLPKGQGGNVAKIKAEHRRVH 76
F FF K Y + CK + W+GNLIAS + + +G N AKI A + +
Sbjct: 167 FWIFFRYKRYGKSCKNTRNGTWTGNLIASFKGVFRQKEMPFKRNRGINEAKISAPFAQTN 226
Query: 77 S 77
S
Sbjct: 227 S 227
>UniRef100_UPI0000467324 UPI0000467324 UniRef100 entry
Length = 143
Score = 35.0 bits (79), Expect = 0.40
Identities = 24/89 (26%), Positives = 41/89 (45%)
Query: 24 EKEYKEVCKKVDRNWSGNLIASSSSSSSTSQCDLPKGQGGNVAKIKAEHRRVHSTGNIPY 83
+KEYK+ KK + S + +SS SSSS+ K + K K +HR S+ + +
Sbjct: 28 KKEYKDKHKKKSKKDSSSSSSSSYSSSSSESDTKKKKKKKEKKKRKKKHRESTSSSDDTH 87
Query: 84 PGSSQPKLVRSSGMRRDWSFENLAENQDQ 112
S V+ +R S ++ + D+
Sbjct: 88 SSSESSSHVKKKNKKRSLSSDDSDHDNDK 116
>UniRef100_Q6IJE9 HDC15037 [Drosophila melanogaster]
Length = 148
Score = 35.0 bits (79), Expect = 0.40
Identities = 26/81 (32%), Positives = 35/81 (43%), Gaps = 10/81 (12%)
Query: 41 NLIASSSSSSSTSQCDL-----PKGQGGNVAKIKAEHRRVHSTGNIPYPGSSQPKLVRSS 95
N A S SSS C L P G GG ++ HR H+T + P G +P L +S
Sbjct: 51 NTTAHRSCSSSAPLCVLCVIPFPSGTGGLA---ESSHRAAHATPSYPPTGDLRPPLPMTS 107
Query: 96 GMRRDWSFENLAENQDQSVSC 116
W + A+ Q Q + C
Sbjct: 108 STTGHW--KQPAKKQKQQLPC 126
>UniRef100_Q730Z1 Hypothetical protein [Bacillus cereus]
Length = 133
Score = 34.3 bits (77), Expect = 0.68
Identities = 18/64 (28%), Positives = 31/64 (48%)
Query: 15 TDSFGEFFTEKEYKEVCKKVDRNWSGNLIASSSSSSSTSQCDLPKGQGGNVAKIKAEHRR 74
T+S G ++ YK K+ R +A S+S S KG+ GN++ +K + ++
Sbjct: 55 TNSSGSANSQSSYKRAAKQSSRKHGKQNVAPLSNSFFKSNASNDKGKKGNLSSLKRKRKQ 114
Query: 75 VHST 78
H T
Sbjct: 115 SHLT 118
>UniRef100_Q5ZI98 Hypothetical protein [Gallus gallus]
Length = 788
Score = 33.9 bits (76), Expect = 0.89
Identities = 32/110 (29%), Positives = 45/110 (40%), Gaps = 20/110 (18%)
Query: 25 KEYKEVCKKVDRNWSGNLIASSSSSSST---SQCDLPKGQGGNVAKIKAEHRRVHSTGNI 81
KE K + KK R+ S + +SSSSSSST S L + K K +HR +
Sbjct: 463 KEEKRLKKKRKRSTSSSSSSSSSSSSSTDSSSDASLSSSSSSDHKKRKKKHRNRSESSRS 522
Query: 82 PYPGSSQ-----------------PKLVRSSGMRRDWSFENLAENQDQSV 114
SS+ P +S + +++ ENL E QD V
Sbjct: 523 SKKRSSRASSHYKDQIRKEEWYSPPADTSASFLNQNFEMENLLERQDSLV 572
>UniRef100_UPI000042D4C1 UPI000042D4C1 UniRef100 entry
Length = 895
Score = 32.7 bits (73), Expect = 2.0
Identities = 21/72 (29%), Positives = 33/72 (45%), Gaps = 5/72 (6%)
Query: 24 EKEYKEVCKKVDRNWSGNLIASSSSSSSTSQCDLPKGQGGNVAKIKAEHRRVHST---GN 80
E+E + + RN+ G+ +S D+P+G+G + I + H T GN
Sbjct: 33 ERETHPELQGITRNYLGS--DPNSGHVRDQSLDMPEGEGYGMTDIPHNSSKSHLTPADGN 90
Query: 81 IPYPGSSQPKLV 92
PY G+S P V
Sbjct: 91 APYTGASTPNRV 102
>UniRef100_Q6HDW3 Hypothetical protein [Bacillus thuringiensis]
Length = 133
Score = 32.7 bits (73), Expect = 2.0
Identities = 18/64 (28%), Positives = 30/64 (46%)
Query: 15 TDSFGEFFTEKEYKEVCKKVDRNWSGNLIASSSSSSSTSQCDLPKGQGGNVAKIKAEHRR 74
T+S G ++ YK K+ R +A S+S S KG+ GN + +K + ++
Sbjct: 55 TNSSGSANSQSSYKRAAKQSSRKHGKQNVAPLSNSFFKSNTSNDKGKKGNPSSLKRKRKQ 114
Query: 75 VHST 78
H T
Sbjct: 115 SHLT 118
>UniRef100_Q6HTI2 Hypothetical protein [Bacillus anthracis]
Length = 133
Score = 32.7 bits (73), Expect = 2.0
Identities = 18/64 (28%), Positives = 30/64 (46%)
Query: 15 TDSFGEFFTEKEYKEVCKKVDRNWSGNLIASSSSSSSTSQCDLPKGQGGNVAKIKAEHRR 74
T+S G ++ YK K+ R +A S+S S KG+ GN + +K + ++
Sbjct: 55 TNSSGSANSQSSYKRAAKQSSRKHGKQNVAPLSNSFFKSNTSNDKGKKGNPSSLKRKRKQ 114
Query: 75 VHST 78
H T
Sbjct: 115 PHLT 118
>UniRef100_Q7Z010 Fibroin heavy chain [Plodia interpunctella]
Length = 509
Score = 32.7 bits (73), Expect = 2.0
Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 1/72 (1%)
Query: 41 NLIASSSSSSSTSQCDLPKGQG-GNVAKIKAEHRRVHSTGNIPYPGSSQPKLVRSSGMRR 99
+L+A ++ S S D G GNV ++K H+ G++P S + KLVR+ +
Sbjct: 24 SLLAGNAREVSESTTDNFTTDGNGNVTEVKTTHKEYRRHGDVPNNISGEDKLVRTFVIET 83
Query: 100 DWSFENLAENQD 111
D S + +D
Sbjct: 84 DASGNEVIYEED 95
>UniRef100_UPI0000430365 UPI0000430365 UniRef100 entry
Length = 4157
Score = 32.3 bits (72), Expect = 2.6
Identities = 29/103 (28%), Positives = 44/103 (42%), Gaps = 18/103 (17%)
Query: 1 MSLNCLTCSQNLQRT------------DSFGEFFTEKEYKEVCKKVDRNWSGNLIASSSS 48
++LN + S NL R+ DSFGEFF EKE + D N + + +
Sbjct: 4056 LALNSIGSSDNLSRSRLSKSTTKEIFIDSFGEFFQEKEIPSGLE--DYNTDAGTVTTDTY 4113
Query: 49 SSS---TSQCDLPKGQGGNV-AKIKAEHRRVHSTGNIPYPGSS 87
S S T+ D G + +++ E RR ++PY S
Sbjct: 4114 SDSDIDTNSIDAYSGDSTDTDNRVQDEIRRGFKLEDLPYDNIS 4156
>UniRef100_UPI0000318D25 UPI0000318D25 UniRef100 entry
Length = 790
Score = 32.3 bits (72), Expect = 2.6
Identities = 24/83 (28%), Positives = 38/83 (44%), Gaps = 4/83 (4%)
Query: 33 KVDRNWSGNLIASSSSSSSTSQCDLPKGQGGNVAKIKAEHRRV-HSTGNIPYPGSSQPKL 91
K R S L SS + + G A+ A+ + H++ NIP P S+P+L
Sbjct: 328 KTKRTGSLMLAVSSPKAIPKQSLKMTPKAGSKTAQSPADGPKTGHNSSNIPAPTGSEPRL 387
Query: 92 VR---SSGMRRDWSFENLAENQD 111
VR S R S ++L+++ D
Sbjct: 388 VRPKLGSSSLRSSSQDSLSQSSD 410
>UniRef100_UPI000028277D UPI000028277D UniRef100 entry
Length = 143
Score = 32.3 bits (72), Expect = 2.6
Identities = 19/80 (23%), Positives = 36/80 (44%), Gaps = 8/80 (10%)
Query: 37 NWSGNLIASSSSSSSTSQCDLPKG--QGGNV------AKIKAEHRRVHSTGNIPYPGSSQ 88
+W LI +S S CD KG + N+ ++ + VH +IP+ G +
Sbjct: 43 SWDSRLIDLKKASESKDYCDDIKGILENNNLQLTELSTHLQGQLVAVHPAYDIPFDGFAD 102
Query: 89 PKLVRSSGMRRDWSFENLAE 108
K+ + R++W+ + L +
Sbjct: 103 EKVRGNPSKRKEWAIDQLMQ 122
>UniRef100_Q7KSW9 CG33324-PA [Drosophila melanogaster]
Length = 360
Score = 32.3 bits (72), Expect = 2.6
Identities = 17/43 (39%), Positives = 22/43 (50%)
Query: 47 SSSSSTSQCDLPKGQGGNVAKIKAEHRRVHSTGNIPYPGSSQP 89
S+SSS+SQ +LP G GG+ H H+ I G S P
Sbjct: 88 STSSSSSQLELPIGSGGSSCNSLYNHHAQHNNIGIGGSGGSHP 130
>UniRef100_Q6IKM3 HDC12113 [Drosophila melanogaster]
Length = 347
Score = 32.3 bits (72), Expect = 2.6
Identities = 17/43 (39%), Positives = 22/43 (50%)
Query: 47 SSSSSTSQCDLPKGQGGNVAKIKAEHRRVHSTGNIPYPGSSQP 89
S+SSS+SQ +LP G GG+ H H+ I G S P
Sbjct: 75 STSSSSSQLELPIGSGGSSCNSLYNHHAQHNNIGIGGSGGSHP 117
>UniRef100_Q6H3Y9 Atency associated nuclear antigen-like [Oryza sativa]
Length = 795
Score = 32.0 bits (71), Expect = 3.4
Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 3/65 (4%)
Query: 32 KKVDRNWSGNLIASSSSSSSTSQCDLPKGQGGNVAKIKAEHRRVHSTGNIPYPGSSQPKL 91
+K++R S + S S +T CD+ +G G + A R+V G+ S P+
Sbjct: 602 RKLNRRQSAH---SDSEEDTTFVCDVKEGSGSRRVQEGAPRRQVKKEGSNKKKDGSTPQC 658
Query: 92 VRSSG 96
VR++G
Sbjct: 659 VRNNG 663
>UniRef100_UPI00002AC90B UPI00002AC90B UniRef100 entry
Length = 288
Score = 31.6 bits (70), Expect = 4.4
Identities = 17/40 (42%), Positives = 26/40 (64%)
Query: 20 EFFTEKEYKEVCKKVDRNWSGNLIASSSSSSSTSQCDLPK 59
E F K+ K+ KKV++ + G+ +SSSSSSS+S + K
Sbjct: 19 EPFKWKKVKKAVKKVEKTFFGSSSSSSSSSSSSSSSSVVK 58
>UniRef100_Q8F932 Hypothetical protein [Leptospira interrogans]
Length = 553
Score = 31.6 bits (70), Expect = 4.4
Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 24 EKEYKEVCKKVDRNWSGNLIASSSSSSSTSQCDLPKGQGGN 64
EKE KEV KK + + N +SSSSSSS+S+ D + G+
Sbjct: 300 EKE-KEVVKKDEERKNNNSSSSSSSSSSSSKSDSSSSKSGS 339
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.311 0.125 0.374
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 201,508,719
Number of Sequences: 2790947
Number of extensions: 7429393
Number of successful extensions: 19469
Number of sequences better than 10.0: 46
Number of HSP's better than 10.0 without gapping: 19
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 19438
Number of HSP's gapped (non-prelim): 54
length of query: 117
length of database: 848,049,833
effective HSP length: 93
effective length of query: 24
effective length of database: 588,491,762
effective search space: 14123802288
effective search space used: 14123802288
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 67 (30.4 bits)
Lotus: description of TM0520.11