
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0010.21
(161 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YADR_ECOLI (P37026) Hypothetical protein yadR 90 2e-18
YI57_AQUAE (O67709) Protein AQ_1857 88 1e-17
YADR_HAEIN (P45344) Protein HI1723 88 1e-17
YG67_XYLFT (P64343) Hypothetical protein PD1667 84 2e-16
Y405_XYLFA (P64342) Hypothetical protein Xf0405 84 2e-16
Y684_HAEDU (Q9X4A0) Hypothetical protein HD0684 83 3e-16
Y205_BUCAP (O51930) Hypothetical protein BUsg205 80 2e-15
Y211_BUCAI (P57307) Hypothetical protein BU211 80 3e-15
YM04_MYCTU (Q10393) Protein Rv2204c/MT2260/Mb2227c 78 8e-15
Y063_RICPR (Q9ZE83) Hypothetical protein RP063 74 1e-13
YUTM_BACSU (O32113) Hypothetical protein yutM 73 3e-13
YNIU_RHOSH (Q01195) Hypothetical 10.8 kDa protein in nifU 5'regi... 71 1e-12
YFHF_HAEIN (P44672) Hypothetical protein HI0376 70 3e-12
Y4VC_RHISN (Q53211) Hypothetical 11.0 kDa protein y4vC 70 3e-12
Y193_BUCBP (Q89AQ6) Hypothetical protein bbp193 70 3e-12
HESB_ANASP (P18501) Protein hesB 69 5e-12
YH55_BRAJA (P37029) Hypothetical protein blr1755 68 8e-12
HEB1_ANAVA (P46051) Protein hesB, heterocyst 68 8e-12
YFHF_ECOLI (P36539) Protein yfhF 68 1e-11
YNIU_AZOVI (Q44540) Hypothetical 11.0 kDa protein in nifU 5'regi... 67 1e-11
>YADR_ECOLI (P37026) Hypothetical protein yadR
Length = 114
Score = 90.1 bits (222), Expect = 2e-18
Identities = 47/109 (43%), Positives = 65/109 (59%), Gaps = 2/109 (1%)
Query: 51 PVHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREG 110
P+ T+ ++K L A E K LR+ + GGCSGFQY F DD++N D E++G
Sbjct: 8 PLEFTDAAANKVKSLIADEDNPNLK-LRVYITGGGCSGFQYGFTFDDQVNEGDMTIEKQG 66
Query: 111 IKLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSFMV 159
+ LVVD +S ++ G +VDY E L S F+VT NP+A C C SSF +
Sbjct: 67 VGLVVDPMSLQYLVGGSVDYTEGLEGSRFIVT-NPNAKSTCGCGSSFSI 114
>YI57_AQUAE (O67709) Protein AQ_1857
Length = 116
Score = 87.8 bits (216), Expect = 1e-17
Identities = 43/104 (41%), Positives = 62/104 (59%), Gaps = 2/104 (1%)
Query: 54 MTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREGIKL 113
+T+ + +K++ A E+ LR+ V GGCSGFQYA DD + D VFE +G+K+
Sbjct: 12 VTDKAVEEIKKV-AQENNIENPILRIRVVPGGCSGFQYAMGFDDTVEEGDHVFEYDGVKV 70
Query: 114 VVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSF 157
V+D S +V GA +DYV + + F + NP+A G C C SSF
Sbjct: 71 VIDPFSMPYVNGAELDYVVDFMGGGFTI-RNPNATGSCGCGSSF 113
>YADR_HAEIN (P45344) Protein HI1723
Length = 114
Score = 87.8 bits (216), Expect = 1e-17
Identities = 45/109 (41%), Positives = 64/109 (58%), Gaps = 2/109 (1%)
Query: 51 PVHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREG 110
P+ T+ ++K L + E T K LR+ + GGCSGFQY F D+++N D E+ G
Sbjct: 8 PLTFTDAAANKVKSLISEEENTDLK-LRVYITGGGCSGFQYGFTFDEKVNDGDLTIEKSG 66
Query: 111 IKLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSFMV 159
++LV+D +S ++ G TVDY E L S F V NP+A C C SSF +
Sbjct: 67 VQLVIDPMSLQYLIGGTVDYTEGLEGSRFTV-NNPNATSTCGCGSSFSI 114
>YG67_XYLFT (P64343) Hypothetical protein PD1667
Length = 128
Score = 83.6 bits (205), Expect = 2e-16
Identities = 44/109 (40%), Positives = 64/109 (58%), Gaps = 2/109 (1%)
Query: 51 PVHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREG 110
P++ T +++EL E+ A LR+ ++ GGCSGFQY F+ D+ DD E +G
Sbjct: 22 PLNFTMAAAAKVRELIQEEN-NADLALRVYIQGGGCSGFQYGFEFDENRADDDLALETDG 80
Query: 111 IKLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSFMV 159
+ L+VD +S ++ GA VDY E L + FV+ NP+A C C SSF V
Sbjct: 81 VVLLVDPLSLQYLLGAEVDYTESLTGAKFVI-RNPNAKTTCGCGSSFSV 128
>Y405_XYLFA (P64342) Hypothetical protein Xf0405
Length = 128
Score = 83.6 bits (205), Expect = 2e-16
Identities = 44/109 (40%), Positives = 64/109 (58%), Gaps = 2/109 (1%)
Query: 51 PVHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREG 110
P++ T +++EL E+ A LR+ ++ GGCSGFQY F+ D+ DD E +G
Sbjct: 22 PLNFTMAAAAKVRELIQEEN-NADLALRVYIQGGGCSGFQYGFEFDENRADDDLALETDG 80
Query: 111 IKLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSFMV 159
+ L+VD +S ++ GA VDY E L + FV+ NP+A C C SSF V
Sbjct: 81 VVLLVDPLSLQYLLGAEVDYTESLTGAKFVI-RNPNAKTTCGCGSSFSV 128
>Y684_HAEDU (Q9X4A0) Hypothetical protein HD0684
Length = 114
Score = 82.8 bits (203), Expect = 3e-16
Identities = 44/109 (40%), Positives = 63/109 (57%), Gaps = 2/109 (1%)
Query: 51 PVHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREG 110
P+ T+ +++K L E + LR+ + GGCSGFQY F DD+IN D E +
Sbjct: 8 PLTFTDAAAKKVKSLIEGEDNPNLR-LRVYITGGGCSGFQYGFTFDDKINEGDLTIENQN 66
Query: 111 IKLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSFMV 159
+ L+VD +S ++ G +VDY E L S FVV +NP+A C C SSF +
Sbjct: 67 VGLIVDPMSLQYLIGGSVDYTEGLDGSRFVV-QNPNASSTCGCGSSFSI 114
>Y205_BUCAP (O51930) Hypothetical protein BUsg205
Length = 114
Score = 80.1 bits (196), Expect = 2e-15
Identities = 41/111 (36%), Positives = 65/111 (57%), Gaps = 2/111 (1%)
Query: 49 VEPVHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFER 108
++ + T + +++K + E LR+ + GGCSGFQY F D++IN DD + ++
Sbjct: 6 IKYIEFTNSAAKKIKSI-IKEKKNKNVKLRIYIIGGGCSGFQYQFIFDEKINEDDILVKK 64
Query: 109 EGIKLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSFMV 159
I LV+D IS ++ G T+DY+E L S F+V+ NP+A C C SF +
Sbjct: 65 LNICLVIDPISLQYLHGGTIDYLENLEGSKFIVS-NPNAKNTCGCGLSFSI 114
>Y211_BUCAI (P57307) Hypothetical protein BU211
Length = 114
Score = 79.7 bits (195), Expect = 3e-15
Identities = 42/108 (38%), Positives = 61/108 (55%), Gaps = 2/108 (1%)
Query: 52 VHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREGI 111
+ TE ++++K L E K LR+ + GGCSGFQY F D IN DD + + +
Sbjct: 9 LQFTEKAIKKIKNLIEIEKNHDLK-LRIYINGGGCSGFQYQFIFDTSINEDDIIITQSEV 67
Query: 112 KLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSFMV 159
L++D IS ++ G +DY+E L S F+V NP+A C C SSF +
Sbjct: 68 SLIIDPISLQYLYGGQIDYLENLEGSKFIV-YNPNAKNTCGCGSSFSI 114
>YM04_MYCTU (Q10393) Protein Rv2204c/MT2260/Mb2227c
Length = 118
Score = 78.2 bits (191), Expect = 8e-15
Identities = 40/118 (33%), Positives = 66/118 (55%), Gaps = 2/118 (1%)
Query: 40 TTSSSSSSHVEPVHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRI 99
T + S+ V +TE + K L E LR++V+ GGC+G +Y DDR
Sbjct: 2 TVQNEPSAKTHGVILTEAAAAKAKSLLDQEGRD-DLALRIAVQPGGCAGLRYNLFFDDRT 60
Query: 100 NSDDRVFEREGIKLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSF 157
D+ E G++L+VD +S +V+GA++D+V+ + + F + +NP+A G C+C SF
Sbjct: 61 LDGDQTAEFGGVRLIVDRMSAPYVEGASIDFVDTIEKQGFTI-DNPNATGSCACGDSF 117
>Y063_RICPR (Q9ZE83) Hypothetical protein RP063
Length = 110
Score = 74.3 bits (181), Expect = 1e-13
Identities = 43/110 (39%), Positives = 62/110 (56%), Gaps = 4/110 (3%)
Query: 52 VHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDL--DDRINSDDRVFERE 109
+ +T+ R+ EL E LR+SV++GGCSG Y ++L D I DD VF R
Sbjct: 3 ITITDRAFERIYELIELEK-DKNLVLRVSVDSGGCSGLMYNYELVSKDNIEKDDYVFTRH 61
Query: 110 GIKLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSFMV 159
+++D+IS F+ T+D++EEL S F V+ NP A C C +SF V
Sbjct: 62 NATIIIDSISQKFMLNCTLDFIEELGSSYFNVS-NPQAKAKCGCGNSFSV 110
>YUTM_BACSU (O32113) Hypothetical protein yutM
Length = 120
Score = 72.8 bits (177), Expect = 3e-13
Identities = 36/107 (33%), Positives = 57/107 (52%), Gaps = 2/107 (1%)
Query: 51 PVHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREG 110
PV +TE +K++ E LR+ V+ GGCSG Y + + D VF++ G
Sbjct: 4 PVTITEAAALHIKDM-MKEHEEENAFLRVGVKGGGCSGLSYGMGFEHEKSESDSVFDQHG 62
Query: 111 IKLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSF 157
I ++VD S D + G +DY + ++ F + +NP+A+ C C SSF
Sbjct: 63 ITVLVDKESLDIMNGTVIDYKQSMLGGGFTI-DNPNAIASCGCGSSF 108
>YNIU_RHOSH (Q01195) Hypothetical 10.8 kDa protein in nifU 5'region
(ORF 1)
Length = 106
Score = 70.9 bits (172), Expect = 1e-12
Identities = 38/92 (41%), Positives = 55/92 (59%), Gaps = 2/92 (2%)
Query: 67 ASESATAPKT-LRLSVETGGCSGFQYAFDLDDRINSDDRVFEREGIKLVVDNISYDFVKG 125
A E A P LRL V++GGC+G +Y L+ DD V E EG+++++D S ++ G
Sbjct: 15 AIEGAGQPVAGLRLMVQSGGCAGLKYGMSLELTEAPDDLVVEAEGLRVLIDPQSGTYLNG 74
Query: 126 ATVDYVEELIRSAFVVTENPSAVGGCSCKSSF 157
T+D+V L + FV +NP+A GGC C SF
Sbjct: 75 VTIDFVTSLEGTGFVF-DNPNAKGGCGCGKSF 105
>YFHF_HAEIN (P44672) Hypothetical protein HI0376
Length = 107
Score = 69.7 bits (169), Expect = 3e-12
Identities = 34/83 (40%), Positives = 49/83 (58%), Gaps = 1/83 (1%)
Query: 77 LRLSVETGGCSGFQYAFDLDDRINSDDRVFEREGIKLVVDNISYDFVKGATVDYVEELIR 136
LRL V+T GCSG Y + D +NS+D+VFE+ G+ ++VD S ++ G +DYV+E +
Sbjct: 26 LRLGVKTSGCSGLAYVLEFVDVLNSEDQVFEQYGVNIIVDPKSLVYLNGIELDYVKEGLN 85
Query: 137 SAFVVTENPSAVGGCSCKSSFMV 159
F NP+ C C SF V
Sbjct: 86 EGFKY-NNPNVKESCGCGESFHV 107
>Y4VC_RHISN (Q53211) Hypothetical 11.0 kDa protein y4vC
Length = 106
Score = 69.7 bits (169), Expect = 3e-12
Identities = 38/106 (35%), Positives = 57/106 (52%), Gaps = 2/106 (1%)
Query: 52 VHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREGI 111
+ +T++ + +K S++ LR+ VE GGCSGF+Y LD D V E G+
Sbjct: 2 ITLTDSAIAAIK-FALSQTCEPADGLRIKVEAGGCSGFKYHLGLDSESRDGDAVIEAGGV 60
Query: 112 KLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSF 157
K+ VD+ S V G TVD+ + SA + +NP+A C+C SF
Sbjct: 61 KVYVDSASQPHVSGMTVDFTTG-VDSAGFIFDNPNARENCACGKSF 105
>Y193_BUCBP (Q89AQ6) Hypothetical protein bbp193
Length = 115
Score = 69.7 bits (169), Expect = 3e-12
Identities = 35/109 (32%), Positives = 59/109 (54%), Gaps = 2/109 (1%)
Query: 51 PVHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREG 110
P+ +++ ++++K + SE R+ + GGCSGFQY F D N +D +
Sbjct: 9 PLSFSKSAIKKIKTI-ISEKNIPNLKFRVYIAGGGCSGFQYKFKFDKNKNKNDTIVNIFN 67
Query: 111 IKLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSFMV 159
+++D IS +++G +DY+E L S F++ NP A C C SSF +
Sbjct: 68 NIIIIDPISLQYLRGGQIDYIENLEGSKFIIL-NPKAKHTCGCGSSFSI 115
>HESB_ANASP (P18501) Protein hesB
Length = 123
Score = 68.9 bits (167), Expect = 5e-12
Identities = 36/101 (35%), Positives = 55/101 (53%), Gaps = 1/101 (0%)
Query: 59 LRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREGIKLVVDNI 118
LR A ++ K +R+SV+ GGCSG++Y D+ + DD V ++ + + VD
Sbjct: 13 LRAFLRGSAKDANETTKGIRVSVKDGGCSGYEYLMDVTSQPQPDDLVTQQGSVLVYVDAK 72
Query: 119 SYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSFMV 159
S ++G +D+VE L+ S F T NP+A C C SF V
Sbjct: 73 SAPLLEGIVIDFVEGLVESGFKFT-NPNATSTCGCGKSFKV 112
>YH55_BRAJA (P37029) Hypothetical protein blr1755
Length = 106
Score = 68.2 bits (165), Expect = 8e-12
Identities = 36/106 (33%), Positives = 59/106 (54%), Gaps = 2/106 (1%)
Query: 52 VHMTENCLRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREGI 111
+++T++ + +K +S A LR+ +E GGC+GF+Y + D DD V + +G+
Sbjct: 2 INLTDSAVNAIKSAISSSERRAGG-LRVMIEAGGCNGFKYKMGIADEPKPDDTVIDCDGL 60
Query: 112 KLVVDNISYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSF 157
K+ VD+ S + + G T+D+V S F NP+A CSC SF
Sbjct: 61 KVFVDSKSREHLAGTTIDFVLAPESSGFTF-HNPNAATNCSCGKSF 105
>HEB1_ANAVA (P46051) Protein hesB, heterocyst
Length = 123
Score = 68.2 bits (165), Expect = 8e-12
Identities = 35/99 (35%), Positives = 54/99 (54%), Gaps = 1/99 (1%)
Query: 59 LRRMKELEASESATAPKTLRLSVETGGCSGFQYAFDLDDRINSDDRVFEREGIKLVVDNI 118
LR A ++ K +R+SV+ GGCSG++Y D+ + DD V ++ + + VD
Sbjct: 13 LRAFLRGSAKDANETTKGIRISVKDGGCSGYEYLMDVTSQPQPDDLVSQQGSVLVYVDAK 72
Query: 119 SYDFVKGATVDYVEELIRSAFVVTENPSAVGGCSCKSSF 157
S ++G +D+VE L+ S F T NP+A C C SF
Sbjct: 73 SAPLLEGIVIDFVEGLVESGFKFT-NPNATSTCGCGKSF 110
>YFHF_ECOLI (P36539) Protein yfhF
Length = 107
Score = 67.8 bits (164), Expect = 1e-11
Identities = 35/83 (42%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 77 LRLSVETGGCSGFQYAFDLDDRINSDDRVFEREGIKLVVDNISYDFVKGATVDYVEELIR 136
LRL V T GCSG Y + D +D VFE +G+K+VVD S F+ G +D+V+E +
Sbjct: 26 LRLGVRTSGCSGMAYVLEFVDEPTPEDIVFEDKGVKVVVDGKSLQFLDGTQLDFVKEGLN 85
Query: 137 SAFVVTENPSAVGGCSCKSSFMV 159
F T NP+ C C SF V
Sbjct: 86 EGFKFT-NPNVKDECGCGESFHV 107
>YNIU_AZOVI (Q44540) Hypothetical 11.0 kDa protein in nifU 5'region
(ORF6)
Length = 107
Score = 67.4 bits (163), Expect = 1e-11
Identities = 32/81 (39%), Positives = 49/81 (59%), Gaps = 1/81 (1%)
Query: 77 LRLSVETGGCSGFQYAFDLDDRINSDDRVFEREGIKLVVDNISYDFVKGATVDYVEELIR 136
LR+ VE GGCSG +Y+ L++ DD++ + +GI L++D+ S + G T+D+VE +
Sbjct: 26 LRIRVEGGGCSGLKYSLKLEEAGAEDDQLVDCDGITLLIDSASAPLLDGVTMDFVESMEG 85
Query: 137 SAFVVTENPSAVGGCSCKSSF 157
S F NP+A C C SF
Sbjct: 86 SGFTFV-NPNATNSCGCGKSF 105
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.314 0.126 0.343
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,944,207
Number of Sequences: 164201
Number of extensions: 575301
Number of successful extensions: 4165
Number of sequences better than 10.0: 87
Number of HSP's better than 10.0 without gapping: 63
Number of HSP's successfully gapped in prelim test: 25
Number of HSP's that attempted gapping in prelim test: 3534
Number of HSP's gapped (non-prelim): 286
length of query: 161
length of database: 59,974,054
effective HSP length: 101
effective length of query: 60
effective length of database: 43,389,753
effective search space: 2603385180
effective search space used: 2603385180
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (28.1 bits)
Lotus: description of TM0010.21