
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0309.4
(116 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q9LU97 Similarity to transcription-associated zinc rib... 140 7e-33
UniRef100_Q9AUR4 Putative RNA polymerase I subunit [Oryza sativa] 134 4e-31
UniRef100_Q9LNT2 T20H2.16 protein [Arabidopsis thaliana] 109 1e-23
UniRef100_Q5TM50 Zinc ribbon domain containing, 1 [Macaca mulatta] 82 3e-15
UniRef100_UPI000036D5F7 UPI000036D5F7 UniRef100 entry 81 5e-15
UniRef100_Q9P1U0 Transcription-associated zinc ribbon protein [H... 81 5e-15
UniRef100_Q6MFY5 Zinc ribbon domain containing, 1 [Rattus norveg... 79 2e-14
UniRef100_Q8GW94 Hypothetical protein [Arabidopsis thaliana] 78 5e-14
UniRef100_Q791N7 Nuclear RNA polymerase I small specific subunit... 77 9e-14
UniRef100_O94703 DNA-directed RNA polymerase I 13.1 kDa polypept... 76 2e-13
UniRef100_Q753A7 DNA-directed RNA polymerase [Ashbya gossypii] 73 2e-12
UniRef100_UPI00002BAA34 UPI00002BAA34 UniRef100 entry 72 4e-12
UniRef100_UPI000042BEF1 UPI000042BEF1 UniRef100 entry 70 1e-11
UniRef100_Q61W07 Hypothetical protein CBG04618 [Caenorhabditis b... 70 1e-11
UniRef100_Q6BKA5 DNA-directed RNA polymerase [Debaryomyces hanse... 70 1e-11
UniRef100_Q7PR45 ENSANGP00000015033 [Anopheles gambiae str. PEST] 69 3e-11
UniRef100_UPI00003C229A UPI00003C229A UniRef100 entry 68 4e-11
UniRef100_O17587 Hypothetical protein C15H11.8 [Caenorhabditis e... 68 4e-11
UniRef100_Q8SRA1 DNA-DIRECTED RNA POLYMERASE I SUBUNIT M [Enceph... 68 4e-11
UniRef100_P32529 DNA-directed RNA polymerase I 13.7 kDa polypept... 65 3e-10
>UniRef100_Q9LU97 Similarity to transcription-associated zinc ribbon protein
[Arabidopsis thaliana]
Length = 119
Score = 140 bits (353), Expect = 7e-33
Identities = 67/118 (56%), Positives = 81/118 (67%), Gaps = 3/118 (2%)
Query: 1 MAYSRERDFLFCHYCGTMLTVPSVNYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELG 60
M SRE +FLFC+ CGTML + S YAECP CK N KDI KEI+YT+SAE+IRRELG
Sbjct: 1 MEKSRESEFLFCNLCGTMLVLKSTKYAECPHCKTTRNAKDIIDKEIAYTVSAEDIRRELG 60
Query: 61 IETIEEQ---KVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQE 115
I E+ + +L K+ K CEKC + E + TRQ RSADEGQTT+Y C C H+F E
Sbjct: 61 ISLFGEKTQAEAELPKIKKACEKCQHPELVYTTRQTRSADEGQTTYYTCPNCAHRFTE 118
>UniRef100_Q9AUR4 Putative RNA polymerase I subunit [Oryza sativa]
Length = 126
Score = 134 bits (338), Expect = 4e-31
Identities = 66/125 (52%), Positives = 84/125 (66%), Gaps = 9/125 (7%)
Query: 1 MAYSRERDFLFCHYCGTMLTVPSVNYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELG 60
MA+ + RDFLFC CGT+L SV A CPLC + KDIEGKE YT++AE+IRREL
Sbjct: 1 MAFWQARDFLFCGVCGTLLKFDSVRSASCPLCGFKRKAKDIEGKETRYTVTAEDIRRELK 60
Query: 61 IE-------TIEEQK--VQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGH 111
++ T++E+ V+ + VNK CEKC N E +YT+Q+RSADEGQT FY C C H
Sbjct: 61 LDPYVILETTLKEEDVIVERATVNKECEKCKNPELQYYTKQLRSADEGQTVFYKCANCRH 120
Query: 112 QFQEN 116
+F EN
Sbjct: 121 EFNEN 125
>UniRef100_Q9LNT2 T20H2.16 protein [Arabidopsis thaliana]
Length = 122
Score = 109 bits (273), Expect = 1e-23
Identities = 53/115 (46%), Positives = 66/115 (57%), Gaps = 20/115 (17%)
Query: 1 MAYSRERDFLFCHYCGTMLTVPSVNYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELG 60
M SRE D LFC+ CGTML + S YAECPLC+ N K+I K ++YT++ E
Sbjct: 27 MEKSRESDILFCNLCGTMLVLKSTKYAECPLCETTRNGKEIIDKNLAYTVTTE------- 79
Query: 61 IETIEEQKVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQE 115
+ K CEKC + E + TRQ RSADEGQTT+Y C CGH+F E
Sbjct: 80 -------------IKKACEKCQHPELVYTTRQTRSADEGQTTYYTCPNCGHRFTE 121
>UniRef100_Q5TM50 Zinc ribbon domain containing, 1 [Macaca mulatta]
Length = 126
Score = 82.0 bits (201), Expect = 3e-15
Identities = 40/114 (35%), Positives = 63/114 (55%), Gaps = 1/114 (0%)
Query: 4 SRERDFLFCHYCGTMLTVPSV-NYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELGIE 62
S + D FC CG++L +P + C C N++D EGK + ++ ++ + +
Sbjct: 12 SFQSDLDFCSDCGSVLPLPGAQDTVTCTRCGFNINVRDFEGKVVKTSVVFHQLGTAMPMS 71
Query: 63 TIEEQKVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
E + Q V++ C +CG+ ++TRQMRSADEGQT FY CT C Q +E+
Sbjct: 72 VEEGPECQGPVVDRRCPRCGHEGMAYHTRQMRSADEGQTVFYTCTNCKFQEKED 125
>UniRef100_UPI000036D5F7 UPI000036D5F7 UniRef100 entry
Length = 163
Score = 81.3 bits (199), Expect = 5e-15
Identities = 40/114 (35%), Positives = 63/114 (55%), Gaps = 1/114 (0%)
Query: 4 SRERDFLFCHYCGTMLTVPSV-NYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELGIE 62
S + D FC CG++L +P + C C N++D EGK + ++ ++ + +
Sbjct: 49 SFQSDLDFCSDCGSVLPLPGAQDTVTCIRCGFNINVRDFEGKVVKTSVVFHQLGTAMPMS 108
Query: 63 TIEEQKVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
E + Q V++ C +CG+ ++TRQMRSADEGQT FY CT C Q +E+
Sbjct: 109 VEEGPECQGPVVDRRCPRCGHEGMAYHTRQMRSADEGQTVFYTCTNCKFQEKED 162
>UniRef100_Q9P1U0 Transcription-associated zinc ribbon protein [Homo sapiens]
Length = 126
Score = 81.3 bits (199), Expect = 5e-15
Identities = 40/114 (35%), Positives = 63/114 (55%), Gaps = 1/114 (0%)
Query: 4 SRERDFLFCHYCGTMLTVPSV-NYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELGIE 62
S + D FC CG++L +P + C C N++D EGK + ++ ++ + +
Sbjct: 12 SFQSDLDFCSDCGSVLPLPGAQDTVTCIRCGFNINVRDFEGKVVKTSVVFHQLGTAMPMS 71
Query: 63 TIEEQKVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
E + Q V++ C +CG+ ++TRQMRSADEGQT FY CT C Q +E+
Sbjct: 72 VEEGPECQGPVVDRRCPRCGHEGMAYHTRQMRSADEGQTVFYTCTNCKFQEKED 125
>UniRef100_Q6MFY5 Zinc ribbon domain containing, 1 [Rattus norvegicus]
Length = 123
Score = 79.0 bits (193), Expect = 2e-14
Identities = 39/112 (34%), Positives = 62/112 (54%), Gaps = 1/112 (0%)
Query: 6 ERDFLFCHYCGTMLTVPSV-NYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELGIETI 64
+ D FC CG++L +P V + CP C +++D GK + ++ ++ + +
Sbjct: 11 QSDLDFCPDCGSVLPLPGVQDTVICPRCGFSIDVRDFGGKVVKTSVVFNKLGTVIPMSVD 70
Query: 65 EEQKVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
E + Q V++ C +CG+ +YTRQMRSADEGQT FY C C Q +E+
Sbjct: 71 EGPESQGPVVDRRCSRCGHEGMAYYTRQMRSADEGQTVFYTCINCKFQEKED 122
>UniRef100_Q8GW94 Hypothetical protein [Arabidopsis thaliana]
Length = 63
Score = 77.8 bits (190), Expect = 5e-14
Identities = 35/61 (57%), Positives = 44/61 (71%)
Query: 1 MAYSRERDFLFCHYCGTMLTVPSVNYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELG 60
M SRE D LFC+ CGTML + S YAECPLC+ N K+I K ++YT++ E+IRRELG
Sbjct: 1 MEKSRESDILFCNLCGTMLVLKSTKYAECPLCETTRNGKEIIDKNLAYTVTTEDIRRELG 60
Query: 61 I 61
I
Sbjct: 61 I 61
>UniRef100_Q791N7 Nuclear RNA polymerase I small specific subunit Rpa12 [Mus
musculus]
Length = 123
Score = 77.0 bits (188), Expect = 9e-14
Identities = 36/112 (32%), Positives = 63/112 (56%), Gaps = 1/112 (0%)
Query: 6 ERDFLFCHYCGTMLTVPSV-NYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELGIETI 64
+ D FC CG++L +P + + C C +++D EGK + ++ ++ + +
Sbjct: 11 QSDLDFCPDCGSVLPLPGIQDTVICSRCGFSIDVRDCEGKVVKTSVVFNKLGATIPLSVD 70
Query: 65 EEQKVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
E ++Q +++ C +CG+ ++TRQMRSADEGQT FY C C Q +E+
Sbjct: 71 EGPELQGPVIDRRCPRCGHEGMAYHTRQMRSADEGQTVFYTCINCKFQEKED 122
>UniRef100_O94703 DNA-directed RNA polymerase I 13.1 kDa polypeptide
[Schizosaccharomyces pombe]
Length = 119
Score = 76.3 bits (186), Expect = 2e-13
Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 4/111 (3%)
Query: 10 LFCHYCGTMLTVPSVNYAECPLCKNRSNIKDIEGKEISYTISAEEIRREL----GIETIE 65
+FC CG +L + + C C++ + + SA L I +E
Sbjct: 8 IFCSECGNLLESTTAQWTTCDQCQSVYPSEQFANLVVETKSSASAFPSALKLKHSIVQVE 67
Query: 66 EQKVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
QK + + + + C KCGN TF+T Q+RSADEG T FY C RC ++F N
Sbjct: 68 SQKEEAATIEEKCPKCGNDHMTFHTLQLRSADEGSTVFYECPRCAYKFSTN 118
>UniRef100_Q753A7 DNA-directed RNA polymerase [Ashbya gossypii]
Length = 125
Score = 72.8 bits (177), Expect = 2e-12
Identities = 39/117 (33%), Positives = 64/117 (54%), Gaps = 10/117 (8%)
Query: 10 LFCHYCGTMLTVPSV---NYAECPLCKNR------SNIKDIEGKEISYTISAEEIRRELG 60
+FC YCG +L PS ++ C LC + SN+K + SA +R +
Sbjct: 8 IFCVYCGNLLDNPSAVAGDHVACALCDAQYDKATFSNLKVVTATADDAFPSALRAKRSVV 67
Query: 61 IETIEEQKVQL-SKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
T+ + +++ + + + C +CG+ E ++T Q+RSADEG T FY CT CG++F+ N
Sbjct: 68 KTTLRKGELEDGATIREKCPQCGHDEMQYHTLQLRSADEGATVFYTCTSCGYRFRTN 124
>UniRef100_UPI00002BAA34 UPI00002BAA34 UniRef100 entry
Length = 147
Score = 71.6 bits (174), Expect = 4e-12
Identities = 33/110 (30%), Positives = 55/110 (50%), Gaps = 1/110 (0%)
Query: 8 DFLFCHYCGTMLTVPSV-NYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELGIETIEE 66
D FC CG +L P + + CP C + + +G++I T+ + + E+
Sbjct: 19 DLNFCPECGNILPPPGLQDTVRCPRCSFSIPVAEFDGQQIRSTVVLNPAEKSAAVVEEED 78
Query: 67 QKVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
+Q +++ C +C ++TRQMRSADEGQT F+ C C + E+
Sbjct: 79 SDLQGPVIDRRCVRCNKEGMVYHTRQMRSADEGQTVFFTCVHCRRRTLES 128
>UniRef100_UPI000042BEF1 UPI000042BEF1 UniRef100 entry
Length = 123
Score = 70.1 bits (170), Expect = 1e-11
Identities = 35/115 (30%), Positives = 64/115 (55%), Gaps = 8/115 (6%)
Query: 10 LFCHYCGTML-TVPSVNYAECPLCKNR------SNIKDI-EGKEISYTISAEEIRRELGI 61
+FC+YCG +L + S + +C +C +N+K + + + ++ + R +
Sbjct: 8 IFCNYCGNLLDSHSSSSEIKCTVCSAAYPKSKFANLKVVTKSSDDAFPSKLKSARSVVKT 67
Query: 62 ETIEEQKVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
+++ + + + + C KCGN E ++T Q+RSADEG T FY CT CG++F+ N
Sbjct: 68 SLKKDELDEGATIKEKCPKCGNEEMQYHTLQLRSADEGATVFYTCTNCGYRFRTN 122
>UniRef100_Q61W07 Hypothetical protein CBG04618 [Caenorhabditis briggsae]
Length = 119
Score = 70.1 bits (170), Expect = 1e-11
Identities = 37/101 (36%), Positives = 55/101 (53%), Gaps = 3/101 (2%)
Query: 11 FCHYCGTMLTVP--SVNYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELGIETIEEQK 68
FC YCG +L +P + + C +C R N+K+ + +S E R + IE +
Sbjct: 12 FCGYCGAILELPPQAPSTVTCKVCSTRWNVKERVDQVVSRVEKIYE-RTVADTDGIENDE 70
Query: 69 VQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRC 109
+ V+ C KCG+ +A++ T Q RSADEGQT FY C +C
Sbjct: 71 SADAVVDHICTKCGHTKASYSTMQTRSADEGQTVFYTCLKC 111
>UniRef100_Q6BKA5 DNA-directed RNA polymerase [Debaryomyces hansenii]
Length = 123
Score = 69.7 bits (169), Expect = 1e-11
Identities = 39/115 (33%), Positives = 63/115 (53%), Gaps = 8/115 (6%)
Query: 10 LFCHYCGTML-TVPSVNYAECPLC------KNRSNIKDIEGKEISYTISAEEIRRELGIE 62
+FC CG +L TV S + C LC N +N+K I SA +++R +
Sbjct: 8 IFCTDCGNLLDTVGSKSTLNCKLCHKSYPTSNFANLKVITQSSEDAFPSALKMKRSVVKT 67
Query: 63 TIEEQKVQL-SKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
+++ +++ + + + C KCG E ++ Q+RSADEG T FY CT CG++F+ N
Sbjct: 68 SLKNDELEEGATIKEKCPKCGTEEMQYHVLQLRSADEGATVFYTCTGCGYRFRTN 122
>UniRef100_Q7PR45 ENSANGP00000015033 [Anopheles gambiae str. PEST]
Length = 114
Score = 68.9 bits (167), Expect = 3e-11
Identities = 36/109 (33%), Positives = 57/109 (52%), Gaps = 3/109 (2%)
Query: 11 FCHYCGTMLT-VPSVNYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELG--IETIEEQ 67
FC CG++L + + N C C++ + E YTI + + E +
Sbjct: 5 FCPDCGSILPPLKNSNRVSCYGCQSEFDAAAFGTMETEYTIHFNSYANKKSDQADRAEGE 64
Query: 68 KVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
+ + VN+ C KCGN + ++ T Q+RSADEGQT F+ CT+C ++ EN
Sbjct: 65 EAEGPIVNRQCPKCGNDQMSYATLQLRSADEGQTVFFTCTKCKYKMSEN 113
>UniRef100_UPI00003C229A UPI00003C229A UniRef100 entry
Length = 147
Score = 68.2 bits (165), Expect = 4e-11
Identities = 40/135 (29%), Positives = 61/135 (44%), Gaps = 28/135 (20%)
Query: 10 LFCHYCGTMLTVPS----VNYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELGIETI- 64
LFC CG++L VP + A C +N D+ + + S ++ L I T
Sbjct: 12 LFCPNCGSLLDVPGDEDMIKCAPCGAVQNAKGSADLLSTLLISSRSNNQVYDNLSIVTRS 71
Query: 65 -----------------------EEQKVQLSKVNKTCEKCGNGEATFYTRQMRSADEGQT 101
+++K Q + + + C CGN E F+T Q+RSADEG T
Sbjct: 72 HPSAFPSALRQKRQLVNTAAALGDDKKPQEATIKEKCPGCGNDEMNFHTLQLRSADEGTT 131
Query: 102 TFYACTRCGHQFQEN 116
FY C +CG++F +N
Sbjct: 132 VFYDCPKCGYKFSQN 146
>UniRef100_O17587 Hypothetical protein C15H11.8 [Caenorhabditis elegans]
Length = 119
Score = 68.2 bits (165), Expect = 4e-11
Identities = 37/101 (36%), Positives = 54/101 (52%), Gaps = 3/101 (2%)
Query: 11 FCHYCGTMLTVPSVNYA--ECPLCKNRSNIKDIEGKEISYTISAEEIRRELGIETIEEQK 68
FC YCG +L +P+ A C +C R +K+ + +S E R + IE +
Sbjct: 12 FCGYCGAILELPAQAPATVSCKVCSTRWAVKERVDQVVSRVEKIYE-RTVADTDGIENDE 70
Query: 69 VQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRC 109
+ V+ C KCG+ +A++ T Q RSADEGQT FY C +C
Sbjct: 71 SADAVVDHICTKCGHSKASYSTMQTRSADEGQTVFYTCLKC 111
>UniRef100_Q8SRA1 DNA-DIRECTED RNA POLYMERASE I SUBUNIT M [Encephalitozoon cuniculi]
Length = 99
Score = 68.2 bits (165), Expect = 4e-11
Identities = 39/104 (37%), Positives = 62/104 (59%), Gaps = 11/104 (10%)
Query: 10 LFCHYCGTMLTVPS-VNYAECPLCKNRSNIKDIEGKEISYTISAEEIRRELGIETIEEQK 68
+FC CGT++ V S + ECP CK +++ I+ T + +++R+L IE ++
Sbjct: 1 MFC-ICGTLVYVRSDSSRVECPRCKRENSVAMIKP-----TYTEVKVQRDLHIEAVD--- 51
Query: 69 VQLSKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQ 112
VQ +K+ C CG E + T Q+RS DEGQT FY+C +CG++
Sbjct: 52 VQGAKIKHRCPACGAEEMMYNTAQLRSTDEGQTVFYSC-KCGYR 94
>UniRef100_P32529 DNA-directed RNA polymerase I 13.7 kDa polypeptide [Saccharomyces
cerevisiae]
Length = 125
Score = 65.5 bits (158), Expect = 3e-10
Identities = 36/117 (30%), Positives = 63/117 (53%), Gaps = 10/117 (8%)
Query: 10 LFCHYCGTMLTVPSV---NYAECPLCK------NRSNIKDIEGKEISYTISAEEIRRELG 60
+FC CG +L P+ + EC CK SN+K + S+ ++ +
Sbjct: 8 IFCLDCGDLLENPNAVLGSNVECSQCKAIYPKSQFSNLKVVTTTADDAFPSSLRAKKSVV 67
Query: 61 IETIEEQKVQL-SKVNKTCEKCGNGEATFYTRQMRSADEGQTTFYACTRCGHQFQEN 116
++++ +++ + + + C +CGN E ++T Q+RSADEG T FY CT CG++F+ N
Sbjct: 68 KTSLKKNELKDGATIKEKCPQCGNEEMNYHTLQLRSADEGATVFYTCTSCGYKFRTN 124
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.319 0.133 0.400
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 179,873,128
Number of Sequences: 2790947
Number of extensions: 6134764
Number of successful extensions: 27749
Number of sequences better than 10.0: 374
Number of HSP's better than 10.0 without gapping: 320
Number of HSP's successfully gapped in prelim test: 54
Number of HSP's that attempted gapping in prelim test: 27119
Number of HSP's gapped (non-prelim): 643
length of query: 116
length of database: 848,049,833
effective HSP length: 92
effective length of query: 24
effective length of database: 591,282,709
effective search space: 14190785016
effective search space used: 14190785016
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 67 (30.4 bits)
Lotus: description of TM0309.4