
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC130806.14 - phase: 0
(394 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III 71 4e-12
GIS2_YEAST (P53849) Zinc-finger protein GIS2 67 1e-10
ZCH3_HUMAN (Q9NUD5) Zinc finger CCHC domain containing protein 3 65 4e-10
CNBP_RAT (P62634) Cellular nucleic acid binding protein (CNBP) (... 64 5e-10
CNBP_MOUSE (P53996) Cellular nucleic acid binding protein (CNBP)... 64 5e-10
CNBP_HUMAN (P62633) Cellular nucleic acid binding protein (CNBP)... 64 5e-10
CNBP_CHICK (O42395) Cellular nucleic acid binding protein (CNBP) 64 5e-10
HEXP_LEIMA (Q04832) DNA-binding protein HEXBP (Hexamer-binding p... 61 4e-09
POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic pro... 60 7e-09
BYR3_SCHPO (P36627) Cellular nucleic acid binding protein homolog 59 2e-08
POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic pro... 59 3e-08
GLH4_CAEEL (O76743) ATP-dependent RNA helicase glh-4 (EC 3.6.1.-... 55 3e-07
GLH1_CAEEL (P34689) ATP-dependent RNA helicase glh-1 (EC 3.6.1.-... 53 2e-06
YL92_SCHPO (Q9HFF2) Hypothetical protein C683.02c in chromosome I 52 3e-06
POL4_DROME (P10394) Retrovirus-related Pol polyprotein from tran... 52 3e-06
POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic pro... 51 4e-06
GAG_VILV2 (P23425) Gag polyprotein [Contains: Core protein p16; ... 50 1e-05
GAG_VILV1 (P23424) Gag polyprotein [Contains: Core protein p16; ... 50 1e-05
GAG_VILV (P03352) Gag polyprotein [Contains: Core protein p16; C... 50 1e-05
GAG_SIVAI (Q02843) Gag polyprotein [Contains: Core protein p17; ... 49 2e-05
>YL52_CAEEL (P34431) Hypothetical protein F44E2.2 in chromosome III
Length = 2186
Score = 71.2 bits (173), Expect = 4e-12
Identities = 34/103 (33%), Positives = 59/103 (57%)
Query: 288 VVNEFPEVFSDEIPDVPPEREVEFSIDLVPRTKPVSMAPYRMSASELAELKNQLEDLLDK 347
V+ +F +VF+ ++ E I+L +P+ P + + E++ ++ +L++
Sbjct: 909 VIEQFQDVFAISDDELGRNSGTECVIELKEGAEPIRQKPRPIPLALKPEIRKMIQKMLNQ 968
Query: 348 KFVRASVFPWGAPVLLAKKKDGSMRLCIAYPPLNKVPIKNRYP 390
K +R S PW +PV+L KKKDGS+R+CI Y +NKV N +P
Sbjct: 969 KVIRESKSPWSSPVVLVKKKDGSIRMCIDYRKVNKVVKNNAHP 1011
Score = 42.0 bits (97), Expect = 0.003
Identities = 29/97 (29%), Positives = 44/97 (44%), Gaps = 8/97 (8%)
Query: 42 PKGRDTPAEIVCFKCGEKGHKSNVCTKDEK-----KCFRCGQKGHVLADC-KRGDIVCYN 95
P+ P++ C C ++G C+K K KC C Q G +A C K + C+
Sbjct: 535 PQKHQNPSDR-CSDCQQRGWHMFWCSKKSKDNASQKCDECQQSGWHMASCFKLKNRACFR 593
Query: 96 CNGEGHISSQCTQPKKVRTGGKVFALAGMQTTNEDRL 132
CN GHI+ C + K T K +A ++T R+
Sbjct: 594 CNEMGHIAWNCPK-KNENTSEKEAPVAKVETIEGVRM 629
>GIS2_YEAST (P53849) Zinc-finger protein GIS2
Length = 153
Score = 66.6 bits (161), Expect = 1e-10
Identities = 29/69 (42%), Positives = 40/69 (57%), Gaps = 5/69 (7%)
Query: 42 PKGRDTPAEIVCFKCGEKGHKSNVCTKDEK----KCFRCGQKGHVLADCKRGDIVCYNCN 97
PK +++ C+KCG H + C K++ KC+ CGQ GH+ DC+ D +CYNCN
Sbjct: 83 PKKTSRFSKVSCYKCGGPNHMAKDCMKEDGISGLKCYTCGQAGHMSRDCQN-DRLCYNCN 141
Query: 98 GEGHISSQC 106
GHIS C
Sbjct: 142 ETGHISKDC 150
Score = 66.2 bits (160), Expect = 1e-10
Identities = 30/77 (38%), Positives = 42/77 (53%), Gaps = 6/77 (7%)
Query: 46 DTPAEIVCFKCGEKGHKSNVCTKDE----KKCFRCGQKGHVLADCKRGDIVCYNCNGEGH 101
D +E +C+ C + GH CT K+C+ CG+ GHV ++C C+NCN GH
Sbjct: 18 DCDSERLCYNCNKPGHVQTDCTMPRTVEFKQCYNCGETGHVRSECTVQR--CFNCNQTGH 75
Query: 102 ISSQCTQPKKVRTGGKV 118
IS +C +PKK KV
Sbjct: 76 ISRECPEPKKTSRFSKV 92
Score = 51.2 bits (121), Expect = 4e-06
Identities = 19/43 (44%), Positives = 28/43 (64%), Gaps = 1/43 (2%)
Query: 70 EKKCFRCGQKGHVLADCKRGDIVCYNCNGEGHISSQCTQPKKV 112
+K C+ CG+ GH+ DC + +CYNCN GH+ + CT P+ V
Sbjct: 3 QKACYVCGKIGHLAEDCD-SERLCYNCNKPGHVQTDCTMPRTV 44
Score = 41.2 bits (95), Expect = 0.005
Identities = 14/36 (38%), Positives = 21/36 (57%), Gaps = 1/36 (2%)
Query: 53 CFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADCKR 88
C+ CG+ GH S C D + C+ C + GH+ DC +
Sbjct: 118 CYTCGQAGHMSRDCQND-RLCYNCNETGHISKDCPK 152
>ZCH3_HUMAN (Q9NUD5) Zinc finger CCHC domain containing protein 3
Length = 404
Score = 64.7 bits (156), Expect = 4e-10
Identities = 29/71 (40%), Positives = 42/71 (58%), Gaps = 3/71 (4%)
Query: 53 CFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADCKRGDIVCYNCNGEGHISSQCTQPKKV 112
CFKCG + H S CT+D +CFRCG++GH+ C++G IVC C GH +QC +
Sbjct: 336 CFKCGSRTHMSGSCTQD--RCFRCGEEGHLSPYCRKG-IVCNLCGKRGHAFAQCPKAVHN 392
Query: 113 RTGGKVFALAG 123
++ +AG
Sbjct: 393 SVAAQLTGVAG 403
Score = 38.1 bits (87), Expect = 0.039
Identities = 14/36 (38%), Positives = 20/36 (54%), Gaps = 2/36 (5%)
Query: 71 KKCFRCGQKGHVLADCKRGDIVCYNCNGEGHISSQC 106
K CF+CG + H+ C + C+ C EGH+S C
Sbjct: 334 KTCFKCGSRTHMSGSCTQDR--CFRCGEEGHLSPYC 367
>CNBP_RAT (P62634) Cellular nucleic acid binding protein (CNBP)
(Zinc finger protein 9)
Length = 177
Score = 64.3 bits (155), Expect = 5e-10
Identities = 28/63 (44%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 46 DTPAEIVCFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADC-KRGDIVCYNCNGEGHISS 104
D E C+ CGE GH CTK KC+RCG+ GHV +C K ++ CY C GH++
Sbjct: 112 DHADEQKCYSCGEFGHIQKDCTK--VKCYRCGETGHVAINCSKTSEVNCYRCGESGHLAR 169
Query: 105 QCT 107
+CT
Sbjct: 170 ECT 172
Score = 63.5 bits (153), Expect = 9e-10
Identities = 29/84 (34%), Positives = 45/84 (53%), Gaps = 6/84 (7%)
Query: 30 GKGKQRLNDERRPKGRDTPAEIVCFKCGEKGHKSNVCTK-DEKKCFRCGQKGHVLADCKR 88
G+G D + PK E C+ CG+ GH + C DE+KC+ CG+ GH+ DC +
Sbjct: 78 GRGGHIAKDCKEPKRE---REQCCYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCTK 134
Query: 89 GDIVCYNCNGEGHISSQCTQPKKV 112
+ CY C GH++ C++ +V
Sbjct: 135 --VKCYRCGETGHVAINCSKTSEV 156
Score = 62.8 bits (151), Expect = 1e-09
Identities = 23/59 (38%), Positives = 33/59 (54%), Gaps = 4/59 (6%)
Query: 52 VCFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADCK----RGDIVCYNCNGEGHISSQC 106
+C++CGE GH + C E C+ CG+ GH+ DCK + CYNC GH++ C
Sbjct: 53 ICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDC 111
Score = 58.5 bits (140), Expect = 3e-08
Identities = 29/89 (32%), Positives = 37/89 (40%), Gaps = 28/89 (31%)
Query: 53 CFKCGEKGHKSNVC------------------TKDE----------KKCFRCGQKGHVLA 84
CFKCG GH + C T D C+RCG+ GH+
Sbjct: 6 CFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGHLAK 65
Query: 85 DCKRGDIVCYNCNGEGHISSQCTQPKKVR 113
DC + CYNC GHI+ C +PK+ R
Sbjct: 66 DCDLQEDACYNCGRGGHIAKDCKEPKRER 94
>CNBP_MOUSE (P53996) Cellular nucleic acid binding protein (CNBP)
(Zinc finger protein 9)
Length = 178
Score = 64.3 bits (155), Expect = 5e-10
Identities = 28/63 (44%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 46 DTPAEIVCFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADC-KRGDIVCYNCNGEGHISS 104
D E C+ CGE GH CTK KC+RCG+ GHV +C K ++ CY C GH++
Sbjct: 113 DHADEQKCYSCGEFGHIQKDCTK--VKCYRCGETGHVAINCSKTSEVNCYRCGESGHLAR 170
Query: 105 QCT 107
+CT
Sbjct: 171 ECT 173
Score = 63.5 bits (153), Expect = 9e-10
Identities = 29/84 (34%), Positives = 45/84 (53%), Gaps = 6/84 (7%)
Query: 30 GKGKQRLNDERRPKGRDTPAEIVCFKCGEKGHKSNVCTK-DEKKCFRCGQKGHVLADCKR 88
G+G D + PK E C+ CG+ GH + C DE+KC+ CG+ GH+ DC +
Sbjct: 79 GRGGHIAKDCKEPKRE---REQCCYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCTK 135
Query: 89 GDIVCYNCNGEGHISSQCTQPKKV 112
+ CY C GH++ C++ +V
Sbjct: 136 --VKCYRCGETGHVAINCSKTSEV 157
Score = 59.7 bits (143), Expect = 1e-08
Identities = 22/60 (36%), Positives = 36/60 (59%), Gaps = 5/60 (8%)
Query: 52 VCFKCGEKGHKSNVCT-KDEKKCFRCGQKGHVLADCK----RGDIVCYNCNGEGHISSQC 106
+C++CGE GH + C ++++ C+ CG+ GH+ DCK + CYNC GH++ C
Sbjct: 53 ICYRCGESGHLAKDCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDC 112
Score = 58.5 bits (140), Expect = 3e-08
Identities = 27/78 (34%), Positives = 39/78 (49%), Gaps = 5/78 (6%)
Query: 43 KGRDTPAEIVCFKCGEKGHKSNVCTKDEKK----CFRCGQKGHVLADCKRGDIV-CYNCN 97
K D + C+ CG GH + C + +++ C+ CG+ GH+ DC D CY+C
Sbjct: 65 KDCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCG 124
Query: 98 GEGHISSQCTQPKKVRTG 115
GHI CT+ K R G
Sbjct: 125 EFGHIQKDCTKVKCYRCG 142
Score = 56.6 bits (135), Expect = 1e-07
Identities = 30/90 (33%), Positives = 38/90 (41%), Gaps = 29/90 (32%)
Query: 53 CFKCGEKGHKSNVC------------------TKDE----------KKCFRCGQKGHVLA 84
CFKCG GH + C T D C+RCG+ GH+
Sbjct: 6 CFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGHLAK 65
Query: 85 DCK-RGDIVCYNCNGEGHISSQCTQPKKVR 113
DC + D CYNC GHI+ C +PK+ R
Sbjct: 66 DCDLQEDEACYNCGRGGHIAKDCKEPKRER 95
>CNBP_HUMAN (P62633) Cellular nucleic acid binding protein (CNBP)
(Zinc finger protein 9)
Length = 177
Score = 64.3 bits (155), Expect = 5e-10
Identities = 28/63 (44%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 46 DTPAEIVCFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADC-KRGDIVCYNCNGEGHISS 104
D E C+ CGE GH CTK KC+RCG+ GHV +C K ++ CY C GH++
Sbjct: 112 DHADEQKCYSCGEFGHIQKDCTK--VKCYRCGETGHVAINCSKTSEVNCYRCGESGHLAR 169
Query: 105 QCT 107
+CT
Sbjct: 170 ECT 172
Score = 63.5 bits (153), Expect = 9e-10
Identities = 29/84 (34%), Positives = 45/84 (53%), Gaps = 6/84 (7%)
Query: 30 GKGKQRLNDERRPKGRDTPAEIVCFKCGEKGHKSNVCTK-DEKKCFRCGQKGHVLADCKR 88
G+G D + PK E C+ CG+ GH + C DE+KC+ CG+ GH+ DC +
Sbjct: 78 GRGGHIAKDCKEPKRE---REQCCYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCTK 134
Query: 89 GDIVCYNCNGEGHISSQCTQPKKV 112
+ CY C GH++ C++ +V
Sbjct: 135 --VKCYRCGETGHVAINCSKTSEV 156
Score = 62.8 bits (151), Expect = 1e-09
Identities = 23/59 (38%), Positives = 33/59 (54%), Gaps = 4/59 (6%)
Query: 52 VCFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADCK----RGDIVCYNCNGEGHISSQC 106
+C++CGE GH + C E C+ CG+ GH+ DCK + CYNC GH++ C
Sbjct: 53 ICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDC 111
Score = 58.5 bits (140), Expect = 3e-08
Identities = 29/89 (32%), Positives = 37/89 (40%), Gaps = 28/89 (31%)
Query: 53 CFKCGEKGHKSNVC------------------TKDE----------KKCFRCGQKGHVLA 84
CFKCG GH + C T D C+RCG+ GH+
Sbjct: 6 CFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGHLAK 65
Query: 85 DCKRGDIVCYNCNGEGHISSQCTQPKKVR 113
DC + CYNC GHI+ C +PK+ R
Sbjct: 66 DCDLQEDACYNCGRGGHIAKDCKEPKRER 94
>CNBP_CHICK (O42395) Cellular nucleic acid binding protein (CNBP)
Length = 172
Score = 64.3 bits (155), Expect = 5e-10
Identities = 28/63 (44%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 46 DTPAEIVCFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADC-KRGDIVCYNCNGEGHISS 104
D E C+ CGE GH CTK KC+RCG+ GHV +C K ++ CY C GH++
Sbjct: 107 DHADEQKCYSCGEFGHIQKDCTK--VKCYRCGETGHVAINCSKTSEVNCYRCGESGHLAR 164
Query: 105 QCT 107
+CT
Sbjct: 165 ECT 167
Score = 63.5 bits (153), Expect = 9e-10
Identities = 29/84 (34%), Positives = 45/84 (53%), Gaps = 6/84 (7%)
Query: 30 GKGKQRLNDERRPKGRDTPAEIVCFKCGEKGHKSNVCTK-DEKKCFRCGQKGHVLADCKR 88
G+G D + PK E C+ CG+ GH + C DE+KC+ CG+ GH+ DC +
Sbjct: 73 GRGGHIAKDCKEPKRE---REQCCYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCTK 129
Query: 89 GDIVCYNCNGEGHISSQCTQPKKV 112
+ CY C GH++ C++ +V
Sbjct: 130 --VKCYRCGETGHVAINCSKTSEV 151
Score = 61.2 bits (147), Expect = 4e-09
Identities = 23/60 (38%), Positives = 36/60 (59%), Gaps = 5/60 (8%)
Query: 52 VCFKCGEKGHKSNVCT-KDEKKCFRCGQKGHVLADCK----RGDIVCYNCNGEGHISSQC 106
+C++CGE GH + C +++K C+ CG+ GH+ DCK + CYNC GH++ C
Sbjct: 47 ICYRCGESGHLAKDCDLQEDKACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDC 106
Score = 58.5 bits (140), Expect = 3e-08
Identities = 27/78 (34%), Positives = 39/78 (49%), Gaps = 5/78 (6%)
Query: 43 KGRDTPAEIVCFKCGEKGHKSNVCTKDEKK----CFRCGQKGHVLADCKRGDIV-CYNCN 97
K D + C+ CG GH + C + +++ C+ CG+ GH+ DC D CY+C
Sbjct: 59 KDCDLQEDKACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCG 118
Query: 98 GEGHISSQCTQPKKVRTG 115
GHI CT+ K R G
Sbjct: 119 EFGHIQKDCTKVKCYRCG 136
Score = 57.8 bits (138), Expect = 5e-08
Identities = 28/84 (33%), Positives = 37/84 (43%), Gaps = 23/84 (27%)
Query: 53 CFKCGEKGHKSNVCTKDEKK----------------------CFRCGQKGHVLADCK-RG 89
CFKCG GH + C + C+RCG+ GH+ DC +
Sbjct: 6 CFKCGRTGHWARECPTGIGRGRGMRSRGRAGFQFMSSSLPDICYRCGESGHLAKDCDLQE 65
Query: 90 DIVCYNCNGEGHISSQCTQPKKVR 113
D CYNC GHI+ C +PK+ R
Sbjct: 66 DKACYNCGRGGHIAKDCKEPKRER 89
>HEXP_LEIMA (Q04832) DNA-binding protein HEXBP (Hexamer-binding
protein)
Length = 271
Score = 61.2 bits (147), Expect = 4e-09
Identities = 28/78 (35%), Positives = 36/78 (45%), Gaps = 14/78 (17%)
Query: 53 CFKCGEKGHKSNVCTKDEKK-------CFRCGQKGHVLADCKRG-------DIVCYNCNG 98
CF+CGE+GH S C + + CFRCG+ GH+ DC CY C
Sbjct: 45 CFRCGEEGHMSRECPNEARSGAAGAMTCFRCGEAGHMSRDCPNSAKPGAAKGFECYKCGQ 104
Query: 99 EGHISSQCTQPKKVRTGG 116
EGH+S C + GG
Sbjct: 105 EGHLSRDCPSSQGGSRGG 122
Score = 58.9 bits (141), Expect = 2e-08
Identities = 25/70 (35%), Positives = 36/70 (50%), Gaps = 14/70 (20%)
Query: 53 CFKCGEKGHKSNVCTKDE--------KKCFRCGQKGHVLADCKR------GDIVCYNCNG 98
C+KCG+ GH S C + +KC++CG+ GH+ +C GD CY C
Sbjct: 170 CYKCGDAGHISRDCPNGQGGYSGAGDRKCYKCGESGHMSRECPSAGSTGSGDRACYKCGK 229
Query: 99 EGHISSQCTQ 108
GHIS +C +
Sbjct: 230 PGHISRECPE 239
Score = 57.4 bits (137), Expect = 6e-08
Identities = 25/71 (35%), Positives = 33/71 (46%), Gaps = 17/71 (23%)
Query: 53 CFKCGEKGHKSNVCTK------DEKKCFRCGQKGHVLADCKR-----------GDIVCYN 95
C+KCGE GH S C ++ C++CG+ GH+ +C GD CY
Sbjct: 198 CYKCGESGHMSRECPSAGSTGSGDRACYKCGKPGHISRECPEAGGSYGGSRGGGDRTCYK 257
Query: 96 CNGEGHISSQC 106
C GHIS C
Sbjct: 258 CGEAGHISRDC 268
Score = 56.6 bits (135), Expect = 1e-07
Identities = 30/86 (34%), Positives = 37/86 (42%), Gaps = 30/86 (34%)
Query: 51 IVCFKCGEKGHKSNVCTKDEK-------KCFRCGQKGHVLADC-------------KR-- 88
+ CF+CGE GH S C K +C++CGQ+GH+ DC KR
Sbjct: 70 MTCFRCGEAGHMSRDCPNSAKPGAAKGFECYKCGQEGHLSRDCPSSQGGSRGGYGQKRGR 129
Query: 89 --------GDIVCYNCNGEGHISSQC 106
GD CY C GHIS C
Sbjct: 130 SGAQGGYSGDRTCYKCGDAGHISRDC 155
Score = 56.2 bits (134), Expect = 1e-07
Identities = 25/81 (30%), Positives = 36/81 (43%), Gaps = 16/81 (19%)
Query: 53 CFKCGEKGHKSNVCTKDE--------KKCFRCGQKGHVLADCKR--------GDIVCYNC 96
C+KCG+ GH S C + + C++CG GH+ DC GD CY C
Sbjct: 142 CYKCGDAGHISRDCPNGQGGYSGAGDRTCYKCGDAGHISRDCPNGQGGYSGAGDRKCYKC 201
Query: 97 NGEGHISSQCTQPKKVRTGGK 117
GH+S +C +G +
Sbjct: 202 GESGHMSRECPSAGSTGSGDR 222
Score = 55.5 bits (132), Expect = 2e-07
Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 17/88 (19%)
Query: 38 DERRPKGRDTPAEIVCFKCGEKGHKSNVCTKDEKK-------CFRCGQKGHVLADCKR-- 88
D +RP+ T + C CG++GH + C + + K CFRCG++GH+ +C
Sbjct: 6 DVKRPR---TESSTSCRNCGKEGHYARECPEADSKGDERSTTCFRCGEEGHMSRECPNEA 62
Query: 89 -----GDIVCYNCNGEGHISSQCTQPKK 111
G + C+ C GH+S C K
Sbjct: 63 RSGAAGAMTCFRCGEAGHMSRDCPNSAK 90
Score = 52.4 bits (124), Expect = 2e-06
Identities = 25/85 (29%), Positives = 33/85 (38%), Gaps = 31/85 (36%)
Query: 53 CFKCGEKGHKSNVCTKDE-----------------------KKCFRCGQKGHVLADCKR- 88
C+KCG++GH S C + + C++CG GH+ DC
Sbjct: 99 CYKCGQEGHLSRDCPSSQGGSRGGYGQKRGRSGAQGGYSGDRTCYKCGDAGHISRDCPNG 158
Query: 89 -------GDIVCYNCNGEGHISSQC 106
GD CY C GHIS C
Sbjct: 159 QGGYSGAGDRTCYKCGDAGHISRDC 183
Score = 41.6 bits (96), Expect = 0.004
Identities = 18/56 (32%), Positives = 27/56 (48%), Gaps = 9/56 (16%)
Query: 67 TKDEKKCFRCGQKGHVLADCKRGD-------IVCYNCNGEGHISSQCTQPKKVRTG 115
T+ C CG++GH +C D C+ C EGH+S +C P + R+G
Sbjct: 12 TESSTSCRNCGKEGHYARECPEADSKGDERSTTCFRCGEEGHMSREC--PNEARSG 65
>POL_CAMVC (P03555) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 679
Score = 60.5 bits (145), Expect = 7e-09
Identities = 73/307 (23%), Positives = 124/307 (39%), Gaps = 45/307 (14%)
Query: 124 MQTTNEDRL-IRGTCFFNS---IPLIAIIDTGATHCFIALECAYKLGLIVSDMKGEMVVE 179
M TN + + I+G +F I L +DTGA+ C + + + ++ ++V+
Sbjct: 16 MNVTNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWVNAERP--IMVK 73
Query: 180 TPAKGSVTTSLVCLICPISMFGRDFEVDLVCLPLTGMDVIFGMNWLEYNR---------- 229
S+T S VC + + G F++ V +G+D I G N+ +
Sbjct: 74 IADGSSITISKVCKDIDLIIAGEIFKIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVI 133
Query: 230 --------VHINCFNKTVHF------------SSAEEESGVEILSTKEMKQLERDGILMY 269
VHI + V S ++ V I + K LE IL
Sbjct: 134 FTKNKSYPVHITKLTRAVRVGIEGFLESMKKRSKTQQPEPVNISTNKIENPLEEIAILSE 193
Query: 270 SLMACLSLENQAVID-KLPVVNEFPEVFSDEIPDVPPERE--VEFSIDLVPRTKPVSMAP 326
LS E + ++ + E E E P P + + ++ SI L +K + + P
Sbjct: 194 GRR--LSEEKLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKP 251
Query: 327 YRMSASELAELKNQLEDLLDKKFVRASVFPWGAPVLL----AKKKDGSMRLCIAYPPLNK 382
+ S + E Q+++LLD K ++ S P AP L A+K+ G R+ + Y +NK
Sbjct: 252 MKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNK 311
Query: 383 VPIKNRY 389
I + Y
Sbjct: 312 ATIGDAY 318
>BYR3_SCHPO (P36627) Cellular nucleic acid binding protein homolog
Length = 179
Score = 58.9 bits (141), Expect = 2e-08
Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 39 ERRPKGRDTPAEIVCFKCGEKGHKSNVCT--KDEKKCFRCGQKGHVLADC-----KRGDI 91
E + R+ +C+ C + GHK++ CT + EK C+ CG GH++ DC R
Sbjct: 24 ENGHQARECTKGSICYNCNQTGHKASECTEPQQEKTCYACGTAGHLVRDCPSSPNPRQGA 83
Query: 92 VCYNCNGEGHISSQC 106
CY C GHI+ C
Sbjct: 84 ECYKCGRVGHIARDC 98
Score = 57.0 bits (136), Expect = 8e-08
Identities = 21/48 (43%), Positives = 33/48 (68%), Gaps = 1/48 (2%)
Query: 67 TKDEKKCFRCGQKGHVLADCKRGDIVCYNCNGEGHISSQCTQPKKVRT 114
T+ +C+ CG+ GH +C +G I CYNCN GH +S+CT+P++ +T
Sbjct: 13 TRPGPRCYNCGENGHQARECTKGSI-CYNCNQTGHKASECTEPQQEKT 59
Score = 56.6 bits (135), Expect = 1e-07
Identities = 24/59 (40%), Positives = 34/59 (56%), Gaps = 3/59 (5%)
Query: 53 CFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADCKRGDI--VCYNCNGEGHISSQCTQP 109
C+ CG GH++ CT K C+ CG+ GH +C++ +CY CN GHI+ CT P
Sbjct: 118 CYACGSYGHQARDCTMGVK-CYSCGKIGHRSFECQQASDGQLCYKCNQPGHIAVNCTSP 175
Score = 55.1 bits (131), Expect = 3e-07
Identities = 23/69 (33%), Positives = 34/69 (48%), Gaps = 3/69 (4%)
Query: 53 CFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADC--KRGDIVCYNCNGEGHISSQCTQPK 110
C+ CGE GH++ CTK C+ C Q GH ++C + + CY C GH+ C
Sbjct: 19 CYNCGENGHQARECTKG-SICYNCNQTGHKASECTEPQQEKTCYACGTAGHLVRDCPSSP 77
Query: 111 KVRTGGKVF 119
R G + +
Sbjct: 78 NPRQGAECY 86
Score = 43.1 bits (100), Expect = 0.001
Identities = 16/46 (34%), Positives = 26/46 (55%), Gaps = 2/46 (4%)
Query: 43 KGRDTPAEIVCFKCGEKGHKSNVC--TKDEKKCFRCGQKGHVLADC 86
+ RD + C+ CG+ GH+S C D + C++C Q GH+ +C
Sbjct: 127 QARDCTMGVKCYSCGKIGHRSFECQQASDGQLCYKCNQPGHIAVNC 172
>POL_CAMVD (P03556) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 674
Score = 58.5 bits (140), Expect = 3e-08
Identities = 70/304 (23%), Positives = 128/304 (42%), Gaps = 35/304 (11%)
Query: 124 MQTTNEDRL-IRGTCFFNS---IPLIAIIDTGATHCFIALECAYKLGLIVSDMKGEMVVE 179
M TN + + I+G +F I L +DTGA+ C + + I ++ ++V+
Sbjct: 18 MNITNPNSIYIKGRLYFKGYKKIELHCFVDTGASLCIASKFVIPEEHWINAERP--IMVK 75
Query: 180 TPAKGSVTTSLVCLICPISMFGRDFEVDLVCLPLTGMDVIFGMNWLEYNRVHINCFNKT- 238
S+T + VC + + G F + V +G+D I G N+ + I ++
Sbjct: 76 IADGSSITINKVCRDIDLIIAGEIFHIPTVYQQESGIDFIIGNNFCQLYEPFIQFTDRVI 135
Query: 239 --------VHFSSAEE------ESGVEILSTKEMKQLERDGILMYSLMACLSLENQAVID 284
VH + E +E + + Q + + +A LS + +
Sbjct: 136 FTKDRTYPVHIAKLTRAVRVGTEGFLESMKKRSKTQQPEPVNISTNKIAILSEGRRLSEE 195
Query: 285 KLPV-------VNEFPEVFSDEIPDVPPERE--VEFSIDLVPRTKPVSMAPYRMSASELA 335
KL + + E E E P P + + ++ SI L +K + + P + S +
Sbjct: 196 KLFITQQRMQKIEELLEKVCSENPLDPNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDRE 255
Query: 336 ELKNQLEDLLDKKFVRASVFPWGAPVLL----AKKKDGSMRLCIAYPPLNKVPIKNRY-P 390
E Q+++LLD K ++ S P AP L A+K+ G R+ + Y +NK + + Y P
Sbjct: 256 EFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNP 315
Query: 391 SSKN 394
+K+
Sbjct: 316 PNKD 319
>GLH4_CAEEL (O76743) ATP-dependent RNA helicase glh-4 (EC 3.6.1.-)
(Germline helicase-4)
Length = 1156
Score = 55.1 bits (131), Expect = 3e-07
Identities = 31/95 (32%), Positives = 42/95 (43%), Gaps = 14/95 (14%)
Query: 25 YSAPAGKGKQRLNDERRPKGRDTPAEIVCFKCGEKGHKSNVCTKDEK---KCFRCGQKGH 81
+ P+ + + RP+G C CGE+GH S C K + C C Q GH
Sbjct: 552 FGEPSDNQRGNWDGGERPRG--------CHNCGEEGHISKECDKPKVPRFPCRNCEQLGH 603
Query: 82 VLADCKRGDI---VCYNCNGEGHISSQCTQPKKVR 113
+DC + + C NC EGH + C QPK R
Sbjct: 604 FASDCDQPRVPRGPCRNCGIEGHFAVDCDQPKVPR 638
Score = 41.2 bits (95), Expect = 0.005
Identities = 23/71 (32%), Positives = 34/71 (47%), Gaps = 10/71 (14%)
Query: 53 CFKCGEKGHKSNVCTKDEKK---CFRCGQKGHVLADCKRGDI------VCYNCNGEGHIS 103
C CG +GH + C + + C CGQ+GH DC+ + C C EGH
Sbjct: 618 CRNCGIEGHFAVDCDQPKVPRGPCRNCGQEGHFAKDCQNERVRMEPTEPCRRCAEEGHWG 677
Query: 104 SQC-TQPKKVR 113
+C T+PK ++
Sbjct: 678 YECPTRPKDLQ 688
>GLH1_CAEEL (P34689) ATP-dependent RNA helicase glh-1 (EC 3.6.1.-)
(Germline helicase-1)
Length = 763
Score = 52.8 bits (125), Expect = 2e-06
Identities = 30/113 (26%), Positives = 41/113 (35%), Gaps = 39/113 (34%)
Query: 42 PKGRDTPAEIVCFKCGEKGHKSNVCTKDEK------------------------------ 71
P+ R VC+ C + GH S CT++ K
Sbjct: 174 PEPRKEREPRVCYNCQQPGHTSRECTEERKPREGRTGGFGGGAGFGNNGGNDGFGGDGGF 233
Query: 72 ---------KCFRCGQKGHVLADCKRGDIVCYNCNGEGHISSQCTQPKKVRTG 115
KCF C +GH A+C C+NC +GH S++C P K R G
Sbjct: 234 GGGEERGPMKCFNCKGEGHRSAECPEPPRGCFNCGEQGHRSNECPNPAKPREG 286
Score = 51.6 bits (122), Expect = 3e-06
Identities = 28/68 (41%), Positives = 35/68 (51%), Gaps = 11/68 (16%)
Query: 57 GEKGHKSNVCTKDEKKCFRCGQKGHVLADC-----KRGDIVCYNCNGEGHISSQCTQ--- 108
GE GH + CF C Q GH +DC +R VCYNC GH S +CT+
Sbjct: 147 GEGGHGGG---ERNNNCFNCQQPGHRSSDCPEPRKEREPRVCYNCQQPGHTSRECTEERK 203
Query: 109 PKKVRTGG 116
P++ RTGG
Sbjct: 204 PREGRTGG 211
Score = 50.1 bits (118), Expect = 1e-05
Identities = 26/103 (25%), Positives = 40/103 (38%), Gaps = 44/103 (42%)
Query: 53 CFKCGEKGHKSNVCTKDEKK-----CFRCGQKGHVLADC--------------------- 86
CF C + GH+S+ C + K+ C+ C Q GH +C
Sbjct: 160 CFNCQQPGHRSSDCPEPRKEREPRVCYNCQQPGHTSRECTEERKPREGRTGGFGGGAGFG 219
Query: 87 ------------------KRGDIVCYNCNGEGHISSQCTQPKK 111
+RG + C+NC GEGH S++C +P +
Sbjct: 220 NNGGNDGFGGDGGFGGGEERGPMKCFNCKGEGHRSAECPEPPR 262
Score = 42.0 bits (97), Expect = 0.003
Identities = 18/57 (31%), Positives = 27/57 (46%)
Query: 44 GRDTPAEIVCFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADCKRGDIVCYNCNGEG 100
G + + CF C +GH+S C + + CF CG++GH +C GEG
Sbjct: 235 GGEERGPMKCFNCKGEGHRSAECPEPPRGCFNCGEQGHRSNECPNPAKPREGVEGEG 291
>YL92_SCHPO (Q9HFF2) Hypothetical protein C683.02c in chromosome I
Length = 218
Score = 51.6 bits (122), Expect = 3e-06
Identities = 30/101 (29%), Positives = 45/101 (43%), Gaps = 14/101 (13%)
Query: 42 PKGRDTPAEIVCFKCGEKGHKSNVCTKDE----KKCFRCGQKGHVLADCKRGDI------ 91
P+ +D + +CF+CG K H N C+K KCF C + GH+ C++
Sbjct: 93 PEAKDNVS--ICFRCGSKEHSLNACSKKGPLKFAKCFICHENGHLSGQCEQNPKGLYPKG 150
Query: 92 -VCYNCNGEGHISSQCTQPKKVRTG-GKVFALAGMQTTNED 130
C C+ H++ C Q K G V +AG +ED
Sbjct: 151 GCCKFCSSVHHLAKDCDQVNKDDVSFGHVVGVAGTTGADED 191
Score = 37.7 bits (86), Expect = 0.051
Identities = 23/83 (27%), Positives = 37/83 (43%), Gaps = 13/83 (15%)
Query: 29 AGKGKQRLNDERRPKGRDTPAEIVCFKCGEKGHKSNVCTKDEKKCFRCGQKGHVLADC-- 86
A G + DER+ K R + + N +D K CF C Q+GH++ DC
Sbjct: 45 ASFGSSKRYDERQKKKRSEYRRL---------RRINQRNRD-KFCFACRQQGHIVQDCPE 94
Query: 87 -KRGDIVCYNCNGEGHISSQCTQ 108
K +C+ C + H + C++
Sbjct: 95 AKDNVSICFRCGSKEHSLNACSK 117
>POL4_DROME (P10394) Retrovirus-related Pol polyprotein from
transposon 412 [Contains: Protease (EC 3.4.23.-);
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1237
Score = 51.6 bits (122), Expect = 3e-06
Identities = 43/168 (25%), Positives = 76/168 (44%), Gaps = 29/168 (17%)
Query: 250 VEILSTKEMKQLERDGILMYSLMACLSL-------ENQAVIDKLPVVNEFPEVFSDEIPD 302
V IL+T + QL L Y ++ ++ N+ V+ +L FPE+F ++ +
Sbjct: 224 VRILNTTDSDQLVNMDTLKYEPLSNYNVVQANSEHRNKTVLSQLK--KNFPELFKSQLEN 281
Query: 303 VPPEREVEFSIDLVPRT--------------KPVSMAPYRMSASELAELKNQLEDLLDKK 348
+ E F+++ P T +PV YR S++ E++ Q++ L+ K
Sbjct: 282 ICSEYIDIFALESEPITVNNLYKQQLRLKDDEPVYTKNYRSPHSQVEEIQAQVQKLIKDK 341
Query: 349 FVRASVFPWGAPVLLAKKKDG------SMRLCIAYPPLNKVPIKNRYP 390
V SV + +P+LL KK RL I Y +NK + +++P
Sbjct: 342 IVEPSVSQYNSPLLLVPKKSSPNSDKKKWRLVIDYRQINKKLLADKFP 389
>POL_SOCMV (P15629) Enzymatic polyprotein [Contains: Aspartic
protease (EC 3.4.23.-); Endonuclease; Reverse
transcriptase (EC 2.7.7.49)]
Length = 692
Score = 51.2 bits (121), Expect = 4e-06
Identities = 55/254 (21%), Positives = 109/254 (42%), Gaps = 19/254 (7%)
Query: 145 IAIIDTGATHCFIALECAYKLGLIVSDMKGEMVVETPAKGSVTTSLVCLICPISMFGRDF 204
+A IDTGAT CF + + ++ E+++ +K + ++ + I ++F
Sbjct: 32 LAYIDTGATLCFGKRKISNNWEILKQPK--EIIIADKSKHYIREAISNVFLKIE--NKEF 87
Query: 205 EVDLVCLPLTGMDVIFGMNWLEYNRVHINCFNKT-VHFSSAEEESGVEILSTKEMKQLER 263
+ ++ L +G+D+I G N+L+ + I + + + +++STK + + E
Sbjct: 88 LIPIIYLHDSGLDLIIGNNFLKLYQPFIQRLETIELRWKNLNNPKESQMISTKILTKNEV 147
Query: 264 DGILMYSLMACLSLENQAVIDKLPVVNEFPEVFSDEIPDVPPERE---VEFSI-DLVPRT 319
+ + CL + + + + EV S+ D + +E + D +
Sbjct: 148 LKLSFEKIHICL----EKYLFFKTIEEQLEEVCSEHPLDETKNKNGLLIEIRLKDPLQEI 203
Query: 320 KPVSMAPYRMSASELAELKNQLEDLLDKKFVRASVFPWGAPVLLAKK----KDGSMRLCI 375
+ PY + ++ E K + EDLL K +R S P AP + K G R+ I
Sbjct: 204 NVTNRIPYTIR--DVQEFKEECEDLLKKGLIRESQSPHSAPAFYVENHNEIKRGKRRMVI 261
Query: 376 AYPPLNKVPIKNRY 389
Y +N+ I + Y
Sbjct: 262 NYKKMNEATIGDSY 275
>GAG_VILV2 (P23425) Gag polyprotein [Contains: Core protein p16;
Core protein p25; Core protein p14]
Length = 442
Score = 49.7 bits (117), Expect = 1e-05
Identities = 20/55 (36%), Positives = 32/55 (57%), Gaps = 6/55 (10%)
Query: 57 GEKGHKSNVCTKDEKKCFRCGQKGHVLADCKRGDIVCYNCNGEGHISSQCTQPKK 111
G+ GHK +KC+ CG+ GH+ C++G I+C++C GH+ C Q K+
Sbjct: 376 GKAGHKGV-----NQKCYNCGKPGHLARQCRQG-IICHHCGKRGHMQKDCRQKKQ 424
>GAG_VILV1 (P23424) Gag polyprotein [Contains: Core protein p16;
Core protein p25; Core protein p14]
Length = 442
Score = 49.7 bits (117), Expect = 1e-05
Identities = 20/55 (36%), Positives = 32/55 (57%), Gaps = 6/55 (10%)
Query: 57 GEKGHKSNVCTKDEKKCFRCGQKGHVLADCKRGDIVCYNCNGEGHISSQCTQPKK 111
G+ GHK +KC+ CG+ GH+ C++G I+C++C GH+ C Q K+
Sbjct: 376 GKAGHKGV-----NQKCYNCGKPGHLARQCRQG-IICHHCGKRGHMQKDCRQKKQ 424
>GAG_VILV (P03352) Gag polyprotein [Contains: Core protein p16; Core
protein p25; Core protein p14]
Length = 442
Score = 49.7 bits (117), Expect = 1e-05
Identities = 20/55 (36%), Positives = 32/55 (57%), Gaps = 6/55 (10%)
Query: 57 GEKGHKSNVCTKDEKKCFRCGQKGHVLADCKRGDIVCYNCNGEGHISSQCTQPKK 111
G+ GHK +KC+ CG+ GH+ C++G I+C++C GH+ C Q K+
Sbjct: 376 GKAGHKGV-----NQKCYNCGKPGHLARQCRQG-IICHHCGKRGHMQKDCRQKKQ 424
>GAG_SIVAI (Q02843) Gag polyprotein [Contains: Core protein p17;
Core protein p24; Core protein p15]
Length = 513
Score = 48.9 bits (115), Expect = 2e-05
Identities = 23/61 (37%), Positives = 31/61 (50%), Gaps = 4/61 (6%)
Query: 42 PKGRDTPAEIVCFKCGEKGHKSNVCTKDEK-KCFRCGQKGHVLADCKRGDIVCYNCNGEG 100
P+ + + CF CG+ GH C + KCF+CG+ GH+ DCK G N G G
Sbjct: 381 PQKKGPRGPLKCFNCGKFGHMQRECKAPRQIKCFKCGKIGHMAKDCKNGQA---NFLGYG 437
Query: 101 H 101
H
Sbjct: 438 H 438
Score = 38.9 bits (89), Expect = 0.023
Identities = 15/36 (41%), Positives = 21/36 (57%), Gaps = 1/36 (2%)
Query: 72 KCFRCGQKGHVLADCKRG-DIVCYNCNGEGHISSQC 106
KCF CG+ GH+ +CK I C+ C GH++ C
Sbjct: 391 KCFNCGKFGHMQRECKAPRQIKCFKCGKIGHMAKDC 426
Score = 31.6 bits (70), Expect = 3.7
Identities = 9/26 (34%), Positives = 17/26 (64%)
Query: 88 RGDIVCYNCNGEGHISSQCTQPKKVR 113
RG + C+NC GH+ +C P++++
Sbjct: 387 RGPLKCFNCGKFGHMQRECKAPRQIK 412
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.320 0.137 0.414
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 48,298,708
Number of Sequences: 164201
Number of extensions: 2146445
Number of successful extensions: 6227
Number of sequences better than 10.0: 165
Number of HSP's better than 10.0 without gapping: 102
Number of HSP's successfully gapped in prelim test: 64
Number of HSP's that attempted gapping in prelim test: 5587
Number of HSP's gapped (non-prelim): 421
length of query: 394
length of database: 59,974,054
effective HSP length: 112
effective length of query: 282
effective length of database: 41,583,542
effective search space: 11726558844
effective search space used: 11726558844
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 67 (30.4 bits)
Medicago: description of AC130806.14