
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC135795.5 + phase: 0
(471 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAM90909.1| p44/SSL1-like protein [Arabidopsis thaliana] gi|3... 577 e-163
emb|CAE04735.1| OSJNBa0043L24.23 [Oryza sativa (japonica cultiva... 562 e-159
ref|XP_394997.2| PREDICTED: similar to ENSANGP00000016260 [Apis ... 296 9e-79
ref|XP_535266.1| PREDICTED: similar to transcription factor BTF2... 294 5e-78
ref|NP_001506.1| general transcription factor IIH, polypeptide 2... 290 7e-77
gb|AAH64557.1| General transcription factor IIH, polypeptide 2, ... 290 7e-77
ref|XP_645146.1| general transcription factor IIH component [Dic... 290 7e-77
gb|AAH16231.1| Gtf2h2 protein [Mus musculus] 287 6e-76
sp|Q9JIB4|TF2H2_MOUSE TFIIH basal transcription factor complex p... 287 6e-76
ref|XP_215466.2| PREDICTED: similar to general transcription fac... 286 1e-75
ref|NP_071294.2| general transcription factor II H, polypeptide ... 283 8e-75
gb|AAH71091.1| MGC81060 protein [Xenopus laevis] 282 1e-74
ref|NP_001011081.1| hypothetical protein LOC496493 [Xenopus trop... 282 2e-74
gb|AAH45397.1| General transcription factor IIH, polypeptide 2 [... 281 3e-74
emb|CAA20673.1| SPCC1682.07 [Schizosaccharomyces pombe] gi|19075... 281 4e-74
dbj|BAA31745.1| SSL1 [Schizosaccharomyces pombe] 281 4e-74
ref|XP_424965.1| PREDICTED: similar to TFIIH basal transcription... 280 5e-74
emb|CAF92597.1| unnamed protein product [Tetraodon nigroviridis] 280 7e-74
ref|XP_314533.2| ENSANGP00000016260 [Anopheles gambiae str. PEST... 276 1e-72
gb|AAF51879.2| CG11115-PA [Drosophila melanogaster] gi|21356299|... 271 4e-71
>gb|AAM90909.1| p44/SSL1-like protein [Arabidopsis thaliana]
gi|30679101|ref|NP_683275.2| basic transcription factor
2, 44kD subunit-related [Arabidopsis thaliana]
gi|4056421|gb|AAC97995.1| Similar to gb|Z30094 basic
transcripion factor 2, 44 kD subunit from Homo sapiens.
EST gb|W43325 comes from this gene. [Arabidopsis
thaliana] gi|25347846|pir||E86184 hypothetical protein
[imported] - Arabidopsis thaliana
Length = 421
Score = 577 bits (1488), Expect = e-163
Identities = 273/360 (75%), Positives = 315/360 (86%), Gaps = 1/360 (0%)
Query: 14 EDDDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASN 73
E ++ED+ + +G+ WERAY +DRSWE LQEDESGLLRPID +AI+HAQYRRRLR L++
Sbjct: 11 EREEEDDEDAEGIGEWERAYVDDRSWEELQEDESGLLRPIDNSAIYHAQYRRRLRMLSAA 70
Query: 74 AATARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLV 133
AA RIQKGLIRYLYIV+D S+AA+E DFRPSRMA++AK VE FIREFFDQNPLS +GLV
Sbjct: 71 AAGTRIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAIMAKHVEAFIREFFDQNPLSQIGLV 130
Query: 134 TTKDGVANCLTDLGGSPESHIKALMGKLECSGDASLQNALELVHSNLNQIPSYGHREVLI 193
+ K+GVA+ LTDLGGSPE+HIKALMGKLE GD+SLQNALELVH +LNQ+PSYGHREVLI
Sbjct: 131 SIKNGVAHTLTDLGGSPETHIKALMGKLEALGDSSLQNALELVHEHLNQVPSYGHREVLI 190
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
LYSAL TCDPGD+METIQKCKKSK+RCSVIGL+AEMFICKHLCQETGG YSVA+DE H K
Sbjct: 191 LYSALCTCDPGDIMETIQKCKKSKLRCSVIGLSAEMFICKHLCQETGGLYSVAVDEVHLK 250
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRAAEGSVAICTCHEEAKTGGGYTCPRCKVRVC 313
+L+LEH+PPPPAIAE+A ANLIKMGFPQRAAEGS+AIC+CH+E K G GY CPRCK RVC
Sbjct: 251 DLLLEHAPPPPAIAEFAIANLIKMGFPQRAAEGSMAICSCHKEVKIGAGYMCPRCKARVC 310
Query: 314 ELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEI-SPSSQNDPNHSFPNTCFGCQQSLL 372
+LPTEC +CGLTL+SSPHLARSYHHLFPI PF E+ + SS ND +CFGCQQSL+
Sbjct: 311 DLPTECTICGLTLVSSPHLARSYHHLFPIAPFDEVPALSSLNDNRRKLGKSCFGCQQSLI 370
>emb|CAE04735.1| OSJNBa0043L24.23 [Oryza sativa (japonica cultivar-group)]
gi|50926354|ref|XP_473124.1| OSJNBa0043L24.23 [Oryza
sativa (japonica cultivar-group)]
Length = 432
Score = 562 bits (1448), Expect = e-159
Identities = 268/373 (71%), Positives = 316/373 (83%), Gaps = 5/373 (1%)
Query: 14 EDDDEDEANDDG----LEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRA 69
EDDD++E + G LEAWERAY +DRSWE+LQEDESGLLRPIDT + HAQYRRRL
Sbjct: 25 EDDDDEEEEESGEGRVLEAWERAYADDRSWEALQEDESGLLRPIDTKTLVHAQYRRRLLL 84
Query: 70 LASNAATARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSH 129
++ +A ARIQKGLIRYLYIV+DLS+AASE D+RPSRMAV+AK E+FIREFFDQNPLSH
Sbjct: 85 RSAASAAARIQKGLIRYLYIVIDLSRAASEMDYRPSRMAVVAKYAEVFIREFFDQNPLSH 144
Query: 130 VGLVTTKDGVANCLTDLGGSPESHIKALMGKLECSGDASLQNALELVHSNLNQIPSYGHR 189
VG+VT KDG+++ LT++GGSPES IKALMGKLECSG+ SLQNALELVH L+Q+PSYGH+
Sbjct: 145 VGIVTMKDGISHRLTEIGGSPESQIKALMGKLECSGEPSLQNALELVHGYLDQVPSYGHK 204
Query: 190 EVLILYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDE 249
EVL LYSAL+TCDPGD+METI KCKKSKIRCSVIGLAAE+FICK+LC+ETGG+Y+VALDE
Sbjct: 205 EVLFLYSALNTCDPGDIMETIAKCKKSKIRCSVIGLAAEIFICKYLCEETGGSYTVALDE 264
Query: 250 SHFKELILEHSPPPPAIAEYATANLIKMGFPQRAAEGSVAICTCHEEAKTGG-GYTCPRC 308
SHFKEL+LEH+PPPPAIAEYA ANLIKMGFPQR AE ++IC+CH++ K+G GY CPRC
Sbjct: 265 SHFKELLLEHAPPPPAIAEYAAANLIKMGFPQRGAEDLISICSCHKKIKSGAEGYICPRC 324
Query: 309 KVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPNTCFGCQ 368
KV VCELPTECR CGLTL+SSPHLARSYHHLFP+ PF E+S N C+GCQ
Sbjct: 325 KVNVCELPTECRTCGLTLVSSPHLARSYHHLFPVQPFDEVSSVHPNRLGQKGGQKCYGCQ 384
Query: 369 QSLLVLEQETRLN 381
QS + + ++ L+
Sbjct: 385 QSFINPDSQSSLH 397
>ref|XP_394997.2| PREDICTED: similar to ENSANGP00000016260 [Apis mellifera]
Length = 405
Score = 296 bits (758), Expect = 9e-79
Identities = 154/366 (42%), Positives = 231/366 (63%), Gaps = 27/366 (7%)
Query: 17 DEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAAT 76
DE+E + WE Y +++WE+++ED+ GLL IH+ + +R++
Sbjct: 3 DEEEEKE---YRWETGY--EKTWEAIKEDDHGLLEASVADIIHNVKRKRQMEKKIG---- 53
Query: 77 ARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTK 136
AR+ G++R+LY+++D S++ S +D +P+R K +E FI EFF QNP+S +G++ T+
Sbjct: 54 ARL--GMMRHLYVILDASESMSIQDLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITR 111
Query: 137 DGVANCLTDLGGSPESHIKAL--MGKLECSGDASLQNALELVHSNLNQIPSYGHREVLIL 194
+ A +++L G+ + HIK + M ++ +G+ SLQN++EL +L +PS+ +E+LI+
Sbjct: 112 NKRAEKVSELTGNSKKHIKEVQSMQQITPAGEPSLQNSIELALKSLRLLPSHASKEILII 171
Query: 195 YSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFKE 254
AL+TCDPGD+ ETI+ K +RCSVIGLAAE++ICK + TGG + VALD+ H+KE
Sbjct: 172 VGALTTCDPGDINETIKNMKLDSVRCSVIGLAAELYICKRMATATGGEHGVALDDKHYKE 231
Query: 255 LILEHSPPPPAIAEYATANLIKMGFPQRAAEGS-----VAICTCHEEAK------TGGGY 303
+ H PPPA A A L+KMGFP A S +A+C CH E+ GY
Sbjct: 232 QLNMHIDPPPA-ATRLDAALVKMGFPHHALHSSTNDSAMAVCMCHAESSDESVKLLSTGY 290
Query: 304 TCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPNT 363
CP+C + CELP ECR CGLTL+S+PHLARSYH+LFP+ PF E+ + H P+
Sbjct: 291 LCPQCLSKHCELPVECRACGLTLVSAPHLARSYHYLFPVEPFKEVEYRKEVTFEH--PSI 348
Query: 364 CFGCQQ 369
C+GCQ+
Sbjct: 349 CYGCQK 354
>ref|XP_535266.1| PREDICTED: similar to transcription factor BTF2 chain p44 - human
[Canis familiaris]
Length = 406
Score = 294 bits (752), Expect = 5e-78
Identities = 154/369 (41%), Positives = 226/369 (60%), Gaps = 31/369 (8%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
D+E E + WE Y +R+WE L+EDESG L+ + A+ +R
Sbjct: 2 DEEPERT----KRWEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------ 49
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
+++ G++R+LY+VVD S+ ++D +P+R+ K +E F+ E+FDQNP+S +G++ T
Sbjct: 50 HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
K A LT+L G+P HI +L ++ C G+ SL N+L + L +P + REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDP ++ + I+ K +KIR SVIGL+AE+ +C L +ETGGTY V LDESH+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDESHYK 229
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA--------AEGSVAICTCHEEAKTG---GG 302
EL+ H PPPA + + +LI+MGFPQ A+ S ++ + G GG
Sbjct: 230 ELLTHHVSPPPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGG 288
Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
Y CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+ PF EI H+
Sbjct: 289 YFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDPFQEIPLE-----EHNGER 343
Query: 363 TCFGCQQSL 371
C+GCQ L
Sbjct: 344 FCYGCQGEL 352
>ref|NP_001506.1| general transcription factor IIH, polypeptide 2, 44kD subunit [Homo
sapiens] gi|17380326|sp|Q13888|TF2H2_HUMAN TFIIH basal
transcription factor complex p44 subunit (Basic
transcription factor 2 44 kDa subunit) (BTF2-p44)
(General transcription factor IIH polypeptide 2)
gi|5531809|gb|AAD44479.1| basic transcription factor 2
[Homo sapiens] gi|496609|emb|CAA82910.1| basic
transcripion factor 2, 44 kD subunit [Homo sapiens]
Length = 395
Score = 290 bits (742), Expect = 7e-77
Identities = 153/369 (41%), Positives = 222/369 (59%), Gaps = 31/369 (8%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
D+E E + WE Y +R+WE L+EDESG L+ + A+ +R
Sbjct: 2 DEEPERT----KRWEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------ 49
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
+++ G++R+LY+VVD S+ ++D +P+R+ K +E F+ E+FDQNP+S +G++ T
Sbjct: 50 HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
K A LT+L G+P HI +L ++ C G+ SL N+L + L +P + REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLI 169
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDP ++ + I+ K +KIR SVIGL+AE+ +C L +ETGGTY V LDESH+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDESHYK 229
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH-----EEAKTGGG 302
EL+ H PPPA + + +LI+MGFPQ + + H E T GG
Sbjct: 230 ELLTHHVSPPPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGG 288
Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
Y CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+ F EI N
Sbjct: 289 YFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLEEYNGERF---- 344
Query: 363 TCFGCQQSL 371
C+GCQ L
Sbjct: 345 -CYGCQGEL 352
>gb|AAH64557.1| General transcription factor IIH, polypeptide 2, 44kD subunit [Homo
sapiens] gi|40674450|gb|AAH65021.1| GTF2H2 protein [Homo
sapiens]
Length = 395
Score = 290 bits (742), Expect = 7e-77
Identities = 153/369 (41%), Positives = 222/369 (59%), Gaps = 31/369 (8%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
D+E E + WE Y +R+WE L+EDESG L+ + A+ +R
Sbjct: 2 DEEPERT----KRWEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------ 49
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
+++ G++R+LY+VVD S+ ++D +P+R+ K +E F+ E+FDQNP+S +G++ T
Sbjct: 50 HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
K A LT+L G+P HI +L ++ C G+ SL N+L + L +P + REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKEAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDP ++ + I+ K +KIR SVIGL+AE+ +C L +ETGGTY V LDESH+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDESHYK 229
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH-----EEAKTGGG 302
EL+ H PPPA + + +LI+MGFPQ + + H E T GG
Sbjct: 230 ELLTHHLSPPPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGG 288
Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
Y CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+ F EI N
Sbjct: 289 YFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLEEYNGERF---- 344
Query: 363 TCFGCQQSL 371
C+GCQ L
Sbjct: 345 -CYGCQGEL 352
>ref|XP_645146.1| general transcription factor IIH component [Dictyostelium
discoideum] gi|42733664|gb|AAO50835.2| similar to Homo
sapiens (Human). TFIIH basal transcription factor
complex p44 subunit (Basic transcription factor 2 44 kDa
subunit) (BTF2-p44) (General transcription factor IIH
polypeptide 2) [Dictyostelium discoideum]
gi|60473387|gb|EAL71333.1| TFIIH subunit [Dictyostelium
discoideum]
Length = 461
Score = 290 bits (742), Expect = 7e-77
Identities = 155/378 (41%), Positives = 229/378 (60%), Gaps = 19/378 (5%)
Query: 2 NNIAGKPLNGDLEDDDEDEAN------DDGLEAWERAYTEDRSWESLQEDESGLLRPIDT 55
NN K N L DD++ A+ +DG ++ +++W ++ EDE GL RP +
Sbjct: 8 NNAQNKRTNRSLYDDEDGPAHVLQTNDEDGTNKYKWENRFEKTWLTIDEDEHGL-RPSNQ 66
Query: 56 TAIHHAQYRRRLRALASNAATA---RIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAK 112
RRL+ + + R+++G+ R+L +++DLSK S +D +PSR V+ +
Sbjct: 67 E--ERNTRNRRLKNKDRDGILSQDQRVRRGMQRHLCLILDLSKTLSNQDLKPSRYQVLLQ 124
Query: 113 QVELFIREFFDQNPLSHVGLVTTKDGVANCLTDLGGSPESHIKALMGKLECSGDASLQNA 172
VELFI+EFFDQNP+S + ++ TK+ A +++L G+ HI+A+ + G+ S+QN+
Sbjct: 125 NVELFIKEFFDQNPISQLSIIITKNSKAEKISELSGNRLRHIQAMKDAIAMEGEPSIQNS 184
Query: 173 LELVHSNLNQIPSYGHREVLILYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFIC 232
LE+ S+L +P YG REVL ++S+L+TCDP L +TIQ K IR S I +AAE++IC
Sbjct: 185 LEVALSSLCYVPKYGSREVLFIFSSLTTCDPSSLQKTIQSLKNESIRVSFIHMAAELYIC 244
Query: 233 KHLCQETGGTYSVALDESHFKELILEHSPPPPAIAEYATANLIKMGFPQRAAEGSVAICT 292
K + ++T GT V L+E HF E ++ PPP I + A L++MGFPQ+ + C
Sbjct: 245 KAIAEQTNGTSKVILNEEHFNESLMLKCQPPPTIGK-TEAALVEMGFPQQITSTVPSPCI 303
Query: 293 CHEEAKTGGGYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSS 352
CHE+ K GY CPRC V+ CELPT+C++C L+L+SSPHLARSYHHLF I F E++
Sbjct: 304 CHEKMKY-SGYICPRCGVKSCELPTDCQICNLSLVSSPHLARSYHHLFQIPLFNEVNWKE 362
Query: 353 QNDPNHSFPNTCFGCQQS 370
N TC GC S
Sbjct: 363 LNK-----NVTCIGCLSS 375
>gb|AAH16231.1| Gtf2h2 protein [Mus musculus]
Length = 398
Score = 287 bits (734), Expect = 6e-76
Identities = 151/370 (40%), Positives = 223/370 (59%), Gaps = 32/370 (8%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
D+E E + WE Y +R+WE L+EDE+G L+ + A+ +R
Sbjct: 2 DEEPERT----KRWEGGY--ERTWEILKEDETGSLKATIEDILFKAKRKRVFEH------ 49
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
+++ G++R+LY+VVD S+ ++D +P+R+ K +E F+ E+FDQNP+S +G++ T
Sbjct: 50 HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
K A LT+L G+P HI +L ++ C G+ SL N+L + L +P + REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDP ++ + I+ K +KIR SVIGL+AE+ +C L +ETGGTY V LDE+H+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKTAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDETHYK 229
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH------EEAKTGG 301
EL+ H PPPA + + +LI+MGFPQ + + H E T G
Sbjct: 230 ELLAHHVSPPPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG 288
Query: 302 GYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFP 361
GY CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+ F EIS +
Sbjct: 289 GYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEISLE-----EYKGE 343
Query: 362 NTCFGCQQSL 371
C+GCQ L
Sbjct: 344 RFCYGCQGEL 353
>sp|Q9JIB4|TF2H2_MOUSE TFIIH basal transcription factor complex p44 subunit (Basic
transcription factor 2 44 kDa subunit) (BTF2-p44)
(General transcription factor IIH polypeptide 2)
gi|9082152|gb|AAF82753.1| general transcription factor
IIH polypeptide 2 [Mus musculus]
Length = 396
Score = 287 bits (734), Expect = 6e-76
Identities = 151/370 (40%), Positives = 223/370 (59%), Gaps = 32/370 (8%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
D+E E + WE Y +R+WE L+EDE+G L+ + A+ +R
Sbjct: 2 DEEPERT----KRWEGGY--ERTWEILKEDETGSLKATIEDILFKAKRKRVFEH------ 49
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
+++ G++R+LY+VVD S+ ++D +P+R+ K +E F+ E+FDQNP+S +G++ T
Sbjct: 50 HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
K A LT+L G+P HI +L ++ C G+ SL N+L + L +P + REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDP ++ + I+ K +KIR SVIGL+AE+ +C L +ETGGTY V LDE+H+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKTAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDETHYK 229
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH------EEAKTGG 301
EL+ H PPPA + + +LI+MGFPQ + + H E T G
Sbjct: 230 ELLAHHVSPPPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG 288
Query: 302 GYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFP 361
GY CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+ F EIS +
Sbjct: 289 GYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEISLE-----EYKGE 343
Query: 362 NTCFGCQQSL 371
C+GCQ L
Sbjct: 344 RFCYGCQGEL 353
>ref|XP_215466.2| PREDICTED: similar to general transcription factor IIH polypeptide
2 [Rattus norvegicus]
Length = 396
Score = 286 bits (731), Expect = 1e-75
Identities = 151/370 (40%), Positives = 222/370 (59%), Gaps = 32/370 (8%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
D+E E + WE Y +R+WE L+EDESG L+ + A+ +R
Sbjct: 2 DEEPERT----KRWEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------ 49
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
+++ G++R+LY+VVD S+ ++D +P+R+ K +E F+ E+FDQNP+S +G++ T
Sbjct: 50 HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
K A LT+L G+P HI +L ++ C G+ SL N+L + L +P + REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDP ++ + I+ K +KIR SVIGL+AE+ +C L +ETGGTY V LDE+H+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKTAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDETHYK 229
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH------EEAKTGG 301
EL+ H PPPA + + +LI+MGFPQ + + H E T G
Sbjct: 230 ELLARHVSPPPA-SSGSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG 288
Query: 302 GYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFP 361
GY CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+ F EI +
Sbjct: 289 GYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLE-----EYKGE 343
Query: 362 NTCFGCQQSL 371
C+GCQ L
Sbjct: 344 RFCYGCQGEL 353
>ref|NP_071294.2| general transcription factor II H, polypeptide 2 [Mus musculus]
gi|31418653|gb|AAH53382.1| General transcription factor
II H, polypeptide 2 [Mus musculus]
Length = 396
Score = 283 bits (724), Expect = 8e-75
Identities = 150/370 (40%), Positives = 222/370 (59%), Gaps = 32/370 (8%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
D+E E + WE Y +R+WE L+EDE+G L+ + A+ +R
Sbjct: 2 DEEPERT----KRWEGGY--ERTWEILKEDETGSLKATIEDILFKAKRKRVFEH------ 49
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
+++ G++R+LY+VVD S+ ++D +P+R+ K +E F+ E+FDQNP+S +G++ T
Sbjct: 50 HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
K A LT+L G+P HI +L ++ C G+ SL N+L + L +P + REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDP ++ + I+ K +KIR SVIGL+AE+ +C L +ETGGTY V LDE+H+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKTAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDETHYK 229
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH------EEAKTGG 301
EL+ H P PA + + +LI+MGFPQ + + H E T G
Sbjct: 230 ELLAHHVSPLPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG 288
Query: 302 GYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFP 361
GY CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+ F EIS +
Sbjct: 289 GYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEISLE-----EYKGE 343
Query: 362 NTCFGCQQSL 371
C+GCQ L
Sbjct: 344 RFCYGCQGEL 353
>gb|AAH71091.1| MGC81060 protein [Xenopus laevis]
Length = 395
Score = 282 bits (722), Expect = 1e-74
Identities = 149/369 (40%), Positives = 225/369 (60%), Gaps = 31/369 (8%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
D+E E + WE Y +R+WE L+EDESG L+ I ++ + + + N
Sbjct: 2 DEEPEKT----KRWEGGY--ERTWEVLKEDESGSLK----ATIDEILFKDKRKRIFENRG 51
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
R+ G++R+LY++VD S+ ++D +P+R+ K +E F+ E+FDQNP+S +GL+ T
Sbjct: 52 QVRL--GMMRHLYVIVDSSRTMEDQDLKPNRLTCTLKLLEFFVEEYFDQNPISQIGLIVT 109
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
++ A LT+L G+P HI A+ ++ CSG+ SL N+L L L +P + RE+L+
Sbjct: 110 RNKRAEKLTELAGNPRQHINAMKKAVDMTCSGEPSLYNSLNLALQTLKHMPGHTSREILV 169
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDP ++ + I+ K SK+R SVIGL+AE+ +C L +ETGG Y V LDESH+K
Sbjct: 170 IFSSLTTCDPTNIYDMIKCLKASKVRVSVIGLSAEVRVCTVLTRETGGVYHVILDESHYK 229
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA--------AEGSVAICTCHEEAKTG---GG 302
EL++ H PPPA + + +LI+MGFPQ A+ S ++ ++ G GG
Sbjct: 230 ELLMHHVIPPPA-SSSSECSLIRMGFPQHTMGCLSDQDAKPSFSMAHLDNTSEPGLTLGG 288
Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
Y CP+CK + ELP EC+VC LTL+S+PHLARSYHHLFP+ F E+ + +
Sbjct: 289 YFCPQCKAKYSELPVECKVCRLTLVSAPHLARSYHHLFPLDAFKEVRLEEYDGERY---- 344
Query: 363 TCFGCQQSL 371
C GC L
Sbjct: 345 -CRGCDGEL 352
>ref|NP_001011081.1| hypothetical protein LOC496493 [Xenopus tropicalis]
gi|54038231|gb|AAH84471.1| Hypothetical LOC496493
[Xenopus tropicalis]
Length = 393
Score = 282 bits (721), Expect = 2e-74
Identities = 149/369 (40%), Positives = 226/369 (60%), Gaps = 31/369 (8%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
D+E E + WE Y +R+WE L+EDESG L+ + I ++ + + + N
Sbjct: 2 DEEPEKT----KRWEGGY--ERTWEVLKEDESGSLK----STIDEILFKDKRKRIFENRG 51
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
R+ G++R+LY++VD S+ ++D +P+R+ K +E F+ E+FDQNP+S +GL+ T
Sbjct: 52 QVRL--GMMRHLYVIVDSSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGLIVT 109
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
++ A LT+L G+P H+ AL ++ C+G+ SL N+L L L +P + RE+L+
Sbjct: 110 RNKRAEKLTELAGNPRQHLNALKKAVDMNCNGEPSLYNSLNLALQTLKHMPGHTSREILV 169
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDP ++ + I+ K SKIR SVIGL+AE+ +C L +ETGG Y V LDESH+K
Sbjct: 170 IFSSLTTCDPTNIYDIIKCLKASKIRVSVIGLSAEVRVCTVLTRETGGVYHVILDESHYK 229
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA--------AEGSVAICTCHEEAKTG---GG 302
EL++ H PPPA + + +LI+MGFPQ A+ S ++ ++ G GG
Sbjct: 230 ELLMHHVIPPPA-SSTSECSLIRMGFPQHTMGCLSDQDAKPSFSMAHLDNTSEPGLTLGG 288
Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
Y CP+C+ + ELP EC+VC LTL+S+PHLARSYHHLFP+ F E+S +
Sbjct: 289 YFCPQCRAKYSELPVECKVCRLTLVSAPHLARSYHHLFPLDAFKEVSLEEYEGERY---- 344
Query: 363 TCFGCQQSL 371
C GC L
Sbjct: 345 -CRGCDGEL 352
>gb|AAH45397.1| General transcription factor IIH, polypeptide 2 [Danio rerio]
gi|42415511|ref|NP_963875.1| general transcription
factor IIH, polypeptide 2 [Danio rerio]
Length = 392
Score = 281 bits (719), Expect = 3e-74
Identities = 148/345 (42%), Positives = 224/345 (64%), Gaps = 26/345 (7%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
D+E E + WE Y +R+WE L+EDESG L+ + A +R R S+
Sbjct: 2 DEEPERT----KRWEGGY--ERTWEVLKEDESGSLKATVEDILFQA---KRKRVFESHG- 51
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
+++ G++R+L++++D S++ ++D +P+R+ K +E F+ E+FDQNP+S +G++TT
Sbjct: 52 --QVRLGMMRHLFVIIDSSRSMEDQDLKPNRLTSTLKLMEHFVEEYFDQNPISQIGIITT 109
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
K+ A LTDL G+P+ HI AL ++ C G+ SL N+L + L +P++ REVL+
Sbjct: 110 KNKRAEKLTDLAGNPKKHITALRKAVDSTCVGEPSLYNSLNMALQTLKHMPAHTSREVLV 169
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDPG++ E I+ KIR SVIGL+AE+ +C L +ETGG+Y+V LDESHFK
Sbjct: 170 IFSSLTTCDPGNIYELIKTLNGLKIRVSVIGLSAEVRVCTILTRETGGSYNVILDESHFK 229
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA--------AEGSVAIC---TCHEEAKTGGG 302
EL+L H PPPA + + +LI+MGFPQ A+ S ++ + E + GG
Sbjct: 230 ELLLLHVKPPPA-SSSSECSLIRMGFPQHVIASLSDQDAKPSFSMAHLDSSSEPELSLGG 288
Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAE 347
Y CP+C+ + ELP EC+VCGLTL+S+PHLARS+HHLFP+ F E
Sbjct: 289 YYCPQCRAKYTELPVECKVCGLTLVSAPHLARSFHHLFPLEAFQE 333
>emb|CAA20673.1| SPCC1682.07 [Schizosaccharomyces pombe]
gi|19075300|ref|NP_587800.1| hypothetical protein
SPCC1682.07 [Schizosaccharomyces pombe 972h-]
gi|26400388|sp|O74995|TFH47_SCHPO TFIIH basal
transcription factor complex p47 subunit (Suppressor of
stem-loop protein 1 homolog) (SSL1 homolog)
gi|3406059|gb|AAC29144.1| TFIIH subunit p47
[Schizosaccharomyces pombe]
Length = 421
Score = 281 bits (718), Expect = 4e-74
Identities = 147/354 (41%), Positives = 213/354 (59%), Gaps = 19/354 (5%)
Query: 20 EANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAATARI 79
+ +D+ WE Y RSW+ +QED G L + I + +R LR T +
Sbjct: 30 KTDDNEGYTWEGEY--QRSWDIVQEDAEGSLVGVIAGLIQSGKRKRLLRD------TTPL 81
Query: 80 QKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKDGV 139
Q+G+IR++ +V+DLS + ERDF R + K F+ EFF+QNP+S + ++ DG+
Sbjct: 82 QRGIIRHMVLVLDLSNSMEERDFHHKRFDLQIKYASEFVLEFFEQNPISQLSIIGVMDGI 141
Query: 140 ANCLTDLGGSPESHIKALMGKLECSGDASLQNALELVHSNLNQIPSYGHREVLILYSALS 199
A+ +TDL G+P+SHI+ L +CSG+ SLQNALE+ ++L+ I S+G REVLI++ ++
Sbjct: 142 AHRITDLHGNPQSHIQKLKSLRDCSGNFSLQNALEMARASLSHIASHGTREVLIIFGSIL 201
Query: 200 TCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGT----YSVALDESHFKEL 255
+ DPGD+ +TI IR ++GLAAE+ ICK +C +T + Y V + E HF+EL
Sbjct: 202 SSDPGDIFKTIDALVHDSIRVRIVGLAAEVAICKEICNKTNSSTKNAYGVVISEQHFREL 261
Query: 256 ILEHS-PPPPAIAEYATANLIKMGFPQRAAEGSVAICTCHEEAKTGGGYTCPRCKVRVCE 314
+LE + PP A+ A+L+ MGFP + E ++C CH + GG+ CPRCK +VC
Sbjct: 262 LLESTIPPATDSAKTTDASLVMMGFPSKVVEQLPSLCACH-SIPSRGGFHCPRCKAKVCT 320
Query: 315 LPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPNTCFGCQ 368
LP EC C L LI S HLARSYHHLFP+ ++EI S+ H CF CQ
Sbjct: 321 LPIECPSCSLVLILSTHLARSYHHLFPLKNWSEIPWSANPKSTH-----CFACQ 369
>dbj|BAA31745.1| SSL1 [Schizosaccharomyces pombe]
Length = 392
Score = 281 bits (718), Expect = 4e-74
Identities = 147/354 (41%), Positives = 213/354 (59%), Gaps = 19/354 (5%)
Query: 20 EANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAATARI 79
+ +D+ WE Y RSW+ +QED G L + I + +R LR T +
Sbjct: 1 KTDDNEGYTWEGEY--QRSWDIVQEDAEGSLVGVIAGLIQSGKRKRLLRD------TTPL 52
Query: 80 QKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKDGV 139
Q+G+IR++ +V+DLS + ERDF R + K F+ EFF+QNP+S + ++ DG+
Sbjct: 53 QRGIIRHMVLVLDLSNSMEERDFHHKRFDLQIKYASEFVLEFFEQNPISQLSIIGVMDGI 112
Query: 140 ANCLTDLGGSPESHIKALMGKLECSGDASLQNALELVHSNLNQIPSYGHREVLILYSALS 199
A+ +TDL G+P+SHI+ L +CSG+ SLQNALE+ ++L+ I S+G REVLI++ ++
Sbjct: 113 AHRITDLHGNPQSHIQKLKSLRDCSGNFSLQNALEMARASLSHIASHGTREVLIIFGSIL 172
Query: 200 TCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGT----YSVALDESHFKEL 255
+ DPGD+ +TI IR ++GLAAE+ ICK +C +T + Y V + E HF+EL
Sbjct: 173 SSDPGDIFKTIDALVHDSIRVRIVGLAAEVAICKEICNKTNSSTKNAYGVVISEQHFREL 232
Query: 256 ILEHS-PPPPAIAEYATANLIKMGFPQRAAEGSVAICTCHEEAKTGGGYTCPRCKVRVCE 314
+LE + PP A+ A+L+ MGFP + E ++C CH + GG+ CPRCK +VC
Sbjct: 233 LLESTIPPATDSAKTTDASLVMMGFPSKVVEQLPSLCACH-SIPSRGGFHCPRCKAKVCT 291
Query: 315 LPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPNTCFGCQ 368
LP EC C L LI S HLARSYHHLFP+ ++EI S+ H CF CQ
Sbjct: 292 LPIECPSCSLVLILSTHLARSYHHLFPLKNWSEIPWSANPKSTH-----CFACQ 340
>ref|XP_424965.1| PREDICTED: similar to TFIIH basal transcription factor complex p44
subunit (Basic transcription factor 2 44 kDa subunit)
(BTF2-p44) (General transcription factor IIH polypeptide
2) [Gallus gallus]
Length = 985
Score = 280 bits (717), Expect = 5e-74
Identities = 152/369 (41%), Positives = 223/369 (60%), Gaps = 31/369 (8%)
Query: 16 DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
DDE E + WE Y +R+WE L+EDESG L+ I ++ + + + +
Sbjct: 592 DDEPERT----KRWEGGY--ERTWEILKEDESGSLK----ATIDDILFKAKRKRIYEHHG 641
Query: 76 TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
R+ G++R+LY+VVD S+ ++D +P+R+ K +E F+ E+FDQNP+S +GL+ T
Sbjct: 642 QVRL--GMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQMGLIVT 699
Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
K A +T+L G+P+ HI AL ++ C G+ SL N+L L L +P + REVLI
Sbjct: 700 KSKRAEKMTELSGNPKKHIAALKKAVDMNCQGEPSLYNSLNLAMQTLKHMPGHTSREVLI 759
Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
++S+L+TCDP ++ + I+ K KIR SVIGL+AE+ +C L +ETGGTY V LDE+H+K
Sbjct: 760 VFSSLTTCDPANIYDLIKCLKAVKIRVSVIGLSAEVRVCTVLTRETGGTYHVILDETHYK 819
Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA--------AEGSVAICTCH---EEAKTGGG 302
EL++ H PPPA + + +LI+MGFPQ A+ S ++ E T G
Sbjct: 820 ELLMHHVSPPPA-SSTSECSLIRMGFPQHTTASLSDQDAKPSFSMAQLENNSEPCLTLDG 878
Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
Y CP+C+ + ELP EC++CGLTL+S+PHLARSYHHLFP+ F EI +
Sbjct: 879 YFCPQCRAKYSELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLEEYQGERY---- 934
Query: 363 TCFGCQQSL 371
C GCQ +
Sbjct: 935 -CQGCQAEI 942
>emb|CAF92597.1| unnamed protein product [Tetraodon nigroviridis]
Length = 395
Score = 280 bits (716), Expect = 7e-74
Identities = 150/353 (42%), Positives = 218/353 (61%), Gaps = 26/353 (7%)
Query: 29 WERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAATARIQKGLIRYLY 88
WE Y +R+WE L+EDESG L+ + A+ RR +++ +++ G++R+LY
Sbjct: 12 WEGGY--ERTWEVLKEDESGSLKASVEEILFQAKKRRLVQS------HGQVRLGMMRHLY 63
Query: 89 IVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKDGVANCLTDLGG 148
+V+D S++ ++D +P+R+ K +E F+ E+FDQNP+S +G++TTK+ A LTDL G
Sbjct: 64 VVIDCSRSMEDQDLKPNRLTSTLKLMEGFVEEYFDQNPISQMGIITTKNKRAEKLTDLAG 123
Query: 149 SPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLILYSALSTCDPGDL 206
+P+ H AL ++ C G+ SL N L L L +P + REVLI+ S+L+TCDPG++
Sbjct: 124 NPKKHAAALKKAVDSACVGEPSLYNCLSLALQTLRHMPGHTSREVLIILSSLTTCDPGNI 183
Query: 207 METIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFKELILEHSPPPPAI 266
E IQ K K+R SV+GL+AE+ +C L +ETGG+Y V LDESHFKEL++ H PPPA
Sbjct: 184 YELIQTLKSLKVRVSVVGLSAEVRVCTVLTRETGGSYHVILDESHFKELLMLHVKPPPAS 243
Query: 267 AEYATANLIKM-GFPQRA------AEGSVAICTCHEEAKTG------GGYTCPRCKVRVC 313
+ +LI+M GFPQ + + H E G GGY CP+C +
Sbjct: 244 CS-SECSLIRMAGFPQHTMASLTDQDAKPSFSMAHLEGGGGGPDLSLGGYFCPQCHAKYT 302
Query: 314 ELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPNTCFG 366
ELP EC+VCGLTL+ +PHLARS+HHLFP+ F E S+++ P F C G
Sbjct: 303 ELPVECKVCGLTLVLAPHLARSFHHLFPLQVFPE--SSAEDPPKDRFCQACQG 353
>ref|XP_314533.2| ENSANGP00000016260 [Anopheles gambiae str. PEST]
gi|55240127|gb|EAA09916.3| ENSANGP00000016260 [Anopheles
gambiae str. PEST]
Length = 403
Score = 276 bits (706), Expect = 1e-72
Identities = 148/359 (41%), Positives = 217/359 (60%), Gaps = 26/359 (7%)
Query: 29 WERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAATARIQKGLIRYLY 88
WE Y +++WE+++ED+ GL+ + I A+ +R+ A +++ G++R+LY
Sbjct: 10 WETGY--EKTWEAIKEDDDGLIEGSVSDIIQKAKRKRQ----AMKRGFSKL--GMMRHLY 61
Query: 89 IVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKDGVANCLTDLGG 148
+++D S+A + D +P+R K +ELFI EFFDQNP+S +G++ K A +++LGG
Sbjct: 62 VLLDCSEAMTVPDLKPTRFICSLKLLELFIEEFFDQNPISQLGVIAMKAKRAEKISELGG 121
Query: 149 SPESHIKA---LMGKLECSGDASLQNALELVHSNLNQIPSYGHREVLILYSALSTCDPGD 205
+ HIKA L + G+ SLQN LEL L IP + RE+L++ +L++CDP D
Sbjct: 122 TSRKHIKAVHALTNGVPLVGEPSLQNGLELALKTLRMIPQHASREILVIMGSLTSCDPND 181
Query: 206 LMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFKELILEHSPPPPA 265
+ TI+ K +RCSV+ L+AE+ ICK LC ETGG + LD++H+K+ +L+H PP A
Sbjct: 182 VHLTIENLKTEGVRCSVLSLSAEIRICKFLCTETGGLFGAVLDDAHYKDQLLQHIDPPQA 241
Query: 266 IAEYATANLIKMGFP----QRAAEGSVAICTCH-----EEAK-TGGGYTCPRCKVRVCEL 315
+LIKMGFP + E + +C CH E AK T GGY CP+C + CEL
Sbjct: 242 -GNQQEFSLIKMGFPHGKTESGKEPPLTMCMCHIDSVDEPAKLTSGGYHCPQCYSKYCEL 300
Query: 316 PTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQ---NDPNHSFPNTCFGCQQSL 371
P EC CGLTL S+PHLARSYHHLFP+ F E+ P Q +P C+ CQ++L
Sbjct: 301 PVECSACGLTLASAPHLARSYHHLFPVPHFNEL-PLEQVQVQEPRDPPVTNCYACQKTL 358
>gb|AAF51879.2| CG11115-PA [Drosophila melanogaster] gi|21356299|ref|NP_649427.1|
CG11115-PA [Drosophila melanogaster]
gi|15010516|gb|AAK77306.1| GH08526p [Drosophila
melanogaster]
Length = 438
Score = 271 bits (692), Expect = 4e-71
Identities = 143/371 (38%), Positives = 225/371 (60%), Gaps = 27/371 (7%)
Query: 18 EDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAATA 77
+DE D WE Y +++WE++++DE G+L I A+ +R+ + N
Sbjct: 3 DDEQEDQKEYRWETGY--EKTWEAIKDDEDGMLDGAIAEIIQKAKRQRQAQKSKQN---- 56
Query: 78 RIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKD 137
+ G++R++++V+D S++ S D +P+R+ K +ELFI EFFDQNP+S +GL+ K
Sbjct: 57 --RLGMMRHMFVVLDCSESMSVPDLKPTRLRCTVKLLELFIEEFFDQNPISQLGLIALKA 114
Query: 138 GVANCLTDLGGSPESHIKAL--MGKLECSGDASLQNALELVHSNLNQIPSYGHREVLILY 195
A +T+L G+ H+KAL + + + + SLQN L+L +L +PS+ RE++I+
Sbjct: 115 KRAEKVTELTGTSRVHLKALESLANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIM 174
Query: 196 SALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFKEL 255
+L+TCDP D+ TI + KK IRCSVI L+AE+ + ++L Q+T GT+ LD++HF++
Sbjct: 175 GSLTTCDPVDINLTIDELKKEGIRCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQ 234
Query: 256 ILEHSPPPPAIAEYATANLIKMGFPQ-----RAAEGSVAICTCHEE------AKTGGGYT 304
++ PPPA A+ +LI+MGFP + +++C CH E T G+
Sbjct: 235 LMSQVDPPPA-AKTQHNSLIRMGFPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHH 293
Query: 305 CPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEI----SPSSQNDPNHSF 360
CP+C + CELP EC+ CGLTL+S+PHLARSYHHLFP+ F E+ P+S +D S
Sbjct: 294 CPQCNSKYCELPVECQSCGLTLVSAPHLARSYHHLFPVPNFEELPFEAMPASSSDLT-SD 352
Query: 361 PNTCFGCQQSL 371
C+GC ++L
Sbjct: 353 VRECYGCAKAL 363
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.319 0.135 0.406
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 821,244,024
Number of Sequences: 2540612
Number of extensions: 34336505
Number of successful extensions: 115813
Number of sequences better than 10.0: 119
Number of HSP's better than 10.0 without gapping: 66
Number of HSP's successfully gapped in prelim test: 53
Number of HSP's that attempted gapping in prelim test: 115549
Number of HSP's gapped (non-prelim): 131
length of query: 471
length of database: 863,360,394
effective HSP length: 132
effective length of query: 339
effective length of database: 527,999,610
effective search space: 178991867790
effective search space used: 178991867790
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)
Medicago: description of AC135795.5