Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC135795.5 + phase: 0 
         (471 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM90909.1| p44/SSL1-like protein [Arabidopsis thaliana] gi|3...   577  e-163
emb|CAE04735.1| OSJNBa0043L24.23 [Oryza sativa (japonica cultiva...   562  e-159
ref|XP_394997.2| PREDICTED: similar to ENSANGP00000016260 [Apis ...   296  9e-79
ref|XP_535266.1| PREDICTED: similar to transcription factor BTF2...   294  5e-78
ref|NP_001506.1| general transcription factor IIH, polypeptide 2...   290  7e-77
gb|AAH64557.1| General transcription factor IIH, polypeptide 2, ...   290  7e-77
ref|XP_645146.1| general transcription factor IIH component [Dic...   290  7e-77
gb|AAH16231.1| Gtf2h2 protein [Mus musculus]                          287  6e-76
sp|Q9JIB4|TF2H2_MOUSE TFIIH basal transcription factor complex p...   287  6e-76
ref|XP_215466.2| PREDICTED: similar to general transcription fac...   286  1e-75
ref|NP_071294.2| general transcription factor II H, polypeptide ...   283  8e-75
gb|AAH71091.1| MGC81060 protein [Xenopus laevis]                      282  1e-74
ref|NP_001011081.1| hypothetical protein LOC496493 [Xenopus trop...   282  2e-74
gb|AAH45397.1| General transcription factor IIH, polypeptide 2 [...   281  3e-74
emb|CAA20673.1| SPCC1682.07 [Schizosaccharomyces pombe] gi|19075...   281  4e-74
dbj|BAA31745.1| SSL1 [Schizosaccharomyces pombe]                      281  4e-74
ref|XP_424965.1| PREDICTED: similar to TFIIH basal transcription...   280  5e-74
emb|CAF92597.1| unnamed protein product [Tetraodon nigroviridis]      280  7e-74
ref|XP_314533.2| ENSANGP00000016260 [Anopheles gambiae str. PEST...   276  1e-72
gb|AAF51879.2| CG11115-PA [Drosophila melanogaster] gi|21356299|...   271  4e-71

>gb|AAM90909.1| p44/SSL1-like protein [Arabidopsis thaliana]
           gi|30679101|ref|NP_683275.2| basic transcription factor
           2, 44kD subunit-related [Arabidopsis thaliana]
           gi|4056421|gb|AAC97995.1| Similar to gb|Z30094 basic
           transcripion factor 2, 44 kD subunit from Homo sapiens. 
           EST gb|W43325 comes from this gene. [Arabidopsis
           thaliana] gi|25347846|pir||E86184 hypothetical protein
           [imported] - Arabidopsis thaliana
          Length = 421

 Score =  577 bits (1488), Expect = e-163
 Identities = 273/360 (75%), Positives = 315/360 (86%), Gaps = 1/360 (0%)

Query: 14  EDDDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASN 73
           E ++ED+ + +G+  WERAY +DRSWE LQEDESGLLRPID +AI+HAQYRRRLR L++ 
Sbjct: 11  EREEEDDEDAEGIGEWERAYVDDRSWEELQEDESGLLRPIDNSAIYHAQYRRRLRMLSAA 70

Query: 74  AATARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLV 133
           AA  RIQKGLIRYLYIV+D S+AA+E DFRPSRMA++AK VE FIREFFDQNPLS +GLV
Sbjct: 71  AAGTRIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAIMAKHVEAFIREFFDQNPLSQIGLV 130

Query: 134 TTKDGVANCLTDLGGSPESHIKALMGKLECSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           + K+GVA+ LTDLGGSPE+HIKALMGKLE  GD+SLQNALELVH +LNQ+PSYGHREVLI
Sbjct: 131 SIKNGVAHTLTDLGGSPETHIKALMGKLEALGDSSLQNALELVHEHLNQVPSYGHREVLI 190

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           LYSAL TCDPGD+METIQKCKKSK+RCSVIGL+AEMFICKHLCQETGG YSVA+DE H K
Sbjct: 191 LYSALCTCDPGDIMETIQKCKKSKLRCSVIGLSAEMFICKHLCQETGGLYSVAVDEVHLK 250

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRAAEGSVAICTCHEEAKTGGGYTCPRCKVRVC 313
           +L+LEH+PPPPAIAE+A ANLIKMGFPQRAAEGS+AIC+CH+E K G GY CPRCK RVC
Sbjct: 251 DLLLEHAPPPPAIAEFAIANLIKMGFPQRAAEGSMAICSCHKEVKIGAGYMCPRCKARVC 310

Query: 314 ELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEI-SPSSQNDPNHSFPNTCFGCQQSLL 372
           +LPTEC +CGLTL+SSPHLARSYHHLFPI PF E+ + SS ND       +CFGCQQSL+
Sbjct: 311 DLPTECTICGLTLVSSPHLARSYHHLFPIAPFDEVPALSSLNDNRRKLGKSCFGCQQSLI 370


>emb|CAE04735.1| OSJNBa0043L24.23 [Oryza sativa (japonica cultivar-group)]
           gi|50926354|ref|XP_473124.1| OSJNBa0043L24.23 [Oryza
           sativa (japonica cultivar-group)]
          Length = 432

 Score =  562 bits (1448), Expect = e-159
 Identities = 268/373 (71%), Positives = 316/373 (83%), Gaps = 5/373 (1%)

Query: 14  EDDDEDEANDDG----LEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRA 69
           EDDD++E  + G    LEAWERAY +DRSWE+LQEDESGLLRPIDT  + HAQYRRRL  
Sbjct: 25  EDDDDEEEEESGEGRVLEAWERAYADDRSWEALQEDESGLLRPIDTKTLVHAQYRRRLLL 84

Query: 70  LASNAATARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSH 129
            ++ +A ARIQKGLIRYLYIV+DLS+AASE D+RPSRMAV+AK  E+FIREFFDQNPLSH
Sbjct: 85  RSAASAAARIQKGLIRYLYIVIDLSRAASEMDYRPSRMAVVAKYAEVFIREFFDQNPLSH 144

Query: 130 VGLVTTKDGVANCLTDLGGSPESHIKALMGKLECSGDASLQNALELVHSNLNQIPSYGHR 189
           VG+VT KDG+++ LT++GGSPES IKALMGKLECSG+ SLQNALELVH  L+Q+PSYGH+
Sbjct: 145 VGIVTMKDGISHRLTEIGGSPESQIKALMGKLECSGEPSLQNALELVHGYLDQVPSYGHK 204

Query: 190 EVLILYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDE 249
           EVL LYSAL+TCDPGD+METI KCKKSKIRCSVIGLAAE+FICK+LC+ETGG+Y+VALDE
Sbjct: 205 EVLFLYSALNTCDPGDIMETIAKCKKSKIRCSVIGLAAEIFICKYLCEETGGSYTVALDE 264

Query: 250 SHFKELILEHSPPPPAIAEYATANLIKMGFPQRAAEGSVAICTCHEEAKTGG-GYTCPRC 308
           SHFKEL+LEH+PPPPAIAEYA ANLIKMGFPQR AE  ++IC+CH++ K+G  GY CPRC
Sbjct: 265 SHFKELLLEHAPPPPAIAEYAAANLIKMGFPQRGAEDLISICSCHKKIKSGAEGYICPRC 324

Query: 309 KVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPNTCFGCQ 368
           KV VCELPTECR CGLTL+SSPHLARSYHHLFP+ PF E+S    N         C+GCQ
Sbjct: 325 KVNVCELPTECRTCGLTLVSSPHLARSYHHLFPVQPFDEVSSVHPNRLGQKGGQKCYGCQ 384

Query: 369 QSLLVLEQETRLN 381
           QS +  + ++ L+
Sbjct: 385 QSFINPDSQSSLH 397


>ref|XP_394997.2| PREDICTED: similar to ENSANGP00000016260 [Apis mellifera]
          Length = 405

 Score =  296 bits (758), Expect = 9e-79
 Identities = 154/366 (42%), Positives = 231/366 (63%), Gaps = 27/366 (7%)

Query: 17  DEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAAT 76
           DE+E  +     WE  Y  +++WE+++ED+ GLL       IH+ + +R++         
Sbjct: 3   DEEEEKE---YRWETGY--EKTWEAIKEDDHGLLEASVADIIHNVKRKRQMEKKIG---- 53

Query: 77  ARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTK 136
           AR+  G++R+LY+++D S++ S +D +P+R     K +E FI EFF QNP+S +G++ T+
Sbjct: 54  ARL--GMMRHLYVILDASESMSIQDLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITR 111

Query: 137 DGVANCLTDLGGSPESHIKAL--MGKLECSGDASLQNALELVHSNLNQIPSYGHREVLIL 194
           +  A  +++L G+ + HIK +  M ++  +G+ SLQN++EL   +L  +PS+  +E+LI+
Sbjct: 112 NKRAEKVSELTGNSKKHIKEVQSMQQITPAGEPSLQNSIELALKSLRLLPSHASKEILII 171

Query: 195 YSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFKE 254
             AL+TCDPGD+ ETI+  K   +RCSVIGLAAE++ICK +   TGG + VALD+ H+KE
Sbjct: 172 VGALTTCDPGDINETIKNMKLDSVRCSVIGLAAELYICKRMATATGGEHGVALDDKHYKE 231

Query: 255 LILEHSPPPPAIAEYATANLIKMGFPQRAAEGS-----VAICTCHEEAK------TGGGY 303
            +  H  PPPA A    A L+KMGFP  A   S     +A+C CH E+          GY
Sbjct: 232 QLNMHIDPPPA-ATRLDAALVKMGFPHHALHSSTNDSAMAVCMCHAESSDESVKLLSTGY 290

Query: 304 TCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPNT 363
            CP+C  + CELP ECR CGLTL+S+PHLARSYH+LFP+ PF E+    +    H  P+ 
Sbjct: 291 LCPQCLSKHCELPVECRACGLTLVSAPHLARSYHYLFPVEPFKEVEYRKEVTFEH--PSI 348

Query: 364 CFGCQQ 369
           C+GCQ+
Sbjct: 349 CYGCQK 354


>ref|XP_535266.1| PREDICTED: similar to transcription factor BTF2 chain p44 - human
           [Canis familiaris]
          Length = 406

 Score =  294 bits (752), Expect = 5e-78
 Identities = 154/369 (41%), Positives = 226/369 (60%), Gaps = 31/369 (8%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           D+E E      + WE  Y  +R+WE L+EDESG L+      +  A+ +R          
Sbjct: 2   DEEPERT----KRWEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------ 49

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             +++ G++R+LY+VVD S+   ++D +P+R+    K +E F+ E+FDQNP+S +G++ T
Sbjct: 50  HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           K   A  LT+L G+P  HI +L   ++  C G+ SL N+L +    L  +P +  REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDP ++ + I+  K +KIR SVIGL+AE+ +C  L +ETGGTY V LDESH+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDESHYK 229

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA--------AEGSVAICTCHEEAKTG---GG 302
           EL+  H  PPPA +  +  +LI+MGFPQ          A+ S ++       + G   GG
Sbjct: 230 ELLTHHVSPPPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGG 288

Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
           Y CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+ PF EI         H+   
Sbjct: 289 YFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDPFQEIPLE-----EHNGER 343

Query: 363 TCFGCQQSL 371
            C+GCQ  L
Sbjct: 344 FCYGCQGEL 352


>ref|NP_001506.1| general transcription factor IIH, polypeptide 2, 44kD subunit [Homo
           sapiens] gi|17380326|sp|Q13888|TF2H2_HUMAN TFIIH basal
           transcription factor complex p44 subunit (Basic
           transcription factor 2 44 kDa subunit) (BTF2-p44)
           (General transcription factor IIH polypeptide 2)
           gi|5531809|gb|AAD44479.1| basic transcription factor 2
           [Homo sapiens] gi|496609|emb|CAA82910.1| basic
           transcripion factor 2, 44 kD subunit [Homo sapiens]
          Length = 395

 Score =  290 bits (742), Expect = 7e-77
 Identities = 153/369 (41%), Positives = 222/369 (59%), Gaps = 31/369 (8%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           D+E E      + WE  Y  +R+WE L+EDESG L+      +  A+ +R          
Sbjct: 2   DEEPERT----KRWEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------ 49

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             +++ G++R+LY+VVD S+   ++D +P+R+    K +E F+ E+FDQNP+S +G++ T
Sbjct: 50  HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           K   A  LT+L G+P  HI +L   ++  C G+ SL N+L +    L  +P +  REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLI 169

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDP ++ + I+  K +KIR SVIGL+AE+ +C  L +ETGGTY V LDESH+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDESHYK 229

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH-----EEAKTGGG 302
           EL+  H  PPPA +  +  +LI+MGFPQ         +   +    H     E   T GG
Sbjct: 230 ELLTHHVSPPPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGG 288

Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
           Y CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+  F EI     N        
Sbjct: 289 YFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLEEYNGERF---- 344

Query: 363 TCFGCQQSL 371
            C+GCQ  L
Sbjct: 345 -CYGCQGEL 352


>gb|AAH64557.1| General transcription factor IIH, polypeptide 2, 44kD subunit [Homo
           sapiens] gi|40674450|gb|AAH65021.1| GTF2H2 protein [Homo
           sapiens]
          Length = 395

 Score =  290 bits (742), Expect = 7e-77
 Identities = 153/369 (41%), Positives = 222/369 (59%), Gaps = 31/369 (8%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           D+E E      + WE  Y  +R+WE L+EDESG L+      +  A+ +R          
Sbjct: 2   DEEPERT----KRWEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------ 49

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             +++ G++R+LY+VVD S+   ++D +P+R+    K +E F+ E+FDQNP+S +G++ T
Sbjct: 50  HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           K   A  LT+L G+P  HI +L   ++  C G+ SL N+L +    L  +P +  REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKEAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDP ++ + I+  K +KIR SVIGL+AE+ +C  L +ETGGTY V LDESH+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDESHYK 229

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH-----EEAKTGGG 302
           EL+  H  PPPA +  +  +LI+MGFPQ         +   +    H     E   T GG
Sbjct: 230 ELLTHHLSPPPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGG 288

Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
           Y CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+  F EI     N        
Sbjct: 289 YFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLEEYNGERF---- 344

Query: 363 TCFGCQQSL 371
            C+GCQ  L
Sbjct: 345 -CYGCQGEL 352


>ref|XP_645146.1| general transcription factor IIH component [Dictyostelium
           discoideum] gi|42733664|gb|AAO50835.2| similar to Homo
           sapiens (Human). TFIIH basal transcription factor
           complex p44 subunit (Basic transcription factor 2 44 kDa
           subunit) (BTF2-p44) (General transcription factor IIH
           polypeptide 2) [Dictyostelium discoideum]
           gi|60473387|gb|EAL71333.1| TFIIH subunit [Dictyostelium
           discoideum]
          Length = 461

 Score =  290 bits (742), Expect = 7e-77
 Identities = 155/378 (41%), Positives = 229/378 (60%), Gaps = 19/378 (5%)

Query: 2   NNIAGKPLNGDLEDDDEDEAN------DDGLEAWERAYTEDRSWESLQEDESGLLRPIDT 55
           NN   K  N  L DD++  A+      +DG   ++     +++W ++ EDE GL RP + 
Sbjct: 8   NNAQNKRTNRSLYDDEDGPAHVLQTNDEDGTNKYKWENRFEKTWLTIDEDEHGL-RPSNQ 66

Query: 56  TAIHHAQYRRRLRALASNAATA---RIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAK 112
                    RRL+    +   +   R+++G+ R+L +++DLSK  S +D +PSR  V+ +
Sbjct: 67  E--ERNTRNRRLKNKDRDGILSQDQRVRRGMQRHLCLILDLSKTLSNQDLKPSRYQVLLQ 124

Query: 113 QVELFIREFFDQNPLSHVGLVTTKDGVANCLTDLGGSPESHIKALMGKLECSGDASLQNA 172
            VELFI+EFFDQNP+S + ++ TK+  A  +++L G+   HI+A+   +   G+ S+QN+
Sbjct: 125 NVELFIKEFFDQNPISQLSIIITKNSKAEKISELSGNRLRHIQAMKDAIAMEGEPSIQNS 184

Query: 173 LELVHSNLNQIPSYGHREVLILYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFIC 232
           LE+  S+L  +P YG REVL ++S+L+TCDP  L +TIQ  K   IR S I +AAE++IC
Sbjct: 185 LEVALSSLCYVPKYGSREVLFIFSSLTTCDPSSLQKTIQSLKNESIRVSFIHMAAELYIC 244

Query: 233 KHLCQETGGTYSVALDESHFKELILEHSPPPPAIAEYATANLIKMGFPQRAAEGSVAICT 292
           K + ++T GT  V L+E HF E ++    PPP I +   A L++MGFPQ+      + C 
Sbjct: 245 KAIAEQTNGTSKVILNEEHFNESLMLKCQPPPTIGK-TEAALVEMGFPQQITSTVPSPCI 303

Query: 293 CHEEAKTGGGYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSS 352
           CHE+ K   GY CPRC V+ CELPT+C++C L+L+SSPHLARSYHHLF I  F E++   
Sbjct: 304 CHEKMKY-SGYICPRCGVKSCELPTDCQICNLSLVSSPHLARSYHHLFQIPLFNEVNWKE 362

Query: 353 QNDPNHSFPNTCFGCQQS 370
            N        TC GC  S
Sbjct: 363 LNK-----NVTCIGCLSS 375


>gb|AAH16231.1| Gtf2h2 protein [Mus musculus]
          Length = 398

 Score =  287 bits (734), Expect = 6e-76
 Identities = 151/370 (40%), Positives = 223/370 (59%), Gaps = 32/370 (8%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           D+E E      + WE  Y  +R+WE L+EDE+G L+      +  A+ +R          
Sbjct: 2   DEEPERT----KRWEGGY--ERTWEILKEDETGSLKATIEDILFKAKRKRVFEH------ 49

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             +++ G++R+LY+VVD S+   ++D +P+R+    K +E F+ E+FDQNP+S +G++ T
Sbjct: 50  HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           K   A  LT+L G+P  HI +L   ++  C G+ SL N+L +    L  +P +  REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDP ++ + I+  K +KIR SVIGL+AE+ +C  L +ETGGTY V LDE+H+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKTAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDETHYK 229

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH------EEAKTGG 301
           EL+  H  PPPA +  +  +LI+MGFPQ         +   +    H      E   T G
Sbjct: 230 ELLAHHVSPPPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG 288

Query: 302 GYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFP 361
           GY CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+  F EIS        +   
Sbjct: 289 GYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEISLE-----EYKGE 343

Query: 362 NTCFGCQQSL 371
             C+GCQ  L
Sbjct: 344 RFCYGCQGEL 353


>sp|Q9JIB4|TF2H2_MOUSE TFIIH basal transcription factor complex p44 subunit (Basic
           transcription factor 2 44 kDa subunit) (BTF2-p44)
           (General transcription factor IIH polypeptide 2)
           gi|9082152|gb|AAF82753.1| general transcription factor
           IIH polypeptide 2 [Mus musculus]
          Length = 396

 Score =  287 bits (734), Expect = 6e-76
 Identities = 151/370 (40%), Positives = 223/370 (59%), Gaps = 32/370 (8%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           D+E E      + WE  Y  +R+WE L+EDE+G L+      +  A+ +R          
Sbjct: 2   DEEPERT----KRWEGGY--ERTWEILKEDETGSLKATIEDILFKAKRKRVFEH------ 49

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             +++ G++R+LY+VVD S+   ++D +P+R+    K +E F+ E+FDQNP+S +G++ T
Sbjct: 50  HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           K   A  LT+L G+P  HI +L   ++  C G+ SL N+L +    L  +P +  REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDP ++ + I+  K +KIR SVIGL+AE+ +C  L +ETGGTY V LDE+H+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKTAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDETHYK 229

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH------EEAKTGG 301
           EL+  H  PPPA +  +  +LI+MGFPQ         +   +    H      E   T G
Sbjct: 230 ELLAHHVSPPPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG 288

Query: 302 GYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFP 361
           GY CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+  F EIS        +   
Sbjct: 289 GYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEISLE-----EYKGE 343

Query: 362 NTCFGCQQSL 371
             C+GCQ  L
Sbjct: 344 RFCYGCQGEL 353


>ref|XP_215466.2| PREDICTED: similar to general transcription factor IIH polypeptide
           2 [Rattus norvegicus]
          Length = 396

 Score =  286 bits (731), Expect = 1e-75
 Identities = 151/370 (40%), Positives = 222/370 (59%), Gaps = 32/370 (8%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           D+E E      + WE  Y  +R+WE L+EDESG L+      +  A+ +R          
Sbjct: 2   DEEPERT----KRWEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------ 49

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             +++ G++R+LY+VVD S+   ++D +P+R+    K +E F+ E+FDQNP+S +G++ T
Sbjct: 50  HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           K   A  LT+L G+P  HI +L   ++  C G+ SL N+L +    L  +P +  REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDP ++ + I+  K +KIR SVIGL+AE+ +C  L +ETGGTY V LDE+H+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKTAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDETHYK 229

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH------EEAKTGG 301
           EL+  H  PPPA +  +  +LI+MGFPQ         +   +    H      E   T G
Sbjct: 230 ELLARHVSPPPA-SSGSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG 288

Query: 302 GYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFP 361
           GY CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+  F EI         +   
Sbjct: 289 GYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLE-----EYKGE 343

Query: 362 NTCFGCQQSL 371
             C+GCQ  L
Sbjct: 344 RFCYGCQGEL 353


>ref|NP_071294.2| general transcription factor II H, polypeptide 2 [Mus musculus]
           gi|31418653|gb|AAH53382.1| General transcription factor
           II H, polypeptide 2 [Mus musculus]
          Length = 396

 Score =  283 bits (724), Expect = 8e-75
 Identities = 150/370 (40%), Positives = 222/370 (59%), Gaps = 32/370 (8%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           D+E E      + WE  Y  +R+WE L+EDE+G L+      +  A+ +R          
Sbjct: 2   DEEPERT----KRWEGGY--ERTWEILKEDETGSLKATIEDILFKAKRKRVFEH------ 49

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             +++ G++R+LY+VVD S+   ++D +P+R+    K +E F+ E+FDQNP+S +G++ T
Sbjct: 50  HGQVRLGMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVT 109

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           K   A  LT+L G+P  HI +L   ++  C G+ SL N+L +    L  +P +  REVLI
Sbjct: 110 KSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLI 169

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDP ++ + I+  K +KIR SVIGL+AE+ +C  L +ETGGTY V LDE+H+K
Sbjct: 170 IFSSLTTCDPSNIYDLIKTLKTAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDETHYK 229

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA------AEGSVAICTCH------EEAKTGG 301
           EL+  H  P PA +  +  +LI+MGFPQ         +   +    H      E   T G
Sbjct: 230 ELLAHHVSPLPA-SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG 288

Query: 302 GYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFP 361
           GY CP+C+ + CELP EC++CGLTL+S+PHLARSYHHLFP+  F EIS        +   
Sbjct: 289 GYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEISLE-----EYKGE 343

Query: 362 NTCFGCQQSL 371
             C+GCQ  L
Sbjct: 344 RFCYGCQGEL 353


>gb|AAH71091.1| MGC81060 protein [Xenopus laevis]
          Length = 395

 Score =  282 bits (722), Expect = 1e-74
 Identities = 149/369 (40%), Positives = 225/369 (60%), Gaps = 31/369 (8%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           D+E E      + WE  Y  +R+WE L+EDESG L+      I    ++ + + +  N  
Sbjct: 2   DEEPEKT----KRWEGGY--ERTWEVLKEDESGSLK----ATIDEILFKDKRKRIFENRG 51

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             R+  G++R+LY++VD S+   ++D +P+R+    K +E F+ E+FDQNP+S +GL+ T
Sbjct: 52  QVRL--GMMRHLYVIVDSSRTMEDQDLKPNRLTCTLKLLEFFVEEYFDQNPISQIGLIVT 109

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           ++  A  LT+L G+P  HI A+   ++  CSG+ SL N+L L    L  +P +  RE+L+
Sbjct: 110 RNKRAEKLTELAGNPRQHINAMKKAVDMTCSGEPSLYNSLNLALQTLKHMPGHTSREILV 169

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDP ++ + I+  K SK+R SVIGL+AE+ +C  L +ETGG Y V LDESH+K
Sbjct: 170 IFSSLTTCDPTNIYDMIKCLKASKVRVSVIGLSAEVRVCTVLTRETGGVYHVILDESHYK 229

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA--------AEGSVAICTCHEEAKTG---GG 302
           EL++ H  PPPA +  +  +LI+MGFPQ          A+ S ++      ++ G   GG
Sbjct: 230 ELLMHHVIPPPA-SSSSECSLIRMGFPQHTMGCLSDQDAKPSFSMAHLDNTSEPGLTLGG 288

Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
           Y CP+CK +  ELP EC+VC LTL+S+PHLARSYHHLFP+  F E+     +   +    
Sbjct: 289 YFCPQCKAKYSELPVECKVCRLTLVSAPHLARSYHHLFPLDAFKEVRLEEYDGERY---- 344

Query: 363 TCFGCQQSL 371
            C GC   L
Sbjct: 345 -CRGCDGEL 352


>ref|NP_001011081.1| hypothetical protein LOC496493 [Xenopus tropicalis]
           gi|54038231|gb|AAH84471.1| Hypothetical LOC496493
           [Xenopus tropicalis]
          Length = 393

 Score =  282 bits (721), Expect = 2e-74
 Identities = 149/369 (40%), Positives = 226/369 (60%), Gaps = 31/369 (8%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           D+E E      + WE  Y  +R+WE L+EDESG L+    + I    ++ + + +  N  
Sbjct: 2   DEEPEKT----KRWEGGY--ERTWEVLKEDESGSLK----STIDEILFKDKRKRIFENRG 51

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             R+  G++R+LY++VD S+   ++D +P+R+    K +E F+ E+FDQNP+S +GL+ T
Sbjct: 52  QVRL--GMMRHLYVIVDSSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGLIVT 109

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           ++  A  LT+L G+P  H+ AL   ++  C+G+ SL N+L L    L  +P +  RE+L+
Sbjct: 110 RNKRAEKLTELAGNPRQHLNALKKAVDMNCNGEPSLYNSLNLALQTLKHMPGHTSREILV 169

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDP ++ + I+  K SKIR SVIGL+AE+ +C  L +ETGG Y V LDESH+K
Sbjct: 170 IFSSLTTCDPTNIYDIIKCLKASKIRVSVIGLSAEVRVCTVLTRETGGVYHVILDESHYK 229

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA--------AEGSVAICTCHEEAKTG---GG 302
           EL++ H  PPPA +  +  +LI+MGFPQ          A+ S ++      ++ G   GG
Sbjct: 230 ELLMHHVIPPPA-SSTSECSLIRMGFPQHTMGCLSDQDAKPSFSMAHLDNTSEPGLTLGG 288

Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
           Y CP+C+ +  ELP EC+VC LTL+S+PHLARSYHHLFP+  F E+S        +    
Sbjct: 289 YFCPQCRAKYSELPVECKVCRLTLVSAPHLARSYHHLFPLDAFKEVSLEEYEGERY---- 344

Query: 363 TCFGCQQSL 371
            C GC   L
Sbjct: 345 -CRGCDGEL 352


>gb|AAH45397.1| General transcription factor IIH, polypeptide 2 [Danio rerio]
           gi|42415511|ref|NP_963875.1| general transcription
           factor IIH, polypeptide 2 [Danio rerio]
          Length = 392

 Score =  281 bits (719), Expect = 3e-74
 Identities = 148/345 (42%), Positives = 224/345 (64%), Gaps = 26/345 (7%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           D+E E      + WE  Y  +R+WE L+EDESG L+      +  A   +R R   S+  
Sbjct: 2   DEEPERT----KRWEGGY--ERTWEVLKEDESGSLKATVEDILFQA---KRKRVFESHG- 51

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             +++ G++R+L++++D S++  ++D +P+R+    K +E F+ E+FDQNP+S +G++TT
Sbjct: 52  --QVRLGMMRHLFVIIDSSRSMEDQDLKPNRLTSTLKLMEHFVEEYFDQNPISQIGIITT 109

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           K+  A  LTDL G+P+ HI AL   ++  C G+ SL N+L +    L  +P++  REVL+
Sbjct: 110 KNKRAEKLTDLAGNPKKHITALRKAVDSTCVGEPSLYNSLNMALQTLKHMPAHTSREVLV 169

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDPG++ E I+     KIR SVIGL+AE+ +C  L +ETGG+Y+V LDESHFK
Sbjct: 170 IFSSLTTCDPGNIYELIKTLNGLKIRVSVIGLSAEVRVCTILTRETGGSYNVILDESHFK 229

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA--------AEGSVAIC---TCHEEAKTGGG 302
           EL+L H  PPPA +  +  +LI+MGFPQ          A+ S ++    +  E   + GG
Sbjct: 230 ELLLLHVKPPPA-SSSSECSLIRMGFPQHVIASLSDQDAKPSFSMAHLDSSSEPELSLGG 288

Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAE 347
           Y CP+C+ +  ELP EC+VCGLTL+S+PHLARS+HHLFP+  F E
Sbjct: 289 YYCPQCRAKYTELPVECKVCGLTLVSAPHLARSFHHLFPLEAFQE 333


>emb|CAA20673.1| SPCC1682.07 [Schizosaccharomyces pombe]
           gi|19075300|ref|NP_587800.1| hypothetical protein
           SPCC1682.07 [Schizosaccharomyces pombe 972h-]
           gi|26400388|sp|O74995|TFH47_SCHPO TFIIH basal
           transcription factor complex p47 subunit (Suppressor of
           stem-loop protein 1 homolog) (SSL1 homolog)
           gi|3406059|gb|AAC29144.1| TFIIH subunit p47
           [Schizosaccharomyces pombe]
          Length = 421

 Score =  281 bits (718), Expect = 4e-74
 Identities = 147/354 (41%), Positives = 213/354 (59%), Gaps = 19/354 (5%)

Query: 20  EANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAATARI 79
           + +D+    WE  Y   RSW+ +QED  G L  +    I   + +R LR       T  +
Sbjct: 30  KTDDNEGYTWEGEY--QRSWDIVQEDAEGSLVGVIAGLIQSGKRKRLLRD------TTPL 81

Query: 80  QKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKDGV 139
           Q+G+IR++ +V+DLS +  ERDF   R  +  K    F+ EFF+QNP+S + ++   DG+
Sbjct: 82  QRGIIRHMVLVLDLSNSMEERDFHHKRFDLQIKYASEFVLEFFEQNPISQLSIIGVMDGI 141

Query: 140 ANCLTDLGGSPESHIKALMGKLECSGDASLQNALELVHSNLNQIPSYGHREVLILYSALS 199
           A+ +TDL G+P+SHI+ L    +CSG+ SLQNALE+  ++L+ I S+G REVLI++ ++ 
Sbjct: 142 AHRITDLHGNPQSHIQKLKSLRDCSGNFSLQNALEMARASLSHIASHGTREVLIIFGSIL 201

Query: 200 TCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGT----YSVALDESHFKEL 255
           + DPGD+ +TI       IR  ++GLAAE+ ICK +C +T  +    Y V + E HF+EL
Sbjct: 202 SSDPGDIFKTIDALVHDSIRVRIVGLAAEVAICKEICNKTNSSTKNAYGVVISEQHFREL 261

Query: 256 ILEHS-PPPPAIAEYATANLIKMGFPQRAAEGSVAICTCHEEAKTGGGYTCPRCKVRVCE 314
           +LE + PP    A+   A+L+ MGFP +  E   ++C CH    + GG+ CPRCK +VC 
Sbjct: 262 LLESTIPPATDSAKTTDASLVMMGFPSKVVEQLPSLCACH-SIPSRGGFHCPRCKAKVCT 320

Query: 315 LPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPNTCFGCQ 368
           LP EC  C L LI S HLARSYHHLFP+  ++EI  S+     H     CF CQ
Sbjct: 321 LPIECPSCSLVLILSTHLARSYHHLFPLKNWSEIPWSANPKSTH-----CFACQ 369


>dbj|BAA31745.1| SSL1 [Schizosaccharomyces pombe]
          Length = 392

 Score =  281 bits (718), Expect = 4e-74
 Identities = 147/354 (41%), Positives = 213/354 (59%), Gaps = 19/354 (5%)

Query: 20  EANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAATARI 79
           + +D+    WE  Y   RSW+ +QED  G L  +    I   + +R LR       T  +
Sbjct: 1   KTDDNEGYTWEGEY--QRSWDIVQEDAEGSLVGVIAGLIQSGKRKRLLRD------TTPL 52

Query: 80  QKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKDGV 139
           Q+G+IR++ +V+DLS +  ERDF   R  +  K    F+ EFF+QNP+S + ++   DG+
Sbjct: 53  QRGIIRHMVLVLDLSNSMEERDFHHKRFDLQIKYASEFVLEFFEQNPISQLSIIGVMDGI 112

Query: 140 ANCLTDLGGSPESHIKALMGKLECSGDASLQNALELVHSNLNQIPSYGHREVLILYSALS 199
           A+ +TDL G+P+SHI+ L    +CSG+ SLQNALE+  ++L+ I S+G REVLI++ ++ 
Sbjct: 113 AHRITDLHGNPQSHIQKLKSLRDCSGNFSLQNALEMARASLSHIASHGTREVLIIFGSIL 172

Query: 200 TCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGT----YSVALDESHFKEL 255
           + DPGD+ +TI       IR  ++GLAAE+ ICK +C +T  +    Y V + E HF+EL
Sbjct: 173 SSDPGDIFKTIDALVHDSIRVRIVGLAAEVAICKEICNKTNSSTKNAYGVVISEQHFREL 232

Query: 256 ILEHS-PPPPAIAEYATANLIKMGFPQRAAEGSVAICTCHEEAKTGGGYTCPRCKVRVCE 314
           +LE + PP    A+   A+L+ MGFP +  E   ++C CH    + GG+ CPRCK +VC 
Sbjct: 233 LLESTIPPATDSAKTTDASLVMMGFPSKVVEQLPSLCACH-SIPSRGGFHCPRCKAKVCT 291

Query: 315 LPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPNTCFGCQ 368
           LP EC  C L LI S HLARSYHHLFP+  ++EI  S+     H     CF CQ
Sbjct: 292 LPIECPSCSLVLILSTHLARSYHHLFPLKNWSEIPWSANPKSTH-----CFACQ 340


>ref|XP_424965.1| PREDICTED: similar to TFIIH basal transcription factor complex p44
           subunit (Basic transcription factor 2 44 kDa subunit)
           (BTF2-p44) (General transcription factor IIH polypeptide
           2) [Gallus gallus]
          Length = 985

 Score =  280 bits (717), Expect = 5e-74
 Identities = 152/369 (41%), Positives = 223/369 (60%), Gaps = 31/369 (8%)

Query: 16  DDEDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAA 75
           DDE E      + WE  Y  +R+WE L+EDESG L+      I    ++ + + +  +  
Sbjct: 592 DDEPERT----KRWEGGY--ERTWEILKEDESGSLK----ATIDDILFKAKRKRIYEHHG 641

Query: 76  TARIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTT 135
             R+  G++R+LY+VVD S+   ++D +P+R+    K +E F+ E+FDQNP+S +GL+ T
Sbjct: 642 QVRL--GMMRHLYVVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQMGLIVT 699

Query: 136 KDGVANCLTDLGGSPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLI 193
           K   A  +T+L G+P+ HI AL   ++  C G+ SL N+L L    L  +P +  REVLI
Sbjct: 700 KSKRAEKMTELSGNPKKHIAALKKAVDMNCQGEPSLYNSLNLAMQTLKHMPGHTSREVLI 759

Query: 194 LYSALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFK 253
           ++S+L+TCDP ++ + I+  K  KIR SVIGL+AE+ +C  L +ETGGTY V LDE+H+K
Sbjct: 760 VFSSLTTCDPANIYDLIKCLKAVKIRVSVIGLSAEVRVCTVLTRETGGTYHVILDETHYK 819

Query: 254 ELILEHSPPPPAIAEYATANLIKMGFPQRA--------AEGSVAICTCH---EEAKTGGG 302
           EL++ H  PPPA +  +  +LI+MGFPQ          A+ S ++       E   T  G
Sbjct: 820 ELLMHHVSPPPA-SSTSECSLIRMGFPQHTTASLSDQDAKPSFSMAQLENNSEPCLTLDG 878

Query: 303 YTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPN 362
           Y CP+C+ +  ELP EC++CGLTL+S+PHLARSYHHLFP+  F EI         +    
Sbjct: 879 YFCPQCRAKYSELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLEEYQGERY---- 934

Query: 363 TCFGCQQSL 371
            C GCQ  +
Sbjct: 935 -CQGCQAEI 942


>emb|CAF92597.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 395

 Score =  280 bits (716), Expect = 7e-74
 Identities = 150/353 (42%), Positives = 218/353 (61%), Gaps = 26/353 (7%)

Query: 29  WERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAATARIQKGLIRYLY 88
           WE  Y  +R+WE L+EDESG L+      +  A+ RR +++        +++ G++R+LY
Sbjct: 12  WEGGY--ERTWEVLKEDESGSLKASVEEILFQAKKRRLVQS------HGQVRLGMMRHLY 63

Query: 89  IVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKDGVANCLTDLGG 148
           +V+D S++  ++D +P+R+    K +E F+ E+FDQNP+S +G++TTK+  A  LTDL G
Sbjct: 64  VVIDCSRSMEDQDLKPNRLTSTLKLMEGFVEEYFDQNPISQMGIITTKNKRAEKLTDLAG 123

Query: 149 SPESHIKALMGKLE--CSGDASLQNALELVHSNLNQIPSYGHREVLILYSALSTCDPGDL 206
           +P+ H  AL   ++  C G+ SL N L L    L  +P +  REVLI+ S+L+TCDPG++
Sbjct: 124 NPKKHAAALKKAVDSACVGEPSLYNCLSLALQTLRHMPGHTSREVLIILSSLTTCDPGNI 183

Query: 207 METIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFKELILEHSPPPPAI 266
            E IQ  K  K+R SV+GL+AE+ +C  L +ETGG+Y V LDESHFKEL++ H  PPPA 
Sbjct: 184 YELIQTLKSLKVRVSVVGLSAEVRVCTVLTRETGGSYHVILDESHFKELLMLHVKPPPAS 243

Query: 267 AEYATANLIKM-GFPQRA------AEGSVAICTCHEEAKTG------GGYTCPRCKVRVC 313
              +  +LI+M GFPQ         +   +    H E   G      GGY CP+C  +  
Sbjct: 244 CS-SECSLIRMAGFPQHTMASLTDQDAKPSFSMAHLEGGGGGPDLSLGGYFCPQCHAKYT 302

Query: 314 ELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQNDPNHSFPNTCFG 366
           ELP EC+VCGLTL+ +PHLARS+HHLFP+  F E   S+++ P   F   C G
Sbjct: 303 ELPVECKVCGLTLVLAPHLARSFHHLFPLQVFPE--SSAEDPPKDRFCQACQG 353


>ref|XP_314533.2| ENSANGP00000016260 [Anopheles gambiae str. PEST]
           gi|55240127|gb|EAA09916.3| ENSANGP00000016260 [Anopheles
           gambiae str. PEST]
          Length = 403

 Score =  276 bits (706), Expect = 1e-72
 Identities = 148/359 (41%), Positives = 217/359 (60%), Gaps = 26/359 (7%)

Query: 29  WERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAATARIQKGLIRYLY 88
           WE  Y  +++WE+++ED+ GL+    +  I  A+ +R+    A     +++  G++R+LY
Sbjct: 10  WETGY--EKTWEAIKEDDDGLIEGSVSDIIQKAKRKRQ----AMKRGFSKL--GMMRHLY 61

Query: 89  IVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKDGVANCLTDLGG 148
           +++D S+A +  D +P+R     K +ELFI EFFDQNP+S +G++  K   A  +++LGG
Sbjct: 62  VLLDCSEAMTVPDLKPTRFICSLKLLELFIEEFFDQNPISQLGVIAMKAKRAEKISELGG 121

Query: 149 SPESHIKA---LMGKLECSGDASLQNALELVHSNLNQIPSYGHREVLILYSALSTCDPGD 205
           +   HIKA   L   +   G+ SLQN LEL    L  IP +  RE+L++  +L++CDP D
Sbjct: 122 TSRKHIKAVHALTNGVPLVGEPSLQNGLELALKTLRMIPQHASREILVIMGSLTSCDPND 181

Query: 206 LMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFKELILEHSPPPPA 265
           +  TI+  K   +RCSV+ L+AE+ ICK LC ETGG +   LD++H+K+ +L+H  PP A
Sbjct: 182 VHLTIENLKTEGVRCSVLSLSAEIRICKFLCTETGGLFGAVLDDAHYKDQLLQHIDPPQA 241

Query: 266 IAEYATANLIKMGFP----QRAAEGSVAICTCH-----EEAK-TGGGYTCPRCKVRVCEL 315
                  +LIKMGFP    +   E  + +C CH     E AK T GGY CP+C  + CEL
Sbjct: 242 -GNQQEFSLIKMGFPHGKTESGKEPPLTMCMCHIDSVDEPAKLTSGGYHCPQCYSKYCEL 300

Query: 316 PTECRVCGLTLISSPHLARSYHHLFPIVPFAEISPSSQ---NDPNHSFPNTCFGCQQSL 371
           P EC  CGLTL S+PHLARSYHHLFP+  F E+ P  Q    +P       C+ CQ++L
Sbjct: 301 PVECSACGLTLASAPHLARSYHHLFPVPHFNEL-PLEQVQVQEPRDPPVTNCYACQKTL 358


>gb|AAF51879.2| CG11115-PA [Drosophila melanogaster] gi|21356299|ref|NP_649427.1|
           CG11115-PA [Drosophila melanogaster]
           gi|15010516|gb|AAK77306.1| GH08526p [Drosophila
           melanogaster]
          Length = 438

 Score =  271 bits (692), Expect = 4e-71
 Identities = 143/371 (38%), Positives = 225/371 (60%), Gaps = 27/371 (7%)

Query: 18  EDEANDDGLEAWERAYTEDRSWESLQEDESGLLRPIDTTAIHHAQYRRRLRALASNAATA 77
           +DE  D     WE  Y  +++WE++++DE G+L       I  A+ +R+ +    N    
Sbjct: 3   DDEQEDQKEYRWETGY--EKTWEAIKDDEDGMLDGAIAEIIQKAKRQRQAQKSKQN---- 56

Query: 78  RIQKGLIRYLYIVVDLSKAASERDFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKD 137
             + G++R++++V+D S++ S  D +P+R+    K +ELFI EFFDQNP+S +GL+  K 
Sbjct: 57  --RLGMMRHMFVVLDCSESMSVPDLKPTRLRCTVKLLELFIEEFFDQNPISQLGLIALKA 114

Query: 138 GVANCLTDLGGSPESHIKAL--MGKLECSGDASLQNALELVHSNLNQIPSYGHREVLILY 195
             A  +T+L G+   H+KAL  +  +  + + SLQN L+L   +L  +PS+  RE++I+ 
Sbjct: 115 KRAEKVTELTGTSRVHLKALESLANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIM 174

Query: 196 SALSTCDPGDLMETIQKCKKSKIRCSVIGLAAEMFICKHLCQETGGTYSVALDESHFKEL 255
            +L+TCDP D+  TI + KK  IRCSVI L+AE+ + ++L Q+T GT+   LD++HF++ 
Sbjct: 175 GSLTTCDPVDINLTIDELKKEGIRCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQ 234

Query: 256 ILEHSPPPPAIAEYATANLIKMGFPQ-----RAAEGSVAICTCHEE------AKTGGGYT 304
           ++    PPPA A+    +LI+MGFP         +  +++C CH E        T  G+ 
Sbjct: 235 LMSQVDPPPA-AKTQHNSLIRMGFPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHH 293

Query: 305 CPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLFPIVPFAEI----SPSSQNDPNHSF 360
           CP+C  + CELP EC+ CGLTL+S+PHLARSYHHLFP+  F E+     P+S +D   S 
Sbjct: 294 CPQCNSKYCELPVECQSCGLTLVSAPHLARSYHHLFPVPNFEELPFEAMPASSSDLT-SD 352

Query: 361 PNTCFGCQQSL 371
              C+GC ++L
Sbjct: 353 VRECYGCAKAL 363


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.319    0.135    0.406 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 821,244,024
Number of Sequences: 2540612
Number of extensions: 34336505
Number of successful extensions: 115813
Number of sequences better than 10.0: 119
Number of HSP's better than 10.0 without gapping: 66
Number of HSP's successfully gapped in prelim test: 53
Number of HSP's that attempted gapping in prelim test: 115549
Number of HSP's gapped (non-prelim): 131
length of query: 471
length of database: 863,360,394
effective HSP length: 132
effective length of query: 339
effective length of database: 527,999,610
effective search space: 178991867790
effective search space used: 178991867790
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)


Medicago: description of AC135795.5