FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3616, 478 aa 1>>>pF1KE3616 478 - 478 aa - 478 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.7847+/-0.000454; mu= 5.8488+/- 0.028 mean_var=133.4470+/-27.685, 0's: 0 Z-trim(114.0): 29 B-trim: 812 in 1/56 Lambda= 0.111025 statistics sampled from 23651 (23670) to 23651 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.65), E-opt: 0.2 (0.278), width: 16 Scan time: 6.680 The best scores are: opt bits E(85289) NP_006863 (OMIM: 605358) TFIIA-alpha and beta-like ( 478) 3148 516.1 8.6e-146 NP_001180416 (OMIM: 605358) TFIIA-alpha and beta-l ( 444) 2884 473.8 4.3e-133 NP_001265869 (OMIM: 600520) transcription initiati ( 326) 412 77.8 5.2e-14 NP_963889 (OMIM: 600520) transcription initiation ( 337) 412 77.8 5.3e-14 NP_056943 (OMIM: 600520) transcription initiation ( 376) 412 77.8 5.8e-14 >>NP_006863 (OMIM: 605358) TFIIA-alpha and beta-like fac (478 aa) initn: 3148 init1: 3148 opt: 3148 Z-score: 2738.4 bits: 516.1 E(85289): 8.6e-146 Smith-Waterman score: 3148; 100.0% identity (100.0% similar) in 478 aa overlap (1-478:1-478) 10 20 30 40 50 60 pF1KE3 MACLNPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATEDFFRNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 MACLNPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATEDFFRNS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 IQSPLFTLQLPHSLHQTLQSSTASLVIPAGRTLPSFTTAELGTSNSSANFTFPGYPIHVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 IQSPLFTLQLPHSLHQTLQSSTASLVIPAGRTLPSFTTAELGTSNSSANFTFPGYPIHVP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 AGVTLQTVSGHLYKVNVPIMVTETSGRAGILQHPIQQVFQQLGQPSVIQTSVPQLNPWSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 AGVTLQTVSGHLYKVNVPIMVTETSGRAGILQHPIQQVFQQLGQPSVIQTSVPQLNPWSL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 QATTEKSQRIETVLQQPAILPSGPVDRKHLENATSDILVSPGNEHKIVPEALLCHQESSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 QATTEKSQRIETVLQQPAILPSGPVDRKHLENATSDILVSPGNEHKIVPEALLCHQESSH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 YISLPGVVFSPQVSQTNSNVESVLSGSASMAQNLHDESLSTSPHGALHQHVTDIQLHILK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 YISLPGVVFSPQVSQTNSNVESVLSGSASMAQNLHDESLSTSPHGALHQHVTDIQLHILK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 NRMYGCDSVKQPRNIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 NRMYGCDSVKQPRNIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNEE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 IGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 IGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSGD 370 380 390 400 410 420 430 440 450 460 470 pF1KE3 DVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 DVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 430 440 450 460 470 >>NP_001180416 (OMIM: 605358) TFIIA-alpha and beta-like (444 aa) initn: 2922 init1: 2884 opt: 2884 Z-score: 2510.4 bits: 473.8 E(85289): 4.3e-133 Smith-Waterman score: 2884; 100.0% identity (100.0% similar) in 437 aa overlap (42-478:8-444) 20 30 40 50 60 70 pF1KE3 RSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATEDFFRNSIQSPLFTLQLP :::::::::::::::::::::::::::::: NP_001 MACLNPVLWETKVLQSKATEDFFRNSIQSPLFTLQLP 10 20 30 80 90 100 110 120 130 pF1KE3 HSLHQTLQSSTASLVIPAGRTLPSFTTAELGTSNSSANFTFPGYPIHVPAGVTLQTVSGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HSLHQTLQSSTASLVIPAGRTLPSFTTAELGTSNSSANFTFPGYPIHVPAGVTLQTVSGH 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE3 LYKVNVPIMVTETSGRAGILQHPIQQVFQQLGQPSVIQTSVPQLNPWSLQATTEKSQRIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LYKVNVPIMVTETSGRAGILQHPIQQVFQQLGQPSVIQTSVPQLNPWSLQATTEKSQRIE 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE3 TVLQQPAILPSGPVDRKHLENATSDILVSPGNEHKIVPEALLCHQESSHYISLPGVVFSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TVLQQPAILPSGPVDRKHLENATSDILVSPGNEHKIVPEALLCHQESSHYISLPGVVFSP 160 170 180 190 200 210 260 270 280 290 300 310 pF1KE3 QVSQTNSNVESVLSGSASMAQNLHDESLSTSPHGALHQHVTDIQLHILKNRMYGCDSVKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 QVSQTNSNVESVLSGSASMAQNLHDESLSTSPHGALHQHVTDIQLHILKNRMYGCDSVKQ 220 230 240 250 260 270 320 330 340 350 360 370 pF1KE3 PRNIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNEEIGSTRDADENE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PRNIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSGDTSSNEEIGSTRDADENE 280 290 300 310 320 330 380 390 400 410 420 430 pF1KE3 FLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSGDDVSEQDVPDLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPLNSGDDVSEQDVPDLF 340 350 360 370 380 390 440 450 460 470 pF1KE3 DTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW ::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDAEW 400 410 420 430 440 >>NP_001265869 (OMIM: 600520) transcription initiation f (326 aa) initn: 450 init1: 403 opt: 412 Z-score: 372.6 bits: 77.8 E(85289): 5.2e-14 Smith-Waterman score: 451; 34.6% identity (58.4% similar) in 332 aa overlap (154-478:47-326) 130 140 150 160 170 180 pF1KE3 TLQTVSGHLYKVNVPIMVTETSGRAGILQHPIQQVFQQLGQPSVIQTSVPQ-LNPWS-LQ : : ::. :. :...:: . : : : NP_001 LLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLI 20 30 40 50 60 70 190 200 210 220 230 pF1KE3 ATTEKSQRIETVLQQPAILPSG--PVDRKHLENATSDILVSPGNEHKIVPEALLCHQESS . :. .. ::.: ::.. ::.. :. ..: : ... NP_001 QHMNASNMSAAATAATLALPAGVTPVQQ---------ILTNSGQLLQVVRAA-----NGA 80 90 100 110 120 240 250 260 270 280 290 pF1KE3 HYISLP--GVVFSPQVSQTNSNVESVLSGSASMAQNLHDESLSTSPHGALHQHVTDIQLH .:: : .::.. :: . .. :... . .. :. : : : : . .. NP_001 QYIFQPQQSVVLQQQV------IPQMQPGGVQAP--VIQQVLAPLPGGISPQ--TGVIIQ 130 140 150 160 170 300 310 320 330 340 350 pF1KE3 ILKNRMYGCDSVKQPRNIEEPSNIPVSEK-DSNSQVDLSIRVTDDDIGEIIQVDGSGDTS . . : . : .. :. :.. . ...: . . . .. . ..::::.:::: NP_001 PQQILFTGNKTQVIPTTVAAPT--PAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTS 180 190 200 210 220 230 360 370 380 390 400 410 pF1KE3 SNEEIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPL :.: : ::.: : : .:: : ...:.: :: :: ::.:: NP_001 SEE------DEDEEE-----DYDD----DEEED--KEKDGA------EDGQV---EEEPL 240 250 260 420 430 440 450 460 470 pF1KE3 NSGDDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDA :: ::::... .::::.::.::::::::::::::::.::::.: ..::::.:.:::::: NP_001 NSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDA 270 280 290 300 310 320 pF1KE3 EW :: NP_001 EW >>NP_963889 (OMIM: 600520) transcription initiation fact (337 aa) initn: 511 init1: 403 opt: 412 Z-score: 372.4 bits: 77.8 E(85289): 5.3e-14 Smith-Waterman score: 451; 34.6% identity (58.4% similar) in 332 aa overlap (154-478:58-337) 130 140 150 160 170 180 pF1KE3 TLQTVSGHLYKVNVPIMVTETSGRAGILQHPIQQVFQQLGQPSVIQTSVPQ-LNPWS-LQ : : ::. :. :...:: . : : : NP_963 LLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLI 30 40 50 60 70 80 190 200 210 220 230 pF1KE3 ATTEKSQRIETVLQQPAILPSG--PVDRKHLENATSDILVSPGNEHKIVPEALLCHQESS . :. .. ::.: ::.. ::.. :. ..: : ... NP_963 QHMNASNMSAAATAATLALPAGVTPVQQ---------ILTNSGQLLQVVRAA-----NGA 90 100 110 120 130 240 250 260 270 280 290 pF1KE3 HYISLP--GVVFSPQVSQTNSNVESVLSGSASMAQNLHDESLSTSPHGALHQHVTDIQLH .:: : .::.. :: . .. :... . .. :. : : : : . .. NP_963 QYIFQPQQSVVLQQQV------IPQMQPGGVQAP--VIQQVLAPLPGGISPQ--TGVIIQ 140 150 160 170 180 300 310 320 330 340 350 pF1KE3 ILKNRMYGCDSVKQPRNIEEPSNIPVSEK-DSNSQVDLSIRVTDDDIGEIIQVDGSGDTS . . : . : .. :. :.. . ...: . . . .. . ..::::.:::: NP_963 PQQILFTGNKTQVIPTTVAAPT--PAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTS 190 200 210 220 230 240 360 370 380 390 400 410 pF1KE3 SNEEIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEEDPL :.: : ::.: : : .:: : ...:.: :: :: ::.:: NP_963 SEE------DEDEEE-----DYDD----DEEED--KEKDGA------EDGQV---EEEPL 250 260 270 420 430 440 450 460 470 pF1KE3 NSGDDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAIGDA :: ::::... .::::.::.::::::::::::::::.::::.: ..::::.:.:::::: NP_963 NSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDA 280 290 300 310 320 330 pF1KE3 EW :: NP_963 EW >>NP_056943 (OMIM: 600520) transcription initiation fact (376 aa) initn: 653 init1: 403 opt: 412 Z-score: 371.6 bits: 77.8 E(85289): 5.8e-14 Smith-Waterman score: 597; 33.9% identity (56.1% similar) in 481 aa overlap (2-478:5-376) 10 20 30 40 50 pF1KE3 MACLNPVPKLYRSVIEDVIEGVRNLFAEEGIEEQVLKDLKQLWETKVLQSKATEDFF : : :::::::::::::. ::..: ..:..:::: .:: :::.:..::.:. : : NP_056 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAV-DGF 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 RNSIQSPLFTLQLPHSLHQTLQSSTA--SLVIPAGRTLPSFTTAELGTSNSSANFTFPGY .. :. :. .: :. .: . . . : .:.:. . .. .: . : : NP_056 HSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQ-QTVPQQAQTQQVLIPASQQATAP-- 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 PIHVPAGVTLQTVSGHLYKVNVPIMVTETSGR--AGILQHPIQQVFQQLGQPSVIQTSVP . :: . .: :. :. .: .. ::. :.::.. . :: ..:. : NP_056 QVIVPDSKLIQ----HMNASNMSAAATAATLALPAGVT--PVQQILTNSGQ--LLQV-VR 120 130 140 150 160 180 190 200 210 220 230 pF1KE3 QLNPWSLQATTEKSQRIETVLQQPAILPSGPVDRKHLENATSDILVSPGNEHKIVPEALL : . : . .: . :::: ..:. ..::. . : . .: NP_056 AAN--GAQYIFQPQQSV--VLQQQ-VIPQ----------------MQPGGVQAPVIQQVL 170 180 190 200 240 250 260 270 280 290 pF1KE3 CHQESSHYISLPGVVFSPQVSQTNSNVESVLSGSASMAQNLHDESLSTSPHGALHQHVTD ::: . :::.. . . ...:. ... . ..: : : .. NP_056 A--------PLPGGI-SPQTGVIIQPQQILFTGNKTQVI----PTTVAAPTPAQAQITAT 210 220 230 240 250 300 310 320 330 340 350 pF1KE3 IQLHILKNRMYGCDSVKQPRNIEEPSNIPVSEKDSNSQVDLSIRVTDDDIGEIIQVDGSG : .::. .:.. .:. : ..::::.: NP_056 GQ--------------QQPQA--QPAQ---------TQAPL-----------VLQVDGTG 260 270 360 370 380 390 400 410 pF1KE3 DTSSNEEIGSTRDADENEFLGNIDGGDLKVPEEEADSISNEDSATNSSDNEDPQVNIVEE ::::.: : ::.: : : .:: : ...:.: :: : ::: NP_056 DTSSEE------DEDEEE-----DYDD----DEEED--KEKDGA------EDGQ---VEE 280 290 300 310 420 430 440 450 460 470 pF1KE3 DPLNSGDDVSEQDVPDLFDTDNVIVCQYDKIHRSKNKWKFYLKDGVMCFGGRDYVFAKAI .:::: ::::... .::::.::.::::::::::::::::.::::.: ..::::.:.::: NP_056 EPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAI 320 330 340 350 360 370 pF1KE3 GDAEW ::::: NP_056 GDAEW 478 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 20:25:16 2016 done: Sun Nov 6 20:25:17 2016 Total Scan time: 6.680 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]