FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5274, 349 aa
1>>>pF1KB5274 349 - 349 aa - 349 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.4800+/-0.000433; mu= 3.2331+/- 0.027
mean_var=263.0727+/-54.322, 0's: 0 Z-trim(118.5): 5 B-trim: 522 in 1/60
Lambda= 0.079074
statistics sampled from 31458 (31463) to 31458 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.713), E-opt: 0.2 (0.369), width: 16
Scan time: 8.930
The best scores are: opt bits E(85289)
NP_005633 (OMIM: 600573) transcription initiation ( 349) 2260 271.0 2.8e-72
NP_001161946 (OMIM: 300314) transcription initiati ( 376) 726 96.0 1.4e-19
XP_005262209 (OMIM: 300314) PREDICTED: transcripti ( 377) 726 96.0 1.4e-19
NP_079161 (OMIM: 300314) transcription initiation ( 462) 726 96.1 1.6e-19
XP_006724727 (OMIM: 300314) PREDICTED: transcripti ( 463) 726 96.1 1.6e-19
>>NP_005633 (OMIM: 600573) transcription initiation fact (349 aa)
initn: 2260 init1: 2260 opt: 2260 Z-score: 1418.6 bits: 271.0 E(85289): 2.8e-72
Smith-Waterman score: 2260; 100.0% identity (100.0% similar) in 349 aa overlap (1-349:1-349)
10 20 30 40 50 60
pF1KB5 MSKSKDDAPHELESQFILRLPPEYASTVRRAVQSGHVNLKDRLTIELHPDGRHGIVRVDR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 MSKSKDDAPHELESQFILRLPPEYASTVRRAVQSGHVNLKDRLTIELHPDGRHGIVRVDR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 VPLASKLVDLPCVMESLKTIDKKTFYKTADICQMLVSTVDGDLYPPVEEPVASTDPKASK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 VPLASKLVDLPCVMESLKTIDKKTFYKTADICQMLVSTVDGDLYPPVEEPVASTDPKASK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 KKDKDKEKKFIWNHGITLPLKNVRKRRFRKTAKKKYIESPDVEKEVKRLLSTDAEAVSTR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 KKDKDKEKKFIWNHGITLPLKNVRKRRFRKTAKKKYIESPDVEKEVKRLLSTDAEAVSTR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 WEIIAEDETKEAENQGLDISSPGMSGHRQGHDSLEHDELREIFNDLSSSSEDEDETQHQD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 WEIIAEDETKEAENQGLDISSPGMSGHRQGHDSLEHDELREIFNDLSSSSEDEDETQHQD
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 EEDINIIDTEEDLERQLQDKLNESDEQHQENEGTNQLVMGIQKQIDNMKGKLQETQDRAK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 EEDINIIDTEEDLERQLQDKLNESDEQHQENEGTNQLVMGIQKQIDNMKGKLQETQDRAK
250 260 270 280 290 300
310 320 330 340
pF1KB5 RQEDLIMKVENLALKNRFQAVLDELKQKEDREKEQLSSLQEELESLLEK
:::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 RQEDLIMKVENLALKNRFQAVLDELKQKEDREKEQLSSLQEELESLLEK
310 320 330 340
>>NP_001161946 (OMIM: 300314) transcription initiation f (376 aa)
initn: 1213 init1: 669 opt: 726 Z-score: 472.4 bits: 96.0 E(85289): 1.4e-19
Smith-Waterman score: 1288; 57.1% identity (79.1% similar) in 378 aa overlap (1-349:1-376)
10 20 30 40 50 60
pF1KB5 MSKSKDDAPHELESQFILRLPPEYASTVRRAVQSGHVNLKDRLTIELHPDGRHGIVRVDR
::.:.:..: :.:.::::::: :.: ::: ..: :..::.: :.: :::::..:.:.
NP_001 MSESQDEVPDEVENQFILRLPLEHACTVRNLARSQSVKMKDKLKIDLLPDGRHAVVEVED
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 VPLASKLVDLPCVMESLKTIDKKTFYKTADICQMLVSTVDGDLYPPVEEPVASTDPKASK
::::.::::::::.:::.:.::::::::::: :::: :.:::.. :::.:::::. .
NP_001 VPLAAKLVDLPCVIESLRTLDKKTFYKTADISQMLVCTADGDIHLSPEEPAASTDPNIVR
70 80 90 100 110 120
130 140 150 160
pF1KB5 KKDKDKEKKFIWNHGITLPLKNVRKRRFRKTAKK-------------KYIESPDVEKEVK
::.. .:.: .:.:::: :::::::.::::: :: .::::::::.:::
NP_001 KKERGREEKCVWKHGITPPLKNVRKKRFRKTQKKVPDVKEMEKSSFTEYIESPDVENEVK
130 140 150 160 170 180
170 180 190 200 210 220
pF1KB5 RLLSTDAEAVSTRWEIIAEDETKEAENQG----LDISSPGMSGHRQGHDSLEHDELREIF
::: .::::::::::.:::: ::: :.:: . ::: :::.:.::: : :.: :::.:
NP_001 RLLRSDAEAVSTRWEVIAEDGTKEIESQGSIPGFLISS-GMSSHKQGHTSSEYDMLREMF
190 200 210 220 230
230 240 250 260 270
pF1KB5 NDLSSSSED------EDETQHQDE-EDINIIDTEED-----LERQLQDKLNESDEQHQEN
.: :...: ::: . .:: :: . . ::: :::::: .. :: :.. :
NP_001 SDSRSNNDDDEDEDDEDEDEDEDEDEDEDKEEEEEDCSEEYLERQLQAEFIESG-QYRAN
240 250 260 270 280 290
280 290 300 310 320 330
pF1KB5 EGTNQLVMGIQKQIDNMKGKLQETQDRAKRQEDLIMKVENLALKNRFQAVLDELKQKEDR
:::...:: :::::.. . ::.. :..:.::.:::::::::.:::.::.::..:. .: .
NP_001 EGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQKDLIMKVENLTLKNHFQSVLEQLELQEKQ
300 310 320 330 340 350
340
pF1KB5 EKEQLSSLQEELESLLEK
..:.: ::::.:. .:.:
NP_001 KNEKLISLQEQLQRFLKK
360 370
>>XP_005262209 (OMIM: 300314) PREDICTED: transcription i (377 aa)
initn: 1381 init1: 669 opt: 726 Z-score: 472.4 bits: 96.0 E(85289): 1.4e-19
Smith-Waterman score: 1281; 57.0% identity (79.4% similar) in 379 aa overlap (1-349:1-377)
10 20 30 40 50 60
pF1KB5 MSKSKDDAPHELESQFILRLPPEYASTVRRAVQSGHVNLKDRLTIELHPDGRHGIVRVDR
::.:.:..: :.:.::::::: :.: ::: ..: :..::.: :.: :::::..:.:.
XP_005 MSESQDEVPDEVENQFILRLPLEHACTVRNLARSQSVKMKDKLKIDLLPDGRHAVVEVED
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 VPLASKLVDLPCVMESLKTIDKKTFYKTADICQMLVSTVDGDLYPPVEEPVASTDPKASK
::::.::::::::.:::.:.::::::::::: :::: :.:::.. :::.:::::. .
XP_005 VPLAAKLVDLPCVIESLRTLDKKTFYKTADISQMLVCTADGDIHLSPEEPAASTDPNIVR
70 80 90 100 110 120
130 140 150 160
pF1KB5 KKDKDKEKKFIWNHGITLPLKNVRKRRFRKTAKK-------------KYIESPDVEKEVK
::.. .:.: .:.:::: :::::::.::::: :: .::::::::.:::
XP_005 KKERGREEKCVWKHGITPPLKNVRKKRFRKTQKKVPDVKEMEKSSFTEYIESPDVENEVK
130 140 150 160 170 180
170 180 190 200 210 220
pF1KB5 RLLSTDAEAVSTRWEIIAEDETKEAENQG----LDISSPGMSGHRQGH-DSLEHDELREI
::: .::::::::::.:::: ::: :.:: . ::: :::.:.::: .:.:.: :::.
XP_005 RLLRSDAEAVSTRWEVIAEDGTKEIESQGSIPGFLISS-GMSSHKQGHTSSVEYDMLREM
190 200 210 220 230
230 240 250 260 270
pF1KB5 FNDLSSSSED------EDETQHQDE-EDINIIDTEED-----LERQLQDKLNESDEQHQE
:.: :...: ::: . .:: :: . . ::: :::::: .. :: :..
XP_005 FSDSRSNNDDDEDEDDEDEDEDEDEDEDEDKEEEEEDCSEEYLERQLQAEFIESG-QYRA
240 250 260 270 280 290
280 290 300 310 320 330
pF1KB5 NEGTNQLVMGIQKQIDNMKGKLQETQDRAKRQEDLIMKVENLALKNRFQAVLDELKQKED
::::...:: :::::.. . ::.. :..:.::.:::::::::.:::.::.::..:. .:
XP_005 NEGTSSIVMEIQKQIEKKEKKLHKIQNKAQRQKDLIMKVENLTLKNHFQSVLEQLELQEK
300 310 320 330 340 350
340
pF1KB5 REKEQLSSLQEELESLLEK
...:.: ::::.:. .:.:
XP_005 QKNEKLISLQEQLQRFLKK
360 370
>>NP_079161 (OMIM: 300314) transcription initiation fact (462 aa)
initn: 1213 init1: 669 opt: 726 Z-score: 471.3 bits: 96.1 E(85289): 1.6e-19
Smith-Waterman score: 1288; 57.1% identity (79.1% similar) in 378 aa overlap (1-349:87-462)
10 20 30
pF1KB5 MSKSKDDAPHELESQFILRLPPEYASTVRR
::.:.:..: :.:.::::::: :.: :::
NP_079 IPADEDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRN
60 70 80 90 100 110
40 50 60 70 80 90
pF1KB5 AVQSGHVNLKDRLTIELHPDGRHGIVRVDRVPLASKLVDLPCVMESLKTIDKKTFYKTAD
..: :..::.: :.: :::::..:.:. ::::.::::::::.:::.:.::::::::::
NP_079 LARSQSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTAD
120 130 140 150 160 170
100 110 120 130 140 150
pF1KB5 ICQMLVSTVDGDLYPPVEEPVASTDPKASKKKDKDKEKKFIWNHGITLPLKNVRKRRFRK
: :::: :.:::.. :::.:::::. .::.. .:.: .:.:::: :::::::.::::
NP_079 ISQMLVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRK
180 190 200 210 220 230
160 170 180 190
pF1KB5 TAKK-------------KYIESPDVEKEVKRLLSTDAEAVSTRWEIIAEDETKEAENQG-
: :: .::::::::.:::::: .::::::::::.:::: ::: :.::
NP_079 TQKKVPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGS
240 250 260 270 280 290
200 210 220 230 240
pF1KB5 ---LDISSPGMSGHRQGHDSLEHDELREIFNDLSSSSED------EDETQHQDE-EDINI
. ::: :::.:.::: : :.: :::.:.: :...: ::: . .:: :: .
NP_079 IPGFLISS-GMSSHKQGHTSSEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDEDK
300 310 320 330 340 350
250 260 270 280 290 300
pF1KB5 IDTEED-----LERQLQDKLNESDEQHQENEGTNQLVMGIQKQIDNMKGKLQETQDRAKR
. ::: :::::: .. :: :.. ::::...:: :::::.. . ::.. :..:.:
NP_079 EEEEEDCSEEYLERQLQAEFIESG-QYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQR
360 370 380 390 400 410
310 320 330 340
pF1KB5 QEDLIMKVENLALKNRFQAVLDELKQKEDREKEQLSSLQEELESLLEK
:.:::::::::.:::.::.::..:. .: ...:.: ::::.:. .:.:
NP_079 QKDLIMKVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK
420 430 440 450 460
>>XP_006724727 (OMIM: 300314) PREDICTED: transcription i (463 aa)
initn: 1381 init1: 669 opt: 726 Z-score: 471.3 bits: 96.1 E(85289): 1.6e-19
Smith-Waterman score: 1281; 57.0% identity (79.4% similar) in 379 aa overlap (1-349:87-463)
10 20 30
pF1KB5 MSKSKDDAPHELESQFILRLPPEYASTVRR
::.:.:..: :.:.::::::: :.: :::
XP_006 IPADEDTQTDADSSAQAAAQAPENFQEGKDMSESQDEVPDEVENQFILRLPLEHACTVRN
60 70 80 90 100 110
40 50 60 70 80 90
pF1KB5 AVQSGHVNLKDRLTIELHPDGRHGIVRVDRVPLASKLVDLPCVMESLKTIDKKTFYKTAD
..: :..::.: :.: :::::..:.:. ::::.::::::::.:::.:.::::::::::
XP_006 LARSQSVKMKDKLKIDLLPDGRHAVVEVEDVPLAAKLVDLPCVIESLRTLDKKTFYKTAD
120 130 140 150 160 170
100 110 120 130 140 150
pF1KB5 ICQMLVSTVDGDLYPPVEEPVASTDPKASKKKDKDKEKKFIWNHGITLPLKNVRKRRFRK
: :::: :.:::.. :::.:::::. .::.. .:.: .:.:::: :::::::.::::
XP_006 ISQMLVCTADGDIHLSPEEPAASTDPNIVRKKERGREEKCVWKHGITPPLKNVRKKRFRK
180 190 200 210 220 230
160 170 180 190
pF1KB5 TAKK-------------KYIESPDVEKEVKRLLSTDAEAVSTRWEIIAEDETKEAENQG-
: :: .::::::::.:::::: .::::::::::.:::: ::: :.::
XP_006 TQKKVPDVKEMEKSSFTEYIESPDVENEVKRLLRSDAEAVSTRWEVIAEDGTKEIESQGS
240 250 260 270 280 290
200 210 220 230 240
pF1KB5 ---LDISSPGMSGHRQGH-DSLEHDELREIFNDLSSSSED------EDETQHQDE-EDIN
. ::: :::.:.::: .:.:.: :::.:.: :...: ::: . .:: :: .
XP_006 IPGFLISS-GMSSHKQGHTSSVEYDMLREMFSDSRSNNDDDEDEDDEDEDEDEDEDEDED
300 310 320 330 340 350
250 260 270 280 290 300
pF1KB5 IIDTEED-----LERQLQDKLNESDEQHQENEGTNQLVMGIQKQIDNMKGKLQETQDRAK
. ::: :::::: .. :: :.. ::::...:: :::::.. . ::.. :..:.
XP_006 KEEEEEDCSEEYLERQLQAEFIESG-QYRANEGTSSIVMEIQKQIEKKEKKLHKIQNKAQ
360 370 380 390 400 410
310 320 330 340
pF1KB5 RQEDLIMKVENLALKNRFQAVLDELKQKEDREKEQLSSLQEELESLLEK
::.:::::::::.:::.::.::..:. .: ...:.: ::::.:. .:.:
XP_006 RQKDLIMKVENLTLKNHFQSVLEQLELQEKQKNEKLISLQEQLQRFLKK
420 430 440 450 460
349 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 16:29:46 2016 done: Thu Nov 3 16:29:47 2016
Total Scan time: 8.930 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]