FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5542, 365 aa
1>>>pF1KE5542 365 - 365 aa - 365 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.0087+/-0.00109; mu= 12.9234+/- 0.065
mean_var=83.4215+/-16.615, 0's: 0 Z-trim(104.2): 38 B-trim: 0 in 0/51
Lambda= 0.140422
statistics sampled from 7741 (7765) to 7741 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.619), E-opt: 0.2 (0.239), width: 16
Scan time: 2.220
The best scores are: opt bits E(32554)
CCDS14491.2 NXF5 gene_id:55998|Hs108|chrX ( 365) 2451 506.7 1.3e-143
CCDS43979.1 NXF2B gene_id:728343|Hs108|chrX ( 626) 1809 376.7 2.9e-104
CCDS14497.1 NXF2 gene_id:56001|Hs108|chrX ( 626) 1809 376.7 2.9e-104
CCDS8037.1 NXF1 gene_id:10482|Hs108|chr11 ( 619) 1199 253.1 4.6e-67
CCDS44629.1 NXF1 gene_id:10482|Hs108|chr11 ( 356) 990 210.7 1.6e-54
CCDS14503.1 NXF3 gene_id:56000|Hs108|chrX ( 531) 807 173.7 3.3e-43
>>CCDS14491.2 NXF5 gene_id:55998|Hs108|chrX (365 aa)
initn: 2451 init1: 2451 opt: 2451 Z-score: 2691.7 bits: 506.7 E(32554): 1.3e-143
Smith-Waterman score: 2451; 100.0% identity (100.0% similar) in 365 aa overlap (1-365:1-365)
10 20 30 40 50 60
pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLMNSIQSNCSVPFTPVDFHYIRNRACFFVQVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MRRNTQDENMRKWFKVTIPYGIKYDKAWLMNSIQSNCSVPFTPVDFHYIRNRACFFVQVA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 SAASALKDVSYKIYDDENQKICIFVSHFTAPYSVKNKLKPGQMEMLKLTMNKRYNVSQQA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 SAASALKDVSYKIYDDENQKICIFVSHFTAPYSVKNKLKPGQMEMLKLTMNKRYNVSQQA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 LDLQNLRFDPDLMGRDIDIILNRRNCMAATLKITERNFPELLSLNLCNNKLYQLDGLSDI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 LDLQNLRFDPDLMGRDIDIILNRRNCMAATLKITERNFPELLSLNLCNNKLYQLDGLSDI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 TEKAPKVKTLNLSKNKLESAWELGKVKGLKLEELWLEGNPLCSTFSDQSAYVSAIRDCFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 TEKAPKVKTLNLSKNKLESAWELGKVKGLKLEELWLEGNPLCSTFSDQSAYVSAIRDCFP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 KLLRLDGRELSAPVIVDIDSSETMKPCKENFTGSETLKHLVLQFLQQSNLCKYFKDSRNI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 KLLRLDGRELSAPVIVDIDSSETMKPCKENFTGSETLKHLVLQFLQQSNLCKYFKDSRNI
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE5 KILKDPYLQRKLLKHTKCPRNVDSLSALPETQHDFTSILVDMWYQTVNTCFLPRAGPESQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 KILKDPYLQRKLLKHTKCPRNVDSLSALPETQHDFTSILVDMWYQTVNTCFLPRAGPESQ
310 320 330 340 350 360
pF1KE5 SLRPL
:::::
CCDS14 SLRPL
>>CCDS43979.1 NXF2B gene_id:728343|Hs108|chrX (626 aa)
initn: 2063 init1: 1809 opt: 1809 Z-score: 1985.3 bits: 376.7 E(32554): 2.9e-104
Smith-Waterman score: 1996; 81.5% identity (85.9% similar) in 389 aa overlap (1-351:112-498)
10 20 30
pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLM
: .:::: :.::::::::::::::::::
CCDS43 SIRCERRMKWHSEDEIRITTWRNRKPPERKMSQNTQDGYTRNWFKVTIPYGIKYDKAWLM
90 100 110 120 130 140
40 50 60 70 80 90
pF1KE5 NSIQSNCSVPFTPVDFHYIRNRACFFVQVASAASALKDVSYKIYDDENQKICIFVSHFTA
:::::.:: ::::::::.::::::::: ::::::::::::::::::::::::::.: ::
CCDS43 NSIQSHCSDRFTPVDFHYVRNRACFFVQDASAASALKDVSYKIYDDENQKICIFVNHSTA
150 160 170 180 190 200
100 110 120 130 140 150
pF1KE5 PYSVKNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 PYSVKNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAAT
210 220 230 240 250 260
160 170 180 190 200 210
pF1KE5 LKITERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLK
::: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LKIIERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLK
270 280 290 300 310 320
220 230 240 250 260 270
pF1KE5 LEELWLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LEELWLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKEN
330 340 350 360 370 380
280 290
pF1KE5 FTGSETLKHLVLQFLQQ------------------------------------SNLCKYF
::::::::::::::::: :.:::::
CCDS43 FTGSETLKHLVLQFLQQYYSIYDSGDRQGLLGAYHDEACFSLAIPFDPKDSAPSSLCKYF
390 400 410 420 430 440
300 310 320 330 340 350
pF1KE5 KDSRNIKILKDPYLQRKLLKHTKCPRN-VDSLSALPETQHDFTSILVDMWYQTVNT-CFL
.::::.: ::::::. .::..:: :. ::::::::.::::..:::::.: :: ::
CCDS43 EDSRNMKTLKDPYLKGELLRRTK--RDIVDSLSALPKTQHDLSSILVDVWCQTERMLCFS
450 460 470 480 490
360
pF1KE5 PRAGPESQSLRPL
CCDS43 VNGVFKEVEGQSQGSVLAFTRTFIATPGSSSSLCIVNDELFVRDASPQETQSAFSIPVST
500 510 520 530 540 550
>>CCDS14497.1 NXF2 gene_id:56001|Hs108|chrX (626 aa)
initn: 2063 init1: 1809 opt: 1809 Z-score: 1985.3 bits: 376.7 E(32554): 2.9e-104
Smith-Waterman score: 1996; 81.5% identity (85.9% similar) in 389 aa overlap (1-351:112-498)
10 20 30
pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLM
: .:::: :.::::::::::::::::::
CCDS14 SIRCERRMKWHSEDEIRITTWRNRKPPERKMSQNTQDGYTRNWFKVTIPYGIKYDKAWLM
90 100 110 120 130 140
40 50 60 70 80 90
pF1KE5 NSIQSNCSVPFTPVDFHYIRNRACFFVQVASAASALKDVSYKIYDDENQKICIFVSHFTA
:::::.:: ::::::::.::::::::: ::::::::::::::::::::::::::.: ::
CCDS14 NSIQSHCSDRFTPVDFHYVRNRACFFVQDASAASALKDVSYKIYDDENQKICIFVNHSTA
150 160 170 180 190 200
100 110 120 130 140 150
pF1KE5 PYSVKNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 PYSVKNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAAT
210 220 230 240 250 260
160 170 180 190 200 210
pF1KE5 LKITERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLK
::: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 LKIIERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLK
270 280 290 300 310 320
220 230 240 250 260 270
pF1KE5 LEELWLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 LEELWLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKEN
330 340 350 360 370 380
280 290
pF1KE5 FTGSETLKHLVLQFLQQ------------------------------------SNLCKYF
::::::::::::::::: :.:::::
CCDS14 FTGSETLKHLVLQFLQQYYSIYDSGDRQGLLGAYHDEACFSLAIPFDPKDSAPSSLCKYF
390 400 410 420 430 440
300 310 320 330 340 350
pF1KE5 KDSRNIKILKDPYLQRKLLKHTKCPRN-VDSLSALPETQHDFTSILVDMWYQTVNT-CFL
.::::.: ::::::. .::..:: :. ::::::::.::::..:::::.: :: ::
CCDS14 EDSRNMKTLKDPYLKGELLRRTK--RDIVDSLSALPKTQHDLSSILVDVWCQTERMLCFS
450 460 470 480 490
360
pF1KE5 PRAGPESQSLRPL
CCDS14 VNGVFKEVEGQSQGSVLAFTRTFIATPGSSSSLCIVNDELFVRDASPQETQSAFSIPVST
500 510 520 530 540 550
>>CCDS8037.1 NXF1 gene_id:10482|Hs108|chr11 (619 aa)
initn: 1262 init1: 1199 opt: 1199 Z-score: 1317.5 bits: 253.1 E(32554): 4.6e-67
Smith-Waterman score: 1316; 54.7% identity (75.5% similar) in 384 aa overlap (5-351:111-493)
10 20 30
pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLMNSIQ
.:: . ..:::.::::: :::::::.. ::
CCDS80 RRGDTWHDRDRIHVTVRRDRAPPERGGAGTSQDGTSKNWFKITIPYGRKYDKAWLLSMIQ
90 100 110 120 130 140
40 50 60 70 80 90
pF1KE5 SNCSVPFTPVDFHYIRNRACFFVQVASAASALKDVSYKIYDDENQKICIFVSHFTAPYSV
:.:::::::..::: .:: :::. ::.::::: :.::: : ::..: :... . :...
CCDS80 SKCSVPFTPIEFHYENTRAQFFVEDASTASALKAVNYKILDRENRRISIIINSSAPPHTI
150 160 170 180 190 200
100 110 120 130 140 150
pF1KE5 KNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAATLKIT
:.::: :.:.::: :.:::. :::::::..:: ::::....::..::::.::::::.:
CCDS80 LNELKPEQVEQLKLIMSKRYDGSQQALDLKGLRSDPDLVAQNIDVVLNRRSCMAATLRII
210 220 230 240 250 260
160 170 180 190 200 210
pF1KE5 ERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLKLEEL
:.:.:::::::: ::.::.:: .:.:..:::..: :::: :.:.: :: :.::::::::
CCDS80 EENIPELLSLNLSNNRLYRLDDMSSIVQKAPNLKILNLSGNELKSERELDKIKGLKLEEL
270 280 290 300 310 320
220 230 240 250 260 270
pF1KE5 WLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKENFTGS
::.:: ::.:: :::.:.::::. :::::::::.:: :. :... :. ::: .. :.
CCDS80 WLDGNSLCDTFRDQSTYISAIRERFPKLLRLDGHELPPPIAFDVEAPTTLPPCKGSYFGT
330 340 350 360 370 380
280 290
pF1KE5 ETLKHLVLQFLQQ------------------------------------SNLCKYFKDSR
:.:: :::.:::: :.: .::::::
CCDS80 ENLKSLVLHFLQQYYAIYDSGDRQGLLDAYHDGACCSLSIPFIPQNPARSSLAEYFKDSR
390 400 410 420 430 440
300 310 320 330 340 350
pF1KE5 NIKILKDPYLQRKLLKHTKCPRNVDSLSALPETQHDFTSILVDMWYQTVNT-CFLPRAGP
:.: :::: :. .:::::. : :. ::.:::: .:..::. :: . ::
CCDS80 NVKKLKDPTLRFRLLKHTRL-NVVAFLNELPKTQHDVNSFVVDISAQTSTLLCFSVNGVF
450 460 470 480 490
360
pF1KE5 ESQSLRPL
CCDS80 KEVDGKSRDSLRAFTRTFIAVPASNSGLCIVNDELFVRNASSEEIQRAFAMPAPTPSSSP
500 510 520 530 540 550
>>CCDS44629.1 NXF1 gene_id:10482|Hs108|chr11 (356 aa)
initn: 980 init1: 980 opt: 990 Z-score: 1092.3 bits: 210.7 E(32554): 1.6e-54
Smith-Waterman score: 990; 61.5% identity (85.9% similar) in 234 aa overlap (5-238:111-344)
10 20 30
pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLMNSIQ
.:: . ..:::.::::: :::::::.. ::
CCDS44 RRGDTWHDRDRIHVTVRRDRAPPERGGAGTSQDGTSKNWFKITIPYGRKYDKAWLLSMIQ
90 100 110 120 130 140
40 50 60 70 80 90
pF1KE5 SNCSVPFTPVDFHYIRNRACFFVQVASAASALKDVSYKIYDDENQKICIFVSHFTAPYSV
:.:::::::..::: .:: :::. ::.::::: :.::: : ::..: :... . :...
CCDS44 SKCSVPFTPIEFHYENTRAQFFVEDASTASALKAVNYKILDRENRRISIIINSSAPPHTI
150 160 170 180 190 200
100 110 120 130 140 150
pF1KE5 KNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAATLKIT
:.::: :.:.::: :.:::. :::::::..:: ::::....::..::::.::::::.:
CCDS44 LNELKPEQVEQLKLIMSKRYDGSQQALDLKGLRSDPDLVAQNIDVVLNRRSCMAATLRII
210 220 230 240 250 260
160 170 180 190 200 210
pF1KE5 ERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLKLEEL
:.:.:::::::: ::.::.:: .:.:..:::..: :::: :.:.: :: :.::::::::
CCDS44 EENIPELLSLNLSNNRLYRLDDMSSIVQKAPNLKILNLSGNELKSERELDKIKGLKLEEL
270 280 290 300 310 320
220 230 240 250 260 270
pF1KE5 WLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKENFTGS
::.:: ::.:: :::.:. .. :
CCDS44 WLDGNSLCDTFRDQSTYIRSVVACVSPPGDLHPLGG
330 340 350
>>CCDS14503.1 NXF3 gene_id:56000|Hs108|chrX (531 aa)
initn: 710 init1: 577 opt: 807 Z-score: 889.3 bits: 173.7 E(32554): 3.3e-43
Smith-Waterman score: 922; 42.8% identity (62.6% similar) in 388 aa overlap (1-351:101-451)
10 20 30
pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLM
:. : : .. .:::.:.:.::::.. ::.
CCDS14 ISPYNRKGSFRKQDQTHVNMEREQKPPERRMEGNMPDGTLGSWFKITVPFGIKYNEKWLL
80 90 100 110 120 130
40 50 60 70 80 90
pF1KE5 NSIQSNCSVPFTPVDFHYIRNRACFFVQVASAASALKDVSYKIYDDENQKICIFVSHFTA
: ::..:::::.::.::: .: :::. :: : :::.:: ::.:..:.:: :::.
CCDS14 NLIQNECSVPFVPVEFHYENMHASFFVENASIAYALKNVSGKIWDEDNEKISIFVNPAGI
140 150 160 170 180 190
100 110 120 130 140 150
pF1KE5 PYSVKNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAAT
:. :. .:: ..:..::.::.. .:::.:::.: : : ::...:: . : :.::::.
CCDS14 PHFVHRELKSEKVEQIKLAMNQQCDVSQEALDIQRLPFYPDMVNRDTKMASNPRKCMAAS
200 210 220 230 240 250
160 170 180 190 200 210
pF1KE5 LKITERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLK
: . :.:.: ..: : :. : ::..
CCDS14 LDVHEENIPTVMS------------------------------------AGEMDKWKGIE
260 270
220 230 240 250 260 270
pF1KE5 LEELWLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKEN
: . .:.:.:::: :. ...: . ::::: :::.. .. .. . . :: .
CCDS14 PGEKCADRSPVCTTFSDTSSNINSILELFPKLLCLDGQQSPRATLCGTEAHKRLPTCKGS
280 290 300 310 320 330
280 290
pF1KE5 FTGSETLKHLVLQFLQQ------------------------------------SNLCKYF
: ::: ::.:::::::: :..::.:
CCDS14 FFGSEMLKNLVLQFLQQYYLIYDSGDRQGLLSAYHDEACFSLSIPFNPEDSAPSSFCKFF
340 350 360 370 380 390
300 310 320 330 340 350
pF1KE5 KDSRNIKILKDPYLQRKLLKHTKCPRNVDSLSALPETQHDFTSILVDMWYQTV-NTCFLP
::::::::::::::. .:::::: ::::::::.::::..:.:::::::: ::
CCDS14 KDSRNIKILKDPYLRGELLKHTKLD-IVDSLSALPKTQHDLSSFLVDMWYQTEWMLCFSV
400 410 420 430 440 450
360
pF1KE5 RAGPESQSLRPL
CCDS14 NGVFKEVEGQSQGSVLAFTRTFIATPGSSSSLCIVNDKLFVRDTSHQGTQSALFTLVPTA
460 470 480 490 500 510
365 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 01:39:27 2016 done: Tue Nov 8 01:39:28 2016
Total Scan time: 2.220 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]