FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5542, 365 aa 1>>>pF1KE5542 365 - 365 aa - 365 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0087+/-0.00109; mu= 12.9234+/- 0.065 mean_var=83.4215+/-16.615, 0's: 0 Z-trim(104.2): 38 B-trim: 0 in 0/51 Lambda= 0.140422 statistics sampled from 7741 (7765) to 7741 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.619), E-opt: 0.2 (0.239), width: 16 Scan time: 2.220 The best scores are: opt bits E(32554) CCDS14491.2 NXF5 gene_id:55998|Hs108|chrX ( 365) 2451 506.7 1.3e-143 CCDS43979.1 NXF2B gene_id:728343|Hs108|chrX ( 626) 1809 376.7 2.9e-104 CCDS14497.1 NXF2 gene_id:56001|Hs108|chrX ( 626) 1809 376.7 2.9e-104 CCDS8037.1 NXF1 gene_id:10482|Hs108|chr11 ( 619) 1199 253.1 4.6e-67 CCDS44629.1 NXF1 gene_id:10482|Hs108|chr11 ( 356) 990 210.7 1.6e-54 CCDS14503.1 NXF3 gene_id:56000|Hs108|chrX ( 531) 807 173.7 3.3e-43 >>CCDS14491.2 NXF5 gene_id:55998|Hs108|chrX (365 aa) initn: 2451 init1: 2451 opt: 2451 Z-score: 2691.7 bits: 506.7 E(32554): 1.3e-143 Smith-Waterman score: 2451; 100.0% identity (100.0% similar) in 365 aa overlap (1-365:1-365) 10 20 30 40 50 60 pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLMNSIQSNCSVPFTPVDFHYIRNRACFFVQVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MRRNTQDENMRKWFKVTIPYGIKYDKAWLMNSIQSNCSVPFTPVDFHYIRNRACFFVQVA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 SAASALKDVSYKIYDDENQKICIFVSHFTAPYSVKNKLKPGQMEMLKLTMNKRYNVSQQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SAASALKDVSYKIYDDENQKICIFVSHFTAPYSVKNKLKPGQMEMLKLTMNKRYNVSQQA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 LDLQNLRFDPDLMGRDIDIILNRRNCMAATLKITERNFPELLSLNLCNNKLYQLDGLSDI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 LDLQNLRFDPDLMGRDIDIILNRRNCMAATLKITERNFPELLSLNLCNNKLYQLDGLSDI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 TEKAPKVKTLNLSKNKLESAWELGKVKGLKLEELWLEGNPLCSTFSDQSAYVSAIRDCFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 TEKAPKVKTLNLSKNKLESAWELGKVKGLKLEELWLEGNPLCSTFSDQSAYVSAIRDCFP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 KLLRLDGRELSAPVIVDIDSSETMKPCKENFTGSETLKHLVLQFLQQSNLCKYFKDSRNI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 KLLRLDGRELSAPVIVDIDSSETMKPCKENFTGSETLKHLVLQFLQQSNLCKYFKDSRNI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 KILKDPYLQRKLLKHTKCPRNVDSLSALPETQHDFTSILVDMWYQTVNTCFLPRAGPESQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 KILKDPYLQRKLLKHTKCPRNVDSLSALPETQHDFTSILVDMWYQTVNTCFLPRAGPESQ 310 320 330 340 350 360 pF1KE5 SLRPL ::::: CCDS14 SLRPL >>CCDS43979.1 NXF2B gene_id:728343|Hs108|chrX (626 aa) initn: 2063 init1: 1809 opt: 1809 Z-score: 1985.3 bits: 376.7 E(32554): 2.9e-104 Smith-Waterman score: 1996; 81.5% identity (85.9% similar) in 389 aa overlap (1-351:112-498) 10 20 30 pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLM : .:::: :.:::::::::::::::::: CCDS43 SIRCERRMKWHSEDEIRITTWRNRKPPERKMSQNTQDGYTRNWFKVTIPYGIKYDKAWLM 90 100 110 120 130 140 40 50 60 70 80 90 pF1KE5 NSIQSNCSVPFTPVDFHYIRNRACFFVQVASAASALKDVSYKIYDDENQKICIFVSHFTA :::::.:: ::::::::.::::::::: ::::::::::::::::::::::::::.: :: CCDS43 NSIQSHCSDRFTPVDFHYVRNRACFFVQDASAASALKDVSYKIYDDENQKICIFVNHSTA 150 160 170 180 190 200 100 110 120 130 140 150 pF1KE5 PYSVKNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 PYSVKNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAAT 210 220 230 240 250 260 160 170 180 190 200 210 pF1KE5 LKITERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLK ::: :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 LKIIERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLK 270 280 290 300 310 320 220 230 240 250 260 270 pF1KE5 LEELWLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 LEELWLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKEN 330 340 350 360 370 380 280 290 pF1KE5 FTGSETLKHLVLQFLQQ------------------------------------SNLCKYF ::::::::::::::::: :.::::: CCDS43 FTGSETLKHLVLQFLQQYYSIYDSGDRQGLLGAYHDEACFSLAIPFDPKDSAPSSLCKYF 390 400 410 420 430 440 300 310 320 330 340 350 pF1KE5 KDSRNIKILKDPYLQRKLLKHTKCPRN-VDSLSALPETQHDFTSILVDMWYQTVNT-CFL .::::.: ::::::. .::..:: :. ::::::::.::::..:::::.: :: :: CCDS43 EDSRNMKTLKDPYLKGELLRRTK--RDIVDSLSALPKTQHDLSSILVDVWCQTERMLCFS 450 460 470 480 490 360 pF1KE5 PRAGPESQSLRPL CCDS43 VNGVFKEVEGQSQGSVLAFTRTFIATPGSSSSLCIVNDELFVRDASPQETQSAFSIPVST 500 510 520 530 540 550 >>CCDS14497.1 NXF2 gene_id:56001|Hs108|chrX (626 aa) initn: 2063 init1: 1809 opt: 1809 Z-score: 1985.3 bits: 376.7 E(32554): 2.9e-104 Smith-Waterman score: 1996; 81.5% identity (85.9% similar) in 389 aa overlap (1-351:112-498) 10 20 30 pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLM : .:::: :.:::::::::::::::::: CCDS14 SIRCERRMKWHSEDEIRITTWRNRKPPERKMSQNTQDGYTRNWFKVTIPYGIKYDKAWLM 90 100 110 120 130 140 40 50 60 70 80 90 pF1KE5 NSIQSNCSVPFTPVDFHYIRNRACFFVQVASAASALKDVSYKIYDDENQKICIFVSHFTA :::::.:: ::::::::.::::::::: ::::::::::::::::::::::::::.: :: CCDS14 NSIQSHCSDRFTPVDFHYVRNRACFFVQDASAASALKDVSYKIYDDENQKICIFVNHSTA 150 160 170 180 190 200 100 110 120 130 140 150 pF1KE5 PYSVKNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PYSVKNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAAT 210 220 230 240 250 260 160 170 180 190 200 210 pF1KE5 LKITERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLK ::: :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 LKIIERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLK 270 280 290 300 310 320 220 230 240 250 260 270 pF1KE5 LEELWLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 LEELWLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKEN 330 340 350 360 370 380 280 290 pF1KE5 FTGSETLKHLVLQFLQQ------------------------------------SNLCKYF ::::::::::::::::: :.::::: CCDS14 FTGSETLKHLVLQFLQQYYSIYDSGDRQGLLGAYHDEACFSLAIPFDPKDSAPSSLCKYF 390 400 410 420 430 440 300 310 320 330 340 350 pF1KE5 KDSRNIKILKDPYLQRKLLKHTKCPRN-VDSLSALPETQHDFTSILVDMWYQTVNT-CFL .::::.: ::::::. .::..:: :. ::::::::.::::..:::::.: :: :: CCDS14 EDSRNMKTLKDPYLKGELLRRTK--RDIVDSLSALPKTQHDLSSILVDVWCQTERMLCFS 450 460 470 480 490 360 pF1KE5 PRAGPESQSLRPL CCDS14 VNGVFKEVEGQSQGSVLAFTRTFIATPGSSSSLCIVNDELFVRDASPQETQSAFSIPVST 500 510 520 530 540 550 >>CCDS8037.1 NXF1 gene_id:10482|Hs108|chr11 (619 aa) initn: 1262 init1: 1199 opt: 1199 Z-score: 1317.5 bits: 253.1 E(32554): 4.6e-67 Smith-Waterman score: 1316; 54.7% identity (75.5% similar) in 384 aa overlap (5-351:111-493) 10 20 30 pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLMNSIQ .:: . ..:::.::::: :::::::.. :: CCDS80 RRGDTWHDRDRIHVTVRRDRAPPERGGAGTSQDGTSKNWFKITIPYGRKYDKAWLLSMIQ 90 100 110 120 130 140 40 50 60 70 80 90 pF1KE5 SNCSVPFTPVDFHYIRNRACFFVQVASAASALKDVSYKIYDDENQKICIFVSHFTAPYSV :.:::::::..::: .:: :::. ::.::::: :.::: : ::..: :... . :... CCDS80 SKCSVPFTPIEFHYENTRAQFFVEDASTASALKAVNYKILDRENRRISIIINSSAPPHTI 150 160 170 180 190 200 100 110 120 130 140 150 pF1KE5 KNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAATLKIT :.::: :.:.::: :.:::. :::::::..:: ::::....::..::::.::::::.: CCDS80 LNELKPEQVEQLKLIMSKRYDGSQQALDLKGLRSDPDLVAQNIDVVLNRRSCMAATLRII 210 220 230 240 250 260 160 170 180 190 200 210 pF1KE5 ERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLKLEEL :.:.:::::::: ::.::.:: .:.:..:::..: :::: :.:.: :: :.:::::::: CCDS80 EENIPELLSLNLSNNRLYRLDDMSSIVQKAPNLKILNLSGNELKSERELDKIKGLKLEEL 270 280 290 300 310 320 220 230 240 250 260 270 pF1KE5 WLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKENFTGS ::.:: ::.:: :::.:.::::. :::::::::.:: :. :... :. ::: .. :. CCDS80 WLDGNSLCDTFRDQSTYISAIRERFPKLLRLDGHELPPPIAFDVEAPTTLPPCKGSYFGT 330 340 350 360 370 380 280 290 pF1KE5 ETLKHLVLQFLQQ------------------------------------SNLCKYFKDSR :.:: :::.:::: :.: .:::::: CCDS80 ENLKSLVLHFLQQYYAIYDSGDRQGLLDAYHDGACCSLSIPFIPQNPARSSLAEYFKDSR 390 400 410 420 430 440 300 310 320 330 340 350 pF1KE5 NIKILKDPYLQRKLLKHTKCPRNVDSLSALPETQHDFTSILVDMWYQTVNT-CFLPRAGP :.: :::: :. .:::::. : :. ::.:::: .:..::. :: . :: CCDS80 NVKKLKDPTLRFRLLKHTRL-NVVAFLNELPKTQHDVNSFVVDISAQTSTLLCFSVNGVF 450 460 470 480 490 360 pF1KE5 ESQSLRPL CCDS80 KEVDGKSRDSLRAFTRTFIAVPASNSGLCIVNDELFVRNASSEEIQRAFAMPAPTPSSSP 500 510 520 530 540 550 >>CCDS44629.1 NXF1 gene_id:10482|Hs108|chr11 (356 aa) initn: 980 init1: 980 opt: 990 Z-score: 1092.3 bits: 210.7 E(32554): 1.6e-54 Smith-Waterman score: 990; 61.5% identity (85.9% similar) in 234 aa overlap (5-238:111-344) 10 20 30 pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLMNSIQ .:: . ..:::.::::: :::::::.. :: CCDS44 RRGDTWHDRDRIHVTVRRDRAPPERGGAGTSQDGTSKNWFKITIPYGRKYDKAWLLSMIQ 90 100 110 120 130 140 40 50 60 70 80 90 pF1KE5 SNCSVPFTPVDFHYIRNRACFFVQVASAASALKDVSYKIYDDENQKICIFVSHFTAPYSV :.:::::::..::: .:: :::. ::.::::: :.::: : ::..: :... . :... CCDS44 SKCSVPFTPIEFHYENTRAQFFVEDASTASALKAVNYKILDRENRRISIIINSSAPPHTI 150 160 170 180 190 200 100 110 120 130 140 150 pF1KE5 KNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAATLKIT :.::: :.:.::: :.:::. :::::::..:: ::::....::..::::.::::::.: CCDS44 LNELKPEQVEQLKLIMSKRYDGSQQALDLKGLRSDPDLVAQNIDVVLNRRSCMAATLRII 210 220 230 240 250 260 160 170 180 190 200 210 pF1KE5 ERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLKLEEL :.:.:::::::: ::.::.:: .:.:..:::..: :::: :.:.: :: :.:::::::: CCDS44 EENIPELLSLNLSNNRLYRLDDMSSIVQKAPNLKILNLSGNELKSERELDKIKGLKLEEL 270 280 290 300 310 320 220 230 240 250 260 270 pF1KE5 WLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKENFTGS ::.:: ::.:: :::.:. .. : CCDS44 WLDGNSLCDTFRDQSTYIRSVVACVSPPGDLHPLGG 330 340 350 >>CCDS14503.1 NXF3 gene_id:56000|Hs108|chrX (531 aa) initn: 710 init1: 577 opt: 807 Z-score: 889.3 bits: 173.7 E(32554): 3.3e-43 Smith-Waterman score: 922; 42.8% identity (62.6% similar) in 388 aa overlap (1-351:101-451) 10 20 30 pF1KE5 MRRNTQDENMRKWFKVTIPYGIKYDKAWLM :. : : .. .:::.:.:.::::.. ::. CCDS14 ISPYNRKGSFRKQDQTHVNMEREQKPPERRMEGNMPDGTLGSWFKITVPFGIKYNEKWLL 80 90 100 110 120 130 40 50 60 70 80 90 pF1KE5 NSIQSNCSVPFTPVDFHYIRNRACFFVQVASAASALKDVSYKIYDDENQKICIFVSHFTA : ::..:::::.::.::: .: :::. :: : :::.:: ::.:..:.:: :::. CCDS14 NLIQNECSVPFVPVEFHYENMHASFFVENASIAYALKNVSGKIWDEDNEKISIFVNPAGI 140 150 160 170 180 190 100 110 120 130 140 150 pF1KE5 PYSVKNKLKPGQMEMLKLTMNKRYNVSQQALDLQNLRFDPDLMGRDIDIILNRRNCMAAT :. :. .:: ..:..::.::.. .:::.:::.: : : ::...:: . : :.::::. CCDS14 PHFVHRELKSEKVEQIKLAMNQQCDVSQEALDIQRLPFYPDMVNRDTKMASNPRKCMAAS 200 210 220 230 240 250 160 170 180 190 200 210 pF1KE5 LKITERNFPELLSLNLCNNKLYQLDGLSDITEKAPKVKTLNLSKNKLESAWELGKVKGLK : . :.:.: ..: : :. : ::.. CCDS14 LDVHEENIPTVMS------------------------------------AGEMDKWKGIE 260 270 220 230 240 250 260 270 pF1KE5 LEELWLEGNPLCSTFSDQSAYVSAIRDCFPKLLRLDGRELSAPVIVDIDSSETMKPCKEN : . .:.:.:::: :. ...: . ::::: :::.. .. .. . . :: . CCDS14 PGEKCADRSPVCTTFSDTSSNINSILELFPKLLCLDGQQSPRATLCGTEAHKRLPTCKGS 280 290 300 310 320 330 280 290 pF1KE5 FTGSETLKHLVLQFLQQ------------------------------------SNLCKYF : ::: ::.:::::::: :..::.: CCDS14 FFGSEMLKNLVLQFLQQYYLIYDSGDRQGLLSAYHDEACFSLSIPFNPEDSAPSSFCKFF 340 350 360 370 380 390 300 310 320 330 340 350 pF1KE5 KDSRNIKILKDPYLQRKLLKHTKCPRNVDSLSALPETQHDFTSILVDMWYQTV-NTCFLP ::::::::::::::. .:::::: ::::::::.::::..:.:::::::: :: CCDS14 KDSRNIKILKDPYLRGELLKHTKLD-IVDSLSALPKTQHDLSSFLVDMWYQTEWMLCFSV 400 410 420 430 440 450 360 pF1KE5 RAGPESQSLRPL CCDS14 NGVFKEVEGQSQGSVLAFTRTFIATPGSSSSLCIVNDKLFVRDTSHQGTQSALFTLVPTA 460 470 480 490 500 510 365 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 01:39:27 2016 done: Tue Nov 8 01:39:28 2016 Total Scan time: 2.220 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]