FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3134, 340 aa 1>>>pF1KE3134 340 - 340 aa - 340 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1694+/-0.000837; mu= 17.1250+/- 0.050 mean_var=66.4448+/-13.362, 0's: 0 Z-trim(106.5): 25 B-trim: 63 in 2/48 Lambda= 0.157342 statistics sampled from 9007 (9028) to 9007 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.65), E-opt: 0.2 (0.277), width: 16 Scan time: 2.560 The best scores are: opt bits E(32554) CCDS2927.1 NSUN3 gene_id:63899|Hs108|chr3 ( 340) 2314 534.0 6.5e-152 CCDS57996.1 NSUN4 gene_id:387338|Hs108|chr1 ( 335) 565 137.0 2.1e-32 CCDS534.1 NSUN4 gene_id:387338|Hs108|chr1 ( 384) 565 137.1 2.3e-32 CCDS58202.1 NOP2 gene_id:4839|Hs108|chr12 ( 628) 278 72.0 1.4e-12 CCDS44811.1 NOP2 gene_id:4839|Hs108|chr12 ( 808) 278 72.1 1.8e-12 CCDS58203.1 NOP2 gene_id:4839|Hs108|chr12 ( 812) 278 72.1 1.8e-12 CCDS58204.1 NOP2 gene_id:4839|Hs108|chr12 ( 845) 278 72.1 1.8e-12 >>CCDS2927.1 NSUN3 gene_id:63899|Hs108|chr3 (340 aa) initn: 2314 init1: 2314 opt: 2314 Z-score: 2840.8 bits: 534.0 E(32554): 6.5e-152 Smith-Waterman score: 2314; 100.0% identity (100.0% similar) in 340 aa overlap (1-340:1-340) 10 20 30 40 50 60 pF1KE3 MLTQLKAKSEGKLAKQICKVVLDHFEKQYSKELGDAWNTVREILTSPSCWQYAVLLNRFN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 MLTQLKAKSEGKLAKQICKVVLDHFEKQYSKELGDAWNTVREILTSPSCWQYAVLLNRFN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 YPFELEKDLHLKGYHTLSQGSLPNYPKSVKCYLSRTPGRIPSERHQIGNLKKYYLLNAAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 YPFELEKDLHLKGYHTLSQGSLPNYPKSVKCYLSRTPGRIPSERHQIGNLKKYYLLNAAS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 LLPVLALELRDGEKVLDLCAAPGGKSIALLQCACPGYLHCNEYDSLRLRWLRQTLESFIP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 LLPVLALELRDGEKVLDLCAAPGGKSIALLQCACPGYLHCNEYDSLRLRWLRQTLESFIP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 QPLINVIKVSELDGRKMGDAQPEMFDKVLVDAPCSNDRSWLFSSDSQKASCRISQRRNLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 QPLINVIKVSELDGRKMGDAQPEMFDKVLVDAPCSNDRSWLFSSDSQKASCRISQRRNLP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 LLQIELLRSAIKALRPGGILVYSTCTLSKAENQDVISEILNSHGNIMPMDIKGIARTCSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 LLQIELLRSAIKALRPGGILVYSTCTLSKAENQDVISEILNSHGNIMPMDIKGIARTCSH 250 260 270 280 290 300 310 320 330 340 pF1KE3 DFTFAPTGQECGLLVIPDKGKAWGPMYVAKLKKSWSTGKW :::::::::::::::::::::::::::::::::::::::: CCDS29 DFTFAPTGQECGLLVIPDKGKAWGPMYVAKLKKSWSTGKW 310 320 330 340 >>CCDS57996.1 NSUN4 gene_id:387338|Hs108|chr1 (335 aa) initn: 534 init1: 206 opt: 565 Z-score: 695.3 bits: 137.0 E(32554): 2.1e-32 Smith-Waterman score: 597; 35.8% identity (63.6% similar) in 335 aa overlap (29-333:3-333) 10 20 30 40 50 pF1KE3 MLTQLKAKSEGKLAKQICKVVLDHFEKQYSKELGDAWNTVREILTSPSCWQYAVLLNRF- :: ..:: : ..: : : . .:..:.: : CCDS57 MTYSVQFGDLWPSIRVSLLSEQ--KYGALVNNFA 10 20 30 60 70 80 90 pF1KE3 ---NYPFELE----KDL---HLKGYHTLSQGSLPNYPK--------SVKCY-LSRTP-GR . .:: ::. .. .. :.:. :. ...:. ..: .: CCDS57 AWDHVSAKLEQLSAKDFVNEAISHWELQSEGGQSAAPSPASWACSPNLRCFTFDRGDISR 40 50 60 70 80 90 100 110 120 130 140 150 pF1KE3 IPSERHQIGNLKKYYLLNAASLLPVLALELRDGEKVLDLCAAPGGKSIALLQCACPGYLH .: : .. .:::..:::::::::: :. :. :::::::::::..:::: .: : CCDS57 FPPARPGSLGVMEYYLMDAASLLPVLALGLQPGDIVLDLCAAPGGKTLALLQTGCCRNLA 100 110 120 130 140 150 160 170 180 190 200 210 pF1KE3 CNEYDSLRLRWLRQTLESFIPQPLI--NVIKVSELDGRKMGDAQPEMFDKVLVDAPCSND :. . :. :.. :.:..:. . : ..:. :::: :. . . .:.::::.::..: CCDS57 ANDLSPSRIARLQKILHSYVPEEIRDGNQVRVTSWDGRKWGELEGDTYDRVLVDVPCTTD 160 170 180 190 200 210 220 230 240 250 260 270 pF1KE3 RSWLFSSDSQ--KASCRISQRRNLPLLQIELLRSAIKALRPGGILVYSTCTLSKAENQDV : : ... : : : ..:. ::.::..:: ... : .::: .:::::.::. .:. : CCDS57 RHSLHEEENNIFKRS-RKKERQILPVLQVQLLAAGLLATKPGGHVVYSTCSLSHLQNEYV 220 230 240 250 260 270 280 290 300 310 320 330 pF1KE3 IS---EILNSHGNIMPM--DIKGIARTCSHDFTFAPTGQECGLLVIPDKGKAWGPMYVAK .. :.: .. .:. . :. . :. : : . : : ::::. .:::: : CCDS57 VQGAIELLANQYSIQVQVEDLTHFRRVFMDTFCFFSSCQ-VGELVIPNLMANFGPMYFCK 280 290 300 310 320 330 340 pF1KE3 LKKSWSTGKW ... CCDS57 MRRLT >>CCDS534.1 NSUN4 gene_id:387338|Hs108|chr1 (384 aa) initn: 534 init1: 206 opt: 565 Z-score: 694.4 bits: 137.1 E(32554): 2.3e-32 Smith-Waterman score: 614; 35.4% identity (64.1% similar) in 345 aa overlap (19-333:42-382) 10 20 30 40 pF1KE3 MLTQLKAKSEGKLAKQICKVVLDHFEKQYSKELGDAWNTVREILTSPS ...:..:. :: ..:: : ..: : : . CCDS53 LLKRVDLATVPRRHRYKKKWAATEPKFPAVRLALQNFDMTYSVQFGDLWPSIRVSLLSEQ 20 30 40 50 60 70 50 60 70 80 pF1KE3 CWQYAVLLNRF----NYPFELE----KDL---HLKGYHTLSQGSLPNYPK--------SV .:..:.: : . .:: ::. .. .. :.:. :. .. CCDS53 --KYGALVNNFAAWDHVSAKLEQLSAKDFVNEAISHWELQSEGGQSAAPSPASWACSPNL 80 90 100 110 120 90 100 110 120 130 140 pF1KE3 KCY-LSRTP-GRIPSERHQIGNLKKYYLLNAASLLPVLALELRDGEKVLDLCAAPGGKSI .:. ..: .:.: : .. .:::..:::::::::: :. :. :::::::::::.. CCDS53 RCFTFDRGDISRFPPARPGSLGVMEYYLMDAASLLPVLALGLQPGDIVLDLCAAPGGKTL 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE3 ALLQCACPGYLHCNEYDSLRLRWLRQTLESFIPQPLI--NVIKVSELDGRKMGDAQPEMF :::: .: : :. . :. :.. :.:..:. . : ..:. :::: :. . . . CCDS53 ALLQTGCCRNLAANDLSPSRIARLQKILHSYVPEEIRDGNQVRVTSWDGRKWGELEGDTY 190 200 210 220 230 240 210 220 230 240 250 260 pF1KE3 DKVLVDAPCSNDRSWLFSSDSQ--KASCRISQRRNLPLLQIELLRSAIKALRPGGILVYS :.::::.::..:: : ... : : : ..:. ::.::..:: ... : .::: .::: CCDS53 DRVLVDVPCTTDRHSLHEEENNIFKRS-RKKERQILPVLQVQLLAAGLLATKPGGHVVYS 250 260 270 280 290 300 270 280 290 300 310 pF1KE3 TCTLSKAENQDVIS---EILNSHGNIMPM--DIKGIARTCSHDFTFAPTGQECGLLVIPD ::.::. .:. :.. :.: .. .:. . :. . :. : : . : : ::::. CCDS53 TCSLSHLQNEYVVQGAIELLANQYSIQVQVEDLTHFRRVFMDTFCFFSSCQ-VGELVIPN 310 320 330 340 350 360 320 330 340 pF1KE3 KGKAWGPMYVAKLKKSWSTGKW .:::: :... CCDS53 LMANFGPMYFCKMRRLT 370 380 >>CCDS58202.1 NOP2 gene_id:4839|Hs108|chr12 (628 aa) initn: 271 init1: 137 opt: 278 Z-score: 339.2 bits: 72.0 E(32554): 1.4e-12 Smith-Waterman score: 278; 31.2% identity (60.7% similar) in 234 aa overlap (113-333:362-582) 90 100 110 120 130 140 pF1KE3 PNYPKSVKCYLSRTPGRIPSERHQIGNLKKYYLLNAASLLPVLALELRDGEKVLDLCAAP :.: .:.:.:::.:: .. :..::.: :: CCDS58 DPLGKWSKTGLVVYDSSVPIGATPEYLAGHYMLQGASSMLPVMALAPQEHERILDMCCAP 340 350 360 370 380 390 150 160 170 180 190 200 pF1KE3 GGKSIALLQCA-CPGYLHCNEYDSLRLRWLRQTLESFIPQPLINVIKVSELDGRKMGDAQ :::. . : : . :. .. ::. . .:. . . :.: .:. :::.. CCDS58 GGKTSYMAQLMKNTGVILANDANAERLKSVVGNLHRL---GVTNTI-ISHYDGRQF---- 400 410 420 430 440 210 220 230 240 250 pF1KE3 PEM---FDKVLVDAPCSNDRSWLFSSDSQKASCRISQRRNL--PLLQIELLRSAIKAL-- :.. ::.::.:::::. . ..:.: . . ... : :: ::: ::: .. CCDS58 PKVVGGFDRVLLDAPCSG--TGVISKDPAVKTNK-DEKDILRCAHLQKELLLSAIDSVNA 450 460 470 480 490 500 260 270 280 290 300 pF1KE3 --RPGGILVYSTCTLSKAENQDVISEILNSHG-NIMP--MDIKGIARTCSHDFTFAPTGQ . :: ::: ::... ::. :.. :.... ..: .:. . : .. : :. . CCDS58 TSKTGGYLVYCTCSITVEENEWVVDYALKKRNVRLVPTGLDFGQEGFTRFRERRFHPSLR 510 520 530 540 550 560 310 320 330 340 pF1KE3 ECGLLVIPDKGKAWGPMYVAKLKKSWSTGKW . : . : ...::.:: CCDS58 STRRFY-PHTHNMDG-FFIAKFKKFSNSIPQSQTDGVLLCRSGWTAVVQSQLIATSTFQV 570 580 590 600 610 >>CCDS44811.1 NOP2 gene_id:4839|Hs108|chr12 (808 aa) initn: 271 init1: 137 opt: 278 Z-score: 337.6 bits: 72.1 E(32554): 1.8e-12 Smith-Waterman score: 278; 31.2% identity (60.7% similar) in 234 aa overlap (113-333:362-582) 90 100 110 120 130 140 pF1KE3 PNYPKSVKCYLSRTPGRIPSERHQIGNLKKYYLLNAASLLPVLALELRDGEKVLDLCAAP :.: .:.:.:::.:: .. :..::.: :: CCDS44 DPLGKWSKTGLVVYDSSVPIGATPEYLAGHYMLQGASSMLPVMALAPQEHERILDMCCAP 340 350 360 370 380 390 150 160 170 180 190 200 pF1KE3 GGKSIALLQCA-CPGYLHCNEYDSLRLRWLRQTLESFIPQPLINVIKVSELDGRKMGDAQ :::. . : : . :. .. ::. . .:. . . :.: .:. :::.. CCDS44 GGKTSYMAQLMKNTGVILANDANAERLKSVVGNLHRL---GVTNTI-ISHYDGRQF---- 400 410 420 430 440 210 220 230 240 250 pF1KE3 PEM---FDKVLVDAPCSNDRSWLFSSDSQKASCRISQRRNL--PLLQIELLRSAIKAL-- :.. ::.::.:::::. . ..:.: . . ... : :: ::: ::: .. CCDS44 PKVVGGFDRVLLDAPCSG--TGVISKDPAVKTNK-DEKDILRCAHLQKELLLSAIDSVNA 450 460 470 480 490 500 260 270 280 290 300 pF1KE3 --RPGGILVYSTCTLSKAENQDVISEILNSHG-NIMP--MDIKGIARTCSHDFTFAPTGQ . :: ::: ::... ::. :.. :.... ..: .:. . : .. : :. . CCDS44 TSKTGGYLVYCTCSITVEENEWVVDYALKKRNVRLVPTGLDFGQEGFTRFRERRFHPSLR 510 520 530 540 550 560 310 320 330 340 pF1KE3 ECGLLVIPDKGKAWGPMYVAKLKKSWSTGKW . : . : ...::.:: CCDS44 STRRFY-PHTHNMDG-FFIAKFKKFSNSIPQSQTGNSETATPTNVDLPQVIPKSENSSQP 570 580 590 600 610 >>CCDS58203.1 NOP2 gene_id:4839|Hs108|chr12 (812 aa) initn: 271 init1: 137 opt: 278 Z-score: 337.6 bits: 72.1 E(32554): 1.8e-12 Smith-Waterman score: 278; 31.2% identity (60.7% similar) in 234 aa overlap (113-333:366-586) 90 100 110 120 130 140 pF1KE3 PNYPKSVKCYLSRTPGRIPSERHQIGNLKKYYLLNAASLLPVLALELRDGEKVLDLCAAP :.: .:.:.:::.:: .. :..::.: :: CCDS58 DPLGKWSKTGLVVYDSSVPIGATPEYLAGHYMLQGASSMLPVMALAPQEHERILDMCCAP 340 350 360 370 380 390 150 160 170 180 190 200 pF1KE3 GGKSIALLQCA-CPGYLHCNEYDSLRLRWLRQTLESFIPQPLINVIKVSELDGRKMGDAQ :::. . : : . :. .. ::. . .:. . . :.: .:. :::.. CCDS58 GGKTSYMAQLMKNTGVILANDANAERLKSVVGNLHRL---GVTNTI-ISHYDGRQF---- 400 410 420 430 440 210 220 230 240 250 pF1KE3 PEM---FDKVLVDAPCSNDRSWLFSSDSQKASCRISQRRNL--PLLQIELLRSAIKAL-- :.. ::.::.:::::. . ..:.: . . ... : :: ::: ::: .. CCDS58 PKVVGGFDRVLLDAPCSG--TGVISKDPAVKTNK-DEKDILRCAHLQKELLLSAIDSVNA 450 460 470 480 490 500 260 270 280 290 300 pF1KE3 --RPGGILVYSTCTLSKAENQDVISEILNSHG-NIMP--MDIKGIARTCSHDFTFAPTGQ . :: ::: ::... ::. :.. :.... ..: .:. . : .. : :. . CCDS58 TSKTGGYLVYCTCSITVEENEWVVDYALKKRNVRLVPTGLDFGQEGFTRFRERRFHPSLR 510 520 530 540 550 560 310 320 330 340 pF1KE3 ECGLLVIPDKGKAWGPMYVAKLKKSWSTGKW . : . : ...::.:: CCDS58 STRRFY-PHTHNMDG-FFIAKFKKFSNSIPQSQTGNSETATPTNVDLPQVIPKSENSSQP 570 580 590 600 610 620 >>CCDS58204.1 NOP2 gene_id:4839|Hs108|chr12 (845 aa) initn: 271 init1: 137 opt: 278 Z-score: 337.3 bits: 72.1 E(32554): 1.8e-12 Smith-Waterman score: 278; 31.2% identity (60.7% similar) in 234 aa overlap (113-333:399-619) 90 100 110 120 130 140 pF1KE3 PNYPKSVKCYLSRTPGRIPSERHQIGNLKKYYLLNAASLLPVLALELRDGEKVLDLCAAP :.: .:.:.:::.:: .. :..::.: :: CCDS58 DPLGKWSKTGLVVYDSSVPIGATPEYLAGHYMLQGASSMLPVMALAPQEHERILDMCCAP 370 380 390 400 410 420 150 160 170 180 190 200 pF1KE3 GGKSIALLQCA-CPGYLHCNEYDSLRLRWLRQTLESFIPQPLINVIKVSELDGRKMGDAQ :::. . : : . :. .. ::. . .:. . . :.: .:. :::.. CCDS58 GGKTSYMAQLMKNTGVILANDANAERLKSVVGNLHRL---GVTNTI-ISHYDGRQF---- 430 440 450 460 470 480 210 220 230 240 250 pF1KE3 PEM---FDKVLVDAPCSNDRSWLFSSDSQKASCRISQRRNL--PLLQIELLRSAIKAL-- :.. ::.::.:::::. . ..:.: . . ... : :: ::: ::: .. CCDS58 PKVVGGFDRVLLDAPCSG--TGVISKDPAVKTNK-DEKDILRCAHLQKELLLSAIDSVNA 490 500 510 520 530 260 270 280 290 300 pF1KE3 --RPGGILVYSTCTLSKAENQDVISEILNSHG-NIMP--MDIKGIARTCSHDFTFAPTGQ . :: ::: ::... ::. :.. :.... ..: .:. . : .. : :. . CCDS58 TSKTGGYLVYCTCSITVEENEWVVDYALKKRNVRLVPTGLDFGQEGFTRFRERRFHPSLR 540 550 560 570 580 590 310 320 330 340 pF1KE3 ECGLLVIPDKGKAWGPMYVAKLKKSWSTGKW . : . : ...::.:: CCDS58 STRRFY-PHTHNMDG-FFIAKFKKFSNSIPQSQTGNSETATPTNVDLPQVIPKSENSSQP 600 610 620 630 640 650 340 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 03:21:52 2016 done: Mon Nov 7 03:21:53 2016 Total Scan time: 2.560 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]