FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6306, 322 aa 1>>>pF1KE6306 322 - 322 aa - 322 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1320+/-0.000888; mu= 16.5375+/- 0.053 mean_var=60.3805+/-12.192, 0's: 0 Z-trim(104.1): 20 B-trim: 0 in 0/52 Lambda= 0.165054 statistics sampled from 7747 (7755) to 7747 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.606), E-opt: 0.2 (0.238), width: 16 Scan time: 2.090 The best scores are: opt bits E(32554) CCDS7539.1 SFXN2 gene_id:118980|Hs108|chr10 ( 322) 2150 520.5 6.8e-148 CCDS4394.1 SFXN1 gene_id:94081|Hs108|chr5 ( 322) 1192 292.4 3.2e-79 CCDS7508.2 SFXN3 gene_id:81855|Hs108|chr10 ( 325) 1164 285.7 3.3e-77 CCDS1922.1 SFXN5 gene_id:94097|Hs108|chr2 ( 340) 761 189.8 2.6e-48 CCDS82469.1 SFXN5 gene_id:94097|Hs108|chr2 ( 253) 522 132.8 2.8e-31 CCDS7610.1 SFXN4 gene_id:119559|Hs108|chr10 ( 337) 283 76.0 4.8e-14 >>CCDS7539.1 SFXN2 gene_id:118980|Hs108|chr10 (322 aa) initn: 2150 init1: 2150 opt: 2150 Z-score: 2768.6 bits: 520.5 E(32554): 6.8e-148 Smith-Waterman score: 2150; 100.0% identity (100.0% similar) in 322 aa overlap (1-322:1-322) 10 20 30 40 50 60 pF1KE6 MEADLSGFNIDAPRWDQRTFLGRVKHFLNITDPRTVFVSERELDWAKVMVEKSRMGVVPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MEADLSGFNIDAPRWDQRTFLGRVKHFLNITDPRTVFVSERELDWAKVMVEKSRMGVVPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GTQVEQLLYAKKLYDSAFHPDTGEKMNVIGRMSFQLPGGMIITGFMLQFYRTMPAVIFWQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 GTQVEQLLYAKKLYDSAFHPDTGEKMNVIGRMSFQLPGGMIITGFMLQFYRTMPAVIFWQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 WVNQSFNALVNYTNRNAASPTSVRQMALSYFTATTTAVATAVGMNMLTKKAPPLVGRWVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 WVNQSFNALVNYTNRNAASPTSVRQMALSYFTATTTAVATAVGMNMLTKKAPPLVGRWVP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 FAAVAAANCVNIPMMRQQELIKGICVKDRNENEIGHSRRAAAIGITQVVISRITMSAPGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 FAAVAAANCVNIPMMRQQELIKGICVKDRNENEIGHSRRAAAIGITQVVISRITMSAPGM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 ILLPVIMERLEKLHFMQKVKVLHAPLQVMLSGCFLIFMVPVACGLFPQKCELPVSYLEPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ILLPVIMERLEKLHFMQKVKVLHAPLQVMLSGCFLIFMVPVACGLFPQKCELPVSYLEPK 250 260 270 280 290 300 310 320 pF1KE6 LQDTIKAKYGELEPYVYFNKGL :::::::::::::::::::::: CCDS75 LQDTIKAKYGELEPYVYFNKGL 310 320 >>CCDS4394.1 SFXN1 gene_id:94081|Hs108|chr5 (322 aa) initn: 1191 init1: 1158 opt: 1192 Z-score: 1535.8 bits: 292.4 E(32554): 3.2e-79 Smith-Waterman score: 1192; 56.1% identity (79.3% similar) in 314 aa overlap (9-322:10-322) 10 20 30 40 50 pF1KE6 MEADLSGFNIDAPRWDQRTFLGRVKHFLNITDPRTVFVSERELDWAKVMVEKSRMGVVP :: ::::: ::.::..::...::::........:. :. .:. :.:.:: CCDS43 MSGELPPNINIKEPRWDQSTFIGRANHFFTVTDPRNILLTNEQLESARKIVHDYRQGIVP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 PGTQVEQLLYAKKLYDSAFHPDTGEKMNVIGRMSFQLPGGMIITGFMLQFYRTMPAVIFW :: ..: :: .::::::::::::: .::::: :.: .: ::: :. :::: :::.:: CCDS43 PGLTENELWRAKYIYDSAFHPDTGEKMILIGRMSAQVPMNMTITGCMMTFYRTTPAVLFW 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 QWVNQSFNALVNYTNRNAASPTSVRQMALSYFTATTTAVATAVGMNMLTKKAPPLVGRWV ::.::::::.::::::.. .: .: ... .: .::: :::::.:.: :::.. ::.::.: CCDS43 QWINQSFNAVVNYTNRSGDAPLTVNELGTAYVSATTGAVATALGLNALTKHVSPLIGRFV 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE6 PFAAVAAANCVNIPMMRQQELIKGICVKDRNENEIGHSRRAAAIGITQVVISRITMSAPG ::::::::::.:::.:::.:: :: : :.: :..:.: :: .:::::.::: :.::: CCDS43 PFAAVAAANCINIPLMRQRELKVGIPVTDENGNRLGESANAAKQAITQVVVSRILMAAPG 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE6 MILLPVIMERLEKLHFMQKVKVLHAPLQVMLSGCFLIFMVPVACGLFPQKCELPVSYLEP : . : ::. ::: :... . ::.:: : : :.: .:. :.::::: . :. :: CCDS43 MAIPPFIMNTLEKKAFLKRFPWMSAPIQVGLVGFCLVFATPLCCALFPQKSSMSVTSLEA 250 260 270 280 290 300 300 310 320 pF1KE6 KLQDTIKAKYGELEPYVYFNKGL .:: :. .. ::. ::::::: CCDS43 ELQAKIQESHPELR-RVYFNKGL 310 320 >>CCDS7508.2 SFXN3 gene_id:81855|Hs108|chr10 (325 aa) initn: 1161 init1: 1132 opt: 1164 Z-score: 1499.7 bits: 285.7 E(32554): 3.3e-77 Smith-Waterman score: 1164; 56.1% identity (77.7% similar) in 314 aa overlap (9-322:13-325) 10 20 30 40 50 pF1KE6 MEADLSGFNIDAPRWDQRTFLGRVKHFLNITDPRTVFVSERELDWAKVMVEKSRMG ::. ::::: :::::..::...::::....: .:. .. .:.. : : CCDS75 MESKMGELPLDINIQEPRWDQSTFLGRARHFFTVTDPRNLLLSGAQLEASRNIVQNYRAG 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 VVPPGTQVEQLLYAKKLYDSAFHPDTGEKMNVIGRMSFQLPGGMIITGFMLQFYRTMPAV :: :: .:: :: .::::::::::::. .::::: :.: .: ::: :: ::: :.: CCDS75 VVTPGITEDQLWRAKYVYDSAFHPDTGEKVVLIGRMSAQVPMNMTITGCMLTFYRKTPTV 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 IFWQWVNQSFNALVNYTNRNAASPTSVRQMALSYFTATTTAVATAVGMNMLTKKAPPLVG .:::::::::::.:::.::.. .: .:::.. .: .::: :::::.:.. :::. ::::: CCDS75 VFWQWVNQSFNAIVNYSNRSGDTPITVRQLGTAYVSATTGAVATALGLKSLTKHLPPLVG 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE6 RWVPFAAVAAANCVNIPMMRQQELIKGICVKDRNENEIGHSRRAAAIGITQVVISRITMS :.:::::::::::.:::.:::.:: :: : :. ...:.: :: :: ::::::: :. CCDS75 RFVPFAAVAAANCINIPLMRQRELQVGIPVADEAGQRLGYSVTAAKQGIFQVVISRICMA 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE6 APGMILLPVIMERLEKLHFMQKVKVLHAPLQVMLSGCFLIFMVPVACGLFPQKCELPVSY :.: . :.::. ::: :... : ::::: : : :.: .:. :.::::: . .: CCDS75 IPAMAIPPLIMDTLEKKDFLKRRPWLGAPLQVGLVGFCLVFATPLCCALFPQKSSIHISN 250 260 270 280 290 300 300 310 320 pF1KE6 LEPKLQDTIKAKYGELEPYVYFNKGL :::.:. :. . .: ::.:::: CCDS75 LEPELRAQIHEQNPSVE-VVYYNKGL 310 320 >>CCDS1922.1 SFXN5 gene_id:94097|Hs108|chr2 (340 aa) initn: 756 init1: 474 opt: 761 Z-score: 980.8 bits: 189.8 E(32554): 2.6e-48 Smith-Waterman score: 761; 39.1% identity (67.6% similar) in 330 aa overlap (3-322:20-340) 10 20 30 40 pF1KE6 MEADLSGFNIDAPRWDQRTFLGRVKHFLNITDPRTVFVSEREL .: :.. ::..: .: :: .:::.: ::::.::.::.: CCDS19 MADTATTASAAAASAASASSDAPPFQLGKPRFQQTSFYGRFRHFLDIIDPRTLFVTERRL 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 DWAKVMVEKSRMGVVPPGTQVEQLLYAKKLYDSAFHPDTGEKMNVIGRMSFQLPGGM-II : ..: . :.. ::. ::: :.:. .. .::::.::. . ::: .: : :. CCDS19 REAVQLLEDYKHGTLRPGVTNEQLWSAQKIKQAILHPDTNEKIFMPFRMSGYIPFGTPIV 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 TGFMLQFYRTMPAVIFWQWVNQSFNALVNYTNRNAASPTSVRQMALSYFTATTTAVATAV .:..: .:. ...::::.::: :: :::.::::..:. . .. .:. :. .::. :: CCDS19 VGLLLP-NQTLASTVFWQWLNQSHNACVNYANRNATKPSPASKFIQGYLGAVISAVSIAV 130 140 150 160 170 170 180 190 200 210 pF1KE6 GMNMLTKKA---PP----LVGRWVPFAAVAAANCVNIPMMRQQELIKGICVKDRNENEIG :.:.:..:: : :. :.::: :::.:: :. .:: :: .:: : : . : .: CCDS19 GLNVLVQKANKFTPATRLLIQRFVPFPAVASANICNVVLMRYGELEEGIDVLDSDGNLVG 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE6 HSRRAAAIGITQVVISRITMSAPGMILLPVIMERLEKLHFMQKVKVLHAPLQVMLSGCFL :. :: .. .....:... : ..: :..: ::: ..: : :.: .. :. CCDS19 SSKIAARHALLETALTRVVLPMPILVLPPIVMSMLEKTALLQARPRLLLPVQSLV--CLA 240 250 260 270 280 290 280 290 300 310 320 pF1KE6 IF--MVPVACGLFPQKCELPVSYLEPKLQDTIKAKYGELEPYVYFNKGL : .:.: .:::: :. .: :::.. .. ... : .:::: CCDS19 AFGLALPLAISLFPQMSEIETSQLEPEIAQATSSRT------VVYNKGL 300 310 320 330 340 >>CCDS82469.1 SFXN5 gene_id:94097|Hs108|chr2 (253 aa) initn: 509 init1: 474 opt: 522 Z-score: 675.1 bits: 132.8 E(32554): 2.8e-31 Smith-Waterman score: 522; 42.2% identity (70.9% similar) in 199 aa overlap (3-193:20-217) 10 20 30 40 pF1KE6 MEADLSGFNIDAPRWDQRTFLGRVKHFLNITDPRTVFVSEREL .: :.. ::..: .: :: .:::.: ::::.::.::.: CCDS82 MADTATTASAAAASAASASSDAPPFQLGKPRFQQTSFYGRFRHFLDIIDPRTLFVTERRL 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 DWAKVMVEKSRMGVVPPGTQVEQLLYAKKLYDSAFHPDTGEKMNVIGRMSFQLPGGM-II : ..: . :.. ::. ::: :.:. .. .::::.::. . ::: .: : :. CCDS82 REAVQLLEDYKHGTLRPGVTNEQLWSAQKIKQAILHPDTNEKIFMPFRMSGYIPFGTPIV 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 TGFMLQFYRTMPAVIFWQWVNQSFNALVNYTNRNAASPTSVRQMALSYFTATTTAVATAV .:..: .:. ...::::.::: :: :::.::::..:. . .. .:. :. .::. :: CCDS82 VGLLLP-NQTLASTVFWQWLNQSHNACVNYANRNATKPSPASKFIQGYLGAVISAVSIAV 130 140 150 160 170 170 180 190 200 210 pF1KE6 GMNMLTKKA---PP----LVGRWVPFAAVAAANCVNIPMMRQQELIKGICVKDRNENEIG :.:.:..:: : :. :.::: ::. .: . : CCDS82 GLNVLVQKANKFTPATRLLIQRFVPFPAVGRLSCRHAPGCSSLCKASCAWQPSAWPCRWP 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE6 HSRRAAAIGITQVVISRITMSAPGMILLPVIMERLEKLHFMQKVKVLHAPLQVMLSGCFL CCDS82 SASSRKCQRLKHPN 240 250 >>CCDS7610.1 SFXN4 gene_id:119559|Hs108|chr10 (337 aa) initn: 220 init1: 144 opt: 283 Z-score: 365.7 bits: 76.0 E(32554): 4.8e-14 Smith-Waterman score: 283; 22.9% identity (56.7% similar) in 319 aa overlap (15-322:32-337) 10 20 30 40 pF1KE6 MEADLSGFNIDAPRW--DQRTFLGRVKHFLNITDPRTVFVSERE : ....:. : .. .. :: .::.: . CCDS76 SLEQEEETQPGRLLGRRDAVPAFIEPNVRFWITERQSFIRRFLQWTELLDPTNVFISVES 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 LDWAKVMVEKSRMGVVPPGTQVEQLLYAKKLYDSAFHPDTGEKMNVIGRMSFQLPGGMII .. .. .. .. : :.. ... : : .. :::... . . : . :: : CCDS76 IENSRQLLCTNE-DVSSPASADQRIQEAWKRSLATVHPDSSNLIPKLFRPAAFLPF-MAP 70 80 90 100 110 110 120 130 140 150 160 pF1KE6 TGFM-LQFYRTMPAVIFWQWVNQSFNALVNYTNRNAASPTSVRQMALSYFTATTTAVATA : :. . . . .::. : .. : : : : . . . . : . : ..: .: CCDS76 TVFLSMTPLKGIKSVILPQVFLCAYMAAFNSINGNRS--YTCKPLERSLLMAGAVASSTF 120 130 140 150 160 170 170 180 190 200 210 pF1KE6 VG-MNMLTKKAPPLVGRWV----PFAAVAAANCVNIPMMRQQELIKGICVKDRNENEIGH .: . .... :.: :. : .. :. .:. : :. : :::: : :.. : .:: CCDS76 LGVIPQFVQMKYGLTGPWIKRLLPVIFLVQASGMNVYMSRSLESIKGIAVMDKEGNVLGH 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE6 SRRAAAIGITQVVISRITMSAPGMILLPVIMERLEKLHFMQKVKVLHAPLQVMLSGCFLI :: :.. .. ... :::.. . . .. :. ... ....: . : .. .: .. CCDS76 SRIAGTKAVRETLASRIVLFGTSALIPEVFTYFFKRTQYFRKNP---GSLWILKLSCTVL 240 250 260 270 280 290 280 290 300 310 320 pF1KE6 ---FMVPVACGLFPQKCELPVSYLEPKLQDTIKAKYGELEPYVYFNKGL .::: . ..::: .. :: :.:. . : .....:. CCDS76 AMGLMVPFSFSIFPQIGQIQYCSLEEKIQSPTE------ETEIFYHRGV 300 310 320 330 322 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:56:40 2016 done: Tue Nov 8 11:56:40 2016 Total Scan time: 2.090 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]