FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4422, 437 aa 1>>>pF1KE4422 437 - 437 aa - 437 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7046+/-0.000818; mu= 15.3542+/- 0.049 mean_var=68.9591+/-13.768, 0's: 0 Z-trim(107.6): 15 B-trim: 0 in 0/49 Lambda= 0.154447 statistics sampled from 9676 (9683) to 9676 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.671), E-opt: 0.2 (0.297), width: 16 Scan time: 2.640 The best scores are: opt bits E(32554) CCDS5637.1 SGCE gene_id:8910|Hs108|chr7 ( 437) 2980 673.0 1.6e-193 CCDS47643.1 SGCE gene_id:8910|Hs108|chr7 ( 462) 2838 641.3 5.7e-184 CCDS75634.1 SGCE gene_id:8910|Hs108|chr7 ( 396) 2430 550.4 1.2e-156 CCDS47642.1 SGCE gene_id:8910|Hs108|chr7 ( 451) 2360 534.8 6.4e-152 CCDS32679.1 SGCA gene_id:6442|Hs108|chr17 ( 387) 1082 250.0 2.9e-66 CCDS45729.1 SGCA gene_id:6442|Hs108|chr17 ( 263) 505 121.4 1.1e-27 >>CCDS5637.1 SGCE gene_id:8910|Hs108|chr7 (437 aa) initn: 2980 init1: 2980 opt: 2980 Z-score: 3587.8 bits: 673.0 E(32554): 1.6e-193 Smith-Waterman score: 2980; 99.5% identity (99.8% similar) in 437 aa overlap (1-437:1-437) 10 20 30 40 50 60 pF1KE4 MQLPRWWELGDPCAWTGQGRGTRRMSPATTGTFLLTVYSIFSKVHSDRSVYPSAGVLFVH ::::::::::::::::::::::::::::::::::::::::::::::::.::::::::::: CCDS56 MQLPRWWELGDPCAWTGQGRGTRRMSPATTGTFLLTVYSIFSKVHSDRNVYPSAGVLFVH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 VLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 VLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 ENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 GDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 GDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 NPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 NPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 YKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 YKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVH 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 HSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIHPLHTDNYDSTNMPLMQTQQNL :::::::::::::::::::::::::::::::::::::: ::::::::::::::::::::: CCDS56 HSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIPPLHTDNYDSTNMPLMQTQQNL 370 380 390 400 410 420 430 pF1KE4 PHQTQIPQQQTTGKWYP ::::::::::::::::: CCDS56 PHQTQIPQQQTTGKWYP 430 >>CCDS47643.1 SGCE gene_id:8910|Hs108|chr7 (462 aa) initn: 2836 init1: 2836 opt: 2838 Z-score: 3416.4 bits: 641.3 E(32554): 5.7e-184 Smith-Waterman score: 2920; 94.2% identity (94.4% similar) in 462 aa overlap (1-437:1-462) 10 20 30 40 50 60 pF1KE4 MQLPRWWELGDPCAWTGQGRGTRRMSPATTGTFLLTVYSIFSKVHSDRSVYPSAGVLFVH ::::::::::::::::::::::::::::::::::::::::::::::::.::::::::::: CCDS47 MQLPRWWELGDPCAWTGQGRGTRRMSPATTGTFLLTVYSIFSKVHSDRNVYPSAGVLFVH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 VLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 GDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 GDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 NPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 NPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 YKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 YKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVH 310 320 330 340 350 360 370 380 390 400 410 pF1KE4 HSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIHPLHTDNYDSTNMPLMQTQQ-- :::::::::::::::::::::::::::::::::::::: ::::::::::::::::::: CCDS47 HSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIPPLHTDNYDSTNMPLMQTQQWS 370 380 390 400 410 420 420 430 pF1KE4 -----------------------NLPHQTQIPQQQTTGKWYP ::::::::::::::::::: CCDS47 FAPVAQAGVQWSDLGSLQPPPPRNLPHQTQIPQQQTTGKWYP 430 440 450 460 >>CCDS75634.1 SGCE gene_id:8910|Hs108|chr7 (396 aa) initn: 2687 init1: 2430 opt: 2430 Z-score: 2926.1 bits: 550.4 E(32554): 1.2e-156 Smith-Waterman score: 2609; 90.4% identity (90.4% similar) in 437 aa overlap (1-437:1-396) 10 20 30 40 50 60 pF1KE4 MQLPRWWELGDPCAWTGQGRGTRRMSPATTGTFLLTVYSIFSKVHSDRSVYPSAGVLFVH :::::::::::::::::::::::::::::::::::: CCDS75 MQLPRWWELGDPCAWTGQGRGTRRMSPATTGTFLLT------------------------ 10 20 30 70 80 90 100 110 120 pF1KE4 VLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTA ::::::::::::::::::::::::::::::::::::::::::: CCDS75 -----------------GEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTA 40 50 60 70 130 140 150 160 170 180 pF1KE4 ENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVL 80 90 100 110 120 130 190 200 210 220 230 240 pF1KE4 GDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 GDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVE 140 150 160 170 180 190 250 260 270 280 290 300 pF1KE4 NPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 NPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGE 200 210 220 230 240 250 310 320 330 340 350 360 pF1KE4 YKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 YKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVH 260 270 280 290 300 310 370 380 390 400 410 420 pF1KE4 HSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIHPLHTDNYDSTNMPLMQTQQNL :::::::::::::::::::::::::::::::::::::: ::::::::::::::::::::: CCDS75 HSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIPPLHTDNYDSTNMPLMQTQQNL 320 330 340 350 360 370 430 pF1KE4 PHQTQIPQQQTTGKWYP ::::::::::::::::: CCDS75 PHQTQIPQQQTTGKWYP 380 390 >>CCDS47642.1 SGCE gene_id:8910|Hs108|chr7 (451 aa) initn: 2864 init1: 2360 opt: 2360 Z-score: 2840.9 bits: 534.8 E(32554): 6.4e-152 Smith-Waterman score: 2850; 97.5% identity (97.7% similar) in 433 aa overlap (1-433:1-424) 10 20 30 40 50 60 pF1KE4 MQLPRWWELGDPCAWTGQGRGTRRMSPATTGTFLLTVYSIFSKVHSDRSVYPSAGVLFVH ::::::::::::::::::::::::::::::::::::::::::::::::.::::::::::: CCDS47 MQLPRWWELGDPCAWTGQGRGTRRMSPATTGTFLLTVYSIFSKVHSDRNVYPSAGVLFVH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 VLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 GDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 GDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 NPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 NPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 YKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVH :::::::::::::::::::::::::::::::::::::::::::::: ::::: CCDS47 YKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGV---------IQLVH 310 320 330 340 350 370 380 390 400 410 420 pF1KE4 HSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIHPLHTDNYDSTNMPLMQTQQNL :::::::::::::::::::::::::::::::::::::: ::::::::::::::::::::: CCDS47 HSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIPPLHTDNYDSTNMPLMQTQQNL 360 370 380 390 400 410 430 pF1KE4 PHQTQIPQQQTTGKWYP ::::::::::::: CCDS47 PHQTQIPQQQTTGDFRLTTFQRFEVNGIPEERKLTEAMNL 420 430 440 450 >>CCDS32679.1 SGCA gene_id:6442|Hs108|chr17 (387 aa) initn: 1055 init1: 554 opt: 1082 Z-score: 1303.0 bits: 250.0 E(32554): 2.9e-66 Smith-Waterman score: 1082; 44.6% identity (71.6% similar) in 370 aa overlap (49-418:27-387) 20 30 40 50 60 70 pF1KE4 GRGTRRMSPATTGTFLLTVYSIFSKVHSDRSVYPSAGVLFVHVLEREYFKGEFPPYPKPG ...: .: .:::.:..: : . : CCDS32 MAETLFWTPLLVVLLAGLGDTEAQQTTLHPLVGRVFVHTLDHETFLSLPEHVAVPP 10 20 30 40 50 80 90 100 110 120 130 pF1KE4 EISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTAENVGKPTIIEITAYNRRT . ::....:.:.:: : :::: ::.:. : :::: : :. : .::.::::: . CCDS32 AVH---ITYHAHLQGHPDLPRWLRYTQRSPHHPGFLYGSATPEDRGLQ-VIEVTAYNRDS 60 70 80 90 100 110 140 150 160 170 180 190 pF1KE4 FETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVLGDFLGAVKNVWQPERLNA :.:.:. :...: . : :::::::.... ..::.: : . ::.:. ..:.: .:. CCDS32 FDTTRQRLVLEIGDPEGPLLPYQAEFLVRSHDAEEVLPSTPASRFLSALGGLWEPGELQL 120 130 140 150 160 170 200 210 220 230 240 250 pF1KE4 INITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVENPQNQLRCSQEMEPVITC .:.::::::::::::::. :::::. ::. :::.::. : .:... ::.: . :...: CCDS32 LNVTSALDRGGRVPLPIEGRKEGVYIKVGSASPFSTCLKMVASPDSHARCAQGQPPLLSC 180 190 200 210 220 230 260 270 280 290 300 310 pF1KE4 DKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGEYKPPSDSLKSRDYYTDFL . .: .:::...::::. . . :.::: . ::... .::. .: : CCDS32 YDTLAPHFRVDWCNVTLVDKSVPEPADEVPTPGDGILEHDPFFCPPTEA-PDRDFLVDAL 240 250 260 270 280 290 320 330 340 350 360 370 pF1KE4 ITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVHHSAIQKSTKELRDMSKNR .:: :: :::.: :.:::.::::::: ::.. : :::.::: .:. .:.:::.:. .: CCDS32 VTLLVPLLVALLLTLLLAYVMCCRREGRLKRDLATSDIQMVHHCTIHGNTEELRQMAASR 300 310 320 330 340 350 380 390 400 410 420 430 pF1KE4 EIAWPLSTLPVFHPVTGEIIHPLHTDNYDSTNMPLMQTQQNLPHQTQIPQQQTTGKWYP :. ::::::.:. ::: . : ::...::. :. CCDS32 EVPRPLSTLPMFNVHTGERLPP----RVDSAQVPLILDQH 360 370 380 >>CCDS45729.1 SGCA gene_id:6442|Hs108|chr17 (263 aa) initn: 495 init1: 323 opt: 505 Z-score: 610.8 bits: 121.4 E(32554): 1.1e-27 Smith-Waterman score: 505; 45.4% identity (73.0% similar) in 174 aa overlap (49-222:27-196) 20 30 40 50 60 70 pF1KE4 GRGTRRMSPATTGTFLLTVYSIFSKVHSDRSVYPSAGVLFVHVLEREYFKGEFPPYPKPG ...: .: .:::.:..: : . : CCDS45 MAETLFWTPLLVVLLAGLGDTEAQQTTLHPLVGRVFVHTLDHETFLSLPEHVAVPP 10 20 30 40 50 80 90 100 110 120 130 pF1KE4 EISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTAENVGKPTIIEITAYNRRT . ::....:.:.:: : :::: ::.:. : :::: : :. : .::.::::: . CCDS45 AVH---ITYHAHLQGHPDLPRWLRYTQRSPHHPGFLYGSATPEDRGLQ-VIEVTAYNRDS 60 70 80 90 100 110 140 150 160 170 180 190 pF1KE4 FETARHNLIINIMSAEDFPLPYQAEFFIKNMNVEEMLASEVLGDFLGAVKNVWQPERLNA :.:.:. :...: . : :::::::.... ..::.: : . ::.:. ..:.: .:. CCDS45 FDTTRQRLVLEIGDPEGPLLPYQAEFLVRSHDAEEVLPSTPASRFLSALGGLWEPGELQL 120 130 140 150 160 170 200 210 220 230 240 250 pF1KE4 INITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVENPQNQLRCSQEMEPVITC .:.::::::::::::::. :::. CCDS45 LNVTSALDRGGRVPLPIEGRKEGLKRDLATSDIQMVHHCTIHGNTEELRQMAASREVPRP 180 190 200 210 220 230 437 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:13:07 2016 done: Sun Nov 6 01:13:08 2016 Total Scan time: 2.640 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]