FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5230, 208 aa 1>>>pF1KE5230 208 - 208 aa - 208 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3314+/-0.0007; mu= 9.4562+/- 0.042 mean_var=93.8482+/-18.699, 0's: 0 Z-trim(111.9): 35 B-trim: 0 in 0/53 Lambda= 0.132392 statistics sampled from 12732 (12763) to 12732 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.769), E-opt: 0.2 (0.392), width: 16 Scan time: 1.740 The best scores are: opt bits E(32554) CCDS35211.1 SCML1 gene_id:6322|Hs108|chrX ( 208) 1411 278.8 1.6e-75 CCDS14182.2 SCML1 gene_id:6322|Hs108|chrX ( 302) 1411 278.9 2.2e-75 CCDS35210.1 SCML1 gene_id:6322|Hs108|chrX ( 329) 1411 278.9 2.4e-75 CCDS14185.1 SCML2 gene_id:10389|Hs108|chrX ( 700) 380 82.2 8.6e-16 CCDS75500.1 SCML4 gene_id:256380|Hs108|chr6 ( 172) 333 72.9 1.3e-13 CCDS69163.1 SCML4 gene_id:256380|Hs108|chr6 ( 356) 333 73.1 2.5e-13 CCDS5060.2 SCML4 gene_id:256380|Hs108|chr6 ( 414) 333 73.1 2.8e-13 CCDS53302.1 SCMH1 gene_id:22955|Hs108|chr1 ( 599) 318 70.3 2.8e-12 CCDS30688.1 SCMH1 gene_id:22955|Hs108|chr1 ( 660) 318 70.3 3e-12 CCDS53303.1 SCMH1 gene_id:22955|Hs108|chr1 ( 480) 309 68.5 7.6e-12 CCDS53301.1 SCMH1 gene_id:22955|Hs108|chr1 ( 577) 309 68.6 8.8e-12 CCDS461.1 SCMH1 gene_id:22955|Hs108|chr1 ( 591) 309 68.6 9e-12 CCDS53304.1 SCMH1 gene_id:22955|Hs108|chr1 ( 648) 309 68.6 9.8e-12 >>CCDS35211.1 SCML1 gene_id:6322|Hs108|chrX (208 aa) initn: 1411 init1: 1411 opt: 1411 Z-score: 1469.2 bits: 278.8 E(32554): 1.6e-75 Smith-Waterman score: 1411; 100.0% identity (100.0% similar) in 208 aa overlap (1-208:1-208) 10 20 30 40 50 60 pF1KE5 MKKNEVYETFSYPESYSPTLPVSRRENNSPSNLPRPSFCMEEYQRAELEEDPILSRTPSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MKKNEVYETFSYPESYSPTLPVSRRENNSPSNLPRPSFCMEEYQRAELEEDPILSRTPSP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VHPSDFSEHNCQPYYASDGATYGSSSGLCLGNPRADSIHNTYSTDHASAAPPSVTRSPVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 VHPSDFSEHNCQPYYASDGATYGSSSGLCLGNPRADSIHNTYSTDHASAAPPSVTRSPVE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 NDGYIEEGSITKHPSTWSVEAVVLFLKQTDPLALCPLVDLFRSHEIDGKALLLLTSDVLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 NDGYIEEGSITKHPSTWSVEAVVLFLKQTDPLALCPLVDLFRSHEIDGKALLLLTSDVLL 130 140 150 160 170 180 190 200 pF1KE5 KHLGVKLGTAVKLCYYIDRLKQGKCFEN :::::::::::::::::::::::::::: CCDS35 KHLGVKLGTAVKLCYYIDRLKQGKCFEN 190 200 >>CCDS14182.2 SCML1 gene_id:6322|Hs108|chrX (302 aa) initn: 1411 init1: 1411 opt: 1411 Z-score: 1466.7 bits: 278.9 E(32554): 2.2e-75 Smith-Waterman score: 1411; 100.0% identity (100.0% similar) in 208 aa overlap (1-208:95-302) 10 20 30 pF1KE5 MKKNEVYETFSYPESYSPTLPVSRRENNSP :::::::::::::::::::::::::::::: CCDS14 FHARSLWTNHKRYGYKKHSYRLVKKLKLQKMKKNEVYETFSYPESYSPTLPVSRRENNSP 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE5 SNLPRPSFCMEEYQRAELEEDPILSRTPSPVHPSDFSEHNCQPYYASDGATYGSSSGLCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SNLPRPSFCMEEYQRAELEEDPILSRTPSPVHPSDFSEHNCQPYYASDGATYGSSSGLCL 130 140 150 160 170 180 100 110 120 130 140 150 pF1KE5 GNPRADSIHNTYSTDHASAAPPSVTRSPVENDGYIEEGSITKHPSTWSVEAVVLFLKQTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GNPRADSIHNTYSTDHASAAPPSVTRSPVENDGYIEEGSITKHPSTWSVEAVVLFLKQTD 190 200 210 220 230 240 160 170 180 190 200 pF1KE5 PLALCPLVDLFRSHEIDGKALLLLTSDVLLKHLGVKLGTAVKLCYYIDRLKQGKCFEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PLALCPLVDLFRSHEIDGKALLLLTSDVLLKHLGVKLGTAVKLCYYIDRLKQGKCFEN 250 260 270 280 290 300 >>CCDS35210.1 SCML1 gene_id:6322|Hs108|chrX (329 aa) initn: 1411 init1: 1411 opt: 1411 Z-score: 1466.2 bits: 278.9 E(32554): 2.4e-75 Smith-Waterman score: 1411; 100.0% identity (100.0% similar) in 208 aa overlap (1-208:122-329) 10 20 30 pF1KE5 MKKNEVYETFSYPESYSPTLPVSRRENNSP :::::::::::::::::::::::::::::: CCDS35 FHARSLWTNHKRYGYKKHSYRLVKKLKLQKMKKNEVYETFSYPESYSPTLPVSRRENNSP 100 110 120 130 140 150 40 50 60 70 80 90 pF1KE5 SNLPRPSFCMEEYQRAELEEDPILSRTPSPVHPSDFSEHNCQPYYASDGATYGSSSGLCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 SNLPRPSFCMEEYQRAELEEDPILSRTPSPVHPSDFSEHNCQPYYASDGATYGSSSGLCL 160 170 180 190 200 210 100 110 120 130 140 150 pF1KE5 GNPRADSIHNTYSTDHASAAPPSVTRSPVENDGYIEEGSITKHPSTWSVEAVVLFLKQTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 GNPRADSIHNTYSTDHASAAPPSVTRSPVENDGYIEEGSITKHPSTWSVEAVVLFLKQTD 220 230 240 250 260 270 160 170 180 190 200 pF1KE5 PLALCPLVDLFRSHEIDGKALLLLTSDVLLKHLGVKLGTAVKLCYYIDRLKQGKCFEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 PLALCPLVDLFRSHEIDGKALLLLTSDVLLKHLGVKLGTAVKLCYYIDRLKQGKCFEN 280 290 300 310 320 >>CCDS14185.1 SCML2 gene_id:10389|Hs108|chrX (700 aa) initn: 347 init1: 347 opt: 380 Z-score: 397.0 bits: 82.2 E(32554): 8.6e-16 Smith-Waterman score: 384; 35.8% identity (62.3% similar) in 212 aa overlap (8-204:502-698) 10 20 30 pF1KE5 MKKNEVYETFSYPESYSPTLPVSRR--ENNSP----- .: : :: :: ... .. : CCDS14 GHTHSSAEHDKNQSAKEDVTERQSTKRSPQQTVPYVVPLSPKLPKTKEYASEGEPLFAGG 480 490 500 510 520 530 40 50 60 70 80 pF1KE5 SNLPRPSFCMEEYQRAELEEDPILS---RTPSPVHPS---DFSEHNCQPYYASDGATYGS : .:. :. . . :. :. :.: .: : :::. .. :. CCDS14 SAIPKEENLSEDSKSSSLNSGNYLNPACRNPMYIHTSVSQDFSR-----------SVPGT 540 550 560 570 580 90 100 110 120 130 140 pF1KE5 SSGLCLGN--PRADSIHNTYSTDHASAAPPSVTRSPVENDGYIEEGSITKHPSTWSVEAV .:. .:. :... . .. .. : :: .. : . . ...: ..: ::::::. : CCDS14 TSSPLVGDISPKSSPHEVKFQMQRKSEAPSYIA---VPDPSVLKQG-FSKDPSTWSVDEV 590 600 610 620 630 150 160 170 180 190 200 pF1KE5 VLFLKQTDPLALCPLVDLFRSHEIDGKALLLLTSDVLLKHLGVKLGTAVKLCYYIDRLKQ . :.:.::: ::.::::.::::::::.:: :::..:..:.::: :.::::::..::. CCDS14 IQFMKHTDPQISGPLADLFRQHEIDGKALFLLKSDVMMKYMGLKLGPALKLCYYIEKLKE 640 650 660 670 680 690 pF1KE5 GKCFEN :: CCDS14 GKYS 700 >>CCDS75500.1 SCML4 gene_id:256380|Hs108|chr6 (172 aa) initn: 371 init1: 320 opt: 333 Z-score: 357.6 bits: 72.9 E(32554): 1.3e-13 Smith-Waterman score: 338; 41.7% identity (67.9% similar) in 156 aa overlap (61-204:19-171) 40 50 60 70 80 pF1KE5 SNLPRPSFCMEEYQRAELEEDPILSRTPSPVHPSDFSEHNCQPYYASDG-------ATYG .::: : :. ..:. :: : CCDS75 MNRYSVDTSASTFNHRGSLHPS--SSLYCKRQNSGDSHLGGGPAATAG 10 20 30 40 90 100 110 120 130 pF1KE5 S--SSGLCLGNPRADSIHNTYSTDHASAAPPSVTR---SPVENDGYIEEGSITKHPSTWS . .: . :.: : ... :. . ... .: :: . :. . ...::.:. CCDS75 GPRTSPMSSGGPSAPGLRPPASSPKRNTTSLEGNRCASSPSQ-DAQDARRPRSRNPSAWT 50 60 70 80 90 100 140 150 160 170 180 190 pF1KE5 VEAVVLFLKQTDPLALCPLVDLFRSHEIDGKALLLLTSDVLLKHLGVKLGTAVKLCYYID :: :: :.:..:: :: : :.:::.:::::.::::: ::...:.::.::: :.::::.:: CCDS75 VEDVVWFVKDADPQALGPHVELFRKHEIDGNALLLLKSDMVMKYLGLKLGPALKLCYHID 110 120 130 140 150 160 200 pF1KE5 RLKQGKCFEN .:::.: CCDS75 KLKQAKF 170 >>CCDS69163.1 SCML4 gene_id:256380|Hs108|chr6 (356 aa) initn: 351 init1: 320 opt: 333 Z-score: 352.9 bits: 73.1 E(32554): 2.5e-13 Smith-Waterman score: 338; 41.7% identity (67.9% similar) in 156 aa overlap (61-204:203-355) 40 50 60 70 80 pF1KE5 SNLPRPSFCMEEYQRAELEEDPILSRTPSPVHPSDFSEHNCQPYYASDG-------ATYG .::: : :. ..:. :: : CCDS69 VTTEEYLVNPVGMNRYSVDTSASTFNHRGSLHPS--SSLYCKRQNSGDSHLGGGPAATAG 180 190 200 210 220 230 90 100 110 120 130 pF1KE5 S--SSGLCLGNPRADSIHNTYSTDHASAAPPSVTR---SPVENDGYIEEGSITKHPSTWS . .: . :.: : ... :. . ... .: :: . :. . ...::.:. CCDS69 GPRTSPMSSGGPSAPGLRPPASSPKRNTTSLEGNRCASSPSQ-DAQDARRPRSRNPSAWT 240 250 260 270 280 140 150 160 170 180 190 pF1KE5 VEAVVLFLKQTDPLALCPLVDLFRSHEIDGKALLLLTSDVLLKHLGVKLGTAVKLCYYID :: :: :.:..:: :: : :.:::.:::::.::::: ::...:.::.::: :.::::.:: CCDS69 VEDVVWFVKDADPQALGPHVELFRKHEIDGNALLLLKSDMVMKYLGLKLGPALKLCYHID 290 300 310 320 330 340 200 pF1KE5 RLKQGKCFEN .:::.: CCDS69 KLKQAKF 350 >>CCDS5060.2 SCML4 gene_id:256380|Hs108|chr6 (414 aa) initn: 370 init1: 320 opt: 333 Z-score: 351.9 bits: 73.1 E(32554): 2.8e-13 Smith-Waterman score: 338; 41.7% identity (67.9% similar) in 156 aa overlap (61-204:261-413) 40 50 60 70 80 pF1KE5 SNLPRPSFCMEEYQRAELEEDPILSRTPSPVHPSDFSEHNCQPYYASDG-------ATYG .::: : :. ..:. :: : CCDS50 VTTEEYLVNPVGMNRYSVDTSASTFNHRGSLHPS--SSLYCKRQNSGDSHLGGGPAATAG 240 250 260 270 280 90 100 110 120 130 pF1KE5 S--SSGLCLGNPRADSIHNTYSTDHASAAPPSVTR---SPVENDGYIEEGSITKHPSTWS . .: . :.: : ... :. . ... .: :: . :. . ...::.:. CCDS50 GPRTSPMSSGGPSAPGLRPPASSPKRNTTSLEGNRCASSPSQ-DAQDARRPRSRNPSAWT 290 300 310 320 330 340 140 150 160 170 180 190 pF1KE5 VEAVVLFLKQTDPLALCPLVDLFRSHEIDGKALLLLTSDVLLKHLGVKLGTAVKLCYYID :: :: :.:..:: :: : :.:::.:::::.::::: ::...:.::.::: :.::::.:: CCDS50 VEDVVWFVKDADPQALGPHVELFRKHEIDGNALLLLKSDMVMKYLGLKLGPALKLCYHID 350 360 370 380 390 400 200 pF1KE5 RLKQGKCFEN .:::.: CCDS50 KLKQAKF 410 >>CCDS53302.1 SCMH1 gene_id:22955|Hs108|chr1 (599 aa) initn: 327 init1: 240 opt: 318 Z-score: 334.0 bits: 70.3 E(32554): 2.8e-12 Smith-Waterman score: 318; 37.1% identity (62.4% similar) in 178 aa overlap (33-204:427-598) 10 20 30 40 50 60 pF1KE5 KNEVYETFSYPESYSPTLPVSRRENNSPSNLPRPSFCMEEYQRAELEEDPILSRTPSPVH :: .: . . :: : . : . CCDS53 RSDNLFGNQPFTQTHLSLTAIEYSHSHDRYLPGETFVLGNSLARSLE--PHSDSMDSASN 400 410 420 430 440 450 70 80 90 100 110 120 pF1KE5 PSDF--SEHNCQPYYASDGATYGSSSGLCLGNPRADSIHNTYSTDHASAAPPSVTRSPVE :... . . .: .: : ...:.. :. .. : : ...::: CCDS53 PTNLVSTSQRHRPLLSSCGLPPSTASAVRRLCSRGVLKGSNERRDMESFW--KLNRSP-G 460 470 480 490 500 510 130 140 150 160 170 pF1KE5 NDGYIEEGSITK----HPSTWSVEAVVLFLKQTDPLALCPLVDLFRSHEIDGKALLLLTS .: :.: . .. ::.:.:: :. :....:: : : .::::.::::::::::: : CCDS53 SDRYLESRDASRLSGRDPSSWTVEDVMQFVREADP-QLGPHADLFRKHEIDGKALLLLRS 520 530 540 550 560 570 180 190 200 pF1KE5 DVLLKHLGVKLGTAVKLCYYIDRLKQGKCFEN :...:..:.::: :.:: :.:::::::: CCDS53 DMMMKYMGLKLGPALKLSYHIDRLKQGKF 580 590 >>CCDS30688.1 SCMH1 gene_id:22955|Hs108|chr1 (660 aa) initn: 329 init1: 240 opt: 318 Z-score: 333.4 bits: 70.3 E(32554): 3e-12 Smith-Waterman score: 318; 37.1% identity (62.4% similar) in 178 aa overlap (33-204:488-659) 10 20 30 40 50 60 pF1KE5 KNEVYETFSYPESYSPTLPVSRRENNSPSNLPRPSFCMEEYQRAELEEDPILSRTPSPVH :: .: . . :: : . : . CCDS30 RSDNLFGNQPFTQTHLSLTAIEYSHSHDRYLPGETFVLGNSLARSLE--PHSDSMDSASN 460 470 480 490 500 510 70 80 90 100 110 120 pF1KE5 PSDF--SEHNCQPYYASDGATYGSSSGLCLGNPRADSIHNTYSTDHASAAPPSVTRSPVE :... . . .: .: : ...:.. :. .. : : ...::: CCDS30 PTNLVSTSQRHRPLLSSCGLPPSTASAVRRLCSRGVLKGSNERRDMESFW--KLNRSP-G 520 530 540 550 560 570 130 140 150 160 170 pF1KE5 NDGYIEEGSITK----HPSTWSVEAVVLFLKQTDPLALCPLVDLFRSHEIDGKALLLLTS .: :.: . .. ::.:.:: :. :....:: : : .::::.::::::::::: : CCDS30 SDRYLESRDASRLSGRDPSSWTVEDVMQFVREADP-QLGPHADLFRKHEIDGKALLLLRS 580 590 600 610 620 630 180 190 200 pF1KE5 DVLLKHLGVKLGTAVKLCYYIDRLKQGKCFEN :...:..:.::: :.:: :.:::::::: CCDS30 DMMMKYMGLKLGPALKLSYHIDRLKQGKF 640 650 660 >>CCDS53303.1 SCMH1 gene_id:22955|Hs108|chr1 (480 aa) initn: 326 init1: 240 opt: 309 Z-score: 326.2 bits: 68.5 E(32554): 7.6e-12 Smith-Waterman score: 314; 38.4% identity (63.2% similar) in 185 aa overlap (48-204:299-479) 20 30 40 50 60 70 pF1KE5 PTLPVSRRENNSPSNLPRPSFCMEEYQRAELEEDPILSRTP-SPVHPS----DFSEHNCQ :. : ... : . .: : ..: :. . CCDS53 VFDREQHTLNLPAVNSITYVLRFLEKLCHNLRSDNLFGNQPFTQTHLSLTAIEYS-HSHD 270 280 290 300 310 320 80 90 100 110 pF1KE5 PYYASDGATYGSSSGLCLGNPRADSIH------NTYSTDH-------ASAAPPSVTRSPV : .. . :.: . : .:..::. : ::.. . . ::: : : : CCDS53 RYLPGETFVLGNSLARSL-EPHSDSMDSASNPTNLVSTSQRHRPLLSSCGLPPS-TASAV 330 340 350 360 370 380 120 130 140 150 160 pF1KE5 E------NDGYIEEGSITK----HPSTWSVEAVVLFLKQTDPLALCPLVDLFRSHEIDGK . .: :.: . .. ::.:.:: :. :....:: : : .::::.:::::: CCDS53 RRLCSRGSDRYLESRDASRLSGRDPSSWTVEDVMQFVREADP-QLGPHADLFRKHEIDGK 390 400 410 420 430 440 170 180 190 200 pF1KE5 ALLLLTSDVLLKHLGVKLGTAVKLCYYIDRLKQGKCFEN ::::: ::...:..:.::: :.:: :.:::::::: CCDS53 ALLLLRSDMMMKYMGLKLGPALKLSYHIDRLKQGKF 450 460 470 480 208 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:42:43 2016 done: Mon Nov 7 22:42:44 2016 Total Scan time: 1.740 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]