FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4444, 460 aa 1>>>pF1KE4444 460 - 460 aa - 460 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3928+/-0.000772; mu= 17.3938+/- 0.047 mean_var=64.7201+/-13.413, 0's: 0 Z-trim(107.5): 25 B-trim: 0 in 0/51 Lambda= 0.159424 statistics sampled from 9593 (9607) to 9593 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.659), E-opt: 0.2 (0.295), width: 16 Scan time: 3.190 The best scores are: opt bits E(32554) CCDS43308.1 SLC45A2 gene_id:51151|Hs108|chr5 ( 460) 3109 723.8 9.3e-209 CCDS3901.1 SLC45A2 gene_id:51151|Hs108|chr5 ( 530) 3068 714.4 7.2e-206 CCDS75232.1 SLC45A2 gene_id:51151|Hs108|chr5 ( 243) 1262 298.8 4.1e-81 CCDS75795.1 SLC45A4 gene_id:57210|Hs108|chr8 ( 808) 760 183.6 6.5e-46 CCDS69550.1 SLC45A4 gene_id:57210|Hs108|chr8 ( 717) 620 151.4 2.9e-36 CCDS34948.1 SLC45A4 gene_id:57210|Hs108|chr8 ( 798) 620 151.4 3.2e-36 CCDS30577.2 SLC45A1 gene_id:50651|Hs108|chr1 ( 782) 477 118.5 2.5e-26 CCDS1458.1 SLC45A3 gene_id:85414|Hs108|chr1 ( 553) 427 106.9 5.4e-23 >>CCDS43308.1 SLC45A2 gene_id:51151|Hs108|chr5 (460 aa) initn: 3109 init1: 3109 opt: 3109 Z-score: 3861.4 bits: 723.8 E(32554): 9.3e-209 Smith-Waterman score: 3109; 99.8% identity (100.0% similar) in 460 aa overlap (1-460:1-460) 10 20 30 40 50 60 pF1KE4 MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LSVGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 LSVGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDKEKGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 LNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDKEKGL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 HYHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLCFTVHLCSISEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 HYHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLCFTVHLCSISEA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 PLTEVAKGIPPQQTPQDPPLSSDGMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 PLTEVAKGIPPQQTPQDPPLSSDGMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 MTLKSLLRALVNMPPHYRYLCISHLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MTLKSLLRALVNMPPHYRYLCISHLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEF 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 LIYERGVEVGCWGFCINSVFSSLYSYFQKVLVSYIGLKGLYFTGYLLFGLGTGFIGLFPN :::::::::::::.:::::::::::::::::::::::::::::::::::::::::::::: CCDS43 LIYERGVEVGCWGLCINSVFSSLYSYFQKVLVSYIGLKGLYFTGYLLFGLGTGFIGLFPN 370 380 390 400 410 420 430 440 450 460 pF1KE4 VYSTLVLCSLFGVMSSTLYTVPFNLITEYHREEEKEVCCH :::::::::::::::::::::::::::::::::::::::: CCDS43 VYSTLVLCSLFGVMSSTLYTVPFNLITEYHREEEKEVCCH 430 440 450 460 >>CCDS3901.1 SLC45A2 gene_id:51151|Hs108|chr5 (530 aa) initn: 3068 init1: 3068 opt: 3068 Z-score: 3809.5 bits: 714.4 E(32554): 7.2e-206 Smith-Waterman score: 3068; 99.8% identity (100.0% similar) in 456 aa overlap (1-456:1-456) 10 20 30 40 50 60 pF1KE4 MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LSVGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 LSVGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDKEKGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 LNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDKEKGL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 HYHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLCFTVHLCSISEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 HYHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLCFTVHLCSISEA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 PLTEVAKGIPPQQTPQDPPLSSDGMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 PLTEVAKGIPPQQTPQDPPLSSDGMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 MTLKSLLRALVNMPPHYRYLCISHLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 MTLKSLLRALVNMPPHYRYLCISHLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEF 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 LIYERGVEVGCWGFCINSVFSSLYSYFQKVLVSYIGLKGLYFTGYLLFGLGTGFIGLFPN :::::::::::::.:::::::::::::::::::::::::::::::::::::::::::::: CCDS39 LIYERGVEVGCWGLCINSVFSSLYSYFQKVLVSYIGLKGLYFTGYLLFGLGTGFIGLFPN 370 380 390 400 410 420 430 440 450 460 pF1KE4 VYSTLVLCSLFGVMSSTLYTVPFNLITEYHREEEKEVCCH :::::::::::::::::::::::::::::::::::: CCDS39 VYSTLVLCSLFGVMSSTLYTVPFNLITEYHREEEKERQQAPGGDPDNSVRGKGMDCATLT 430 440 450 460 470 480 CCDS39 CMVQLAQILVGGGLGFLVNTAGTVVVVVITASAVALIGCCFVALFVRYVD 490 500 510 520 530 >>CCDS75232.1 SLC45A2 gene_id:51151|Hs108|chr5 (243 aa) initn: 1260 init1: 1260 opt: 1262 Z-score: 1569.8 bits: 298.8 E(32554): 4.1e-81 Smith-Waterman score: 1262; 98.4% identity (98.4% similar) in 191 aa overlap (1-191:1-191) 10 20 30 40 50 60 pF1KE4 MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LSVGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 LSVGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDKEKGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 LNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDKEKGL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 HYHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLCFTVHLCSISEA ::::::: : CCDS75 HYHALFTDSQGNDIKVTAESTGEHASSLPLPLHQPPHWMDSLPVQHAVLHRFHGPDCVPR 190 200 210 220 230 240 >>CCDS75795.1 SLC45A4 gene_id:57210|Hs108|chr8 (808 aa) initn: 1109 init1: 438 opt: 760 Z-score: 937.7 bits: 183.6 E(32554): 6.5e-46 Smith-Waterman score: 760; 46.4% identity (75.3% similar) in 239 aa overlap (31-269:48-279) 10 20 30 40 50 60 pF1KE4 MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVL : .::. .:::::::::.:.: :::.: CCDS75 LSVPLPDPQKAGGAEAENCETISEGSIDRIPMRLWVMHGAVMFGREFCYAMETALVTPIL 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE4 LSVGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALY :..::: . ::..::::::::... :..:::::.: ::::::.::.: : .: :.::. CCDS75 LQIGLPEQYYSLTWFLSPILGLIFTPLIGSASDRCTLSWGRRRPFILALCVGVLFGVALF 80 90 100 110 120 130 130 140 150 160 170 180 pF1KE4 LNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDKEKGL :::... :: : :. . .: .:..:::..::.:: .:::.:::.:: . .... .: CCDS75 LNGSAIGLALGDVPNRQPI-GIVLTVLGVVVLDFSADATEGPIRAYLLDVVDSEEQDMAL 140 150 160 170 180 190 190 200 210 220 230 240 pF1KE4 HYHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLCFTVHLCSISEA . ::. .:.:::.::.::..::.. :: . :. ::.:::.:...:. ..:: ::.: CCDS75 NIHAFSAGLGGAIGYVLGGLDWTQTFLGSWFRTQNQVLFFFAAIIFTVSVALHLFSIDEE 200 210 220 230 240 250 250 260 270 280 290 300 pF1KE4 PLTEVAKGIPPQQTPQDPPLSSDGMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRA . : :. . : . :: .: CCDS75 QYS------PQQERSAEEPGALDGGEPHGVPAFPDEVQSEHELALDYPDVDIMRSKSDSA 260 270 280 290 300 310 >-- initn: 379 init1: 236 opt: 388 Z-score: 475.3 bits: 98.0 E(32554): 3.7e-20 Smith-Waterman score: 388; 34.6% identity (69.8% similar) in 159 aa overlap (294-450:485-643) 270 280 290 300 310 320 pF1KE4 GMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRAMTLKSLLRALVNMPPHYRYLCIS .:. . :.. : ....:: . ::. CCDS75 MSDLYDMQKRQRQHRHRNQSGATTSSGDTESEEGEGETTVRLLWLSMLKMPRELMRLCLC 460 470 480 490 500 510 330 340 350 360 370 380 pF1KE4 HLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEFLIYERGVEVGCWGFCINSVFSSL ::. : . ... .:.::::::....::: . ::: . :. ::..::::. : .. ... CCDS75 HLLTWFSVIAEAVFYTDFMGQVIFEGDPKAPSNSTAWQAYNAGVKMGCWGLVIYAATGAI 520 530 540 550 560 570 390 400 410 420 430 440 pF1KE4 YS-YFQKVLVSY-IGLKGLYFTGYLLFGLGTGFIGLFPNVYSTLVLCSLFGVMSSTLYTV : .:: : .: .... .: : : :..::. ...::::: ..: : .:..: .. CCDS75 CSALLQKYLDNYDLSVRVIYVLGTLGFSVGTAVMAMFPNVYVAMVTISTMGIVSMSISYC 580 590 600 610 620 630 450 460 pF1KE4 PFNLITEYHREEEKEVCCH :. :. .:: CCDS75 PYALLGQYHDIKQYIHHSPGNSKRGFGIDCAILSCQVYISQILVASALGGVVDAVGTVRV 640 650 660 670 680 690 >>CCDS69550.1 SLC45A4 gene_id:57210|Hs108|chr8 (717 aa) initn: 972 init1: 309 opt: 620 Z-score: 764.5 bits: 151.4 E(32554): 2.9e-36 Smith-Waterman score: 620; 44.5% identity (74.2% similar) in 209 aa overlap (61-269:27-228) 40 50 60 70 80 90 pF1KE4 PTSRLIMHSMAMFGREFCYAVEAAYVTPVLLSVGLPSSLYSIVWFLSPILGFLLQPVVGS :. ::: . ::..::::::::... :..:: CCDS69 MGKASPASGLSRPKTLVSPLRSNQWSLQKGLPEQYYSLTWFLSPILGLIFTPLIGS 10 20 30 40 50 100 110 120 130 140 150 pF1KE4 ASDHCRSRWGRRRPYILTLGVMMLVGMALYLNGATVVAALIANPRRKLVWAISVTMIGVV :::.: ::::::.::.: : .: :.::.:::... :: : :. . .: .:..::: CCDS69 ASDRCTLSWGRRRPFILALCVGVLFGVALFLNGSAIGLALGDVPNRQPI-GIVLTVLGVV 60 70 80 90 100 110 160 170 180 190 200 210 pF1KE4 LFDFAADFIDGPIKAYLFDVCSHQDKEKGLHYHALFTGFGGALGYLLGAIDWAHLELGRL ..::.:: .:::.:::.:: . .... .:. ::. .:.:::.::.::..::.. :: CCDS69 VLDFSADATEGPIRAYLLDVVDSEEQDMALNIHAFSAGLGGAIGYVLGGLDWTQTFLGSW 120 130 140 150 160 170 220 230 240 250 260 270 pF1KE4 LGTEFQVMFFFSALVLTLCFTVHLCSISEAPLTEVAKGIPPQQTPQDPPLSSDGMYEYGS . :. ::.:::.:...:. ..:: ::.: . : :. . : . :: .: CCDS69 FRTQNQVLFFFAAIIFTVSVALHLFSIDEEQYS------PQQERSAEEPGALDGGEPHGV 180 190 200 210 220 280 290 300 310 320 330 pF1KE4 IEKVKNGYVNPELAMQGAKNKNHAEQTRRAMTLKSLLRALVNMPPHYRYLCISHLIGWTA CCDS69 PAFPDEVQSEHELALDYPDVDIMRSKSDSALHVPDTALDLEPELLFLHDIEPSIFHDASY 230 240 250 260 270 280 >-- initn: 379 init1: 236 opt: 388 Z-score: 476.1 bits: 98.0 E(32554): 3.4e-20 Smith-Waterman score: 388; 34.6% identity (69.8% similar) in 159 aa overlap (294-450:434-592) 270 280 290 300 310 320 pF1KE4 GMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRAMTLKSLLRALVNMPPHYRYLCIS .:. . :.. : ....:: . ::. CCDS69 MSDLYDMQKRQRQHRHRNQSGATTSSGDTESEEGEGETTVRLLWLSMLKMPRELMRLCLC 410 420 430 440 450 460 330 340 350 360 370 380 pF1KE4 HLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEFLIYERGVEVGCWGFCINSVFSSL ::. : . ... .:.::::::....::: . ::: . :. ::..::::. : .. ... CCDS69 HLLTWFSVIAEAVFYTDFMGQVIFEGDPKAPSNSTAWQAYNAGVKMGCWGLVIYAATGAI 470 480 490 500 510 520 390 400 410 420 430 440 pF1KE4 YS-YFQKVLVSY-IGLKGLYFTGYLLFGLGTGFIGLFPNVYSTLVLCSLFGVMSSTLYTV : .:: : .: .... .: : : :..::. ...::::: ..: : .:..: .. CCDS69 CSALLQKYLDNYDLSVRVIYVLGTLGFSVGTAVMAMFPNVYVAMVTISTMGIVSMSISYC 530 540 550 560 570 580 450 460 pF1KE4 PFNLITEYHREEEKEVCCH :. :. .:: CCDS69 PYALLGQYHDIKQYIHHSPGNSKRGFGIDCAILSCQVYISQILVASALGGVVDAVGTVRV 590 600 610 620 630 640 >>CCDS34948.1 SLC45A4 gene_id:57210|Hs108|chr8 (798 aa) initn: 972 init1: 309 opt: 620 Z-score: 763.8 bits: 151.4 E(32554): 3.2e-36 Smith-Waterman score: 620; 44.5% identity (74.2% similar) in 209 aa overlap (61-269:27-228) 40 50 60 70 80 90 pF1KE4 PTSRLIMHSMAMFGREFCYAVEAAYVTPVLLSVGLPSSLYSIVWFLSPILGFLLQPVVGS :. ::: . ::..::::::::... :..:: CCDS34 MGKASPASGLSRPKTLVSPLRSNQWSLQKGLPEQYYSLTWFLSPILGLIFTPLIGS 10 20 30 40 50 100 110 120 130 140 150 pF1KE4 ASDHCRSRWGRRRPYILTLGVMMLVGMALYLNGATVVAALIANPRRKLVWAISVTMIGVV :::.: ::::::.::.: : .: :.::.:::... :: : :. . .: .:..::: CCDS34 ASDRCTLSWGRRRPFILALCVGVLFGVALFLNGSAIGLALGDVPNRQPI-GIVLTVLGVV 60 70 80 90 100 110 160 170 180 190 200 210 pF1KE4 LFDFAADFIDGPIKAYLFDVCSHQDKEKGLHYHALFTGFGGALGYLLGAIDWAHLELGRL ..::.:: .:::.:::.:: . .... .:. ::. .:.:::.::.::..::.. :: CCDS34 VLDFSADATEGPIRAYLLDVVDSEEQDMALNIHAFSAGLGGAIGYVLGGLDWTQTFLGSW 120 130 140 150 160 170 220 230 240 250 260 270 pF1KE4 LGTEFQVMFFFSALVLTLCFTVHLCSISEAPLTEVAKGIPPQQTPQDPPLSSDGMYEYGS . :. ::.:::.:...:. ..:: ::.: . : :. . : . :: .: CCDS34 FRTQNQVLFFFAAIIFTVSVALHLFSIDEEQYS------PQQERSAEEPGALDGGEPHGV 180 190 200 210 220 280 290 300 310 320 330 pF1KE4 IEKVKNGYVNPELAMQGAKNKNHAEQTRRAMTLKSLLRALVNMPPHYRYLCISHLIGWTA CCDS34 PAFPDEVQSEHELALDYPDVDIMRSKSDSALHVPDTALDLEPELLFLHDIEPSIFHDASY 230 240 250 260 270 280 >-- initn: 379 init1: 236 opt: 388 Z-score: 475.4 bits: 98.0 E(32554): 3.7e-20 Smith-Waterman score: 388; 34.6% identity (69.8% similar) in 159 aa overlap (294-450:434-592) 270 280 290 300 310 320 pF1KE4 GMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRAMTLKSLLRALVNMPPHYRYLCIS .:. . :.. : ....:: . ::. CCDS34 MSDLYDMQKRQRQHRHRNQSGATTSSGDTESEEGEGETTVRLLWLSMLKMPRELMRLCLC 410 420 430 440 450 460 330 340 350 360 370 380 pF1KE4 HLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEFLIYERGVEVGCWGFCINSVFSSL ::. : . ... .:.::::::....::: . ::: . :. ::..::::. : .. ... CCDS34 HLLTWFSVIAEAVFYTDFMGQVIFEGDPKAPSNSTAWQAYNAGVKMGCWGLVIYAATGAI 470 480 490 500 510 520 390 400 410 420 430 440 pF1KE4 YS-YFQKVLVSY-IGLKGLYFTGYLLFGLGTGFIGLFPNVYSTLVLCSLFGVMSSTLYTV : .:: : .: .... .: : : :..::. ...::::: ..: : .:..: .. CCDS34 CSALLQKYLDNYDLSVRVIYVLGTLGFSVGTAVMAMFPNVYVAMVTISTMGIVSMSISYC 530 540 550 560 570 580 450 460 pF1KE4 PFNLITEYHREEEKEVCCH :. :. .:: CCDS34 PYALLGQYHDIKQYIHHSPGNSKRGFGIDCAILSCQVYISQILVASALGGVVDAVGTVRV 590 600 610 620 630 640 >>CCDS30577.2 SLC45A1 gene_id:50651|Hs108|chr1 (782 aa) initn: 1186 init1: 461 opt: 477 Z-score: 586.2 bits: 118.5 E(32554): 2.5e-26 Smith-Waterman score: 558; 26.8% identity (50.4% similar) in 470 aa overlap (140-454:225-694) 110 120 130 140 150 160 pF1KE4 GVMMLVGMALYLNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFD :.. .:. ::::.::.:: :.: .::..: CCDS30 LAIGALLGLSLLLNGRDIGIALADVTGNHKWGLLLTVCGVVLMDFSADSADNPSHAYMMD 200 210 220 230 240 250 170 180 190 200 210 220 pF1KE4 VCSHQDKEKGLHYHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLC ::: :...::. :::..:.::..::..:.: : . .:: :: ...:...:.:..:.. CCDS30 VCSPADQDRGLNIHALLAGLGGGFGYVVGGIHWDKTGFGRALGGQLRVIYLFTAVTLSVT 260 270 280 290 300 310 230 240 250 pF1KE4 FTVHLCSISEAPLTEVAK--------GIP--------PQQTPQD---------------- .. : :: : :: .. ..: :.. : : CCDS30 TVLTLVSIPERPLRPPSEKRAAMKSPSLPLPPSPPVLPEEGPGDSLPSHTATNFSSPISP 320 330 340 350 360 370 260 pF1KE4 ---------------------------------------------------------PPL ::. CCDS30 PSPLTPKYGSFISRDSSLTGISEFASSFGTANIDSVLIDCFTGGHDSYLAIPGSVPRPPI 380 390 400 410 420 430 270 pF1KE4 S------SDGMYEY-----------------------GSIEKVK---------------- : ::.:. ::.. : CCDS30 SVSFPRAPDGFYRQDRGLLEGREGALTSGCDGDILRVGSLDTSKPRSSGILKRPQTLAIP 440 450 460 470 480 490 280 290 300 310 pF1KE4 --NGYVNPELA---------------MQGAKNKNH----AEQTRRAMTLKSLLRALVNMP : .:: . ..:.: ... .:.... ... : .. ::: CCDS30 DAAGGGGPETSRRRNVTFSQQVANILLNGVKYESELTGSSERAEQPLSVGRLCSTICNMP 500 510 520 530 540 550 320 330 340 350 360 370 pF1KE4 PHYRYLCISHLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEFLIYERGVEVGCWGF : ::..:..:: .: . .::.:::::..:..::: . :.: . :. :: .::::. CCDS30 KALRTLCVNHFLGWLSFEGMLLFYTDFMGEVVFQGDPKAPHTSEAYQKYNSGVTMGCWGM 560 570 580 590 600 610 380 390 400 410 420 430 pF1KE4 CINSVFSSLYSYFQKVLVSYIGLKGLYFTGYLLFGLGTGFIGLFPNVYSTLVLCSLFGVM :: . ...:: . . : ..... ::: .:: ::::::. : :.: .: :: .:.. CCDS30 CIYAFSAAFYSAILEKLEEFLSVRTLYFIAYLAFGLGTGLATLSRNLYVVLSLCITYGIL 620 630 640 650 660 670 440 450 460 pF1KE4 SSTLYTVPFNLITEYHREEEKEVCCH ::: :.:..:. .:.. .. CCDS30 FSTLCTLPYSLLCDYYQSKKFAGSSADGTRRGMGVDISLLSCQYFLAQILVSLVLGPLTS 680 690 700 710 720 730 >-- initn: 410 init1: 410 opt: 410 Z-score: 502.9 bits: 103.1 E(32554): 1.1e-21 Smith-Waterman score: 410; 57.3% identity (84.5% similar) in 103 aa overlap (28-130:114-216) 10 20 30 40 50 pF1KE4 MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVT :.: .:.... .:: :: ::.:.:::: CCDS30 PKRRKCIRPSPPPPPNTPCPLELVDFGDLHPQRSFRELLFNGCILFGIEFSYAMETAYVT 90 100 110 120 130 140 60 70 80 90 100 110 pF1KE4 PVLLSVGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGM ::::..:::..:::.:::.::::::::::..:. ::.: ::.:::::.::.:.. :.:. CCDS30 PVLLQMGLPDQLYSLVWFISPILGFLLQPLLGAWSDRCTSRFGRRRPFILVLAIGALLGL 150 160 170 180 190 200 120 130 140 150 160 170 pF1KE4 ALYLNGATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDKE .: ::: . :: CCDS30 SLLLNGRDIGIALADVTGNHKWGLLLTVCGVVLMDFSADSADNPSHAYMMDVCSPADQDR 210 220 230 240 250 260 >>CCDS1458.1 SLC45A3 gene_id:85414|Hs108|chr1 (553 aa) initn: 599 init1: 251 opt: 427 Z-score: 526.3 bits: 106.9 E(32554): 5.4e-23 Smith-Waterman score: 598; 29.1% identity (59.2% similar) in 426 aa overlap (33-454:16-408) 10 20 30 40 50 60 pF1KE4 SNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVLLS ..:.. .. :: : : :. .:: :.:: CCDS14 MVQRLWVSRLLRHRKAQLLLVNLLTFGLEVCLAAGITYVPPLLLE 10 20 30 40 70 80 90 100 110 120 pF1KE4 VGLPSSLYSIVWFLSPILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALYLN ::. .....: ..:.::.. :..:::::: :.:.:::::.: .:.. .:... : CCDS14 VGVEEKFMTMVLGIGPVLGLVCVPLLGSASDHWRGRYGRRRPFIWALSLGILLSLFLIPR 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE4 GATVVAALIANPRRKLVWAISVTMIGVVLFDFAADFIDGPIKAYLFDVCSHQDK-EKGLH .. ... : .:: ... ..:: :.:: .. :..: : :. :. ... CCDS14 AGWLAGLLCPDPRP---LELALLILGVGLLDFCGQVCFTPLEALLSDLFRDPDHCRQAYS 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE4 YHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLCFTVHLCSISEAP .:.. ..:: ::::: :::: :. :::. . .: . .:.. : .. : :: CCDS14 VYAFMISLGGCLGYLLPAIDWDTSALAPYLGTQEECLFGLLTLIFLTCVAATLLVAEEAA 170 180 190 200 210 220 250 260 270 280 290 pF1KE4 L--TEVAKGIP-PQQTPQDPPLSSDGMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTR : :: :.:. :. .:. : . .. . : . :.: : : CCDS14 LGPTEPAEGLSAPSLSPHCCPCRARLAFR-------NLGALLPRL---------HQLCCR 230 240 250 260 300 310 320 330 340 350 pF1KE4 RAMTLKSLLRALVNMPPHYRYLCISHLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNST :: : : ...: .: :... ::.:::.:. .:.: : . .. CCDS14 --------------MPRTLRRLFVAELCSWMALMTFTLFYTDFVGEGLYQGVPRAEPGTE 270 280 290 300 310 360 370 380 390 400 410 pF1KE4 EFLIYERGVEVGCWGFCINSVFSSLYSYFQKVLVSYIGLKGLYFTGYLLFGLGTGFIGLF :..::..: :. .. ..: ..: . ::. .: ...:... : ...: : CCDS14 ARRHYDEGVRMGSLGLFLQCAISLVFSLVMDRLVQRFGTRAVYLASVAAFPVAAGATCLS 320 330 340 350 360 370 420 430 440 450 460 pF1KE4 PNVYSTLVLCSLFGVMSSTLYTVPFNLITEYHREEEKEVCCH .: . . .: : :.: .:..: . ::::.. CCDS14 HSVAVVTASAALTGFTFSALQILPYTLASLYHREKQVFLPKYRGDTGGASSEDSLMTSFL 380 390 400 410 420 430 CCDS14 PGPKPGAPFPNGHVGAGGSGLLPPPPALCGASACDVSVRVVVGEPTEARVVPGRGICLDL 440 450 460 470 480 490 460 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:36:06 2016 done: Sun Nov 6 00:36:06 2016 Total Scan time: 3.190 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]