FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4557, 679 aa 1>>>pF1KE4557 679 - 679 aa - 679 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4210+/-0.00113; mu= 19.4954+/- 0.068 mean_var=75.2533+/-14.991, 0's: 0 Z-trim(102.5): 25 B-trim: 0 in 0/47 Lambda= 0.147847 statistics sampled from 6984 (6987) to 6984 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.561), E-opt: 0.2 (0.215), width: 16 Scan time: 3.480 The best scores are: opt bits E(32554) CCDS2099.1 SLC20A1 gene_id:6574|Hs108|chr2 ( 679) 4456 960.6 0 CCDS6132.1 SLC20A2 gene_id:6575|Hs108|chr8 ( 652) 1238 274.2 4.1e-73 >>CCDS2099.1 SLC20A1 gene_id:6574|Hs108|chr2 (679 aa) initn: 4456 init1: 4456 opt: 4456 Z-score: 5135.5 bits: 960.6 E(32554): 0 Smith-Waterman score: 4456; 100.0% identity (100.0% similar) in 679 aa overlap (1-679:1-679) 10 20 30 40 50 60 pF1KE4 MATLITSTTAATAASGPLVDYLWMLILGFIIAFVLAFSVGANDVANSFGTAVGSGVVTLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 MATLITSTTAATAASGPLVDYLWMLILGFIIAFVLAFSVGANDVANSFGTAVGSGVVTLK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 QACILASIFETVGSVLLGAKVSETIRKGLIDVEMYNSTQGLLMAGSVSAMFGSAVWQLVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 QACILASIFETVGSVLLGAKVSETIRKGLIDVEMYNSTQGLLMAGSVSAMFGSAVWQLVA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 SFLKLPISGTHCIVGATIGFSLVAKGQEGVKWSELIKIVMSWFVSPLLSGIMSGILFFLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 SFLKLPISGTHCIVGATIGFSLVAKGQEGVKWSELIKIVMSWFVSPLLSGIMSGILFFLV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 RAFILHKADPVPNGLRALPVFYACTVGINLFSIMYTGAPLLGFDKLPLWGTILISVGCAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 RAFILHKADPVPNGLRALPVFYACTVGINLFSIMYTGAPLLGFDKLPLWGTILISVGCAV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 FCALIVWFFVCPRMKRKIEREIKCSPSESPLMEKKNSLKEDHEETKLSVGDIENKHPVSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 FCALIVWFFVCPRMKRKIEREIKCSPSESPLMEKKNSLKEDHEETKLSVGDIENKHPVSE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 VGPATVPLQAVVEERTVSFKLGDLEEAPERERLPSVDLKEETSIDSTVNGAVQLPNGNLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 VGPATVPLQAVVEERTVSFKLGDLEEAPERERLPSVDLKEETSIDSTVNGAVQLPNGNLV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 QFSQAVSNQINSSGHYQYHTVHKDSGLYKELLHKLHLAKVGDCMGDSGDKPLRRNNSYTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 QFSQAVSNQINSSGHYQYHTVHKDSGLYKELLHKLHLAKVGDCMGDSGDKPLRRNNSYTS 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 YTMAICGMPLDSFRAKEGEQKGEEMEKLTWPNADSKKRIRMDSYTSYCNAVSDLHSASEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 YTMAICGMPLDSFRAKEGEQKGEEMEKLTWPNADSKKRIRMDSYTSYCNAVSDLHSASEI 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE4 DMSVKAEMGLGDRKGSNGSLEEWYDQDKPEVSLLFQFLQILTACFGSFAHGGNDVSNAIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 DMSVKAEMGLGDRKGSNGSLEEWYDQDKPEVSLLFQFLQILTACFGSFAHGGNDVSNAIG 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE4 PLVALYLVYDTGDVSSKVATPIWLLLYGGVGICVGLWVWGRRVIQTMGKDLTPITPSSGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 PLVALYLVYDTGDVSSKVATPIWLLLYGGVGICVGLWVWGRRVIQTMGKDLTPITPSSGF 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE4 SIELASALTVVIASNIGLPISTTHCKVGSVVSVGWLRSKKAVDWRLFRNIFMAWFVTVPI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 SIELASALTVVIASNIGLPISTTHCKVGSVVSVGWLRSKKAVDWRLFRNIFMAWFVTVPI 610 620 630 640 650 660 670 pF1KE4 SGVISAAIMAIFRYVILRM ::::::::::::::::::: CCDS20 SGVISAAIMAIFRYVILRM 670 >>CCDS6132.1 SLC20A2 gene_id:6575|Hs108|chr8 (652 aa) initn: 2430 init1: 1096 opt: 1238 Z-score: 1426.1 bits: 274.2 E(32554): 4.1e-73 Smith-Waterman score: 2521; 60.7% identity (80.1% similar) in 672 aa overlap (20-677:5-649) 10 20 30 40 50 60 pF1KE4 MATLITSTTAATAASGPLVDYLWMLILGFIIAFVLAFSVGANDVANSFGTAVGSGVVTLK .::::.::::::::.:::::::::::::::::::::::::. CCDS61 MAMDEYLWMVILGFIIAFILAFSVGANDVANSFGTAVGSGVVTLR 10 20 30 40 70 80 90 100 110 120 pF1KE4 QACILASIFETVGSVLLGAKVSETIRKGLIDVEMYNSTQGLLMAGSVSAMFGSAVWQLVA :::::::::::.:::::::::.::::::.:::..:: : :::: :::: :::::::.: CCDS61 QACILASIFETTGSVLLGAKVGETIRKGIIDVNLYNETVETLMAGEVSAMVGSAVWQLIA 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE4 SFLKLPISGTHCIVGATIGFSLVAKGQEGVKWSELIKIVMSWFVSPLLSGIMSGILFFLV :::.:::::::::::.:::::::: : .::.: ::.::: :::.::::::.:::.:: :. CCDS61 SFLRLPISGTHCIVGSTIGFSLVAIGTKGVQWMELVKIVASWFISPLLSGFMSGLLFVLI 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE4 RAFILHKADPVPNGLRALPVFYACTVGINLFSIMYTGAPLLGFDKLPLWGTILISVGCAV : :::.: ::::::::::::::: :..::.:::::::::.::. ::.:. ::: : :. CCDS61 RIFILKKEDPVPNGLRALPVFYAATIAINVFSIMYTGAPVLGL-VLPMWAIALISFGVAL 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE4 FCALIVWFFVCPRMKRKIEREIKCSPSESPLMEKKNSLKEDHEETKLSVGDIENKHPVSE . :..::.:::: :.::: . ..:...:.. .:. .: . :. :: . CCDS61 LFAFFVWLFVCPWMRRKITGK----------LQKEGALSRVSDESLSKVQEAES--PVFK 230 240 250 260 270 310 320 330 340 350 pF1KE4 VGP-------ATVPLQAVVEERTVSFKLGDLEEAPERERLPSVDLKEETSIDSTVNGAVQ : .:.:: ... : :: :. : . . :. ..:.:. CCDS61 ELPGAKANDDSTIPLTGAAGET-----LGT-SEGTSAGSHPRAAYGRALSM---THGSVK 280 290 300 310 320 360 370 380 390 400 410 pF1KE4 LPNGNLVQFSQAVSNQINSSGHYQYHTVHKDSGLYKELLHKLHLAKVGDC--MGDSGDKP : .: . :. ... :.:: ::::::::::::.::::.:. . . .:. . CCDS61 SPISNGT-FG--FDGHTRSDGHV-YHTVHKDSGLYKDLLHKIHIDRGPEEKPAQESNYRL 330 340 350 360 370 420 430 440 450 460 pF1KE4 LRRNNSYTSYTMAICGMPLD-SFRAKEGEQKGEEMEKLTWPNAD-SKKRIRMDSYTSYCN :::::::: :: ::::.:. .::: .. . :. :::. ... ::::.:.:::.:::: CCDS61 LRRNNSYTCYTAAICGLPVHATFRAADS-SAPEDSEKLVGDTVSYSKKRLRYDSYSSYCN 380 390 400 410 420 430 470 480 490 500 510 520 pF1KE4 AVSDLHSASE---IDMSVKAEMGLGDRKGSNGSLEEWYDQDKPEVSLLFQFLQILTACFG ::.. . .: ..:.. .:.. :. . . :: ..: ::: :::.:::.:::::: CCDS61 AVAEAEIEAEEGGVEMKLASELADPDQPREDPAEEEKEEKDAPEVHLLFHFLQVLTACFG 440 450 460 470 480 490 530 540 550 560 570 580 pF1KE4 SFAHGGNDVSNAIGPLVALYLVYDTGDVSSKVATPIWLLLYGGVGICVGLWVWGRRVIQT :::::::::::::::::::.:.: : :....:::.:::.:::::::.:::::::::::: CCDS61 SFAHGGNDVSNAIGPLVALWLIYKQGGVTQEAATPVWLLFYGGVGICTGLWVWGRRVIQT 500 510 520 530 540 550 590 600 610 620 630 640 pF1KE4 MGKDLTPITPSSGFSIELASALTVVIASNIGLPISTTHCKVGSVVSVGWLRSKKAVDWRL ::::::::::::::.::::::.:::::::::::.:::::::::::.:::.::.::::::: CCDS61 MGKDLTPITPSSGFTIELASAFTVVIASNIGLPVSTTHCKVGSVVAVGWIRSRKAVDWRL 560 570 580 590 600 610 650 660 670 pF1KE4 FRNIFMAWFVTVPISGVISAAIMAIFRYVILRM :::::.:::::::..:..:::.::.. : :: CCDS61 FRNIFVAWFVTVPVAGLFSAAVMALLMYGILPYV 620 630 640 650 679 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:57:34 2016 done: Sat Nov 5 23:57:34 2016 Total Scan time: 3.480 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]