FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6388, 377 aa 1>>>pF1KE6388 377 - 377 aa - 377 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1025+/-0.000755; mu= 17.8344+/- 0.046 mean_var=63.0037+/-12.841, 0's: 0 Z-trim(108.0): 18 B-trim: 518 in 1/52 Lambda= 0.161581 statistics sampled from 9945 (9955) to 9945 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.672), E-opt: 0.2 (0.306), width: 16 Scan time: 2.360 The best scores are: opt bits E(32554) CCDS3614.1 SLC10A6 gene_id:345274|Hs108|chr4 ( 377) 2533 598.9 2.4e-171 CCDS9506.1 SLC10A2 gene_id:6555|Hs108|chr13 ( 348) 1114 268.1 8.3e-72 CCDS9797.1 SLC10A1 gene_id:6554|Hs108|chr14 ( 349) 723 177.0 2.3e-44 CCDS3482.1 SLC10A4 gene_id:201780|Hs108|chr4 ( 437) 711 174.2 1.9e-43 CCDS48195.1 SLC10A3 gene_id:8273|Hs108|chrX ( 448) 451 113.6 3.4e-25 CCDS14755.1 SLC10A3 gene_id:8273|Hs108|chrX ( 477) 451 113.6 3.6e-25 CCDS34915.1 SLC10A5 gene_id:347051|Hs108|chr8 ( 438) 444 112.0 1e-24 >>CCDS3614.1 SLC10A6 gene_id:345274|Hs108|chr4 (377 aa) initn: 2533 init1: 2533 opt: 2533 Z-score: 3189.8 bits: 598.9 E(32554): 2.4e-171 Smith-Waterman score: 2533; 99.7% identity (100.0% similar) in 377 aa overlap (1-377:1-377) 10 20 30 40 50 60 pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNVFTFWVD :::::::::::::::::::::::::::::::::::::::::::::::::::::.:::::: CCDS36 HIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNIFTFWVD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 GDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIPYQNIGITLVCLTIPVAFGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 GDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIPYQNIGITLVCLTIPVAFGV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 YVNYRWPKQSKIILKIGAVVGGVLLLVVAVAGVVLAKGSWNSDITLLTISFIFPLIGHVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 YVNYRWPKQSKIILKIGAVVGGVLLLVVAVAGVVLAKGSWNSDITLLTISFIFPLIGHVT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 GFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPLAYGLFQLID :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 GFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPLAYGLFQLID 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 GFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEGAITPGPPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 GFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEGAITPGPPGP 310 320 330 340 350 360 370 pF1KE6 MDCHRALEPVGHITSCE ::::::::::::::::: CCDS36 MDCHRALEPVGHITSCE 370 >>CCDS9506.1 SLC10A2 gene_id:6555|Hs108|chr13 (348 aa) initn: 1175 init1: 1069 opt: 1114 Z-score: 1402.6 bits: 268.1 E(32554): 8.3e-72 Smith-Waterman score: 1117; 47.5% identity (81.0% similar) in 326 aa overlap (30-355:30-345) 10 20 30 40 50 60 pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWS : .:...: :....:.:::.::.:::.:. . CCDS95 MNDPNSCVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMFSMGCNVEIKKFLG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNVFTFWVD ::.::::: ::.:::::.::.:...:...:.. :.::..:::.::::::: ::....::: CCDS95 HIKRPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCCPGGTASNILAYWVD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 GDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIPYQNIGITLVCLTIPVAFGV ::::::.:::::::. ::::::::. .:: : . ...:::.::: .:: :..::..:. CCDS95 GDMDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVSLVVPVSIGM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 YVNYRWPKQSKIILKIGAVVGGVLLLVVAVAGVVLAKGSWNSDITLLTISFIFPLIGHVT .::..::...:::::::...:..:....::.: .: ...: : :. :::. :. CCDS95 FVNHKWPQKAKIILKIGSIAGAILIVLIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYSL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 GFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPLAYGLFQLID ::::: .. : ::::...::: :: :.: :..::::: :.: ...::: :..::: CCDS95 GFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLAF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 GFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEGAITPGPPGP . .... : .::. :::... :. ..... . :.. : ..: :.. : CCDS95 AAIFLGFYVAYKK----CHGKNKA---EIPESKENGTEPESS-FYKAN--GGFQPDEK 310 320 330 340 370 pF1KE6 MDCHRALEPVGHITSCE >>CCDS9797.1 SLC10A1 gene_id:6554|Hs108|chr14 (349 aa) initn: 696 init1: 403 opt: 723 Z-score: 910.0 bits: 177.0 E(32554): 2.3e-44 Smith-Waterman score: 723; 39.4% identity (73.3% similar) in 292 aa overlap (31-313:24-309) 10 20 30 40 50 60 pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWS .:...:. . :. ..:.::::..:. :. . CCDS97 MEAHNASAPFNFTLPPNFGKRPTDLALSVILVFMLFFIMLSLGCTMEFSKIKA 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 HIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNVFTFWVD :. .: :.:..:. :.:.::.::..:. : :: ..:.:.:. :: :::..::::.. . CCDS97 HLWKPKGLAIALVAQYGIMPLTAFVLGKVFRLKNIEALAILVCGCSPGGNLSNVFSLAMK 60 70 80 90 100 110 130 140 150 160 170 pF1KE6 GDMDLSISMTTCSTVAALGMMPLCIYLYTWS-WSLQQNLTIPYQNIGITLVCLTIPVAFG :::.::: :::::: ::::::: .:.:. . .. . . .::..: :.:: . :: ..: CCDS97 GDMNLSIVMTTCSTFCALGMMPLLLYIYSRGIYDGDLKDKVPYKGIVISLVLVLIPCTIG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 VYVNYRWPKQSKIILKIGAVVGGVLLLVVAVAGVVLAKGSWNSDIT------LLTISFIF . .. . :. . ..: : .. .:: ::: .::. . ...: :.. : .. CCDS97 IVLKSKRPQYMRYVIKGGMII--ILLCSVAV--TVLSAINVGKSIMFAMTPLLIATSSLM 180 190 200 210 220 240 250 260 270 280 290 pF1KE6 PLIGHVTGFLL-ALFTHQSWQRCR-TISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPL :.:: . :..: ::: .. ::: :.:.::: ::.:.: :.:...: : . .. ::: CCDS97 PFIGFLLGYVLSALFCLNG--RCRRTVSMETGCQNVQLCSTILNVAFPPEVIGPLFFFPL 230 240 250 260 270 280 300 310 320 330 340 350 pF1KE6 AYGLFQLIDGFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEG : .::: .:.:..: . :.. CCDS97 LYMIFQLGEGLLLIAIFWCYEKFKTPKDKTKMIYTAATTEETIPGALGNGTYKGEDCSPC 290 300 310 320 330 340 >>CCDS3482.1 SLC10A4 gene_id:201780|Hs108|chr4 (437 aa) initn: 688 init1: 446 opt: 711 Z-score: 893.4 bits: 174.2 E(32554): 1.9e-43 Smith-Waterman score: 711; 38.1% identity (70.9% similar) in 278 aa overlap (44-318:115-392) 20 30 40 50 60 70 pF1KE6 NSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWSHIRRPWGIAVGLL . :..:::.:.. .. .:.::: : .. : CCDS34 FPRPWAPHALPFWDTPLNHGLNVFVGAALCITMLGLGCTVDVNHFGAHVRRPVGALLAAL 90 100 110 120 130 140 80 90 100 110 120 130 pF1KE6 CQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNVFTFWVDGDMDLSISMTTCS :::::.:. :.:::..:.: : :.:::. ::::::..::.... :::::.::: :: : CCDS34 CQFGLLPLLAFLLALAFKLDEVAAVAVLLCGCCPGGNLSNLMSLLVDGDMNLSIIMTISS 150 160 170 180 190 200 140 150 160 170 180 190 pF1KE6 TVAALGMMPLCIYLYTWSW-SLQQNLTIPYQNIGITLVCLTIPVAFGVYVNYRWPKQSKI :. :: .::::...:.:.: . .: .. .:: ::...::.. :.. . . CCDS34 TLLALVLMPLCLWIYSWAWINTPIVQLLPLGTVTLTLCSTLIPIGLGVFIRYKYSRVADY 210 220 230 240 250 260 200 210 220 230 240 250 pF1KE6 ILKIGAVVGGVLLLVVAV-AGVVLAKGSWNS-DITLLTISFIFPLIGHVTGFLLALFTHQ :.:.. : :.:. . .:..:. : .. .:....:: :...:. :: . : CCDS34 IVKVSLWSLLVTLVVLFIMTGTMLGPELLASIPAAVYVIAIFMPLAGYASGYGLATLFHL 270 280 290 300 310 320 260 270 280 290 300 310 pF1KE6 SWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPLAYGLFQLIDGFLIVAAYQT . ::. ::::.::.:.: ..:.:.: . . .: ::: :.::: .. ..: :. CCDS34 PPNCKRTVCLETGSQNVQLCTAILKLAFPPQFIGSMYMFPLLYALFQSAEAGIFVLIYKM 330 340 350 360 370 380 320 330 340 350 360 370 pF1KE6 YKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEGAITPGPPGPMDCHRALEPV : .. .: CCDS34 YGSEMLHKRDPLDEDEDTDISYKKLKEEEMADTSYGTVKAENIIMMETAQTSL 390 400 410 420 430 >>CCDS48195.1 SLC10A3 gene_id:8273|Hs108|chrX (448 aa) initn: 453 init1: 356 opt: 451 Z-score: 565.7 bits: 113.6 E(32554): 3.4e-25 Smith-Waterman score: 451; 31.7% identity (63.5% similar) in 271 aa overlap (12-278:143-408) 10 20 30 40 pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVM ::... : . : .. ... .. .. CCDS48 EVLTIKNLVDAHEAPPTLIEERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIF 120 130 140 150 160 170 50 60 70 80 90 100 pF1KE6 MGLLMFSLGCSVEIRKLWSHIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVL .. :.::.::.. : . .. : . .::: :: .::. :.:.: : : . :.... CCDS48 VN--KCSFGCKVELEVLKGLMQSPQPMLLGLLGQFLVMPLYAFLMAKVFMLPKALALGLI 180 190 200 210 220 230 110 120 130 140 150 160 pF1KE6 IMGCCPGGTISNVFTFWVDGDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIP : ::: : .:.. . ::. :.:::: ::::: :..:: .:. :....: .: CCDS48 ITCSSPGGGGSYLFSLLLGGDVTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVP 240 250 260 270 280 290 170 180 190 200 210 pF1KE6 YQNIGITLVCLTIPVAFGVYVNYRWPKQSKIILKIGAVVGGVLLL----VVAVAGVVLAK ..: ::. ..::.: :: .. . :: :...:.. . :::: .. :: . CCDS48 ISKILGTLLFIAIPIAVGVLIKSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGVFILA 300 310 320 330 340 350 220 230 240 250 260 270 pF1KE6 GSWNSDITLLTISFIFPLIGHVTGFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLS : . .. ... ::.: ..:. :: . . ::.:.:.:.:: . ..::::: CCDS48 GI---RLPIVLVGITVPLVGLLVGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLS 360 370 380 390 400 280 290 300 310 320 330 pF1KE6 FTAEHLVQMLSFPLAYGLFQLIDGFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTS . CCDS48 LRRLQADYASQAPFIVALSGTSEMLALVIGHFIYSSLFPVP 410 420 430 440 >>CCDS14755.1 SLC10A3 gene_id:8273|Hs108|chrX (477 aa) initn: 453 init1: 356 opt: 451 Z-score: 565.3 bits: 113.6 E(32554): 3.6e-25 Smith-Waterman score: 451; 31.7% identity (63.5% similar) in 271 aa overlap (12-278:172-437) 10 20 30 40 pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVM ::... : . : .. ... .. .. CCDS14 LAPLHIQLVDAHEAPPTLIEERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIF 150 160 170 180 190 200 50 60 70 80 90 100 pF1KE6 MGLLMFSLGCSVEIRKLWSHIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVL .. :.::.::.. : . .. : . .::: :: .::. :.:.: : : . :.... CCDS14 VN--KCSFGCKVELEVLKGLMQSPQPMLLGLLGQFLVMPLYAFLMAKVFMLPKALALGLI 210 220 230 240 250 110 120 130 140 150 160 pF1KE6 IMGCCPGGTISNVFTFWVDGDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIP : ::: : .:.. . ::. :.:::: ::::: :..:: .:. :....: .: CCDS14 ITCSSPGGGGSYLFSLLLGGDVTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVP 260 270 280 290 300 310 170 180 190 200 210 pF1KE6 YQNIGITLVCLTIPVAFGVYVNYRWPKQSKIILKIGAVVGGVLLL----VVAVAGVVLAK ..: ::. ..::.: :: .. . :: :...:.. . :::: .. :: . CCDS14 ISKILGTLLFIAIPIAVGVLIKSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGVFILA 320 330 340 350 360 370 220 230 240 250 260 270 pF1KE6 GSWNSDITLLTISFIFPLIGHVTGFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLS : . .. ... ::.: ..:. :: . . ::.:.:.:.:: . ..::::: CCDS14 GI---RLPIVLVGITVPLVGLLVGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLS 380 390 400 410 420 430 280 290 300 310 320 330 pF1KE6 FTAEHLVQMLSFPLAYGLFQLIDGFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTS . CCDS14 LRRLQADYASQAPFIVALSGTSEMLALVIGHFIYSSLFPVP 440 450 460 470 >>CCDS34915.1 SLC10A5 gene_id:347051|Hs108|chr8 (438 aa) initn: 370 init1: 333 opt: 444 Z-score: 557.0 bits: 112.0 E(32554): 1e-24 Smith-Waterman score: 444; 27.0% identity (64.2% similar) in 293 aa overlap (26-313:137-423) 10 20 30 40 50 pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEI .: . .... .. .... : ::..:. CCDS34 SEGRQERLIEEIKNVKVKVLKQKDSLLQAPMHIDRNILMLILPLILLNKCAF--GCKIEL 110 120 130 140 150 160 60 70 80 90 100 110 pF1KE6 RKLWSHIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGC-CPGGTISNV . . . .:: . .: . :: :::: ..::. .: .::..: .: : :::: . . CCDS34 QLFQTVWKRPLPVILGAVTQFFLMPFCGFLLSQIVALPEAQAFGV-VMTCTCPGGGGGYL 170 180 190 200 210 220 120 130 140 150 160 170 pF1KE6 FTFWVDGDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIPYQNIGITLVCLTI :.. .:::. :.: :: ::. :: :::. :.:. .:. .. :: ..: ::. . . CCDS34 FALLLDGDFTLAILMTCTSTLLALIMMPVNSYIYSRILGLSGTFHIPVSKIVSTLLFILV 230 240 250 260 270 280 180 190 200 210 220 230 pF1KE6 PVAFGVYVNYRWPKQSKIILKIGAVVGGVLLLV----VAVAGVVLAKGSWNSDITLLTIS ::..:. ...: :...... .: .. .:..: . ..:.:. : .... .. .. CCDS34 PVSIGIVIKHRIPEKASFLERIIRPLSFILMFVGIYLTFTVGLVFLK---TDNLEVILLG 290 300 310 320 330 340 240 250 260 270 280 290 pF1KE6 FIFPLIGHVTGFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFP .. : .: . :. .: :.:...:.: : . ....:::: . : CCDS34 LLVPALGLLFGYSFAKVCTLPLPVCKTVAIESGMLNSFLALAVIQLSFPQSKANLASVAP 350 360 370 380 390 400 300 310 320 330 340 350 pF1KE6 LAYGLFQLIDGFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEE .. .. . . .::. .:.. :: CCDS34 FTVAMCSGCEMLLIILVYKAKKRCIFFLQDKRKRNFLI 410 420 430 377 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:49:35 2016 done: Tue Nov 8 12:49:36 2016 Total Scan time: 2.360 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]