FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6388, 377 aa
1>>>pF1KE6388 377 - 377 aa - 377 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1025+/-0.000755; mu= 17.8344+/- 0.046
mean_var=63.0037+/-12.841, 0's: 0 Z-trim(108.0): 18 B-trim: 518 in 1/52
Lambda= 0.161581
statistics sampled from 9945 (9955) to 9945 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.672), E-opt: 0.2 (0.306), width: 16
Scan time: 2.360
The best scores are: opt bits E(32554)
CCDS3614.1 SLC10A6 gene_id:345274|Hs108|chr4 ( 377) 2533 598.9 2.4e-171
CCDS9506.1 SLC10A2 gene_id:6555|Hs108|chr13 ( 348) 1114 268.1 8.3e-72
CCDS9797.1 SLC10A1 gene_id:6554|Hs108|chr14 ( 349) 723 177.0 2.3e-44
CCDS3482.1 SLC10A4 gene_id:201780|Hs108|chr4 ( 437) 711 174.2 1.9e-43
CCDS48195.1 SLC10A3 gene_id:8273|Hs108|chrX ( 448) 451 113.6 3.4e-25
CCDS14755.1 SLC10A3 gene_id:8273|Hs108|chrX ( 477) 451 113.6 3.6e-25
CCDS34915.1 SLC10A5 gene_id:347051|Hs108|chr8 ( 438) 444 112.0 1e-24
>>CCDS3614.1 SLC10A6 gene_id:345274|Hs108|chr4 (377 aa)
initn: 2533 init1: 2533 opt: 2533 Z-score: 3189.8 bits: 598.9 E(32554): 2.4e-171
Smith-Waterman score: 2533; 99.7% identity (100.0% similar) in 377 aa overlap (1-377:1-377)
10 20 30 40 50 60
pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 HIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNVFTFWVD
:::::::::::::::::::::::::::::::::::::::::::::::::::::.::::::
CCDS36 HIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNIFTFWVD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 GDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIPYQNIGITLVCLTIPVAFGV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 GDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIPYQNIGITLVCLTIPVAFGV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 YVNYRWPKQSKIILKIGAVVGGVLLLVVAVAGVVLAKGSWNSDITLLTISFIFPLIGHVT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 YVNYRWPKQSKIILKIGAVVGGVLLLVVAVAGVVLAKGSWNSDITLLTISFIFPLIGHVT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE6 GFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPLAYGLFQLID
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 GFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPLAYGLFQLID
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE6 GFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEGAITPGPPGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 GFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEGAITPGPPGP
310 320 330 340 350 360
370
pF1KE6 MDCHRALEPVGHITSCE
:::::::::::::::::
CCDS36 MDCHRALEPVGHITSCE
370
>>CCDS9506.1 SLC10A2 gene_id:6555|Hs108|chr13 (348 aa)
initn: 1175 init1: 1069 opt: 1114 Z-score: 1402.6 bits: 268.1 E(32554): 8.3e-72
Smith-Waterman score: 1117; 47.5% identity (81.0% similar) in 326 aa overlap (30-355:30-345)
10 20 30 40 50 60
pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWS
: .:...: :....:.:::.::.:::.:. .
CCDS95 MNDPNSCVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMFSMGCNVEIKKFLG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 HIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNVFTFWVD
::.::::: ::.:::::.::.:...:...:.. :.::..:::.::::::: ::....:::
CCDS95 HIKRPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCCPGGTASNILAYWVD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 GDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIPYQNIGITLVCLTIPVAFGV
::::::.:::::::. ::::::::. .:: : . ...:::.::: .:: :..::..:.
CCDS95 GDMDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVSLVVPVSIGM
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 YVNYRWPKQSKIILKIGAVVGGVLLLVVAVAGVVLAKGSWNSDITLLTISFIFPLIGHVT
.::..::...:::::::...:..:....::.: .: ...: : :. :::. :.
CCDS95 FVNHKWPQKAKIILKIGSIAGAILIVLIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYSL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE6 GFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPLAYGLFQLID
::::: .. : ::::...::: :: :.: :..::::: :.: ...::: :..:::
CCDS95 GFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLAF
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE6 GFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEGAITPGPPGP
. .... : .::. :::... :. ..... . :.. : ..: :.. :
CCDS95 AAIFLGFYVAYKK----CHGKNKA---EIPESKENGTEPESS-FYKAN--GGFQPDEK
310 320 330 340
370
pF1KE6 MDCHRALEPVGHITSCE
>>CCDS9797.1 SLC10A1 gene_id:6554|Hs108|chr14 (349 aa)
initn: 696 init1: 403 opt: 723 Z-score: 910.0 bits: 177.0 E(32554): 2.3e-44
Smith-Waterman score: 723; 39.4% identity (73.3% similar) in 292 aa overlap (31-313:24-309)
10 20 30 40 50 60
pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWS
.:...:. . :. ..:.::::..:. :. .
CCDS97 MEAHNASAPFNFTLPPNFGKRPTDLALSVILVFMLFFIMLSLGCTMEFSKIKA
10 20 30 40 50
70 80 90 100 110 120
pF1KE6 HIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNVFTFWVD
:. .: :.:..:. :.:.::.::..:. : :: ..:.:.:. :: :::..::::.. .
CCDS97 HLWKPKGLAIALVAQYGIMPLTAFVLGKVFRLKNIEALAILVCGCSPGGNLSNVFSLAMK
60 70 80 90 100 110
130 140 150 160 170
pF1KE6 GDMDLSISMTTCSTVAALGMMPLCIYLYTWS-WSLQQNLTIPYQNIGITLVCLTIPVAFG
:::.::: :::::: ::::::: .:.:. . .. . . .::..: :.:: . :: ..:
CCDS97 GDMNLSIVMTTCSTFCALGMMPLLLYIYSRGIYDGDLKDKVPYKGIVISLVLVLIPCTIG
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE6 VYVNYRWPKQSKIILKIGAVVGGVLLLVVAVAGVVLAKGSWNSDIT------LLTISFIF
. .. . :. . ..: : .. .:: ::: .::. . ...: :.. : ..
CCDS97 IVLKSKRPQYMRYVIKGGMII--ILLCSVAV--TVLSAINVGKSIMFAMTPLLIATSSLM
180 190 200 210 220
240 250 260 270 280 290
pF1KE6 PLIGHVTGFLL-ALFTHQSWQRCR-TISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPL
:.:: . :..: ::: .. ::: :.:.::: ::.:.: :.:...: : . .. :::
CCDS97 PFIGFLLGYVLSALFCLNG--RCRRTVSMETGCQNVQLCSTILNVAFPPEVIGPLFFFPL
230 240 250 260 270 280
300 310 320 330 340 350
pF1KE6 AYGLFQLIDGFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEG
: .::: .:.:..: . :..
CCDS97 LYMIFQLGEGLLLIAIFWCYEKFKTPKDKTKMIYTAATTEETIPGALGNGTYKGEDCSPC
290 300 310 320 330 340
>>CCDS3482.1 SLC10A4 gene_id:201780|Hs108|chr4 (437 aa)
initn: 688 init1: 446 opt: 711 Z-score: 893.4 bits: 174.2 E(32554): 1.9e-43
Smith-Waterman score: 711; 38.1% identity (70.9% similar) in 278 aa overlap (44-318:115-392)
20 30 40 50 60 70
pF1KE6 NSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWSHIRRPWGIAVGLL
. :..:::.:.. .. .:.::: : .. :
CCDS34 FPRPWAPHALPFWDTPLNHGLNVFVGAALCITMLGLGCTVDVNHFGAHVRRPVGALLAAL
90 100 110 120 130 140
80 90 100 110 120 130
pF1KE6 CQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNVFTFWVDGDMDLSISMTTCS
:::::.:. :.:::..:.: : :.:::. ::::::..::.... :::::.::: :: :
CCDS34 CQFGLLPLLAFLLALAFKLDEVAAVAVLLCGCCPGGNLSNLMSLLVDGDMNLSIIMTISS
150 160 170 180 190 200
140 150 160 170 180 190
pF1KE6 TVAALGMMPLCIYLYTWSW-SLQQNLTIPYQNIGITLVCLTIPVAFGVYVNYRWPKQSKI
:. :: .::::...:.:.: . .: .. .:: ::...::.. :.. . .
CCDS34 TLLALVLMPLCLWIYSWAWINTPIVQLLPLGTVTLTLCSTLIPIGLGVFIRYKYSRVADY
210 220 230 240 250 260
200 210 220 230 240 250
pF1KE6 ILKIGAVVGGVLLLVVAV-AGVVLAKGSWNS-DITLLTISFIFPLIGHVTGFLLALFTHQ
:.:.. : :.:. . .:..:. : .. .:....:: :...:. :: . :
CCDS34 IVKVSLWSLLVTLVVLFIMTGTMLGPELLASIPAAVYVIAIFMPLAGYASGYGLATLFHL
270 280 290 300 310 320
260 270 280 290 300 310
pF1KE6 SWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPLAYGLFQLIDGFLIVAAYQT
. ::. ::::.::.:.: ..:.:.: . . .: ::: :.::: .. ..: :.
CCDS34 PPNCKRTVCLETGSQNVQLCTAILKLAFPPQFIGSMYMFPLLYALFQSAEAGIFVLIYKM
330 340 350 360 370 380
320 330 340 350 360 370
pF1KE6 YKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEGAITPGPPGPMDCHRALEPV
: .. .:
CCDS34 YGSEMLHKRDPLDEDEDTDISYKKLKEEEMADTSYGTVKAENIIMMETAQTSL
390 400 410 420 430
>>CCDS48195.1 SLC10A3 gene_id:8273|Hs108|chrX (448 aa)
initn: 453 init1: 356 opt: 451 Z-score: 565.7 bits: 113.6 E(32554): 3.4e-25
Smith-Waterman score: 451; 31.7% identity (63.5% similar) in 271 aa overlap (12-278:143-408)
10 20 30 40
pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVM
::... : . : .. ... .. ..
CCDS48 EVLTIKNLVDAHEAPPTLIEERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIF
120 130 140 150 160 170
50 60 70 80 90 100
pF1KE6 MGLLMFSLGCSVEIRKLWSHIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVL
.. :.::.::.. : . .. : . .::: :: .::. :.:.: : : . :....
CCDS48 VN--KCSFGCKVELEVLKGLMQSPQPMLLGLLGQFLVMPLYAFLMAKVFMLPKALALGLI
180 190 200 210 220 230
110 120 130 140 150 160
pF1KE6 IMGCCPGGTISNVFTFWVDGDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIP
: ::: : .:.. . ::. :.:::: ::::: :..:: .:. :....: .:
CCDS48 ITCSSPGGGGSYLFSLLLGGDVTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVP
240 250 260 270 280 290
170 180 190 200 210
pF1KE6 YQNIGITLVCLTIPVAFGVYVNYRWPKQSKIILKIGAVVGGVLLL----VVAVAGVVLAK
..: ::. ..::.: :: .. . :: :...:.. . :::: .. :: .
CCDS48 ISKILGTLLFIAIPIAVGVLIKSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGVFILA
300 310 320 330 340 350
220 230 240 250 260 270
pF1KE6 GSWNSDITLLTISFIFPLIGHVTGFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLS
: . .. ... ::.: ..:. :: . . ::.:.:.:.:: . ..:::::
CCDS48 GI---RLPIVLVGITVPLVGLLVGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLS
360 370 380 390 400
280 290 300 310 320 330
pF1KE6 FTAEHLVQMLSFPLAYGLFQLIDGFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTS
.
CCDS48 LRRLQADYASQAPFIVALSGTSEMLALVIGHFIYSSLFPVP
410 420 430 440
>>CCDS14755.1 SLC10A3 gene_id:8273|Hs108|chrX (477 aa)
initn: 453 init1: 356 opt: 451 Z-score: 565.3 bits: 113.6 E(32554): 3.6e-25
Smith-Waterman score: 451; 31.7% identity (63.5% similar) in 271 aa overlap (12-278:172-437)
10 20 30 40
pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVM
::... : . : .. ... .. ..
CCDS14 LAPLHIQLVDAHEAPPTLIEERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIF
150 160 170 180 190 200
50 60 70 80 90 100
pF1KE6 MGLLMFSLGCSVEIRKLWSHIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVL
.. :.::.::.. : . .. : . .::: :: .::. :.:.: : : . :....
CCDS14 VN--KCSFGCKVELEVLKGLMQSPQPMLLGLLGQFLVMPLYAFLMAKVFMLPKALALGLI
210 220 230 240 250
110 120 130 140 150 160
pF1KE6 IMGCCPGGTISNVFTFWVDGDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIP
: ::: : .:.. . ::. :.:::: ::::: :..:: .:. :....: .:
CCDS14 ITCSSPGGGGSYLFSLLLGGDVTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVP
260 270 280 290 300 310
170 180 190 200 210
pF1KE6 YQNIGITLVCLTIPVAFGVYVNYRWPKQSKIILKIGAVVGGVLLL----VVAVAGVVLAK
..: ::. ..::.: :: .. . :: :...:.. . :::: .. :: .
CCDS14 ISKILGTLLFIAIPIAVGVLIKSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGVFILA
320 330 340 350 360 370
220 230 240 250 260 270
pF1KE6 GSWNSDITLLTISFIFPLIGHVTGFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLS
: . .. ... ::.: ..:. :: . . ::.:.:.:.:: . ..:::::
CCDS14 GI---RLPIVLVGITVPLVGLLVGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLS
380 390 400 410 420 430
280 290 300 310 320 330
pF1KE6 FTAEHLVQMLSFPLAYGLFQLIDGFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTS
.
CCDS14 LRRLQADYASQAPFIVALSGTSEMLALVIGHFIYSSLFPVP
440 450 460 470
>>CCDS34915.1 SLC10A5 gene_id:347051|Hs108|chr8 (438 aa)
initn: 370 init1: 333 opt: 444 Z-score: 557.0 bits: 112.0 E(32554): 1e-24
Smith-Waterman score: 444; 27.0% identity (64.2% similar) in 293 aa overlap (26-313:137-423)
10 20 30 40 50
pF1KE6 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEI
.: . .... .. .... : ::..:.
CCDS34 SEGRQERLIEEIKNVKVKVLKQKDSLLQAPMHIDRNILMLILPLILLNKCAF--GCKIEL
110 120 130 140 150 160
60 70 80 90 100 110
pF1KE6 RKLWSHIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGC-CPGGTISNV
. . . .:: . .: . :: :::: ..::. .: .::..: .: : :::: . .
CCDS34 QLFQTVWKRPLPVILGAVTQFFLMPFCGFLLSQIVALPEAQAFGV-VMTCTCPGGGGGYL
170 180 190 200 210 220
120 130 140 150 160 170
pF1KE6 FTFWVDGDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIPYQNIGITLVCLTI
:.. .:::. :.: :: ::. :: :::. :.:. .:. .. :: ..: ::. . .
CCDS34 FALLLDGDFTLAILMTCTSTLLALIMMPVNSYIYSRILGLSGTFHIPVSKIVSTLLFILV
230 240 250 260 270 280
180 190 200 210 220 230
pF1KE6 PVAFGVYVNYRWPKQSKIILKIGAVVGGVLLLV----VAVAGVVLAKGSWNSDITLLTIS
::..:. ...: :...... .: .. .:..: . ..:.:. : .... .. ..
CCDS34 PVSIGIVIKHRIPEKASFLERIIRPLSFILMFVGIYLTFTVGLVFLK---TDNLEVILLG
290 300 310 320 330 340
240 250 260 270 280 290
pF1KE6 FIFPLIGHVTGFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFP
.. : .: . :. .: :.:...:.: : . ....:::: . :
CCDS34 LLVPALGLLFGYSFAKVCTLPLPVCKTVAIESGMLNSFLALAVIQLSFPQSKANLASVAP
350 360 370 380 390 400
300 310 320 330 340 350
pF1KE6 LAYGLFQLIDGFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEE
.. .. . . .::. .:.. ::
CCDS34 FTVAMCSGCEMLLIILVYKAKKRCIFFLQDKRKRNFLI
410 420 430
377 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 12:49:35 2016 done: Tue Nov 8 12:49:36 2016
Total Scan time: 2.360 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]