FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4495, 519 aa
1>>>pF1KE4495 519 - 519 aa - 519 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.3122+/-0.000847; mu= 18.6492+/- 0.051
mean_var=83.3882+/-16.305, 0's: 0 Z-trim(108.3): 24 B-trim: 11 in 1/50
Lambda= 0.140450
statistics sampled from 10098 (10112) to 10098 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.687), E-opt: 0.2 (0.311), width: 16
Scan time: 2.520
The best scores are: opt bits E(32554)
CCDS9470.1 DCT gene_id:1638|Hs108|chr13 ( 519) 3703 760.4 0
CCDS45060.1 DCT gene_id:1638|Hs108|chr13 ( 552) 2857 589.0 4.7e-168
CCDS34990.1 TYRP1 gene_id:7306|Hs108|chr9 ( 537) 1763 367.3 2.5e-101
CCDS8284.1 TYR gene_id:7299|Hs108|chr11 ( 529) 1389 291.5 1.6e-78
>>CCDS9470.1 DCT gene_id:1638|Hs108|chr13 (519 aa)
initn: 3703 init1: 3703 opt: 3703 Z-score: 4057.4 bits: 760.4 E(32554): 0
Smith-Waterman score: 3703; 100.0% identity (100.0% similar) in 519 aa overlap (1-519:1-519)
10 20 30 40 50 60
pF1KE4 MSPLWWGFLLSCLGCKILPGAQGQFPRVCMTVDSLVNKECCPRLGAESANVCGSQQGRGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS94 MSPLWWGFLLSCLGCKILPGAQGQFPRVCMTVDSLVNKECCPRLGAESANVCGSQQGRGQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 CTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFGWTGPNCER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS94 CTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFGWTGPNCER
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 KKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGPNGTQPQFANCS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS94 KKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGPNGTQPQFANCS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 VYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVTWHRYHLLCLERDLQRLIGNESF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS94 VYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVTWHRYHLLCLERDLQRLIGNESF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 ALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDDYNHLVTLCN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS94 ALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDDYNHLVTLCN
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 GTYEGLLRRNQMGRNSMKLPTLKDIRDCLSLQKFDNPPFFQNSTFSFRNALEGFDKADGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS94 GTYEGLLRRNQMGRNSMKLPTLKDIRDCLSLQKFDNPPFFQNSTFSFRNALEGFDKADGT
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 LDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVVLHSFTDAIFDEWMKRFNPPADAWPQE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS94 LDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVVLHSFTDAIFDEWMKRFNPPADAWPQE
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 LAPIGHNRMYNMVPFFPPVTNEELFLTSDQLGYSYAIDLPVSVEETPGWPTTLLVVMGTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS94 LAPIGHNRMYNMVPFFPPVTNEELFLTSDQLGYSYAIDLPVSVEETPGWPTTLLVVMGTL
430 440 450 460 470 480
490 500 510
pF1KE4 VALVGLFVLLAFLQYRRLRKGYTPLMETHLSSKRYTEEA
:::::::::::::::::::::::::::::::::::::::
CCDS94 VALVGLFVLLAFLQYRRLRKGYTPLMETHLSSKRYTEEA
490 500 510
>>CCDS45060.1 DCT gene_id:1638|Hs108|chr13 (552 aa)
initn: 2857 init1: 2857 opt: 2857 Z-score: 3130.6 bits: 589.0 E(32554): 4.7e-168
Smith-Waterman score: 3627; 94.0% identity (94.0% similar) in 552 aa overlap (1-519:1-552)
10 20 30 40 50 60
pF1KE4 MSPLWWGFLLSCLGCKILPGAQGQFPRVCMTVDSLVNKECCPRLGAESANVCGSQQGRGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 MSPLWWGFLLSCLGCKILPGAQGQFPRVCMTVDSLVNKECCPRLGAESANVCGSQQGRGQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 CTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFGWTGPNCER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 CTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFGWTGPNCER
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 KKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGPNGTQPQFANCS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 KKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGPNGTQPQFANCS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 VYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVTWHRYHLLCLERDLQRLIGNESF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 VYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVTWHRYHLLCLERDLQRLIGNESF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 ALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDDYNHLVTLCN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 ALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDDYNHLVTLCN
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 GTYEGLLRRNQMGRNSMKLPTLKDIRDCLSLQKFDNPPFFQNSTFSFRNALEGFDKADGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 GTYEGLLRRNQMGRNSMKLPTLKDIRDCLSLQKFDNPPFFQNSTFSFRNALEGFDKADGT
310 320 330 340 350 360
370 380 390
pF1KE4 LDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVV--------------------------
::::::::::::::::::::::::::::::::::
CCDS45 LDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVVISNRLLYNATTNILEHVRKEKATKEL
370 380 390 400 410 420
400 410 420 430 440
pF1KE4 -------LHSFTDAIFDEWMKRFNPPADAWPQELAPIGHNRMYNMVPFFPPVTNEELFLT
:::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 PSLHVLVLHSFTDAIFDEWMKRFNPPADAWPQELAPIGHNRMYNMVPFFPPVTNEELFLT
430 440 450 460 470 480
450 460 470 480 490 500
pF1KE4 SDQLGYSYAIDLPVSVEETPGWPTTLLVVMGTLVALVGLFVLLAFLQYRRLRKGYTPLME
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 SDQLGYSYAIDLPVSVEETPGWPTTLLVVMGTLVALVGLFVLLAFLQYRRLRKGYTPLME
490 500 510 520 530 540
510
pF1KE4 THLSSKRYTEEA
::::::::::::
CCDS45 THLSSKRYTEEA
550
>>CCDS34990.1 TYRP1 gene_id:7306|Hs108|chr9 (537 aa)
initn: 1362 init1: 746 opt: 1763 Z-score: 1932.8 bits: 367.3 E(32554): 2.5e-101
Smith-Waterman score: 1763; 49.0% identity (73.8% similar) in 526 aa overlap (9-518:6-525)
10 20 30 40 50
pF1KE4 MSPLWWGFLLSCLGCKILP-----GAQGQFPRVCMTVDSLVNKECCPRLGAESA---NVC
::: ::: ..: :..:::: : ::..: . ::: :. :. . :
CCDS34 MSAPKLLS-LGCIFFPLLLFQQARAQFPRQCATVEALRSGMCCPDLSPVSGPGTDRC
10 20 30 40 50
60 70 80 90 100 110
pF1KE4 GSQQGRGQCTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFG
::..:::.: : ::.:: : : ..::::.:: .::.:::.:.:::.:.::: :. :
CCDS34 GSSSGRGRCEAVTADSRPHSPQYPHDGRDDREVWPLRFFNRTCHCNGNFSGHNCGTCRPG
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE4 WTGPNCERKKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGPNGT
: : :. .. ..:.:. .:: .:...:. :::.::. .:: .::.:.. .:::.:.
CCDS34 WRGAACD-QRVLIVRRNLLDLSKEEKNHFVRALDMAKRTTHPLFVIATRRSEEILGPDGN
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE4 QPQFANCSVYDFFVWLHYYSVRDTLLGPGRP-YRAIDFSHQGPAFVTWHRYHLLCLERDL
::: : :.:..::: :::::. :.:: :. . .::::.::::.:::::::: ::.:.
CCDS34 TPQFENISIYNYFVWTHYYSVKKTFLGVGQESFGEVDFSHEGPAFLTWHRYHLLRLEKDM
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE4 QRLIGNESFALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDD
:... . ::.:::::::::.: ::.:::.:.:. : :::: :: ::.:..:::::.:
CCDS34 QEMLQEPSFSLPYWNFATGKNVCDICTDDLMGSRSNFDSTLISPNSVFSQWRVVCDSLED
240 250 260 270 280 290
300 310 320 330 340
pF1KE4 YNHLVTLCNGTYEGLLRRNQMGRNS----MKLPTLKDIRDCLSLQKFDNPPFFQNSTFSF
:. : ::::.: .: .::: : . ..:: .:. .:: . ::.:::..::: ::
CCDS34 YDTLGTLCNSTEDGPIRRNPAGNVARPMVQRLPEPQDVAQCLEVGLFDTPPFYSNSTNSF
300 310 320 330 340 350
350 360 370 380 390 400
pF1KE4 RNALEGFDKADGTLDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVVLHSFTDAIFDEWM
::..::.. : : : :::::.: :::::.. : . ::::::.::.::::.::::.
CCDS34 RNTVEGYSDPTGKYDPAVRSLHNLAHLFLNGTGGQTHLSPNDPIFVLLHTFTDAVFDEWL
360 370 380 390 400 410
410 420 430 440 450 460
pF1KE4 KRFNPPADAWPQELAPIGHNRMYNMVPFFPPVTNEELFLTS-DQLGYSYAIDLPVSVEET
.:.: ...: : :::::::.::::::.::::: :.:.:. :.:::.: :. : .
CCDS34 RRYNADISTFPLENAPIGHNRQYNMVPFWPPVTNTEMFVTAPDNLGYTYEIQWPSREFSV
420 430 440 450 460 470
470 480 490 500 510
pF1KE4 PGWPTTLLVVMGTLVALVGLFVLLAFL--QYRRLRKGYTPLMETHLSSKRYTEEA
: ..:.:.:. .. .: ..: : . .. ::. . . :.::
CCDS34 P--EIIAIAVVGALLLVALIFGTASYLIRARRSMDEANQPLLTDQYQC--YAEEYEKLQN
480 490 500 510 520 530
CCDS34 PNQSVV
>>CCDS8284.1 TYR gene_id:7299|Hs108|chr11 (529 aa)
initn: 1053 init1: 325 opt: 1389 Z-score: 1523.3 bits: 291.5 E(32554): 1.6e-78
Smith-Waterman score: 1391; 40.1% identity (68.9% similar) in 521 aa overlap (9-506:2-515)
10 20 30 40 50
pF1KE4 MSPLWWGFLLSCLGCKI--LPGAQGQFPRVCMTVDSLVNKECCPRLGAESANVCGSQQGR
::. : : . . . :.:::.:.. .:..::::: ... . ::. .::
CCDS82 MLLAVLYCLLWSFQTSAGHFPRACVSSKNLMEKECCPPWSGDRSP-CGQLSGR
10 20 30 40 50
60 70 80 90 100 110
pF1KE4 GQCTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFGWTGPNC
:.: .. .. : . . . . :::: :: :..:::.:.::: :.:::.::::. ::::
CCDS82 GSCQNILLSNAPLGPQFPFTGVDDRESWPSVFYNRTCQCSGNFMGFNCGNCKFGFWGPNC
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE4 ERKKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGP--NGTQPQF
... ..:.:: .:: :...:.. : :::. . :::: .: : ::. :.:
CCDS82 TERRL-LVRRNIFDLSAPEKDKFFAYLTLAKHTISSDYVIP----IGTYGQMKNGSTPMF
120 130 140 150 160
180 190 200 210 220 230
pF1KE4 ANCSVYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVTWHRYHLLCLERDLQRLIG
. ..::.:::.::: :.::: .. .: :::.:..:::. ::: :: :...:.: :
CCDS82 NDINIYDLFVWMHYYVSMDALLGGSEIWRDIDFAHEAPAFLPWHRLFLLRWEQEIQKLTG
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE4 NESFALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDDYNHLV
.:.:..:::.. .. .::.:::. .:. .: .:.:.: : ::::. ::. :..::
CCDS82 DENFTIPYWDWRDAE-KCDICTDEYMGGQHPTNPNLLSPASFFSSWQIVCSRLEEYNSHQ
230 240 250 260 270 280
300 310 320 330 340 350
pF1KE4 TLCNGTYEGLLRRNQMGRN---SMKLPTLKDIRDCLSLQKFDNPPFFQNSTFSFRNALEG
.::::: :: :::: ... . .::. :.. :::: .... . . ..:::::.:::
CCDS82 SLCNGTPEGPLRRNPGNHDKSRTPRLPSSADVEFCLSLTQYESGSMDKAANFSFRNTLEG
290 300 310 320 330 340
360 370 380 390 400 410
pF1KE4 FDKA-DGTLDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVVLHSFTDAIFDEWMKRFNP
: . : :.. :.:: .: ..::: . ...::::::.. :.:.:.::..:..: :
CCDS82 FASPLTGIADASQSSMHNALHIYMNGTMSQVQGSANDPIFLLHHAFVDSIFEQWLRRHRP
350 360 370 380 390 400
420 430 440 450 460
pF1KE4 PADAWPQELAPIGHNRMYNMVPFFPPVTNEELFLTSDQLGYSYAI----------DLPVS
...:. ::::::: ::::.: : ..:..: .:::.:. : :
CCDS82 LQEVYPEANAPIGHNRESYMVPFIPLYRNGDFFISSKDLGYDYSYLQDSDPDSFQDYIKS
410 420 430 440 450 460
470 480 490 500 510
pF1KE4 VEETPG--WPTTLLVVM--GTLVALV-GLFVLLAFLQYRRLRKGYTPLMETHLSSKRYTE
: . : : ..: ..:.::. :: :: . ..: . ::.
CCDS82 YLEQASRIWSWLLGAAMVGAVLTALLAGLVSLLCRHKRKQLPEEKQPLLMEKEDYHSLYQ
470 480 490 500 510 520
pF1KE4 EA
CCDS82 SHL
519 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 00:45:22 2016 done: Sun Nov 6 00:45:22 2016
Total Scan time: 2.520 Total Display time: 0.040
Function used was FASTA [36.3.4 Apr, 2011]