FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4495, 519 aa 1>>>pF1KE4495 519 - 519 aa - 519 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3122+/-0.000847; mu= 18.6492+/- 0.051 mean_var=83.3882+/-16.305, 0's: 0 Z-trim(108.3): 24 B-trim: 11 in 1/50 Lambda= 0.140450 statistics sampled from 10098 (10112) to 10098 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.687), E-opt: 0.2 (0.311), width: 16 Scan time: 2.520 The best scores are: opt bits E(32554) CCDS9470.1 DCT gene_id:1638|Hs108|chr13 ( 519) 3703 760.4 0 CCDS45060.1 DCT gene_id:1638|Hs108|chr13 ( 552) 2857 589.0 4.7e-168 CCDS34990.1 TYRP1 gene_id:7306|Hs108|chr9 ( 537) 1763 367.3 2.5e-101 CCDS8284.1 TYR gene_id:7299|Hs108|chr11 ( 529) 1389 291.5 1.6e-78 >>CCDS9470.1 DCT gene_id:1638|Hs108|chr13 (519 aa) initn: 3703 init1: 3703 opt: 3703 Z-score: 4057.4 bits: 760.4 E(32554): 0 Smith-Waterman score: 3703; 100.0% identity (100.0% similar) in 519 aa overlap (1-519:1-519) 10 20 30 40 50 60 pF1KE4 MSPLWWGFLLSCLGCKILPGAQGQFPRVCMTVDSLVNKECCPRLGAESANVCGSQQGRGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 MSPLWWGFLLSCLGCKILPGAQGQFPRVCMTVDSLVNKECCPRLGAESANVCGSQQGRGQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 CTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFGWTGPNCER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 CTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFGWTGPNCER 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 KKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGPNGTQPQFANCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 KKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGPNGTQPQFANCS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVTWHRYHLLCLERDLQRLIGNESF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 VYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVTWHRYHLLCLERDLQRLIGNESF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDDYNHLVTLCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 ALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDDYNHLVTLCN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 GTYEGLLRRNQMGRNSMKLPTLKDIRDCLSLQKFDNPPFFQNSTFSFRNALEGFDKADGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 GTYEGLLRRNQMGRNSMKLPTLKDIRDCLSLQKFDNPPFFQNSTFSFRNALEGFDKADGT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 LDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVVLHSFTDAIFDEWMKRFNPPADAWPQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 LDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVVLHSFTDAIFDEWMKRFNPPADAWPQE 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 LAPIGHNRMYNMVPFFPPVTNEELFLTSDQLGYSYAIDLPVSVEETPGWPTTLLVVMGTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 LAPIGHNRMYNMVPFFPPVTNEELFLTSDQLGYSYAIDLPVSVEETPGWPTTLLVVMGTL 430 440 450 460 470 480 490 500 510 pF1KE4 VALVGLFVLLAFLQYRRLRKGYTPLMETHLSSKRYTEEA ::::::::::::::::::::::::::::::::::::::: CCDS94 VALVGLFVLLAFLQYRRLRKGYTPLMETHLSSKRYTEEA 490 500 510 >>CCDS45060.1 DCT gene_id:1638|Hs108|chr13 (552 aa) initn: 2857 init1: 2857 opt: 2857 Z-score: 3130.6 bits: 589.0 E(32554): 4.7e-168 Smith-Waterman score: 3627; 94.0% identity (94.0% similar) in 552 aa overlap (1-519:1-552) 10 20 30 40 50 60 pF1KE4 MSPLWWGFLLSCLGCKILPGAQGQFPRVCMTVDSLVNKECCPRLGAESANVCGSQQGRGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MSPLWWGFLLSCLGCKILPGAQGQFPRVCMTVDSLVNKECCPRLGAESANVCGSQQGRGQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 CTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFGWTGPNCER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 CTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFGWTGPNCER 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 KKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGPNGTQPQFANCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGPNGTQPQFANCS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVTWHRYHLLCLERDLQRLIGNESF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVTWHRYHLLCLERDLQRLIGNESF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDDYNHLVTLCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 ALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDDYNHLVTLCN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 GTYEGLLRRNQMGRNSMKLPTLKDIRDCLSLQKFDNPPFFQNSTFSFRNALEGFDKADGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GTYEGLLRRNQMGRNSMKLPTLKDIRDCLSLQKFDNPPFFQNSTFSFRNALEGFDKADGT 310 320 330 340 350 360 370 380 390 pF1KE4 LDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVV-------------------------- :::::::::::::::::::::::::::::::::: CCDS45 LDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVVISNRLLYNATTNILEHVRKEKATKEL 370 380 390 400 410 420 400 410 420 430 440 pF1KE4 -------LHSFTDAIFDEWMKRFNPPADAWPQELAPIGHNRMYNMVPFFPPVTNEELFLT ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PSLHVLVLHSFTDAIFDEWMKRFNPPADAWPQELAPIGHNRMYNMVPFFPPVTNEELFLT 430 440 450 460 470 480 450 460 470 480 490 500 pF1KE4 SDQLGYSYAIDLPVSVEETPGWPTTLLVVMGTLVALVGLFVLLAFLQYRRLRKGYTPLME :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SDQLGYSYAIDLPVSVEETPGWPTTLLVVMGTLVALVGLFVLLAFLQYRRLRKGYTPLME 490 500 510 520 530 540 510 pF1KE4 THLSSKRYTEEA :::::::::::: CCDS45 THLSSKRYTEEA 550 >>CCDS34990.1 TYRP1 gene_id:7306|Hs108|chr9 (537 aa) initn: 1362 init1: 746 opt: 1763 Z-score: 1932.8 bits: 367.3 E(32554): 2.5e-101 Smith-Waterman score: 1763; 49.0% identity (73.8% similar) in 526 aa overlap (9-518:6-525) 10 20 30 40 50 pF1KE4 MSPLWWGFLLSCLGCKILP-----GAQGQFPRVCMTVDSLVNKECCPRLGAESA---NVC ::: ::: ..: :..:::: : ::..: . ::: :. :. . : CCDS34 MSAPKLLS-LGCIFFPLLLFQQARAQFPRQCATVEALRSGMCCPDLSPVSGPGTDRC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 GSQQGRGQCTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFG ::..:::.: : ::.:: : : ..::::.:: .::.:::.:.:::.:.::: :. : CCDS34 GSSSGRGRCEAVTADSRPHSPQYPHDGRDDREVWPLRFFNRTCHCNGNFSGHNCGTCRPG 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 WTGPNCERKKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGPNGT : : :. .. ..:.:. .:: .:...:. :::.::. .:: .::.:.. .:::.:. CCDS34 WRGAACD-QRVLIVRRNLLDLSKEEKNHFVRALDMAKRTTHPLFVIATRRSEEILGPDGN 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 QPQFANCSVYDFFVWLHYYSVRDTLLGPGRP-YRAIDFSHQGPAFVTWHRYHLLCLERDL ::: : :.:..::: :::::. :.:: :. . .::::.::::.:::::::: ::.:. CCDS34 TPQFENISIYNYFVWTHYYSVKKTFLGVGQESFGEVDFSHEGPAFLTWHRYHLLRLEKDM 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE4 QRLIGNESFALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDD :... . ::.:::::::::.: ::.:::.:.:. : :::: :: ::.:..:::::.: CCDS34 QEMLQEPSFSLPYWNFATGKNVCDICTDDLMGSRSNFDSTLISPNSVFSQWRVVCDSLED 240 250 260 270 280 290 300 310 320 330 340 pF1KE4 YNHLVTLCNGTYEGLLRRNQMGRNS----MKLPTLKDIRDCLSLQKFDNPPFFQNSTFSF :. : ::::.: .: .::: : . ..:: .:. .:: . ::.:::..::: :: CCDS34 YDTLGTLCNSTEDGPIRRNPAGNVARPMVQRLPEPQDVAQCLEVGLFDTPPFYSNSTNSF 300 310 320 330 340 350 350 360 370 380 390 400 pF1KE4 RNALEGFDKADGTLDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVVLHSFTDAIFDEWM ::..::.. : : : :::::.: :::::.. : . ::::::.::.::::.::::. CCDS34 RNTVEGYSDPTGKYDPAVRSLHNLAHLFLNGTGGQTHLSPNDPIFVLLHTFTDAVFDEWL 360 370 380 390 400 410 410 420 430 440 450 460 pF1KE4 KRFNPPADAWPQELAPIGHNRMYNMVPFFPPVTNEELFLTS-DQLGYSYAIDLPVSVEET .:.: ...: : :::::::.::::::.::::: :.:.:. :.:::.: :. : . CCDS34 RRYNADISTFPLENAPIGHNRQYNMVPFWPPVTNTEMFVTAPDNLGYTYEIQWPSREFSV 420 430 440 450 460 470 470 480 490 500 510 pF1KE4 PGWPTTLLVVMGTLVALVGLFVLLAFL--QYRRLRKGYTPLMETHLSSKRYTEEA : ..:.:.:. .. .: ..: : . .. ::. . . :.:: CCDS34 P--EIIAIAVVGALLLVALIFGTASYLIRARRSMDEANQPLLTDQYQC--YAEEYEKLQN 480 490 500 510 520 530 CCDS34 PNQSVV >>CCDS8284.1 TYR gene_id:7299|Hs108|chr11 (529 aa) initn: 1053 init1: 325 opt: 1389 Z-score: 1523.3 bits: 291.5 E(32554): 1.6e-78 Smith-Waterman score: 1391; 40.1% identity (68.9% similar) in 521 aa overlap (9-506:2-515) 10 20 30 40 50 pF1KE4 MSPLWWGFLLSCLGCKI--LPGAQGQFPRVCMTVDSLVNKECCPRLGAESANVCGSQQGR ::. : : . . . :.:::.:.. .:..::::: ... . ::. .:: CCDS82 MLLAVLYCLLWSFQTSAGHFPRACVSSKNLMEKECCPPWSGDRSP-CGQLSGR 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 GQCTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFAGYNCGDCKFGWTGPNC :.: .. .. : . . . . :::: :: :..:::.:.::: :.:::.::::. :::: CCDS82 GSCQNILLSNAPLGPQFPFTGVDDRESWPSVFYNRTCQCSGNFMGFNCGNCKFGFWGPNC 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 ERKKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITTQHWLGLLGP--NGTQPQF ... ..:.:: .:: :...:.. : :::. . :::: .: : ::. :.: CCDS82 TERRL-LVRRNIFDLSAPEKDKFFAYLTLAKHTISSDYVIP----IGTYGQMKNGSTPMF 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 ANCSVYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVTWHRYHLLCLERDLQRLIG . ..::.:::.::: :.::: .. .: :::.:..:::. ::: :: :...:.: : CCDS82 NDINIYDLFVWMHYYVSMDALLGGSEIWRDIDFAHEAPAFLPWHRLFLLRWEQEIQKLTG 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE4 NESFALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRNSRFSSWETVCDSLDDYNHLV .:.:..:::.. .. .::.:::. .:. .: .:.:.: : ::::. ::. :..:: CCDS82 DENFTIPYWDWRDAE-KCDICTDEYMGGQHPTNPNLLSPASFFSSWQIVCSRLEEYNSHQ 230 240 250 260 270 280 300 310 320 330 340 350 pF1KE4 TLCNGTYEGLLRRNQMGRN---SMKLPTLKDIRDCLSLQKFDNPPFFQNSTFSFRNALEG .::::: :: :::: ... . .::. :.. :::: .... . . ..:::::.::: CCDS82 SLCNGTPEGPLRRNPGNHDKSRTPRLPSSADVEFCLSLTQYESGSMDKAANFSFRNTLEG 290 300 310 320 330 340 360 370 380 390 400 410 pF1KE4 FDKA-DGTLDSQVMSLHNLVHSFLNGTNALPHSAANDPIFVVLHSFTDAIFDEWMKRFNP : . : :.. :.:: .: ..::: . ...::::::.. :.:.:.::..:..: : CCDS82 FASPLTGIADASQSSMHNALHIYMNGTMSQVQGSANDPIFLLHHAFVDSIFEQWLRRHRP 350 360 370 380 390 400 420 430 440 450 460 pF1KE4 PADAWPQELAPIGHNRMYNMVPFFPPVTNEELFLTSDQLGYSYAI----------DLPVS ...:. ::::::: ::::.: : ..:..: .:::.:. : : CCDS82 LQEVYPEANAPIGHNRESYMVPFIPLYRNGDFFISSKDLGYDYSYLQDSDPDSFQDYIKS 410 420 430 440 450 460 470 480 490 500 510 pF1KE4 VEETPG--WPTTLLVVM--GTLVALV-GLFVLLAFLQYRRLRKGYTPLMETHLSSKRYTE : . : : ..: ..:.::. :: :: . ..: . ::. CCDS82 YLEQASRIWSWLLGAAMVGAVLTALLAGLVSLLCRHKRKQLPEEKQPLLMEKEDYHSLYQ 470 480 490 500 510 520 pF1KE4 EA CCDS82 SHL 519 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:45:22 2016 done: Sun Nov 6 00:45:22 2016 Total Scan time: 2.520 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]