FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5822, 543 aa
1>>>pF1KB5822 543 - 543 aa - 543 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.6923+/-0.000905; mu= 1.2715+/- 0.055
mean_var=272.9383+/-54.850, 0's: 0 Z-trim(116.2): 27 B-trim: 2 in 1/52
Lambda= 0.077632
statistics sampled from 16751 (16775) to 16751 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.805), E-opt: 0.2 (0.515), width: 16
Scan time: 4.310
The best scores are: opt bits E(32554)
CCDS64425.1 TFEB gene_id:7942|Hs108|chr6 ( 490) 3244 376.3 4.7e-104
CCDS4858.1 TFEB gene_id:7942|Hs108|chr6 ( 476) 3198 371.1 1.6e-102
CCDS64424.1 TFEB gene_id:7942|Hs108|chr6 ( 391) 2138 252.3 7.7e-67
CCDS14315.3 TFE3 gene_id:7030|Hs108|chrX ( 575) 1129 139.4 1.1e-32
CCDS54607.1 MITF gene_id:4286|Hs108|chr3 ( 468) 770 99.2 1.2e-20
CCDS46865.1 MITF gene_id:4286|Hs108|chr3 ( 504) 770 99.2 1.2e-20
CCDS43106.1 MITF gene_id:4286|Hs108|chr3 ( 520) 770 99.2 1.3e-20
CCDS46866.2 MITF gene_id:4286|Hs108|chr3 ( 357) 755 97.4 3e-20
CCDS43107.1 MITF gene_id:4286|Hs108|chr3 ( 413) 755 97.4 3.4e-20
CCDS5762.1 TFEC gene_id:22797|Hs108|chr7 ( 347) 728 94.4 2.4e-19
CCDS2913.1 MITF gene_id:4286|Hs108|chr3 ( 419) 706 92.0 1.5e-18
CCDS59076.1 TFEC gene_id:22797|Hs108|chr7 ( 280) 669 87.7 2e-17
CCDS34738.1 TFEC gene_id:22797|Hs108|chr7 ( 318) 669 87.7 2.2e-17
>>CCDS64425.1 TFEB gene_id:7942|Hs108|chr6 (490 aa)
initn: 3244 init1: 3244 opt: 3244 Z-score: 1981.6 bits: 376.3 E(32554): 4.7e-104
Smith-Waterman score: 3244; 100.0% identity (100.0% similar) in 483 aa overlap (61-543:8-490)
40 50 60 70 80 90
pF1KB5 VPVILASPCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQRE
::::::::::::::::::::::::::::::
CCDS64 MTASSGWEPAPAATMASRIGLRMQLMREQAQQEEQRE
10 20 30
100 110 120 130 140 150
pF1KB5 RMQQQAVMHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 RMQQQAVMHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYH
40 50 60 70 80 90
160 170 180 190 200 210
pF1KB5 LQQSQHQKVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 LQQSQHQKVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSP
100 110 120 130 140 150
220 230 240 250 260 270
pF1KB5 MAMLHIGSNPERELDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 MAMLHIGSNPERELDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTAS
160 170 180 190 200 210
280 290 300 310 320 330
pF1KB5 LVGVTSSSCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 LVGVTSSSCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIP
220 230 240 250 260 270
340 350 360 370 380 390
pF1KB5 KANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 KANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQA
280 290 300 310 320 330
400 410 420 430 440 450
pF1KB5 RVHGLPTTSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 RVHGLPTTSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPL
340 350 360 370 380 390
460 470 480 490 500 510
pF1KB5 PTQPPSPFHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 PTQPPSPFHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLL
400 410 420 430 440 450
520 530 540
pF1KB5 PLASDPLLSTMSPEASKASSRRSSFSMEEGDVL
:::::::::::::::::::::::::::::::::
CCDS64 PLASDPLLSTMSPEASKASSRRSSFSMEEGDVL
460 470 480 490
>>CCDS4858.1 TFEB gene_id:7942|Hs108|chr6 (476 aa)
initn: 3198 init1: 3198 opt: 3198 Z-score: 1954.0 bits: 371.1 E(32554): 1.6e-102
Smith-Waterman score: 3198; 100.0% identity (100.0% similar) in 476 aa overlap (68-543:1-476)
40 50 60 70 80 90
pF1KB5 PCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQRERMQQQAV
::::::::::::::::::::::::::::::
CCDS48 MASRIGLRMQLMREQAQQEEQRERMQQQAV
10 20 30
100 110 120 130 140 150
pF1KB5 MHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 MHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQ
40 50 60 70 80 90
160 170 180 190 200 210
pF1KB5 KVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 KVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIG
100 110 120 130 140 150
220 230 240 250 260 270
pF1KB5 SNPERELDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 SNPERELDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSS
160 170 180 190 200 210
280 290 300 310 320 330
pF1KB5 SCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 SCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDV
220 230 240 250 260 270
340 350 360 370 380 390
pF1KB5 RWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 RWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPT
280 290 300 310 320 330
400 410 420 430 440 450
pF1KB5 TSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 TSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSP
340 350 360 370 380 390
460 470 480 490 500 510
pF1KB5 FHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLASDPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 FHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLASDPL
400 410 420 430 440 450
520 530 540
pF1KB5 LSTMSPEASKASSRRSSFSMEEGDVL
::::::::::::::::::::::::::
CCDS48 LSTMSPEASKASSRRSSFSMEEGDVL
460 470
>>CCDS64424.1 TFEB gene_id:7942|Hs108|chr6 (391 aa)
initn: 2226 init1: 2135 opt: 2138 Z-score: 1313.5 bits: 252.3 E(32554): 7.7e-67
Smith-Waterman score: 2445; 82.1% identity (82.1% similar) in 476 aa overlap (68-543:1-391)
40 50 60 70 80 90
pF1KB5 PCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQRERMQQQAV
::::::::::::::::::::::::::::::
CCDS64 MASRIGLRMQLMREQAQQEEQRERMQQQAV
10 20 30
100 110 120 130 140 150
pF1KB5 MHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQ
:::::::::::::::::::::::::::::::::::::::::
CCDS64 MHYMQQQQQQQQQQLGGPPTPAINTPVHFQSPPPVPGEVLK-------------------
40 50 60 70
160 170 180 190 200 210
pF1KB5 KVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIG
CCDS64 ------------------------------------------------------------
220 230 240 250 260 270
pF1KB5 SNPERELDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 ------LDDVIDNIMRLDDVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSS
80 90 100 110 120
280 290 300 310 320 330
pF1KB5 SCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 SCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDV
130 140 150 160 170 180
340 350 360 370 380 390
pF1KB5 RWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 RWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPT
190 200 210 220 230 240
400 410 420 430 440 450
pF1KB5 TSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 TSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSP
250 260 270 280 290 300
460 470 480 490 500 510
pF1KB5 FHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLASDPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 FHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLASDPL
310 320 330 340 350 360
520 530 540
pF1KB5 LSTMSPEASKASSRRSSFSMEEGDVL
::::::::::::::::::::::::::
CCDS64 LSTMSPEASKASSRRSSFSMEEGDVL
370 380 390
>>CCDS14315.3 TFE3 gene_id:7030|Hs108|chrX (575 aa)
initn: 1059 init1: 553 opt: 1129 Z-score: 700.5 bits: 139.4 E(32554): 1.1e-32
Smith-Waterman score: 1223; 46.8% identity (69.1% similar) in 511 aa overlap (60-539:102-573)
30 40 50 60 70 80
pF1KB5 AVPVILASPCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQR
: :: ... .::. ::.:::: :::..:.:
CCDS14 LPLRSSLPISLQATPATPATLSASSSAGGSRTPAMSSSSSSRVLLRQQLMRAQAQEQERR
80 90 100 110 120 130
90 100 110 120 130 140
pF1KB5 ERMQQQAVMHYMQQQQQQQQQQLGGPPTPAINT-----PVHFQS-PPP--VPGEVLKVQS
:: .: :. . . .: .:::.. : : ::: :: ::::::.
CCDS14 ERREQAAAAPFPSP----------APASPAISVVGVSAGGHTLSRPPPAQVPREVLKVQT
140 150 160 170 180
150 160 170 180 190 200
pF1KB5 YLENPTSYHLQQSQHQKVREYLSETYGNKFAAH-ISPAQGSPKPPPAASPGVRAGHVLSS
.::::: :::::...:.:..::: : : :.:.. ..: : . : .: .:.: ..
CCDS14 HLENPTRYHLQQARRQQVKQYLSTTLGPKLASQALTPPPGPASAQPLPAP--EAAH--TT
190 200 210 220 230
210 220 230 240 250
pF1KB5 SAGNSAPNSPMAMLHIGSNPERELDDVIDNIMRL-----DDVLGYI---NPEMQMPNTLP
. .::::::::.: :::. :.:.:::::.:. : :..:.:. . .:.:.:::
CCDS14 GPTGSAPNSPMALLTIGSSSEKEIDDVIDEIISLESSYNDEMLSYLPGGTTGLQLPSTLP
240 250 260 270 280 290
260 270 280 290 300 310
pF1KB5 LSSSHLNVYSSDPQVTASLVGVTSSSCPADLTQ-KRELTDAESRALAKERQKKDNHNLIE
.:.. :.:::: : .:. . ..:.::::.: . :::....:..:: :::::::::::::
CCDS14 VSGNLLDVYSS--QGVATPAITVSNSCPAELPNIKREISETEAKALLKERQKKDNHNLIE
300 310 320 330 340 350
320 330 340 350 360 370
pF1KB5 RRRRFNINDRIKELGMLIPKANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRR
::::::::::::::: ::::..: ..::::::::::::::::..::. :.:..::...:
CCDS14 RRRRFNINDRIKELGTLIPKSSDPEMRWNKGTILKASVDYIRKLQKEQQRSKDLESRQRS
360 370 380 390 400 410
380 390 400 410 420
pF1KB5 LEMTNKQLWLRIQELEMQARVHGLPTT-SPSGMNMAEL-AQQVVKQELPS--EEG-PGEA
::..:..: :::::::.::..::::. .:. ...: :.. .: : . ::: :: :
CCDS14 LEQANRSLQLRIQELELQAQIHGLPVPPTPGLLSLATTSASDSLKPEQLDIEEEGRPGAA
420 430 440 450 460 470
430 440 450 460 470
pF1KB5 LML---GAEVPDPEPLPALPPQAPLPLPTQPPSPFHHLDFSHSLSFG-----GREDEGPP
. : :. : ::. : : . :: : :.. . .: .:.::
CCDS14 TFHVGGGPAQNAPHQQPPAPPSDAL-LDLHFPSD-HLGDLGDPFHLGLEDILMEEEEGVV
480 490 500 510 520 530
480 490 500 510 520 530
pF1KB5 GYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLASDPLLSTMSPEASKASSRRSSFSME
: :. : :: : :::::::..:: .:::::::::::::
CCDS14 G---GLSGGALSP------------------LRAASDPLLSSVSPAVSKASSRRSSFSME
540 550 560 570
540
pF1KB5 EGDVL
:
CCDS14 EES
>>CCDS54607.1 MITF gene_id:4286|Hs108|chr3 (468 aa)
initn: 1101 init1: 612 opt: 770 Z-score: 484.4 bits: 99.2 E(32554): 1.2e-20
Smith-Waterman score: 1209; 45.8% identity (67.0% similar) in 509 aa overlap (68-539:1-463)
40 50 60 70 80 90
pF1KB5 PCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQRERMQQQAV
:.::: ::.:::::: :..:.::..:. .
CCDS54 MTSRILLRQQLMREQMQEQERREQQQKLQA
10 20 30
100 110 120 130 140 150
pF1KB5 MHYMQQQQQQQQQQLGGPPTPAINT--PVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQ
..:::. .: :::::. :. . : :: ::::::..:::::.::.::.:
CCDS54 AQFMQQRVPVSQ-------TPAINVSVPTTLPSATQVPMEVLKVQTHLENPTKYHIQQAQ
40 50 60 70 80
160 170 180 190 200 210
pF1KB5 HQKVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLH
.:.:..::: : .:: : . . . : : .:: ::. :.::::::::::
CCDS54 RQQVKQYLSTTLANK---HANQVLSLPCP---NQPG---DHVMPPVPGSSAPNSPMAMLT
90 100 110 120 130
220 230 240
pF1KB5 IGSNPERE----------------------------LDDVIDNIMRLD-----DVLGYIN
..:: :.: .:::::.:. :. ..:: ..
CCDS54 LNSNCEKEGFYKFEEQNRAESECPGMNTHSRASCMQMDDVIDDIISLESSYNEEILGLMD
140 150 160 170 180 190
250 260 270 280 290 300
pF1KB5 PEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSSSCPADLTQ-KRELTDAESRALAKER
: .:: ::::.:.. ...:... .: . :.::::.: . :::::..:.:::::::
CCDS54 PALQMANTLPVSGNLIDLYGNQGLPPPGL--TISNSCPANLPNIKRELTESEARALAKER
200 210 220 230 240 250
310 320 330 340 350 360
pF1KB5 QKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDVRWNKGTILKASVDYIRRMQKDLQK
::::::::::::::::::::::::: ::::.:: :.::::::::::::::::..:.. :.
CCDS54 QKKDNHNLIERRRRFNINDRIKELGTLIPKSNDPDMRWNKGTILKASVDYIRKLQREQQR
260 270 280 290 300 310
370 380 390 400 410 420
pF1KB5 SRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPTTSPSGMNMAELAQQVVKQELPSEE
..::::....:: .:..: :::::::::::.::: .:. .:.....::: : :
CCDS54 AKELENRQKKLEHANRHLLLRIQELEMQARAHGLSLIPSTGLCSPDLVNRIIKQE-PVLE
320 330 340 350 360 370
430 440 450 460 470 480
pF1KB5 GPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSPFHHLDFSHSLSFGGREDEGPPGYP
. .. : : .: : : . :...:. : :. .:
CCDS54 NCSQDL--------------LQHHADLTCTTTLDLTDGTITFNNNLGTG---TEANQAYS
380 390 400 410
490 500 510 520 530 540
pF1KB5 EPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLA-SDPLLSTMSPEASKASSRRSSFSMEEG
: : :: :. .:.::.: :.. .:::::..:: :::.::::::.::::
CCDS54 VPTKMG--------SK--LEDILMDDTLSPVGVTDPLLSSVSPGASKTSSRRSSMSMEET
420 430 440 450 460
pF1KB5 DVL
CCDS54 EHTC
>>CCDS46865.1 MITF gene_id:4286|Hs108|chr3 (504 aa)
initn: 1101 init1: 612 opt: 770 Z-score: 484.0 bits: 99.2 E(32554): 1.2e-20
Smith-Waterman score: 1225; 44.0% identity (65.2% similar) in 552 aa overlap (28-539:1-499)
10 20 30 40 50
pF1KB5 MSQLSPACSVTLGKSLPLSGLGVFSSKMDAVPVILASPCQPLCFEEDTCLIYLLPLL---
:.:. : . ::. :: .::
CCDS46 MEALRVQMFMPCS---FES----LYLSSAEHPG
10 20
60 70 80 90 100 110
pF1KB5 IHREPAPAATMASRIGLRMQLMREQAQQEEQRERMQQQAVMHYMQQQQQQQQQQLGGPPT
. : ...:.::: ::.:::::: :..:.::..:. . ..:::. .: :
CCDS46 ASKPPISSSSMTSRILLRQQLMREQMQEQERREQQQKLQAAQFMQQRVPVSQ-------T
30 40 50 60 70
120 130 140 150 160 170
pF1KB5 PAINT--PVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQKVREYLSETYGNKFAAHI
::::. :. . : :: ::::::..:::::.::.::.:.:.:..::: : .:: :
CCDS46 PAINVSVPTTLPSATQVPMEVLKVQTHLENPTKYHIQQAQRQQVKQYLSTTLANK---HA
80 90 100 110 120 130
180 190 200 210 220
pF1KB5 SPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIGSNPERE------------
. . . : : .:: ::. :.:::::::::: ..:: :.:
CCDS46 NQVLSLPCP---NQPG---DHVMPPVPGSSAPNSPMAMLTLNSNCEKEGFYKFEEQNRAE
140 150 160 170 180 190
230 240 250 260
pF1KB5 ----------------LDDVIDNIMRLD-----DVLGYINPEMQMPNTLPLSSSHLNVYS
.:::::.:. :. ..:: ..: .:: ::::.:.. ...:.
CCDS46 SECPGMNTHSRASCMQMDDVIDDIISLESSYNEEILGLMDPALQMANTLPVSGNLIDLYG
200 210 220 230 240 250
270 280 290 300 310 320
pF1KB5 SDPQVTASLVGVTSSSCPADLTQ-KRELTDAESRALAKERQKKDNHNLIERRRRFNINDR
.. .: . :.::::.: . :::::..:.:::::::::::::::::::::::::::
CCDS46 NQGLPPPGL--TISNSCPANLPNIKRELTESEARALAKERQKKDNHNLIERRRRFNINDR
260 270 280 290 300
330 340 350 360 370 380
pF1KB5 IKELGMLIPKANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWL
::::: ::::.:: :.::::::::::::::::..:.. :...::::....:: .:..: :
CCDS46 IKELGTLIPKSNDPDMRWNKGTILKASVDYIRKLQREQQRAKELENRQKKLEHANRHLLL
310 320 330 340 350 360
390 400 410 420 430 440
pF1KB5 RIQELEMQARVHGLPTTSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPA
::::::::::.::: .:. .:.....::: : :. .. :
CCDS46 RIQELEMQARAHGLSLIPSTGLCSPDLVNRIIKQE-PVLENCSQDL--------------
370 380 390 400 410
450 460 470 480 490 500
pF1KB5 LPPQAPLPLPTQPPSPFHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLD
: .: : : . :...:. : :. .: : : :: :.
CCDS46 LQHHADLTCTTTLDLTDGTITFNNNLGTG---TEANQAYSVPTKMG--------SK--LE
420 430 440 450 460
510 520 530 540
pF1KB5 LMLLDDSLLPLA-SDPLLSTMSPEASKASSRRSSFSMEEGDVL
.:.::.: :.. .:::::..:: :::.::::::.::::
CCDS46 DILMDDTLSPVGVTDPLLSSVSPGASKTSSRRSSMSMEETEHTC
470 480 490 500
>>CCDS43106.1 MITF gene_id:4286|Hs108|chr3 (520 aa)
initn: 1101 init1: 612 opt: 770 Z-score: 483.8 bits: 99.2 E(32554): 1.3e-20
Smith-Waterman score: 1223; 45.3% identity (66.9% similar) in 517 aa overlap (60-539:45-515)
30 40 50 60 70 80
pF1KB5 AVPVILASPCQPLCFEEDTCLIYLLPLLIHREPAPAATMASRIGLRMQLMREQAQQEEQR
. : ...:.::: ::.:::::: :..:.:
CCDS43 EEFHEEPKTYYELKSQPLKSSSSAEHPGASKPPISSSSMTSRILLRQQLMREQMQEQERR
20 30 40 50 60 70
90 100 110 120 130 140
pF1KB5 ERMQQQAVMHYMQQQQQQQQQQLGGPPTPAINT--PVHFQSPPPVPGEVLKVQSYLENPT
:..:. . ..:::. .: :::::. :. . : :: ::::::..:::::
CCDS43 EQQQKLQAAQFMQQRVPVSQ-------TPAINVSVPTTLPSATQVPMEVLKVQTHLENPT
80 90 100 110 120
150 160 170 180 190 200
pF1KB5 SYHLQQSQHQKVREYLSETYGNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAP
.::.::.:.:.:..::: : .:: : . . . : : .:: ::. :.:::
CCDS43 KYHIQQAQRQQVKQYLSTTLANK---HANQVLSLPCP---NQPG---DHVMPPVPGSSAP
130 140 150 160 170
210 220 230
pF1KB5 NSPMAMLHIGSNPERE----------------------------LDDVIDNIMRLD----
::::::: ..:: :.: .:::::.:. :.
CCDS43 NSPMAMLTLNSNCEKEGFYKFEEQNRAESECPGMNTHSRASCMQMDDVIDDIISLESSYN
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB5 -DVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSSSCPADLTQ-KRELTDAE
..:: ..: .:: ::::.:.. ...:... .:. :.::::.: . :::::..:
CCDS43 EEILGLMDPALQMANTLPVSGNLIDLYGNQGLPPPGLT--ISNSCPANLPNIKRELTESE
240 250 260 270 280 290
300 310 320 330 340 350
pF1KB5 SRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDVRWNKGTILKASVDYIR
.:::::::::::::::::::::::::::::::: ::::.:: :.::::::::::::::::
CCDS43 ARALAKERQKKDNHNLIERRRRFNINDRIKELGTLIPKSNDPDMRWNKGTILKASVDYIR
300 310 320 330 340 350
360 370 380 390 400 410
pF1KB5 RMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPTTSPSGMNMAELAQQVV
..:.. :...::::....:: .:..: :::::::::::.::: .:. .:.....
CCDS43 KLQREQQRAKELENRQKKLEHANRHLLLRIQELEMQARAHGLSLIPSTGLCSPDLVNRII
360 370 380 390 400 410
420 430 440 450 460 470
pF1KB5 KQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSPFHHLDFSHSLSFGGRE
::: : :. .. : : .: : : . :...:. :
CCDS43 KQE-PVLENCSQDL--------------LQHHADLTCTTTLDLTDGTITFNNNLGTG---
420 430 440 450
480 490 500 510 520 530
pF1KB5 DEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLA-SDPLLSTMSPEASKASSRR
:. .: : : :: :. .:.::.: :.. .:::::..:: :::.::::
CCDS43 TEANQAYSVPTKMG--------SK--LEDILMDDTLSPVGVTDPLLSSVSPGASKTSSRR
460 470 480 490 500
540
pF1KB5 SSFSMEEGDVL
::.::::
CCDS43 SSMSMEETEHTC
510 520
>>CCDS46866.2 MITF gene_id:4286|Hs108|chr3 (357 aa)
initn: 925 init1: 612 opt: 755 Z-score: 476.9 bits: 97.4 E(32554): 3e-20
Smith-Waterman score: 877; 43.0% identity (64.5% similar) in 409 aa overlap (138-539:11-352)
110 120 130 140 150 160
pF1KB5 QQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQKVREYLSETY
.::..:::::.::.::.:.:. . :
CCDS46 MLEMLEYNHYQVQTHLENPTKYHIQQAQRQQGFYKFEEQ-
10 20 30
170 180 190 200 210 220
pF1KB5 GNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIGSNPERELDDV
:. . ::. . : .: ..:::
CCDS46 -NR--------------AESECPGMNT-HSRASCM--------------------QMDDV
40 50 60
230 240 250 260 270 280
pF1KB5 IDNIMRLD-----DVLGYINPEMQMPNTLPLSSSHLNVYSSDPQVTASLVGVTSSSCPAD
::.:. :. ..:: ..: .:: ::::.:.. ...:... .:. :.::::.
CCDS46 IDDIISLESSYNEEILGLMDPALQMANTLPVSGNLIDLYGNQGLPPPGLT--ISNSCPAN
70 80 90 100 110 120
290 300 310 320 330 340
pF1KB5 LTQ-KRELTDAESRALAKERQKKDNHNLIERRRRFNINDRIKELGMLIPKANDLDVRWNK
: . :::::..:.:::::::::::::::::::::::::::::::: ::::.:: :.::::
CCDS46 LPNIKRELTESEARALAKERQKKDNHNLIERRRRFNINDRIKELGTLIPKSNDPDMRWNK
130 140 150 160 170 180
350 360 370 380 390 400
pF1KB5 GTILKASVDYIRRMQKDLQKSRELENHSRRLEMTNKQLWLRIQELEMQARVHGLPTTSPS
::::::::::::..:.. :...::::....:: .:..: :::::::::::.::: .
CCDS46 GTILKASVDYIRKLQREQQRAKELENRQKKLEHANRHLLLRIQELEMQARAHGLSLIPST
190 200 210 220 230 240
410 420 430 440 450 460
pF1KB5 GMNMAELAQQVVKQELPSEEGPGEALMLGAEVPDPEPLPALPPQAPLPLPTQPPSPFHHL
:. .:.....::: : :. .. : : .: : : .
CCDS46 GLCSPDLVNRIIKQE-PVLENCSQDL--------------LQHHADLTCTTTLDLTDGTI
250 260 270 280
470 480 490 500 510 520
pF1KB5 DFSHSLSFGGREDEGPPGYPEPLAPGHGSPFPSLSKKDLDLMLLDDSLLPLA-SDPLLST
:...:. : :. .: : : :: :. .:.::.: :.. .:::::.
CCDS46 TFNNNLGTG---TEANQAYSVPTKMG--------SK--LEDILMDDTLSPVGVTDPLLSS
290 300 310 320 330
530 540
pF1KB5 MSPEASKASSRRSSFSMEEGDVL
.:: :::.::::::.::::
CCDS46 VSPGASKTSSRRSSMSMEETEHTC
340 350
>>CCDS43107.1 MITF gene_id:4286|Hs108|chr3 (413 aa)
initn: 1079 init1: 612 opt: 755 Z-score: 476.1 bits: 97.4 E(32554): 3.4e-20
Smith-Waterman score: 1039; 45.5% identity (66.8% similar) in 437 aa overlap (138-539:11-408)
110 120 130 140 150 160
pF1KB5 QQQQLGGPPTPAINTPVHFQSPPPVPGEVLKVQSYLENPTSYHLQQSQHQKVREYLSETY
.::..:::::.::.::.:.:.:..::: :
CCDS43 MLEMLEYNHYQVQTHLENPTKYHIQQAQRQQVKQYLSTTL
10 20 30 40
170 180 190 200 210 220
pF1KB5 GNKFAAHISPAQGSPKPPPAASPGVRAGHVLSSSAGNSAPNSPMAMLHIGSNPERE----
.:: : . . . : : .:: ::. :.:::::::::: ..:: :.:
CCDS43 ANK---HANQVLSLPCP---NQPG---DHVMPPVPGSSAPNSPMAMLTLNSNCEKEGFYK
50 60 70 80 90
230 240 250
pF1KB5 ------------------------LDDVIDNIMRLD-----DVLGYINPEMQMPNTLPLS
.:::::.:. :. ..:: ..: .:: ::::.:
CCDS43 FEEQNRAESECPGMNTHSRASCMQMDDVIDDIISLESSYNEEILGLMDPALQMANTLPVS
100 110 120 130 140 150
260 270 280 290 300 310
pF1KB5 SSHLNVYSSDPQVTASLVGVTSSSCPADLTQ-KRELTDAESRALAKERQKKDNHNLIERR
.. ...:... .:. :.::::.: . :::::..:.:::::::::::::::::::
CCDS43 GNLIDLYGNQGLPPPGLT--ISNSCPANLPNIKRELTESEARALAKERQKKDNHNLIERR
160 170 180 190 200
320 330 340 350 360 370
pF1KB5 RRFNINDRIKELGMLIPKANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLE
::::::::::::: ::::.:: :.::::::::::::::::..:.. :...::::....::
CCDS43 RRFNINDRIKELGTLIPKSNDPDMRWNKGTILKASVDYIRKLQREQQRAKELENRQKKLE
210 220 230 240 250 260
380 390 400 410 420 430
pF1KB5 MTNKQLWLRIQELEMQARVHGLPTTSPSGMNMAELAQQVVKQELPSEEGPGEALMLGAEV
.:..: :::::::::::.::: .:. .:.....::: : :. .. :
CCDS43 HANRHLLLRIQELEMQARAHGLSLIPSTGLCSPDLVNRIIKQE-PVLENCSQDL------
270 280 290 300 310 320
440 450 460 470 480 490
pF1KB5 PDPEPLPALPPQAPLPLPTQPPSPFHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFP
: .: : : . :...:. : :. .: : :
CCDS43 --------LQHHADLTCTTTLDLTDGTITFNNNLGTG---TEANQAYSVPTKMG------
330 340 350 360
500 510 520 530 540
pF1KB5 SLSKKDLDLMLLDDSLLPLA-SDPLLSTMSPEASKASSRRSSFSMEEGDVL
:: :. .:.::.: :.. .:::::..:: :::.::::::.::::
CCDS43 --SK--LEDILMDDTLSPVGVTDPLLSSVSPGASKTSSRRSSMSMEETEHTC
370 380 390 400 410
>>CCDS5762.1 TFEC gene_id:22797|Hs108|chr7 (347 aa)
initn: 826 init1: 661 opt: 728 Z-score: 460.7 bits: 94.4 E(32554): 2.4e-19
Smith-Waterman score: 840; 43.9% identity (70.3% similar) in 380 aa overlap (175-543:8-347)
150 160 170 180 190 200
pF1KB5 NPTSYHLQQSQHQKVREYLSETYGNKFAAHISPAQGSPKPP-PAASPGVRAGHV-LSSSA
:.:. .: :...: :. .:. :.:.:
CCDS57 MTLDHQIINPTLKWSQPAVPSGGPLVQHAHTTLDSDA
10 20 30
210 220 230 240 250
pF1KB5 GNSAPNSPMA-MLHIGS---NPERELDDVIDNIMRLDDVL---GYINPEMQMPNTLPLSS
: . ..:.. .: ::. : . ...:::..:. ... . : .: . : :: :.
CCDS57 GLT--ENPLTKLLAIGKEDDNAQWHMEDVIEDIIGMESSFKEEGADSP-LLMQRTL--SG
40 50 60 70 80 90
260 270 280 290 300 310
pF1KB5 SHLNVYSSDPQVTASLVGVTSSSCPADLTQKRELTDAESRALAKERQKKDNHNLIERRRR
: :.:::.. .. .:.::.:::..: .:::.:....:::::::::::::::::::::
CCDS57 SILDVYSGEQGISPINMGLTSASCPSSLPMKREITETDTRALAKERQKKDNHNLIERRRR
100 110 120 130 140 150
320 330 340 350 360 370
pF1KB5 FNINDRIKELGMLIPKANDLDVRWNKGTILKASVDYIRRMQKDLQKSRELENHSRRLEMT
.::: :::::: ::::.:: :.::::::::::::.::. .::. :..::::.....::..
CCDS57 YNINYRIKELGTLIPKSNDPDMRWNKGTILKASVEYIKWLQKEQQRARELEHRQKKLEQA
160 170 180 190 200 210
380 390 400 410 420 430
pF1KB5 NKQLWLRIQELEMQARVHGLPTTSPSGMNMAELAQQVVKQELPSEEGPGE--ALMLGAEV
:..: :::::::.:::.::::: . : ..:. .:.::. :.. . . ..
CCDS57 NRRLLLRIQELEIQARTHGLPTLASLG--TVDLGAHVTKQQSHPEQNSVDYCQQLTVSQG
220 230 240 250 260 270
440 450 460 470 480 490
pF1KB5 PDPEPLPALPPQAPLPLPTQPPSPFHHLDFSHSLSFGGREDEGPPGYPEPLAPGHGSPFP
:.:: : :: . . ..: : : :.:: .: .:..
CCDS57 PSPE----LCDQA-IAF-SDPLSYFTDLSFSAAL----KEEQ------------------
280 290 300
500 510 520 530 540
pF1KB5 SLSKKDLDLMLLDDSLLPLASDPLLSTMSPEASKASSRRSSFSMEEGDVL
:: :::::.. :...:::::. :: .:: :::::::: ..:: :
CCDS57 -----RLDGMLLDDTISPFGTDPLLSATSPAVSKESSRRSSFSSDDGDEL
310 320 330 340
543 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 10:26:16 2016 done: Sat Nov 5 10:26:16 2016
Total Scan time: 4.310 Total Display time: 0.090
Function used was FASTA [36.3.4 Apr, 2011]