FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3212, 479 aa
1>>>pF1KE3212 479 - 479 aa - 479 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.2633+/-0.000726; mu= 16.0763+/- 0.045
mean_var=119.1578+/-24.228, 0's: 0 Z-trim(113.7): 114 B-trim: 527 in 1/52
Lambda= 0.117493
statistics sampled from 14163 (14285) to 14163 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.763), E-opt: 0.2 (0.439), width: 16
Scan time: 3.010
The best scores are: opt bits E(32554)
CCDS35137.1 NR6A1 gene_id:2649|Hs108|chr9 ( 480) 3297 569.4 2.9e-162
CCDS55340.1 NR6A1 gene_id:2649|Hs108|chr9 ( 475) 3265 564.0 1.2e-160
CCDS65127.1 NR6A1 gene_id:2649|Hs108|chr9 ( 476) 3253 562.0 5.1e-160
CCDS60383.1 NR5A2 gene_id:2494|Hs108|chr1 ( 469) 407 79.5 8.4e-15
CCDS1400.1 NR5A2 gene_id:2494|Hs108|chr1 ( 495) 407 79.6 8.7e-15
CCDS1401.1 NR5A2 gene_id:2494|Hs108|chr1 ( 541) 407 79.6 9.3e-15
CCDS41790.1 RARG gene_id:5916|Hs108|chr12 ( 443) 403 78.8 1.3e-14
CCDS4768.1 RXRB gene_id:6257|Hs108|chr6 ( 533) 396 77.7 3.3e-14
CCDS59007.1 RXRB gene_id:6257|Hs108|chr6 ( 537) 396 77.7 3.4e-14
CCDS1248.1 RXRG gene_id:6258|Hs108|chr1 ( 463) 392 77.0 4.8e-14
CCDS8850.1 RARG gene_id:5916|Hs108|chr12 ( 454) 386 76.0 9.6e-14
CCDS35172.1 RXRA gene_id:6256|Hs108|chr9 ( 462) 386 76.0 9.8e-14
CCDS58236.1 RARG gene_id:5916|Hs108|chr12 ( 382) 382 75.2 1.4e-13
CCDS6646.1 RORB gene_id:6096|Hs108|chr9 ( 459) 382 75.3 1.6e-13
CCDS6744.1 NR4A3 gene_id:8013|Hs108|chr9 ( 443) 381 75.1 1.7e-13
CCDS72970.1 RXRG gene_id:6258|Hs108|chr1 ( 340) 379 74.7 1.8e-13
CCDS6743.1 NR4A3 gene_id:8013|Hs108|chr9 ( 626) 381 75.2 2.2e-13
CCDS6742.1 NR4A3 gene_id:8013|Hs108|chr9 ( 637) 381 75.2 2.2e-13
CCDS11366.1 RARA gene_id:5914|Hs108|chr17 ( 462) 378 74.6 2.5e-13
CCDS6856.1 NR5A1 gene_id:2516|Hs108|chr9 ( 461) 375 74.1 3.5e-13
CCDS2642.1 RARB gene_id:5915|Hs108|chr3 ( 448) 372 73.6 4.9e-13
CCDS4068.1 NR2F1 gene_id:7025|Hs108|chr5 ( 423) 371 73.4 5.3e-13
CCDS10177.1 RORA gene_id:6095|Hs108|chr15 ( 523) 372 73.6 5.5e-13
CCDS42317.1 RARA gene_id:5914|Hs108|chr17 ( 457) 371 73.4 5.6e-13
CCDS10179.1 RORA gene_id:6095|Hs108|chr15 ( 556) 372 73.7 5.8e-13
CCDS30856.1 RORC gene_id:6097|Hs108|chr1 ( 497) 363 72.1 1.5e-12
CCDS1004.1 RORC gene_id:6097|Hs108|chr1 ( 518) 363 72.1 1.6e-12
CCDS33669.1 PPARA gene_id:5465|Hs108|chr22 ( 468) 361 71.7 1.9e-12
CCDS45271.1 RORA gene_id:6095|Hs108|chr15 ( 468) 361 71.7 1.9e-12
CCDS10178.1 RORA gene_id:6095|Hs108|chr15 ( 548) 361 71.8 2.1e-12
CCDS12352.1 NR2F6 gene_id:2063|Hs108|chr19 ( 404) 354 70.5 3.8e-12
CCDS10375.1 NR2F2 gene_id:7026|Hs108|chr15 ( 414) 354 70.5 3.9e-12
CCDS33718.1 NR1D2 gene_id:9975|Hs108|chr3 ( 579) 355 70.8 4.4e-12
CCDS74905.1 NR2C2 gene_id:7182|Hs108|chr3 ( 596) 355 70.8 4.5e-12
CCDS2621.1 NR2C2 gene_id:7182|Hs108|chr3 ( 615) 355 70.8 4.6e-12
CCDS11361.1 NR1D1 gene_id:9572|Hs108|chr17 ( 614) 351 70.1 7.3e-12
CCDS2201.1 NR4A2 gene_id:4929|Hs108|chr2 ( 598) 348 69.6 1e-11
CCDS41821.1 NR2C1 gene_id:7181|Hs108|chr12 ( 467) 346 69.2 1.1e-11
CCDS44953.1 NR2C1 gene_id:7181|Hs108|chr12 ( 483) 346 69.2 1.1e-11
CCDS9051.1 NR2C1 gene_id:7181|Hs108|chr12 ( 603) 346 69.3 1.3e-11
CCDS1517.1 ESRRG gene_id:2104|Hs108|chr1 ( 435) 342 68.5 1.6e-11
CCDS41468.1 ESRRG gene_id:2104|Hs108|chr1 ( 458) 342 68.5 1.7e-11
CCDS58061.1 ESRRG gene_id:2104|Hs108|chr1 ( 470) 342 68.5 1.7e-11
CCDS2610.2 PPARG gene_id:5468|Hs108|chr3 ( 477) 341 68.4 2e-11
CCDS2609.1 PPARG gene_id:5468|Hs108|chr3 ( 505) 341 68.4 2.1e-11
CCDS9850.2 ESRRB gene_id:2103|Hs108|chr14 ( 508) 338 67.9 2.9e-11
CCDS6220.2 HNF4G gene_id:3174|Hs108|chr8 ( 445) 337 67.6 3e-11
CCDS60830.1 ESRRA gene_id:2101|Hs108|chr11 ( 422) 334 67.1 4.1e-11
CCDS41667.1 ESRRA gene_id:2101|Hs108|chr11 ( 423) 334 67.1 4.1e-11
CCDS74728.1 HNF4A gene_id:3172|Hs108|chr20 ( 449) 334 67.1 4.3e-11
>>CCDS35137.1 NR6A1 gene_id:2649|Hs108|chr9 (480 aa)
initn: 1901 init1: 1901 opt: 3297 Z-score: 3026.6 bits: 569.4 E(32554): 2.9e-162
Smith-Waterman score: 3297; 99.8% identity (99.8% similar) in 480 aa overlap (1-479:1-480)
10 20 30 40 50 60
pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP
130 140 150 160 170 180
190 200 210 220 230
pF1KE3 GNRASESNQPSPGSTLSS-RSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLP
:::::::::::::::::: :::::::::::::::::::::::::::::::::::::::::
CCDS35 GNRASESNQPSPGSTLSSSRSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLP
190 200 210 220 230 240
240 250 260 270 280 290
pF1KE3 QQARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 QQARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIA
250 260 270 280 290 300
300 310 320 330 340 350
pF1KE3 WIKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 WIKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHR
310 320 330 340 350 360
360 370 380 390 400 410
pF1KE3 FSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 FSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWY
370 380 390 400 410 420
420 430 440 450 460 470
pF1KE3 ICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 ICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE
430 440 450 460 470 480
>>CCDS55340.1 NR6A1 gene_id:2649|Hs108|chr9 (475 aa)
initn: 2932 init1: 2932 opt: 3265 Z-score: 2997.4 bits: 564.0 E(32554): 1.2e-160
Smith-Waterman score: 3265; 99.0% identity (99.2% similar) in 479 aa overlap (1-479:1-475)
10 20 30 40 50 60
pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTC
::::::::::::::::::::::::::::::::::::::::::::::: .::::::::
CCDS55 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGT----NDRAEQRTC
10 20 30 40 50
70 80 90 100 110 120
pF1KE3 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE3 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP
120 130 140 150 160 170
190 200 210 220 230 240
pF1KE3 GNRASESNQPSPGSTLSSRSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLPQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 GNRASESNQPSPGSTLSSRSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLPQ
180 190 200 210 220 230
250 260 270 280 290 300
pF1KE3 QARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIAW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 QARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIAW
240 250 260 270 280 290
310 320 330 340 350 360
pF1KE3 IKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHRF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 IKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHRF
300 310 320 330 340 350
370 380 390 400 410 420
pF1KE3 SDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWYI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 SDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWYI
360 370 380 390 400 410
430 440 450 460 470
pF1KE3 CQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 CQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE
420 430 440 450 460 470
>>CCDS65127.1 NR6A1 gene_id:2649|Hs108|chr9 (476 aa)
initn: 2244 init1: 1901 opt: 3253 Z-score: 2986.4 bits: 562.0 E(32554): 5.1e-160
Smith-Waterman score: 3253; 98.8% identity (99.0% similar) in 480 aa overlap (1-479:1-476)
10 20 30 40 50 60
pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTC
::::::::::::::::::::::::::::::::::::::::::::::: .::::::::
CCDS65 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGT----NDRAEQRTC
10 20 30 40 50
70 80 90 100 110 120
pF1KE3 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS65 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE3 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS65 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP
120 130 140 150 160 170
190 200 210 220 230
pF1KE3 GNRASESNQPSPGSTLSS-RSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLP
:::::::::::::::::: :::::::::::::::::::::::::::::::::::::::::
CCDS65 GNRASESNQPSPGSTLSSSRSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLP
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE3 QQARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS65 QQARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIA
240 250 260 270 280 290
300 310 320 330 340 350
pF1KE3 WIKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS65 WIKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHR
300 310 320 330 340 350
360 370 380 390 400 410
pF1KE3 FSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS65 FSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWY
360 370 380 390 400 410
420 430 440 450 460 470
pF1KE3 ICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS65 ICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE
420 430 440 450 460 470
>>CCDS60383.1 NR5A2 gene_id:2494|Hs108|chr1 (469 aa)
initn: 528 init1: 372 opt: 407 Z-score: 379.3 bits: 79.5 E(32554): 8.4e-15
Smith-Waterman score: 623; 27.7% identity (59.5% similar) in 447 aa overlap (48-450:2-438)
20 30 40 50 60 70
pF1KE3 AGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISC
.. : :. .. : .:::...: :::...:
CCDS60 MVNYSYDEDLEELCPVCGDKVSGYHYGLLTC
10 20 30
80 90 100 110 120 130
pF1KE3 EGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPG
:.:::::::.. :.. : : ...:: ... ::.:: :::. :::..::. .:.: : : :
CCDS60 ESCKGFFKRTVQNNKRYTCIENQNCQIDKTQRKRCPYCRFQKCLSVGMKLEAVRADRMRG
40 50 60 70 80 90
140 150 160 170 180 190
pF1KE3 GRNKSIGPVQISEEEIER----IMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPG
:::: .::. .. ... .. .. .. :: .. . . .. .. .. : :
CCDS60 GRNK-FGPMYKRDRALKQQKKALIRANGLKLEAMSQVIQAMPSDLTISSAIQNIHSASKG
100 110 120 130 140 150
200 210 220 230
pF1KE3 STLSSRSVELNGF--MAFREQYMGMSVPPH-----YQYIPHLFSYS--------------
:. .. . . : . ..:..::: :: :. : .
CCDS60 LPLNHAALPPTDYDRSPFVTSPISMTMPPHGSLQGYQTYGHFPSRAIKSEYPDPYTSSPE
160 170 180 190 200 210
240 250 260 270 280
pF1KE3 ---GHSPLLPQQARSLDPQSYS-LIHQLLSAEDLEPLGTPMLI-----EDGYAVTQAEL-
:.: . :. : : : :: .::. : :: .. :.. . .:
CCDS60 SIMGYSYMDSYQTSS--PASIPHLILELLKCEPDEPQVQAKIMAYLQQEQANRSKHEKLS
220 230 240 250 260
290 300 310 320 330
pF1KE3 -FALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILLSSL---TVYSKQ
:.:.:..::. :: . : .. :: ::.. : ::.. :.::..:. . .:..:.
CCDS60 TFGLMCKMADQTLFSIVEWARSSIFFRELKVDDQMKLLQNCWSELLILDHIYRQVVHGKE
270 280 290 300 310 320
340 350 360 370 380 390
pF1KE3 --IF---GELADVTAKYSPSDEELHRFSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKA
:: :. .: . : . :. . ....:.. .: ..:. ...:..:.:
CCDS60 GSIFLVTGQQVDYSIIASQAGATLNNLMSHAQELVAKL-------RSLQFDQREFVCLKF
330 340 350 360 370 380
400 410 420 430 440 450
pF1KE3 INFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKM
. ... :...: . . .: .... :.: .: .: ..: .:.. ::::: :.
CCDS60 LVLFSLDVKNLENFQLVEGVQEQVNAALLDYTMCNYPQQTEKFGQLLLRLPEIRAISMQA
390 400 410 420 430 440
460 470
pF1KE3 VNVPLEQLPLLFKVVLHSCKTSVGKE
CCDS60 EEYLYYKHLNGDVPYNNLLIEMLHAKRA
450 460
>>CCDS1400.1 NR5A2 gene_id:2494|Hs108|chr1 (495 aa)
initn: 528 init1: 372 opt: 407 Z-score: 378.9 bits: 79.6 E(32554): 8.7e-15
Smith-Waterman score: 623; 27.7% identity (59.5% similar) in 447 aa overlap (48-450:28-464)
20 30 40 50 60 70
pF1KE3 AGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISC
.. : :. .. : .:::...: :::...:
CCDS14 MSSNSDTGDLQESLKHGLTPIVSQFKMVNYSYDEDLEELCPVCGDKVSGYHYGLLTC
10 20 30 40 50
80 90 100 110 120 130
pF1KE3 EGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPG
:.:::::::.. :.. : : ...:: ... ::.:: :::. :::..::. .:.: : : :
CCDS14 ESCKGFFKRTVQNNKRYTCIENQNCQIDKTQRKRCPYCRFQKCLSVGMKLEAVRADRMRG
60 70 80 90 100 110
140 150 160 170 180 190
pF1KE3 GRNKSIGPVQISEEEIER----IMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPG
:::: .::. .. ... .. .. .. :: .. . . .. .. .. : :
CCDS14 GRNK-FGPMYKRDRALKQQKKALIRANGLKLEAMSQVIQAMPSDLTISSAIQNIHSASKG
120 130 140 150 160 170
200 210 220 230
pF1KE3 STLSSRSVELNGF--MAFREQYMGMSVPPH-----YQYIPHLFSYS--------------
:. .. . . : . ..:..::: :: :. : .
CCDS14 LPLNHAALPPTDYDRSPFVTSPISMTMPPHGSLQGYQTYGHFPSRAIKSEYPDPYTSSPE
180 190 200 210 220 230
240 250 260 270 280
pF1KE3 ---GHSPLLPQQARSLDPQSYS-LIHQLLSAEDLEPLGTPMLI-----EDGYAVTQAEL-
:.: . :. : : : :: .::. : :: .. :.. . .:
CCDS14 SIMGYSYMDSYQTSS--PASIPHLILELLKCEPDEPQVQAKIMAYLQQEQANRSKHEKLS
240 250 260 270 280 290
290 300 310 320 330
pF1KE3 -FALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILLSSL---TVYSKQ
:.:.:..::. :: . : .. :: ::.. : ::.. :.::..:. . .:..:.
CCDS14 TFGLMCKMADQTLFSIVEWARSSIFFRELKVDDQMKLLQNCWSELLILDHIYRQVVHGKE
300 310 320 330 340 350
340 350 360 370 380 390
pF1KE3 --IF---GELADVTAKYSPSDEELHRFSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKA
:: :. .: . : . :. . ....:.. .: ..:. ...:..:.:
CCDS14 GSIFLVTGQQVDYSIIASQAGATLNNLMSHAQELVAKL-------RSLQFDQREFVCLKF
360 370 380 390 400
400 410 420 430 440 450
pF1KE3 INFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKM
. ... :...: . . .: .... :.: .: .: ..: .:.. ::::: :.
CCDS14 LVLFSLDVKNLENFQLVEGVQEQVNAALLDYTMCNYPQQTEKFGQLLLRLPEIRAISMQA
410 420 430 440 450 460
460 470
pF1KE3 VNVPLEQLPLLFKVVLHSCKTSVGKE
CCDS14 EEYLYYKHLNGDVPYNNLLIEMLHAKRA
470 480 490
>>CCDS1401.1 NR5A2 gene_id:2494|Hs108|chr1 (541 aa)
initn: 505 init1: 372 opt: 407 Z-score: 378.4 bits: 79.6 E(32554): 9.3e-15
Smith-Waterman score: 623; 27.7% identity (59.5% similar) in 447 aa overlap (48-450:74-510)
20 30 40 50 60 70
pF1KE3 AGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISC
.. : :. .. : .:::...: :::...:
CCDS14 KVETEALGLARSHGEQGQMPENMQVSQFKMVNYSYDEDLEELCPVCGDKVSGYHYGLLTC
50 60 70 80 90 100
80 90 100 110 120 130
pF1KE3 EGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPG
:.:::::::.. :.. : : ...:: ... ::.:: :::. :::..::. .:.: : : :
CCDS14 ESCKGFFKRTVQNNKRYTCIENQNCQIDKTQRKRCPYCRFQKCLSVGMKLEAVRADRMRG
110 120 130 140 150 160
140 150 160 170 180 190
pF1KE3 GRNKSIGPVQISEEEIER----IMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPG
:::: .::. .. ... .. .. .. :: .. . . .. .. .. : :
CCDS14 GRNK-FGPMYKRDRALKQQKKALIRANGLKLEAMSQVIQAMPSDLTISSAIQNIHSASKG
170 180 190 200 210 220
200 210 220 230
pF1KE3 STLSSRSVELNGF--MAFREQYMGMSVPPH-----YQYIPHLFSYS--------------
:. .. . . : . ..:..::: :: :. : .
CCDS14 LPLNHAALPPTDYDRSPFVTSPISMTMPPHGSLQGYQTYGHFPSRAIKSEYPDPYTSSPE
230 240 250 260 270 280
240 250 260 270 280
pF1KE3 ---GHSPLLPQQARSLDPQSYS-LIHQLLSAEDLEPLGTPMLI-----EDGYAVTQAEL-
:.: . :. : : : :: .::. : :: .. :.. . .:
CCDS14 SIMGYSYMDSYQTSS--PASIPHLILELLKCEPDEPQVQAKIMAYLQQEQANRSKHEKLS
290 300 310 320 330 340
290 300 310 320 330
pF1KE3 -FALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILLSSL---TVYSKQ
:.:.:..::. :: . : .. :: ::.. : ::.. :.::..:. . .:..:.
CCDS14 TFGLMCKMADQTLFSIVEWARSSIFFRELKVDDQMKLLQNCWSELLILDHIYRQVVHGKE
350 360 370 380 390 400
340 350 360 370 380 390
pF1KE3 --IF---GELADVTAKYSPSDEELHRFSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKA
:: :. .: . : . :. . ....:.. .: ..:. ...:..:.:
CCDS14 GSIFLVTGQQVDYSIIASQAGATLNNLMSHAQELVAKL-------RSLQFDQREFVCLKF
410 420 430 440 450
400 410 420 430 440 450
pF1KE3 INFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKM
. ... :...: . . .: .... :.: .: .: ..: .:.. ::::: :.
CCDS14 LVLFSLDVKNLENFQLVEGVQEQVNAALLDYTMCNYPQQTEKFGQLLLRLPEIRAISMQA
460 470 480 490 500 510
460 470
pF1KE3 VNVPLEQLPLLFKVVLHSCKTSVGKE
CCDS14 EEYLYYKHLNGDVPYNNLLIEMLHAKRA
520 530 540
>>CCDS41790.1 RARG gene_id:5916|Hs108|chr12 (443 aa)
initn: 511 init1: 381 opt: 403 Z-score: 375.9 bits: 78.8 E(32554): 1.3e-14
Smith-Waterman score: 403; 41.4% identity (68.6% similar) in 140 aa overlap (9-147:30-161)
10 20 30
pF1KE3 MERDEPPPSGGGGGGGSAGFLEP-PAALPPPPRNGFCQD
.::. .: .: : ::.: .. ..
CCDS41 MYDCMETFAPGPRRLYGAAGPGAGLLRRATGGSCFAGLESFAWPQPASLQSVETQSTSSE
10 20 30 40 50 60
40 50 60 70 80 90
pF1KE3 ELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSR
:.. :.. : . :..:.:...: :::. :::::::::.::: .. :: : :
CCDS41 EMV---PSSPSPPPPPRVYKPCFVCNDKSSGYHYGVSSCEGCKGFFRRSIQKNMVYTCHR
70 80 90 100 110
100 110 120 130 140 150
pF1KE3 DKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMS
::::.... ::::::::: ::...::...:.:.: :::. :.
CCDS41 DKNCIINKVTRNRCQYCRLQKCFEVGMSKEAVRND-----RNKKKKEVKEEGSPDSYELS
120 130 140 150 160 170
160 170 180 190 200 210
pF1KE3 GQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPGSTLSSRSVELNGFMAFREQYMGMSV
CCDS41 PQLEELITKVSKAHQETFPSLCQLGKYTTNSSADHRVQLDLGLWDKFSELATKCIIKIVE
180 190 200 210 220 230
>>CCDS4768.1 RXRB gene_id:6257|Hs108|chr6 (533 aa)
initn: 457 init1: 385 opt: 396 Z-score: 368.4 bits: 77.7 E(32554): 3.3e-14
Smith-Waterman score: 502; 28.7% identity (50.1% similar) in 471 aa overlap (12-468:170-513)
10 20 30
pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAA------LPPPPRNGF
:::.: ..::. :::: .:
CCDS47 MGSPGLPPPAPPGFSGPVSSPQINSTVSLPGGGSGPPEDVKPPVLGVRGLHCPPPP-GG-
140 150 160 170 180 190
40 50 60 70 80 90
pF1KE3 CQDELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISCEGCKGFFKRSICNKRVYR
:: : .: : :::::..: :::. :::::::::::.: . .:
CCDS47 ---------PG--------AGKRLCAICGDRSSGKHYGVYSCEGCKGFFKRTIRKDLTYS
200 210 220 230 240
100 110 120 130 140 150
pF1KE3 CSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPG----GRNKSIGPVQISEE
: .:.:.....::::::::: ::: ::.:.:..:. . : : ... : . :
CCDS47 CRDNKDCTVDKRQRNRCQYCRYQKCLATGMKREAVQEERQRGKDKDGDGEGAGGAP-EEM
250 260 270 280 290
160 170 180 190 200 210
pF1KE3 EIERIMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPGSTLSSRSVELNGFMAFRE
..::. .. :. ::.. : ::.: .:
CCDS47 PVDRILEAELAVEQK--------SDQGVEG----------PGGT--------GG------
300 310 320
220 230 240 250 260 270
pF1KE3 QYMGMSVPPHYQYIPHLFSYSGHSPLLPQQARSLDPQSYSLIHQLLSAEDLEPLGTPMLI
:: :: ::
CCDS47 --------------------SGSSPN--------DP------------------------
330
280 290 300 310 320 330
pF1KE3 EDGYAVTQAELFALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILLSS
::. .:. ::. :: . : :..: : : . : . :: . :.:: :..:
CCDS47 -----VTN------ICQAADKQLFTLVEWAKRIPHFSSLPLDDQVILLRAGWNEL-LIAS
340 350 360 370 380
340 350 360 370 380
pF1KE3 LTVYSKQIFGELADVTAKYSPSDEELHRFS--DEGM-EVIER-LIYLYHKFHQLKVSNEE
.. : .. . .:. . .:: : . :. ...: : : :........ :
CCDS47 FSHRSIDVRDGILLATGLH------VHRNSAHSAGVGAIFDRVLTELVSKMRDMRMDKTE
390 400 410 420 430
390 400 410 420 430 440
pF1KE3 YACMKAINFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPEIR
.:..:: ..: : .::.. :..: : .. . . . . :: .: .:: :.. :: .:
CCDS47 LGCLRAIILFNPDAKGLSNPSEVEVLREKVYASLETYCKQKYPEQQGRFAKLLLRLPALR
440 450 460 470 480 490
450 460 470
pF1KE3 YIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE
:. : ::.: ..::..
CCDS47 SIGLKC----LEHL-FFFKLIGDTPIDTFLMEMLEAPHQLA
500 510 520 530
>>CCDS59007.1 RXRB gene_id:6257|Hs108|chr6 (537 aa)
initn: 457 init1: 385 opt: 396 Z-score: 368.4 bits: 77.7 E(32554): 3.4e-14
Smith-Waterman score: 497; 28.0% identity (49.7% similar) in 471 aa overlap (12-468:170-517)
10 20 30
pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAA------LPPPPRNGF
:::.: ..::. :::: .:
CCDS59 MGSPGLPPPAPPGFSGPVSSPQINSTVSLPGGGSGPPEDVKPPVLGVRGLHCPPPP-GG-
140 150 160 170 180 190
40 50 60 70 80 90
pF1KE3 CQDELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISCEGCKGFFKRSICNKRVYR
:: : .: : :::::..: :::. :::::::::::.: . .:
CCDS59 ---------PG--------AGKRLCAICGDRSSGKHYGVYSCEGCKGFFKRTIRKDLTYS
200 210 220 230 240
100 110 120 130 140 150
pF1KE3 CSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPG----GRNKSIGPVQISEE
: .:.:.....::::::::: ::: ::.:.:..:. . : : ... : . :
CCDS59 CRDNKDCTVDKRQRNRCQYCRYQKCLATGMKREAVQEERQRGKDKDGDGEGAGGAP-EEM
250 260 270 280 290
160 170 180 190 200 210
pF1KE3 EIERIMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPGSTLSSRSVELNGFMAFRE
..::. .. :. ::.. : ::.: .:
CCDS59 PVDRILEAELAVEQK--------SDQGVEG----------PGGT--------GG------
300 310 320
220 230 240 250 260 270
pF1KE3 QYMGMSVPPHYQYIPHLFSYSGHSPLLPQQARSLDPQSYSLIHQLLSAEDLEPLGTPMLI
:: :: ::
CCDS59 --------------------SGSSPN--------DP------------------------
330
280 290 300 310 320 330
pF1KE3 EDGYAVTQAELFALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILLSS
::. .:. ::. :: . : :..: : : . : . :: . :.:: :..:
CCDS59 -----VTN------ICQAADKQLFTLVEWAKRIPHFSSLPLDDQVILLRAGWNEL-LIAS
340 350 360 370 380
340 350 360 370 380
pF1KE3 LTVYSKQIFGELADVTA----KYSPSDEELHRFSDEGMEVIERLIYLYHKFHQLKVSNEE
.. : .. . .:. . : . . . :... . : : :........ :
CCDS59 FSHRSIDVRDGILLATGLHVHRNSAHSAGVGAIFDRSLSRV--LTELVSKMRDMRMDKTE
390 400 410 420 430 440
390 400 410 420 430 440
pF1KE3 YACMKAINFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPEIR
.:..:: ..: : .::.. :..: : .. . . . . :: .: .:: :.. :: .:
CCDS59 LGCLRAIILFNPDAKGLSNPSEVEVLREKVYASLETYCKQKYPEQQGRFAKLLLRLPALR
450 460 470 480 490 500
450 460 470
pF1KE3 YIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE
:. : ::.: ..::..
CCDS59 SIGLKC----LEHL-FFFKLIGDTPIDTFLMEMLEAPHQLA
510 520 530
>>CCDS1248.1 RXRG gene_id:6258|Hs108|chr1 (463 aa)
initn: 645 init1: 372 opt: 392 Z-score: 365.6 bits: 77.0 E(32554): 4.8e-14
Smith-Waterman score: 524; 27.7% identity (52.6% similar) in 473 aa overlap (7-468:79-443)
10 20 30
pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFC
::::. .. . ... ::.. .
CCDS12 PVSAPRTLSAVGTPLNALGSPYRVITSAMGPPSGALAAPPGINLVAPPSSQLNVVNSVSS
50 60 70 80 90 100
40 50 60 70 80
pF1KE3 QDELAELD--PGT-----ISVSDDRAEQRTCLICGDRATGLHYGIISCEGCKGFFKRSIC
.... : :: :.: .. : :::::..: :::. :::::::::::.:
CCDS12 SEDIKPLPGLPGIGNMNYPSTSPGSLVKHICAICGDRSSGKHYGVYSCEGCKGFFKRTIR
110 120 130 140 150 160
90 100 110 120 130 140
pF1KE3 NKRVYRCSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPGGRNKSIGPVQIS
. .: : .:.:.....::::::::: ::: :::.:.:..:. :..: . .
CCDS12 KDLIYTCRDNKDCLIDKRQRNRCQYCRYQKCLVMGMKREAVQEE-----RQRS---RERA
170 180 190 200 210 220
150 160 170 180 190 200
pF1KE3 EEEIERIMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPGSTLSSRSVELNGFMAF
: : : ::.: . : .: :..
CCDS12 ESEAECATSGHE----------------DMPVERILEAE---------------------
230 240
210 220 230 240 250 260
pF1KE3 REQYMGMSVPPHYQYIPHLFSYSGHSPLLPQQARSLDPQSYSLIHQLLSAEDLEPLGTPM
..: :. . ::. . .. . ::
CCDS12 ------LAVEPKTE------SYGD----MNMENSTNDP----------------------
250 260
270 280 290 300 310 320
pF1KE3 LIEDGYAVTQAELFALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILL
::. .:. ::. :: . : :..: : .:...: . :: . :.:: :.
CCDS12 -------VTN------ICHAADKQLFTLVEWAKRIPHFSDLTLEDQVILLRAGWNEL-LI
270 280 290 300 310
330 340 350 360 370 380
pF1KE3 SSLTVYSKQIFGELADVTAKYSPSDEELHRFS--DEGM-EVIER-LIYLYHKFHQLKVSN
.:.. : .. . .:. . .:: : . :. ...: : : :........
CCDS12 ASFSHRSVSVQDGILLATGLH------VHRSSAHSAGVGSIFDRVLTELVSKMKDMQMDK
320 330 340 350 360
390 400 410 420 430 440
pF1KE3 EEYACMKAINFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPE
: .:..:: ..: : .::.. :..: : .. . . .:. :: .::.:: :.. ::
CCDS12 SELGCLRAIVLFNPDAKGLSNPSEVETLREKVYATLEAYTKQKYPEQPGRFAKLLLRLPA
370 380 390 400 410 420
450 460 470
pF1KE3 IRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE
.: :. : ::.: ..::..
CCDS12 LRSIGLKC----LEHL-FFFKLIGDTPIDTFLMEMLETPLQIT
430 440 450 460
479 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 17:15:02 2016 done: Mon Nov 7 17:15:03 2016
Total Scan time: 3.010 Total Display time: 0.050
Function used was FASTA [36.3.4 Apr, 2011]