FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3212, 479 aa 1>>>pF1KE3212 479 - 479 aa - 479 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2633+/-0.000726; mu= 16.0763+/- 0.045 mean_var=119.1578+/-24.228, 0's: 0 Z-trim(113.7): 114 B-trim: 527 in 1/52 Lambda= 0.117493 statistics sampled from 14163 (14285) to 14163 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.763), E-opt: 0.2 (0.439), width: 16 Scan time: 3.010 The best scores are: opt bits E(32554) CCDS35137.1 NR6A1 gene_id:2649|Hs108|chr9 ( 480) 3297 569.4 2.9e-162 CCDS55340.1 NR6A1 gene_id:2649|Hs108|chr9 ( 475) 3265 564.0 1.2e-160 CCDS65127.1 NR6A1 gene_id:2649|Hs108|chr9 ( 476) 3253 562.0 5.1e-160 CCDS60383.1 NR5A2 gene_id:2494|Hs108|chr1 ( 469) 407 79.5 8.4e-15 CCDS1400.1 NR5A2 gene_id:2494|Hs108|chr1 ( 495) 407 79.6 8.7e-15 CCDS1401.1 NR5A2 gene_id:2494|Hs108|chr1 ( 541) 407 79.6 9.3e-15 CCDS41790.1 RARG gene_id:5916|Hs108|chr12 ( 443) 403 78.8 1.3e-14 CCDS4768.1 RXRB gene_id:6257|Hs108|chr6 ( 533) 396 77.7 3.3e-14 CCDS59007.1 RXRB gene_id:6257|Hs108|chr6 ( 537) 396 77.7 3.4e-14 CCDS1248.1 RXRG gene_id:6258|Hs108|chr1 ( 463) 392 77.0 4.8e-14 CCDS8850.1 RARG gene_id:5916|Hs108|chr12 ( 454) 386 76.0 9.6e-14 CCDS35172.1 RXRA gene_id:6256|Hs108|chr9 ( 462) 386 76.0 9.8e-14 CCDS58236.1 RARG gene_id:5916|Hs108|chr12 ( 382) 382 75.2 1.4e-13 CCDS6646.1 RORB gene_id:6096|Hs108|chr9 ( 459) 382 75.3 1.6e-13 CCDS6744.1 NR4A3 gene_id:8013|Hs108|chr9 ( 443) 381 75.1 1.7e-13 CCDS72970.1 RXRG gene_id:6258|Hs108|chr1 ( 340) 379 74.7 1.8e-13 CCDS6743.1 NR4A3 gene_id:8013|Hs108|chr9 ( 626) 381 75.2 2.2e-13 CCDS6742.1 NR4A3 gene_id:8013|Hs108|chr9 ( 637) 381 75.2 2.2e-13 CCDS11366.1 RARA gene_id:5914|Hs108|chr17 ( 462) 378 74.6 2.5e-13 CCDS6856.1 NR5A1 gene_id:2516|Hs108|chr9 ( 461) 375 74.1 3.5e-13 CCDS2642.1 RARB gene_id:5915|Hs108|chr3 ( 448) 372 73.6 4.9e-13 CCDS4068.1 NR2F1 gene_id:7025|Hs108|chr5 ( 423) 371 73.4 5.3e-13 CCDS10177.1 RORA gene_id:6095|Hs108|chr15 ( 523) 372 73.6 5.5e-13 CCDS42317.1 RARA gene_id:5914|Hs108|chr17 ( 457) 371 73.4 5.6e-13 CCDS10179.1 RORA gene_id:6095|Hs108|chr15 ( 556) 372 73.7 5.8e-13 CCDS30856.1 RORC gene_id:6097|Hs108|chr1 ( 497) 363 72.1 1.5e-12 CCDS1004.1 RORC gene_id:6097|Hs108|chr1 ( 518) 363 72.1 1.6e-12 CCDS33669.1 PPARA gene_id:5465|Hs108|chr22 ( 468) 361 71.7 1.9e-12 CCDS45271.1 RORA gene_id:6095|Hs108|chr15 ( 468) 361 71.7 1.9e-12 CCDS10178.1 RORA gene_id:6095|Hs108|chr15 ( 548) 361 71.8 2.1e-12 CCDS12352.1 NR2F6 gene_id:2063|Hs108|chr19 ( 404) 354 70.5 3.8e-12 CCDS10375.1 NR2F2 gene_id:7026|Hs108|chr15 ( 414) 354 70.5 3.9e-12 CCDS33718.1 NR1D2 gene_id:9975|Hs108|chr3 ( 579) 355 70.8 4.4e-12 CCDS74905.1 NR2C2 gene_id:7182|Hs108|chr3 ( 596) 355 70.8 4.5e-12 CCDS2621.1 NR2C2 gene_id:7182|Hs108|chr3 ( 615) 355 70.8 4.6e-12 CCDS11361.1 NR1D1 gene_id:9572|Hs108|chr17 ( 614) 351 70.1 7.3e-12 CCDS2201.1 NR4A2 gene_id:4929|Hs108|chr2 ( 598) 348 69.6 1e-11 CCDS41821.1 NR2C1 gene_id:7181|Hs108|chr12 ( 467) 346 69.2 1.1e-11 CCDS44953.1 NR2C1 gene_id:7181|Hs108|chr12 ( 483) 346 69.2 1.1e-11 CCDS9051.1 NR2C1 gene_id:7181|Hs108|chr12 ( 603) 346 69.3 1.3e-11 CCDS1517.1 ESRRG gene_id:2104|Hs108|chr1 ( 435) 342 68.5 1.6e-11 CCDS41468.1 ESRRG gene_id:2104|Hs108|chr1 ( 458) 342 68.5 1.7e-11 CCDS58061.1 ESRRG gene_id:2104|Hs108|chr1 ( 470) 342 68.5 1.7e-11 CCDS2610.2 PPARG gene_id:5468|Hs108|chr3 ( 477) 341 68.4 2e-11 CCDS2609.1 PPARG gene_id:5468|Hs108|chr3 ( 505) 341 68.4 2.1e-11 CCDS9850.2 ESRRB gene_id:2103|Hs108|chr14 ( 508) 338 67.9 2.9e-11 CCDS6220.2 HNF4G gene_id:3174|Hs108|chr8 ( 445) 337 67.6 3e-11 CCDS60830.1 ESRRA gene_id:2101|Hs108|chr11 ( 422) 334 67.1 4.1e-11 CCDS41667.1 ESRRA gene_id:2101|Hs108|chr11 ( 423) 334 67.1 4.1e-11 CCDS74728.1 HNF4A gene_id:3172|Hs108|chr20 ( 449) 334 67.1 4.3e-11 >>CCDS35137.1 NR6A1 gene_id:2649|Hs108|chr9 (480 aa) initn: 1901 init1: 1901 opt: 3297 Z-score: 3026.6 bits: 569.4 E(32554): 2.9e-162 Smith-Waterman score: 3297; 99.8% identity (99.8% similar) in 480 aa overlap (1-479:1-480) 10 20 30 40 50 60 pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP 130 140 150 160 170 180 190 200 210 220 230 pF1KE3 GNRASESNQPSPGSTLSS-RSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLP :::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::: CCDS35 GNRASESNQPSPGSTLSSSRSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLP 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE3 QQARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 QQARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIA 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE3 WIKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 WIKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHR 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE3 FSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 FSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWY 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE3 ICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 ICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE 430 440 450 460 470 480 >>CCDS55340.1 NR6A1 gene_id:2649|Hs108|chr9 (475 aa) initn: 2932 init1: 2932 opt: 3265 Z-score: 2997.4 bits: 564.0 E(32554): 1.2e-160 Smith-Waterman score: 3265; 99.0% identity (99.2% similar) in 479 aa overlap (1-479:1-475) 10 20 30 40 50 60 pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTC ::::::::::::::::::::::::::::::::::::::::::::::: .:::::::: CCDS55 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGT----NDRAEQRTC 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE3 GNRASESNQPSPGSTLSSRSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GNRASESNQPSPGSTLSSRSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLPQ 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE3 QARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIAW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 QARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIAW 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE3 IKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 IKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHRF 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE3 SDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWYI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 SDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWYI 360 370 380 390 400 410 430 440 450 460 470 pF1KE3 CQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 CQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE 420 430 440 450 460 470 >>CCDS65127.1 NR6A1 gene_id:2649|Hs108|chr9 (476 aa) initn: 2244 init1: 1901 opt: 3253 Z-score: 2986.4 bits: 562.0 E(32554): 5.1e-160 Smith-Waterman score: 3253; 98.8% identity (99.0% similar) in 480 aa overlap (1-479:1-476) 10 20 30 40 50 60 pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTC ::::::::::::::::::::::::::::::::::::::::::::::: .:::::::: CCDS65 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFCQDELAELDPGT----NDRAEQRTC 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 LICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKC 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 LQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSP 120 130 140 150 160 170 190 200 210 220 230 pF1KE3 GNRASESNQPSPGSTLSS-RSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLP :::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::: CCDS65 GNRASESNQPSPGSTLSSSRSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLP 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE3 QQARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 QQARSLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIA 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE3 WIKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 WIKKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHR 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE3 FSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 FSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRYWY 360 370 380 390 400 410 420 430 440 450 460 470 pF1KE3 ICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 ICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE 420 430 440 450 460 470 >>CCDS60383.1 NR5A2 gene_id:2494|Hs108|chr1 (469 aa) initn: 528 init1: 372 opt: 407 Z-score: 379.3 bits: 79.5 E(32554): 8.4e-15 Smith-Waterman score: 623; 27.7% identity (59.5% similar) in 447 aa overlap (48-450:2-438) 20 30 40 50 60 70 pF1KE3 AGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISC .. : :. .. : .:::...: :::...: CCDS60 MVNYSYDEDLEELCPVCGDKVSGYHYGLLTC 10 20 30 80 90 100 110 120 130 pF1KE3 EGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPG :.:::::::.. :.. : : ...:: ... ::.:: :::. :::..::. .:.: : : : CCDS60 ESCKGFFKRTVQNNKRYTCIENQNCQIDKTQRKRCPYCRFQKCLSVGMKLEAVRADRMRG 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE3 GRNKSIGPVQISEEEIER----IMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPG :::: .::. .. ... .. .. .. :: .. . . .. .. .. : : CCDS60 GRNK-FGPMYKRDRALKQQKKALIRANGLKLEAMSQVIQAMPSDLTISSAIQNIHSASKG 100 110 120 130 140 150 200 210 220 230 pF1KE3 STLSSRSVELNGF--MAFREQYMGMSVPPH-----YQYIPHLFSYS-------------- :. .. . . : . ..:..::: :: :. : . CCDS60 LPLNHAALPPTDYDRSPFVTSPISMTMPPHGSLQGYQTYGHFPSRAIKSEYPDPYTSSPE 160 170 180 190 200 210 240 250 260 270 280 pF1KE3 ---GHSPLLPQQARSLDPQSYS-LIHQLLSAEDLEPLGTPMLI-----EDGYAVTQAEL- :.: . :. : : : :: .::. : :: .. :.. . .: CCDS60 SIMGYSYMDSYQTSS--PASIPHLILELLKCEPDEPQVQAKIMAYLQQEQANRSKHEKLS 220 230 240 250 260 290 300 310 320 330 pF1KE3 -FALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILLSSL---TVYSKQ :.:.:..::. :: . : .. :: ::.. : ::.. :.::..:. . .:..:. CCDS60 TFGLMCKMADQTLFSIVEWARSSIFFRELKVDDQMKLLQNCWSELLILDHIYRQVVHGKE 270 280 290 300 310 320 340 350 360 370 380 390 pF1KE3 --IF---GELADVTAKYSPSDEELHRFSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKA :: :. .: . : . :. . ....:.. .: ..:. ...:..:.: CCDS60 GSIFLVTGQQVDYSIIASQAGATLNNLMSHAQELVAKL-------RSLQFDQREFVCLKF 330 340 350 360 370 380 400 410 420 430 440 450 pF1KE3 INFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKM . ... :...: . . .: .... :.: .: .: ..: .:.. ::::: :. CCDS60 LVLFSLDVKNLENFQLVEGVQEQVNAALLDYTMCNYPQQTEKFGQLLLRLPEIRAISMQA 390 400 410 420 430 440 460 470 pF1KE3 VNVPLEQLPLLFKVVLHSCKTSVGKE CCDS60 EEYLYYKHLNGDVPYNNLLIEMLHAKRA 450 460 >>CCDS1400.1 NR5A2 gene_id:2494|Hs108|chr1 (495 aa) initn: 528 init1: 372 opt: 407 Z-score: 378.9 bits: 79.6 E(32554): 8.7e-15 Smith-Waterman score: 623; 27.7% identity (59.5% similar) in 447 aa overlap (48-450:28-464) 20 30 40 50 60 70 pF1KE3 AGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISC .. : :. .. : .:::...: :::...: CCDS14 MSSNSDTGDLQESLKHGLTPIVSQFKMVNYSYDEDLEELCPVCGDKVSGYHYGLLTC 10 20 30 40 50 80 90 100 110 120 130 pF1KE3 EGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPG :.:::::::.. :.. : : ...:: ... ::.:: :::. :::..::. .:.: : : : CCDS14 ESCKGFFKRTVQNNKRYTCIENQNCQIDKTQRKRCPYCRFQKCLSVGMKLEAVRADRMRG 60 70 80 90 100 110 140 150 160 170 180 190 pF1KE3 GRNKSIGPVQISEEEIER----IMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPG :::: .::. .. ... .. .. .. :: .. . . .. .. .. : : CCDS14 GRNK-FGPMYKRDRALKQQKKALIRANGLKLEAMSQVIQAMPSDLTISSAIQNIHSASKG 120 130 140 150 160 170 200 210 220 230 pF1KE3 STLSSRSVELNGF--MAFREQYMGMSVPPH-----YQYIPHLFSYS-------------- :. .. . . : . ..:..::: :: :. : . CCDS14 LPLNHAALPPTDYDRSPFVTSPISMTMPPHGSLQGYQTYGHFPSRAIKSEYPDPYTSSPE 180 190 200 210 220 230 240 250 260 270 280 pF1KE3 ---GHSPLLPQQARSLDPQSYS-LIHQLLSAEDLEPLGTPMLI-----EDGYAVTQAEL- :.: . :. : : : :: .::. : :: .. :.. . .: CCDS14 SIMGYSYMDSYQTSS--PASIPHLILELLKCEPDEPQVQAKIMAYLQQEQANRSKHEKLS 240 250 260 270 280 290 290 300 310 320 330 pF1KE3 -FALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILLSSL---TVYSKQ :.:.:..::. :: . : .. :: ::.. : ::.. :.::..:. . .:..:. CCDS14 TFGLMCKMADQTLFSIVEWARSSIFFRELKVDDQMKLLQNCWSELLILDHIYRQVVHGKE 300 310 320 330 340 350 340 350 360 370 380 390 pF1KE3 --IF---GELADVTAKYSPSDEELHRFSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKA :: :. .: . : . :. . ....:.. .: ..:. ...:..:.: CCDS14 GSIFLVTGQQVDYSIIASQAGATLNNLMSHAQELVAKL-------RSLQFDQREFVCLKF 360 370 380 390 400 400 410 420 430 440 450 pF1KE3 INFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKM . ... :...: . . .: .... :.: .: .: ..: .:.. ::::: :. CCDS14 LVLFSLDVKNLENFQLVEGVQEQVNAALLDYTMCNYPQQTEKFGQLLLRLPEIRAISMQA 410 420 430 440 450 460 460 470 pF1KE3 VNVPLEQLPLLFKVVLHSCKTSVGKE CCDS14 EEYLYYKHLNGDVPYNNLLIEMLHAKRA 470 480 490 >>CCDS1401.1 NR5A2 gene_id:2494|Hs108|chr1 (541 aa) initn: 505 init1: 372 opt: 407 Z-score: 378.4 bits: 79.6 E(32554): 9.3e-15 Smith-Waterman score: 623; 27.7% identity (59.5% similar) in 447 aa overlap (48-450:74-510) 20 30 40 50 60 70 pF1KE3 AGFLEPPAALPPPPRNGFCQDELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISC .. : :. .. : .:::...: :::...: CCDS14 KVETEALGLARSHGEQGQMPENMQVSQFKMVNYSYDEDLEELCPVCGDKVSGYHYGLLTC 50 60 70 80 90 100 80 90 100 110 120 130 pF1KE3 EGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPG :.:::::::.. :.. : : ...:: ... ::.:: :::. :::..::. .:.: : : : CCDS14 ESCKGFFKRTVQNNKRYTCIENQNCQIDKTQRKRCPYCRFQKCLSVGMKLEAVRADRMRG 110 120 130 140 150 160 140 150 160 170 180 190 pF1KE3 GRNKSIGPVQISEEEIER----IMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPG :::: .::. .. ... .. .. .. :: .. . . .. .. .. : : CCDS14 GRNK-FGPMYKRDRALKQQKKALIRANGLKLEAMSQVIQAMPSDLTISSAIQNIHSASKG 170 180 190 200 210 220 200 210 220 230 pF1KE3 STLSSRSVELNGF--MAFREQYMGMSVPPH-----YQYIPHLFSYS-------------- :. .. . . : . ..:..::: :: :. : . CCDS14 LPLNHAALPPTDYDRSPFVTSPISMTMPPHGSLQGYQTYGHFPSRAIKSEYPDPYTSSPE 230 240 250 260 270 280 240 250 260 270 280 pF1KE3 ---GHSPLLPQQARSLDPQSYS-LIHQLLSAEDLEPLGTPMLI-----EDGYAVTQAEL- :.: . :. : : : :: .::. : :: .. :.. . .: CCDS14 SIMGYSYMDSYQTSS--PASIPHLILELLKCEPDEPQVQAKIMAYLQQEQANRSKHEKLS 290 300 310 320 330 340 290 300 310 320 330 pF1KE3 -FALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILLSSL---TVYSKQ :.:.:..::. :: . : .. :: ::.. : ::.. :.::..:. . .:..:. CCDS14 TFGLMCKMADQTLFSIVEWARSSIFFRELKVDDQMKLLQNCWSELLILDHIYRQVVHGKE 350 360 370 380 390 400 340 350 360 370 380 390 pF1KE3 --IF---GELADVTAKYSPSDEELHRFSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKA :: :. .: . : . :. . ....:.. .: ..:. ...:..:.: CCDS14 GSIFLVTGQQVDYSIIASQAGATLNNLMSHAQELVAKL-------RSLQFDQREFVCLKF 410 420 430 440 450 400 410 420 430 440 450 pF1KE3 INFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKM . ... :...: . . .: .... :.: .: .: ..: .:.. ::::: :. CCDS14 LVLFSLDVKNLENFQLVEGVQEQVNAALLDYTMCNYPQQTEKFGQLLLRLPEIRAISMQA 460 470 480 490 500 510 460 470 pF1KE3 VNVPLEQLPLLFKVVLHSCKTSVGKE CCDS14 EEYLYYKHLNGDVPYNNLLIEMLHAKRA 520 530 540 >>CCDS41790.1 RARG gene_id:5916|Hs108|chr12 (443 aa) initn: 511 init1: 381 opt: 403 Z-score: 375.9 bits: 78.8 E(32554): 1.3e-14 Smith-Waterman score: 403; 41.4% identity (68.6% similar) in 140 aa overlap (9-147:30-161) 10 20 30 pF1KE3 MERDEPPPSGGGGGGGSAGFLEP-PAALPPPPRNGFCQD .::. .: .: : ::.: .. .. CCDS41 MYDCMETFAPGPRRLYGAAGPGAGLLRRATGGSCFAGLESFAWPQPASLQSVETQSTSSE 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE3 ELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISCEGCKGFFKRSICNKRVYRCSR :.. :.. : . :..:.:...: :::. :::::::::.::: .. :: : : CCDS41 EMV---PSSPSPPPPPRVYKPCFVCNDKSSGYHYGVSSCEGCKGFFRRSIQKNMVYTCHR 70 80 90 100 110 100 110 120 130 140 150 pF1KE3 DKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPGGRNKSIGPVQISEEEIERIMS ::::.... ::::::::: ::...::...:.:.: :::. :. CCDS41 DKNCIINKVTRNRCQYCRLQKCFEVGMSKEAVRND-----RNKKKKEVKEEGSPDSYELS 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE3 GQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPGSTLSSRSVELNGFMAFREQYMGMSV CCDS41 PQLEELITKVSKAHQETFPSLCQLGKYTTNSSADHRVQLDLGLWDKFSELATKCIIKIVE 180 190 200 210 220 230 >>CCDS4768.1 RXRB gene_id:6257|Hs108|chr6 (533 aa) initn: 457 init1: 385 opt: 396 Z-score: 368.4 bits: 77.7 E(32554): 3.3e-14 Smith-Waterman score: 502; 28.7% identity (50.1% similar) in 471 aa overlap (12-468:170-513) 10 20 30 pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAA------LPPPPRNGF :::.: ..::. :::: .: CCDS47 MGSPGLPPPAPPGFSGPVSSPQINSTVSLPGGGSGPPEDVKPPVLGVRGLHCPPPP-GG- 140 150 160 170 180 190 40 50 60 70 80 90 pF1KE3 CQDELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISCEGCKGFFKRSICNKRVYR :: : .: : :::::..: :::. :::::::::::.: . .: CCDS47 ---------PG--------AGKRLCAICGDRSSGKHYGVYSCEGCKGFFKRTIRKDLTYS 200 210 220 230 240 100 110 120 130 140 150 pF1KE3 CSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPG----GRNKSIGPVQISEE : .:.:.....::::::::: ::: ::.:.:..:. . : : ... : . : CCDS47 CRDNKDCTVDKRQRNRCQYCRYQKCLATGMKREAVQEERQRGKDKDGDGEGAGGAP-EEM 250 260 270 280 290 160 170 180 190 200 210 pF1KE3 EIERIMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPGSTLSSRSVELNGFMAFRE ..::. .. :. ::.. : ::.: .: CCDS47 PVDRILEAELAVEQK--------SDQGVEG----------PGGT--------GG------ 300 310 320 220 230 240 250 260 270 pF1KE3 QYMGMSVPPHYQYIPHLFSYSGHSPLLPQQARSLDPQSYSLIHQLLSAEDLEPLGTPMLI :: :: :: CCDS47 --------------------SGSSPN--------DP------------------------ 330 280 290 300 310 320 330 pF1KE3 EDGYAVTQAELFALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILLSS ::. .:. ::. :: . : :..: : : . : . :: . :.:: :..: CCDS47 -----VTN------ICQAADKQLFTLVEWAKRIPHFSSLPLDDQVILLRAGWNEL-LIAS 340 350 360 370 380 340 350 360 370 380 pF1KE3 LTVYSKQIFGELADVTAKYSPSDEELHRFS--DEGM-EVIER-LIYLYHKFHQLKVSNEE .. : .. . .:. . .:: : . :. ...: : : :........ : CCDS47 FSHRSIDVRDGILLATGLH------VHRNSAHSAGVGAIFDRVLTELVSKMRDMRMDKTE 390 400 410 420 430 390 400 410 420 430 440 pF1KE3 YACMKAINFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPEIR .:..:: ..: : .::.. :..: : .. . . . . :: .: .:: :.. :: .: CCDS47 LGCLRAIILFNPDAKGLSNPSEVEVLREKVYASLETYCKQKYPEQQGRFAKLLLRLPALR 440 450 460 470 480 490 450 460 470 pF1KE3 YIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE :. : ::.: ..::.. CCDS47 SIGLKC----LEHL-FFFKLIGDTPIDTFLMEMLEAPHQLA 500 510 520 530 >>CCDS59007.1 RXRB gene_id:6257|Hs108|chr6 (537 aa) initn: 457 init1: 385 opt: 396 Z-score: 368.4 bits: 77.7 E(32554): 3.4e-14 Smith-Waterman score: 497; 28.0% identity (49.7% similar) in 471 aa overlap (12-468:170-517) 10 20 30 pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAA------LPPPPRNGF :::.: ..::. :::: .: CCDS59 MGSPGLPPPAPPGFSGPVSSPQINSTVSLPGGGSGPPEDVKPPVLGVRGLHCPPPP-GG- 140 150 160 170 180 190 40 50 60 70 80 90 pF1KE3 CQDELAELDPGTISVSDDRAEQRTCLICGDRATGLHYGIISCEGCKGFFKRSICNKRVYR :: : .: : :::::..: :::. :::::::::::.: . .: CCDS59 ---------PG--------AGKRLCAICGDRSSGKHYGVYSCEGCKGFFKRTIRKDLTYS 200 210 220 230 240 100 110 120 130 140 150 pF1KE3 CSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPG----GRNKSIGPVQISEE : .:.:.....::::::::: ::: ::.:.:..:. . : : ... : . : CCDS59 CRDNKDCTVDKRQRNRCQYCRYQKCLATGMKREAVQEERQRGKDKDGDGEGAGGAP-EEM 250 260 270 280 290 160 170 180 190 200 210 pF1KE3 EIERIMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPGSTLSSRSVELNGFMAFRE ..::. .. :. ::.. : ::.: .: CCDS59 PVDRILEAELAVEQK--------SDQGVEG----------PGGT--------GG------ 300 310 320 220 230 240 250 260 270 pF1KE3 QYMGMSVPPHYQYIPHLFSYSGHSPLLPQQARSLDPQSYSLIHQLLSAEDLEPLGTPMLI :: :: :: CCDS59 --------------------SGSSPN--------DP------------------------ 330 280 290 300 310 320 330 pF1KE3 EDGYAVTQAELFALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILLSS ::. .:. ::. :: . : :..: : : . : . :: . :.:: :..: CCDS59 -----VTN------ICQAADKQLFTLVEWAKRIPHFSSLPLDDQVILLRAGWNEL-LIAS 340 350 360 370 380 340 350 360 370 380 pF1KE3 LTVYSKQIFGELADVTA----KYSPSDEELHRFSDEGMEVIERLIYLYHKFHQLKVSNEE .. : .. . .:. . : . . . :... . : : :........ : CCDS59 FSHRSIDVRDGILLATGLHVHRNSAHSAGVGAIFDRSLSRV--LTELVSKMRDMRMDKTE 390 400 410 420 430 440 390 400 410 420 430 440 pF1KE3 YACMKAINFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPEIR .:..:: ..: : .::.. :..: : .. . . . . :: .: .:: :.. :: .: CCDS59 LGCLRAIILFNPDAKGLSNPSEVEVLREKVYASLETYCKQKYPEQQGRFAKLLLRLPALR 450 460 470 480 490 500 450 460 470 pF1KE3 YIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE :. : ::.: ..::.. CCDS59 SIGLKC----LEHL-FFFKLIGDTPIDTFLMEMLEAPHQLA 510 520 530 >>CCDS1248.1 RXRG gene_id:6258|Hs108|chr1 (463 aa) initn: 645 init1: 372 opt: 392 Z-score: 365.6 bits: 77.0 E(32554): 4.8e-14 Smith-Waterman score: 524; 27.7% identity (52.6% similar) in 473 aa overlap (7-468:79-443) 10 20 30 pF1KE3 MERDEPPPSGGGGGGGSAGFLEPPAALPPPPRNGFC ::::. .. . ... ::.. . CCDS12 PVSAPRTLSAVGTPLNALGSPYRVITSAMGPPSGALAAPPGINLVAPPSSQLNVVNSVSS 50 60 70 80 90 100 40 50 60 70 80 pF1KE3 QDELAELD--PGT-----ISVSDDRAEQRTCLICGDRATGLHYGIISCEGCKGFFKRSIC .... : :: :.: .. : :::::..: :::. :::::::::::.: CCDS12 SEDIKPLPGLPGIGNMNYPSTSPGSLVKHICAICGDRSSGKHYGVYSCEGCKGFFKRTIR 110 120 130 140 150 160 90 100 110 120 130 140 pF1KE3 NKRVYRCSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRKAIREDGMPGGRNKSIGPVQIS . .: : .:.:.....::::::::: ::: :::.:.:..:. :..: . . CCDS12 KDLIYTCRDNKDCLIDKRQRNRCQYCRYQKCLVMGMKREAVQEE-----RQRS---RERA 170 180 190 200 210 220 150 160 170 180 190 200 pF1KE3 EEEIERIMSGQEFEEEANHWSNHGDSDHSSPGNRASESNQPSPGSTLSSRSVELNGFMAF : : : ::.: . : .: :.. CCDS12 ESEAECATSGHE----------------DMPVERILEAE--------------------- 230 240 210 220 230 240 250 260 pF1KE3 REQYMGMSVPPHYQYIPHLFSYSGHSPLLPQQARSLDPQSYSLIHQLLSAEDLEPLGTPM ..: :. . ::. . .. . :: CCDS12 ------LAVEPKTE------SYGD----MNMENSTNDP---------------------- 250 260 270 280 290 300 310 320 pF1KE3 LIEDGYAVTQAELFALLCRLADELLFRQIAWIKKLPFFCELSIKDYTCLLSSTWQELILL ::. .:. ::. :: . : :..: : .:...: . :: . :.:: :. CCDS12 -------VTN------ICHAADKQLFTLVEWAKRIPHFSDLTLEDQVILLRAGWNEL-LI 270 280 290 300 310 330 340 350 360 370 380 pF1KE3 SSLTVYSKQIFGELADVTAKYSPSDEELHRFS--DEGM-EVIER-LIYLYHKFHQLKVSN .:.. : .. . .:. . .:: : . :. ...: : : :........ CCDS12 ASFSHRSVSVQDGILLATGLH------VHRSSAHSAGVGSIFDRVLTELVSKMKDMQMDK 320 330 340 350 360 390 400 410 420 430 440 pF1KE3 EEYACMKAINFLNQDIRGLTSASQLEQLNKRYWYICQDFTEYKYTHQPNRFPDLMMCLPE : .:..:: ..: : .::.. :..: : .. . . .:. :: .::.:: :.. :: CCDS12 SELGCLRAIVLFNPDAKGLSNPSEVETLREKVYATLEAYTKQKYPEQPGRFAKLLLRLPA 370 380 390 400 410 420 450 460 470 pF1KE3 IRYIAGKMVNVPLEQLPLLFKVVLHSCKTSVGKE .: :. : ::.: ..::.. CCDS12 LRSIGLKC----LEHL-FFFKLIGDTPIDTFLMEMLETPLQIT 430 440 450 460 479 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 17:15:02 2016 done: Mon Nov 7 17:15:03 2016 Total Scan time: 3.010 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]