FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6769, 534 aa 1>>>pF1KE6769 534 - 534 aa - 534 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8022+/-0.00104; mu= 15.0685+/- 0.062 mean_var=62.9713+/-12.453, 0's: 0 Z-trim(103.0): 47 B-trim: 2 in 1/50 Lambda= 0.161623 statistics sampled from 7166 (7210) to 7166 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.577), E-opt: 0.2 (0.221), width: 16 Scan time: 2.310 The best scores are: opt bits E(32554) CCDS33405.1 UGT1A4 gene_id:54657|Hs108|chr2 ( 534) 3596 847.6 0 CCDS2509.1 UGT1A3 gene_id:54659|Hs108|chr2 ( 534) 3359 792.3 0 CCDS33404.1 UGT1A5 gene_id:54579|Hs108|chr2 ( 534) 3357 791.9 0 CCDS2510.1 UGT1A1 gene_id:54658|Hs108|chr2 ( 533) 2605 616.5 2.4e-176 CCDS2505.1 UGT1A9 gene_id:54600|Hs108|chr2 ( 530) 2402 569.2 4.2e-162 CCDS33402.1 UGT1A8 gene_id:54576|Hs108|chr2 ( 530) 2396 567.8 1.1e-161 CCDS2507.1 UGT1A6 gene_id:54578|Hs108|chr2 ( 532) 2379 563.8 1.7e-160 CCDS2506.1 UGT1A7 gene_id:54577|Hs108|chr2 ( 530) 2343 555.4 5.8e-158 CCDS33403.1 UGT1A10 gene_id:54575|Hs108|chr2 ( 530) 2339 554.5 1.1e-157 CCDS2508.1 UGT1A6 gene_id:54578|Hs108|chr2 ( 265) 1751 417.3 1.1e-116 CCDS3523.1 UGT2B17 gene_id:7367|Hs108|chr4 ( 530) 1509 361.0 2e-99 CCDS43234.1 UGT2B4 gene_id:7363|Hs108|chr4 ( 528) 1493 357.2 2.6e-98 CCDS75136.1 UGT2B10 gene_id:7365|Hs108|chr4 ( 528) 1485 355.4 9.6e-98 CCDS56331.1 UGT2A2 gene_id:574537|Hs108|chr4 ( 536) 1477 353.5 3.6e-97 CCDS3529.1 UGT2A1 gene_id:10941|Hs108|chr4 ( 527) 1466 350.9 2.1e-96 CCDS3524.1 UGT2B15 gene_id:7366|Hs108|chr4 ( 530) 1466 350.9 2.1e-96 CCDS3526.1 UGT2B7 gene_id:7364|Hs108|chr4 ( 529) 1428 342.1 9.7e-94 CCDS3527.1 UGT2B11 gene_id:10720|Hs108|chr4 ( 529) 1427 341.9 1.1e-93 CCDS3528.1 UGT2B28 gene_id:54490|Hs108|chr4 ( 529) 1412 338.4 1.3e-92 CCDS3525.1 UGT2A3 gene_id:79799|Hs108|chr4 ( 527) 1411 338.1 1.5e-92 CCDS75135.1 UGT2B10 gene_id:7365|Hs108|chr4 ( 444) 1159 279.3 6.2e-75 CCDS58901.1 UGT2A1 gene_id:10941|Hs108|chr4 ( 527) 1099 265.4 1.2e-70 CCDS3705.1 UGT8 gene_id:7368|Hs108|chr4 ( 541) 993 240.7 3.4e-63 CCDS77925.1 UGT2A1 gene_id:10941|Hs108|chr4 ( 483) 858 209.2 9.1e-54 CCDS77924.1 UGT2A2 gene_id:574537|Hs108|chr4 ( 492) 858 209.2 9.2e-54 CCDS58902.1 UGT2A1 gene_id:10941|Hs108|chr4 ( 693) 858 209.2 1.3e-53 CCDS3914.1 UGT3A2 gene_id:167127|Hs108|chr5 ( 523) 841 205.2 1.5e-52 CCDS3913.1 UGT3A1 gene_id:133688|Hs108|chr5 ( 523) 811 198.2 1.9e-50 CCDS54842.1 UGT3A2 gene_id:167127|Hs108|chr5 ( 489) 802 196.1 7.8e-50 CCDS75137.1 UGT2B4 gene_id:7363|Hs108|chr4 ( 369) 784 191.9 1.1e-48 CCDS82930.1 UGT2B7 gene_id:7364|Hs108|chr4 ( 369) 720 177.0 3.4e-44 CCDS56330.1 UGT2B28 gene_id:54490|Hs108|chr4 ( 335) 618 153.2 4.5e-37 CCDS54841.1 UGT3A1 gene_id:133688|Hs108|chr5 ( 252) 270 72.0 9.2e-13 >>CCDS33405.1 UGT1A4 gene_id:54657|Hs108|chr2 (534 aa) initn: 3596 init1: 3596 opt: 3596 Z-score: 4528.4 bits: 847.6 E(32554): 0 Smith-Waterman score: 3596; 100.0% identity (100.0% similar) in 534 aa overlap (1-534:1-534) 10 20 30 40 50 60 pF1KE6 MARGLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MARGLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 LTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNNV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNNV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SLALHRCCVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SLALHRCCVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCDL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 DFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELFQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 REVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 REVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASGE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 HGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 HGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQND 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 SEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAH 430 440 450 460 470 480 490 500 510 520 530 pF1KE6 DLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH 490 500 510 520 530 >>CCDS2509.1 UGT1A3 gene_id:54659|Hs108|chr2 (534 aa) initn: 3359 init1: 3359 opt: 3359 Z-score: 4229.8 bits: 792.3 E(32554): 0 Smith-Waterman score: 3359; 93.3% identity (97.4% similar) in 534 aa overlap (1-534:1-534) 10 20 30 40 50 60 pF1KE6 MARGLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAVV :: ::::::: :::::::::::::::::::::::: ::: ::::::.::::::::::::: CCDS25 MATGLQVPLPWLATGLLLLLSVQPWAESGKVLVVPIDGSHWLSMREVLRELHARGHQAVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 LTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNNV ::::::::::::.:::::.::. ::: :::: .::.:: .:::::.::.. ::::..::. CCDS25 LTPEVNMHIKEENFFTLTTYAISWTQDEFDRHVLGHTQLYFETEHFLKKFFRSMAMLNNM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SLALHRCCVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCDL ::. :: ::::::::::::::::::::::::::::::.::::::::::.::: : ::::: CCDS25 SLVYHRSCVELLHNEALIRHLNATSFDVVLTDPVNLCAAVLAKYLSIPTVFFLRNIPCDL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 DFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELFQ :::::::::::::::.:::::::::::.:::::::::::::::::.:::::::::::::: CCDS25 DFKGTQCPNPSSYIPRLLTTNSDHMTFMQRVKNMLYPLALSYICHAFSAPYASLASELFQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 REVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASGE :::::::..:.:::::::::::::::::::::::::::::::: :::::::::::::::: CCDS25 REVSVVDILSHASVWLFRGDFVMDYPRPIMPNMVFIGGINCANRKPLSQEFEAYINASGE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 HGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 HGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQND 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 LLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 SEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 SEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAH 430 440 450 460 470 480 490 500 510 520 530 pF1KE6 DLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 DLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH 490 500 510 520 530 >>CCDS33404.1 UGT1A5 gene_id:54579|Hs108|chr2 (534 aa) initn: 3357 init1: 3357 opt: 3357 Z-score: 4227.2 bits: 791.9 E(32554): 0 Smith-Waterman score: 3357; 93.4% identity (97.4% similar) in 534 aa overlap (1-534:1-534) 10 20 30 40 50 60 pF1KE6 MARGLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAVV :: :::::::.:::::::::::::::::::::::::::: :::::::::.:::::::.:: CCDS33 MATGLQVPLPQLATGLLLLLSVQPWAESGKVLVVPTDGSHWLSMREALRDLHARGHQVVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 LTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNNV :: ::::.::::.:::::.::. ::: ::::. ::.::.:::::::: ..:: ::::::. CCDS33 LTLEVNMYIKEENFFTLTTYAISWTQDEFDRLLLGHTQSFFETEHLLMKFSRRMAIMNNM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SLALHRCCVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCDL :: .:: ::::::::::::::.::::::::::: .::.:::::::::::::: : ::::: CCDS33 SLIIHRSCVELLHNEALIRHLHATSFDVVLTDPFHLCAAVLAKYLSIPAVFFLRNIPCDL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 DFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELFQ :::::::::::::::.::::::::::::::::::::::::::.::. ::::::::::::: CCDS33 DFKGTQCPNPSSYIPRLLTTNSDHMTFLQRVKNMLYPLALSYLCHAVSAPYASLASELFQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 REVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASGE ::::::::::.::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 REVSVVDLVSHASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASGE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 HGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 HGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQND 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 SEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAH 430 440 450 460 470 480 490 500 510 520 530 pF1KE6 DLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH 490 500 510 520 530 >>CCDS2510.1 UGT1A1 gene_id:54658|Hs108|chr2 (533 aa) initn: 2605 init1: 2605 opt: 2605 Z-score: 3279.6 bits: 616.5 E(32554): 2.4e-176 Smith-Waterman score: 2605; 73.0% identity (88.5% similar) in 523 aa overlap (12-534:11-533) 10 20 30 40 50 60 pF1KE6 MARGLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAVV :. :::: . ...::.:..:.::: :::: :...:. :::. :: CCDS25 MAVESQGGRPLVLGLLLCVLGPVVSHAGKILLIPVDGSHWLSMLGAIQQLQQRGHEIVV 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 LTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNNV :.:.....:.. :.:: .: ::. ... . .. .. ::.. .:.: ... ... CCDS25 LAPDASLYIRDGAFYTLKTYPVPFQREDVKESFVSLGHNVFENDSFLQRVIKTYKKIKKD 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 SLALHRCCVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCDL : : : .::::. :. : .::::.:::: :. ..:.:::.:.::: . .::.: CCDS25 SAMLLSGCSHLLHNKELMASLAESSFDVMLTDPFLPCSPIVAQYLSLPTVFFLHALPCSL 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE6 DFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELFQ .:..:::::: ::.:. :...:::::::::::::: .. ...: . .:::.::::..: CCDS25 EFEATQCPNPFSYVPRPLSSHSDHMTFLQRVKNMLIAFSQNFLCDVVYSPYATLASEFLQ 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE6 REVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASGE :::.: ::.: :::::::.::: ::::::::::::.::::: . .::::::::::::::: CCDS25 REVTVQDLLSSASVWLFRSDFVKDYPRPIMPNMVFVGGINCLHQNPLSQEFEAYINASGE 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE6 HGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 HGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQND 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE6 LLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 LLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMT 360 370 380 390 400 410 430 440 450 460 470 480 pF1KE6 SEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 SEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAH 420 430 440 450 460 470 490 500 510 520 530 pF1KE6 DLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 DLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH 480 490 500 510 520 530 >>CCDS2505.1 UGT1A9 gene_id:54600|Hs108|chr2 (530 aa) initn: 2386 init1: 2209 opt: 2402 Z-score: 3023.8 bits: 569.2 E(32554): 4.2e-162 Smith-Waterman score: 2402; 68.2% identity (83.8% similar) in 531 aa overlap (4-534:5-530) 10 20 30 40 50 pF1KE6 MARGLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAV : ::: : . ::: . .::.::.:::: ::: :..:: ....: :::..: CCDS25 MACTGWTSPLP-LCVCLLLTCG---FAEAGKLLVVPMDGSHWFTMRSVVEKLILRGHEVV 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 VLTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNN :. :::. .. . :. .:.. .: ...:: ..... .... . . :: :. .:. CCDS25 VVMPEVSWQLGRSLNCTVKTYSTSYTLEDLDREFKAFAHAQWKAQ-VRSIYSLLMGSYND 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 VSLALHRCCVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCD . . : :.... :...:. .:::.:. :: . :: ..:::.:.:.: : : : : CCDS25 IFDLFFSNCRSLFKDKKLVEYLKESSFDAVFLDPFDNCGLIVAKYFSLPSVVFARGILCH 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 LDFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELF .:.::: : ::.:..: :: ::: .::.: .. : .:: : .:::.. CCDS25 YLEEGAQCPAPLSYVPRILLGFSDAMTFKERVRNHIMHLEEHLLCHRFFKNALEIASEIL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 QREVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASG : :. :: :..:.::.: :::.:::.:.::::.::::::: .:::: .:::::::::: CCDS25 QTPVTEYDLYSHTSIWLLRTDFVLDYPKPVMPNMIFIGGINCHQGKPLPMEFEAYINASG 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE6 EHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 EHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQN 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE6 DLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 DLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEM 360 370 380 390 400 410 420 430 440 450 460 470 pF1KE6 TSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 TSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAA 420 430 440 450 460 470 480 490 500 510 520 530 pF1KE6 HDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 HDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH 480 490 500 510 520 530 >>CCDS33402.1 UGT1A8 gene_id:54576|Hs108|chr2 (530 aa) initn: 2386 init1: 2213 opt: 2396 Z-score: 3016.3 bits: 567.8 E(32554): 1.1e-161 Smith-Waterman score: 2396; 67.5% identity (83.9% similar) in 535 aa overlap (1-534:1-530) 10 20 30 40 50 pF1KE6 MAR-GLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAV ::: : :.: : ..::: . .::.::.:::: ::: :..:. ....: :::..: CCDS33 MARTGWTSPIP-LCVSLLLTCG---FAEAGKLLVVPMDGSHWFTMQSVVEKLILRGHEVV 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 VLTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNN :. :::. .. . :. .:.. .: ...:: . .... .... . . .: .. :. CCDS33 VVMPEVSWQLGKSLNCTVKTYSTSYTLEDLDREFMDFADAQWKAQ-VRSLFSLFLSSSNG 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 VSLALHRCCVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCD . : :.... :...:. .:::.:. :: . :: ..:::.:.:.: : : : : CCDS33 FFNLFFSHCRSLFNDRKLVEYLKESSFDAVFLDPFDACGLIVAKYFSLPSVVFARGIACH 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 LDFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELF .:.::: : ::.:..: :: ::: .::.: .. : .:. :: .:::.. CCDS33 YLEEGAQCPAPLSYVPRILLGFSDAMTFKERVRNHIMHLEEHLFCQYFSKNALEIASEIL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 QREVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASG : :.. :: :..:.::.: :::.:::.:.::::.::::::: .:::: .:::::::::: CCDS33 QTPVTAYDLYSHTSIWLLRTDFVLDYPKPVMPNMIFIGGINCHQGKPLPMEFEAYINASG 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE6 EHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 EHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQN 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE6 DLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEM 360 370 380 390 400 410 420 430 440 450 460 470 pF1KE6 TSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 TSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAA 420 430 440 450 460 470 480 490 500 510 520 530 pF1KE6 HDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 HDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH 480 490 500 510 520 530 >>CCDS2507.1 UGT1A6 gene_id:54578|Hs108|chr2 (532 aa) initn: 2363 init1: 2363 opt: 2379 Z-score: 2994.8 bits: 563.8 E(32554): 1.7e-160 Smith-Waterman score: 2379; 68.2% identity (84.2% similar) in 525 aa overlap (11-534:10-532) 10 20 30 40 50 60 pF1KE6 MARGLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAVV :...:...: .. . . :.:::: ::: ::::.. .. : :::. :: CCDS25 MACLLRSFQRISAGVFFL-ALWGMVVGDKLLVVPQDGSHWLSMKDIVEVLSDRGHEIVV 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 LTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNNV ..::::. .:: :..: : ::. :.:. .. .. : . .: .. : . CCDS25 VVPEVNLLLKESKYYTRKIYPVPYDQEELKNRYQSFGNNHFAERSFLTA-PQTEYRNNMI 60 70 80 90 100 110 130 140 150 160 170 pF1KE6 SLALHRC-CVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCD ..:. : ::... . .. ..::...:::. ::..::.::..:.:...: .::. CCDS25 VIGLYFINCQSLLQDRDTLNFFKESKFDALFTDPALPCGVILAEYLGLPSVYLFRGFPCS 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 LDFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELF :. .. :.: ::::. : :::::: ::: :.: : :. . . . : ::: .. CCDS25 LEHTFSRSPDPVSYIPRCYTKFSDHMTFSQRVANFLVNLLEPYLFYCLFSKYEELASAVL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 QREVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASG .:.:... : . .::::.: :::..::::.:::::::::::: . : ::::::::::::: CCDS25 KRDVDIITLYQKVSVWLLRYDFVLEYPRPVMPNMVFIGGINCKKRKDLSQEFEAYINASG 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE6 EHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 EHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQN 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE6 DLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 DLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEM 360 370 380 390 400 410 420 430 440 450 460 470 pF1KE6 TSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 TSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAA 420 430 440 450 460 470 480 490 500 510 520 530 pF1KE6 HDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 HDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH 480 490 500 510 520 530 >>CCDS2506.1 UGT1A7 gene_id:54577|Hs108|chr2 (530 aa) initn: 2342 init1: 2181 opt: 2343 Z-score: 2949.5 bits: 555.4 E(32554): 5.8e-158 Smith-Waterman score: 2343; 66.9% identity (82.4% similar) in 534 aa overlap (1-534:1-530) 10 20 30 40 50 60 pF1KE6 MARGLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAVV :::. . : : . ::: . .:..::.:::: ::: :..:. ....: :::..:: CCDS25 MARAGWTGLLPLYVCLLLTCG---FAKAGKLLVVPMDGSHWFTMQSVVEKLILRGHEVVV 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 LTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNNV . :::. .. . :. .:.. .: .. :: . .... . : : . .: . :.. CCDS25 VMPEVSWQLGRSLNCTVKTYSTSYTLEDQDREFMVFADARW-TAPLRSAFSLLTSSSNGI 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 SLALHRCCVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCDL . : :.... :...:. . ::.:. :: . :: ..:::.:.:.: : : : : CCDS25 FDLFFSNCRSLFNDRKLVEYLKESCFDAVFLDPFDACGLIVAKYFSLPSVVFARGIFCHY 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE6 DFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELFQ .:.::: : ::.:.:: :: ::: .:: : .. : .: : .:::..: CCDS25 LEEGAQCPAPLSYVPRLLLGFSDAMTFKERVWNHIMHLEEHLFCPYFFKNVLEIASEILQ 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE6 REVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASGE :.. :: :..:.::.: :::..::.:.::::.::::::: .:::. .::::::::::: CCDS25 TPVTAYDLYSHTSIWLLRTDFVLEYPKPVMPNMIFIGGINCHQGKPVPMEFEAYINASGE 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE6 HGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 HGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQND 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE6 LLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 LLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEMT 360 370 380 390 400 410 430 440 450 460 470 480 pF1KE6 SEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 SEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAAH 420 430 440 450 460 470 490 500 510 520 530 pF1KE6 DLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 DLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH 480 490 500 510 520 530 >>CCDS33403.1 UGT1A10 gene_id:54575|Hs108|chr2 (530 aa) initn: 2339 init1: 2176 opt: 2339 Z-score: 2944.4 bits: 554.5 E(32554): 1.1e-157 Smith-Waterman score: 2339; 66.9% identity (82.8% similar) in 535 aa overlap (1-534:1-530) 10 20 30 40 50 pF1KE6 MAR-GLQVPLPRLATGLLLLLSVQPWAESGKVLVVPTDGSPWLSMREALRELHARGHQAV ::: : :.: : . ::: . .::.::.:::: ::: :..:. ....: :::..: CCDS33 MARAGWTSPVP-LCVCLLLTCG---FAEAGKLLVVPMDGSHWFTMQSVVEKLILRGHEVV 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 VLTPEVNMHIKEEKFFTLTAYAVPWTQKEFDRVTLGYTQGFFETEHLLKRYSRSMAIMNN :. :::. .... :. .:.. .: .. .: . .... .... . .: :. .. CCDS33 VVMPEVSWQLERSLNCTVKTYSTSYTLEDQNREFMVFAHAQWKAQAQ-SIFSLLMSSSSG 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 VSLALHRCCVELLHNEALIRHLNATSFDVVLTDPVNLCGAVLAKYLSIPAVFFWRYIPCD . : :.... :...:. .:::.:. :: . :: ..:::.:.:.: : : : : CCDS33 FLDLFFSHCRSLFNDRKLVEYLKESSFDAVFLDPFDTCGLIVAKYFSLPSVVFTRGIFCH 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 LDFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLASELF .:.::: : ::.:. : :: ::: .:: : . : .:. . .:::.. CCDS33 HLEEGAQCPAPLSYVPNDLLGFSDAMTFKERVWNHIVHLEDHLFCQYLFRNALEIASEIL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 QREVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASG : :.. :: :..:.::.: :::.:::.:.::::.::::::: .:::: .:::::::::: CCDS33 QTPVTAYDLYSHTSIWLLRTDFVLDYPKPVMPNMIFIGGINCHQGKPLPMEFEAYINASG 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE6 EHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 EHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQN 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE6 DLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEM 360 370 380 390 400 410 420 430 440 450 460 470 pF1KE6 TSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 TSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAA 420 430 440 450 460 470 480 490 500 510 520 530 pF1KE6 HDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 HDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH 480 490 500 510 520 530 >>CCDS2508.1 UGT1A6 gene_id:54578|Hs108|chr2 (265 aa) initn: 1751 init1: 1751 opt: 1751 Z-score: 2208.5 bits: 417.3 E(32554): 1.1e-116 Smith-Waterman score: 1751; 98.5% identity (98.9% similar) in 265 aa overlap (270-534:1-265) 240 250 260 270 280 290 pF1KE6 QREVSVVDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFEAYINASG :::::::::::: . : ::::::::::::: CCDS25 MPNMVFIGGINCKKRKDLSQEFEAYINASG 10 20 30 300 310 320 330 340 350 pF1KE6 EHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 EHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVLWRYTGTRPSNLANNTILVKWLPQN 40 50 60 70 80 90 360 370 380 390 400 410 pF1KE6 DLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 DLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGVTLNVLEM 100 110 120 130 140 150 420 430 440 450 460 470 pF1KE6 TSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 TSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGAPHLRPAA 160 170 180 190 200 210 480 490 500 510 520 530 pF1KE6 HDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 HDLTWYQYHSLDVIGFLLAVVLTVAFITFKCCAYGYRKCLGKKGRVKKAHKSKTH 220 230 240 250 260 534 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 16:11:53 2016 done: Tue Nov 8 16:11:54 2016 Total Scan time: 2.310 Total Display time: 0.090 Function used was FASTA [36.3.4 Apr, 2011]