FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4140, 557 aa 1>>>pF1KE4140 557 - 557 aa - 557 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.7313+/-0.00108; mu= 5.6387+/- 0.065 mean_var=268.4494+/-54.819, 0's: 0 Z-trim(112.8): 39 B-trim: 0 in 0/51 Lambda= 0.078279 statistics sampled from 13485 (13516) to 13485 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.73), E-opt: 0.2 (0.415), width: 16 Scan time: 3.720 The best scores are: opt bits E(32554) CCDS10244.1 ARIH1 gene_id:25820|Hs108|chr15 ( 557) 4005 465.8 6.2e-131 CCDS2780.1 ARIH2 gene_id:10425|Hs108|chr3 ( 493) 1135 141.6 2.1e-33 CCDS4890.1 CUL9 gene_id:23113|Hs108|chr6 (2517) 517 72.5 6.6e-12 >>CCDS10244.1 ARIH1 gene_id:25820|Hs108|chr15 (557 aa) initn: 4005 init1: 4005 opt: 4005 Z-score: 2464.1 bits: 465.8 E(32554): 6.2e-131 Smith-Waterman score: 4005; 100.0% identity (100.0% similar) in 557 aa overlap (1-557:1-557) 10 20 30 40 50 60 pF1KE4 MDSDEGYNYEFDEDEECSEEDSGAEEEEDEDDDEPDDDTLDLGEVELVEPGLGVGGERDG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MDSDEGYNYEFDEDEECSEEDSGAEEEEDEDDDEPDDDTLDLGEVELVEPGLGVGGERDG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LLCGETGGGGGSALGPGGGGGGGGGGGGGGPGHEQEEDYRYEVLTAEQILQHMVECIREV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LLCGETGGGGGSALGPGGGGGGGGGGGGGGPGHEQEEDYRYEVLTAEQILQHMVECIREV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 NEVIQNPATITRILLSHFNWDKEKLMERYFDGNLEKLFAECHVINPSKKSRTRQMNTRSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NEVIQNPATITRILLSHFNWDKEKLMERYFDGNLEKLFAECHVINPSKKSRTRQMNTRSS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 AQDMPCQICYLNYPNSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDILV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AQDMPCQICYLNYPNSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDILV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 DDNTVMRLITDSKVKLKYQHLITNSFVECNRLLKWCPAPDCHHVVKVQYPDAKPVRCKCG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DDNTVMRLITDSKVKLKYQHLITNSFVECNRLLKWCPAPDCHHVVKVQYPDAKPVRCKCG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 RQFCFNCGENWHDPVKCKWLKKWIKKCDDDSETSNWIAANTKECPKCHVTIEKDGGCNHM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 RQFCFNCGENWHDPVKCKWLKKWIKKCDDDSETSNWIAANTKECPKCHVTIEKDGGCNHM 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 VCRNQNCKAEFCWVCLGPWEPHGSAWYNCNRYNEDDAKAARDAQERSRAALQRYLFYCNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VCRNQNCKAEFCWVCLGPWEPHGSAWYNCNRYNEDDAKAARDAQERSRAALQRYLFYCNR 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 YMNHMQSLRFEHKLYAQVKQKMEEMQQHNMSWIEVQFLKKAVDVLCQCRATLMYTYVFAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YMNHMQSLRFEHKLYAQVKQKMEEMQQHNMSWIEVQFLKKAVDVLCQCRATLMYTYVFAF 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE4 YLKKNNQSIIFENNQADLENATEVLSGYLERDISQDSLQDIKQKVQDKYRYCESRRRVLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YLKKNNQSIIFENNQADLENATEVLSGYLERDISQDSLQDIKQKVQDKYRYCESRRRVLL 490 500 510 520 530 540 550 pF1KE4 QHVHEGYEKDLWEYIED ::::::::::::::::: CCDS10 QHVHEGYEKDLWEYIED 550 >>CCDS2780.1 ARIH2 gene_id:10425|Hs108|chr3 (493 aa) initn: 950 init1: 319 opt: 1135 Z-score: 713.1 bits: 141.6 E(32554): 2.1e-33 Smith-Waterman score: 1142; 33.0% identity (64.4% similar) in 537 aa overlap (12-545:11-492) 10 20 30 40 50 60 pF1KE4 MDSDEGYNYEFDEDEECSEEDSGAEEEEDEDDDEPDDDTLDLGEVELVEPGLGVGGERDG : .:: . : . ::::.:..:.: :..: :.. :..: CCDS27 MSVDMNSQGSDSNEE--DYDPNCEEEEEEEEDDP-------GDIEDYYVGVASDVEQQG 10 20 30 40 50 70 80 90 100 110 120 pF1KE4 LLCGETGGGGGSALGPGGGGGGGGGGGGGGPGHEQEEDYRYEVLTAEQILQHMVECIREV ..:. : :.:.. :: .. . : . . CCDS27 ----------ADAFDP--------------------EEYQFTCLTYKESEGALNEHMTSL 60 70 80 130 140 150 160 170 pF1KE4 NEVIQNPATITRILLSHFNWDKEKLMERYFDGNLEKLFAECHVI-NPSKKSRTRQMNTRS :.. ......: .:.:. ....:: .: .:..: .: :::: .. . CCDS27 ASVLKVSHSVAKLILVNFHWQVSEILDRY-KSNSAQLLVEARVQPNPSK-------HVPT 90 100 110 120 130 180 190 200 210 220 230 pF1KE4 SAQDMPCQICYLNYPNSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDIL : : .:. . . .: : :.:: .:: .. .. ....:.: .:: :. : . CCDS27 SHPPHHCAVCMQFVRKENLLSLACQHQFCRSCWEQHCSV-LVKDGVGVGVSCMAQDCPLR 140 150 160 170 180 190 240 250 260 270 280 290 pF1KE4 VDDNTVMRLITDSKVKLKYQHLITNSFVECNRLLKWCPAPDCHHVVKVQYPDAKPVRC-K . .. :. :. . ... ::.. . ..:: . :. ::. :: :..:: : :. :.: . CCDS27 TPEDFVFPLLPNEELREKYRRYLFRDYVESHYQLQLCPGADCPMVIRVQEPRARRVQCNR 200 210 220 230 240 250 300 310 320 330 340 350 pF1KE4 CGRQFCFNCGENWHDPVKCKWLKKWIKKCDDDSETSNWIAANTKECPKCHVTIEKDGGCN :.. :::.: . .: :. : ..::. :: :::::.:.:.:.::.::::.. :::.:::: CCDS27 CNEVFCFKCRQMYHAPTDCATIRKWLTKCADDSETANYISAHTKDCPKCNICIEKNGGCN 260 270 280 290 300 310 360 370 380 390 400 410 pF1KE4 HMVCRNQNCKAEFCWVCLGPWEPHGSAWYNCNRYNEDDAKAARDAQERSRAALQRYLFYC :: : ..:: .:::.::: :. ::: .:.:.::.:. . .. : ..: ::..:::: CCDS27 HMQC--SKCKHDFCWMCLGDWKTHGSEYYECSRYKENPDIVNQSQQAQAREALKKYLFYF 320 330 340 350 360 420 430 440 450 460 470 pF1KE4 NRYMNHMQSLRFEHKLYAQVKQKMEEMQQHNM-SWIEVQFLKKAVDVLCQCRATLMYTYV .:. :: .::..: . : ....:..: ..:. .::. :.:..:. .: .:: ::.::: CCDS27 ERWENHNKSLQLEAQTYQRIHEKIQERVMNNLGTWIDWQYLQNAAKLLAKCRYTLQYTYP 370 380 390 400 410 420 480 490 500 510 520 530 pF1KE4 FAFYLKKNNQSIIFENNQADLENATEVLSGYLERDISQDSLQDIKQKVQDKYRYCESRRR .:.:.... .. .:: .::.:: : :: .:: : : . ...... :.::: CCDS27 YAYYMESGPRKKLFEYQQAQLEAEIENLSWKVERADSYD-----RGDLENQMHIAEQRRR 430 440 450 460 470 480 540 550 pF1KE4 VLLQHVHEGYEKDLWEYIED .::. :. CCDS27 TLLKDFHDT 490 >>CCDS4890.1 CUL9 gene_id:23113|Hs108|chr6 (2517 aa) initn: 279 init1: 106 opt: 517 Z-score: 327.2 bits: 72.5 E(32554): 6.6e-12 Smith-Waterman score: 578; 26.0% identity (54.0% similar) in 454 aa overlap (104-545:1997-2417) 80 90 100 110 120 130 pF1KE4 LGPGGGGGGGGGGGGGGPGHEQEEDYRYEVLTAEQILQHMVECIREVNEVIQNPATITRI .. ... : . .:.:.:... ... CCDS48 PFCGSQSETSKPSPEAVATLASLQLPAGRTMSPQEVEGLMKQTVRQVQETLNLEPDVAQH 1970 1980 1990 2000 2010 2020 140 150 160 170 180 190 pF1KE4 LLSHFNWDKEKLMERYFDGNLEKLFAECHVINPSKKSRTRQMNTRSSAQDMPCQICYLNY ::.: .: :.:.. : . :.: .. .. .: . : .: CCDS48 LLAHSHWGAEQLLQSYSEDPEPLLLAAGLCVHQAQAVPVRPDH---------CPVCVSPL 2030 2040 2050 2060 2070 200 210 220 230 240 250 pF1KE4 P-NSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDILVDDNTVMRLITDS .. . .: : : : .::.:::::.: :... . .:: : . .... CCDS48 GCDDDLPSLCCMHYCCKSCWNEYLTTRI-EQNLVLNCTCPIADCPAQPTGAFIRAIVSSP 2080 2090 2100 2110 2120 2130 260 270 280 290 300 310 pF1KE4 KVKLKYQHLITNSFVECNRLLKWCPAPD-CHHVVKVQYPDAKPVRCKCGRQFCFNCG-EN .: ::.. . ..:: : :: :. : ... : . ::: ::::. . CCDS48 EVISKYEKALLRGYVESCSNLTWCTNPQGCDRILCRQGLGCGTTCSKCGWASCFNCSFPE 2140 2150 2160 2170 2180 2190 320 330 340 350 360 pF1KE4 WHDPVKCKWLKKWIKKCDD---------DSETSNWIAANTKECPKCHVTIEKDGGCNHMV : :..: ...:. :: ...... .:.::.:.. :::. :: ::. CCDS48 AHYPASCGHMSQWV---DDGGYYDGMSVEAQSKHLAKLISKRCPSCQAPIEKNEGCLHMT 2200 2210 2220 2230 2240 2250 370 380 390 400 410 420 pF1KE4 CRNQNCKAEFCWVCLGPWEPHGSAWYNCNRYNEDDAKAARDAQERSRAALQRYLFYCNRY : .:. ::: :: :.:. . .:::. . .:::: ::. :. : .: CCDS48 C--AKCNHGFCWRCLKSWKPNHKDYYNCSAMV---SKAAR--QEK------RFQDYNERC 2260 2270 2280 2290 2300 430 440 450 460 470 480 pF1KE4 MNHMQSLRFEHKLYAQVKQKMEEMQQHNMSWIEVQFLKKAVDVLCQCRATLMYTYVFAFY : :. .: .: .:. : .... ::. : . : : : .: :. :..:: CCDS48 TFHHQAREFAVNLRNRVSAIHEVPPPRSFT-----FLNDACQGLEQARKVLAYACVYSFY 2310 2320 2330 2340 2350 490 500 510 520 530 540 pF1KE4 LKKNNQSIIFENNQADLENATEVLSGYLERDISQDSLQDIKQKVQDKYRYCESRRRVLLQ . . . :.. .:: :..:. ::. . . .:. .... : : ::. CCDS48 SQDAEYMDVVEQQTENLELHTNALQILLEETLLR--CRDLASSLRLLRADCLSTGMELLR 2360 2370 2380 2390 2400 2410 550 pF1KE4 HVHEGYEKDLWEYIED ...: CCDS48 RIQERLLAILQHSAQDFRVGLQSPSVEAWEAKGPNMPGSQPQASSGPEAEEEEEDDEDDV 2420 2430 2440 2450 2460 2470 557 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:50:27 2016 done: Sun Nov 6 01:50:27 2016 Total Scan time: 3.720 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]