FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3670, 493 aa 1>>>pF1KE3670 493 - 493 aa - 493 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3399+/-0.000942; mu= 10.1158+/- 0.057 mean_var=146.8961+/-29.956, 0's: 0 Z-trim(110.5): 35 B-trim: 41 in 1/50 Lambda= 0.105820 statistics sampled from 11646 (11674) to 11646 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.707), E-opt: 0.2 (0.359), width: 16 Scan time: 3.280 The best scores are: opt bits E(32554) CCDS2780.1 ARIH2 gene_id:10425|Hs108|chr3 ( 493) 3509 547.4 1.3e-155 CCDS10244.1 ARIH1 gene_id:25820|Hs108|chr15 ( 557) 1135 185.0 1.8e-46 CCDS4890.1 CUL9 gene_id:23113|Hs108|chr6 (2517) 588 102.0 7.9e-21 CCDS1657.1 RNF144A gene_id:9781|Hs108|chr2 ( 292) 380 69.6 5.4e-12 CCDS34345.1 RNF144B gene_id:255488|Hs108|chr6 ( 303) 379 69.4 6.2e-12 >>CCDS2780.1 ARIH2 gene_id:10425|Hs108|chr3 (493 aa) initn: 3509 init1: 3509 opt: 3509 Z-score: 2907.3 bits: 547.4 E(32554): 1.3e-155 Smith-Waterman score: 3509; 100.0% identity (100.0% similar) in 493 aa overlap (1-493:1-493) 10 20 30 40 50 60 pF1KE3 MSVDMNSQGSDSNEEDYDPNCEEEEEEEEDDPGDIEDYYVGVASDVEQQGADAFDPEEYQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 MSVDMNSQGSDSNEEDYDPNCEEEEEEEEDDPGDIEDYYVGVASDVEQQGADAFDPEEYQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 FTCLTYKESEGALNEHMTSLASVLKVSHSVAKLILVNFHWQVSEILDRYKSNSAQLLVEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 FTCLTYKESEGALNEHMTSLASVLKVSHSVAKLILVNFHWQVSEILDRYKSNSAQLLVEA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 RVQPNPSKHVPTSHPPHHCAVCMQFVRKENLLSLACQHQFCRSCWEQHCSVLVKDGVGVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 RVQPNPSKHVPTSHPPHHCAVCMQFVRKENLLSLACQHQFCRSCWEQHCSVLVKDGVGVG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 VSCMAQDCPLRTPEDFVFPLLPNEELREKYRRYLFRDYVESHYQLQLCPGADCPMVIRVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 VSCMAQDCPLRTPEDFVFPLLPNEELREKYRRYLFRDYVESHYQLQLCPGADCPMVIRVQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 EPRARRVQCNRCNEVFCFKCRQMYHAPTDCATIRKWLTKCADDSETANYISAHTKDCPKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 EPRARRVQCNRCNEVFCFKCRQMYHAPTDCATIRKWLTKCADDSETANYISAHTKDCPKC 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 NICIEKNGGCNHMQCSKCKHDFCWMCLGDWKTHGSEYYECSRYKENPDIVNQSQQAQARE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 NICIEKNGGCNHMQCSKCKHDFCWMCLGDWKTHGSEYYECSRYKENPDIVNQSQQAQARE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 ALKKYLFYFERWENHNKSLQLEAQTYQRIHEKIQERVMNNLGTWIDWQYLQNAAKLLAKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 ALKKYLFYFERWENHNKSLQLEAQTYQRIHEKIQERVMNNLGTWIDWQYLQNAAKLLAKC 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE3 RYTLQYTYPYAYYMESGPRKKLFEYQQAQLEAEIENLSWKVERADSYDRGDLENQMHIAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 RYTLQYTYPYAYYMESGPRKKLFEYQQAQLEAEIENLSWKVERADSYDRGDLENQMHIAE 430 440 450 460 470 480 490 pF1KE3 QRRRTLLKDFHDT ::::::::::::: CCDS27 QRRRTLLKDFHDT 490 >>CCDS10244.1 ARIH1 gene_id:25820|Hs108|chr15 (557 aa) initn: 950 init1: 319 opt: 1135 Z-score: 947.8 bits: 185.0 E(32554): 1.8e-46 Smith-Waterman score: 1135; 35.4% identity (68.8% similar) in 452 aa overlap (57-492:97-545) 30 40 50 60 70 80 pF1KE3 EEEDDPGDIEDYYVGVASDVEQQGADAFDPEEYQFTCLTYKESEGALNEHMTSLASVLKV :.:.. :: .. . : . . :.. CCDS10 GGGGGSALGPGGGGGGGGGGGGGGPGHEQEEDYRYEVLTAEQILQHMVECIREVNEVIQN 70 80 90 100 110 120 90 100 110 120 130 pF1KE3 SHSVAKLILVNFHWQVSEILDRY-KSNSAQLLVEARVQPNPSKHVPT-------SHPPHH ......: .:.:. ....:: .: .:..: .: ::::. : : CCDS10 PATITRILLSHFNWDKEKLMERYFDGNLEKLFAECHVI-NPSKKSRTRQMNTRSSAQDMP 130 140 150 160 170 180 140 150 160 170 180 190 pF1KE3 CAVCMQFVRKENLLSLACQHQFCRSCWEQHCSV-LVKDGVGVGVSCMAQDCPLRTPEDFV : .:. . . .: : :.:: .:: .. .. ....:.: .:: :. : . . .. : CCDS10 CQICYLNYPNSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDILVDDNTV 190 200 210 220 230 240 200 210 220 230 240 250 pF1KE3 FPLLPNEELREKYRRYLFRDYVESHYQLQLCPGADCPMVIRVQEPRARRVQCNRCNEVFC . :. . ... ::.. . ..:: . :. ::. :: :..:: : :. :.: .:.. :: CCDS10 MRLITDSKVKLKYQHLITNSFVECNRLLKWCPAPDCHHVVKVQYPDAKPVRC-KCGRQFC 250 260 270 280 290 300 260 270 280 290 300 310 pF1KE3 FKCRQMYHAPTDCATIRKWLTKCADDSETANYISAHTKDCPKCNICIEKNGGCNHMQC-- :.: . .: :. : ..::. :: :::::.:.:.:.::.::::.. :::.:::::: : CCDS10 FNCGENWHDPVKCKWLKKWIKKCDDDSETSNWIAANTKECPKCHVTIEKDGGCNHMVCRN 310 320 330 340 350 360 320 330 340 350 360 370 pF1KE3 SKCKHDFCWMCLGDWKTHGSEYYECSRYKENPDIVNQSQQAQAREALKKYLFYFERWENH ..:: .:::.::: :. ::: .:.:.::.:. . .. : ..: ::..:::: .:. :: CCDS10 QNCKAEFCWVCLGPWEPHGSAWYNCNRYNEDDAKAARDAQERSRAALQRYLFYCNRYMNH 370 380 390 400 410 420 380 390 400 410 420 430 pF1KE3 NKSLQLEAQTYQRIHEKIQERVMNNLGTWIDWQYLQNAAKLLAKCRYTLQYTYPYAYYME .::..: . : ....:..: ..:. .::. :.:..:. .: .:: ::.::: .:.:.. CCDS10 MQSLRFEHKLYAQVKQKMEEMQQHNM-SWIEVQFLKKAVDVLCQCRATLMYTYVFAFYLK 430 440 450 460 470 480 440 450 460 470 480 490 pF1KE3 SGPRKKLFEYQQAQLEAEIENLSWKVERADSYD-----RGDLENQMHIAEQRRRTLLKDF .. .. .:: .::.:: : :: .:: : : . ...... :.:::.::. CCDS10 KNNQSIIFENNQADLENATEVLSGYLERDISQDSLQDIKQKVQDKYRYCESRRRVLLQHV 490 500 510 520 530 540 pF1KE3 HDT :. CCDS10 HEGYEKDLWEYIED 550 >>CCDS4890.1 CUL9 gene_id:23113|Hs108|chr6 (2517 aa) initn: 646 init1: 238 opt: 588 Z-score: 487.4 bits: 102.0 E(32554): 7.9e-21 Smith-Waterman score: 634; 27.2% identity (58.8% similar) in 427 aa overlap (64-478:1997-2399) 40 50 60 70 80 90 pF1KE3 DIEDYYVGVASDVEQQGADAFDPEEYQFTCLTYKESEGALNEHMTSLASVLKVSHSVAKL .. .: :: ... . .. .:.. .::. CCDS48 PFCGSQSETSKPSPEAVATLASLQLPAGRTMSPQEVEGLMKQTVRQVQETLNLEPDVAQH 1970 1980 1990 2000 2010 2020 100 110 120 130 140 150 pF1KE3 ILVNFHWQVSEILDRYKSNSAQLLVEARVQPNPSKHVPTSHPPHHCAVCMQFVR-KENLL .:.. :: . ..:. :. . ::. : . . .. ::. : :: ::.. . ..: CCDS48 LLAHSHWGAEQLLQSYSEDPEPLLLAAGLCVHQAQAVPVR--PDHCPVCVSPLGCDDDLP 2030 2040 2050 2060 2070 2080 160 170 180 190 200 210 pF1KE3 SLACQHQFCRSCWEQHCSVLVKDGVGVGVSCMAQDCPLRTPEDFVFPLLPNEELREKYRR :: :.: :.:::... .. ..... .. .: ::: . :. .. . :. ::.. CCDS48 SLCCMHYCCKSCWNEYLTTRIEQNLVLNCTCPIADCPAQPTGAFIRAIVSSPEVISKYEK 2090 2100 2110 2120 2130 2140 220 230 240 250 260 270 pF1KE3 YLFRDYVESHYQLQLCPGAD-CPMVIRVQEPRARRVQCNRCNEVFCFKCR-QMYHAPTDC :.: :::: .: : . . : .. .. . . :..:. . ::.: : :..: CCDS48 ALLRGYVESCSNLTWCTNPQGCDRIL-CRQGLGCGTTCSKCGWASCFNCSFPEAHYPASC 2150 2160 2170 2180 2190 2200 280 290 300 310 320 pF1KE3 ATIRKWLTKCAD-DSETANYISAH-----TKDCPKCNICIEKNGGCNHMQCSKCKHDFCW . . .:. . :. ... : : .: ::.:. :::: :: :: :.::.: ::: CCDS48 GHMSQWVDDGGYYDGMSVEAQSKHLAKLISKRCPSCQAPIEKNEGCLHMTCAKCNHGFCW 2210 2220 2230 2240 2250 2260 330 340 350 360 370 380 pF1KE3 MCLGDWKTHGSEYYECSRYKENPDIVNQSQQAQAREALKKYLFYFERWENHNKSLQLEAQ :: .:: . ..::.:: .:... ::. :.. : :: :... .. .. CCDS48 RCLKSWKPNHKDYYNCSA------MVSKA----ARQE-KRFQDYNERCTFHHQAREFAVN 2270 2280 2290 2300 2310 390 400 410 420 430 440 pF1KE3 TYQR---IHEKIQERVMNNLGTWIDWQYLQNAAKLLAKCRYTLQYTYPYAYYMESGPRKK .: ::: : .. .:..: . : . : .: :. :..: ... CCDS48 LRNRVSAIHEVPPPR---------SFTFLNDACQGLEQARKVLAYACVYSFYSQDAEYMD 2320 2330 2340 2350 2360 450 460 470 480 490 pF1KE3 LFEYQQAQLEAEIENLSWKVERADSYDRGDLENQMHIAEQRRRTLLKDFHDT . : : .:: . . :. .:.. : :: ..... CCDS48 VVEQQTENLELHTNALQILLEETLLRCR-DLASSLRLLRADCLSTGMELLRRIQERLLAI 2370 2380 2390 2400 2410 2420 CCDS48 LQHSAQDFRVGLQSPSVEAWEAKGPNMPGSQPQASSGPEAEEEEEDDEDDVPEWQQDEFD 2430 2440 2450 2460 2470 2480 >>CCDS1657.1 RNF144A gene_id:9781|Hs108|chr2 (292 aa) initn: 339 init1: 184 opt: 380 Z-score: 328.8 bits: 69.6 E(32554): 5.4e-12 Smith-Waterman score: 380; 28.9% identity (53.7% similar) in 201 aa overlap (135-327:16-215) 110 120 130 140 150 160 pF1KE3 ILDRYKSNSAQLLVEARVQPNPSKHVPTSHPPHHCAVCMQFVRKENLLSLA-CQHQFCRS : : .:. :.. ..: :: :: CCDS16 MTTTRYRPTWDLALDPLVSCKLCLGEYPVEQMTTIAQCQCIFCTL 10 20 30 40 170 180 190 200 210 220 pF1KE3 CWEQHCSVLVKDGVGVGVSCMAQDCPLRTP-EDFVFPLLPNEELREKYRRYLFRDYVESH : .:. .:.:.:. ...:: :: . .. . . :. ..:.. :. : CCDS16 CLKQYVELLIKEGLETAISCPDAACPKQGHLQENEIECMVAAEIMQRYKKLQFEREVLFD 50 60 70 80 90 100 230 240 250 260 270 pF1KE3 YQLQLCPGADCPMVIRVQE---PRARRVQCNRCNEVFCFKCRQMYHAPTDCATIRKWLTK ::.. : : ..:. . :::. : :: :. .: : .: CCDS16 PCRTWCPASTCQAVCQLQDVGLQTPQPVQCKACRMEFCSTCKASWHPGQGCPETMP-ITF 110 120 130 140 150 160 280 290 300 310 320 330 pF1KE3 CADDSETANYI---SAHTKDCPKCNICIEKNGGCNHMQCSKCKHDFCWMCLGDWKTHGSE .. .: . .: : ::::.. ::.. :: .:.:..::: :::.:: CCDS16 LPGETSAAFKMEEDDAPIKRCPKCKVYIERDEGCAQMMCKNCKHAFCWYCLESLDDDFLL 170 180 190 200 210 220 340 350 360 370 380 390 pF1KE3 YYECSRYKENPDIVNQSQQAQAREALKKYLFYFERWENHNKSLQLEAQTYQRIHEKIQER CCDS16 IHYDKGPCRNKLGHSRASVIWHRTQVVGIFAGFGLLLLVASPFLLLATPFVLCCKCKCSK 230 240 250 260 270 280 >>CCDS34345.1 RNF144B gene_id:255488|Hs108|chr6 (303 aa) initn: 329 init1: 217 opt: 379 Z-score: 327.8 bits: 69.4 E(32554): 6.2e-12 Smith-Waterman score: 379; 29.0% identity (54.3% similar) in 210 aa overlap (124-327:17-223) 100 110 120 130 140 150 pF1KE3 ILVNFHWQVSEILDRYKSNSAQLLVEARVQPNPSKHVPTSHPPHHCAVCMQFVRKENLLS :.:. .:. : : .:. ... . CCDS34 MGSAGRLHYLAMTAENPTPGDLAPA--PLITCKLCLCEQSLDKMTT 10 20 30 40 160 170 180 190 200 210 pF1KE3 LA-CQHQFCRSCWEQHCSVLVKDGVGVGVSCMAQDC--PLRTPEDFVFPLLPNEELREKY : :: :: .: .:. .. ...: : ..: . : : . :.: ... . : CCDS34 LQECQCIFCTACLKQYMQLAIREGCGSPITCPDMVCLNHGTLQEAEIACLVPVDQF-QLY 50 60 70 80 90 100 220 230 240 250 260 pF1KE3 RRYLFRDYVESHYQLQLCPGADCPMVIRV--QEP-RARRVQCNRCNEVFCFKCRQMYHAP .: :. :. :: ::: : : ..: . :.: :. :: :.. .:: CCDS34 QRLKFEREVHLDPYRTWCPVADCQTVCPVASSDPGQPVLVECPSCHLKFCSCCKDAWHAE 110 120 130 140 150 160 270 280 290 300 310 320 pF1KE3 TDCATIRKWLTKCADDSETANYISAHTKDCPKCNICIEKNGGCNHMQCSKCKHDFCWMCL ..: . . . .. : :.:: : . ::.: :: .:.:..::: :::.:: CCDS34 VSCRDSQPIVLPTEHRALFGTDAEAPIKQCPVCRVYIERNEGCAQMMCKNCKHTFCWYCL 170 180 190 200 210 220 330 340 350 360 370 380 pF1KE3 GDWKTHGSEYYECSRYKENPDIVNQSQQAQAREALKKYLFYFERWENHNKSLQLEAQTYQ CCDS34 QNLDNDIFLRHYDKGPCRNKLGHSRASVMWNRTQVVGILVGLGIIALVTSPLLLLASPCI 230 240 250 260 270 280 493 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 23:36:06 2016 done: Sun Nov 6 23:36:07 2016 Total Scan time: 3.280 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]