FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3903, 465 aa 1>>>pF1KE3903 465 - 465 aa - 465 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.1275+/-0.000971; mu= -2.4616+/- 0.059 mean_var=321.6713+/-64.858, 0's: 0 Z-trim(116.1): 58 B-trim: 0 in 0/52 Lambda= 0.071510 statistics sampled from 16592 (16647) to 16592 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.807), E-opt: 0.2 (0.511), width: 16 Scan time: 3.540 The best scores are: opt bits E(32554) CCDS6604.1 RNF38 gene_id:152006|Hs108|chr9 ( 465) 3360 360.0 3e-99 CCDS6603.1 RNF38 gene_id:152006|Hs108|chr9 ( 515) 3331 357.1 2.6e-98 CCDS4404.1 RNF44 gene_id:22838|Hs108|chr5 ( 432) 1836 202.7 6.1e-52 >>CCDS6604.1 RNF38 gene_id:152006|Hs108|chr9 (465 aa) initn: 3360 init1: 3360 opt: 3360 Z-score: 1895.3 bits: 360.0 E(32554): 3e-99 Smith-Waterman score: 3360; 100.0% identity (100.0% similar) in 465 aa overlap (1-465:1-465) 10 20 30 40 50 60 pF1KE3 MACKSEDSPSPKRQRLSHSVFDYTSASPAPSPPMRPWEMTSNRQPPSVRPSQHHFSGERC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 MACKSEDSPSPKRQRLSHSVFDYTSASPAPSPPMRPWEMTSNRQPPSVRPSQHHFSGERC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 NTPARNRRSPPVRRQRGRRDRLSRHNSISQDENYHHLPYAQQQAIEEPRAFHPPNVSPRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 NTPARNRRSPPVRRQRGRRDRLSRHNSISQDENYHHLPYAQQQAIEEPRAFHPPNVSPRL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 LHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVTTVAPHGIPLCTGQHIPACSTQQVPGCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 LHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVTTVAPHGIPLCTGQHIPACSTQQVPGCS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 VVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAFPPLISSDPFLIHPPHLSPHHPPHLPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 VVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAFPPLISSDPFLIHPPHLSPHHPPHLPPP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 GQFVPFQTQQSRSPLQRIENEVELLGEHLPVGGFTYPPSAHPPTLPPSAPLQFLTHDPLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 GQFVPFQTQQSRSPLQRIENEVELLGEHLPVGGFTYPPSAHPPTLPPSAPLQFLTHDPLH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 QEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPPYHPSLLPYVLSMLPVPPAVGPTFSFEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 QEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPPYHPSLLPYVLSMLPVPPAVGPTFSFEL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 DVEDGEVENYEALLNLAERLGEAKPRGLTKADIEQLPSYRFNPNNHQSEQTLCVVCMCDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 DVEDGEVENYEALLNLAERLGEAKPRGLTKADIEQLPSYRFNPNNHQSEQTLCVVCMCDF 370 380 390 400 410 420 430 440 450 460 pF1KE3 ESRQLLRVLPCNHEFHAKCVDKWLKANRTCPICRADASEVHRDSE ::::::::::::::::::::::::::::::::::::::::::::: CCDS66 ESRQLLRVLPCNHEFHAKCVDKWLKANRTCPICRADASEVHRDSE 430 440 450 460 >>CCDS6603.1 RNF38 gene_id:152006|Hs108|chr9 (515 aa) initn: 3331 init1: 3331 opt: 3331 Z-score: 1878.6 bits: 357.1 E(32554): 2.6e-98 Smith-Waterman score: 3331; 99.8% identity (100.0% similar) in 462 aa overlap (4-465:54-515) 10 20 30 pF1KE3 MACKSEDSPSPKRQRLSHSVFDYTSASPAPSPP .::::::::::::::::::::::::::::: CCDS66 ERVRLQSLFPLLPSDQNTTVQEDAHFKAFFQSEDSPSPKRQRLSHSVFDYTSASPAPSPP 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE3 MRPWEMTSNRQPPSVRPSQHHFSGERCNTPARNRRSPPVRRQRGRRDRLSRHNSISQDEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 MRPWEMTSNRQPPSVRPSQHHFSGERCNTPARNRRSPPVRRQRGRRDRLSRHNSISQDEN 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE3 YHHLPYAQQQAIEEPRAFHPPNVSPRLLHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 YHHLPYAQQQAIEEPRAFHPPNVSPRLLHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVT 150 160 170 180 190 200 160 170 180 190 200 210 pF1KE3 TVAPHGIPLCTGQHIPACSTQQVPGCSVVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 TVAPHGIPLCTGQHIPACSTQQVPGCSVVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAF 210 220 230 240 250 260 220 230 240 250 260 270 pF1KE3 PPLISSDPFLIHPPHLSPHHPPHLPPPGQFVPFQTQQSRSPLQRIENEVELLGEHLPVGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 PPLISSDPFLIHPPHLSPHHPPHLPPPGQFVPFQTQQSRSPLQRIENEVELLGEHLPVGG 270 280 290 300 310 320 280 290 300 310 320 330 pF1KE3 FTYPPSAHPPTLPPSAPLQFLTHDPLHQEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 FTYPPSAHPPTLPPSAPLQFLTHDPLHQEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPP 330 340 350 360 370 380 340 350 360 370 380 390 pF1KE3 YHPSLLPYVLSMLPVPPAVGPTFSFELDVEDGEVENYEALLNLAERLGEAKPRGLTKADI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 YHPSLLPYVLSMLPVPPAVGPTFSFELDVEDGEVENYEALLNLAERLGEAKPRGLTKADI 390 400 410 420 430 440 400 410 420 430 440 450 pF1KE3 EQLPSYRFNPNNHQSEQTLCVVCMCDFESRQLLRVLPCNHEFHAKCVDKWLKANRTCPIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 EQLPSYRFNPNNHQSEQTLCVVCMCDFESRQLLRVLPCNHEFHAKCVDKWLKANRTCPIC 450 460 470 480 490 500 460 pF1KE3 RADASEVHRDSE :::::::::::: CCDS66 RADASEVHRDSE 510 >>CCDS4404.1 RNF44 gene_id:22838|Hs108|chr5 (432 aa) initn: 1381 init1: 730 opt: 1836 Z-score: 1046.0 bits: 202.7 E(32554): 6.1e-52 Smith-Waterman score: 1836; 60.9% identity (79.8% similar) in 445 aa overlap (34-465:1-432) 10 20 30 40 50 60 pF1KE3 KSEDSPSPKRQRLSHSVFDYTSASPAPSPPMRPWEMTSNRQPPSVRPSQHHFSGERCNTP :::: .. .: :::. .:..::. .:: CCDS44 MRPWALAVTRWPPSAPVGQRRFSAGPGSTP 10 20 30 70 80 90 100 110 pF1KE3 ARNRRSP----PVRRQRGRRDRLSRHNSISQDENYHHLPYAQQQAIEEPRAFHPPNVSPR .. :: :. .: .:: .. :. ::: .:: :: : . ::: CCDS44 GQLWGSPGLEGPLASPPARDERLPSQQPPSRPP---HLP------VEERRASAPAGGSPR 40 50 60 70 80 120 130 140 150 160 170 pF1KE3 LLHPAAHPPQQNAVMVDIHDQLHQGTVPVSYTVTTVAPHGIPLCTGQHIPACSTQQVPGC .::::. ::. :::.:.:.::: ::.:::::::. .:.:: ::::::.::.::.:.: CCDS44 MLHPAT---QQSPFMVDLHEQVHQGPVPLSYTVTTVTTQGFPLPTGQHIPGCSAQQLPAC 90 100 110 120 130 180 190 200 210 220 230 pF1KE3 SVVFSGQHLPVCSVPPPMLQACSVQHLPVPYAAFPPLISSDPFLIHPPHLSPH-HPPHLP ::.::::: :.: .:::..:::..:.::::: :.: ::::: ...::: .: .: :. CCDS44 SVMFSGQHYPLCCLPPPLIQACTMQQLPVPYQAYPHLISSDHYILHPPPPAPPPQPTHMA 140 150 160 170 180 190 240 250 260 270 280 290 pF1KE3 PPGQFVPFQTQQSRSPLQRIENEVELLGEHLPVGGFTYPPSAHPPTLPPSAPLQFLTHDP : :::: .:::. : ::::..:.:.: :.. .:.::: :: :.: ::.::..: ::: CCDS44 PLGQFVSLQTQHPRMPLQRLDNDVDLRGDQPSLGSFTYSTSAPGPALSPSVPLHYLPHDP 200 210 220 230 240 250 300 310 320 330 340 350 pF1KE3 LHQEVSFGVPYPPFMPRRLTGRSRYRSQQPIPPPP-------YHPSLLPYVLSMLPVPP- ::::.:::::: .:::::. .::: :::.:::: :.::.::: :::::. : CCDS44 LHQELSFGVPYSHMMPRRLS-TQRYRLQQPLPPPPPPPPPPPYYPSFLPYFLSMLPMSPT 260 270 280 290 300 310 360 370 380 390 400 410 pF1KE3 AVGPTFSFELDVEDGEVENYEALLNLAERLGEAKPRGLTKADIEQLPSYRFNPNNHQSEQ :.:::.:..:::.: :.::::::::::::::.:::::::::::::::::::::..::::: CCDS44 AMGPTISLDLDVDDVEMENYEALLNLAERLGDAKPRGLTKADIEQLPSYRFNPDSHQSEQ 320 330 340 350 360 370 420 430 440 450 460 pF1KE3 TLCVVCMCDFESRQLLRVLPCNHEFHAKCVDKWLKANRTCPICRADASEVHRDSE ::::::. :::.::::::::::::::.::::::::::::::::::::::: :..: CCDS44 TLCVVCFSDFEARQLLRVLPCNHEFHTKCVDKWLKANRTCPICRADASEVPREAE 380 390 400 410 420 430 465 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 09:06:07 2016 done: Sun Nov 6 09:06:07 2016 Total Scan time: 3.540 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]