FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4065, 346 aa 1>>>pF1KE4065 346 - 346 aa - 346 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4661+/-0.000889; mu= 7.9809+/- 0.054 mean_var=173.5470+/-35.054, 0's: 0 Z-trim(112.7): 69 B-trim: 51 in 1/51 Lambda= 0.097357 statistics sampled from 13319 (13388) to 13319 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.768), E-opt: 0.2 (0.411), width: 16 Scan time: 3.010 The best scores are: opt bits E(32554) CCDS32823.1 RNF165 gene_id:494470|Hs108|chr18 ( 346) 2438 354.2 9.5e-98 CCDS58621.1 RNF165 gene_id:494470|Hs108|chr18 ( 154) 1073 162.1 2.7e-40 CCDS10169.1 RNF111 gene_id:54778|Hs108|chr15 ( 986) 865 133.6 6.5e-31 CCDS58366.1 RNF111 gene_id:54778|Hs108|chr15 ( 994) 850 131.5 2.8e-30 CCDS58365.1 RNF111 gene_id:54778|Hs108|chr15 ( 995) 810 125.9 1.4e-28 CCDS81888.1 RNF111 gene_id:54778|Hs108|chr15 (1003) 777 121.3 3.5e-27 >>CCDS32823.1 RNF165 gene_id:494470|Hs108|chr18 (346 aa) initn: 2438 init1: 2438 opt: 2438 Z-score: 1868.4 bits: 354.2 E(32554): 9.5e-98 Smith-Waterman score: 2438; 100.0% identity (100.0% similar) in 346 aa overlap (1-346:1-346) 10 20 30 40 50 60 pF1KE4 MVLVHVGYLVLPVFGSVRNRGAPFQRSQHPHATSCRHFHLGPPQPQQLAPDFPLAHPVQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MVLVHVGYLVLPVFGSVRNRGAPFQRSQHPHATSCRHFHLGPPQPQQLAPDFPLAHPVQS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 QPGLSAHMAPAHQHSGALHQSLTPLPTLQFQDVTGPSFLPQALHQQYLLQQQLLEAQHRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 QPGLSAHMAPAHQHSGALHQSLTPLPTLQFQDVTGPSFLPQALHQQYLLQQQLLEAQHRR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LVSHPRRSQERVSVHPHRLHPSFDFGQLQTPQPRYLAEGTDWDLSVDAGLSPAQFQVRPI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 LVSHPRRSQERVSVHPHRLHPSFDFGQLQTPQPRYLAEGTDWDLSVDAGLSPAQFQVRPI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 PQHYQHYLATPRMHHFPRNSSSTQMVVHEIRNYPYPQLHFLALQGLNPSRHTSAVRESYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 PQHYQHYLATPRMHHFPRNSSSTQMVVHEIRNYPYPQLHFLALQGLNPSRHTSAVRESYE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ELLQLEDRLGNVTRGAVQNTIERFTFPHKYKKRRPQDGKGKKDEGEESDTDEKCTICLSM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ELLQLEDRLGNVTRGAVQNTIERFTFPHKYKKRRPQDGKGKKDEGEESDTDEKCTICLSM 250 260 270 280 290 300 310 320 330 340 pF1KE4 LEDGEDVRRLPCMHLFHQLCVDQWLAMSKKCPICRVDIETQLGADS :::::::::::::::::::::::::::::::::::::::::::::: CCDS32 LEDGEDVRRLPCMHLFHQLCVDQWLAMSKKCPICRVDIETQLGADS 310 320 330 340 >>CCDS58621.1 RNF165 gene_id:494470|Hs108|chr18 (154 aa) initn: 1073 init1: 1073 opt: 1073 Z-score: 836.8 bits: 162.1 E(32554): 2.7e-40 Smith-Waterman score: 1073; 100.0% identity (100.0% similar) in 154 aa overlap (193-346:1-154) 170 180 190 200 210 220 pF1KE4 DLSVDAGLSPAQFQVRPIPQHYQHYLATPRMHHFPRNSSSTQMVVHEIRNYPYPQLHFLA :::::::::::::::::::::::::::::: CCDS58 MHHFPRNSSSTQMVVHEIRNYPYPQLHFLA 10 20 30 230 240 250 260 270 280 pF1KE4 LQGLNPSRHTSAVRESYEELLQLEDRLGNVTRGAVQNTIERFTFPHKYKKRRPQDGKGKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 LQGLNPSRHTSAVRESYEELLQLEDRLGNVTRGAVQNTIERFTFPHKYKKRRPQDGKGKK 40 50 60 70 80 90 290 300 310 320 330 340 pF1KE4 DEGEESDTDEKCTICLSMLEDGEDVRRLPCMHLFHQLCVDQWLAMSKKCPICRVDIETQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 DEGEESDTDEKCTICLSMLEDGEDVRRLPCMHLFHQLCVDQWLAMSKKCPICRVDIETQL 100 110 120 130 140 150 pF1KE4 GADS :::: CCDS58 GADS >>CCDS10169.1 RNF111 gene_id:54778|Hs108|chr15 (986 aa) initn: 690 init1: 389 opt: 865 Z-score: 668.4 bits: 133.6 E(32554): 6.5e-31 Smith-Waterman score: 923; 42.3% identity (66.2% similar) in 364 aa overlap (12-346:637-986) 10 20 30 pF1KE4 MVLVHVGYLVLPVFGSVRNRGAP----FQRSQHPHATSCRH : ..: :. : . : : .:..: : CCDS10 AAAAPSQPLSSIDGYGSSMVAQPQPQPPPQPSLSSCRHYMPPPYASLTRPLHHQASACPH 610 620 630 640 650 660 40 50 60 70 80 pF1KE4 FHLGPPQPQQLAP---DFPLAHPVQS-QPGLSAHMA--------PAHQHSGA--LHQSLT : .:: :: : :. . :::.. . .:.: . :.: : : . : : CCDS10 SHGNPP-PQTQPPPQVDYVIPHPVHAFHSQISSHATSHPVAPPPPTHLASTAAPIPQHLP 670 680 690 700 710 720 90 100 110 120 130 140 pF1KE4 PL--PTLQFQDVTGPSFLPQALHQQYLLQQQLLEAQHRRLVSHPRRSQERVSVHPHRLHP : : . .:.: : :: . ..:. .:.:.::...:: :..:: ::::.:: CCDS10 PTHQPISHHIPATAPP--AQRLHPHEVMQR--MEVQRRRMMQHPTRAHERPPPHPHRMHP 730 740 750 760 770 780 150 160 170 180 190 pF1KE4 SFDFGQ-LQTPQ-----PRYLAEGTDWDLSVDAGLSPAQFQVRPIPQHYQHYLATPRMHH .. :. ...:: :: : . :.:...::.. : . . : :: : ::.:: CCDS10 NYGHGHHIHVPQTMSSHPRQAPERSAWELGIEAGVTAATYTPGALHPHLAHYHAPPRLHH 790 800 810 820 830 840 200 210 220 230 240 250 pF1KE4 FPRNSSSTQMVVHEIRNYPYPQLHFLALQGLNPSRHTSAVRESYEELLQLEDRLGNVTRG . . .. ..: .. .::. ..... .::. . . : ..:::..::.:::::.:: CCDS10 L--QLGALPLMVPDMAGYPH--IRYIS-SGLDGTSFRGPFRGNFEELIHLEERLGNVNRG 850 860 870 880 890 260 270 280 290 300 310 pF1KE4 AVQNTIERFTFPHKYKKRR---PQDGKGKKDEGEESDTDEKCTICLSMLEDGEDVRRLPC : :.:::: :.:::::::. ::: .:: : ::.::::::::.::.::::::::: CCDS10 ASQGTIERCTYPHKYKKRKLHCKQDG----EEGTEEDTEEKCTICLSILEEGEDVRRLPC 900 910 920 930 940 950 320 330 340 pF1KE4 MHLFHQLCVDQWLAMSKKCPICRVDIETQLGADS ::::::.:::::: .:::::::::::.:: ..: CCDS10 MHLFHQVCVDQWLITNKKCPICRVDIEAQLPSES 960 970 980 >>CCDS58366.1 RNF111 gene_id:54778|Hs108|chr15 (994 aa) initn: 798 init1: 377 opt: 850 Z-score: 657.0 bits: 131.5 E(32554): 2.8e-30 Smith-Waterman score: 903; 41.3% identity (64.9% similar) in 368 aa overlap (12-346:637-994) 10 20 30 pF1KE4 MVLVHVGYLVLPVFGSVRNRGAP----FQRSQHPHATSCRH : ..: :. : . : : .:..: : CCDS58 AAAAPSQPLSSIDGYGSSMVAQPQPQPPPQPSLSSCRHYMPPPYASLTRPLHHQASACPH 610 620 630 640 650 660 40 50 60 70 80 pF1KE4 FHLGPPQPQQLAP---DFPLAHPVQS-QPGLSAHMA--------PAHQHSGA--LHQSLT : .:: :: : :. . :::.. . .:.: . :.: : : . : : CCDS58 SHGNPP-PQTQPPPQVDYVIPHPVHAFHSQISSHATSHPVAPPPPTHLASTAAPIPQHLP 670 680 690 700 710 720 90 100 110 120 130 140 pF1KE4 PL--PTLQFQDVTGPSFLPQALHQQYLLQQQLLEAQHRRLVSHPRRSQERVSVHPHRLHP : : . .:.: : :: . ..:. .:.:.::...:: :..:: ::::.:: CCDS58 PTHQPISHHIPATAPP--AQRLHPHEVMQR--MEVQRRRMMQHPTRAHERPPPHPHRMHP 730 740 750 760 770 780 150 160 170 180 190 pF1KE4 SFDFGQ-LQTPQ-----PRYLAEGTDWDLSVDAGLSPAQFQVRPIPQHYQHYLATPRMHH .. :. ...:: :: : . :.:...::.. : . . : :: : ::.:: CCDS58 NYGHGHHIHVPQTMSSHPRQAPERSAWELGIEAGVTAATYTPGALHPHLAHYHAPPRLHH 790 800 810 820 830 840 200 210 220 230 240 250 pF1KE4 FPRNSSSTQMVVHEIRNYPYPQLHFLALQGLNPSRHTSAVRESYEELLQLEDRLGNVTRG . . .. ..: .. .::. ..... .::. . . : ..:::..::.:::::.:: CCDS58 L--QLGALPLMVPDMAGYPH--IRYIS-SGLDGTSFRGPFRGNFEELIHLEERLGNVNRG 850 860 870 880 890 260 270 280 290 300 pF1KE4 AVQNTIERFTFPHKYKK-------RRPQDGKGKKDEGEESDTDEKCTICLSMLEDGEDVR : :.:::: :.:::::: .: : .:: : ::.::::::::.::.::::: CCDS58 ASQGTIERCTYPHKYKKVTTDWFSQRKLHCKQDGEEGTEEDTEEKCTICLSILEEGEDVR 900 910 920 930 940 950 310 320 330 340 pF1KE4 RLPCMHLFHQLCVDQWLAMSKKCPICRVDIETQLGADS ::::::::::.:::::: .:::::::::::.:: ..: CCDS58 RLPCMHLFHQVCVDQWLITNKKCPICRVDIEAQLPSES 960 970 980 990 >>CCDS58365.1 RNF111 gene_id:54778|Hs108|chr15 (995 aa) initn: 682 init1: 389 opt: 810 Z-score: 626.6 bits: 125.9 E(32554): 1.4e-28 Smith-Waterman score: 903; 41.6% identity (64.9% similar) in 373 aa overlap (12-346:637-995) 10 20 30 pF1KE4 MVLVHVGYLVLPVFGSVRNRGAP----FQRSQHPHATSCRH : ..: :. : . : : .:..: : CCDS58 AAAAPSQPLSSIDGYGSSMVAQPQPQPPPQPSLSSCRHYMPPPYASLTRPLHHQASACPH 610 620 630 640 650 660 40 50 60 70 80 pF1KE4 FHLGPPQPQQLAP---DFPLAHPVQS-QPGLSAHMA--------PAHQHSGA--LHQSLT : .:: :: : :. . :::.. . .:.: . :.: : : . : : CCDS58 SHGNPP-PQTQPPPQVDYVIPHPVHAFHSQISSHATSHPVAPPPPTHLASTAAPIPQHLP 670 680 690 700 710 720 90 100 110 120 130 pF1KE4 PL--PTLQFQDVTGPSFLPQALHQQYLLQQQLLEAQHRRLVSHP---------RRSQERV : : . .:.: : :: . ..:. .:.:.::...:: ::..:: CCDS58 PTHQPISHHIPATAPP--AQRLHPHEVMQR--MEVQRRRMMQHPTGLFVFCVSRRAHERP 730 740 750 760 770 780 140 150 160 170 180 pF1KE4 SVHPHRLHPSFDFGQ-LQTPQ-----PRYLAEGTDWDLSVDAGLSPAQFQVRPIPQHYQH ::::.::.. :. ...:: :: : . :.:...::.. : . . : : CCDS58 PPHPHRMHPNYGHGHHIHVPQTMSSHPRQAPERSAWELGIEAGVTAATYTPGALHPHLAH 790 800 810 820 830 840 190 200 210 220 230 240 pF1KE4 YLATPRMHHFPRNSSSTQMVVHEIRNYPYPQLHFLALQGLNPSRHTSAVRESYEELLQLE : : ::.::. . .. ..: .. .::. ..... .::. . . : ..:::..:: CCDS58 YHAPPRLHHL--QLGALPLMVPDMAGYPH--IRYIS-SGLDGTSFRGPFRGNFEELIHLE 850 860 870 880 890 250 260 270 280 290 300 pF1KE4 DRLGNVTRGAVQNTIERFTFPHKYKKRR---PQDGKGKKDEGEESDTDEKCTICLSMLED .:::::.::: :.:::: :.:::::::. ::: .:: : ::.::::::::.::. CCDS58 ERLGNVNRGASQGTIERCTYPHKYKKRKLHCKQDG----EEGTEEDTEEKCTICLSILEE 900 910 920 930 940 950 310 320 330 340 pF1KE4 GEDVRRLPCMHLFHQLCVDQWLAMSKKCPICRVDIETQLGADS :::::::::::::::.:::::: .:::::::::::.:: ..: CCDS58 GEDVRRLPCMHLFHQVCVDQWLITNKKCPICRVDIEAQLPSES 960 970 980 990 >>CCDS81888.1 RNF111 gene_id:54778|Hs108|chr15 (1003 aa) initn: 740 init1: 377 opt: 777 Z-score: 601.5 bits: 121.3 E(32554): 3.5e-27 Smith-Waterman score: 883; 40.6% identity (63.7% similar) in 377 aa overlap (12-346:637-1003) 10 20 30 pF1KE4 MVLVHVGYLVLPVFGSVRNRGAP----FQRSQHPHATSCRH : ..: :. : . : : .:..: : CCDS81 AAAAPSQPLSSIDGYGSSMVAQPQPQPPPQPSLSSCRHYMPPPYASLTRPLHHQASACPH 610 620 630 640 650 660 40 50 60 70 80 pF1KE4 FHLGPPQPQQLAP---DFPLAHPVQS-QPGLSAHMA--------PAHQHSGA--LHQSLT : .:: :: : :. . :::.. . .:.: . :.: : : . : : CCDS81 SHGNPP-PQTQPPPQVDYVIPHPVHAFHSQISSHATSHPVAPPPPTHLASTAAPIPQHLP 670 680 690 700 710 720 90 100 110 120 130 pF1KE4 PL--PTLQFQDVTGPSFLPQALHQQYLLQQQLLEAQHRRLVSHP---------RRSQERV : : . .:.: : :: . ..:. .:.:.::...:: ::..:: CCDS81 PTHQPISHHIPATAPP--AQRLHPHEVMQR--MEVQRRRMMQHPTGLFVFCVSRRAHERP 730 740 750 760 770 780 140 150 160 170 180 pF1KE4 SVHPHRLHPSFDFGQ-LQTPQ-----PRYLAEGTDWDLSVDAGLSPAQFQVRPIPQHYQH ::::.::.. :. ...:: :: : . :.:...::.. : . . : : CCDS81 PPHPHRMHPNYGHGHHIHVPQTMSSHPRQAPERSAWELGIEAGVTAATYTPGALHPHLAH 790 800 810 820 830 840 190 200 210 220 230 240 pF1KE4 YLATPRMHHFPRNSSSTQMVVHEIRNYPYPQLHFLALQGLNPSRHTSAVRESYEELLQLE : : ::.::. . .. ..: .. .::. ..... .::. . . : ..:::..:: CCDS81 YHAPPRLHHL--QLGALPLMVPDMAGYPH--IRYIS-SGLDGTSFRGPFRGNFEELIHLE 850 860 870 880 890 250 260 270 280 290 pF1KE4 DRLGNVTRGAVQNTIERFTFPHKYKK-------RRPQDGKGKKDEGEESDTDEKCTICLS .:::::.::: :.:::: :.:::::: .: : .:: : ::.:::::::: CCDS81 ERLGNVNRGASQGTIERCTYPHKYKKVTTDWFSQRKLHCKQDGEEGTEEDTEEKCTICLS 900 910 920 930 940 950 300 310 320 330 340 pF1KE4 MLEDGEDVRRLPCMHLFHQLCVDQWLAMSKKCPICRVDIETQLGADS .::.:::::::::::::::.:::::: .:::::::::::.:: ..: CCDS81 ILEEGEDVRRLPCMHLFHQVCVDQWLITNKKCPICRVDIEAQLPSES 960 970 980 990 1000 346 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:00:41 2016 done: Sun Nov 6 04:00:41 2016 Total Scan time: 3.010 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]