FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3287, 740 aa 1>>>pF1KE3287 740 - 740 aa - 740 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1802+/-0.00109; mu= 17.0492+/- 0.066 mean_var=118.1419+/-23.716, 0's: 0 Z-trim(106.1): 82 B-trim: 0 in 0/50 Lambda= 0.117997 statistics sampled from 8734 (8806) to 8734 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.63), E-opt: 0.2 (0.271), width: 16 Scan time: 3.580 The best scores are: opt bits E(32554) CCDS3015.1 DTX3L gene_id:151636|Hs108|chr3 ( 740) 4918 849.1 0 CCDS41800.1 DTX3 gene_id:196403|Hs108|chr12 ( 347) 751 139.5 8.6e-33 CCDS66410.1 DTX3 gene_id:196403|Hs108|chr12 ( 350) 751 139.5 8.7e-33 CCDS43605.1 DTX2 gene_id:113878|Hs108|chr7 ( 575) 521 100.5 7.6e-21 CCDS5587.1 DTX2 gene_id:113878|Hs108|chr7 ( 622) 521 100.6 8.1e-21 CCDS76408.1 DTX4 gene_id:23220|Hs108|chr11 ( 513) 508 98.3 3.2e-20 CCDS44612.1 DTX4 gene_id:23220|Hs108|chr11 ( 619) 508 98.3 3.7e-20 CCDS9164.1 DTX1 gene_id:1840|Hs108|chr12 ( 620) 498 96.6 1.2e-19 >>CCDS3015.1 DTX3L gene_id:151636|Hs108|chr3 (740 aa) initn: 4918 init1: 4918 opt: 4918 Z-score: 4531.6 bits: 849.1 E(32554): 0 Smith-Waterman score: 4918; 100.0% identity (100.0% similar) in 740 aa overlap (1-740:1-740) 10 20 30 40 50 60 pF1KE3 MASHLRPPSPLLVRVYKSGPRVRRKLESYFQSSKSSGGGECTVSTQEHEAPGTFRVEFSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MASHLRPPSPLLVRVYKSGPRVRRKLESYFQSSKSSGGGECTVSTQEHEAPGTFRVEFSE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 RAAKERVLKKGEHQILVDEKPVPIFLVPTENSIKKNTRPQISSLTQSQAETPSGDMHQHE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 RAAKERVLKKGEHQILVDEKPVPIFLVPTENSIKKNTRPQISSLTQSQAETPSGDMHQHE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 GHIPNAVDSCLQKIFLTVTADLNCNLFSKEQRAYITTLCPSIRKMEGHDGIEKVCGDFQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 GHIPNAVDSCLQKIFLTVTADLNCNLFSKEQRAYITTLCPSIRKMEGHDGIEKVCGDFQD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 IERIHQFLSEQFLESEQKQQFSPSMTERKPLSQQERDSCISPSEPETKAEQKSNYFEVPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 IERIHQFLSEQFLESEQKQQFSPSMTERKPLSQQERDSCISPSEPETKAEQKSNYFEVPL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 PYFEYFKYICPDKINSIEKRFGVNIEIQESSPNMVCLDFTSSRSGDLEAARESFASEFQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 PYFEYFKYICPDKINSIEKRFGVNIEIQESSPNMVCLDFTSSRSGDLEAARESFASEFQK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 NTEPLKQECVSLADSKQANKFKQELNHQFTKLLIKEKGGELTLLGTQDDISAAKQKISEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 NTEPLKQECVSLADSKQANKFKQELNHQFTKLLIKEKGGELTLLGTQDDISAAKQKISEA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 FVKIPVKLFAANYMMNVIEVDSAHYKLLETELLQEISEIEKRYDICSKVSEKGQKTCILF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 FVKIPVKLFAANYMMNVIEVDSAHYKLLETELLQEISEIEKRYDICSKVSEKGQKTCILF 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE3 ESKDRQVDLSVHAYASFIDAFQHASCQLMREVLLLKSLGKERKHLHQTKFADDFRKRHPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 ESKDRQVDLSVHAYASFIDAFQHASCQLMREVLLLKSLGKERKHLHQTKFADDFRKRHPN 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE3 VHFVLNQESMTLTGLPNHLAKAKQYVLKGGGMSSLAGKKLKEGHETPMDIDSDDSKAASP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 VHFVLNQESMTLTGLPNHLAKAKQYVLKGGGMSSLAGKKLKEGHETPMDIDSDDSKAASP 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE3 PLKGSVSSEASELDKKEKGICVICMDTISNKKVLPKCKHEFCAPCINKAMSYKPICPTCQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 PLKGSVSSEASELDKKEKGICVICMDTISNKKVLPKCKHEFCAPCINKAMSYKPICPTCQ 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE3 TSYGIQKGNQPEGSMVFTVSRDSLPGYESFGTIVITYSMKAGIQTEEHPNPGKRYPGIQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 TSYGIQKGNQPEGSMVFTVSRDSLPGYESFGTIVITYSMKAGIQTEEHPNPGKRYPGIQR 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE3 TAYLPDNKEGRKVLKLLYRAFDQKLIFTVGYSRVLGVSDVITWNDIHHKTSRFGGPEMYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 TAYLPDNKEGRKVLKLLYRAFDQKLIFTVGYSRVLGVSDVITWNDIHHKTSRFGGPEMYG 670 680 690 700 710 720 730 740 pF1KE3 YPDPSYLKRVKEELKAKGIE :::::::::::::::::::: CCDS30 YPDPSYLKRVKEELKAKGIE 730 740 >>CCDS41800.1 DTX3 gene_id:196403|Hs108|chr12 (347 aa) initn: 742 init1: 532 opt: 751 Z-score: 702.2 bits: 139.5 E(32554): 8.6e-33 Smith-Waterman score: 751; 53.4% identity (71.6% similar) in 204 aa overlap (540-739:143-344) 510 520 530 540 550 560 pF1KE3 GGMSSLAGKKLKEGHETPMDIDSDDSKAASPPLKGSVSSEASELDKKEKGICVICMDTIS ::: . . : ..... : ::. :. CCDS41 EHPEMHRAGPPPLRAAPLLPPGARGLPPPPPPLPPPLPPRLREEAEEQESTCPICLGEIQ 120 130 140 150 160 170 570 580 590 600 610 620 pF1KE3 NKKVLPKCKHEFCAPCINKAMSYKPICPTCQTSYGIQKGNQPE-GSMVFTVSRDS---LP : :.: ::.: :: ::..:.. : :: : :: ::::. : :. ::.:. :: CCDS41 NAKTLEKCRHSFCEGCITRALQVKKACPMCGRFYGQLVGNQPQNGRML--VSKDATLLLP 180 190 200 210 220 230 630 640 650 660 670 680 pF1KE3 GYESFGTIVITYSMKAGIQTEEHPNPGKRYPGIQRTAYLPDNKEGRKVLKLLYRAFDQKL .::..::::: : . :.: :::::: :::: :.::::: :: ::: :. .::::.: CCDS41 SYEKYGTIVIQYVFPPGVQGAEHPNPGVRYPGTTRVAYLPDCPEGNKVLTLFRKAFDQRL 240 250 260 270 280 290 690 700 710 720 730 740 pF1KE3 IFTVGYSRVLGVSDVITWNDIHHKTSRFGGPEMYGYPDPSYLKRVKEELKAKGIE ::.: : . : .:::::::::::: :::...:::::.:: ::.:::.:::: CCDS41 TFTIGTSMTTGRPNVITWNDIHHKTSCTGGPQLFGYPDPTYLTRVQEELRAKGITDD 300 310 320 330 340 >>CCDS66410.1 DTX3 gene_id:196403|Hs108|chr12 (350 aa) initn: 742 init1: 532 opt: 751 Z-score: 702.1 bits: 139.5 E(32554): 8.7e-33 Smith-Waterman score: 751; 53.4% identity (71.6% similar) in 204 aa overlap (540-739:146-347) 510 520 530 540 550 560 pF1KE3 GGMSSLAGKKLKEGHETPMDIDSDDSKAASPPLKGSVSSEASELDKKEKGICVICMDTIS ::: . . : ..... : ::. :. CCDS66 EHPEMHRAGPPPLRAAPLLPPGARGLPPPPPPLPPPLPPRLREEAEEQESTCPICLGEIQ 120 130 140 150 160 170 570 580 590 600 610 620 pF1KE3 NKKVLPKCKHEFCAPCINKAMSYKPICPTCQTSYGIQKGNQPE-GSMVFTVSRDS---LP : :.: ::.: :: ::..:.. : :: : :: ::::. : :. ::.:. :: CCDS66 NAKTLEKCRHSFCEGCITRALQVKKACPMCGRFYGQLVGNQPQNGRML--VSKDATLLLP 180 190 200 210 220 230 630 640 650 660 670 680 pF1KE3 GYESFGTIVITYSMKAGIQTEEHPNPGKRYPGIQRTAYLPDNKEGRKVLKLLYRAFDQKL .::..::::: : . :.: :::::: :::: :.::::: :: ::: :. .::::.: CCDS66 SYEKYGTIVIQYVFPPGVQGAEHPNPGVRYPGTTRVAYLPDCPEGNKVLTLFRKAFDQRL 240 250 260 270 280 290 690 700 710 720 730 740 pF1KE3 IFTVGYSRVLGVSDVITWNDIHHKTSRFGGPEMYGYPDPSYLKRVKEELKAKGIE ::.: : . : .:::::::::::: :::...:::::.:: ::.:::.:::: CCDS66 TFTIGTSMTTGRPNVITWNDIHHKTSCTGGPQLFGYPDPTYLTRVQEELRAKGITDD 300 310 320 330 340 350 >>CCDS43605.1 DTX2 gene_id:113878|Hs108|chr7 (575 aa) initn: 560 init1: 274 opt: 521 Z-score: 487.7 bits: 100.5 E(32554): 7.6e-21 Smith-Waterman score: 528; 41.2% identity (60.6% similar) in 226 aa overlap (540-739:344-567) 510 520 530 540 550 560 pF1KE3 GGMSSLAGKKLKEGHETPMDIDSDDSKAASPPLKGSVSSEASELDKKEKGICVICMDTIS : . ... . :: :.:::. .: CCDS43 PGSVPATVPMQMPKPSRVQQALAGATPKPEPEPEQVIKNYTEELKVPPDEDCIICMEKLS 320 330 340 350 360 370 570 580 590 600 pF1KE3 ----------NKKV-------LPKCKHEFCAPCI-------NKAMSYKPICPTCQTSYGI .: . : ::.: : :. :: : . ::.:.: :: CCDS43 TASGYSDVTDSKAIGSLAVGHLTKCSHAFHLLCLLAMYCNGNKDGSLQ--CPSCKTIYGE 380 390 400 410 420 430 610 620 630 640 650 660 pF1KE3 QKGNQPEGSMVFTVSRDSLPGYESFGTIVITYSMKAGIQTEEHPNPGKRYP--GIQRTAY . :.::.:.: . ::::.:. :::.:.::. ::: ::::::: . :. : : CCDS43 KTGTQPQGKMEVLRFQMSLPGHEDCGTILIVYSIPHGIQGPEHPNPGKPFTARGFPRQCY 440 450 460 470 480 490 670 680 690 700 710 720 pF1KE3 LPDNKEGRKVLKLLYRAFDQKLIFTVGYSRVLGVSDVITWNDIHHKTSRFGGPEMYGYPD :::: .:::::.:: :. ..:::::: : . : .:...::.::::: . .:::: CCDS43 LPDNAQGRKVLELLKVAWKRRLIFTVGTSSTTGETDTVVWNEIHHKTEMDRNITGHGYPD 500 510 520 530 540 550 730 740 pF1KE3 PSYLKRVKEELKAKGIE :.::. : :: :.:. CCDS43 PNYLQNVLAELAAQGVTEDCLEQQ 560 570 >>CCDS5587.1 DTX2 gene_id:113878|Hs108|chr7 (622 aa) initn: 560 init1: 274 opt: 521 Z-score: 487.3 bits: 100.6 E(32554): 8.1e-21 Smith-Waterman score: 546; 38.3% identity (58.5% similar) in 277 aa overlap (490-739:343-614) 460 470 480 490 500 510 pF1KE3 KERKHLHQTKFADDFRKRHPNVHFVLNQESMTLTGLPNHLAKAKQYVLKGGGMSSLAGKK :. ::: :..: : . . : ::.:. CCDS55 SPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIGLPVCLSRAPQPT--SPPASRLASKS 320 330 340 350 360 370 520 530 540 550 560 pF1KE3 LKEGHET-PMDIDSDDSKAASPPLKGSVSSEASELDKKEKGICVICMDTIS--------- .. :.. . : : . ... . :: :.:::. .: CCDS55 HGSVKRLRKMSVKGATPKPEPEPEQ-VIKNYTEELKVPPDEDCIICMEKLSTASGYSDVT 380 390 400 410 420 570 580 590 600 610 pF1KE3 -NKKV-------LPKCKHEFCAPCI-------NKAMSYKPICPTCQTSYGIQKGNQPEGS .: . : ::.: : :. :: : . ::.:.: :: . :.::.:. CCDS55 DSKAIGSLAVGHLTKCSHAFHLLCLLAMYCNGNKDGSLQ--CPSCKTIYGEKTGTQPQGK 430 440 450 460 470 480 620 630 640 650 660 670 pF1KE3 MVFTVSRDSLPGYESFGTIVITYSMKAGIQTEEHPNPGKRYP--GIQRTAYLPDNKEGRK : . ::::.:. :::.:.::. ::: ::::::: . :. : ::::: .::: CCDS55 MEVLRFQMSLPGHEDCGTILIVYSIPHGIQGPEHPNPGKPFTARGFPRQCYLPDNAQGRK 490 500 510 520 530 540 680 690 700 710 720 730 pF1KE3 VLKLLYRAFDQKLIFTVGYSRVLGVSDVITWNDIHHKTSRFGGPEMYGYPDPSYLKRVKE ::.:: :. ..:::::: : . : .:...::.::::: . .:::::.::. : CCDS55 VLELLKVAWKRRLIFTVGTSSTTGETDTVVWNEIHHKTEMDRNITGHGYPDPNYLQNVLA 550 560 570 580 590 600 740 pF1KE3 ELKAKGIE :: :.:. CCDS55 ELAAQGVTEDCLEQQ 610 620 >>CCDS76408.1 DTX4 gene_id:23220|Hs108|chr11 (513 aa) initn: 489 init1: 279 opt: 508 Z-score: 476.4 bits: 98.3 E(32554): 3.2e-20 Smith-Waterman score: 518; 36.0% identity (58.9% similar) in 275 aa overlap (490-739:233-503) 460 470 480 490 500 510 pF1KE3 KERKHLHQTKFADDFRKRHPNVHFVLNQESMTLTGLPNHLAKAKQYVLKGGGMSSLAGKK :. .::: :.. . ::. .:. :. CCDS76 IASGVPTVPVKNLNGSSPVNPALAGITGILMSAAGLPVCLTRPPKLVLHPPPVSKSEIKS 210 220 230 240 250 260 520 530 540 550 560 570 pF1KE3 LKEGHETPMDIDSDDSKAASPPLKGSVSSEASELDKKEKGICVICMDTISNKK------- . .: . ..: .. : . ... ... . :.:::. .. . CCDS76 IPGVSNTSRKTTKKQAKKGKTPEE-VLKKYLQKVRHPPDEDCTICMERLTAPSGYKGPQP 270 280 290 300 310 320 580 590 600 610 pF1KE3 -VLP-------KCKHEFCAPCI-------NKAMSYKPICPTCQTSYGIQKGNQPEGSMVF : : .: : . :. :: : . ::::.: ::.. :.:: :.: . CCDS76 TVKPDLVGKLSRCGHVYHIYCLVAMYNNGNKDGSLQ--CPTCKTIYGVKTGTQPPGKMEY 330 340 350 360 370 620 630 640 650 660 670 pF1KE3 TVSRDSLPGYESFGTIVITYSMKAGIQTEEHPNPGKRYP--GIQRTAYLPDNKEGRKVLK . ::::. . :: : ::. ::: ::::::: . :. : ::::...:::::: CCDS76 HLIPHSLPGHPDCKTIRIIYSIPPGIQGPEHPNPGKSFSARGFPRHCYLPDSEKGRKVLK 380 390 400 410 420 430 680 690 700 710 720 730 pF1KE3 LLYRAFDQKLIFTVGYSRVLGVSDVITWNDIHHKTSRFGGPEM-YGYPDPSYLKRVKEEL :: :.:..:::..: : . : ::.. ::..:::: .::. .:::: .:: : :: CCDS76 LLLVAWDRRLIFAIGTSSTTGESDTVIWNEVHHKT-EFGSNLTGHGYPDANYLDNVLAEL 440 450 460 470 480 490 740 pF1KE3 KAKGIE :.:: CCDS76 AAQGISEDSTAQEKD 500 510 >>CCDS44612.1 DTX4 gene_id:23220|Hs108|chr11 (619 aa) initn: 489 init1: 279 opt: 508 Z-score: 475.3 bits: 98.3 E(32554): 3.7e-20 Smith-Waterman score: 518; 36.0% identity (58.9% similar) in 275 aa overlap (490-739:339-609) 460 470 480 490 500 510 pF1KE3 KERKHLHQTKFADDFRKRHPNVHFVLNQESMTLTGLPNHLAKAKQYVLKGGGMSSLAGKK :. .::: :.. . ::. .:. :. CCDS44 IASGVPTVPVKNLNGSSPVNPALAGITGILMSAAGLPVCLTRPPKLVLHPPPVSKSEIKS 310 320 330 340 350 360 520 530 540 550 560 570 pF1KE3 LKEGHETPMDIDSDDSKAASPPLKGSVSSEASELDKKEKGICVICMDTISNKK------- . .: . ..: .. : . ... ... . :.:::. .. . CCDS44 IPGVSNTSRKTTKKQAKKGKTPEE-VLKKYLQKVRHPPDEDCTICMERLTAPSGYKGPQP 370 380 390 400 410 420 580 590 600 610 pF1KE3 -VLP-------KCKHEFCAPCI-------NKAMSYKPICPTCQTSYGIQKGNQPEGSMVF : : .: : . :. :: : . ::::.: ::.. :.:: :.: . CCDS44 TVKPDLVGKLSRCGHVYHIYCLVAMYNNGNKDGSLQ--CPTCKTIYGVKTGTQPPGKMEY 430 440 450 460 470 480 620 630 640 650 660 670 pF1KE3 TVSRDSLPGYESFGTIVITYSMKAGIQTEEHPNPGKRYP--GIQRTAYLPDNKEGRKVLK . ::::. . :: : ::. ::: ::::::: . :. : ::::...:::::: CCDS44 HLIPHSLPGHPDCKTIRIIYSIPPGIQGPEHPNPGKSFSARGFPRHCYLPDSEKGRKVLK 490 500 510 520 530 540 680 690 700 710 720 730 pF1KE3 LLYRAFDQKLIFTVGYSRVLGVSDVITWNDIHHKTSRFGGPEM-YGYPDPSYLKRVKEEL :: :.:..:::..: : . : ::.. ::..:::: .::. .:::: .:: : :: CCDS44 LLLVAWDRRLIFAIGTSSTTGESDTVIWNEVHHKT-EFGSNLTGHGYPDANYLDNVLAEL 550 560 570 580 590 600 740 pF1KE3 KAKGIE :.:: CCDS44 AAQGISEDSTAQEKD 610 >>CCDS9164.1 DTX1 gene_id:1840|Hs108|chr12 (620 aa) initn: 475 init1: 271 opt: 498 Z-score: 466.1 bits: 96.6 E(32554): 1.2e-19 Smith-Waterman score: 499; 42.2% identity (63.1% similar) in 206 aa overlap (561-739:411-613) 540 550 560 570 pF1KE3 DSDDSKAASPPLKGSVSSEASELDKKEKGICVICMD----------TISNKKVLP----- :.:::. .. .: : : CCDS91 TKKKHLKKSKNPEDVVRRYMQKVKNPPDEDCTICMERLVTASGYEGVLRHKGVRPELVGR 390 400 410 420 430 440 580 590 600 610 620 pF1KE3 --KCKHEFCAPCI-------NKAMSYKPICPTCQTSYGIQKGNQPEGSMVFTVSRDSLPG .: : . :. :: : . ::::.. :: . :.:: :.: : . :::: CCDS91 LGRCGHMYHLLCLVAMYSNGNKDGSLQ--CPTCKAIYGEKTGTQPPGKMEFHLIPHSLPG 450 460 470 480 490 630 640 650 660 670 680 pF1KE3 YESFGTIVITYSMKAGIQTEEHPNPGKRYP--GIQRTAYLPDNKEGRKVLKLLYRAFDQK . . :: :.:.. .::: :::::::.. :. : :::.:..:::::.:: :.... CCDS91 FPDTQTIRIVYDIPTGIQGPEHPNPGKKFTARGFPRHCYLPNNEKGRKVLRLLITAWERR 500 510 520 530 540 550 690 700 710 720 730 740 pF1KE3 LIFTVGYSRVLGVSDVITWNDIHHKTSRFGGPEM-YGYPDPSYLKRVKEELKAKGIE ::::.: : . : ::...::.::::: .::. .:::: ::: : :: :.:. CCDS91 LIFTIGTSNTTGESDTVVWNEIHHKT-EFGSNLTGHGYPDASYLDNVLAELTAQGVSEAA 560 570 580 590 600 610 CCDS91 AKA 620 740 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 17:42:32 2016 done: Sun Nov 6 17:42:33 2016 Total Scan time: 3.580 Total Display time: 0.080 Function used was FASTA [36.3.4 Apr, 2011]