FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3888, 304 aa 1>>>pF1KE3888 304 - 304 aa - 304 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6604+/-0.000645; mu= 14.4844+/- 0.039 mean_var=76.1269+/-14.883, 0's: 0 Z-trim(111.3): 12 B-trim: 0 in 0/53 Lambda= 0.146996 statistics sampled from 12427 (12436) to 12427 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.75), E-opt: 0.2 (0.372), width: 16 Scan time: 1.240 The best scores are: opt bits E(33420) CCDS10592.1 DCUN1D3 gene_id:123879|Hs109|chr16 ( 304) 2093 452.7 1.7e-127 CCDS32013.1 DCUN1D2 gene_id:55208|Hs109|chr13 ( 259) 585 132.8 2.7e-31 CCDS77862.1 DCUN1D1 gene_id:54165|Hs109|chr3 ( 244) 564 128.4 5.6e-30 CCDS3240.1 DCUN1D1 gene_id:54165|Hs109|chr3 ( 259) 564 128.4 5.9e-30 CCDS8325.1 DCUN1D5 gene_id:84259|Hs109|chr11 ( 237) 397 93.0 2.5e-19 CCDS33982.1 DCUN1D4 gene_id:23142|Hs109|chr4 ( 292) 396 92.8 3.5e-19 CCDS75123.1 DCUN1D4 gene_id:23142|Hs109|chr4 ( 336) 396 92.8 3.9e-19 >>CCDS10592.1 DCUN1D3 gene_id:123879|Hs109|chr16 (304 aa) initn: 2093 init1: 2093 opt: 2093 Z-score: 2402.9 bits: 452.7 E(33420): 1.7e-127 Smith-Waterman score: 2093; 100.0% identity (100.0% similar) in 304 aa overlap (1-304:1-304) 10 20 30 40 50 60 pF1KE3 MGQCVTKCKNPSSTLGSKNGDREPSNKSHSRRGAGHREEQVPPCGKPGGDILVNGTKKAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MGQCVTKCKNPSSTLGSKNGDREPSNKSHSRRGAGHREEQVPPCGKPGGDILVNGTKKAE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 AATEACQLPTSSGDAGRESKSNAEESSLQRLEELFRRYKDEREDAILEEGMERFCNDLCV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AATEACQLPTSSGDAGRESKSNAEESSLQRLEELFRRYKDEREDAILEEGMERFCNDLCV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 DPTEFRVLLLAWKFQAATMCKFTRKEFFDGCKAISADSIDGICARFPSLLTEAKQEDKFK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DPTEFRVLLLAWKFQAATMCKFTRKEFFDGCKAISADSIDGICARFPSLLTEAKQEDKFK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 DLYRFTFQFGLDSEEGQRSLHREIAIALWKLVFTQNNPPVLDQWLNFLTENPSGIKGISR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DLYRFTFQFGLDSEEGQRSLHREIAIALWKLVFTQNNPPVLDQWLNFLTENPSGIKGISR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 DTWNMFLNFTQVIGPDLSNYSEDEAWPSLFDTFVEWEMERRKREGEGRGALSSGPEGLCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DTWNMFLNFTQVIGPDLSNYSEDEAWPSLFDTFVEWEMERRKREGEGRGALSSGPEGLCP 250 260 270 280 290 300 pF1KE3 EEQT :::: CCDS10 EEQT >>CCDS32013.1 DCUN1D2 gene_id:55208|Hs109|chr13 (259 aa) initn: 550 init1: 280 opt: 585 Z-score: 675.5 bits: 132.8 E(33420): 2.7e-31 Smith-Waterman score: 585; 44.5% identity (71.6% similar) in 218 aa overlap (60-276:39-246) 30 40 50 60 70 80 pF1KE3 SRRGAGHREEQVPPCGKPGGDILVNGTKKAEAATEACQLPTSSGDAGRESKSNAEESSLQ ::. : : : ::: :: .. . CCDS32 KDKVRQFMACTQAGERTAIYCLTQNEWRLDEATDSFFQNPDS---LHRESMRNAVDK--K 10 20 30 40 50 60 90 100 110 120 130 140 pF1KE3 RLEELFRRYKDER-EDAILEEGMERFCNDLCVDPTEFRVLLLAWKFQAATMCKFTRKEFF .::.:. :::: . :. : .:...::.:: .::. . ::..::::.:::.:.:.::::. CCDS32 KLERLYGRYKDPQDENKIGVDGIQQFCDDLSLDPASISVLVIAWKFRAATQCEFSRKEFL 70 80 90 100 110 120 150 160 170 180 190 200 pF1KE3 DGCKAISADSIDGICARFPSLLTEAKQEDKFKDLYRFTFQFGLDSEEGQRSLHREIAIAL :: .. ::.. . : .: : : :. ::::.:.::: :. .. ::..: :.:.: CCDS32 DGMTELGCDSMEKLKALLPRLEQELKDTAKFKDFYQFTFTFA--KNPGQKGLDLEMAVAY 130 140 150 160 170 180 210 220 230 240 250 260 pF1KE3 WKLVFTQNNPPVLDQWLNFLTENPSGIKGISRDTWNMFLNFTQVIGPDLSNYSEDEAWPS ::::.. . :: : .:: :. . ..: :::::..:.: ..:. :.:::.:. ::: CCDS32 WKLVLS-GRFKFLDLWNTFLMEHHK--RSIPRDTWNLLLDFGNMIADDMSNYDEEGAWPV 190 200 210 220 230 270 280 290 300 pF1KE3 LFDTFVEWEMERRKREGEGRGALSSGPEGLCPEEQT :.: :::. CCDS32 LIDDFVEYARPVVTGGKRSLF 240 250 >>CCDS77862.1 DCUN1D1 gene_id:54165|Hs109|chr3 (244 aa) initn: 556 init1: 295 opt: 564 Z-score: 651.9 bits: 128.4 E(33420): 5.6e-30 Smith-Waterman score: 564; 43.9% identity (77.2% similar) in 189 aa overlap (89-276:48-231) 60 70 80 90 100 110 pF1KE3 AEAATEACQLPTSSGDAGRESKSNAEESSLQRLEELFRRYKDER-EDAILEEGMERFCND ..::.:. :::: . :. : .:...::.: CCDS77 NDWKLDVATDNFFQNPELYIRESVKGSLDRKKLEQLYNRYKDPQDENKIGIDGIQQFCDD 20 30 40 50 60 70 120 130 140 150 160 170 pF1KE3 LCVDPTEFRVLLLAWKFQAATMCKFTRKEFFDGCKAISADSIDGICARFPSLLTEAKQED : .::. . ::..::::.:::.:.:...::.:: .. :::. . :..:.. : :. CCDS77 LALDPASISVLIIAWKFRAATQCEFSKQEFMDGMTELGCDSIEKLKAQIPKMEQELKEPG 80 90 100 110 120 130 180 190 200 210 220 230 pF1KE3 KFKDLYRFTFQFGLDSEEGQRSLHREIAIALWKLVFTQNNPPVLDQWLNFLTENPSGIKG .:::.:.:::.:. .. ::..: :.::: :.::.. . :: : .:: :. . .. CCDS77 RFKDFYQFTFNFA--KNPGQKGLDLEMAIAYWNLVLN-GRFKFLDLWNKFLLEHHK--RS 140 150 160 170 180 190 240 250 260 270 280 290 pF1KE3 ISRDTWNMFLNFTQVIGPDLSNYSEDEAWPSLFDTFVEWEMERRKREGEGRGALSSGPEG : .::::..:.:. .:. :.:::.:. ::: :.: :::. CCDS77 IPKDTWNLLLDFSTMIADDMSNYDEEGAWPVLIDDFVEFARPQIAGTKSTTV 200 210 220 230 240 300 pF1KE3 LCPEEQT >>CCDS3240.1 DCUN1D1 gene_id:54165|Hs109|chr3 (259 aa) initn: 556 init1: 295 opt: 564 Z-score: 651.5 bits: 128.4 E(33420): 5.9e-30 Smith-Waterman score: 564; 43.9% identity (77.2% similar) in 189 aa overlap (89-276:63-246) 60 70 80 90 100 110 pF1KE3 AEAATEACQLPTSSGDAGRESKSNAEESSLQRLEELFRRYKDER-EDAILEEGMERFCND ..::.:. :::: . :. : .:...::.: CCDS32 NDWKLDVATDNFFQNPELYIRESVKGSLDRKKLEQLYNRYKDPQDENKIGIDGIQQFCDD 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE3 LCVDPTEFRVLLLAWKFQAATMCKFTRKEFFDGCKAISADSIDGICARFPSLLTEAKQED : .::. . ::..::::.:::.:.:...::.:: .. :::. . :..:.. : :. CCDS32 LALDPASISVLIIAWKFRAATQCEFSKQEFMDGMTELGCDSIEKLKAQIPKMEQELKEPG 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE3 KFKDLYRFTFQFGLDSEEGQRSLHREIAIALWKLVFTQNNPPVLDQWLNFLTENPSGIKG .:::.:.:::.:. .. ::..: :.::: :.::.. . :: : .:: :. . .. CCDS32 RFKDFYQFTFNFA--KNPGQKGLDLEMAIAYWNLVLN-GRFKFLDLWNKFLLEHHK--RS 160 170 180 190 200 240 250 260 270 280 290 pF1KE3 ISRDTWNMFLNFTQVIGPDLSNYSEDEAWPSLFDTFVEWEMERRKREGEGRGALSSGPEG : .::::..:.:. .:. :.:::.:. ::: :.: :::. CCDS32 IPKDTWNLLLDFSTMIADDMSNYDEEGAWPVLIDDFVEFARPQIAGTKSTTV 210 220 230 240 250 300 pF1KE3 LCPEEQT >>CCDS8325.1 DCUN1D5 gene_id:84259|Hs109|chr11 (237 aa) initn: 357 init1: 224 opt: 397 Z-score: 460.6 bits: 93.0 E(33420): 2.5e-19 Smith-Waterman score: 397; 32.8% identity (67.2% similar) in 201 aa overlap (81-281:41-235) 60 70 80 90 100 110 pF1KE3 ILVNGTKKAEAATEACQLPTSSGDAGRESKSNAEESSLQRLEELFRRYKDEREDAILEEG :. :. : .. : .: : .. :: CCDS83 GVAAAVAEDGGLKKCKISSYCRSQPPARLISGEEHFSSKKCLAWFYEYAGPDE-VVGPEG 20 30 40 50 60 120 130 140 150 160 170 pF1KE3 MERFCNDLCVDPTEFRVLLLAWKFQAATMCKFTRKEFFDGCKAISADSIDGICARFPSLL ::.::.:. :.: .. .:.::::..: .: ::..:.. : ... : . . .: : CCDS83 MEKFCEDIGVEPENIIMLVLAWKLEAESMGFFTKEEWLKGMTSLQCDCTEKLQNKFDFLR 70 80 90 100 110 120 180 190 200 210 220 230 pF1KE3 TEAKQEDKFKDLYRFTFQFGLDSEEGQRSLHREIAIALWKLVFTQNNPPVLDQWLNFLTE .. .. ..::..::..:.:. :.. :::: . : .. :.. .. : ... . ..: . CCDS83 SQLNDISSFKNIYRYAFDFARDKD--QRSLDIDTAKSMLALLLGRTWP-LFSVFYQYLEQ 130 140 150 160 170 180 240 250 260 270 280 290 pF1KE3 NPSGIKGISRDTWNMFLNFTQVIGPDLSNYSEDEAWPSLFDTFVEWEMERRKREGEGRGA : . ...: : :.:.... :::::.:: ::: :.: ::::. :. CCDS83 --SKYRVMNKDQWYNVLEFSRTVHADLSNYDEDGAWPVLLDEFVEWQKVRQTS 190 200 210 220 230 300 pF1KE3 LSSGPEGLCPEEQT >>CCDS33982.1 DCUN1D4 gene_id:23142|Hs109|chr4 (292 aa) initn: 332 init1: 200 opt: 396 Z-score: 458.1 bits: 92.8 E(33420): 3.5e-19 Smith-Waterman score: 396; 35.3% identity (63.2% similar) in 204 aa overlap (80-281:95-290) 50 60 70 80 90 100 pF1KE3 DILVNGTKKAEAATEACQLPTSSGDAGRESKSNAEESSLQRLEELFRRYKDEREDAILEE :.. : : .: : : .: .:.. : CCDS33 KKRRPASGDDLSAKKSRHDSMYRKYDSTRIKTEEEAFSSKRCLEWFYEYAGT-DDVVGPE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE3 GMERFCNDLCVDPTEFRVLLLAWKFQAATMCKFTRKEFFDGCKAISADSIDGICARFPSL :::.::.:. :.: . .:.::::..: .: :: .:.. : ... :. . . . : CCDS33 GMEKFCEDIGVEPENVVMLVLAWKLDAQNMGYFTLQEWLKGMTSLQCDTTEKLRNTLDYL 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE3 LTEAKQEDKFKDLYRFTFQFGLDSEEGQRSLHREIAIALWKLVFTQNNP--PVLDQWLNF . .. .:: .::..:.:. :. :::: . : . :.. . : ::. :.: CCDS33 RSFLNDSTNFKLIYRYAFDFA--REKDQRSLDINTAKCMLGLLLGKIWPLFPVFHQFL-- 190 200 210 220 230 230 240 250 260 270 280 pF1KE3 LTENPSGIKGISRDTWNMFLNFTQVIGPDLSNYSEDEAWPSLFDTFVEWEMERRKREGEG . : : :..: : :.:...:. :::::.:: ::: :.: :::: ... CCDS33 ---EQSKYKVINKDQWCNVLEFSRTINLDLSNYDEDGAWPVLLDEFVEWYKDKQMS 240 250 260 270 280 290 290 300 pF1KE3 RGALSSGPEGLCPEEQT >>CCDS75123.1 DCUN1D4 gene_id:23142|Hs109|chr4 (336 aa) initn: 332 init1: 200 opt: 396 Z-score: 457.2 bits: 92.8 E(33420): 3.9e-19 Smith-Waterman score: 396; 35.3% identity (63.2% similar) in 204 aa overlap (80-281:139-334) 50 60 70 80 90 100 pF1KE3 DILVNGTKKAEAATEACQLPTSSGDAGRESKSNAEESSLQRLEELFRRYKDEREDAILEE :.. : : .: : : .: .:.. : CCDS75 KKRRPASGDDLSAKKSRHDSMYRKYDSTRIKTEEEAFSSKRCLEWFYEYAGT-DDVVGPE 110 120 130 140 150 160 110 120 130 140 150 160 pF1KE3 GMERFCNDLCVDPTEFRVLLLAWKFQAATMCKFTRKEFFDGCKAISADSIDGICARFPSL :::.::.:. :.: . .:.::::..: .: :: .:.. : ... :. . . . : CCDS75 GMEKFCEDIGVEPENVVMLVLAWKLDAQNMGYFTLQEWLKGMTSLQCDTTEKLRNTLDYL 170 180 190 200 210 220 170 180 190 200 210 220 pF1KE3 LTEAKQEDKFKDLYRFTFQFGLDSEEGQRSLHREIAIALWKLVFTQNNP--PVLDQWLNF . .. .:: .::..:.:. :. :::: . : . :.. . : ::. :.: CCDS75 RSFLNDSTNFKLIYRYAFDFA--REKDQRSLDINTAKCMLGLLLGKIWPLFPVFHQFL-- 230 240 250 260 270 280 230 240 250 260 270 280 pF1KE3 LTENPSGIKGISRDTWNMFLNFTQVIGPDLSNYSEDEAWPSLFDTFVEWEMERRKREGEG . : : :..: : :.:...:. :::::.:: ::: :.: :::: ... CCDS75 ---EQSKYKVINKDQWCNVLEFSRTINLDLSNYDEDGAWPVLLDEFVEWYKDKQMS 290 300 310 320 330 290 300 pF1KE3 RGALSSGPEGLCPEEQT 304 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Aug 4 20:41:31 2021 done: Wed Aug 4 20:41:31 2021 Total Scan time: 1.240 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]