FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5213, 244 aa 1>>>pF1KE5213 244 - 244 aa - 244 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6401+/-0.000824; mu= 11.8702+/- 0.049 mean_var=57.0939+/-11.479, 0's: 0 Z-trim(106.4): 22 B-trim: 0 in 0/49 Lambda= 0.169738 statistics sampled from 8927 (8940) to 8927 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.662), E-opt: 0.2 (0.275), width: 16 Scan time: 1.690 The best scores are: opt bits E(32554) CCDS11093.1 CTDNEP1 gene_id:23399|Hs108|chr17 ( 244) 1623 405.5 1.7e-113 CCDS33734.1 CTDSPL gene_id:10217|Hs108|chr3 ( 276) 462 121.2 7.2e-28 CCDS33735.1 CTDSPL gene_id:10217|Hs108|chr3 ( 265) 457 120.0 1.6e-27 CCDS56166.1 CTDSP1 gene_id:58190|Hs108|chr2 ( 260) 436 114.8 5.6e-26 CCDS2416.1 CTDSP1 gene_id:58190|Hs108|chr2 ( 261) 432 113.9 1.1e-25 CCDS41801.1 CTDSP2 gene_id:10106|Hs108|chr12 ( 271) 415 109.7 2.1e-24 CCDS10110.1 CTDSPL2 gene_id:51496|Hs108|chr15 ( 466) 406 107.5 1.6e-23 >>CCDS11093.1 CTDNEP1 gene_id:23399|Hs108|chr17 (244 aa) initn: 1623 init1: 1623 opt: 1623 Z-score: 2151.3 bits: 405.5 E(32554): 1.7e-113 Smith-Waterman score: 1623; 100.0% identity (100.0% similar) in 244 aa overlap (1-244:1-244) 10 20 30 40 50 60 pF1KE5 MMRTQCLLGLRTFVAFAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPLSPVSRNRLAQVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MMRTQCLLGLRTFVAFAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPLSPVSRNRLAQVK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 RKILVLDLDETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 RKILVLDLDETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 VSQWYELVVFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VSQWYELVVFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 SIVILDNSPGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 SIVILDNSPGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQ 190 200 210 220 230 240 pF1KE5 HRLW :::: CCDS11 HRLW >>CCDS33734.1 CTDSPL gene_id:10217|Hs108|chr3 (276 aa) initn: 443 init1: 237 opt: 462 Z-score: 613.8 bits: 121.2 E(32554): 7.2e-28 Smith-Waterman score: 501; 41.4% identity (69.7% similar) in 198 aa overlap (45-236:84-272) 20 30 40 50 60 pF1KE5 AFAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPL-SPVSRNRLAQVK-----RKILVLDL ..:. :: .. : .: .: .:.:: CCDS33 NVEAPPPSSPSVLPPLVEENGGLQKGDQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDL 60 70 80 90 100 110 70 80 90 100 110 120 pF1KE5 DETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELV ::::.:: . .: . :::. : :: . .: :::::: ::. ..: .: : CCDS33 DETLVHS--------SFKPISNADFIVPVEIDGTIHQVYVLKRPHVDEFLQRMGQLFECV 120 130 140 150 160 130 140 150 160 170 180 pF1KE5 VFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNS .::::. :.. ::: :: .... : .:. :... :.:.:::: . .::...:.::: CCDS33 LFTASLAKYADPVADLLDR-WGVFRARLFRESCVFHRGNYVKDLSRLGRELSKVIIVDNS 170 180 190 200 210 220 190 200 210 220 230 240 pF1KE5 PGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW :..: ::.::.:..:::.: .:: ::.:.:....: :: :.: : CCDS33 PASYIFHPENAVPVQSWFDDMTDTELLDLIPFFEGLSREDDVYSMLHRLCNR 230 240 250 260 270 >>CCDS33735.1 CTDSPL gene_id:10217|Hs108|chr3 (265 aa) initn: 443 init1: 237 opt: 457 Z-score: 607.5 bits: 120.0 E(32554): 1.6e-27 Smith-Waterman score: 494; 43.8% identity (72.2% similar) in 176 aa overlap (61-236:95-261) 40 50 60 70 80 90 pF1KE5 QIRTVIQYQTVRYDILPLSPVSRNRLAQVKRKILVLDLDETLIHSHHDGVLRPTVRPGTP .: .:.::::::.:: . .: . CCDS33 VLPPLVEENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHS--------SFKPISN 70 80 90 100 110 100 110 120 130 140 150 pF1KE5 PDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELVVFTASMEIYGSAVADKLDNSRS :::. : :: . .: :::::: ::. ..: .: :.::::. :.. ::: :: . CCDS33 ADFIVPVEIDGTIHQVYVLKRPHVDEFLQRMGQLFECVLFTASLAKYADPVADLLDR-WG 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE5 ILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNSPGAYRSHPDNAIPIKSWFSDPS ... : .:. :... :.:.:::: . .::...:.::::..: ::.::.:..:::.: . CCDS33 VFRARLFRESCVFHRGNYVKDLSRLGRELSKVIIVDNSPASYIFHPENAVPVQSWFDDMT 180 190 200 210 220 230 220 230 240 pF1KE5 DTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW :: ::.:.:....: :: :.: : CCDS33 DTELLDLIPFFEGLSREDDVYSMLHRLCNR 240 250 260 >>CCDS56166.1 CTDSP1 gene_id:58190|Hs108|chr2 (260 aa) initn: 476 init1: 283 opt: 436 Z-score: 579.9 bits: 114.8 E(32554): 5.6e-26 Smith-Waterman score: 478; 43.5% identity (68.4% similar) in 193 aa overlap (46-234:70-253) 20 30 40 50 60 70 pF1KE5 FAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPLSPVSR---NRLAQVKRKI-LVLDLDET .: .::. . :: . :: .:.::::: CCDS56 HSLFCCVCRDDGEALPAHSGAPLLVEENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDET 40 50 60 70 80 90 80 90 100 110 120 130 pF1KE5 LIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELVVFT :.:: . .: . :::. : :: . .: :::::: ::. ... .: :.:: CCDS56 LVHS--------SFKPVNNADFIIPVEIDGVVHQVYVLKRPHVDEFLQRMGELFECVLFT 100 110 120 130 140 150 140 150 160 170 180 190 pF1KE5 ASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNSPGA ::. :.. ::: ::. . .. : .:. :... :.:.:::: . :: ..::::::.. CCDS56 ASLAKYADPVADLLDK-WGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPAS 160 170 180 190 200 210 200 210 220 230 240 pF1KE5 YRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW : :::::.:. :::.. ::: : .:::... : . :: ::: CCDS56 YVFHPDNAVPVASWFDNMSDTELHDLLPFFEQLSRVDDVYSVLRQPRPGS 220 230 240 250 260 >>CCDS2416.1 CTDSP1 gene_id:58190|Hs108|chr2 (261 aa) initn: 476 init1: 283 opt: 432 Z-score: 574.6 bits: 113.9 E(32554): 1.1e-25 Smith-Waterman score: 474; 45.3% identity (69.8% similar) in 179 aa overlap (57-234:85-254) 30 40 50 60 70 80 pF1KE5 LLRRQIRTVIQYQTVRYDILPLSPVSRNRLAQVKRKI-LVLDLDETLIHSHHDGVLRPTV :: . :: .:.::::::.:: . CCDS24 PAHSGAPLLVEENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHS--------SF 60 70 80 90 100 90 100 110 120 130 140 pF1KE5 RPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELVVFTASMEIYGSAVADKL .: . :::. : :: . .: :::::: ::. ... .: :.::::. :.. ::: : CCDS24 KPVNNADFIIPVEIDGVVHQVYVLKRPHVDEFLQRMGELFECVLFTASLAKYADPVADLL 110 120 130 140 150 160 150 160 170 180 190 200 pF1KE5 DNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNSPGAYRSHPDNAIPIKSW :. . .. : .:. :... :.:.:::: . :: ..::::::..: :::::.:. :: CCDS24 DK-WGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPASYVFHPDNAVPVASW 170 180 190 200 210 220 210 220 230 240 pF1KE5 FSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW :.. ::: : .:::... : . :: ::: CCDS24 FDNMSDTELHDLLPFFEQLSRVDDVYSVLRQPRPGS 230 240 250 260 >>CCDS41801.1 CTDSP2 gene_id:10106|Hs108|chr12 (271 aa) initn: 402 init1: 228 opt: 415 Z-score: 551.8 bits: 109.7 E(32554): 2.1e-24 Smith-Waterman score: 454; 40.9% identity (69.9% similar) in 176 aa overlap (61-236:101-267) 40 50 60 70 80 90 pF1KE5 QIRTVIQYQTVRYDILPLSPVSRNRLAQVKRKILVLDLDETLIHSHHDGVLRPTVRPGTP : .:.::::::.:: . .: . CCDS41 KSDLLQCLQYQFYQIPGTCLLPEVTEEDQGRICVVIDLDETLVHS--------SFKPINN 80 90 100 110 120 100 110 120 130 140 150 pF1KE5 PDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELVVFTASMEIYGSAVADKLDNSRS :::. . :. . .: :::.:: ::. ... .: :.::::. :.. :.: :: . CCDS41 ADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECVLFTASLAKYADPVTDLLDRC-G 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE5 ILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNSPGAYRSHPDNAIPIKSWFSDPS ... : .:. :... : :.:::: . :: . .::::::..: ::.::.:..:::.: . CCDS41 VFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDNSPASYIFHPENAVPVQSWFDDMA 190 200 210 220 230 240 220 230 240 pF1KE5 DTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW :: ::::.:... : . :: . :.. CCDS41 DTELLNLIPIFEELSGAEDVYTSLGQLRAP 250 260 270 >>CCDS10110.1 CTDSPL2 gene_id:51496|Hs108|chr15 (466 aa) initn: 426 init1: 354 opt: 406 Z-score: 535.8 bits: 107.5 E(32554): 1.6e-23 Smith-Waterman score: 424; 34.7% identity (63.6% similar) in 225 aa overlap (21-243:252-462) 10 20 30 40 pF1KE5 MMRTQCLLGLRTFVAFAAKLWSFF-IYLLRRQIRTVIQYQTVRYDILPLS : : : . ... . . : : :::. CCDS10 VNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLK 230 240 250 260 270 280 50 60 70 80 90 100 pF1KE5 PVSRNRLAQVKRKILVLDLDETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVH : ... :::::::::.: . . .. : : .. :. . .:. CCDS10 TRSTPEFS------LVLDLDETLVHCSLNELEDAAL---TFPVLFQDVI-----YQVYVR 290 300 310 320 110 120 130 140 150 160 pF1KE5 KRPHVDFFLEVVSQWYELVVFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYI :: ::: .:: ::...:::: ..:.. . . :: .......: .:.::. :.:: CCDS10 LRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFREHCVCVQGNYI 330 340 350 360 370 380 170 180 190 200 210 220 pF1KE5 KDLSVVHSDLSSIVILDNSPGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDAL-RFTA :::... :::. .:.:::: :. . .:.:::.::: : .:. ::.:.:.:. : ... CCDS10 KDLNILGRDLSKTIIIDNSPQAFAYQLSNGIPIESWFMDKNDNELLKLIPFLEKLVELNE 390 400 410 420 430 440 230 240 pF1KE5 DVRSVLSRNLHQHRLW ::: . .. : : CCDS10 DVRPHIRDRFRLHDLLPPD 450 460 244 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:33:52 2016 done: Mon Nov 7 22:33:53 2016 Total Scan time: 1.690 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]