FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6648, 223 aa 1>>>pF1KE6648 223 - 223 aa - 223 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4864+/-0.000296; mu= 14.5922+/- 0.018 mean_var=118.0230+/-22.271, 0's: 0 Z-trim(121.2): 123 B-trim: 61 in 1/55 Lambda= 0.118057 statistics sampled from 37377 (37586) to 37377 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.793), E-opt: 0.2 (0.441), width: 16 Scan time: 5.330 The best scores are: opt bits E(85289) NP_115934 (OMIM: 217095,605194,605376,613853) cryp ( 223) 1631 287.9 9.3e-78 NP_001257349 (OMIM: 217095,605194,605376,613853) c ( 191) 585 109.6 3.6e-24 XP_011509788 (OMIM: 217095,605194,605376,613853) P ( 142) 565 106.1 3.1e-23 NP_001257350 (OMIM: 217095,605194,605376,613853) c ( 148) 565 106.1 3.2e-23 NP_001167607 (OMIM: 187395) teratocarcinoma-derive ( 172) 319 64.3 1.5e-10 NP_003203 (OMIM: 187395) teratocarcinoma-derived g ( 188) 319 64.3 1.5e-10 NP_963845 (OMIM: 602319) protein kinase C-binding ( 763) 186 42.4 0.0025 >>NP_115934 (OMIM: 217095,605194,605376,613853) cryptic (223 aa) initn: 1631 init1: 1631 opt: 1631 Z-score: 1517.0 bits: 287.9 E(85289): 9.3e-78 Smith-Waterman score: 1631; 100.0% identity (100.0% similar) in 223 aa overlap (1-223:1-223) 10 20 30 40 50 60 pF1KE6 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_115 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VTGSAEGWGPEEPLPYSRAFGEGASARPRCCRNGGTCVLGSFCVCPAHFTGRYCEHDQRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_115 VTGSAEGWGPEEPLPYSRAFGEGASARPRCCRNGGTCVLGSFCVCPAHFTGRYCEHDQRR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SECGALEHGAWTLRACHLCRCIFGALHCLPLQTPDRCDPKDFLASHAHGPSAGGAPSLLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_115 SECGALEHGAWTLRACHLCRCIFGALHCLPLQTPDRCDPKDFLASHAHGPSAGGAPSLLL 130 140 150 160 170 180 190 200 210 220 pF1KE6 LLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL ::::::::::::::::::::::::::::::::::::::::::: NP_115 LLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL 190 200 210 220 >>NP_001257349 (OMIM: 217095,605194,605376,613853) crypt (191 aa) initn: 623 init1: 565 opt: 585 Z-score: 554.9 bits: 109.6 E(85289): 3.6e-24 Smith-Waterman score: 585; 58.2% identity (68.1% similar) in 182 aa overlap (1-177:1-176) 10 20 30 40 50 60 pF1KE6 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 VTGSAEGWGPEEPLPYSRAFGEGASARPRCCRNG----GTCVLGSFCVCPAHFTGRYCEH :::::::::::::::::::::: ..: : . : .: . .: : : . : NP_001 VTGSAEGWGPEEPLPYSRAFGE-VNAAPWSTEPGPSAPATSAGASSGPCTASPSRRLTAV 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 DQRRSECGALEHGAWTLR-ACHLCRCIFGALHCLPLQTPDRCDPKDFLASHAHGPSAGGA .. : .: : : ::. : : : : . : :. . . :..:: NP_001 TRKTSWPPTLTGRAPGARPACYSC-C--PAHSCT--ASCARMRPRTLGPWSLPSSSGSGA 120 130 140 150 160 170 180 190 200 210 220 pF1KE6 PSLLLLLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL :. NP_001 PAEGRDLGIAFNFLCCK 180 190 >>XP_011509788 (OMIM: 217095,605194,605376,613853) PREDI (142 aa) initn: 565 init1: 565 opt: 565 Z-score: 538.0 bits: 106.1 E(85289): 3.1e-23 Smith-Waterman score: 565; 100.0% identity (100.0% similar) in 82 aa overlap (1-82:1-82) 10 20 30 40 50 60 pF1KE6 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VTGSAEGWGPEEPLPYSRAFGEGASARPRCCRNGGTCVLGSFCVCPAHFTGRYCEHDQRR :::::::::::::::::::::: XP_011 VTGSAEGWGPEEPLPYSRAFGEDLENCDLNVLKINIPSQDNSGLLLGFSQTPRPQEAPRR 70 80 90 100 110 120 >>NP_001257350 (OMIM: 217095,605194,605376,613853) crypt (148 aa) initn: 565 init1: 565 opt: 565 Z-score: 537.8 bits: 106.1 E(85289): 3.2e-23 Smith-Waterman score: 876; 66.4% identity (66.4% similar) in 223 aa overlap (1-223:1-148) 10 20 30 40 50 60 pF1KE6 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VTGSAEGWGPEEPLPYSRAFGEGASARPRCCRNGGTCVLGSFCVCPAHFTGRYCEHDQRR :::::::::::::::::::::: NP_001 VTGSAEGWGPEEPLPYSRAFGE-------------------------------------- 70 80 130 140 150 160 170 180 pF1KE6 SECGALEHGAWTLRACHLCRCIFGALHCLPLQTPDRCDPKDFLASHAHGPSAGGAPSLLL ::::::::::::::::::::::: NP_001 -------------------------------------DPKDFLASHAHGPSAGGAPSLLL 90 100 190 200 210 220 pF1KE6 LLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL ::::::::::::::::::::::::::::::::::::::::::: NP_001 LLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL 110 120 130 140 >>NP_001167607 (OMIM: 187395) teratocarcinoma-derived gr (172 aa) initn: 336 init1: 316 opt: 319 Z-score: 310.6 bits: 64.3 E(85289): 1.5e-10 Smith-Waterman score: 319; 37.7% identity (58.5% similar) in 130 aa overlap (68-184:36-165) 40 50 60 70 80 pF1KE6 VTKVATQKHRQSPLNWTSSHFGEVTGSAEGWGPEEPL--PYS--RAFGEGASARPR---- : ::: : : :. : . . NP_001 VFELGLVAGLGHQEFARPSRGYLAFRDDSIWPQEEPAIRPRSSQRVPPMGIQHSKELNRT 10 20 30 40 50 60 90 100 110 120 130 140 pF1KE6 CCRNGGTCVLGSFCVCPAHFTGRYCEHDQRRSECGALEHGAWTLRACHLCRCIFGALHCL :: :::::.:::::.:: : :: :::: :. .::.. : .: . : ::.: : :.:. NP_001 CCLNGGTCMLGSFCACPPSFYGRNCEHDVRKENCGSVPHDTWLPKKCSLCKCWHGQLRCF 70 80 90 100 110 120 150 160 170 180 190 200 pF1KE6 PLQTPDRCD----PKDFLASHA-HGPSAGGAPSLLLLLPCALLHRLLRPDAPAHPRSLVP : :: . ..::.. . : .. . ...:. : NP_001 PQAFLPGCDGLVMDEHLVASRTPELPPSARTTTFMLVGICLSIQSYY 130 140 150 160 170 210 220 pF1KE6 SVLQRERRPCGRPGLGHRL >>NP_003203 (OMIM: 187395) teratocarcinoma-derived growt (188 aa) initn: 316 init1: 316 opt: 319 Z-score: 310.1 bits: 64.3 E(85289): 1.5e-10 Smith-Waterman score: 319; 37.7% identity (58.5% similar) in 130 aa overlap (68-184:52-181) 40 50 60 70 80 pF1KE6 VTKVATQKHRQSPLNWTSSHFGEVTGSAEGWGPEEPL--PYS--RAFGEGASARPR---- : ::: : : :. : . . NP_003 VFELGLVAGLGHQEFARPSRGYLAFRDDSIWPQEEPAIRPRSSQRVPPMGIQHSKELNRT 30 40 50 60 70 80 90 100 110 120 130 140 pF1KE6 CCRNGGTCVLGSFCVCPAHFTGRYCEHDQRRSECGALEHGAWTLRACHLCRCIFGALHCL :: :::::.:::::.:: : :: :::: :. .::.. : .: . : ::.: : :.:. NP_003 CCLNGGTCMLGSFCACPPSFYGRNCEHDVRKENCGSVPHDTWLPKKCSLCKCWHGQLRCF 90 100 110 120 130 140 150 160 170 180 190 200 pF1KE6 PLQTPDRCD----PKDFLASHA-HGPSAGGAPSLLLLLPCALLHRLLRPDAPAHPRSLVP : :: . ..::.. . : .. . ...:. : NP_003 PQAFLPGCDGLVMDEHLVASRTPELPPSARTTTFMLVGICLSIQSYY 150 160 170 180 210 220 pF1KE6 SVLQRERRPCGRPGLGHRL >>NP_963845 (OMIM: 602319) protein kinase C-binding prot (763 aa) initn: 200 init1: 139 opt: 186 Z-score: 180.6 bits: 42.4 E(85289): 0.0025 Smith-Waterman score: 186; 40.0% identity (57.5% similar) in 80 aa overlap (81-157:510-583) 60 70 80 90 100 pF1KE6 LNWTSSHFGEVTGSAEGWGPEEPLPYSRAFGEGASARPRC---CRNGGTCVLGSFCVCPA :.:. : : :: ::::: . ::::. NP_963 CGSGQHNCDENAICTNTVQGHSCTCKPGYVGNGTICRAFCEEGCRYGGTCVAPNKCVCPS 480 490 500 510 520 530 110 120 130 140 150 160 pF1KE6 HFTGRYCEHDQRRSECGALEHGAWTLRACHLCRCIFGALHCLPLQTPDRCDPKDFLASHA ::: .::.: .::. : :. :: . :.. :: . :. : NP_963 GFTGSHCEKDI--DECALRTHTCWNDSACI---NLAGGFDCLCPSGPS-CSGDCPHEGGL 540 550 560 570 580 590 170 180 190 200 210 220 pF1KE6 HGPSAGGAPSLLLLLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL NP_963 KHNGQVWTLKEDRCSVCSCKDGKIFCRRTACDCQNPSADLFCCPECDTRVTSQCLDQNGH 600 610 620 630 640 650 223 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 15:06:18 2016 done: Tue Nov 8 15:06:19 2016 Total Scan time: 5.330 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]