FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2458, 276 aa 1>>>pF1KE2458 276 - 276 aa - 276 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8748+/-0.00114; mu= 12.1565+/- 0.069 mean_var=83.3755+/-16.289, 0's: 0 Z-trim(103.7): 56 B-trim: 76 in 2/49 Lambda= 0.140461 statistics sampled from 7508 (7540) to 7508 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.598), E-opt: 0.2 (0.232), width: 16 Scan time: 1.920 The best scores are: opt bits E(32554) CCDS4561.1 SCGN gene_id:10590|Hs108|chr6 ( 276) 1827 380.2 8.6e-106 CCDS10899.1 CALB2 gene_id:794|Hs108|chr16 ( 271) 611 133.8 1.3e-31 CCDS6251.1 CALB1 gene_id:793|Hs108|chr8 ( 261) 561 123.7 1.4e-28 >>CCDS4561.1 SCGN gene_id:10590|Hs108|chr6 (276 aa) initn: 1827 init1: 1827 opt: 1827 Z-score: 2012.8 bits: 380.2 E(32554): 8.6e-106 Smith-Waterman score: 1827; 100.0% identity (100.0% similar) in 276 aa overlap (1-276:1-276) 10 20 30 40 50 60 pF1KE2 MDSSREPTLGRLDAAGFWQVWQRFDADEKGYIEEKELDAFFLHMLMKLGTDDTVMKANLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MDSSREPTLGRLDAAGFWQVWQRFDADEKGYIEEKELDAFFLHMLMKLGTDDTVMKANLH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 KVKQQFMTTQDASKDGRIRMKELAGMFLSEDENFLLLFRRENPLDSSVEFMQIWRKYDAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KVKQQFMTTQDASKDGRIRMKELAGMFLSEDENFLLLFRRENPLDSSVEFMQIWRKYDAD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 SSGFISAAELRNFLRDLFLHHKKAISEAKLEEYTGTMMKIFDRNKDGRLDLNDLARILAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SSGFISAAELRNFLRDLFLHHKKAISEAKLEEYTGTMMKIFDRNKDGRLDLNDLARILAL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 QENFLLQFKMDACSTEERKRDFEKIFAYYDVSKTGALEGPEVDGFVKDMMELVQPSISGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 QENFLLQFKMDACSTEERKRDFEKIFAYYDVSKTGALEGPEVDGFVKDMMELVQPSISGV 190 200 210 220 230 240 250 260 270 pF1KE2 DLDKFREILLRHCDVNKDGKIQKSELALCLGLKINP :::::::::::::::::::::::::::::::::::: CCDS45 DLDKFREILLRHCDVNKDGKIQKSELALCLGLKINP 250 260 270 >>CCDS10899.1 CALB2 gene_id:794|Hs108|chr16 (271 aa) initn: 343 init1: 317 opt: 611 Z-score: 681.2 bits: 133.8 E(32554): 1.3e-31 Smith-Waterman score: 611; 38.4% identity (71.1% similar) in 263 aa overlap (9-270:13-265) 10 20 30 40 50 pF1KE2 MDSSREPTLGRLDAAGFWQVWQRFDADEKGYIEEKELDAFFLHML-MKLGTDDTVM :..: :. : ..:..:::: .:::: :::. :: .. . :. CCDS10 MAGPQQQPPYLHLAELTASQFLEIWKHFDADGNGYIEGKELENFFQELEKARKGSGMMSK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 KANLHKVKQQFMTTQDASKDGRIRMKELAGMFLSEDENFLLLFRRENPLDSSVEFMQIWR . :. . ..:: : ..::.:.: ::: ..: .::::: ::.. . ::.:::. :: CCDS10 SDNFGEKMKEFMQKYDKNSDGKIEMAELA-QILPTEENFLLCFRQH--VGSSAEFMEAWR 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 KYDADSSGFISAAELRNFLRDLFLHHKKAISEAKLEEYTGTMMKIFDRNKDGRLDLNDLA :::.: ::.: : ::..:: ::. . .. .: ::.::: :....:: : ::.: :.... CCDS10 KYDTDRSGYIEANELKGFLSDLLKKANRPYDEPKLQEYTQTILRMFDLNGDGKLGLSEMS 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE2 RILALQENFLLQFKMDACSTEERKRDFEKIFAYYDVSKTGALEGPEVDGFVKDMMELVQP :.: .::::::.:. ..:: :. ::..:: ...: .. :.:...::..: . CCDS10 RLLPVQENFLLKFQGMKLTSEE----FNAIFTFYDKDRSGYIDEHELDALLKDLYEKNKK 180 190 200 210 220 230 240 250 260 270 pF1KE2 SISGVDLDKFREILLRHCDVNKDGKIQKSELALCLGLKINP .. .: ..:. .. .. . ::. ...: . : CCDS10 EMNIQQLTNYRKSVM---SLAEAGKLYRKDLEIVLCSEPPM 240 250 260 270 >>CCDS6251.1 CALB1 gene_id:793|Hs108|chr8 (261 aa) initn: 352 init1: 352 opt: 561 Z-score: 626.7 bits: 123.7 E(32554): 1.4e-28 Smith-Waterman score: 561; 37.6% identity (66.2% similar) in 263 aa overlap (12-271:11-259) 10 20 30 40 50 60 pF1KE2 MDSSREPTLGRLDAAGFWQVWQRFDADEKGYIEEKELDAFFLHMLMKLGTDDTVMKANLH . :. :...: .:::: .::.: :::. .....: .: CCDS62 MAESHLQSSLITASQFFEIWLHFDADGSGYLEGKELQ----NLIQELQQARKKAGLELS 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 KVKQQFMTTQDASKDGRIRMKELAGMFLSEDENFLLLFRRENPLDSSVEFMQIWRKYDAD . :. ::.: . ::: . : .:::::::: .. : : :::. :::::.: CCDS62 PEMKTFVDQYGQRDDGKIGIVELAHV-LPTEENFLLLFRCQQ-LKSCEEFMKTWRKYDTD 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 SSGFISAAELRNFLRDLFLHHKKAISEAKLEEYTGTMMKIFDRNKDGRLDLNDLARILAL :::: . ::.:::.::. . .:.....:: ::: :.:.:: :.::.:.:...::.: . CCDS62 HSGFIETEELKNFLKDLLEKANKTVDDTKLAEYTDLMLKLFDSNNDGKLELTEMARLLPV 120 130 140 150 160 170 190 200 210 220 230 pF1KE2 QENFLLQFK-MDACSTEERKRDFEKIFAYYDVSKTGALEGPEVDGFVKDMMELVQPSISG ::::::.:. . :. : :.: : :: . .: .. :.:...::. : . ... CCDS62 QENFLLKFQGIKMCGKE-----FNKAFELYDQDGNGYIDENELDALLKDLCEKNKQDLDI 180 190 200 210 220 240 250 260 270 pF1KE2 VDLDKFREILLRHCDVNKDGKIQKSELAL--CLGLKINP .. ... .. : ::. ...::: : : CCDS62 NNITTYKKNIMALSD---GGKLYRTDLALILCAGDN 230 240 250 260 276 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 20:12:39 2016 done: Mon Nov 7 20:12:40 2016 Total Scan time: 1.920 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]