FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9715, 392 aa 1>>>pF1KB9715 392 - 392 aa - 392 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.3561+/-0.00102; mu= -4.4505+/- 0.062 mean_var=552.6418+/-112.568, 0's: 0 Z-trim(118.5): 47 B-trim: 0 in 0/53 Lambda= 0.054557 statistics sampled from 19421 (19465) to 19421 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.838), E-opt: 0.2 (0.598), width: 16 Scan time: 3.330 The best scores are: opt bits E(32554) CCDS2123.1 EN1 gene_id:2019|Hs108|chr2 ( 392) 2686 225.2 7.9e-59 CCDS5940.1 EN2 gene_id:2020|Hs108|chr7 ( 333) 774 74.6 1.4e-13 >>CCDS2123.1 EN1 gene_id:2019|Hs108|chr2 (392 aa) initn: 2686 init1: 2686 opt: 2686 Z-score: 1169.6 bits: 225.2 E(32554): 7.9e-59 Smith-Waterman score: 2686; 100.0% identity (100.0% similar) in 392 aa overlap (1-392:1-392) 10 20 30 40 50 60 pF1KB9 MEEQQPEPKSQRDSALGAAAAATPGGLSLSLSPGASGSSGSGSDGDSVPVSPQPAPPSPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 MEEQQPEPKSQRDSALGAAAAATPGGLSLSLSPGASGSSGSGSDGDSVPVSPQPAPPSPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 AAPCLPPLAHHPHLPPHPPPPPPQHLAAPAHQPQPAAQLHRTTNFFIDNILRPDFGCKKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 AAPCLPPLAHHPHLPPHPPPPPPQHLAAPAHQPQPAAQLHRTTNFFIDNILRPDFGCKKE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 QPPPQLLVAAAARGGAGGGGRVERDRGQTAAGRDPVHPLGTRAPGAASLLCAPDANCGPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 QPPPQLLVAAAARGGAGGGGRVERDRGQTAAGRDPVHPLGTRAPGAASLLCAPDANCGPP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 DGSQPAAAGAGASKAGNPAAAAAAAAAAVAAAAAAAAAKPSDTGGGGSGGGAGSPGAQGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 DGSQPAAAGAGASKAGNPAAAAAAAAAAVAAAAAAAAAKPSDTGGGGSGGGAGSPGAQGT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 KYPEHGNPAILLMGSANGGPVVKTDSQQPLVWPAWVYCTRYSDRPSSGPRTRKLKKKKNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 KYPEHGNPAILLMGSANGGPVVKTDSQQPLVWPAWVYCTRYSDRPSSGPRTRKLKKKKNE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 KEDKRPRTAFTAEQLQRLKAEFQANRYITEQRRQTLAQELSLNESQIKIWFQNKRAKIKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 KEDKRPRTAFTAEQLQRLKAEFQANRYITEQRRQTLAQELSLNESQIKIWFQNKRAKIKK 310 320 330 340 350 360 370 380 390 pF1KB9 ATGIKNGLALHLMAQGLYNHSTTTVQDKDESE :::::::::::::::::::::::::::::::: CCDS21 ATGIKNGLALHLMAQGLYNHSTTTVQDKDESE 370 380 390 >>CCDS5940.1 EN2 gene_id:2020|Hs108|chr7 (333 aa) initn: 897 init1: 740 opt: 774 Z-score: 357.1 bits: 74.6 E(32554): 1.4e-13 Smith-Waterman score: 919; 47.1% identity (62.3% similar) in 395 aa overlap (1-392:1-333) 10 20 30 40 50 60 pF1KB9 MEEQQPEPKSQRDSALGAAAAATPGGLSLSLSPGASGSSGSGSDGDSVPVSPQPAPPSPP :::..:.: : ::::. : . :::. :::. : : :: : . CCDS59 MEENDPKP--------GEAAAAVEGQRQPESSPGG----GSGGGGGS---SPGEADTGRR 10 20 30 40 70 80 90 100 110 120 pF1KB9 AAPCLPPLAHHPHLPPHPPPPPPQHLAAPAHQPQPAAQLHRTTNFFIDNILRPDFGCKKE : :: . : ::... .: :: :::::::::::.:: .:. CCDS59 RALMLPAV-----------------LQAPGNHQHP----HRITNFFIDNILRPEFGRRKD 50 60 70 80 130 140 150 160 170 pF1KB9 QPPPQLLVAAAARGGAGG-GGRVERDRGQTAAGRDPVHPLGTRAPGAASLLCAPDANCGP .... ::::: :: . : :.: . . :.: : . ::: :. :: CCDS59 AGTCCAGAGGGRGGGAGGEGGASGAEGGGGAGGSEQLLGSGSREP-RQNPPCAPGAG-GP 90 100 110 120 130 140 180 190 200 210 220 230 pF1KB9 -PD-GSQPAAAGAGASKAGNPAAAAAAAAAAVAAAAAAAAAKPSDTGGGGSGGGAGSPGA : ::. . : :.::. : ::. .:: :.: CCDS59 LPAAGSDSPGDGEGGSKTL------------------------SLHGGAKKGGDPGGPLD 150 160 170 240 250 260 270 280 290 pF1KB9 QGTKYPEHGNPAILLMGSANGGPVVKTDSQQPLVWPAWVYCTRYSDRPSSGPRTRKLKKK . : :. . . ...... . . . ::..:::::::::::::::::::.:: ::: CCDS59 GSLKARGLGGGDLSVSSDSDSSQAGANLGAQPMLWPAWVYCTRYSDRPSSGPRSRKPKKK 180 190 200 210 220 230 300 310 320 330 340 350 pF1KB9 KNEKEDKRPRTAFTAEQLQRLKAEFQANRYITEQRRQTLAQELSLNESQIKIWFQNKRAK . .:::::::::::::::::::::::.:::.::::::.:::::::::::::::::::::: CCDS59 NPNKEDKRPRTAFTAEQLQRLKAEFQTNRYLTEQRRQSLAQELSLNESQIKIWFQNKRAK 240 250 260 270 280 290 360 370 380 390 pF1KB9 IKKATGIKNGLALHLMAQGLYNHSTTTVQDKDESE :::::: :: ::.:::::::::::::. . :..:: CCDS59 IKKATGNKNTLAVHLMAQGLYNHSTTAKEGKSDSE 300 310 320 330 392 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 06:08:10 2016 done: Sun Nov 6 06:08:11 2016 Total Scan time: 3.330 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]