FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9604, 376 aa 1>>>pF1KB9604 376 - 376 aa - 376 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.4170+/-0.00101; mu= -6.2691+/- 0.061 mean_var=530.4880+/-108.826, 0's: 0 Z-trim(118.5): 73 B-trim: 192 in 1/54 Lambda= 0.055685 statistics sampled from 19413 (19486) to 19413 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.829), E-opt: 0.2 (0.599), width: 16 Scan time: 3.600 The best scores are: opt bits E(32554) CCDS3105.1 FOXL2 gene_id:668|Hs108|chr3 ( 376) 2703 230.9 1.4e-60 CCDS75259.1 FOXD1 gene_id:2297|Hs108|chr5 ( 465) 742 73.5 4.4e-13 >>CCDS3105.1 FOXL2 gene_id:668|Hs108|chr3 (376 aa) initn: 2703 init1: 2703 opt: 2703 Z-score: 1200.9 bits: 230.9 E(32554): 1.4e-60 Smith-Waterman score: 2703; 100.0% identity (100.0% similar) in 376 aa overlap (1-376:1-376) 10 20 30 40 50 60 pF1KB9 MMASYPEPEDAAGALLAPETGRTVKEPEGPPPSPGKGGGGGGGTAPEKPDPAQKPPYSYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MMASYPEPEDAAGALLAPETGRTVKEPEGPPPSPGKGGGGGGGTAPEKPDPAQKPPYSYV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ALIAMAIRESAEKRLTLSGIYQYIIAKFPFYEKNKKGWQNSIRHNLSLNECFIKVPREGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 ALIAMAIRESAEKRLTLSGIYQYIIAKFPFYEKNKKGWQNSIRHNLSLNECFIKVPREGG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 GERKGNYWTLDPACEDMFEKGNYRRRRRMKRPFRPPPAHFQPGKGLFGAGGAAGGCGVAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 GERKGNYWTLDPACEDMFEKGNYRRRRRMKRPFRPPPAHFQPGKGLFGAGGAAGGCGVAG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 AGADGYGYLAPPKYLQSGFLNNSWPLPQPPSPMPYASCQMAAAAAAAAAAAAAAGPGSPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 AGADGYGYLAPPKYLQSGFLNNSWPLPQPPSPMPYASCQMAAAAAAAAAAAAAAGPGSPG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 AAAVVKGLAGPAASYGPYTRVQSMALPPGVVNSYNGLGGPPAAPPPPPHPHPHPHAHHLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 AAAVVKGLAGPAASYGPYTRVQSMALPPGVVNSYNGLGGPPAAPPPPPHPHPHPHAHHLH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 AAAAPPPAPPHHGAAAPPPGQLSPASPATAAPPAPAPTSAPGLQFACARQPELAMMHCSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 AAAAPPPAPPHHGAAAPPPGQLSPASPATAAPPAPAPTSAPGLQFACARQPELAMMHCSY 310 320 330 340 350 360 370 pF1KB9 WDHDSKTGALHSRLDL :::::::::::::::: CCDS31 WDHDSKTGALHSRLDL 370 >>CCDS75259.1 FOXD1 gene_id:2297|Hs108|chr5 (465 aa) initn: 506 init1: 506 opt: 742 Z-score: 348.4 bits: 73.5 E(32554): 4.4e-13 Smith-Waterman score: 751; 42.4% identity (56.7% similar) in 363 aa overlap (7-355:70-422) 10 20 30 pF1KB9 MMASYPEPEDAAGALLAPETGRTVKEPEGPPPSPGK : :: :::: .: . : :: :. : CCDS75 GGGGPRLAVPAQRRRRRRSYAGEDELEDLEEEEDDDDILLAPPAGGS-PAPPGPAPAAGA 40 50 60 70 80 90 40 50 60 70 80 pF1KB9 G--GGGGGGTA-------PEKPDPAQKPPYSYVALIAMAIRESAEKRLTLSGIYQYIIAK : :::::: : .: ::::::.:::.::: .: .:::::: : ..: .. CCDS75 GAGGGGGGGGAGGGGSAGSGAKNPLVKPPYSYIALITMAILQSPKKRLTLSEICEFISGR 100 110 120 130 140 150 90 100 110 120 130 140 pF1KB9 FPFYEKNKKGWQNSIRHNLSLNECFIKVPREGGGERKGNYWTLDPACEDMFEKGNY-RRR ::.:... .::::::::::::.::.:.::: :. ::::::::: :::..:.. ::: CCDS75 FPYYREKFPAWQNSIRHNLSLNDCFVKIPREPGNPGKGNYWTLDPESADMFDNGSFLRRR 160 170 180 190 200 210 150 160 170 180 190 200 pF1KB9 RRMKR-PFRPPPAHFQPGKGLFGAGGAAGGCGVAGAGADGYGYLAPPKYLQSGF--LNNS .:.:: :. :: : . : ::: :::: : .:.: . :: :. . . CCDS75 KRFKRQPLLPPNAAAAESLLLRGAG-AAGGAGDPAAAAALFPPAPPPPPHAYGYGPYGCG 220 230 240 250 260 270 210 220 230 240 250 260 pF1KB9 WPLPQPPSPMPYASCQMAAAAAAAAAAAAAAGPGSPGAAAVVKGLAGPAASYGPYTRVQS . : :: : : ::::::::: . : : ... :: : .: :. . CCDS75 YGLQLPPYAPPSALFAAAAAAAAAAAFHPHSPPPPPPPHGAAAELARTAFGYRPHP--LG 280 290 300 310 320 330 270 280 290 300 310 320 pF1KB9 MALPPGVVNSYNGLGGPPA-APPPPPHPHPHPHAHHLHAAAAPPPAPPHHGAAAPPPGQL ::: . : ::: : : : . : ::: : : : .: CCDS75 AALPGPLPASAAKAGGPGASALARSPFSIESIIGGSLGPAAAAAAA-----AQAAAAAQA 340 350 360 370 380 390 330 340 350 360 370 pF1KB9 SPASPATAAPPAPAPTSAPGLQFACARQPELAMMHCSYWDHDSKTGALHSRLDL ::. .::::::. .:. : : : :. CCDS75 SPSPSPVAAPPAPG-SSGGGCAAQAAVGPAAALTRSLVAAAAAAASSVSSSAALGTLHQG 400 410 420 430 440 CCDS75 TALSSVENFTARISNC 450 460 376 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 02:05:09 2016 done: Tue Nov 8 02:05:10 2016 Total Scan time: 3.600 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]