FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0271, 358 aa 1>>>pF1KE0271 358 - 358 aa - 358 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2618+/-0.000653; mu= 13.6553+/- 0.040 mean_var=105.5384+/-20.935, 0's: 0 Z-trim(114.2): 5 B-trim: 0 in 0/52 Lambda= 0.124844 statistics sampled from 14806 (14809) to 14806 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.789), E-opt: 0.2 (0.455), width: 16 Scan time: 3.410 The best scores are: opt bits E(32554) CCDS41668.1 SAC3D1 gene_id:29901|Hs108|chr11 ( 358) 2398 441.8 4.4e-124 CCDS13734.1 MCM3AP gene_id:8888|Hs108|chr21 (1980) 450 91.4 6.9e-18 >>CCDS41668.1 SAC3D1 gene_id:29901|Hs108|chr11 (358 aa) initn: 2398 init1: 2398 opt: 2398 Z-score: 2341.2 bits: 441.8 E(32554): 4.4e-124 Smith-Waterman score: 2398; 100.0% identity (100.0% similar) in 358 aa overlap (1-358:1-358) 10 20 30 40 50 60 pF1KE0 MPGCELPVGTCPDMCPAAERAQREREHRLHRLEVVPGCRQDPPRADPQRAVKEYSRPAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MPGCELPVGTCPDMCPAAERAQREREHRLHRLEVVPGCRQDPPRADPQRAVKEYSRPAAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 KPRPPPSQLRPPSVLLATVRYLAGEVAESADIARAEVASFVADRLRAVLLDLALQGAGDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 KPRPPPSQLRPPSVLLATVRYLAGEVAESADIARAEVASFVADRLRAVLLDLALQGAGDA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 EAAVVLEAALATLLTVVARLGPDAARGPADPVLLQAQVQEGFGSLRRCYARGAGPHPRQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EAAVVLEAALATLLTVVARLGPDAARGPADPVLLQAQVQEGFGSLRRCYARGAGPHPRQP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 AFQGLFLLYNLGSVEALHEVLQLPAALRACPPLRKALAVDAAFREGNAARLFRLLQTLPY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 AFQGLFLLYNLGSVEALHEVLQLPAALRACPPLRKALAVDAAFREGNAARLFRLLQTLPY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 LPSCAVQCHVGHARREALARFARAFSTPKGQTLPLGFMVNLLALDGLREARDLCQAHGLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 LPSCAVQCHVGHARREALARFARAFSTPKGQTLPLGFMVNLLALDGLREARDLCQAHGLP 250 260 270 280 290 300 310 320 330 340 350 pF1KE0 LDGEERVVFLRGRYVEEGLPPASTCKVLVESKLRGRTLEEVVMAEEEDEGTDRPGSPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 LDGEERVVFLRGRYVEEGLPPASTCKVLVESKLRGRTLEEVVMAEEEDEGTDRPGSPA 310 320 330 340 350 >>CCDS13734.1 MCM3AP gene_id:8888|Hs108|chr21 (1980 aa) initn: 397 init1: 159 opt: 450 Z-score: 434.6 bits: 91.4 E(32554): 6.9e-18 Smith-Waterman score: 450; 31.5% identity (58.3% similar) in 333 aa overlap (8-333:634-958) 10 20 30 pF1KE0 MPGCELPVGTCPDMCPAAERAQREREHRLHRLEVVPG :::: :::: :: .:: . .: .::::: CCDS13 KEKYRLLDQRDRIMRQARVKRTDLDKARTFVGTCLDMCPEKERYMRETRSQLSVFEVVPG 610 620 630 640 650 660 40 50 60 70 80 90 pF1KE0 CRQDPPRADPQRAVKEYSRPAAGKPRPPPSQLRPPSVLLATVRYLAGEVAESADIARAEV : .: ::::::: .: . .: : .::: :: :. ::. .. .. . . . CCDS13 TDQ----VDHAAAVKEYSRSSADQEEPLPHELRPLPVLSRTMDYLVTQIMDQKEGSLRDW 670 680 690 700 710 100 110 120 130 140 150 pF1KE0 ASFVADRLRAVLLDLALQGAGDAEAAVVLEAALATLLTVVARLGPDAARGPADPVLLQAQ .:: .: :.. :.. : : .. ..: . . :.. . . : . . . CCDS13 YDFVWNRTRGIRKDITQQHLCDPLTVSLIEKC-TRFHIHCAHFMCEEPMSSFDAKINNEN 720 730 740 750 760 770 160 170 180 190 200 210 pF1KE0 VQEGFGSLRRCYA--RGAGPHPRQPA-FQGLFLLYNLGSVEALHEVLQLPAALRACPPLR . . . ::.. : :. : . : ::: .: .:.. . :.:: :. :.: .. CCDS13 MTKCLQSLKEMYQDLRNKGVFCASEAEFQGYNVLLSLNKGDILREVQQFHPAVRNSSEVK 780 790 800 810 820 830 220 230 240 250 260 270 pF1KE0 KALAVDAAFREGNAARLFRLLQTLPYLPSCAVQCHVGHARREALA--RFARAFSTPKGQT :. . ::. .: .:.:.:.:. :: .: ..:. .. :..:: :: . :: .. CCDS13 FAVQAFAALNSNNFVRFFKLVQSASYLNACLLHCYFSQIRKDALRALNFAYTVSTQRSTI 840 850 860 870 880 890 280 290 300 310 320 330 pF1KE0 LPLGFMVNLLALDGLREARDLCQAHGLPL-DGEERVVFLRGRYVE-EGLPPASTCKVLVE .:: .: .: . .:: :. ::: . :: : . :. ..: ::: . .:.. CCDS13 FPLDGVVRMLLFRDCEEATDFLTCHGLTVSDGC--VELNRSAFLEPEGLSK-TRKSVFIT 900 910 920 930 940 950 340 350 pF1KE0 SKLRGRTLEEVVMAEEEDEGTDRPGSPA :: CCDS13 RKLTVSVGEIVNGGPLPPVPRHTPVCSFNSQNKYIGESLAAELPVSTQRPGSDTVGGGRG 960 970 980 990 1000 1010 358 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 04:07:43 2016 done: Mon Nov 7 04:07:43 2016 Total Scan time: 3.410 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]