FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9267, 137 aa 1>>>pF1KB9267 137 - 137 aa - 137 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6951+/-0.000656; mu= 10.2172+/- 0.039 mean_var=80.2641+/-15.001, 0's: 0 Z-trim(112.1): 31 B-trim: 0 in 0/52 Lambda= 0.143157 statistics sampled from 12921 (12952) to 12921 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.766), E-opt: 0.2 (0.398), width: 16 Scan time: 1.720 The best scores are: opt bits E(32554) CCDS31186.1 CXCL12 gene_id:6387|Hs108|chr10 ( 119) 677 148.2 1.3e-36 CCDS53527.1 CXCL12 gene_id:6387|Hs108|chr10 ( 140) 597 131.8 1.3e-31 CCDS7207.1 CXCL12 gene_id:6387|Hs108|chr10 ( 89) 592 130.6 1.9e-31 CCDS44373.1 CXCL12 gene_id:6387|Hs108|chr10 ( 93) 592 130.6 2e-31 >>CCDS31186.1 CXCL12 gene_id:6387|Hs108|chr10 (119 aa) initn: 806 init1: 677 opt: 677 Z-score: 771.1 bits: 148.2 E(32554): 1.3e-36 Smith-Waterman score: 677; 87.3% identity (94.9% similar) in 118 aa overlap (1-118:1-118) 10 20 30 40 50 60 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKKEATEEEKGCPEKEKL :::::::::::::::::::::::::::::::::::::::.: ::. ...:. ... CCDS31 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKEKIGKKKRQKKRKAAQKRKN 70 80 90 100 110 130 pF1KB9 VICHLEMDHSSLALGAL >>CCDS53527.1 CXCL12 gene_id:6387|Hs108|chr10 (140 aa) initn: 638 init1: 591 opt: 597 Z-score: 680.7 bits: 131.8 E(32554): 1.3e-31 Smith-Waterman score: 597; 73.4% identity (82.0% similar) in 128 aa overlap (1-127:1-126) 10 20 30 40 50 60 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKR-KDRKKEATEEEKGCPEKEK ::::::::::::::::::::::::::::. .::. . ..:: . CCDS53 ARLKNNNRQVCIDPKLKWIQEYLEKALNNLISAAPAGKRVIAGARALHPSPPRACPTARA 70 80 90 100 110 120 120 130 pF1KB9 LVICHLEMDHSSLALGAL : :.... CCDS53 L--CEIRLWPPPEWSWPSPGDV 130 140 >>CCDS7207.1 CXCL12 gene_id:6387|Hs108|chr10 (89 aa) initn: 592 init1: 592 opt: 592 Z-score: 678.0 bits: 130.6 E(32554): 1.9e-31 Smith-Waterman score: 592; 100.0% identity (100.0% similar) in 89 aa overlap (1-89:1-89) 10 20 30 40 50 60 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKKEATEEEKGCPEKEKL ::::::::::::::::::::::::::::: CCDS72 ARLKNNNRQVCIDPKLKWIQEYLEKALNK 70 80 >>CCDS44373.1 CXCL12 gene_id:6387|Hs108|chr10 (93 aa) initn: 592 init1: 592 opt: 592 Z-score: 677.8 bits: 130.6 E(32554): 2e-31 Smith-Waterman score: 592; 100.0% identity (100.0% similar) in 89 aa overlap (1-89:1-89) 10 20 30 40 50 60 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKKEATEEEKGCPEKEKL ::::::::::::::::::::::::::::: CCDS44 ARLKNNNRQVCIDPKLKWIQEYLEKALNKRFKM 70 80 90 137 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 18:21:45 2016 done: Thu Nov 3 18:21:45 2016 Total Scan time: 1.720 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]