FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9267, 137 aa 1>>>pF1KB9267 137 - 137 aa - 137 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7657+/-0.000291; mu= 9.6567+/- 0.018 mean_var=80.0092+/-14.945, 0's: 0 Z-trim(118.9): 40 B-trim: 0 in 0/54 Lambda= 0.143385 statistics sampled from 32385 (32426) to 32385 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.751), E-opt: 0.2 (0.38), width: 16 Scan time: 5.340 The best scores are: opt bits E(85289) NP_001029058 (OMIM: 600835,609423) stromal cell-de ( 119) 677 148.6 2.6e-36 NP_001171605 (OMIM: 600835,609423) stromal cell-de ( 140) 597 132.1 2.9e-31 NP_954637 (OMIM: 600835,609423) stromal cell-deriv ( 89) 592 130.9 4.1e-31 NP_000600 (OMIM: 600835,609423) stromal cell-deriv ( 93) 592 130.9 4.2e-31 NP_001264919 (OMIM: 600835,609423) stromal cell-de ( 103) 255 61.2 4.4e-10 NP_000575 (OMIM: 146930) interleukin-8 precursor [ ( 99) 164 42.4 0.0002 NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 p ( 107) 153 40.1 0.001 NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 p ( 107) 152 39.9 0.0012 NP_002976 (OMIM: 187011) C-C motif chemokine 5 iso ( 91) 147 38.9 0.0021 NP_002984 (OMIM: 138965) C-X-C motif chemokine 6 p ( 114) 142 37.9 0.0052 NP_001502 (OMIM: 155730) growth-regulated alpha pr ( 107) 140 37.5 0.0066 >>NP_001029058 (OMIM: 600835,609423) stromal cell-derive (119 aa) initn: 806 init1: 677 opt: 677 Z-score: 772.8 bits: 148.6 E(85289): 2.6e-36 Smith-Waterman score: 677; 87.3% identity (94.9% similar) in 118 aa overlap (1-118:1-118) 10 20 30 40 50 60 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKKEATEEEKGCPEKEKL :::::::::::::::::::::::::::::::::::::::.: ::. ...:. ... NP_001 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKEKIGKKKRQKKRKAAQKRKN 70 80 90 100 110 130 pF1KB9 VICHLEMDHSSLALGAL >>NP_001171605 (OMIM: 600835,609423) stromal cell-derive (140 aa) initn: 638 init1: 591 opt: 597 Z-score: 682.3 bits: 132.1 E(85289): 2.9e-31 Smith-Waterman score: 597; 73.4% identity (82.0% similar) in 128 aa overlap (1-127:1-126) 10 20 30 40 50 60 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKR-KDRKKEATEEEKGCPEKEK ::::::::::::::::::::::::::::. .::. . ..:: . NP_001 ARLKNNNRQVCIDPKLKWIQEYLEKALNNLISAAPAGKRVIAGARALHPSPPRACPTARA 70 80 90 100 110 120 120 130 pF1KB9 LVICHLEMDHSSLALGAL : :.... NP_001 L--CEIRLWPPPEWSWPSPGDV 130 140 >>NP_954637 (OMIM: 600835,609423) stromal cell-derived f (89 aa) initn: 592 init1: 592 opt: 592 Z-score: 679.6 bits: 130.9 E(85289): 4.1e-31 Smith-Waterman score: 592; 100.0% identity (100.0% similar) in 89 aa overlap (1-89:1-89) 10 20 30 40 50 60 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_954 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKKEATEEEKGCPEKEKL ::::::::::::::::::::::::::::: NP_954 ARLKNNNRQVCIDPKLKWIQEYLEKALNK 70 80 >>NP_000600 (OMIM: 600835,609423) stromal cell-derived f (93 aa) initn: 592 init1: 592 opt: 592 Z-score: 679.4 bits: 130.9 E(85289): 4.2e-31 Smith-Waterman score: 592; 100.0% identity (100.0% similar) in 89 aa overlap (1-89:1-89) 10 20 30 40 50 60 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKKEATEEEKGCPEKEKL ::::::::::::::::::::::::::::: NP_000 ARLKNNNRQVCIDPKLKWIQEYLEKALNKRFKM 70 80 90 >>NP_001264919 (OMIM: 600835,609423) stromal cell-derive (103 aa) initn: 265 init1: 255 opt: 255 Z-score: 302.0 bits: 61.2 E(85289): 4.4e-10 Smith-Waterman score: 255; 100.0% identity (100.0% similar) in 38 aa overlap (1-38:1-38) 10 20 30 40 50 60 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV :::::::::::::::::::::::::::::::::::::: NP_001 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHYCTCLIRVSFHGATPLTQGSWV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 ARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKKEATEEEKGCPEKEKL NP_001 LYSLSCAGGETGLREPGPMVSPRVESHQEGRLGVPGPVNLGKA 70 80 90 100 >>NP_000575 (OMIM: 146930) interleukin-8 precursor [Homo (99 aa) initn: 138 init1: 95 opt: 164 Z-score: 200.5 bits: 42.4 E(85289): 0.0002 Smith-Waterman score: 164; 31.2% identity (69.8% similar) in 96 aa overlap (1-89:1-95) 10 20 30 40 50 pF1KB9 MNAKVVVVLV---LVLTALCLSDGKPVSLS-YRCPC-RFFESHVARANVKHLKILNT-PN :..:..:.:. :. .::: . : : . :: : . . . .:.:..... :. NP_000 MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPFHPKFIKELRVIESGPH 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 CA-LQIVARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKKEATEEEKG :: .:...: ...:..:.::: .:.:. .:: :.. NP_000 CANTEIIVKL-SDGRELCLDPKENWVQRVVEKFLKRAENS 70 80 90 120 130 pF1KB9 CPEKEKLVICHLEMDHSSLALGAL >>NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 precu (107 aa) initn: 133 init1: 103 opt: 153 Z-score: 187.7 bits: 40.1 E(85289): 0.001 Smith-Waterman score: 153; 29.2% identity (62.9% similar) in 89 aa overlap (4-90:17-104) 10 20 30 40 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHL .:...:.:...: . : : :: : . . :.. . NP_002 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB9 KILNT-PNCA-LQIVARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKK .. . :.:: ...: :::. ...:..: .:. .:: :::: NP_002 NVRSPGPHCAQTEVIATLKNG-KKACLNPASPMVQKIIEKILNKGSTN 70 80 90 100 110 120 130 pF1KB9 EATEEEKGCPEKEKLVICHLEMDHSSLALGAL >>NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 precu (107 aa) initn: 93 init1: 75 opt: 152 Z-score: 186.6 bits: 39.9 E(85289): 0.0012 Smith-Waterman score: 152; 26.7% identity (65.6% similar) in 90 aa overlap (4-91:17-105) 10 20 30 40 pF1KB9 MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHL .:...:.:...: . : :.. :: : . . :.. . NP_002 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB9 KILNT-PNCA-LQIVARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKK :. . :.:: ...: :::. ...:..: ... .:: :..:. NP_002 KVKSPGPHCAQTEVIATLKNG-QKACLNPASPMVKKIIEKMLKNGKSN 70 80 90 100 110 120 130 pF1KB9 EATEEEKGCPEKEKLVICHLEMDHSSLALGAL >>NP_002976 (OMIM: 187011) C-C motif chemokine 5 isoform (91 aa) initn: 116 init1: 90 opt: 147 Z-score: 182.0 bits: 38.9 E(85289): 0.0021 Smith-Waterman score: 147; 33.3% identity (66.7% similar) in 81 aa overlap (7-83:8-85) 10 20 30 40 50 pF1KB9 MNAKVVVVLVLVLTALCL-SDGKPVSLSYRCPCRFFESHVARANVK-HLK--ILNTPNC ....:. :::: ....: : : :: : ...:: . :.: . .. .: NP_002 MKVSAAALAVILIATALCAPASASPYS-SDTTPCCF--AYIARPLPRAHIKEYFYTSGKC 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 ALQIVARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKKRKDRKKEATEEEKGCP . :. . .::::: .:. ::..::. NP_002 SNPAVVFVTRKNRQVCANPEKKWVREYINSLEMS 60 70 80 90 >>NP_002984 (OMIM: 138965) C-X-C motif chemokine 6 precu (114 aa) initn: 117 init1: 75 opt: 142 Z-score: 175.0 bits: 37.9 E(85289): 0.0052 Smith-Waterman score: 142; 29.8% identity (67.0% similar) in 94 aa overlap (6-93:21-113) 10 20 30 40 pF1KB9 MNAKVVVVLVLVLTALC-LSDGKPVS---LSYRCPCRFFESHVAR ...:.:.:: :... ::: :: : .: NP_002 MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP 10 20 30 40 50 60 50 60 70 80 90 pF1KB9 ANVKHLKILNT-PNCA-LQIVARLKNNNRQVCIDPKLKWIQEYLEKALNKGRREEKVGKK .. .:... . :.:. ...:: :::. .:::.::. .... ..: :..: .. NP_002 KTIGKLQVFPAGPQCSKVEVVASLKNG-KQVCLDPEAPFLKKVIQKILDSGNKKN 70 80 90 100 110 100 110 120 130 pF1KB9 RKDRKKEATEEEKGCPEKEKLVICHLEMDHSSLALGAL 137 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 18:21:45 2016 done: Thu Nov 3 18:21:46 2016 Total Scan time: 5.340 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]