FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7544, 262 aa 1>>>pF1KB7544 262 - 262 aa - 262 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.4722+/-0.000817; mu= 2.0627+/- 0.049 mean_var=205.7253+/-41.103, 0's: 0 Z-trim(115.0): 67 B-trim: 0 in 0/54 Lambda= 0.089419 statistics sampled from 15478 (15545) to 15478 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.478), width: 16 Scan time: 2.630 The best scores are: opt bits E(32554) CCDS33080.1 SPIB gene_id:6689|Hs108|chr19 ( 262) 1805 244.5 5.7e-65 CCDS59412.1 SPIB gene_id:6689|Hs108|chr19 ( 177) 1138 158.3 3.4e-39 CCDS58674.1 SPIB gene_id:6689|Hs108|chr19 ( 171) 994 139.7 1.3e-33 CCDS7933.2 SPI1 gene_id:6688|Hs108|chr11 ( 270) 617 91.2 8e-19 CCDS44591.1 SPI1 gene_id:6688|Hs108|chr11 ( 271) 615 91.0 9.6e-19 >>CCDS33080.1 SPIB gene_id:6689|Hs108|chr19 (262 aa) initn: 1805 init1: 1805 opt: 1805 Z-score: 1279.9 bits: 244.5 E(32554): 5.7e-65 Smith-Waterman score: 1805; 100.0% identity (100.0% similar) in 262 aa overlap (1-262:1-262) 10 20 30 40 50 60 pF1KB7 MLALEAAQLDGPHFSCLYPDGVFYDLDSCKHSSYPDSEGAPDSLWDWTVAPPVPATPYEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MLALEAAQLDGPHFSCLYPDGVFYDLDSCKHSSYPDSEGAPDSLWDWTVAPPVPATPYEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 FDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 FDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 RGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEI 190 200 210 220 230 240 250 260 pF1KB7 RKVKRKLTYQFDSALLPAVRRA :::::::::::::::::::::: CCDS33 RKVKRKLTYQFDSALLPAVRRA 250 260 >>CCDS59412.1 SPIB gene_id:6689|Hs108|chr19 (177 aa) initn: 1170 init1: 1138 opt: 1138 Z-score: 817.2 bits: 158.3 E(32554): 3.4e-39 Smith-Waterman score: 1138; 98.2% identity (99.4% similar) in 166 aa overlap (1-166:1-166) 10 20 30 40 50 60 pF1KB7 MLALEAAQLDGPHFSCLYPDGVFYDLDSCKHSSYPDSEGAPDSLWDWTVAPPVPATPYEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MLALEAAQLDGPHFSCLYPDGVFYDLDSCKHSSYPDSEGAPDSLWDWTVAPPVPATPYEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 FDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 FDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLT ::::::::::::::::::::::::::::::::::::::::::. .: CCDS59 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEGLARSCACTSSCWGY 130 140 150 160 170 190 200 210 220 230 240 pF1KB7 RGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEI >>CCDS58674.1 SPIB gene_id:6689|Hs108|chr19 (171 aa) initn: 991 init1: 991 opt: 994 Z-score: 717.0 bits: 139.7 E(32554): 1.3e-33 Smith-Waterman score: 994; 89.0% identity (93.0% similar) in 172 aa overlap (91-262:1-171) 70 80 90 100 110 120 pF1KB7 FDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAY .: :. . ..:: . . ::::::: CCDS58 MASSMTWTAASIPATLIQR-GLLTLVPPAY 10 20 130 140 150 160 170 180 pF1KB7 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLT 30 40 50 60 70 80 190 200 210 220 230 240 pF1KB7 RGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 RGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEI 90 100 110 120 130 140 250 260 pF1KB7 RKVKRKLTYQFDSALLPAVRRA :::::::::::::::::::::: CCDS58 RKVKRKLTYQFDSALLPAVRRA 150 160 170 >>CCDS7933.2 SPI1 gene_id:6688|Hs108|chr11 (270 aa) initn: 530 init1: 480 opt: 617 Z-score: 451.4 bits: 91.2 E(32554): 8e-19 Smith-Waterman score: 617; 44.4% identity (63.4% similar) in 268 aa overlap (4-256:2-257) 10 20 30 40 50 pF1KB7 MLALEAAQLDGPHFSCLYP---DGVFYDLDSCK---HSSYP----DSEGAPDSLWDWTVA :.: ...: : . : : : :: : . : :: :.:. : ::. CCDS79 MLQACKMEG--FPLVPPPSEDLVPYDTDLYQRQTHEYYPYLSSDGESHSDHYWDFH-- 10 20 30 40 50 60 70 80 90 100 pF1KB7 PPVPATPYEAFDPAAAAFSHPQAAQLCYEPPTYSPAG-NLELAPSLEAPGPGLPAYPTEN : . .:.: : :.. :..: :: . ..:: : .: .:. . CCDS79 PHHVHSEFESF--AENNFTELQSVQ----PPQLQQLYRHMELEQMHVLDTPMVPPHPSLG 60 70 80 90 100 110 120 130 140 150 160 pF1KB7 FASQTLVPPAYAPYPS--PVL--SEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGT . : : ::: :. :.::. .:: :::::.:.: .: :: .:.:. CCDS79 HQVSYL-PRMCLQYPSLSPAQPSSDEEEGERQSPPLEVSDGEAD-GLEPGPGLLPGETGS 110 120 130 140 150 160 170 180 190 200 210 220 pF1KB7 RKKLRLYQFLLGLLTRGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQ .::.::::::: :: :::.. .:::. :.::::::::: ::.::: ::::::.:::: CCDS79 KKKIRLYQFLLDLLRSGDMKDSIWWVDKDKGTFQFSSKHKEALAHRWGIQKGNRKKMTYQ 170 180 190 200 210 220 230 240 250 260 pF1KB7 KLARALRNYAKTGEIRKVKRKLTYQFDSALLPAVRRA :.:::::::.::::..:::.::::::.. .: CCDS79 KMARALRNYGKTGEVKKVKKKLTYQFSGEVLGRGGLAERRHPPH 230 240 250 260 270 >>CCDS44591.1 SPI1 gene_id:6688|Hs108|chr11 (271 aa) initn: 530 init1: 480 opt: 615 Z-score: 450.0 bits: 91.0 E(32554): 9.6e-19 Smith-Waterman score: 615; 44.2% identity (63.2% similar) in 269 aa overlap (4-256:2-258) 10 20 30 40 pF1KB7 MLALEAAQLDGPHFSCLYP----DGVFYDLDSCK---HSSYP----DSEGAPDSLWDWTV :.: ...: : . : : : :: : . : :: :.:. : ::. CCDS44 MLQACKMEG--FPLVPPQPSEDLVPYDTDLYQRQTHEYYPYLSSDGESHSDHYWDFH- 10 20 30 40 50 50 60 70 80 90 100 pF1KB7 APPVPATPYEAFDPAAAAFSHPQAAQLCYEPPTYSPAG-NLELAPSLEAPGPGLPAYPTE : . .:.: : :.. :..: :: . ..:: : .: .:. CCDS44 -PHHVHSEFESF--AENNFTELQSVQ----PPQLQQLYRHMELEQMHVLDTPMVPPHPSL 60 70 80 90 100 110 120 130 140 150 160 pF1KB7 NFASQTLVPPAYAPYPS--PVL--SEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAG . . : : ::: :. :.::. .:: :::::.:.: .: :: .:.: CCDS44 GHQVSYL-PRMCLQYPSLSPAQPSSDEEEGERQSPPLEVSDGEAD-GLEPGPGLLPGETG 110 120 130 140 150 160 170 180 190 200 210 220 pF1KB7 TRKKLRLYQFLLGLLTRGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTY ..::.::::::: :: :::.. .:::. :.::::::::: ::.::: ::::::.::: CCDS44 SKKKIRLYQFLLDLLRSGDMKDSIWWVDKDKGTFQFSSKHKEALAHRWGIQKGNRKKMTY 170 180 190 200 210 220 230 240 250 260 pF1KB7 QKLARALRNYAKTGEIRKVKRKLTYQFDSALLPAVRRA ::.:::::::.::::..:::.::::::.. .: CCDS44 QKMARALRNYGKTGEVKKVKKKLTYQFSGEVLGRGGLAERRHPPH 230 240 250 260 270 262 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:09:28 2016 done: Sun Nov 6 12:09:28 2016 Total Scan time: 2.630 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]