FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1651, 145 aa 1>>>pF1KE1651 145 - 145 aa - 145 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9332+/-0.000615; mu= 13.5755+/- 0.037 mean_var=59.2330+/-11.843, 0's: 0 Z-trim(110.8): 20 B-trim: 0 in 0/49 Lambda= 0.166645 statistics sampled from 11898 (11918) to 11898 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.748), E-opt: 0.2 (0.366), width: 16 Scan time: 1.750 The best scores are: opt bits E(32554) CCDS13165.2 CST7 gene_id:8530|Hs108|chr20 ( 145) 1002 248.4 1.2e-66 CCDS13158.1 CST3 gene_id:1471|Hs108|chr20 ( 146) 254 68.5 1.6e-12 CCDS13159.1 CST4 gene_id:1472|Hs108|chr20 ( 141) 235 64.0 3.7e-11 CCDS13160.1 CST1 gene_id:1469|Hs108|chr20 ( 141) 233 63.5 5.2e-11 >>CCDS13165.2 CST7 gene_id:8530|Hs108|chr20 (145 aa) initn: 1002 init1: 1002 opt: 1002 Z-score: 1310.2 bits: 248.4 E(32554): 1.2e-66 Smith-Waterman score: 1002; 100.0% identity (100.0% similar) in 145 aa overlap (1-145:1-145) 10 20 30 40 50 60 pF1KE1 MRAAGTLLAFCCLVLSTTGGPSPDTCSQDLNSRVKPGFPKTIKTNDPGVLQAARYSVEKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MRAAGTLLAFCCLVLSTTGGPSPDTCSQDLNSRVKPGFPKTIKTNDPGVLQAARYSVEKF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 NNCTNDMFLFKESRITRALVQIVKGLKYMLEVEIGRTTCKKNQHLRLDDCDFQTNHTLKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 NNCTNDMFLFKESRITRALVQIVKGLKYMLEVEIGRTTCKKNQHLRLDDCDFQTNHTLKQ 70 80 90 100 110 120 130 140 pF1KE1 TLSCYSEVWVVPWLQHFEVPVLRCH ::::::::::::::::::::::::: CCDS13 TLSCYSEVWVVPWLQHFEVPVLRCH 130 140 >>CCDS13158.1 CST3 gene_id:1471|Hs108|chr20 (146 aa) initn: 270 init1: 176 opt: 254 Z-score: 338.3 bits: 68.5 E(32554): 1.6e-12 Smith-Waterman score: 254; 35.6% identity (63.0% similar) in 135 aa overlap (1-133:5-132) 10 20 30 40 50 pF1KE1 MRAAGTLLAF--CCLVLSTTGGPSPDTCSQDLNSRVKPGFPKTIKTNDPGVLQAAR .:: :::. :..: ..: :: :. : : .... :: .: CCDS13 MAGPLRAPLLLLAILAVALAVSPAAGSSPGK-----PPRLVGG-PMDASVEEEGVRRALD 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 YSVEKFNNCTNDMFLFKESRITRALVQIVKGLKYMLEVEIGRTTCKKNQHLRLDDCDFQT ..: ..:. .:::. . ...:: ::: :..:.:.::.::::: :.: ::.: :. CCDS13 FAVGEYNKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKTQP-NLDNCPFHD 60 70 80 90 100 110 120 130 140 pF1KE1 NHTLKQTLSCYSEVWVVPWLQHFEVPVLRCH . ::. : ....::: CCDS13 QPHLKRKAFCSFQIYAVPWQGTMTLSKSTCQDA 120 130 140 >>CCDS13159.1 CST4 gene_id:1472|Hs108|chr20 (141 aa) initn: 218 init1: 145 opt: 235 Z-score: 313.8 bits: 64.0 E(32554): 3.7e-11 Smith-Waterman score: 235; 29.0% identity (59.4% similar) in 138 aa overlap (10-145:5-139) 10 20 30 40 50 pF1KE1 MRAAGTLLAFCCLVL--STTGGPSPDTCSQDLNSRVKPGFPKTIKTNDPGVLQAARYSVE .: :.: .: .: .. ... .:. :: :: : .: .... CCDS13 MARPLCTLLLLMATLAGALASSSKEE--NRIIPGGIYDADLNDEWVQRALHFAIS 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 KFNNCTNDMFLFKESRITRALVQIVKGLKYMLEVEIGRTTCKKNQHLRLDDCDFQTNHTL ..:. :.: . . .. :: : :..:...::.::: : :.: :: : :. . : CCDS13 EYNKATEDEYYRRPLQVLRAREQTFGGVNYFFDVEVGRTICTKSQP-NLDTCAFHEQPEL 60 70 80 90 100 110 120 130 140 pF1KE1 KQTLSCYSEVWVVPWLQHFEVPVLRCH .. : :.. ::: ... . ::. CCDS13 QKKQLCSFEIYEVPWEDRMSLVNSRCQEA 120 130 140 >>CCDS13160.1 CST1 gene_id:1469|Hs108|chr20 (141 aa) initn: 226 init1: 156 opt: 233 Z-score: 311.2 bits: 63.5 E(32554): 5.2e-11 Smith-Waterman score: 233; 32.7% identity (60.2% similar) in 113 aa overlap (33-145:28-139) 10 20 30 40 50 60 pF1KE1 AAGTLLAFCCLVLSTTGGPSPDTCSQDLNSRVKPGFPKTIKTNDPGVLQAARYSVEKFNN :. :: . :: : .: .... ..:. CCDS13 MAQYLSTLLLLLATLAVALAWSPKEEDRIIPGGIYNADLNDEWVQRALHFAISEYNK 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 CTNDMFLFKESRITRALVQIVKGLKYMLEVEIGRTTCKKNQHLRLDDCDFQTNHTLKQTL :.: . . :. :: : : :..:...::.::: : :.: :: : :. . :.. CCDS13 ATKDDYYRRPLRVLRARQQTVGGVNYFFDVEVGRTICTKSQP-NLDTCAFHEQPELQKKQ 60 70 80 90 100 110 130 140 pF1KE1 SCYSEVWVVPWLQHFEVPVLRCH : :.. ::: .. . ::. CCDS13 LCSFEIYEVPWENRRSLVKSRCQES 120 130 140 145 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 13:06:20 2016 done: Sun Nov 6 13:06:20 2016 Total Scan time: 1.750 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]