FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2270, 146 aa 1>>>pF1KE2270 146 - 146 aa - 146 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2194+/-0.000539; mu= 13.2849+/- 0.033 mean_var=65.8325+/-12.911, 0's: 0 Z-trim(113.9): 20 B-trim: 0 in 0/50 Lambda= 0.158072 statistics sampled from 14475 (14495) to 14475 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.445), width: 16 Scan time: 1.860 The best scores are: opt bits E(32554) CCDS13158.1 CST3 gene_id:1471|Hs108|chr20 ( 146) 982 231.4 1.6e-61 CCDS13161.1 CST2 gene_id:1470|Hs108|chr20 ( 141) 542 131.0 2.4e-31 CCDS13159.1 CST4 gene_id:1472|Hs108|chr20 ( 141) 529 128.0 1.9e-30 CCDS13160.1 CST1 gene_id:1469|Hs108|chr20 ( 141) 523 126.7 4.9e-30 CCDS13162.1 CST5 gene_id:1473|Hs108|chr20 ( 142) 488 118.7 1.3e-27 CCDS8126.1 CST6 gene_id:1474|Hs108|chr11 ( 149) 282 71.7 1.8e-13 CCDS13156.1 CST8 gene_id:10047|Hs108|chr20 ( 142) 260 66.7 5.6e-12 CCDS13165.2 CST7 gene_id:8530|Hs108|chr20 ( 145) 254 65.3 1.5e-11 >>CCDS13158.1 CST3 gene_id:1471|Hs108|chr20 (146 aa) initn: 982 init1: 982 opt: 982 Z-score: 1218.2 bits: 231.4 E(32554): 1.6e-61 Smith-Waterman score: 982; 100.0% identity (100.0% similar) in 146 aa overlap (1-146:1-146) 10 20 30 40 50 60 pF1KE2 MAGPLRAPLLLLAILAVALAVSPAAGSSPGKPPRLVGGPMDASVEEEGVRRALDFAVGEY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MAGPLRAPLLLLAILAVALAVSPAAGSSPGKPPRLVGGPMDASVEEEGVRRALDFAVGEY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 NKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKTQPNLDNCPFHDQPHLKRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 NKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKTQPNLDNCPFHDQPHLKRK 70 80 90 100 110 120 130 140 pF1KE2 AFCSFQIYAVPWQGTMTLSKSTCQDA :::::::::::::::::::::::::: CCDS13 AFCSFQIYAVPWQGTMTLSKSTCQDA 130 140 >>CCDS13161.1 CST2 gene_id:1470|Hs108|chr20 (141 aa) initn: 584 init1: 490 opt: 542 Z-score: 676.1 bits: 131.0 E(32554): 2.4e-31 Smith-Waterman score: 542; 56.5% identity (79.6% similar) in 147 aa overlap (1-146:1-141) 10 20 30 40 50 pF1KE2 MAGPLRAPLLLLAILAVALAVSPAAGSSPGKPPRLV-GGPMDASVEEEGVRRALDFAVGE :: :: . ::::: ::::: :: . :.. :: .::....: :.::: :...: CCDS13 MAWPLCTLLLLLATQAVALAWSPQ------EEDRIIEGGIYDADLNDERVQRALHFVISE 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 YNKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKTQPNLDNCPFHDQPHLKR ::::..: :. : :.:.:::.:::.:::::.:.:.::: :::.:::::.: ::.::.:.. CCDS13 YNKATEDEYYRRLLRVLRAREQIVGGVNYFFDIEVGRTICTKSQPNLDTCAFHEQPELQK 60 70 80 90 100 110 120 130 140 pF1KE2 KAFCSFQIYAVPWQGTMTLSKSTCQDA : .:::::: :::. :.: .: ::.: CCDS13 KQLCSFQIYEVPWEDRMSLVNSRCQEA 120 130 140 >>CCDS13159.1 CST4 gene_id:1472|Hs108|chr20 (141 aa) initn: 583 init1: 486 opt: 529 Z-score: 660.1 bits: 128.0 E(32554): 1.9e-30 Smith-Waterman score: 529; 55.8% identity (78.2% similar) in 147 aa overlap (1-146:1-141) 10 20 30 40 50 pF1KE2 MAGPLRAPLLLLAILAVALAVSPAAGSSPGKPPRLV-GGPMDASVEEEGVRRALDFAVGE :: :: . :::.: :: ::: :: . :.. :: .::....: :.::: ::..: CCDS13 MARPLCTLLLLMATLAGALA------SSSKEENRIIPGGIYDADLNDEWVQRALHFAISE 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 YNKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKTQPNLDNCPFHDQPHLKR ::::..: :. : :::.:::.: .:::::.:::.::: :::.:::::.: ::.::.:.. CCDS13 YNKATEDEYYRRPLQVLRAREQTFGGVNYFFDVEVGRTICTKSQPNLDTCAFHEQPELQK 60 70 80 90 100 110 120 130 140 pF1KE2 KAFCSFQIYAVPWQGTMTLSKSTCQDA : .:::.:: :::. :.: .: ::.: CCDS13 KQLCSFEIYEVPWEDRMSLVNSRCQEA 120 130 140 >>CCDS13160.1 CST1 gene_id:1469|Hs108|chr20 (141 aa) initn: 573 init1: 473 opt: 523 Z-score: 652.7 bits: 126.7 E(32554): 4.9e-30 Smith-Waterman score: 523; 55.1% identity (78.9% similar) in 147 aa overlap (1-146:1-141) 10 20 30 40 50 pF1KE2 MAGPLRAPLLLLAILAVALAVSPAAGSSPGKPPRLV-GGPMDASVEEEGVRRALDFAVGE :: : . ::::: :::::: :: . :.. :: ..:....: :.::: ::..: CCDS13 MAQYLSTLLLLLATLAVALAWSPK------EEDRIIPGGIYNADLNDEWVQRALHFAISE 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 YNKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKTQPNLDNCPFHDQPHLKR ::::..: :. : :.:.:::.: :.:::::.:::.::: :::.:::::.: ::.::.:.. CCDS13 YNKATKDDYYRRPLRVLRARQQTVGGVNYFFDVEVGRTICTKSQPNLDTCAFHEQPELQK 60 70 80 90 100 110 120 130 140 pF1KE2 KAFCSFQIYAVPWQGTMTLSKSTCQDA : .:::.:: :::.. .: :: ::.. CCDS13 KQLCSFEIYEVPWENRRSLVKSRCQES 120 130 140 >>CCDS13162.1 CST5 gene_id:1473|Hs108|chr20 (142 aa) initn: 482 init1: 334 opt: 488 Z-score: 609.5 bits: 118.7 E(32554): 1.3e-27 Smith-Waterman score: 488; 51.0% identity (78.6% similar) in 145 aa overlap (1-144:1-140) 10 20 30 40 50 60 pF1KE2 MAGPLRAPLLLLAILAVALAVSPAAGSSPGKPPRLVGGPMDASVEEEGVRRALDFAVGEY : :...:::::. : ::.: ::. .. :.:: .......:. :::::..:: CCDS13 MMWPMHTPLLLLTALMVAVA-----GSASAQSRTLAGGIHATDLNDKSVQCALDFAISEY 10 20 30 40 50 70 80 90 100 110 pF1KE2 NKASN-DMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKTQPNLDNCPFHDQPHLKR ::. : : :.:: :::. : .:::.::::...:..:::::::.:::::::::.:::.::. CCDS13 NKVINKDEYYSRPLQVMAAYQQIVGGVNYYFNVKFGRTTCTKSQPNLDNCPFNDQPKLKE 60 70 80 90 100 110 120 130 140 pF1KE2 KAFCSFQIYAVPWQGTMTLSKSTCQDA . :::::: :::. ... . :. CCDS13 EEFCSFQINEVPWEDKISILNYKCRKV 120 130 140 >>CCDS8126.1 CST6 gene_id:1474|Hs108|chr11 (149 aa) initn: 251 init1: 193 opt: 282 Z-score: 355.3 bits: 71.7 E(32554): 1.8e-13 Smith-Waterman score: 282; 36.4% identity (65.0% similar) in 143 aa overlap (8-143:7-146) 10 20 30 40 50 pF1KE2 MAGPLRAPLLL-LAILAVALAVSPA-AGSSPGKPPRLVGGPMDASVEEEGVRRALDFAVG :: : ::..: : . : : . : . :.:: : : .. :..: . ::. CCDS81 MARSNLPLALGLALVAFCLLALPRDARARPQE--RMVGELRDLSPDDPQVQKAAQAAVA 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 EYNKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKTQP-----NLDNCPFHD :: .::..:. : ....:..:.:::..::: .:.: : : ::. .: .::. CCDS81 SYNMGSNSIYYFRDTHIIKAQSQLVAGIKYFLTMEMGSTDCRKTRVTGDHVDLTTCPLAA 60 70 80 90 100 110 120 130 140 pF1KE2 QPHLKRKAFCSFQIYAVPWQGTMTLSKSTCQDA . ..: :.:.. .::::.. : : .: CCDS81 GAQ-QEKLRCDFEVLVVPWQNSSQLLKHNCVQM 120 130 140 >>CCDS13156.1 CST8 gene_id:10047|Hs108|chr20 (142 aa) initn: 239 init1: 239 opt: 260 Z-score: 328.5 bits: 66.7 E(32554): 5.6e-12 Smith-Waterman score: 260; 34.0% identity (66.0% similar) in 144 aa overlap (6-146:3-142) 10 20 30 40 50 pF1KE2 MAGPLRAPLLLLAILAVALAVSPAAGSSPGKPPRLVG---GPMDASVEEEGVRRALDFAV : : : .:.. ::. .: ..: : : :..:: . .:.. : ::. CCDS13 MPRCRWLSLILLTIPLAL--VARKDPKKNETGVLRKLKPVNAS--NANVKQCLWFAM 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 GEYNKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKTQPNLDNCPFHDQPHL :::: :.: : ......:. :.. ..:..:::..:. : : . . : .... .: CCDS13 QEYNKESEDKYVFLVVKTLQAQLQVTNLLEYLIDVEIARSDCRKPLSTNEICAIQENSKL 60 70 80 90 100 110 120 130 140 pF1KE2 KRKAFCSFQIYAVPWQGTMTLSKSTCQDA ::: ::: . :.::.: .:. .. :.:: CCDS13 KRKLSCSFLVGALPWNGEFTVMEKKCEDA 120 130 140 >>CCDS13165.2 CST7 gene_id:8530|Hs108|chr20 (145 aa) initn: 270 init1: 176 opt: 254 Z-score: 321.0 bits: 65.3 E(32554): 1.5e-11 Smith-Waterman score: 254; 35.6% identity (63.0% similar) in 135 aa overlap (5-132:1-133) 10 20 30 40 50 pF1KE2 MAGPLRAPLLLLAILAVALAVSPAAGSSPGK-----PPRLVGG-PMDASVEEEGVRRALD .:: :::. :..: ..: :: :. : : .... :: .: CCDS13 MRAAGTLLAF--CCLVLSTTGGPSPDTCSQDLNSRVKPGFPKTIKTNDPGVLQAAR 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 FAVGEYNKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKTQP-NLDNCPFHD ..: ..:. .:::. . ...:: ::: :..:.:.::.::::: :.: ::.: :. CCDS13 YSVEKFNNCTNDMFLFKESRITRALVQIVKGLKYMLEVEIGRTTCKKNQHLRLDDCDFQT 60 70 80 90 100 110 120 130 140 pF1KE2 QPHLKRKAFCSFQIYAVPWQGTMTLSKSTCQDA . ::. : ....::: CCDS13 NHTLKQTLSCYSEVWVVPWLQHFEVPVLRCH 120 130 140 146 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:36:59 2016 done: Mon Nov 7 01:36:59 2016 Total Scan time: 1.860 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]