FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6998, 217 aa 1>>>pF1KB6998 217 - 217 aa - 217 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5602+/-0.000785; mu= 12.0401+/- 0.047 mean_var=61.8370+/-12.418, 0's: 0 Z-trim(107.3): 28 B-trim: 62 in 1/50 Lambda= 0.163098 statistics sampled from 9445 (9464) to 9445 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.68), E-opt: 0.2 (0.291), width: 16 Scan time: 2.170 The best scores are: opt bits E(32554) CCDS11649.1 CSH1 gene_id:1442|Hs108|chr17 ( 217) 1448 349.0 1.4e-96 CCDS42369.1 CSH2 gene_id:1443|Hs108|chr17 ( 217) 1437 346.4 8.1e-96 CCDS11652.1 CSHL1 gene_id:1444|Hs108|chr17 ( 222) 1284 310.4 5.7e-85 CCDS11653.1 GH1 gene_id:2688|Hs108|chr17 ( 217) 1228 297.2 5.2e-81 CCDS11647.1 GH2 gene_id:2689|Hs108|chr17 ( 217) 1129 273.9 5.3e-74 CCDS11646.1 CSH2 gene_id:1443|Hs108|chr17 ( 167) 996 242.6 1.1e-64 CCDS45760.1 GH1 gene_id:2688|Hs108|chr17 ( 202) 837 205.2 2.4e-53 CCDS45759.1 CSHL1 gene_id:1444|Hs108|chr17 ( 139) 819 200.9 3.2e-52 CCDS82189.1 CSHL1 gene_id:1444|Hs108|chr17 ( 160) 819 200.9 3.7e-52 CCDS45758.1 GH2 gene_id:2689|Hs108|chr17 ( 202) 782 192.3 1.9e-49 CCDS42370.1 CSHL1 gene_id:1444|Hs108|chr17 ( 128) 774 190.3 4.6e-49 CCDS45757.1 GH2 gene_id:2689|Hs108|chr17 ( 245) 735 181.2 4.8e-46 CCDS11648.1 GH2 gene_id:2689|Hs108|chr17 ( 256) 732 180.5 8.2e-46 CCDS11654.1 GH1 gene_id:2688|Hs108|chr17 ( 177) 714 176.2 1.1e-44 CCDS42368.1 CSH2 gene_id:1443|Hs108|chr17 ( 122) 446 113.1 7.6e-26 CCDS4548.1 PRL gene_id:5617|Hs108|chr6 ( 227) 249 66.9 1.2e-11 >>CCDS11649.1 CSH1 gene_id:1442|Hs108|chr17 (217 aa) initn: 1448 init1: 1448 opt: 1448 Z-score: 1847.7 bits: 349.0 E(32554): 1.4e-96 Smith-Waterman score: 1448; 100.0% identity (100.0% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KB6 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD 130 140 150 160 170 180 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS11 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF 190 200 210 >>CCDS42369.1 CSH2 gene_id:1443|Hs108|chr17 (217 aa) initn: 1437 init1: 1437 opt: 1437 Z-score: 1833.7 bits: 346.4 E(32554): 8.1e-96 Smith-Waterman score: 1437; 99.5% identity (99.5% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KB6 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET :: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD 130 140 150 160 170 180 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS42 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF 190 200 210 >>CCDS11652.1 CSHL1 gene_id:1444|Hs108|chr17 (222 aa) initn: 1292 init1: 957 opt: 1284 Z-score: 1638.9 bits: 310.4 E(32554): 5.7e-85 Smith-Waterman score: 1284; 90.1% identity (93.2% similar) in 222 aa overlap (1-217:1-222) 10 20 30 40 50 pF1KB6 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEF--- :: ::::::::::::::::::::::::::::::::: .::::::::::::::::::: CCDS11 MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFKEAMLQAHRAHQLAIDTYQEFISS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB6 --EETYIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEP :.:: :.::::::::::::::::::::: :::::::::::::::.:::::::: ::: CCDS11 WGMEAYITKEQKYSFLHDSQTSFCFSDSIPTSSNMEETQQKSNLELLHISLLLIESRLEP 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB6 VRFLRSMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTN :::::: :.:::::::::::::::::::::::: :::::::::. ::: ::::::::::: CCDS11 VRFLRSTFTNNLVYDTSDSDDYHLLKDLEEGIQMLMGRLEDGSHLTGQTLKQTYSKFDTN 130 140 150 160 170 180 180 190 200 210 pF1KB6 SHNHDALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF ::::::::::::::.::::::::::::::::::::::::::: CCDS11 SHNHDALLKNYGLLHCFRKDMDKVETFLRMVQCRSVEGSCGF 190 200 210 220 >>CCDS11653.1 GH1 gene_id:2688|Hs108|chr17 (217 aa) initn: 1228 init1: 1228 opt: 1228 Z-score: 1567.9 bits: 297.2 E(32554): 5.2e-81 Smith-Waterman score: 1228; 85.3% identity (94.9% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KB6 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET :: ::::::::::.:::::::::..: :.:::::::.:::.::: ::::.::::::::. CCDS11 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR ::::.::::::.. :::.:::.::::::: :::::::::::::::::::.::::::.::: CCDS11 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD :.:::.::: .:::. : :::::::::::::::::::: :::::.::::::::::::: : CCDS11 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD 130 140 150 160 170 180 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF ::::::::::::::::::::::::.:::::::::::: CCDS11 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 190 200 210 >>CCDS11647.1 GH2 gene_id:2689|Hs108|chr17 (217 aa) initn: 1129 init1: 1129 opt: 1129 Z-score: 1442.0 bits: 273.9 E(32554): 5.3e-74 Smith-Waterman score: 1129; 79.7% identity (92.6% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KB6 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET :: ::::::::::.:::: ::::..: :.:::::::.:::.:.: .::: ::::::::. CCDS11 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR :: :.::::::.. :::.:::.::::::: .:::::::::::::::::.::::::..:: CCDS11 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD :.:::.::: .:::. :. :::::::::::: :::::: :::::..:.::::::.::: : CCDS11 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD 130 140 150 160 170 180 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF ::::::::::::::::::::::::.:::::::::::: CCDS11 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 190 200 210 >>CCDS11646.1 CSH2 gene_id:1443|Hs108|chr17 (167 aa) initn: 991 init1: 991 opt: 996 Z-score: 1274.7 bits: 242.6 E(32554): 1.1e-64 Smith-Waterman score: 996; 93.4% identity (94.0% similar) in 166 aa overlap (1-165:1-166) 10 20 30 40 50 60 pF1KB6 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET :: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR 70 80 90 100 110 120 130 140 150 160 170 pF1KB6 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMG-RLEDGSRRTGQILKQTYSKFDTNSHNH :::::::::::::::::::::::::::::::: :. : : : CCDS11 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGVRVAPGVANPGTPLA 130 140 150 160 180 190 200 210 pF1KB6 DALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF >>CCDS45760.1 GH1 gene_id:2688|Hs108|chr17 (202 aa) initn: 1131 init1: 837 opt: 837 Z-score: 1071.2 bits: 205.2 E(32554): 2.4e-53 Smith-Waterman score: 1105; 79.7% identity (88.0% similar) in 217 aa overlap (1-217:1-202) 10 20 30 40 50 60 pF1KB6 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET :: ::::::::::.:::::::::..: :.:::::::.:::.::: ::::.:::::: CCDS45 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEF--- 10 20 30 40 50 70 80 90 100 110 120 pF1KB6 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR . :::.:::.::::::: :::::::::::::::::::.::::::.::: CCDS45 ------------NPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR 60 70 80 90 100 130 140 150 160 170 180 pF1KB6 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD :.:::.::: .:::. : :::::::::::::::::::: :::::.::::::::::::: : CCDS45 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD 110 120 130 140 150 160 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF ::::::::::::::::::::::::.:::::::::::: CCDS45 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 170 180 190 200 >>CCDS45759.1 CSHL1 gene_id:1444|Hs108|chr17 (139 aa) initn: 819 init1: 819 opt: 819 Z-score: 1050.9 bits: 200.9 E(32554): 3.2e-52 Smith-Waterman score: 819; 92.0% identity (95.6% similar) in 137 aa overlap (81-217:3-139) 60 70 80 90 100 110 pF1KB6 IDTYQEFEETYIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIE .::::: :::::::::::::::.::::::: CCDS45 MAADSIPTSSNMEETQQKSNLELLHISLLLIE 10 20 30 120 130 140 150 160 170 pF1KB6 SWLEPVRFLRSMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYS : ::::::::: :.:::::::::::::::::::::::: :::::::::. ::: :::::: CCDS45 SRLEPVRFLRSTFTNNLVYDTSDSDDYHLLKDLEEGIQMLMGRLEDGSHLTGQTLKQTYS 40 50 60 70 80 90 180 190 200 210 pF1KB6 KFDTNSHNHDALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF :::::::::::::::::::.::::::::::::::::::::::::::: CCDS45 KFDTNSHNHDALLKNYGLLHCFRKDMDKVETFLRMVQCRSVEGSCGF 100 110 120 130 >>CCDS82189.1 CSHL1 gene_id:1444|Hs108|chr17 (160 aa) initn: 819 init1: 819 opt: 819 Z-score: 1049.9 bits: 200.9 E(32554): 3.7e-52 Smith-Waterman score: 844; 68.7% identity (70.5% similar) in 217 aa overlap (1-217:1-160) 10 20 30 40 50 60 pF1KB6 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET :: ::::::::::::::::::::: CCDS82 MAAGSRTSLLLAFALLCLPWLQEA------------------------------------ 10 20 70 80 90 100 110 120 pF1KB6 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR ::::: :::::::::::::::.:::::::: :::::::: CCDS82 ---------------------DSIPTSSNMEETQQKSNLELLHISLLLIESRLEPVRFLR 30 40 50 60 130 140 150 160 170 180 pF1KB6 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD : :.:::::::::::::::::::::::: :::::::::. ::: :::::::::::::::: CCDS82 STFTNNLVYDTSDSDDYHLLKDLEEGIQMLMGRLEDGSHLTGQTLKQTYSKFDTNSHNHD 70 80 90 100 110 120 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF :::::::::.::::::::::::::::::::::::::: CCDS82 ALLKNYGLLHCFRKDMDKVETFLRMVQCRSVEGSCGF 130 140 150 160 >>CCDS45758.1 GH2 gene_id:2689|Hs108|chr17 (202 aa) initn: 1046 init1: 782 opt: 782 Z-score: 1001.2 bits: 192.3 E(32554): 1.9e-49 Smith-Waterman score: 1020; 74.7% identity (86.2% similar) in 217 aa overlap (1-217:1-202) 10 20 30 40 50 60 pF1KB6 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET :: ::::::::::.:::: ::::..: :.:::::::.:::.:.: .::: :::::: CCDS45 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEF--- 10 20 30 40 50 70 80 90 100 110 120 pF1KB6 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR . :::.:::.::::::: .:::::::::::::::::.::::::..:: CCDS45 ------------NPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR 60 70 80 90 100 130 140 150 160 170 180 pF1KB6 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD :.:::.::: .:::. :. :::::::::::: :::::: :::::..:.::::::.::: : CCDS45 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD 110 120 130 140 150 160 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF ::::::::::::::::::::::::.:::::::::::: CCDS45 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 170 180 190 200 217 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 02:17:02 2016 done: Sat Nov 5 02:17:03 2016 Total Scan time: 2.170 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]