FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6977, 217 aa 1>>>pF1KB6977 217 - 217 aa - 217 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4595+/-0.000769; mu= 12.9240+/- 0.046 mean_var=64.5087+/-12.852, 0's: 0 Z-trim(108.2): 23 B-trim: 278 in 1/52 Lambda= 0.159685 statistics sampled from 10141 (10164) to 10141 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.69), E-opt: 0.2 (0.304), width: 16 Scan time: 1.000 The best scores are: opt bits E(33420) CCDS11653.1 GH1 gene_id:2688|Hs109|chr17 ( 217) 1439 339.8 7.8e-94 CCDS11647.1 GH2 gene_id:2689|Hs109|chr17 ( 217) 1321 312.7 1.2e-85 CCDS42369.1 CSH2 gene_id:1443|Hs109|chr17 ( 217) 1229 291.5 2.9e-79 CCDS11649.1 CSH1 gene_id:1442|Hs109|chr17 ( 217) 1228 291.2 3.4e-79 CCDS11652.1 CSHL1 gene_id:1444|Hs109|chr17 ( 222) 1120 266.4 1.1e-71 CCDS45760.1 GH1 gene_id:2688|Hs109|chr17 ( 202) 958 229.0 1.7e-60 CCDS11648.1 GH2 gene_id:2689|Hs109|chr17 ( 256) 904 216.6 1.1e-56 CCDS45757.1 GH2 gene_id:2689|Hs109|chr17 ( 245) 895 214.5 4.6e-56 CCDS45758.1 GH2 gene_id:2689|Hs109|chr17 ( 202) 892 213.8 6.3e-56 CCDS11646.1 CSH2 gene_id:1443|Hs109|chr17 ( 167) 818 196.7 7.3e-51 CCDS11654.1 GH1 gene_id:2688|Hs109|chr17 ( 177) 802 193.1 9.8e-50 CCDS45759.1 CSHL1 gene_id:1444|Hs109|chr17 ( 139) 724 175.0 2e-44 CCDS82189.1 CSHL1 gene_id:1444|Hs109|chr17 ( 160) 724 175.1 2.3e-44 CCDS42370.1 CSHL1 gene_id:1444|Hs109|chr17 ( 128) 687 166.5 7e-42 CCDS42368.1 CSH2 gene_id:1443|Hs109|chr17 ( 122) 421 105.2 1.9e-23 CCDS4548.1 PRL gene_id:5617|Hs109|chr6 ( 227) 270 70.5 9.6e-13 >>CCDS11653.1 GH1 gene_id:2688|Hs109|chr17 (217 aa) initn: 1439 init1: 1439 opt: 1439 Z-score: 1798.3 bits: 339.8 E(33420): 7.8e-94 Smith-Waterman score: 1439; 100.0% identity (100.0% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KB6 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD 130 140 150 160 170 180 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS11 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 190 200 210 >>CCDS11647.1 GH2 gene_id:2689|Hs109|chr17 (217 aa) initn: 1321 init1: 1321 opt: 1321 Z-score: 1651.3 bits: 312.7 E(33420): 1.2e-85 Smith-Waterman score: 1321; 93.1% identity (97.2% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KB6 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA ::.::::::::::::::: ::::::::::::::::::::::::.::.:::.::::::::: CCDS11 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR :: ::::::::::::::::::::::::::: .:::::::::::::::::::::::::.:: CCDS11 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD ::::::::::::::::: :::::::::::: :::::::::::::.:.::::::.::::: CCDS11 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD 130 140 150 160 170 180 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS11 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 190 200 210 >>CCDS42369.1 CSH2 gene_id:1443|Hs109|chr17 (217 aa) initn: 1229 init1: 1229 opt: 1229 Z-score: 1536.8 bits: 291.5 E(33420): 2.9e-79 Smith-Waterman score: 1229; 85.3% identity (95.4% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KB6 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA ::.::::::::::.:::::::::..: :.:::::::.:::.::: ::::.::::::::. CCDS42 MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR ::::.::::::.. :::.:::.::::::: :::::::::::::::::::.::::::.::: CCDS42 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD :.:::.::: .:::. : :::::::::::::::::::: :::::.::::::::::::: : CCDS42 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD 130 140 150 160 170 180 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::.:::::::::::: CCDS42 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF 190 200 210 >>CCDS11649.1 CSH1 gene_id:1442|Hs109|chr17 (217 aa) initn: 1228 init1: 1228 opt: 1228 Z-score: 1535.6 bits: 291.2 E(33420): 3.4e-79 Smith-Waterman score: 1228; 85.3% identity (94.9% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KB6 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA :: ::::::::::.:::::::::..: :.:::::::.:::.::: ::::.::::::::. CCDS11 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR ::::.::::::.. :::.:::.::::::: :::::::::::::::::::.::::::.::: CCDS11 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD :.:::.::: .:::. : :::::::::::::::::::: :::::.::::::::::::: : CCDS11 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD 130 140 150 160 170 180 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::.:::::::::::: CCDS11 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF 190 200 210 >>CCDS11652.1 CSHL1 gene_id:1444|Hs109|chr17 (222 aa) initn: 1128 init1: 843 opt: 1120 Z-score: 1400.9 bits: 266.4 E(33420): 1.1e-71 Smith-Waterman score: 1120; 79.3% identity (89.6% similar) in 222 aa overlap (1-217:1-222) 10 20 30 40 50 pF1KB6 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEF--- ::.::::::::::.:::::::::..: :.:::::: .:::.::: ::::.:::::: CCDS11 MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFKEAMLQAHRAHQLAIDTYQEFISS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB6 --EEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEP :::: ::::::::.. :::.:::.:::: :: ::::::::::::.::::::.: ::: CCDS11 WGMEAYITKEQKYSFLHDSQTSFCFSDSIPTSSNMEETQQKSNLELLHISLLLIESRLEP 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB6 VQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTN :.::::.:.:.::: .:::. : :::::::::: ::::::::: ::: .:::::::::: CCDS11 VRFLRSTFTNNLVYDTSDSDDYHLLKDLEEGIQMLMGRLEDGSHLTGQTLKQTYSKFDTN 130 140 150 160 170 180 180 190 200 210 pF1KB6 SHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::: ::::::::::.::::::::::::::.:::::::::::: CCDS11 SHNHDALLKNYGLLHCFRKDMDKVETFLRMVQCRSVEGSCGF 190 200 210 220 >>CCDS45760.1 GH1 gene_id:2688|Hs109|chr17 (202 aa) initn: 1327 init1: 958 opt: 958 Z-score: 1199.9 bits: 229.0 E(33420): 1.7e-60 Smith-Waterman score: 1301; 93.1% identity (93.1% similar) in 217 aa overlap (1-217:1-202) 10 20 30 40 50 60 pF1KB6 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEF--- 10 20 30 40 50 70 80 90 100 110 120 pF1KB6 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR :::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 ------------NPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR 60 70 80 90 100 130 140 150 160 170 180 pF1KB6 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD 110 120 130 140 150 160 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS45 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 170 180 190 200 >>CCDS11648.1 GH2 gene_id:2689|Hs109|chr17 (256 aa) initn: 892 init1: 892 opt: 904 Z-score: 1131.0 bits: 216.6 E(33420): 1.1e-56 Smith-Waterman score: 904; 88.3% identity (92.6% similar) in 163 aa overlap (1-162:1-163) 10 20 30 40 50 60 pF1KB6 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA ::.::::::::::::::: ::::::::::::::::::::::::.::.:::.::::::::: CCDS11 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR :: ::::::::::::::::::::::::::: .:::::::::::::::::::::::::.:: CCDS11 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR 70 80 90 100 110 120 130 140 150 160 170 pF1KB6 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMG-RLEDGSPRTGQIFKQTYSKFDTNSHND ::::::::::::::::: :::::::::::: :. : : : CCDS11 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWVRVAPGIPNPGAPLASRDWGEKHCCPLF 130 140 150 160 170 180 180 190 200 210 pF1KB6 DALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF CCDS11 SSQALTQENSPYSSFPLVNPPGLSLQPGGEGGKWMNERGREQCPSAWPLLLFLHFAEAGR 190 200 210 220 230 240 >>CCDS45757.1 GH2 gene_id:2689|Hs109|chr17 (245 aa) initn: 931 init1: 895 opt: 895 Z-score: 1120.1 bits: 214.5 E(33420): 4.6e-56 Smith-Waterman score: 895; 92.1% identity (96.7% similar) in 152 aa overlap (1-152:1-152) 10 20 30 40 50 60 pF1KB6 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA ::.::::::::::::::: ::::::::::::::::::::::::.::.:::.::::::::: CCDS45 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR :: ::::::::::::::::::::::::::: .:::::::::::::::::::::::::.:: CCDS45 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD ::::::::::::::::: :::::::::::.: CCDS45 SVFANSLVYGASDSNVYRHLKDLEEGIQTLIGWKMAAPGLGRSSISPTASLTQNRTTMTH 130 140 150 160 170 180 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF CCDS45 CSRTTGCSTASGRTWTRSRHSCASCSAALWRAAVASSCPGGIPVTPPQCLSWSWKVLLQC 190 200 210 220 230 240 >>CCDS45758.1 GH2 gene_id:2689|Hs109|chr17 (202 aa) initn: 1223 init1: 892 opt: 892 Z-score: 1117.7 bits: 213.8 E(33420): 6.3e-56 Smith-Waterman score: 1197; 86.6% identity (90.8% similar) in 217 aa overlap (1-217:1-202) 10 20 30 40 50 60 pF1KB6 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA ::.::::::::::::::: ::::::::::::::::::::::::.::.:::.:::::: CCDS45 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEF--- 10 20 30 40 50 70 80 90 100 110 120 pF1KB6 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR :::::::::::::::::: .:::::::::::::::::::::::::.:: CCDS45 ------------NPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR 60 70 80 90 100 130 140 150 160 170 180 pF1KB6 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD ::::::::::::::::: :::::::::::: :::::::::::::.:.::::::.::::: CCDS45 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD 110 120 130 140 150 160 190 200 210 pF1KB6 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS45 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 170 180 190 200 >>CCDS11646.1 CSH2 gene_id:1443|Hs109|chr17 (167 aa) initn: 813 init1: 813 opt: 818 Z-score: 1026.9 bits: 196.7 E(33420): 7.3e-51 Smith-Waterman score: 818; 77.9% identity (90.8% similar) in 163 aa overlap (1-162:1-163) 10 20 30 40 50 60 pF1KB6 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA ::.::::::::::.:::::::::..: :.:::::::.:::.::: ::::.::::::::. CCDS11 MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR ::::.::::::.. :::.:::.::::::: :::::::::::::::::::.::::::.::: CCDS11 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR 70 80 90 100 110 120 130 140 150 160 170 pF1KB6 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMG-RLEDGSPRTGQIFKQTYSKFDTNSHND :.:::.::: .:::. : :::::::::::::: :. : : CCDS11 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGVRVAPGVANPGTPLA 130 140 150 160 180 190 200 210 pF1KB6 DALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 217 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Jul 3 20:36:07 2020 done: Fri Jul 3 20:36:08 2020 Total Scan time: 1.000 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]