FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5275, 217 aa 1>>>pF1KE5275 217 - 217 aa - 217 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4180+/-0.000794; mu= 13.1804+/- 0.048 mean_var=64.6059+/-12.810, 0's: 0 Z-trim(106.6): 20 B-trim: 0 in 0/53 Lambda= 0.159565 statistics sampled from 9066 (9082) to 9066 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.661), E-opt: 0.2 (0.279), width: 16 Scan time: 2.070 The best scores are: opt bits E(32554) CCDS11647.1 GH2 gene_id:2689|Hs108|chr17 ( 217) 1431 337.9 3e-93 CCDS11653.1 GH1 gene_id:2688|Hs108|chr17 ( 217) 1321 312.6 1.3e-85 CCDS42369.1 CSH2 gene_id:1443|Hs108|chr17 ( 217) 1135 269.7 9.7e-73 CCDS11649.1 CSH1 gene_id:1442|Hs108|chr17 ( 217) 1129 268.4 2.5e-72 CCDS11652.1 CSHL1 gene_id:1444|Hs108|chr17 ( 222) 1040 247.9 3.8e-66 CCDS11648.1 GH2 gene_id:2689|Hs108|chr17 ( 256) 998 238.2 3.5e-63 CCDS45757.1 GH2 gene_id:2689|Hs108|chr17 ( 245) 974 232.7 1.5e-61 CCDS45758.1 GH2 gene_id:2689|Hs108|chr17 ( 202) 972 232.2 1.8e-61 CCDS45760.1 GH1 gene_id:2688|Hs108|chr17 ( 202) 893 214.0 5.4e-56 CCDS11654.1 GH1 gene_id:2688|Hs108|chr17 ( 177) 748 180.6 5.4e-46 CCDS11646.1 CSH2 gene_id:1443|Hs108|chr17 ( 167) 740 178.8 1.8e-45 CCDS45759.1 CSHL1 gene_id:1444|Hs108|chr17 ( 139) 669 162.4 1.3e-40 CCDS82189.1 CSHL1 gene_id:1444|Hs108|chr17 ( 160) 669 162.4 1.5e-40 CCDS42370.1 CSHL1 gene_id:1444|Hs108|chr17 ( 128) 635 154.5 2.7e-38 CCDS42368.1 CSH2 gene_id:1443|Hs108|chr17 ( 122) 411 103.0 8.8e-23 >>CCDS11647.1 GH2 gene_id:2689|Hs108|chr17 (217 aa) initn: 1431 init1: 1431 opt: 1431 Z-score: 1787.6 bits: 337.9 E(32554): 3e-93 Smith-Waterman score: 1431; 100.0% identity (100.0% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KE5 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD 130 140 150 160 170 180 190 200 210 pF1KE5 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS11 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 190 200 210 >>CCDS11653.1 GH1 gene_id:2688|Hs108|chr17 (217 aa) initn: 1321 init1: 1321 opt: 1321 Z-score: 1650.8 bits: 312.6 E(32554): 1.3e-85 Smith-Waterman score: 1321; 93.1% identity (97.2% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KE5 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA ::.::::::::::::::: ::::::::::::::::::::::::.::.:::.::::::::: CCDS11 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR :: ::::::::::::::::::::::::::: .:::::::::::::::::::::::::.:: CCDS11 YIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD ::::::::::::::::: :::::::::::: :::::::::::::.:.::::::.::::: CCDS11 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD 130 140 150 160 170 180 190 200 210 pF1KE5 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS11 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 190 200 210 >>CCDS42369.1 CSH2 gene_id:1443|Hs108|chr17 (217 aa) initn: 1135 init1: 1135 opt: 1135 Z-score: 1419.4 bits: 269.7 E(32554): 9.7e-73 Smith-Waterman score: 1135; 80.2% identity (93.1% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KE5 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA :::::::::::::.:::: ::::..: :.:::::::.:::.:.: .::: ::::::::. CCDS42 MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR :: :.::::::.. :::.:::.::::::: .:::::::::::::::::.::::::..:: CCDS42 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD :.:::.::: .:::. :. :::::::::::: :::::: :::::..:.::::::.::: : CCDS42 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD 130 140 150 160 170 180 190 200 210 pF1KE5 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::.:::::::::::: CCDS42 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF 190 200 210 >>CCDS11649.1 CSH1 gene_id:1442|Hs108|chr17 (217 aa) initn: 1129 init1: 1129 opt: 1129 Z-score: 1411.9 bits: 268.4 E(32554): 2.5e-72 Smith-Waterman score: 1129; 79.7% identity (92.6% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KE5 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA :: ::::::::::.:::: ::::..: :.:::::::.:::.:.: .::: ::::::::. CCDS11 MAPGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEET 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR :: :.::::::.. :::.:::.::::::: .:::::::::::::::::.::::::..:: CCDS11 YIPKDQKYSFLHDSQTSFCFSDSIPTPSNMEETQQKSNLELLRISLLLIESWLEPVRFLR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD :.:::.::: .:::. :. :::::::::::: :::::: :::::..:.::::::.::: : CCDS11 SMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEDGSRRTGQILKQTYSKFDTNSHNHD 130 140 150 160 170 180 190 200 210 pF1KE5 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::.:::::::::::: CCDS11 ALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSCGF 190 200 210 >>CCDS11652.1 CSHL1 gene_id:1444|Hs108|chr17 (222 aa) initn: 1048 init1: 788 opt: 1040 Z-score: 1301.0 bits: 247.9 E(32554): 3.8e-66 Smith-Waterman score: 1040; 74.8% identity (87.8% similar) in 222 aa overlap (1-217:1-222) 10 20 30 40 50 pF1KE5 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEF--- :::::::::::::.:::: ::::..: :.:::::: .:::.:.: .::: :::::: CCDS11 MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFKEAMLQAHRAHQLAIDTYQEFISS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 --EEAYILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEP :::: ::::::::.. :::.:::.:::: :: .::::::::::.::::::.: ::: CCDS11 WGMEAYITKEQKYSFLHDSQTSFCFSDSIPTSSNMEETQQKSNLELLHISLLLIESRLEP 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 VQLLRSVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTK :..:::.:.:.::: .:::. :. ::::::::: :: :::::: ::: ..:.::::::. CCDS11 VRFLRSTFTNNLVYDTSDSDDYHLLKDLEEGIQMLMGRLEDGSHLTGQTLKQTYSKFDTN 130 140 150 160 170 180 180 190 200 210 pF1KE5 SHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::: ::::::::::.::::::::::::::.:::::::::::: CCDS11 SHNHDALLKNYGLLHCFRKDMDKVETFLRMVQCRSVEGSCGF 190 200 210 220 >>CCDS11648.1 GH2 gene_id:2689|Hs108|chr17 (256 aa) initn: 983 init1: 983 opt: 998 Z-score: 1247.8 bits: 238.2 E(32554): 3.5e-63 Smith-Waterman score: 998; 95.7% identity (96.3% similar) in 163 aa overlap (1-162:1-163) 10 20 30 40 50 60 pF1KE5 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMW-RLEDGSPRTGQIFNQSYSKFDTKSHND :::::::::::::::::::::::::::::::: :. : : : CCDS11 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWVRVAPGIPNPGAPLASRDWGEKHCCPLF 130 140 150 160 170 180 180 190 200 210 pF1KE5 DALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF CCDS11 SSQALTQENSPYSSFPLVNPPGLSLQPGGEGGKWMNERGREQCPSAWPLLLFLHFAEAGR 190 200 210 220 230 240 >>CCDS45757.1 GH2 gene_id:2689|Hs108|chr17 (245 aa) initn: 982 init1: 963 opt: 974 Z-score: 1218.3 bits: 232.7 E(32554): 1.5e-61 Smith-Waterman score: 974; 93.3% identity (97.0% similar) in 164 aa overlap (1-163:1-162) 10 20 30 40 50 60 pF1KE5 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 SVFANSLVYGASDSNVYRHLKDLEEGIQTLM-WRLEDGSPRTGQIFNQSYSKFDTKSHND ::::::::::::::::::::::::::::::. :.. ..: :. CCDS45 SVFANSLVYGASDSNVYRHLKDLEEGIQTLIGWKM--AAPGLGRSSISPTASLTQNRTTM 130 140 150 160 170 180 190 200 210 pF1KE5 DALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF CCDS45 THCSRTTGCSTASGRTWTRSRHSCASCSAALWRAAVASSCPGGIPVTPPQCLSWSWKVLL 180 190 200 210 220 230 >>CCDS45758.1 GH2 gene_id:2689|Hs108|chr17 (202 aa) initn: 1324 init1: 965 opt: 972 Z-score: 1217.1 bits: 232.2 E(32554): 1.8e-61 Smith-Waterman score: 1298; 93.1% identity (93.1% similar) in 217 aa overlap (1-217:1-202) 10 20 30 40 50 60 pF1KE5 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEF--- 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR :::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 ------------NPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR 60 70 80 90 100 130 140 150 160 170 180 pF1KE5 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD 110 120 130 140 150 160 190 200 210 pF1KE5 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS45 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 170 180 190 200 >>CCDS45760.1 GH1 gene_id:2688|Hs108|chr17 (202 aa) initn: 1223 init1: 892 opt: 893 Z-score: 1118.8 bits: 214.0 E(32554): 5.4e-56 Smith-Waterman score: 1197; 86.6% identity (90.8% similar) in 217 aa overlap (1-217:1-202) 10 20 30 40 50 60 pF1KE5 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA ::.::::::::::::::: ::::::::::::::::::::::::.::.:::.:::::: CCDS45 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEF--- 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR :::::::::::::::::: .:::::::::::::::::::::::::.:: CCDS45 ------------NPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLR 60 70 80 90 100 130 140 150 160 170 180 pF1KE5 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD ::::::::::::::::: :::::::::::: :::::::::::::.:.::::::.::::: CCDS45 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD 110 120 130 140 150 160 190 200 210 pF1KE5 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS45 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 170 180 190 200 >>CCDS11654.1 GH1 gene_id:2688|Hs108|chr17 (177 aa) initn: 748 init1: 748 opt: 748 Z-score: 939.3 bits: 180.6 E(32554): 5.4e-46 Smith-Waterman score: 994; 76.0% identity (79.7% similar) in 217 aa overlap (1-217:1-177) 10 20 30 40 50 60 pF1KE5 MAAGSRTSLLLAFGLLCLSWLQEGSAFPTIPLSRLFDNAMLRARRLYQLAYDTYQEFEEA ::.::::::::::::::: ::::::::::::::::::::::::.::.:::.:::::: CCDS11 MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEF--- 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 YILKEQKYSFLQNPQTSLCFSESIPTPSNRVKTQQKSNLELLRISLLLIQSWLEPVQLLR ::::::::::::::::::::.:: CCDS11 -------------------------------------NLELLRISLLLIQSWLEPVQFLR 60 70 80 130 140 150 160 170 180 pF1KE5 SVFANSLVYGASDSNVYRHLKDLEEGIQTLMWRLEDGSPRTGQIFNQSYSKFDTKSHNDD ::::::::::::::::: :::::::::::: :::::::::::::.:.::::::.::::: CCDS11 SVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDD 90 100 110 120 130 140 190 200 210 pF1KE5 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ::::::::::::::::::::::::::::::::::::: CCDS11 ALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF 150 160 170 217 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 07:26:39 2016 done: Tue Nov 8 07:26:39 2016 Total Scan time: 2.070 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]