FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3389, 452 aa 1>>>pF1KB3389 452 - 452 aa - 452 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3315+/-0.000834; mu= 9.8640+/- 0.051 mean_var=120.3064+/-24.281, 0's: 0 Z-trim(111.4): 14 B-trim: 0 in 0/52 Lambda= 0.116931 statistics sampled from 12376 (12384) to 12376 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.723), E-opt: 0.2 (0.38), width: 16 Scan time: 3.110 The best scores are: opt bits E(32554) CCDS8698.1 ETNK1 gene_id:55500|Hs108|chr12 ( 452) 3059 526.7 1.9e-149 CCDS41760.1 ETNK1 gene_id:55500|Hs108|chr12 ( 258) 1544 271.0 1e-72 CCDS1442.2 ETNK2 gene_id:55224|Hs108|chr1 ( 386) 1149 204.4 1.6e-52 CCDS73006.1 ETNK2 gene_id:55224|Hs108|chr1 ( 394) 1144 203.6 3e-52 CCDS14099.1 CHKB gene_id:1120|Hs108|chr22 ( 395) 419 81.3 2e-15 >>CCDS8698.1 ETNK1 gene_id:55500|Hs108|chr12 (452 aa) initn: 3059 init1: 3059 opt: 3059 Z-score: 2796.6 bits: 526.7 E(32554): 1.9e-149 Smith-Waterman score: 3059; 100.0% identity (100.0% similar) in 452 aa overlap (1-452:1-452) 10 20 30 40 50 60 pF1KB3 MLCGRPRSSSDNRNFLRERAGLSSAAVQTRIGNSAASRRSPAARPPVPAPPALPRGRPGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 MLCGRPRSSSDNRNFLRERAGLSSAAVQTRIGNSAASRRSPAARPPVPAPPALPRGRPGT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 EGSTSLSAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 EGSTSLSAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCRE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 GALSLLQHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 GALSLLQHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 EEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKIHAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 EEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKIHAI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 HAHNGWIPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEILSNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 HAHNGWIPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEILSNL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 GSPVVLCHNDLLCKNIIYNEKQGDVQFIDYEYSGYNYLAYDIGNHFNEFAGVSDVDYSLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 GSPVVLCHNDLLCKNIIYNEKQGDVQFIDYEYSGYNYLAYDIGNHFNEFAGVSDVDYSLY 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 PDRELQSQWLRAYLEAYKEFKGFGTEVTEKEVEILFIQVNQFALASHFFWGLWALIQAKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 PDRELQSQWLRAYLEAYKEFKGFGTEVTEKEVEILFIQVNQFALASHFFWGLWALIQAKY 370 380 390 400 410 420 430 440 450 pF1KB3 STIEFDFLGYAIVRFNQYFKMKPEVTALKVPE :::::::::::::::::::::::::::::::: CCDS86 STIEFDFLGYAIVRFNQYFKMKPEVTALKVPE 430 440 450 >>CCDS41760.1 ETNK1 gene_id:55500|Hs108|chr12 (258 aa) initn: 1616 init1: 1540 opt: 1544 Z-score: 1419.1 bits: 271.0 E(32554): 1e-72 Smith-Waterman score: 1544; 97.5% identity (97.9% similar) in 236 aa overlap (1-236:1-236) 10 20 30 40 50 60 pF1KB3 MLCGRPRSSSDNRNFLRERAGLSSAAVQTRIGNSAASRRSPAARPPVPAPPALPRGRPGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MLCGRPRSSSDNRNFLRERAGLSSAAVQTRIGNSAASRRSPAARPPVPAPPALPRGRPGT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 EGSTSLSAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EGSTSLSAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCRE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 GALSLLQHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 GALSLLQHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 EEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKIHAI ::::::::::::::::::::::::::::::::::::::::::::::: : . : : CCDS41 EEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFSLSSLTLCKGKTT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 HAHNGWIPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEILSNL CCDS41 RCFGLTGCRGSRLLLSFF 250 >>CCDS1442.2 ETNK2 gene_id:55224|Hs108|chr1 (386 aa) initn: 1467 init1: 596 opt: 1149 Z-score: 1056.3 bits: 204.4 E(32554): 1.6e-52 Smith-Waterman score: 1461; 58.4% identity (81.5% similar) in 356 aa overlap (97-452:42-386) 70 80 90 100 110 120 pF1KB3 SAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCREGALSLL ::: :.. . . . ::: :. CCDS14 ASFHLRRHTPCPQCSWGMEEKAAASASCREPPGPPRAAAVAYFGISVDPDDILPGALRLI 20 30 40 50 60 70 130 140 150 160 170 180 pF1KB3 QHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRDEEVKSF :.::::: :..: . :::::::::..::: . :.: ::::.::..:::::::..::..: CCDS14 QELRPHWKPEQVRTKRFTDGITNKLVACYVEEDMQDCVLVRVYGERTELLVDRENEVRNF 80 90 100 110 120 130 190 200 210 220 230 240 pF1KB3 RVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKIHAIHAHNGW ..:.::.:::.:::::.::::::..:: ::.:.:. .: .::::: ..::::.::: :: CCDS14 QLLRAHSCAPKLYCTFQNGLCYEYMQGVALEPEHIREPRLFRLIALEMAKIHTIHA-NGS 140 150 160 170 180 190 250 260 270 280 290 300 pF1KB3 IPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEILSNLGSPVVL .:: :: :: .::.:. ..:: . .:.:. ..:..:..:.:: ::.: ::::. CCDS14 LPKPILWHKMHNYFTLV-----KNEINPSLSADVPKVEVLERELAWLKEHLSQLESPVVF 200 210 220 230 240 310 320 330 340 350 360 pF1KB3 CHNDLLCKNIIYNEKQGDVQFIDYEYSGYNYLAYDIGNHFNEFAGVSDVDYSLYPDRELQ ::::::::::::. .: :.::::::.:::: :.::::::::::::..::: ::: :: : CCDS14 CHNDLLCKNIIYDSIKGHVRFIDYEYAGYNYQAFDIGNHFNEFAGVNEVDYCLYPARETQ 250 260 270 280 290 300 370 380 390 400 410 420 pF1KB3 SQWLRAYLEAYKEFKGFGTEVTEKEVEILFIQVNQFALASHFFWGLWALIQAKYSTIEFD :::. ::.: : : :: .::. :..:::.:::::::::.:::::: .::::.:: CCDS14 LQWLHYYLQAQK-----GMAVTPREVQRLYVQVNKFALASHFFWALWALIQNQYSTIDFD 310 320 330 340 350 360 430 440 450 pF1KB3 FLGYAIVRFNQYFKMKPEVTALKVPE :: ::..:::::::.::...::..:. CCDS14 FLRYAVIRFNQYFKVKPQASALEMPK 370 380 >>CCDS73006.1 ETNK2 gene_id:55224|Hs108|chr1 (394 aa) initn: 1212 init1: 596 opt: 1144 Z-score: 1051.6 bits: 203.6 E(32554): 3e-52 Smith-Waterman score: 1200; 56.9% identity (80.1% similar) in 311 aa overlap (97-404:42-338) 70 80 90 100 110 120 pF1KB3 SAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPE---VPKLNVTVQDQEEHRCREGAL ::: :. : ....: : .. ::: CCDS73 ASFHLRRHTPCPQCSWGMEEKAAASASCREPPGPPRAAAVAYFGISV-DPDD--ILPGAL 20 30 40 50 60 130 140 150 160 170 180 pF1KB3 SLLQHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRDEEV :.:.::::: :..: . :::::::::..::: . :.: ::::.::..:::::::..:: CCDS73 RLIQELRPHWKPEQVRTKRFTDGITNKLVACYVEEDMQDCVLVRVYGERTELLVDRENEV 70 80 90 100 110 120 190 200 210 220 230 240 pF1KB3 KSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKIHAIHAH ..:..:.::.:::.:::::.::::::..:: ::.:.:. .: .::::: ..::::.::: CCDS73 RNFQLLRAHSCAPKLYCTFQNGLCYEYMQGVALEPEHIREPRLFRLIALEMAKIHTIHA- 130 140 150 160 170 180 250 260 270 280 290 300 pF1KB3 NGWIPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEILSNLGSP :: .:: :: :: .::.:. ..:: . .:.:. ..:..:..:.:: ::.: :: CCDS73 NGSLPKPILWHKMHNYFTLVK-----NEINPSLSADVPKVEVLERELAWLKEHLSQLESP 190 200 210 220 230 240 310 320 330 340 350 360 pF1KB3 VVLCHNDLLCKNIIYNEKQGDVQFIDYEYSGYNYLAYDIGNHFNEFAGVSDVDYSLYPDR ::.::::::::::::. .: :.::::::.:::: :.::::::::::::..::: ::: : CCDS73 VVFCHNDLLCKNIIYDSIKGHVRFIDYEYAGYNYQAFDIGNHFNEFAGVNEVDYCLYPAR 250 260 270 280 290 300 370 380 390 400 410 420 pF1KB3 ELQSQWLRAYLEAYKEFKGFGTEVTEKEVEILFIQVNQFALASHFFWGLWALIQAKYSTI : : :::. ::.: : : :: .::. :..:::.::: CCDS73 ETQLQWLHYYLQAQK-----GMAVTPREVQRLYVQVNKFALGPSCVSSTMTASLQCCRVG 310 320 330 340 350 430 440 450 pF1KB3 EFDFLGYAIVRFNQYFKMKPEVTALKVPE CCDS73 NRHGEIARLTLSGLFPGVSLLLGSLGPHPEPVLHHRL 360 370 380 390 >>CCDS14099.1 CHKB gene_id:1120|Hs108|chr22 (395 aa) initn: 527 init1: 266 opt: 419 Z-score: 390.6 bits: 81.3 E(32554): 2e-15 Smith-Waterman score: 555; 30.5% identity (59.0% similar) in 383 aa overlap (97-448:29-391) 70 80 90 100 110 120 pF1KB3 SAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHR---CREGAL : .:. . . .: :.. ::: CCDS14 MAAEATAVAGSGAVGGCLAKDGLQQSKCPDTTPKRRRASSLSRDAERRAYQWCRE--- 10 20 30 40 50 130 140 150 160 170 pF1KB3 SLLQHLRPHW---DPQEVTLQLFTDGITNKLIGCYVGNTMEDV------VLVRIYGNKTE .: : .:.:. . . :..: :. : . . . .: ::.:.:: . CCDS14 ----YLGGAWRRVQPEELRVYPVSGGLSNLLFRCSLPDHLPSVGEEPREVLLRLYGAILQ 60 70 80 90 100 110 180 190 200 210 220 230 pF1KB3 LLVDRDEEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQL . . : : .: .. .:::: .: .: ..: .. : ... .:.. :: .. CCDS14 GVDSLVLESVMFAILAERSLGPQLYGVFPEGRLEQYIPSRPLKTQELREPVLSAAIATKM 120 130 140 150 160 170 240 250 260 270 280 pF1KB3 AKIHAIHAHNGWIPKSNLWL--KMGKYFSLI----PTGFADEDINKRFLSDIPSSQILQE :..:... . : :: : .:.. : :::. . .. . . :.. CCDS14 AQFHGMEMP---FTKEPHWLFGTMERYLKQIQDLPPTGLPEMNLLEMYS--------LKD 180 190 200 210 220 290 300 310 320 330 340 pF1KB3 EMTWMKEILSNLGSPVVLCHNDLLCKNIIY---NEKQGDVQFIDYEYSGYNYLAYDIGNH :: ....: . ::::.::::. ::. :. .....:.:::.::: ..::::: CCDS14 EMGNLRKLLESTPSPVVFCHNDIQEGNILLLSEPENADSLMLVDFEYSSYNYRGFDIGNH 230 240 250 260 270 280 350 360 370 380 390 pF1KB3 FNEFAGVSDVDY----------SLYPDRELQSQWLRAYLEAYKEFKGFGTEVTEKEVEIL : :. : : . . :: .: : ...: :: :. . .. : .: : : CCDS14 FCEW--VYDYTHEEWPFYKARPTDYPTQEQQLHFIRHYLAEAKKGETLSQEEQRKLEEDL 290 300 310 320 330 400 410 420 430 440 450 pF1KB3 FIQVNQFALASHFFWGLWALIQAKYSTIEFDFLGYAIVRFNQYFKMKPEVTALKVPE ...:...:::::::::::...::..::::: .: :: ::. ::..: ..:.. CCDS14 LVEVSRYALASHFFWGLWSILQASMSTIEFGYLDYAQSRFQFYFQQKGQLTSVHSSS 340 350 360 370 380 390 452 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 03:25:56 2016 done: Tue Nov 8 03:25:57 2016 Total Scan time: 3.110 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]