FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB3389, 452 aa
1>>>pF1KB3389 452 - 452 aa - 452 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.3315+/-0.000834; mu= 9.8640+/- 0.051
mean_var=120.3064+/-24.281, 0's: 0 Z-trim(111.4): 14 B-trim: 0 in 0/52
Lambda= 0.116931
statistics sampled from 12376 (12384) to 12376 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.723), E-opt: 0.2 (0.38), width: 16
Scan time: 3.110
The best scores are: opt bits E(32554)
CCDS8698.1 ETNK1 gene_id:55500|Hs108|chr12 ( 452) 3059 526.7 1.9e-149
CCDS41760.1 ETNK1 gene_id:55500|Hs108|chr12 ( 258) 1544 271.0 1e-72
CCDS1442.2 ETNK2 gene_id:55224|Hs108|chr1 ( 386) 1149 204.4 1.6e-52
CCDS73006.1 ETNK2 gene_id:55224|Hs108|chr1 ( 394) 1144 203.6 3e-52
CCDS14099.1 CHKB gene_id:1120|Hs108|chr22 ( 395) 419 81.3 2e-15
>>CCDS8698.1 ETNK1 gene_id:55500|Hs108|chr12 (452 aa)
initn: 3059 init1: 3059 opt: 3059 Z-score: 2796.6 bits: 526.7 E(32554): 1.9e-149
Smith-Waterman score: 3059; 100.0% identity (100.0% similar) in 452 aa overlap (1-452:1-452)
10 20 30 40 50 60
pF1KB3 MLCGRPRSSSDNRNFLRERAGLSSAAVQTRIGNSAASRRSPAARPPVPAPPALPRGRPGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 MLCGRPRSSSDNRNFLRERAGLSSAAVQTRIGNSAASRRSPAARPPVPAPPALPRGRPGT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 EGSTSLSAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCRE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 EGSTSLSAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCRE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 GALSLLQHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 GALSLLQHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 EEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKIHAI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 EEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKIHAI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 HAHNGWIPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEILSNL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 HAHNGWIPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEILSNL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB3 GSPVVLCHNDLLCKNIIYNEKQGDVQFIDYEYSGYNYLAYDIGNHFNEFAGVSDVDYSLY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 GSPVVLCHNDLLCKNIIYNEKQGDVQFIDYEYSGYNYLAYDIGNHFNEFAGVSDVDYSLY
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB3 PDRELQSQWLRAYLEAYKEFKGFGTEVTEKEVEILFIQVNQFALASHFFWGLWALIQAKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 PDRELQSQWLRAYLEAYKEFKGFGTEVTEKEVEILFIQVNQFALASHFFWGLWALIQAKY
370 380 390 400 410 420
430 440 450
pF1KB3 STIEFDFLGYAIVRFNQYFKMKPEVTALKVPE
::::::::::::::::::::::::::::::::
CCDS86 STIEFDFLGYAIVRFNQYFKMKPEVTALKVPE
430 440 450
>>CCDS41760.1 ETNK1 gene_id:55500|Hs108|chr12 (258 aa)
initn: 1616 init1: 1540 opt: 1544 Z-score: 1419.1 bits: 271.0 E(32554): 1e-72
Smith-Waterman score: 1544; 97.5% identity (97.9% similar) in 236 aa overlap (1-236:1-236)
10 20 30 40 50 60
pF1KB3 MLCGRPRSSSDNRNFLRERAGLSSAAVQTRIGNSAASRRSPAARPPVPAPPALPRGRPGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 MLCGRPRSSSDNRNFLRERAGLSSAAVQTRIGNSAASRRSPAARPPVPAPPALPRGRPGT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 EGSTSLSAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCRE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 EGSTSLSAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCRE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 GALSLLQHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 GALSLLQHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 EEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKIHAI
::::::::::::::::::::::::::::::::::::::::::::::: : . : :
CCDS41 EEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFSLSSLTLCKGKTT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 HAHNGWIPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEILSNL
CCDS41 RCFGLTGCRGSRLLLSFF
250
>>CCDS1442.2 ETNK2 gene_id:55224|Hs108|chr1 (386 aa)
initn: 1467 init1: 596 opt: 1149 Z-score: 1056.3 bits: 204.4 E(32554): 1.6e-52
Smith-Waterman score: 1461; 58.4% identity (81.5% similar) in 356 aa overlap (97-452:42-386)
70 80 90 100 110 120
pF1KB3 SAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCREGALSLL
::: :.. . . . ::: :.
CCDS14 ASFHLRRHTPCPQCSWGMEEKAAASASCREPPGPPRAAAVAYFGISVDPDDILPGALRLI
20 30 40 50 60 70
130 140 150 160 170 180
pF1KB3 QHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRDEEVKSF
:.::::: :..: . :::::::::..::: . :.: ::::.::..:::::::..::..:
CCDS14 QELRPHWKPEQVRTKRFTDGITNKLVACYVEEDMQDCVLVRVYGERTELLVDRENEVRNF
80 90 100 110 120 130
190 200 210 220 230 240
pF1KB3 RVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKIHAIHAHNGW
..:.::.:::.:::::.::::::..:: ::.:.:. .: .::::: ..::::.::: ::
CCDS14 QLLRAHSCAPKLYCTFQNGLCYEYMQGVALEPEHIREPRLFRLIALEMAKIHTIHA-NGS
140 150 160 170 180 190
250 260 270 280 290 300
pF1KB3 IPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEILSNLGSPVVL
.:: :: :: .::.:. ..:: . .:.:. ..:..:..:.:: ::.: ::::.
CCDS14 LPKPILWHKMHNYFTLV-----KNEINPSLSADVPKVEVLERELAWLKEHLSQLESPVVF
200 210 220 230 240
310 320 330 340 350 360
pF1KB3 CHNDLLCKNIIYNEKQGDVQFIDYEYSGYNYLAYDIGNHFNEFAGVSDVDYSLYPDRELQ
::::::::::::. .: :.::::::.:::: :.::::::::::::..::: ::: :: :
CCDS14 CHNDLLCKNIIYDSIKGHVRFIDYEYAGYNYQAFDIGNHFNEFAGVNEVDYCLYPARETQ
250 260 270 280 290 300
370 380 390 400 410 420
pF1KB3 SQWLRAYLEAYKEFKGFGTEVTEKEVEILFIQVNQFALASHFFWGLWALIQAKYSTIEFD
:::. ::.: : : :: .::. :..:::.:::::::::.:::::: .::::.::
CCDS14 LQWLHYYLQAQK-----GMAVTPREVQRLYVQVNKFALASHFFWALWALIQNQYSTIDFD
310 320 330 340 350 360
430 440 450
pF1KB3 FLGYAIVRFNQYFKMKPEVTALKVPE
:: ::..:::::::.::...::..:.
CCDS14 FLRYAVIRFNQYFKVKPQASALEMPK
370 380
>>CCDS73006.1 ETNK2 gene_id:55224|Hs108|chr1 (394 aa)
initn: 1212 init1: 596 opt: 1144 Z-score: 1051.6 bits: 203.6 E(32554): 3e-52
Smith-Waterman score: 1200; 56.9% identity (80.1% similar) in 311 aa overlap (97-404:42-338)
70 80 90 100 110 120
pF1KB3 SAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPE---VPKLNVTVQDQEEHRCREGAL
::: :. : ....: : .. :::
CCDS73 ASFHLRRHTPCPQCSWGMEEKAAASASCREPPGPPRAAAVAYFGISV-DPDD--ILPGAL
20 30 40 50 60
130 140 150 160 170 180
pF1KB3 SLLQHLRPHWDPQEVTLQLFTDGITNKLIGCYVGNTMEDVVLVRIYGNKTELLVDRDEEV
:.:.::::: :..: . :::::::::..::: . :.: ::::.::..:::::::..::
CCDS73 RLIQELRPHWKPEQVRTKRFTDGITNKLVACYVEEDMQDCVLVRVYGERTELLVDRENEV
70 80 90 100 110 120
190 200 210 220 230 240
pF1KB3 KSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKIHAIHAH
..:..:.::.:::.:::::.::::::..:: ::.:.:. .: .::::: ..::::.:::
CCDS73 RNFQLLRAHSCAPKLYCTFQNGLCYEYMQGVALEPEHIREPRLFRLIALEMAKIHTIHA-
130 140 150 160 170 180
250 260 270 280 290 300
pF1KB3 NGWIPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEILSNLGSP
:: .:: :: :: .::.:. ..:: . .:.:. ..:..:..:.:: ::.: ::
CCDS73 NGSLPKPILWHKMHNYFTLVK-----NEINPSLSADVPKVEVLERELAWLKEHLSQLESP
190 200 210 220 230 240
310 320 330 340 350 360
pF1KB3 VVLCHNDLLCKNIIYNEKQGDVQFIDYEYSGYNYLAYDIGNHFNEFAGVSDVDYSLYPDR
::.::::::::::::. .: :.::::::.:::: :.::::::::::::..::: ::: :
CCDS73 VVFCHNDLLCKNIIYDSIKGHVRFIDYEYAGYNYQAFDIGNHFNEFAGVNEVDYCLYPAR
250 260 270 280 290 300
370 380 390 400 410 420
pF1KB3 ELQSQWLRAYLEAYKEFKGFGTEVTEKEVEILFIQVNQFALASHFFWGLWALIQAKYSTI
: : :::. ::.: : : :: .::. :..:::.:::
CCDS73 ETQLQWLHYYLQAQK-----GMAVTPREVQRLYVQVNKFALGPSCVSSTMTASLQCCRVG
310 320 330 340 350
430 440 450
pF1KB3 EFDFLGYAIVRFNQYFKMKPEVTALKVPE
CCDS73 NRHGEIARLTLSGLFPGVSLLLGSLGPHPEPVLHHRL
360 370 380 390
>>CCDS14099.1 CHKB gene_id:1120|Hs108|chr22 (395 aa)
initn: 527 init1: 266 opt: 419 Z-score: 390.6 bits: 81.3 E(32554): 2e-15
Smith-Waterman score: 555; 30.5% identity (59.0% similar) in 383 aa overlap (97-448:29-391)
70 80 90 100 110 120
pF1KB3 SAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHR---CREGAL
: .:. . . .: :.. :::
CCDS14 MAAEATAVAGSGAVGGCLAKDGLQQSKCPDTTPKRRRASSLSRDAERRAYQWCRE---
10 20 30 40 50
130 140 150 160 170
pF1KB3 SLLQHLRPHW---DPQEVTLQLFTDGITNKLIGCYVGNTMEDV------VLVRIYGNKTE
.: : .:.:. . . :..: :. : . . . .: ::.:.:: .
CCDS14 ----YLGGAWRRVQPEELRVYPVSGGLSNLLFRCSLPDHLPSVGEEPREVLLRLYGAILQ
60 70 80 90 100 110
180 190 200 210 220 230
pF1KB3 LLVDRDEEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQL
. . : : .: .. .:::: .: .: ..: .. : ... .:.. :: ..
CCDS14 GVDSLVLESVMFAILAERSLGPQLYGVFPEGRLEQYIPSRPLKTQELREPVLSAAIATKM
120 130 140 150 160 170
240 250 260 270 280
pF1KB3 AKIHAIHAHNGWIPKSNLWL--KMGKYFSLI----PTGFADEDINKRFLSDIPSSQILQE
:..:... . : :: : .:.. : :::. . .. . . :..
CCDS14 AQFHGMEMP---FTKEPHWLFGTMERYLKQIQDLPPTGLPEMNLLEMYS--------LKD
180 190 200 210 220
290 300 310 320 330 340
pF1KB3 EMTWMKEILSNLGSPVVLCHNDLLCKNIIY---NEKQGDVQFIDYEYSGYNYLAYDIGNH
:: ....: . ::::.::::. ::. :. .....:.:::.::: ..:::::
CCDS14 EMGNLRKLLESTPSPVVFCHNDIQEGNILLLSEPENADSLMLVDFEYSSYNYRGFDIGNH
230 240 250 260 270 280
350 360 370 380 390
pF1KB3 FNEFAGVSDVDY----------SLYPDRELQSQWLRAYLEAYKEFKGFGTEVTEKEVEIL
: :. : : . . :: .: : ...: :: :. . .. : .: : :
CCDS14 FCEW--VYDYTHEEWPFYKARPTDYPTQEQQLHFIRHYLAEAKKGETLSQEEQRKLEEDL
290 300 310 320 330
400 410 420 430 440 450
pF1KB3 FIQVNQFALASHFFWGLWALIQAKYSTIEFDFLGYAIVRFNQYFKMKPEVTALKVPE
...:...:::::::::::...::..::::: .: :: ::. ::..: ..:..
CCDS14 LVEVSRYALASHFFWGLWSILQASMSTIEFGYLDYAQSRFQFYFQQKGQLTSVHSSS
340 350 360 370 380 390
452 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 03:25:56 2016 done: Tue Nov 8 03:25:57 2016
Total Scan time: 3.110 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]