FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6285, 279 aa 1>>>pF1KE6285 279 - 279 aa - 279 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6705+/-0.00087; mu= 13.2323+/- 0.052 mean_var=66.5851+/-13.277, 0's: 0 Z-trim(106.0): 15 B-trim: 0 in 0/51 Lambda= 0.157176 statistics sampled from 8721 (8730) to 8721 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.657), E-opt: 0.2 (0.268), width: 16 Scan time: 1.880 The best scores are: opt bits E(32554) CCDS3121.1 ATP1B3 gene_id:483|Hs108|chr3 ( 279) 1896 438.7 2.2e-123 CCDS48158.1 ATP1B4 gene_id:23439|Hs108|chrX ( 357) 673 161.4 8.5e-40 CCDS14598.1 ATP1B4 gene_id:23439|Hs108|chrX ( 353) 641 154.1 1.3e-37 CCDS32550.1 ATP1B2 gene_id:482|Hs108|chr17 ( 290) 597 144.1 1.1e-34 CCDS9539.1 ATP4B gene_id:496|Hs108|chr13 ( 291) 457 112.4 3.9e-25 CCDS1276.1 ATP1B1 gene_id:481|Hs108|chr1 ( 303) 317 80.6 1.5e-15 >>CCDS3121.1 ATP1B3 gene_id:483|Hs108|chr3 (279 aa) initn: 1896 init1: 1896 opt: 1896 Z-score: 2328.5 bits: 438.7 E(32554): 2.2e-123 Smith-Waterman score: 1896; 100.0% identity (100.0% similar) in 279 aa overlap (1-279:1-279) 10 20 30 40 50 60 pF1KE6 MTKNEKKSLNQSLAEWKLFIYNPTTGEFLGRTAKSWGLILLFYLVFYGFLAALFSFTMWV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MTKNEKKSLNQSLAEWKLFIYNPTTGEFLGRTAKSWGLILLFYLVFYGFLAALFSFTMWV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 MLQTLNDEVPKYRDQIPSPGLMVFPKPVTALEYTFSRSDPTSYAGYIEDLKKFLKPYTLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MLQTLNDEVPKYRDQIPSPGLMVFPKPVTALEYTFSRSDPTSYAGYIEDLKKFLKPYTLE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 EQKNLTVCPDGALFEQKGPVYVACQFPISLLQACSGMNDPDFGYSQGNPCILVKMNRIIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 EQKNLTVCPDGALFEQKGPVYVACQFPISLLQACSGMNDPDFGYSQGNPCILVKMNRIIG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 LKPEGVPRIDCVSKNEDIPNVAVYPHNGMIDLKYFPYYGKKLHVGYLQPLVAVQVSFAPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 LKPEGVPRIDCVSKNEDIPNVAVYPHNGMIDLKYFPYYGKKLHVGYLQPLVAVQVSFAPN 190 200 210 220 230 240 250 260 270 pF1KE6 NTGKEVTVECKIDGSANLKSQDDRDKFLGRVMFKITARA ::::::::::::::::::::::::::::::::::::::: CCDS31 NTGKEVTVECKIDGSANLKSQDDRDKFLGRVMFKITARA 250 260 270 >>CCDS48158.1 ATP1B4 gene_id:23439|Hs108|chrX (357 aa) initn: 518 init1: 220 opt: 673 Z-score: 828.0 bits: 161.4 E(32554): 8.5e-40 Smith-Waterman score: 673; 38.3% identity (72.6% similar) in 274 aa overlap (11-275:84-353) 10 20 30 40 pF1KE6 MTKNEKKSLNQSLAEWKLFIYNPTTGEFLGRTAKSWGLIL :.: . ....: ::.::..::.::: CCDS48 EEEEEEKEEEEEEEKEEEEGQGQPTGNAWWQKLQIMSEYLWDPERRMFLARTGQSWSLIL 60 70 80 90 100 110 50 60 70 80 90 100 pF1KE6 LFYLVFYGFLAALFSFTMWVMLQTLNDEVPKYRDQIPSPGLMVFPKPVTALEYTFSRSDP :.:. ::. :::.... :.... :.. .: . ... ::.:. : . .:...:. :.: CCDS48 LIYFFFYASLAAVITLCMYTLFLTISPYIPTFTERVKPPGVMIRPF-AHSLNFNFNVSEP 120 130 140 150 160 170 110 120 130 140 150 pF1KE6 TSYAGYIEDLKKFLKPYTLEEQKNLTV-CPDGALFEQKGPV---YVACQFPISLLQACSG .. :. .:. ::. :. :....: :: : : : : :::: :.:. ::: CCDS48 DTWQHYVISLNGFLQGYNDSLQEEMNVDCPPGQYFIQDGNEDEDKKACQFKRSFLKNCSG 180 190 200 210 220 230 160 170 180 190 200 210 pF1KE6 MNDPDFGYSQGNPCILVKMNRIIGLKPE-GVP-RIDC-VSKNE--DIPNVAVYPHNGMID ..:: :::: :.::::.:::::.:..:: : : ...: :.... :: ... ::... .: CCDS48 LEDPTFGYSTGQPCILLKMNRIVGFRPELGDPVKVSCKVQRGDENDIRSISYYPESASFD 240 250 260 270 280 290 220 230 240 250 260 270 pF1KE6 LKYFPYYGKKLHVGYLQPLVAVQVSFAPNNTGKEVTVECKIDGSANLKSQDDRDKFLGRV :.:.::::: ::.: .::::.. . . .: . : :.:.. :.. ... . :.:.::: CCDS48 LRYYPYYGKLTHVNYTSPLVAMHFTDVVKN--QAVPVQCQLKGKGVINDVIN-DRFVGRV 300 310 320 330 340 pF1KE6 MFKITARA .: . CCDS48 IFTLNIET 350 >>CCDS14598.1 ATP1B4 gene_id:23439|Hs108|chrX (353 aa) initn: 479 init1: 220 opt: 641 Z-score: 788.8 bits: 154.1 E(32554): 1.3e-37 Smith-Waterman score: 641; 38.0% identity (71.2% similar) in 274 aa overlap (11-275:84-349) 10 20 30 40 pF1KE6 MTKNEKKSLNQSLAEWKLFIYNPTTGEFLGRTAKSWGLIL :.: . ....: ::.:: :::: CCDS14 EEEEEEKEEEEEEEKEEEEGQGQPTGNAWWQKLQIMSEYLWDPERRMFLART----GLIL 60 70 80 90 100 50 60 70 80 90 100 pF1KE6 LFYLVFYGFLAALFSFTMWVMLQTLNDEVPKYRDQIPSPGLMVFPKPVTALEYTFSRSDP :.:. ::. :::.... :.... :.. .: . ... ::.:. : . .:...:. :.: CCDS14 LIYFFFYASLAAVITLCMYTLFLTISPYIPTFTERVKPPGVMIRPF-AHSLNFNFNVSEP 110 120 130 140 150 160 110 120 130 140 150 pF1KE6 TSYAGYIEDLKKFLKPYTLEEQKNLTV-CPDGALFEQKGPV---YVACQFPISLLQACSG .. :. .:. ::. :. :....: :: : : : : :::: :.:. ::: CCDS14 DTWQHYVISLNGFLQGYNDSLQEEMNVDCPPGQYFIQDGNEDEDKKACQFKRSFLKNCSG 170 180 190 200 210 220 160 170 180 190 200 210 pF1KE6 MNDPDFGYSQGNPCILVKMNRIIGLKPE-GVP-RIDC-VSKNE--DIPNVAVYPHNGMID ..:: :::: :.::::.:::::.:..:: : : ...: :.... :: ... ::... .: CCDS14 LEDPTFGYSTGQPCILLKMNRIVGFRPELGDPVKVSCKVQRGDENDIRSISYYPESASFD 230 240 250 260 270 280 220 230 240 250 260 270 pF1KE6 LKYFPYYGKKLHVGYLQPLVAVQVSFAPNNTGKEVTVECKIDGSANLKSQDDRDKFLGRV :.:.::::: ::.: .::::.. . . .: . : :.:.. :.. ... . :.:.::: CCDS14 LRYYPYYGKLTHVNYTSPLVAMHFTDVVKN--QAVPVQCQLKGKGVINDVIN-DRFVGRV 290 300 310 320 330 340 pF1KE6 MFKITARA .: . CCDS14 IFTLNIET 350 >>CCDS32550.1 ATP1B2 gene_id:482|Hs108|chr17 (290 aa) initn: 817 init1: 385 opt: 597 Z-score: 736.3 bits: 144.1 E(32554): 1.1e-34 Smith-Waterman score: 863; 47.6% identity (73.6% similar) in 288 aa overlap (3-275:4-285) 10 20 30 40 50 pF1KE6 MTKNEKKSLNQSLAEWKLFIYNPTTGEFLGRTAKSWGLILLFYLVFYGFLAALFSFTMW ..:::: .: . ::: :..:: : .:.:::. ::..::::::::::::.:.:..::: CCDS32 MVIQKEKKSCGQVVEEWKEFVWNPRTHQFMGRTGTSWAFILLFYLVFYGFLTAMFTLTMW 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 VMLQTLNDEVPKYRDQIPSPGLMVFPKPVTALEYTFSRSDPTSYAGYIEDLKKFLKPY-- :::::..:..:::.:.. .::::. :: . :. . :: :. ... :.:::.:: CCDS32 VMLQTVSDHTPKYQDRLATPGLMIRPK-TENLDVIVNVSDTESWDQHVQKLNKFLEPYND 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 TLEEQKNLTVCPDGALFEQ--KGPVYV---ACQFPISLLQACSGMNDPD-FGYSQGNPCI ... ::: :: : .:: .: . :::: . : :::..: .::: :.::. CCDS32 SIQAQKN-DVCRPGRYYEQPDNGVLNYPKRACQFNRTQLGNCSGIGDSTHYGYSTGQPCV 120 130 140 150 160 170 180 190 200 210 220 pF1KE6 LVKMNRIIGLKPEGVPRID--CVSKN----EDIPNVAVYPHNGMIDLKYFPYYGKKLHVG ..::::.:.. . .. :..: :.. : ...: :: ::: ::::::::.::. CCDS32 FIKMNRVINFYAGANQSMNVTCAGKRDEDAENLGNFVMFPANGNIDLMYFPYYGKKFHVN 180 190 200 210 220 230 230 240 250 260 270 pF1KE6 YLQPLVAVQ-VSFAPNNTGKEVTVECKIDGSANLKSQDDRDKFLGRVMFKITARA : ::::::. .. .:: ::.:::.:. .::. ..:.:::: ::: ::. CCDS32 YTQPLVAVKFLNVTPNV---EVNVECRIN-AANIATDDERDKFAGRVAFKLRINKT 240 250 260 270 280 290 >>CCDS9539.1 ATP4B gene_id:496|Hs108|chr13 (291 aa) initn: 580 init1: 237 opt: 457 Z-score: 564.7 bits: 112.4 E(32554): 3.9e-25 Smith-Waterman score: 632; 35.4% identity (67.0% similar) in 285 aa overlap (5-275:6-287) 10 20 30 40 50 pF1KE6 MTKNEKKSLNQSLAEWKLFIYNPTTGEFLGRTAKSWGLILLFYLVFYGFLAALFSFTMW :::. .: . :.. . .:: ::..:::: . : : :.:..:: ...::.. .. CCDS95 MAALQEKKTCGQRMEEFQRYCWNPDTGQMLGRTLSRWVWISLYYVAFYVVMTGLFALCLY 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 VMLQTLNDEVPKYRDQIPSPGLMVFPKPV--TALEYTFSRSDPTSYAGYIEDLKKFLKPY :..::.. .: :.::. :::. . : .:: ... :: ..: . :. :: : CCDS95 VLMQTVDPYTPDYQDQLRSPGVTLRPDVYGEKGLEIVYNVSDNRTWADLTQTLHAFLAGY 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 TLEEQKNLTVCPDGALFEQ---KGPVYV--ACQFPISLLQACSGMNDPDFGYSQGNPCIL . :.. : . : : ..: .. .:.: ..:: :::. ::.::. .:.::.. CCDS95 SPAAQEDSINCTSEQYFFQESFRAPNHTKFSCKFTADMLQNCSGLADPNFGFEEGKPCFI 130 140 150 160 170 180 180 190 200 210 220 pF1KE6 VKMNRIIGLKPEG--VPRIDC--VSKNEDI--P-NVAVYPHNGMIDLKYFPYYGKKLHVG .:::::. . : . .::.:: ... ... : .: :: :: ..:.:::::::: . CCDS95 IKMNRIVKFLPSNGSAPRVDCAFLDQPRELGQPLQVKYYPPNGTFSLHYFPYYGKKAQPH 190 200 210 220 230 240 230 240 250 260 270 pF1KE6 YLQPLVAVQVSFAPNNTGKEVTVECKIDGSANLKSQDDRDKFLGRVMFKITARA : .::::... : :. ::.. ::. . .. .. .: . :.: ::. CCDS95 YSNPLVAAKLLNIPRNA--EVAIVCKV-MAEHVTFNNPHDPYEGKVEFKLKIEK 250 260 270 280 290 >>CCDS1276.1 ATP1B1 gene_id:481|Hs108|chr1 (303 aa) initn: 580 init1: 241 opt: 317 Z-score: 392.8 bits: 80.6 E(32554): 1.5e-15 Smith-Waterman score: 603; 36.6% identity (60.4% similar) in 298 aa overlap (16-279:12-303) 10 20 30 40 50 60 pF1KE6 MTKNEKKSLNQSLAEWKLFIYNPTTGEFLGRTAKSWGLILLFYLVFYGFLAALFSFTMWV :: ::.: ::::::. :: :::::..::: ::..: :. : CCDS12 MARGKAKEEGSWKKFIWNSEKKEFLGRTGGSWFKILLFYVIFYGCLAGIFIGTIQV 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 MLQTLNDEVPKYRDQIPSPGLMVFPKPVTALEYTFSRSDPTSYAGYIEDLKKFLKPYTLE :: :... : :.:.. ::: .:. . : .: .:: :: .:. .. .::. : CCDS12 MLLTISEFKPTYQDRVAPPGLTQIPQ-IQKTEISFRPNDPKSYEAYVLNIVRFLEKYKDS 60 70 80 90 100 110 130 140 150 160 pF1KE6 EQKNLTV---CPD--------GALFEQKGPVYVACQFPISLLQACSGMNDPDFGYSQGNP :.. . : : : . ...: : :.: . : :::.:: .::..:.: CCDS12 AQRDDMIFEDCGDVPSEPKERGDFNHERGERKV-CRFKLEWLGNCSGLNDETYGYKEGKP 120 130 140 150 160 170 170 180 190 200 pF1KE6 CILVKMNRIIGLKP--------EGVPR---------IDCVSK-NEDIPNVAVYPHNGM-- ::..:.::..:.:: : : ..:..: .:: .:. . :. CCDS12 CIIIKLNRVLGFKPKPPKNESLETYPVMKYNPNVLPVQCTGKRDEDKDKVGNVEYFGLGN 180 190 200 210 220 230 210 220 230 240 250 260 pF1KE6 ---IDLKYFPYYGKKLHVGYLQPLVAVQVSFAPNNTGKEVTVECKIDGSANLKSQDDRDK . :.:.::::: :. :::::.::: :. . :. .::: : :. . ...:. CCDS12 SPGFPLQYYPYYGKLLQPKYLQPLLAVQ--FTNLTMDTEIRIECKAYGE-NI-GYSEKDR 240 250 260 270 280 290 270 pF1KE6 FLGRVMFKITARA : :: :: ... CCDS12 FQGRFDVKIEVKS 300 279 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:47:08 2016 done: Tue Nov 8 11:47:08 2016 Total Scan time: 1.880 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]