FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3040, 177 aa 1>>>pF1KE3040 177 - 177 aa - 177 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0905+/-0.000736; mu= 13.3182+/- 0.044 mean_var=58.0549+/-11.445, 0's: 0 Z-trim(107.8): 18 B-trim: 0 in 0/50 Lambda= 0.168327 statistics sampled from 9791 (9809) to 9791 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.697), E-opt: 0.2 (0.301), width: 16 Scan time: 1.880 The best scores are: opt bits E(32554) CCDS11578.1 NME1 gene_id:4830|Hs108|chr17 ( 177) 1204 300.3 4.2e-82 CCDS11579.1 NME1 gene_id:4830|Hs108|chr17 ( 152) 1038 259.9 5e-70 CCDS32682.1 NME2 gene_id:654364|Hs108|chr17 ( 267) 933 234.5 3.9e-62 CCDS11580.1 NME2 gene_id:4831|Hs108|chr17 ( 152) 928 233.2 5.5e-62 CCDS10443.1 NME3 gene_id:4832|Hs108|chr16 ( 169) 726 184.2 3.6e-47 CCDS10408.1 NME4 gene_id:4833|Hs108|chr16 ( 187) 621 158.7 1.8e-39 CCDS66886.1 NME4 gene_id:4833|Hs108|chr16 ( 117) 452 117.5 2.8e-27 CCDS74107.1 NME2 gene_id:4831|Hs108|chr17 ( 82) 439 114.3 1.8e-26 CCDS73797.1 NME4 gene_id:4833|Hs108|chr16 ( 153) 289 78.0 2.9e-15 CCDS44274.1 NME7 gene_id:29922|Hs108|chr1 ( 340) 245 67.5 9.4e-12 CCDS1277.1 NME7 gene_id:29922|Hs108|chr1 ( 376) 245 67.5 1e-11 >>CCDS11578.1 NME1 gene_id:4830|Hs108|chr17 (177 aa) initn: 1204 init1: 1204 opt: 1204 Z-score: 1587.5 bits: 300.3 E(32554): 4.2e-82 Smith-Waterman score: 1204; 100.0% identity (100.0% similar) in 177 aa overlap (1-177:1-177) 10 20 30 40 50 60 pF1KE3 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 VGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETN 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 PADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE 130 140 150 160 170 >>CCDS11579.1 NME1 gene_id:4830|Hs108|chr17 (152 aa) initn: 1038 init1: 1038 opt: 1038 Z-score: 1370.6 bits: 259.9 E(32554): 5e-70 Smith-Waterman score: 1038; 100.0% identity (100.0% similar) in 152 aa overlap (26-177:1-152) 10 20 30 40 50 60 pF1KE3 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL ::::::::::::::::::::::::::::::::::: CCDS11 MANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL 10 20 30 70 80 90 100 110 120 pF1KE3 VGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETN 40 50 60 70 80 90 130 140 150 160 170 pF1KE3 PADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE 100 110 120 130 140 150 >>CCDS32682.1 NME2 gene_id:654364|Hs108|chr17 (267 aa) initn: 933 init1: 933 opt: 933 Z-score: 1229.1 bits: 234.5 E(32554): 3.9e-62 Smith-Waterman score: 933; 88.2% identity (97.4% similar) in 153 aa overlap (25-177:115-267) 10 20 30 40 50 pF1KE3 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFE :::: ::::::::::::::::::::::::: CCDS32 KTGRVMLGETNPADSKPGTIRGDFCIQVGRTMANLERTFIAIKPDGVQRGLVGEIIKRFE 90 100 110 120 130 140 60 70 80 90 100 110 pF1KE3 QKGFRLVGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRV :::::::..::..:::. ::.::.:::::::: ::::::.:::::::::::::::::::: CCDS32 QKGFRLVAMKFLRASEEHLKQHYIDLKDRPFFPGLVKYMNSGPVVAMVWEGLNVVKTGRV 150 160 170 180 190 200 120 130 140 150 160 170 pF1KE3 MLGETNPADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNW ::::::::::::::::::::::::::::::::::.::::::.:::.::::::: :::..: CCDS32 MLGETNPADSKPGTIRGDFCIQVGRNIIHGSDSVKSAEKEISLWFKPEELVDYKSCAHDW 210 220 230 240 250 260 pF1KE3 IYE .:: CCDS32 VYE >-- initn: 771 init1: 771 opt: 771 Z-score: 1016.4 bits: 195.2 E(32554): 2.7e-50 Smith-Waterman score: 771; 100.0% identity (100.0% similar) in 114 aa overlap (26-139:1-114) 10 20 30 40 50 60 pF1KE3 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL ::::::::::::::::::::::::::::::::::: CCDS32 MANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL 10 20 30 70 80 90 100 110 120 pF1KE3 VGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 VGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETN 40 50 60 70 80 90 130 140 150 160 170 pF1KE3 PADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE ::::::::::::::::::: CCDS32 PADSKPGTIRGDFCIQVGRTMANLERTFIAIKPDGVQRGLVGEIIKRFEQKGFRLVAMKF 100 110 120 130 140 150 >>CCDS11580.1 NME2 gene_id:4831|Hs108|chr17 (152 aa) initn: 928 init1: 928 opt: 928 Z-score: 1226.3 bits: 233.2 E(32554): 5.5e-62 Smith-Waterman score: 928; 88.2% identity (97.4% similar) in 152 aa overlap (26-177:1-152) 10 20 30 40 50 60 pF1KE3 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL ::: ::::::::::::::::::::::::::::::: CCDS11 MANLERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL 10 20 30 70 80 90 100 110 120 pF1KE3 VGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETN :..::..:::. ::.::.:::::::: ::::::.:::::::::::::::::::::::::: CCDS11 VAMKFLRASEEHLKQHYIDLKDRPFFPGLVKYMNSGPVVAMVWEGLNVVKTGRVMLGETN 40 50 60 70 80 90 130 140 150 160 170 pF1KE3 PADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE ::::::::::::::::::::::::::::.::::::.:::.::::::: :::..:.:: CCDS11 PADSKPGTIRGDFCIQVGRNIIHGSDSVKSAEKEISLWFKPEELVDYKSCAHDWVYE 100 110 120 130 140 150 >>CCDS10443.1 NME3 gene_id:4832|Hs108|chr16 (169 aa) initn: 726 init1: 726 opt: 726 Z-score: 960.4 bits: 184.2 E(32554): 3.6e-47 Smith-Waterman score: 726; 67.6% identity (93.2% similar) in 148 aa overlap (30-177:22-169) 10 20 30 40 50 60 pF1KE3 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL ::::.:.::::::: :::::..:::.:::.: CCDS10 MICLVLTIFANLFPAACTGAHERTFLAVKPDGVQRRLVGEIVRRFERKGFKL 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 VGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETN :.::..::::.::.:::..:..:::.. ::::: :::::::::.::.::.:.:...: :: CCDS10 VALKLVQASEELLREHYAELRERPFYGRLVKYMASGPVVAMVWQGLDVVRTSRALIGATN 60 70 80 90 100 110 130 140 150 160 170 pF1KE3 PADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE :::. ::::::::::.::.:.::::::::::..::.:::. .::. . . : .:.:: CCDS10 PADAPPGTIRGDFCIEVGKNLIHGSDSVESARREIALWFRADELLCWEDSAGHWLYE 120 130 140 150 160 >>CCDS10408.1 NME4 gene_id:4833|Hs108|chr16 (187 aa) initn: 636 init1: 621 opt: 621 Z-score: 822.0 bits: 158.7 E(32554): 1.8e-39 Smith-Waterman score: 621; 58.3% identity (88.9% similar) in 144 aa overlap (30-173:38-181) 10 20 30 40 50 pF1KE3 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFR :::..:.::::::: :::..:.:::..:: CCDS10 SALRGLRCGPRAPGPSLLVRHGSGGPSWTRERTLVAVKPDGVQRRLVGDVIQRFERRGFT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 LVGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGET :::.:..:: :..: ::: ::. .::. .:..:: ::::::::::: :::...:.:.:.: CCDS10 LVGMKMLQAPESVLAEHYQDLRRKPFYPALIRYMSSGPVVAMVWEGYNVVRASRAMIGHT 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE3 NPADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE . :.. :::::::: ....::.::.:::::.:..:: :::. :::.... .:. CCDS10 DSAEAAPGTIRGDFSVHISRNVIHASDSVEGAQREIQLWFQSSELVSWADGGQHSSIHPA 130 140 150 160 170 180 >>CCDS66886.1 NME4 gene_id:4833|Hs108|chr16 (117 aa) initn: 440 init1: 440 opt: 452 Z-score: 603.3 bits: 117.5 E(32554): 2.8e-27 Smith-Waterman score: 452; 55.0% identity (87.4% similar) in 111 aa overlap (63-173:1-111) 40 50 60 70 80 90 pF1KE3 FIAIKPDGVQRGLVGEIIKRFEQKGFRLVGLKFMQASEDLLKEHYVDLKDRPFFAGLVKY .:..:: :..: ::: ::. .::. .:..: CCDS66 MKMLQAPESVLAEHYQDLRRKPFYPALIRY 10 20 30 100 110 120 130 140 150 pF1KE3 MHSGPVVAMVWEGLNVVKTGRVMLGETNPADSKPGTIRGDFCIQVGRNIIHGSDSVESAE : ::::::::::: :::...:.:.:.:. :.. :::::::: ....::.::.:::::.:. CCDS66 MSSGPVVAMVWEGYNVVRASRAMIGHTDSAEAAPGTIRGDFSVHISRNVIHASDSVEGAQ 40 50 60 70 80 90 160 170 pF1KE3 KEIGLWFHPEELVDYTSCAQNWIYE .:: :::. :::.... .:. CCDS66 REIQLWFQSSELVSWADGGQHSSIHPA 100 110 >>CCDS74107.1 NME2 gene_id:4831|Hs108|chr17 (82 aa) initn: 439 init1: 439 opt: 439 Z-score: 588.6 bits: 114.3 E(32554): 1.8e-26 Smith-Waterman score: 439; 85.5% identity (96.1% similar) in 76 aa overlap (26-101:1-76) 10 20 30 40 50 60 pF1KE3 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL ::: ::::::::::::::::::::::::::::::: CCDS74 MANLERTFIAIKPDGVQRGLVGEIIKRFEQKGFRL 10 20 30 70 80 90 100 110 120 pF1KE3 VGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETN :..::..:::. ::.::.:::::::: ::::::.::::::: CCDS74 VAMKFLRASEEHLKQHYIDLKDRPFFPGLVKYMNSGPVVAMEHHSWQ 40 50 60 70 80 130 140 150 160 170 pF1KE3 PADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE >>CCDS73797.1 NME4 gene_id:4833|Hs108|chr16 (153 aa) initn: 464 init1: 288 opt: 289 Z-score: 387.6 bits: 78.0 E(32554): 2.9e-15 Smith-Waterman score: 398; 44.4% identity (69.4% similar) in 144 aa overlap (30-173:38-147) 10 20 30 40 50 pF1KE3 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFR :::..:.::::::: :::..:.:::..:: CCDS73 SALRGLRCGPRAPGPSLLVRHGSGGPSWTRERTLVAVKPDGVQRRLVGDVIQRFERRGFT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 LVGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGET :::.:..: :::: :::...:.:.:.: CCDS73 LVGMKMLQ----------------------------------VWEGYNVVRASRAMIGHT 70 80 90 120 130 140 150 160 170 pF1KE3 NPADSKPGTIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE . :.. :::::::: ....::.::.:::::.:..:: :::. :::.... .:. CCDS73 DSAEAAPGTIRGDFSVHISRNVIHASDSVEGAQREIQLWFQSSELVSWADGGQHSSIHPA 100 110 120 130 140 150 >>CCDS44274.1 NME7 gene_id:29922|Hs108|chr1 (340 aa) initn: 231 init1: 148 opt: 245 Z-score: 324.5 bits: 67.5 E(32554): 9.4e-12 Smith-Waterman score: 251; 31.3% identity (62.0% similar) in 150 aa overlap (30-170:56-203) 10 20 30 40 50 pF1KE3 MVLLSTLGIVFQGEGPPISSCDTGTMANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFR :.:. ::::..... ::::. ... :: CCDS44 IGNKVNVFSRQLVLIDYGDQYTARQLGSRKEKTLALIKPDAISKA--GEIIEIINKAGFT 30 40 50 60 70 80 60 70 80 90 100 110 pF1KE3 LVGLKFMQASEDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGET .. ::.:. :. . .:: ..:::: :.... .::..:: ... . .:: . CCDS44 ITKLKMMMLSRKEALDFHVDHQSRPFFNELIQFITTGPIIAMEILRDDAICEWKRLLGPA 90 100 110 120 130 140 120 130 140 150 160 170 pF1KE3 NPADSKPG---TIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFH------PEELVDYTSC : . .. .::. : . :: :: :: :: .:. :.: : . . .:.: CCDS44 NSGVARTDASESIRALFGTDGIRNAAHGPDSFASAAREMELFFPSSGGCGPANTAKFTNC 150 160 170 180 190 200 pF1KE3 AQNWIYE CCDS44 TCCIVKPHAVSEGLLGKILMAIRDAGFEISAMQMFNMDRVNVEEFYEVYKGVVTEYHDMV 210 220 230 240 250 260 177 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 19:23:35 2016 done: Thu Nov 3 19:23:35 2016 Total Scan time: 1.880 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]