FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3095, 263 aa 1>>>pF1KE3095 263 - 263 aa - 263 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3637+/-0.000822; mu= 14.4162+/- 0.049 mean_var=65.2373+/-12.888, 0's: 0 Z-trim(106.8): 27 B-trim: 0 in 0/51 Lambda= 0.158791 statistics sampled from 9183 (9208) to 9183 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.67), E-opt: 0.2 (0.283), width: 16 Scan time: 2.330 The best scores are: opt bits E(32554) CCDS3099.1 NME9 gene_id:347736|Hs108|chr3 ( 263) 1757 411.1 4e-115 CCDS4197.1 NME5 gene_id:8382|Hs108|chr5 ( 212) 287 74.3 7.9e-14 CCDS77734.1 NME6 gene_id:10201|Hs108|chr3 ( 186) 278 72.2 2.9e-13 CCDS2763.1 NME6 gene_id:10201|Hs108|chr3 ( 194) 278 72.2 3.1e-13 CCDS44274.1 NME7 gene_id:29922|Hs108|chr1 ( 340) 278 72.3 4.9e-13 CCDS1277.1 NME7 gene_id:29922|Hs108|chr1 ( 376) 278 72.3 5.4e-13 CCDS5452.1 NME8 gene_id:51314|Hs108|chr7 ( 588) 279 72.7 6.7e-13 >>CCDS3099.1 NME9 gene_id:347736|Hs108|chr3 (263 aa) initn: 1757 init1: 1757 opt: 1757 Z-score: 2180.3 bits: 411.1 E(32554): 4e-115 Smith-Waterman score: 1757; 100.0% identity (100.0% similar) in 263 aa overlap (1-263:1-263) 10 20 30 40 50 60 pF1KE3 MLSSKGLTVVDVYQGWCGPCKPVVSLFQKMRIEVGLDLLHFALAEADRLDVLEKYRGKCE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MLSSKGLTVVDVYQGWCGPCKPVVSLFQKMRIEVGLDLLHFALAEADRLDVLEKYRGKCE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 PTFLFYAIKDEALSDEDECVSHGKNNGEDEDMVSSERTCTLAIIKPDAVAHGKTDEIIMK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 PTFLFYAIKDEALSDEDECVSHGKNNGEDEDMVSSERTCTLAIIKPDAVAHGKTDEIIMK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 IQEAGFEILTNEERTMTEAEVRLFYQHKAGEEAFEKLVHHMCSGPSHLLILTRTEGFEDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 IQEAGFEILTNEERTMTEAEVRLFYQHKAGEEAFEKLVHHMCSGPSHLLILTRTEGFEDV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 VTTWRTVMGPRDPNVARREQPESLRAQYGTEMPFNAVHGSRDREDADRELALLFPSLKFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 VTTWRTVMGPRDPNVARREQPESLRAQYGTEMPFNAVHGSRDREDADRELALLFPSLKFS 190 200 210 220 230 240 250 260 pF1KE3 DKDTEAPQGESSTQPRLKITDLD ::::::::::::::::::::::: CCDS30 DKDTEAPQGESSTQPRLKITDLD 250 260 >>CCDS4197.1 NME5 gene_id:8382|Hs108|chr5 (212 aa) initn: 286 init1: 168 opt: 287 Z-score: 361.8 bits: 74.3 E(32554): 7.9e-14 Smith-Waterman score: 287; 39.0% identity (63.2% similar) in 136 aa overlap (100-235:15-144) 70 80 90 100 110 120 pF1KE3 DEALSDEDECVSHGKNNGEDEDMVSSERTCTLAIIKPDAVAHGKTDEIIMKIQEAGFEIL :::::::: : : .:: : ..:: :. CCDS41 MEISMPPPQIYVEKTLAIIKPDIV--DKEEEIQDIILRSGFTIV 10 20 30 40 130 140 150 160 170 180 pF1KE3 TNEERTMTEAEVRLFYQHKAGEEAFEKLVHHMCSGPSHLLILTRTEGFEDVVTTWRTVMG .. .. . :: .: :. : .:. .: ::: .::.: . ... : ..: CCDS41 QRRKLRLSPEQCSNFYVEKYGKMFFPNLTAYMSSGPLVAMILARHK----AISYWLELLG 50 60 70 80 90 190 200 210 220 230 240 pF1KE3 PRDPNVARREQPESLRAQYGTEMPFNAVHGSRDREDADRELALLFPSLKFSDKDTEAPQG : . ::.. .:.:::: :::. ::.::: : :.::. ..:: CCDS41 PNNSLVAKETHPDSLRAIYGTDDLRNALHGSNDFAAAEREIRFMFPEVIVEPIPIGQAAK 100 110 120 130 140 150 250 260 pF1KE3 ESSTQPRLKITDLD CCDS41 DYLNLHIMPTLLEGLTELCKQKPADPLIWLADWLLKNNPNKPKLCHHPIVEEPY 160 170 180 190 200 210 >>CCDS77734.1 NME6 gene_id:10201|Hs108|chr3 (186 aa) initn: 256 init1: 141 opt: 278 Z-score: 351.5 bits: 72.2 E(32554): 2.9e-13 Smith-Waterman score: 278; 35.9% identity (62.7% similar) in 153 aa overlap (100-248:14-160) 70 80 90 100 110 120 pF1KE3 DEALSDEDECVSHGKNNGEDEDMVSSERTCTLAIIKPDAVAHGKTDEII-MKIQEAGFEI :::.:::::::: : . ..: : : CCDS77 MASILRSPQALQLTLALIKPDAVAHPLILEAVHQQILSNKFLI 10 20 30 40 130 140 150 160 170 180 pF1KE3 LTNEERTMTEAEVRLFYQHKAGEEAFEKLVHHMCSGPSHLLILTRTEGFEDVVTTWRTVM . .: . . . ::... :. ...::. : ::: . ::.. .:.. :::.: CCDS77 VRMRELLWRKEDCQRFYREHEGRFFYQRLVEFMASGPIRAYILAH----KDAIQLWRTLM 50 60 70 80 90 190 200 210 220 230 240 pF1KE3 GPRDPNVARREQPESLRAQYGTEMPFNAVHGSRDREDADRELALLFPSLKFSDK---DTE :: ::. :.:.:...: :..::: . .:.::.: .::. ::.. . : CCDS77 GPTRVFRARHVAPDSIRGSFGLTDTRNTTHGSDSVVSASREIAAFFPD--FSEQRWYEEE 100 110 120 130 140 150 250 260 pF1KE3 APQGESSTQPRLKITDLD :: CCDS77 EPQLRCGPVCYSPEGGVHYVAGTGGLGPA 160 170 180 >>CCDS2763.1 NME6 gene_id:10201|Hs108|chr3 (194 aa) initn: 256 init1: 141 opt: 278 Z-score: 351.2 bits: 72.2 E(32554): 3.1e-13 Smith-Waterman score: 278; 35.9% identity (62.7% similar) in 153 aa overlap (100-248:22-168) 70 80 90 100 110 120 pF1KE3 DEALSDEDECVSHGKNNGEDEDMVSSERTCTLAIIKPDAVAHGKTDEII-MKIQEAGFEI :::.:::::::: : . ..: : : CCDS27 MTQNLGSEMASILRSPQALQLTLALIKPDAVAHPLILEAVHQQILSNKFLI 10 20 30 40 50 130 140 150 160 170 180 pF1KE3 LTNEERTMTEAEVRLFYQHKAGEEAFEKLVHHMCSGPSHLLILTRTEGFEDVVTTWRTVM . .: . . . ::... :. ...::. : ::: . ::.. .:.. :::.: CCDS27 VRMRELLWRKEDCQRFYREHEGRFFYQRLVEFMASGPIRAYILAH----KDAIQLWRTLM 60 70 80 90 100 190 200 210 220 230 240 pF1KE3 GPRDPNVARREQPESLRAQYGTEMPFNAVHGSRDREDADRELALLFPSLKFSDK---DTE :: ::. :.:.:...: :..::: . .:.::.: .::. ::.. . : CCDS27 GPTRVFRARHVAPDSIRGSFGLTDTRNTTHGSDSVVSASREIAAFFPD--FSEQRWYEEE 110 120 130 140 150 160 250 260 pF1KE3 APQGESSTQPRLKITDLD :: CCDS27 EPQLRCGPVCYSPEGGVHYVAGTGGLGPA 170 180 190 >>CCDS44274.1 NME7 gene_id:29922|Hs108|chr1 (340 aa) initn: 271 init1: 169 opt: 278 Z-score: 347.5 bits: 72.3 E(32554): 4.9e-13 Smith-Waterman score: 278; 31.2% identity (62.4% similar) in 173 aa overlap (64-236:24-188) 40 50 60 70 80 90 pF1KE3 VGLDLLHFALAEADRLDVLEKYRGKCEPTFLFYAIKDEALSDEDECVSHGKNNGEDEDMV :: . : ...: . ...: . . . CCDS44 MHDVKNHRTFLKRTKYDNLHLEDLFIGNKVNVFSRQLVLIDYGDQYTARQ--L 10 20 30 40 50 100 110 120 130 140 150 pF1KE3 SSERTCTLAIIKPDAVAHGKTDEIIMKIQEAGFEILTNEERTMTEAEVRLFYQHKAGEEA .:.. :::.:::::.. :. ::: :..::: : . ... :. :. . .. CCDS44 GSRKEKTLALIKPDAIS--KAGEIIEIINKAGFTITKLKMMMLSRKEALDFHVDHQSRPF 60 70 80 90 100 160 170 180 190 200 210 pF1KE3 FEKLVHHMCSGPSHLLILTRTEGFEDVVTTWRTVMGPRDPNVARREQPESLRAQYGTEMP :..:.. . .:: . . : .:.. :. ..:: . .::: . ::.:: .::. CCDS44 FNELIQFITTGPIIAMEILR----DDAICEWKRLLGPANSGVARTDASESIRALFGTDGI 110 120 130 140 150 160 220 230 240 250 260 pF1KE3 FNAVHGSRDREDADRELALLFPSLKFSDKDTEAPQGESSTQPRLKITDLD ::.:: . .: ::. :.::: CCDS44 RNAAHGPDSFASAAREMELFFPSSGGCGPANTAKFTNCTCCIVKPHAVSEGLLGKILMAI 170 180 190 200 210 220 >>CCDS1277.1 NME7 gene_id:29922|Hs108|chr1 (376 aa) initn: 271 init1: 169 opt: 278 Z-score: 346.8 bits: 72.3 E(32554): 5.4e-13 Smith-Waterman score: 278; 31.2% identity (62.4% similar) in 173 aa overlap (64-236:60-224) 40 50 60 70 80 90 pF1KE3 VGLDLLHFALAEADRLDVLEKYRGKCEPTFLFYAIKDEALSDEDECVSHGKNNGEDEDMV :: . : ...: . ...: . . . CCDS12 PGDGSVEMHDVKNHRTFLKRTKYDNLHLEDLFIGNKVNVFSRQLVLIDYGDQYTARQ--L 30 40 50 60 70 80 100 110 120 130 140 150 pF1KE3 SSERTCTLAIIKPDAVAHGKTDEIIMKIQEAGFEILTNEERTMTEAEVRLFYQHKAGEEA .:.. :::.:::::.. :. ::: :..::: : . ... :. :. . .. CCDS12 GSRKEKTLALIKPDAIS--KAGEIIEIINKAGFTITKLKMMMLSRKEALDFHVDHQSRPF 90 100 110 120 130 140 160 170 180 190 200 210 pF1KE3 FEKLVHHMCSGPSHLLILTRTEGFEDVVTTWRTVMGPRDPNVARREQPESLRAQYGTEMP :..:.. . .:: . . : .:.. :. ..:: . .::: . ::.:: .::. CCDS12 FNELIQFITTGPIIAMEILR----DDAICEWKRLLGPANSGVARTDASESIRALFGTDGI 150 160 170 180 190 200 220 230 240 250 260 pF1KE3 FNAVHGSRDREDADRELALLFPSLKFSDKDTEAPQGESSTQPRLKITDLD ::.:: . .: ::. :.::: CCDS12 RNAAHGPDSFASAAREMELFFPSSGGCGPANTAKFTNCTCCIVKPHAVSEGLLGKILMAI 210 220 230 240 250 260 >>CCDS5452.1 NME8 gene_id:51314|Hs108|chr7 (588 aa) initn: 648 init1: 167 opt: 279 Z-score: 345.1 bits: 72.7 E(32554): 6.7e-13 Smith-Waterman score: 404; 36.0% identity (61.2% similar) in 242 aa overlap (1-207:23-261) 10 20 30 pF1KE3 MLSSKGLTVVDVYQGWCGPCKPVVSLFQKMRIEVGLD- ::..:::::.::::.:::::. . ::.:.. :.. : CCDS54 MASKKREVQLQTVINNQSLWDEMLQNKGLTVIDVYQAWCGPCRAMQPLFRKLKNELNEDE 10 20 30 40 50 60 40 50 60 70 80 pF1KE3 LLHFALAEADRLDVLEKYRGKCEPTFLFYA---------------IKDEALS--DEDECV .::::.:::: . .:. .: ::::.::: . .. .... ::.. . CCDS54 ILHFAVAEADNIVTLQPFRDKCEPVFLFSVNGKIIEKIQGANAPLVNKKVINLIDEERKI 70 80 90 100 110 120 90 100 110 120 pF1KE3 SHGKNNGE--------DEDM-VSSERTC-------TLAIIKPDAVAHGKTDEIIMKIQEA . :. : : :: : : ..:::::::: :. :: :: .: CCDS54 AAGEMARPQYPEIPLVDSDSEVSEESPCESVQELYSIAIIKPDAVISKKVLEIKRKITKA 130 140 150 160 170 180 130 140 150 160 170 180 pF1KE3 GFEILTNEERTMTEAEVRLFYQHKAGEEAFEKLVHHMCSGPSHLLILTRTEGFEDVVTTW :: : .... ..:: .: ::.. : . ::..: : :: :..:... .: . . CCDS54 GFIIEAEHKTVLTEEQVVNFYSRIADQCDFEEFVSFMTSGLSYILVVS--QGSKHNPPSE 190 200 210 220 230 190 200 210 220 230 240 pF1KE3 RTV-MGPRDPNVARREQPESLRAQYGTEMPFNAVHGSRDREDADRELALLFPSLKFSDKD .: . .:: ..::: ..:: CCDS54 ETEPQTDTEPNERSEDQPE-VEAQVTPGMMKNKQDSLQEYLERQHLAQLCDIEEDAANVA 240 250 260 270 280 290 263 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 02:24:35 2016 done: Sun Nov 6 02:24:35 2016 Total Scan time: 2.330 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]