FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2744, 215 aa 1>>>pF1KE2744 215 - 215 aa - 215 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4881+/-0.000733; mu= 12.9166+/- 0.044 mean_var=64.2296+/-12.743, 0's: 0 Z-trim(108.4): 18 B-trim: 6 in 1/51 Lambda= 0.160032 statistics sampled from 10340 (10353) to 10340 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.703), E-opt: 0.2 (0.31), width: 16 Scan time: 0.960 The best scores are: opt bits E(33420) CCDS3111.1 NMNAT3 gene_id:349565|Hs109|chr3 ( 215) 1456 344.4 3.3e-95 CCDS82846.1 NMNAT3 gene_id:349565|Hs109|chr3 ( 252) 1456 344.4 3.8e-95 CCDS87147.1 NMNAT3 gene_id:349565|Hs109|chr3 ( 346) 1456 344.5 5e-95 CCDS56282.1 NMNAT3 gene_id:349565|Hs109|chr3 ( 163) 849 204.2 4e-53 CCDS87145.1 NMNAT3 gene_id:349565|Hs109|chr3 ( 170) 611 149.3 1.4e-36 CCDS108.1 NMNAT1 gene_id:64802|Hs109|chr1 ( 279) 421 105.5 3.5e-23 CCDS72698.1 NMNAT1 gene_id:64802|Hs109|chr1 ( 160) 276 71.9 2.6e-13 CCDS1354.1 NMNAT2 gene_id:23057|Hs109|chr1 ( 302) 270 70.6 1.2e-12 CCDS1353.1 NMNAT2 gene_id:23057|Hs109|chr1 ( 307) 270 70.6 1.2e-12 >>CCDS3111.1 NMNAT3 gene_id:349565|Hs109|chr3 (215 aa) initn: 1456 init1: 1456 opt: 1456 Z-score: 1823.0 bits: 344.4 E(33420): 3.3e-95 Smith-Waterman score: 1456; 100.0% identity (100.0% similar) in 215 aa overlap (1-215:1-215) 10 20 30 40 50 60 pF1KE2 MYQVIQGIISPVNDTYGKKDLAASHHRVAMARLALQTSDWIRVDPWESEQAQWMETVKVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MYQVIQGIISPVNDTYGKKDLAASHHRVAMARLALQTSDWIRVDPWESEQAQWMETVKVL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 RHHHSKLLRSPPQMEGPDHGKALFSTPAAVPELKLLCGADVLKTFQTPNLWKDAHIQEIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 RHHHSKLLRSPPQMEGPDHGKALFSTPAAVPELKLLCGADVLKTFQTPNLWKDAHIQEIV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 EKFGLVCVGRVGHDPKGYIAESPILRMHQHNIHLAKEPVQNEISATYIRRALGQGQSVKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 EKFGLVCVGRVGHDPKGYIAESPILRMHQHNIHLAKEPVQNEISATYIRRALGQGQSVKY 130 140 150 160 170 180 190 200 210 pF1KE2 LIPDAVITYIKDHGLYTKGSTWKGKSTQSTEGKTS ::::::::::::::::::::::::::::::::::: CCDS31 LIPDAVITYIKDHGLYTKGSTWKGKSTQSTEGKTS 190 200 210 >>CCDS82846.1 NMNAT3 gene_id:349565|Hs109|chr3 (252 aa) initn: 1456 init1: 1456 opt: 1456 Z-score: 1821.9 bits: 344.4 E(33420): 3.8e-95 Smith-Waterman score: 1456; 100.0% identity (100.0% similar) in 215 aa overlap (1-215:38-252) 10 20 30 pF1KE2 MYQVIQGIISPVNDTYGKKDLAASHHRVAM :::::::::::::::::::::::::::::: CCDS82 VLLACGSFNPITNMHLRMFEVARDHLHQTGMYQVIQGIISPVNDTYGKKDLAASHHRVAM 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE2 ARLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLRSPPQMEGPDHGKALFSTPAAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 ARLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLRSPPQMEGPDHGKALFSTPAAV 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE2 PELKLLCGADVLKTFQTPNLWKDAHIQEIVEKFGLVCVGRVGHDPKGYIAESPILRMHQH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 PELKLLCGADVLKTFQTPNLWKDAHIQEIVEKFGLVCVGRVGHDPKGYIAESPILRMHQH 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE2 NIHLAKEPVQNEISATYIRRALGQGQSVKYLIPDAVITYIKDHGLYTKGSTWKGKSTQST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 NIHLAKEPVQNEISATYIRRALGQGQSVKYLIPDAVITYIKDHGLYTKGSTWKGKSTQST 190 200 210 220 230 240 pF1KE2 EGKTS ::::: CCDS82 EGKTS 250 >>CCDS87147.1 NMNAT3 gene_id:349565|Hs109|chr3 (346 aa) initn: 1456 init1: 1456 opt: 1456 Z-score: 1819.7 bits: 344.5 E(33420): 5e-95 Smith-Waterman score: 1456; 100.0% identity (100.0% similar) in 215 aa overlap (1-215:132-346) 10 20 30 pF1KE2 MYQVIQGIISPVNDTYGKKDLAASHHRVAM :::::::::::::::::::::::::::::: CCDS87 CKGTTLKKIIFIDSTWNQTNKIFTDERLQGMYQVIQGIISPVNDTYGKKDLAASHHRVAM 110 120 130 140 150 160 40 50 60 70 80 90 pF1KE2 ARLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLRSPPQMEGPDHGKALFSTPAAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 ARLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLRSPPQMEGPDHGKALFSTPAAV 170 180 190 200 210 220 100 110 120 130 140 150 pF1KE2 PELKLLCGADVLKTFQTPNLWKDAHIQEIVEKFGLVCVGRVGHDPKGYIAESPILRMHQH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 PELKLLCGADVLKTFQTPNLWKDAHIQEIVEKFGLVCVGRVGHDPKGYIAESPILRMHQH 230 240 250 260 270 280 160 170 180 190 200 210 pF1KE2 NIHLAKEPVQNEISATYIRRALGQGQSVKYLIPDAVITYIKDHGLYTKGSTWKGKSTQST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 NIHLAKEPVQNEISATYIRRALGQGQSVKYLIPDAVITYIKDHGLYTKGSTWKGKSTQST 290 300 310 320 330 340 pF1KE2 EGKTS ::::: CCDS87 EGKTS >>CCDS56282.1 NMNAT3 gene_id:349565|Hs109|chr3 (163 aa) initn: 849 init1: 849 opt: 849 Z-score: 1067.5 bits: 204.2 E(33420): 4e-53 Smith-Waterman score: 849; 100.0% identity (100.0% similar) in 127 aa overlap (89-215:37-163) 60 70 80 90 100 110 pF1KE2 VLRHHHSKLLRSPPQMEGPDHGKALFSTPAAVPELKLLCGADVLKTFQTPNLWKDAHIQE :::::::::::::::::::::::::::::: CCDS56 VVLLACGSFNPITNMHLRMFEVARDHLHQTAVPELKLLCGADVLKTFQTPNLWKDAHIQE 10 20 30 40 50 60 120 130 140 150 160 170 pF1KE2 IVEKFGLVCVGRVGHDPKGYIAESPILRMHQHNIHLAKEPVQNEISATYIRRALGQGQSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 IVEKFGLVCVGRVGHDPKGYIAESPILRMHQHNIHLAKEPVQNEISATYIRRALGQGQSV 70 80 90 100 110 120 180 190 200 210 pF1KE2 KYLIPDAVITYIKDHGLYTKGSTWKGKSTQSTEGKTS ::::::::::::::::::::::::::::::::::::: CCDS56 KYLIPDAVITYIKDHGLYTKGSTWKGKSTQSTEGKTS 130 140 150 160 >>CCDS87145.1 NMNAT3 gene_id:349565|Hs109|chr3 (170 aa) initn: 611 init1: 611 opt: 611 Z-score: 770.2 bits: 149.3 E(33420): 1.4e-36 Smith-Waterman score: 611; 97.8% identity (100.0% similar) in 90 aa overlap (1-90:38-127) 10 20 30 pF1KE2 MYQVIQGIISPVNDTYGKKDLAASHHRVAM :::::::::::::::::::::::::::::: CCDS87 VLLACGSFNPITNMHLRMFEVARDHLHQTGMYQVIQGIISPVNDTYGKKDLAASHHRVAM 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE2 ARLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLRSPPQMEGPDHGKALFSTPAAV ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::.. CCDS87 ARLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLRSPPQMEGPDHGKALFSTPAGI 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE2 PELKLLCGADVLKTFQTPNLWKDAHIQEIVEKFGLVCVGRVGHDPKGYIAESPILRMHQH CCDS87 LKRILQTLFEHLTGTDSGLRLRTETASPSPHLKLETGYEWSCA 130 140 150 160 170 >>CCDS108.1 NMNAT1 gene_id:64802|Hs109|chr1 (279 aa) initn: 694 init1: 414 opt: 421 Z-score: 529.8 bits: 105.5 E(33420): 3.5e-23 Smith-Waterman score: 660; 47.7% identity (70.2% similar) in 218 aa overlap (2-200:41-258) 10 20 30 pF1KE2 MYQVIQGIISPVNDTYGKKDLAASHHRVAMA : :..::::::.:.: :: : ..::: :: CCDS10 LLACGSFNPITNMHLRLFELAKDYMNGTGRYTVVKGIISPVGDAYKKKGLIPAYHRVIMA 20 30 40 50 60 70 40 50 60 70 pF1KE2 RLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLRSP-------PQMEGP------- .:: ..: :..:: ::: : .: ::.::::::. :: : : .: : CCDS10 ELATKNSKWVEVDTWESLQKEWKETLKVLRHHQEKLEASDCDHQQNSPTLERPGRKRKWT 80 90 100 110 120 130 80 90 100 110 120 130 pF1KE2 -----DHGKALFSTPAAVPELKLLCGADVLKTFQTPNLWKDAHIQEIVEKFGLVCVGRVG .. :.: :::..:::::::.:..: .:::::. : .:: ..::.:: :.: CCDS10 ETQDSSQKKSLEPKTKAVPKVKLLCGADLLESFAVPNLWKSEDITQIVANYGLICVTRAG 140 150 160 170 180 190 140 150 160 170 180 190 pF1KE2 HDPKGYIAESPILRMHQHNIHLAKEPVQNEISATYIRRALGQGQSVKYLIPDAVITYIKD .: . .: :: .: :. :::...: . :.::.: ::::: .:::..::.:: : ::. CCDS10 NDAQKFIYESDVLWKHRSNIHVVNEWIANDISSTKIRRALRRGQSIRYLVPDLVQEYIEK 200 210 220 230 240 250 200 210 pF1KE2 HGLYTKGSTWKGKSTQSTEGKTS :.::.. : CCDS10 HNLYSSESEDRNAGVILAPLQRNTAEAKT 260 270 >>CCDS72698.1 NMNAT1 gene_id:64802|Hs109|chr1 (160 aa) initn: 276 init1: 276 opt: 276 Z-score: 352.6 bits: 71.9 E(33420): 2.6e-13 Smith-Waterman score: 276; 59.1% identity (80.3% similar) in 66 aa overlap (2-67:41-106) 10 20 30 pF1KE2 MYQVIQGIISPVNDTYGKKDLAASHHRVAMA : :..::::::.:.: :: : ..::: :: CCDS72 LLACGSFNPITNMHLRLFELAKDYMNGTGRYTVVKGIISPVGDAYKKKGLIPAYHRVIMA 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE2 RLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLRSPPQMEGPDHGKALFSTPAAVP .:: ..: :..:: ::: : .: ::.::::::. :: CCDS72 ELATKNSKWVEVDTWESLQKEWKETLKVLRHHQEKLEASDCDHQQNSPTLERPGRKRKWT 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE2 ELKLLCGADVLKTFQTPNLWKDAHIQEIVEKFGLVCVGRVGHDPKGYIAESPILRMHQHN CCDS72 ETQDSSQKKSLEPKTKDGVSLYHPGWSAVA 140 150 160 >>CCDS1354.1 NMNAT2 gene_id:23057|Hs109|chr1 (302 aa) initn: 452 init1: 269 opt: 270 Z-score: 340.8 bits: 70.6 E(33420): 1.2e-12 Smith-Waterman score: 350; 32.9% identity (55.0% similar) in 240 aa overlap (2-175:36-275) 10 20 30 pF1KE2 MYQVIQGIISPVNDTYGKKDLAASHHRVAMA . :: ::.:::.:.:::. :..:.::. : CCDS13 LEEIQACQGLWEVFVTLSERARDYLHKTGRFIVIGGIVSPVHDSYGKQGLVSSRHRLIMC 10 20 30 40 50 60 40 50 60 70 pF1KE2 RLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLR----------SP--------PQ .::.:.:::::::::: : :. : .::.::.. . : .: :: CCDS13 QLAVQNSDWIRVDPWECYQDTWQTTCSVLEHHRDLMKRVTGCILSNVNTPSMTPVIGQPQ 70 80 90 100 110 120 80 90 pF1KE2 MEGPD---HGKALFSTPAAV-----------------P---------------------- : :. ... . . :.:. : CCDS13 NETPQPIYQNSNVATKPTAAKILGKVGESLSRICCVRPPVERFTFVDENANLGTVMRYEE 130 140 150 160 170 180 100 110 120 130 140 pF1KE2 -ELK--LLCGADVLKTFQTPNLWKDAHIQEIVEKFGLVCVGRVGHDPKGYIAESPILRMH ::. ::::.:.:..: :.::..: .. :: ::.: : : . : . .: ::: . CCDS13 IELRILLLCGSDLLESFCIPGLWNEADMEVIVGDFGIVVVPRDAADTDRIMNHSSILRKY 190 200 210 220 230 240 150 160 170 180 190 200 pF1KE2 QHNIHLAKEPVQNE---ISATYIRRALGQGQSVKYLIPDAVITYIKDHGLYTKGSTWKGK ..:: ..:. ... .:.: : :: .: CCDS13 KNNIMVVKDDINHPMSVVSSTKSRLALQHGDGHVVDYLSQPVIDYILKSQLYINASG 250 260 270 280 290 300 >>CCDS1353.1 NMNAT2 gene_id:23057|Hs109|chr1 (307 aa) initn: 452 init1: 269 opt: 270 Z-score: 340.7 bits: 70.6 E(33420): 1.2e-12 Smith-Waterman score: 350; 32.9% identity (55.0% similar) in 240 aa overlap (2-175:41-280) 10 20 30 pF1KE2 MYQVIQGIISPVNDTYGKKDLAASHHRVAMA . :: ::.:::.:.:::. :..:.::. : CCDS13 LLACGSFNPITKGHIQMFERARDYLHKTGRFIVIGGIVSPVHDSYGKQGLVSSRHRLIMC 20 30 40 50 60 70 40 50 60 70 pF1KE2 RLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLR----------SP--------PQ .::.:.:::::::::: : :. : .::.::.. . : .: :: CCDS13 QLAVQNSDWIRVDPWECYQDTWQTTCSVLEHHRDLMKRVTGCILSNVNTPSMTPVIGQPQ 80 90 100 110 120 130 80 90 pF1KE2 MEGPD---HGKALFSTPAAV-----------------P---------------------- : :. ... . . :.:. : CCDS13 NETPQPIYQNSNVATKPTAAKILGKVGESLSRICCVRPPVERFTFVDENANLGTVMRYEE 140 150 160 170 180 190 100 110 120 130 140 pF1KE2 -ELK--LLCGADVLKTFQTPNLWKDAHIQEIVEKFGLVCVGRVGHDPKGYIAESPILRMH ::. ::::.:.:..: :.::..: .. :: ::.: : : . : . .: ::: . CCDS13 IELRILLLCGSDLLESFCIPGLWNEADMEVIVGDFGIVVVPRDAADTDRIMNHSSILRKY 200 210 220 230 240 250 150 160 170 180 190 200 pF1KE2 QHNIHLAKEPVQNE---ISATYIRRALGQGQSVKYLIPDAVITYIKDHGLYTKGSTWKGK ..:: ..:. ... .:.: : :: .: CCDS13 KNNIMVVKDDINHPMSVVSSTKSRLALQHGDGHVVDYLSQPVIDYILKSQLYINASG 260 270 280 290 300 215 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Jun 19 14:29:04 2019 done: Wed Jun 19 14:29:05 2019 Total Scan time: 0.960 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]