FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1195, 828 aa 1>>>pF1KE1195 828 - 828 aa - 828 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.5241+/-0.000966; mu= 14.9493+/- 0.058 mean_var=92.4895+/-18.109, 0's: 0 Z-trim(105.8): 59 B-trim: 0 in 0/53 Lambda= 0.133361 statistics sampled from 8556 (8609) to 8556 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.624), E-opt: 0.2 (0.264), width: 16 Scan time: 3.380 The best scores are: opt bits E(32554) CCDS41343.1 MYSM1 gene_id:114803|Hs108|chr1 ( 828) 5545 1077.7 0 CCDS74261.1 MPND gene_id:84954|Hs108|chr19 ( 501) 361 80.2 9.8e-15 CCDS42470.1 MPND gene_id:84954|Hs108|chr19 ( 471) 347 77.5 6e-14 >>CCDS41343.1 MYSM1 gene_id:114803|Hs108|chr1 (828 aa) initn: 5545 init1: 5545 opt: 5545 Z-score: 5765.0 bits: 1077.7 E(32554): 0 Smith-Waterman score: 5545; 99.9% identity (100.0% similar) in 828 aa overlap (1-828:1-828) 10 20 30 40 50 60 pF1KE1 MAAEEADVDIEGDVVAAAGAQPGSGENTASVLQKDHYLDSSWRTENGLIPWTLDNTISEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MAAEEADVDIEGDVVAAAGAQPGSGENTASVLQKDHYLDSSWRTENGLIPWTLDNTISEE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 NRAVIEKMLLEEEYYLSKKSQPEKVWLDQKEDDKKYMKSLQKTAKIMVHSPTKPASYSVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 NRAVIEKMLLEEEYYLSKKSQPEKVWLDQKEDDKKYMKSLQKTAKIMVHSPTKPASYSVK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 WTIEEKELFEQGLAKFGRRWTKISKLIGSRTVLQVKSYARQYFKNKVKCGLDKETPNQKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 WTIEEKELFEQGLAKFGRRWTKISKLIGSRTVLQVKSYARQYFKNKVKCGLDKETPNQKT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 GHNLQVKNEDKGTKAWTPSCLRGRADPNLNAVKIEKLSDDEEVDITDEVDELSSQTPQKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 GHNLQVKNEDKGTKAWTPSCLRGRADPNLNAVKIEKLSDDEEVDITDEVDELSSQTPQKN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 SSSDLLLDFPNSKMHETNQGEFIASDSQEALFSKSSRGCLQNEKQDETLSSSEITLWTEK :::::::::::::::::::::::.:::::::::::::::::::::::::::::::::::: CCDS41 SSSDLLLDFPNSKMHETNQGEFITSDSQEALFSKSSRGCLQNEKQDETLSSSEITLWTEK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 QSNGDKKSIELNDQKFNELIKNCNKHDGRGIIVDARQLPSPEPCEIQKNLNDNEMLFHSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 QSNGDKKSIELNDQKFNELIKNCNKHDGRGIIVDARQLPSPEPCEIQKNLNDNEMLFHSC 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 QMVEESHEEEELKPPEQEIEIDRNIIQEEEKQAIPEFFEGRQAKTPERYLKIRNYILDQW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 QMVEESHEEEELKPPEQEIEIDRNIIQEEEKQAIPEFFEGRQAKTPERYLKIRNYILDQW 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 EICKPKYLNKTSVRPGLKNCGDVNCIGRIHTYLELIGAINFGCEQAVYNRPQTVDKVRIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EICKPKYLNKTSVRPGLKNCGDVNCIGRIHTYLELIGAINFGCEQAVYNRPQTVDKVRIR 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE1 DRKDAVEAYQLAQRLQSMRTRRRRVRDPWGNWCDAKDLEGQTFEHLSAEELAKRREEEKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 DRKDAVEAYQLAQRLQSMRTRRRRVRDPWGNWCDAKDLEGQTFEHLSAEELAKRREEEKG 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE1 RPVKSLKVPRPTKSSFDPFQLIPCNFFSEEKQEPFQVKVASEALLIMDLHAHVSMAEVIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 RPVKSLKVPRPTKSSFDPFQLIPCNFFSEEKQEPFQVKVASEALLIMDLHAHVSMAEVIG 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE1 LLGGRYSEVDKVVEVCAAEPCNSLSTGLQCEMDPVSQTQASETLAVRGFSVIGWYHSHPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 LLGGRYSEVDKVVEVCAAEPCNSLSTGLQCEMDPVSQTQASETLAVRGFSVIGWYHSHPA 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE1 FDPNPSLRDIDTQAKYQSYFSRGGAKFIGMIVSPYNRNNPLPYSQITCLVISEEISPDGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 FDPNPSLRDIDTQAKYQSYFSRGGAKFIGMIVSPYNRNNPLPYSQITCLVISEEISPDGS 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE1 YRLPYKFEVQQMLEEPQWGLVFEKTRWIIEKYRLSHSSVPMDKIFRRDSDLTCLQKLLEC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 YRLPYKFEVQQMLEEPQWGLVFEKTRWIIEKYRLSHSSVPMDKIFRRDSDLTCLQKLLEC 730 740 750 760 770 780 790 800 810 820 pF1KE1 MRKTLSKVTNCFMAEEFLTEIENLFLSNYKSNQENGVTEENCTKELLM :::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MRKTLSKVTNCFMAEEFLTEIENLFLSNYKSNQENGVTEENCTKELLM 790 800 810 820 >>CCDS74261.1 MPND gene_id:84954|Hs108|chr19 (501 aa) initn: 370 init1: 162 opt: 361 Z-score: 378.0 bits: 80.2 E(32554): 9.8e-15 Smith-Waterman score: 361; 29.6% identity (60.3% similar) in 247 aa overlap (557-796:251-492) 530 540 550 560 570 580 pF1KE1 SAEELAKRREEEKGRPVKSLKVPRPTKSSFDPFQLIPCNFFSE-EKQEPFQVKVASEALL .: :. . :. .: .::.: :.:..:. CCDS74 PEATTPGKRVDSKIRVPVRYCMLGSRDLARNPHTLVEVTSFAAINKFQPFNVAVSSNVLF 230 240 250 260 270 280 590 600 610 620 630 640 pF1KE1 IMDLHAHVSMAEVIGLLGGRYSEVDKVVEVCAAEPCNSLSTGLQCEMDPVSQTQASETLA ..:.:.:.. .::.: ::::.. .... : : :: : : . : . . . ..: CCDS74 LLDFHSHLTRSEVVGYLGGRWDVNSQMLTVLRAFPCRS-RLG-DAETAAAIEEEIYQSLF 290 300 310 320 330 650 660 670 680 690 700 pF1KE1 VRGFSVIGWYHSHPAFDPNPSLRDIDTQAKYQSYF---SRGGAKFIGMIVSPYNRNNPLP .::.:..::::::: :::.:::.: :: . : : .... ::: .:: : CCDS74 LRGLSLVGWYHSHPHSPALPSLQDIDAQMDYQLRLQGSSNGFQPCLALLCSPYYSGNPGP 340 350 360 370 380 390 710 720 730 740 750 760 pF1KE1 YSQITCLVI--SEEISPDGSYRLPYKFEVQQMLEEPQWGLVFEKTRWIIEKYRLSHSSVP :.:. . . : :. .: .:. :. . . . .... ..: :. : . : CCDS74 ESKISPFWVMPPPEQRPS-DYGIPMDVEMAYVQDSFLTNDILHEMMLLVEFYKGSPDLVR 400 410 420 430 440 450 770 780 790 800 810 pF1KE1 MDKIFRRDSDLTCLQKL-LECMRKTLSKVTNCFMAEEFLTEIENLFLSNYKSNQENGVTE ... . .. : :.:: . .: . . : . :. CCDS74 LQEPWSQEH--TYLDKLKISLASRTPKDQSLCHVLEQVCGVLKQGS 460 470 480 490 500 820 pF1KE1 ENCTKELLM >>CCDS42470.1 MPND gene_id:84954|Hs108|chr19 (471 aa) initn: 370 init1: 162 opt: 347 Z-score: 363.9 bits: 77.5 E(32554): 6e-14 Smith-Waterman score: 347; 33.3% identity (61.5% similar) in 213 aa overlap (557-763:251-454) 530 540 550 560 570 580 pF1KE1 SAEELAKRREEEKGRPVKSLKVPRPTKSSFDPFQLIPCNFFSE-EKQEPFQVKVASEALL .: :. . :. .: .::.: :.:..:. CCDS42 PEATTPGKRVDSKIRVPVRYCMLGSRDLARNPHTLVEVTSFAAINKFQPFNVAVSSNVLF 230 240 250 260 270 280 590 600 610 620 630 640 pF1KE1 IMDLHAHVSMAEVIGLLGGRYSEVDKVVEVCAAEPCNSLSTGLQCEMDPVSQTQASETLA ..:.:.:.. .::.: ::::.. .... : : :: : : . : . . . ..: CCDS42 LLDFHSHLTRSEVVGYLGGRWDVNSQMLTVLRAFPCRS-RLG-DAETAAAIEEEIYQSLF 290 300 310 320 330 650 660 670 680 690 700 pF1KE1 VRGFSVIGWYHSHPAFDPNPSLRDIDTQAKYQSYF---SRGGAKFIGMIVSPYNRNNPLP .::.:..::::::: :::.:::.: :: . : : .... ::: .:: : CCDS42 LRGLSLVGWYHSHPHSPALPSLQDIDAQMDYQLRLQGSSNGFQPCLALLCSPYYSGNPGP 340 350 360 370 380 390 710 720 730 740 750 760 pF1KE1 YSQITCLVISEEISPDGSYRLPYKFEVQQM-LEEPQWGLVFEKTRWIIEKYRLSHSS-VP :.:. . . :. :: . . :.:: :. :.: ..: ..: .: .: CCDS42 ESKISPFWVMPP--PEMLLVEFYKGSPDLVRLQEP-WSQ--EHTY--LDKLKISLASRTP 400 410 420 430 440 450 770 780 790 800 810 820 pF1KE1 MDKIFRRDSDLTCLQKLLECMRKTLSKVTNCFMAEEFLTEIENLFLSNYKSNQENGVTEE :. CCDS42 KDQSLCHVLEQVCGVLKQGS 460 470 828 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:59:55 2016 done: Sun Nov 6 04:59:56 2016 Total Scan time: 3.380 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]