FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5733, 642 aa 1>>>pF1KE5733 642 - 642 aa - 642 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.5729+/-0.00102; mu= -2.5754+/- 0.062 mean_var=316.1232+/-63.191, 0's: 0 Z-trim(114.9): 2 B-trim: 0 in 0/52 Lambda= 0.072135 statistics sampled from 15441 (15443) to 15441 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.783), E-opt: 0.2 (0.474), width: 16 Scan time: 4.020 The best scores are: opt bits E(32554) CCDS12062.1 MUM1 gene_id:84939|Hs108|chr19 ( 711) 4348 466.2 6.6e-131 CCDS55469.1 MUM1L1 gene_id:139221|Hs108|chrX ( 696) 1380 157.4 6.2e-38 >>CCDS12062.1 MUM1 gene_id:84939|Hs108|chr19 (711 aa) initn: 4348 init1: 4348 opt: 4348 Z-score: 2463.7 bits: 466.2 E(32554): 6.6e-131 Smith-Waterman score: 4348; 99.8% identity (99.8% similar) in 639 aa overlap (4-642:73-711) 10 20 30 pF1KE5 MVSASQNEVPAAPLEELAYRRSLRVALDVLSEG :::::::::::::::::::::::::::::: CCDS12 ILSLEEKIKVKSTEVEILEKSQIEAIASSLASQNEVPAAPLEELAYRRSLRVALDVLSEG 50 60 70 80 90 100 40 50 60 70 80 90 pF1KE5 SIWSQESSAGTGRADRSLRGKPMEHVSSPCDSNSSSLPRGDVLGSSRPHRRRPCVQQSLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 SIWSQESSAGTGRADRSLRGKPMEHVSSPCDSNSSSLPRGDVLGSSRPHRRRPCVQQSLS 110 120 130 140 150 160 100 110 120 130 140 150 pF1KE5 SSFTCEKDPECKVDHKKGLRKSENPRGPLVLPAGGGAQDESGSRIHHKNWTLASKRGRNS ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: CCDS12 SSFTCEKDPECKVDHKKGLRKSENPRGPLVLPAGGGAQDESGSRIHHKNWTLASKRGGNS 170 180 190 200 210 220 160 170 180 190 200 210 pF1KE5 AQKASLCLNGSSLSEDDTERDMGSKGGSWAAPSLPSGVREDDPCANAEGHDPGLPLGSLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AQKASLCLNGSSLSEDDTERDMGSKGGSWAAPSLPSGVREDDPCANAEGHDPGLPLGSLT 230 240 250 260 270 280 220 230 240 250 260 270 pF1KE5 APPAPEPSACSEPGECPAKKRPRLDGSQRPPAVQLEPMAAGAAPSPGPGPGPRESVTPRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 APPAPEPSACSEPGECPAKKRPRLDGSQRPPAVQLEPMAAGAAPSPGPGPGPRESVTPRS 290 300 310 320 330 340 280 290 300 310 320 330 pF1KE5 TARLGPPPSHASADATRCLPCPDSQKLEKECQSSEESMGSNSMRSILEEDEEDEEPPRVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 TARLGPPPSHASADATRCLPCPDSQKLEKECQSSEESMGSNSMRSILEEDEEDEEPPRVL 350 360 370 380 390 400 340 350 360 370 380 390 pF1KE5 LYHEPRSFEVGMLVWHKHKKYPFWPAVVKSVRQRDKKASVLYIEGHMNPKMKGFTVSLKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LYHEPRSFEVGMLVWHKHKKYPFWPAVVKSVRQRDKKASVLYIEGHMNPKMKGFTVSLKS 410 420 430 440 450 460 400 410 420 430 440 450 pF1KE5 LKHFDCKEKQTLLNQAREDFNQDIGWCVSLITDYRVRLGCGSFAGSFLEYYAADISYPVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LKHFDCKEKQTLLNQAREDFNQDIGWCVSLITDYRVRLGCGSFAGSFLEYYAADISYPVR 470 480 490 500 510 520 460 470 480 490 500 510 pF1KE5 KSIQQDVLGTKLPQLSKGSPEEPVVGCPLGQRQPCRKMLPDRSRAARDRANQKLVEYIVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 KSIQQDVLGTKLPQLSKGSPEEPVVGCPLGQRQPCRKMLPDRSRAARDRANQKLVEYIVK 530 540 550 560 570 580 520 530 540 550 560 570 pF1KE5 AKGAESHLRAILKSRKPSRWLQTFLSSSQYVTCVETYLEDEGQLDLVVKYLQGVYQEVGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AKGAESHLRAILKSRKPSRWLQTFLSSSQYVTCVETYLEDEGQLDLVVKYLQGVYQEVGA 590 600 610 620 630 640 580 590 600 610 620 630 pF1KE5 KVLQRTNGDRIRFILDVLLPEAIICAISAVDEVDYKTAEEKYIKGPSLSYREKEIFDNQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 KVLQRTNGDRIRFILDVLLPEAIICAISAVDEVDYKTAEEKYIKGPSLSYREKEIFDNQL 650 660 670 680 690 700 640 pF1KE5 LEERNRRRR ::::::::: CCDS12 LEERNRRRR 710 >>CCDS55469.1 MUM1L1 gene_id:139221|Hs108|chrX (696 aa) initn: 1378 init1: 1287 opt: 1380 Z-score: 794.5 bits: 157.4 E(32554): 6.2e-38 Smith-Waterman score: 1522; 41.9% identity (67.1% similar) in 659 aa overlap (6-640:73-689) 10 20 30 pF1KE5 MVSASQNEVPAAPLEELAYRRSLRVALDVLSEGSI :.: : : :: :: :::.::: .:.: . CCDS55 SLDEKIKLDSTETKILNKSQIEAIAASLGLQSEDSAPPTEETAYGRSLKVALGILNERTN 50 60 70 80 90 100 40 50 60 70 80 90 pF1KE5 WSQESSAGTGRADRSLRGKPMEHVSSPCDSNSSSLPRGDVLGSSRPHRRRPCVQQSLSSS :: :.. . .. :... .:: .. .::. : . .. :. :.:: CCDS55 LSQASTSDEEEITMLSQNVPQKQSDSP-PHKKYRKDEGDLPGCLEERENSACL---LASS 110 120 130 140 150 100 110 120 130 140 150 pF1KE5 FTCEKDPECKVDHKKGLRKSENPRGPLVLPAGGGAQDESGSRIHHKNWTLASKRGRNSAQ :.: : ::. : ..: :... : : : CCDS55 ---ESDDSLYDD------KSQAPTMVDTIP----------SEVETK-----------SLQ 160 170 180 160 170 180 190 200 210 pF1KE5 KASLCLNGSSLSEDDTERDMGSKGGSWAAPSLPSGVREDDPCANAEGHDPGL-PLGS--L ..: : . :::::. :.. .: :. :. :.:.:.. :.. : : : ::.: : CCDS55 NSSWCETFPSLSEDNDEKENKNKIDISAVMSVHSAVKEESACVKDEKFAPPLSPLSSDML 190 200 210 220 230 240 220 230 240 250 pF1KE5 TAPPAPE-------------PSACS-------EPGECPAKKRPRLDGSQRPPAVQLEPMA : : . :: :: .::: :.. : :: :: :... : :. CCDS55 IMPKALKEESEDTCLETLAVPSECSAFSENIEDPGEGPSN--PCLDTSQNQPSMESE-MG 250 260 270 280 290 300 260 270 280 290 300 310 pF1KE5 AGAAPSPGPGPGPRESVTPRSTARLGPPPSHASADATRCLPCPDSQKLEKECQSSEESMG :.: : : :: . :.. :: .. : . : ..::.: :.:..:. CCDS55 AAACP----GSCSRECEVSFSASNPVWDYSHL-MSSERNFQRLDFEELEEEGQASDKSLL 310 320 330 340 350 360 320 330 340 350 360 370 pF1KE5 SNSMR-SILEEDEEDEEPPRVLLYHEPRSFEVGMLVWHKHKKYPFWPAVVKSVRQRDKKA . . :.:..:::::: :: .:..: . ::.::.:: :..::::::::.::.:....:: CCDS55 PSRINLSLLDDDEEDEELPRFILHYETHPFETGMIVWFKYQKYPFWPAVIKSIRRKERKA 370 380 390 400 410 420 380 390 400 410 420 430 pF1KE5 SVLYIEGHMNPKMKGFTVSLKSLKHFDCKEKQTLLNQAREDFNQDIGWCVSLITDYRVRL :::..:..:: . ::. :... ::.::::::: :...::::....: ::.::: :::::. CCDS55 SVLFVEANMNSEKKGIRVNFRRLKKFDCKEKQMLVDKAREDYSESIDWCISLICDYRVRI 430 440 450 460 470 480 440 450 460 470 480 490 pF1KE5 GCGSFAGSFLEYYAADISYPVRKSIQQDVLGTKLPQLSKGSPEEPVVGCPLGQRQPCRKM :::::.::.:::::::::::::: .::.. .:.:.: . . .::.. ... .:. CCDS55 GCGSFTGSLLEYYAADISYPVRKETKQDTFRNKFPKLHNEDAREPMAVTSQTKKMSFQKI 490 500 510 520 530 540 500 510 520 530 540 550 pF1KE5 LPDRSRAARDRANQKLVEYIVKAKGAESHLRAILKSRKPSRWLQTFLSSSQYVTCVETYL :::: .:::::::..::..::.:::.:.:: ::... : ::::..::...... :.:::. CCDS55 LPDRMKAARDRANKNLVDFIVNAKGTENHLLAIVNGTKGSRWLKSFLNANRFTPCIETYF 550 560 570 580 590 600 560 570 580 590 600 610 pF1KE5 EDEGQLDLVVKYLQGVYQEVGAKVLQRTNGDRIRFILDVLLPEAIICAISAVDEVDYKTA ::: ::: :::::: : ... . . :.:.:::.:::::::::.::::: .::..: CCDS55 EDEDQLDEVVKYLQEVCNQIDQIMPTWIKDDKIKFILEVLLPEAIICSISAVDGLDYEAA 610 620 630 640 650 660 620 630 640 pF1KE5 EEKYIKGPSLSYREKEIFDNQLLEERNRRRR : ::.::: :.:::.:.:: ... :. :. CCDS55 EAKYLKGPCLGYRERELFDAKIIYEKRRKAPTNEAH 670 680 690 642 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 06:11:26 2016 done: Tue Nov 8 06:11:26 2016 Total Scan time: 4.020 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]