FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5784, 770 aa 1>>>pF1KB5784 770 - 770 aa - 770 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.8002+/-0.00102; mu= -8.8875+/- 0.061 mean_var=357.1605+/-75.295, 0's: 0 Z-trim(114.5): 35 B-trim: 399 in 1/52 Lambda= 0.067864 statistics sampled from 15073 (15100) to 15073 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.761), E-opt: 0.2 (0.464), width: 16 Scan time: 3.100 The best scores are: opt bits E(32554) CCDS34149.1 DAB2 gene_id:1601|Hs108|chr5 ( 770) 5195 522.9 7.3e-148 CCDS58946.1 DAB2 gene_id:1601|Hs108|chr5 ( 749) 3703 376.9 6.7e-104 CCDS607.1 DAB1 gene_id:1600|Hs108|chr1 ( 555) 746 87.3 7.5e-17 >>CCDS34149.1 DAB2 gene_id:1601|Hs108|chr5 (770 aa) initn: 5195 init1: 5195 opt: 5195 Z-score: 2768.1 bits: 522.9 E(32554): 7.3e-148 Smith-Waterman score: 5195; 100.0% identity (100.0% similar) in 770 aa overlap (1-770:1-770) 10 20 30 40 50 60 pF1KB5 MSNEVETSATNGQPDQQAAPKAPSKKEKKKGPEKTDEYLLARFKGDGVKYKAKLIGIDDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MSNEVETSATNGQPDQQAAPKAPSKKEKKKGPEKTDEYLLARFKGDGVKYKAKLIGIDDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 PDARGDKMSQDSMMKLKGMAAAGRSQGQHKQRIWVNISLSGIKIIDEKTGVIEHEHPVNK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 PDARGDKMSQDSMMKLKGMAAAGRSQGQHKQRIWVNISLSGIKIIDEKTGVIEHEHPVNK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 ISFIARDVTDNRAFGYVCGGEGQHQFFAIKTGQQAEPLVVDLKDLFQVIYNVKKKEEEKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ISFIARDVTDNRAFGYVCGGEGQHQFFAIKTGQQAEPLVVDLKDLFQVIYNVKKKEEEKK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 KIEEASKAVENGSEALMILDDQTNKLKSGVDQMDLFGDMSTPPDLNSPTESKDILLVDLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 KIEEASKAVENGSEALMILDDQTNKLKSGVDQMDLFGDMSTPPDLNSPTESKDILLVDLN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 SEIDTNQNSLRENPFLTNGITSCSLPRPTPQASFLPENAFSANLNFFPTPNPDPFRDDPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 SEIDTNQNSLRENPFLTNGITSCSLPRPTPQASFLPENAFSANLNFFPTPNPDPFRDDPF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 TQPDQSTPSSFDSLKSPDQKKENSSSSSTPLSNGPLNGDVDYFGQQFDQISNRTGKQEAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 TQPDQSTPSSFDSLKSPDQKKENSSSSSTPLSNGPLNGDVDYFGQQFDQISNRTGKQEAQ 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 AGPWPFSSSQTQPAVRTQNGVSEREQNGFSVKSSPNPFVGSPPKGLSIQNGVKQDLESSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 AGPWPFSSSQTQPAVRTQNGVSEREQNGFSVKSSPNPFVGSPPKGLSIQNGVKQDLESSV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 QSSPHDSIAIIPPPQSTKPGRGRRTAKSSANDLLASDIFAPPVSEPSGQASPTGQPTALQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 QSSPHDSIAIIPPPQSTKPGRGRRTAKSSANDLLASDIFAPPVSEPSGQASPTGQPTALQ 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB5 PNPLDLFKTSAPAPVGPLVGLGGVTVTLPQAGPWNTASLVFNQSPSMAPGAMMGGQPSGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 PNPLDLFKTSAPAPVGPLVGLGGVTVTLPQAGPWNTASLVFNQSPSMAPGAMMGGQPSGF 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB5 SQPVIFGTSPAVSGWNQPSPFAASTPPPVPVVWGPSASVAPNAWSTTSPLGNPFQSNIFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 SQPVIFGTSPAVSGWNQPSPFAASTPPPVPVVWGPSASVAPNAWSTTSPLGNPFQSNIFP 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB5 APAVSTQPPSMHSSLLVTPPQPPPRAGPPKDISSDAFTALDPLGDKEIKDVKEMFKDFQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 APAVSTQPPSMHSSLLVTPPQPPPRAGPPKDISSDAFTALDPLGDKEIKDVKEMFKDFQL 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB5 RQPPAVPARKGEQTSSGTLSAFASYFNSKVGIPQENADHDDFDANQLLNKINEPPKPAPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 RQPPAVPARKGEQTSSGTLSAFASYFNSKVGIPQENADHDDFDANQLLNKINEPPKPAPR 670 680 690 700 710 720 730 740 750 760 770 pF1KB5 QVSLPVTKSTDNAFENPFFKDSFGSSQASVASSQPVSSEMYRDPFGNPFA :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 QVSLPVTKSTDNAFENPFFKDSFGSSQASVASSQPVSSEMYRDPFGNPFA 730 740 750 760 770 >>CCDS58946.1 DAB2 gene_id:1601|Hs108|chr5 (749 aa) initn: 3697 init1: 3697 opt: 3703 Z-score: 1978.8 bits: 376.9 E(32554): 6.7e-104 Smith-Waterman score: 4994; 97.3% identity (97.3% similar) in 770 aa overlap (1-770:1-749) 10 20 30 40 50 60 pF1KB5 MSNEVETSATNGQPDQQAAPKAPSKKEKKKGPEKTDEYLLARFKGDGVKYKAKLIGIDDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MSNEVETSATNGQPDQQAAPKAPSKKEKKKGPEKTDEYLLARFKGDGVKYKAKLIGIDDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 PDARGDKMSQDSMMKLKGMAAAGRSQGQHKQRIWVNISLSGIKIIDEKTGVIEHEHPVNK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 PDARGDKMSQDSMMKLKGMAAAGRSQGQHKQRIWVNISLSGIKIIDEKTGVIEHEHPVNK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 ISFIARDVTDNRAFGYVCGGEGQHQFFAIKTGQQAEPLVVDLKDLFQVIYNVKKKEEEKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ISFIARDVTDNRAFGYVCGGEGQHQFFAIKTGQQAEPLVVDLKDLFQVIYNVKKKEEEKK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 KIEEASKAVENGSEALMILDDQTNKLKSGVDQMDLFGDMSTPPDLNSPTESKDILLVDLN :::::::::::::::::::::::::::: ::::::::::: CCDS58 KIEEASKAVENGSEALMILDDQTNKLKS---------------------ESKDILLVDLN 190 200 210 250 260 270 280 290 300 pF1KB5 SEIDTNQNSLRENPFLTNGITSCSLPRPTPQASFLPENAFSANLNFFPTPNPDPFRDDPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 SEIDTNQNSLRENPFLTNGITSCSLPRPTPQASFLPENAFSANLNFFPTPNPDPFRDDPF 220 230 240 250 260 270 310 320 330 340 350 360 pF1KB5 TQPDQSTPSSFDSLKSPDQKKENSSSSSTPLSNGPLNGDVDYFGQQFDQISNRTGKQEAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 TQPDQSTPSSFDSLKSPDQKKENSSSSSTPLSNGPLNGDVDYFGQQFDQISNRTGKQEAQ 280 290 300 310 320 330 370 380 390 400 410 420 pF1KB5 AGPWPFSSSQTQPAVRTQNGVSEREQNGFSVKSSPNPFVGSPPKGLSIQNGVKQDLESSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 AGPWPFSSSQTQPAVRTQNGVSEREQNGFSVKSSPNPFVGSPPKGLSIQNGVKQDLESSV 340 350 360 370 380 390 430 440 450 460 470 480 pF1KB5 QSSPHDSIAIIPPPQSTKPGRGRRTAKSSANDLLASDIFAPPVSEPSGQASPTGQPTALQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 QSSPHDSIAIIPPPQSTKPGRGRRTAKSSANDLLASDIFAPPVSEPSGQASPTGQPTALQ 400 410 420 430 440 450 490 500 510 520 530 540 pF1KB5 PNPLDLFKTSAPAPVGPLVGLGGVTVTLPQAGPWNTASLVFNQSPSMAPGAMMGGQPSGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 PNPLDLFKTSAPAPVGPLVGLGGVTVTLPQAGPWNTASLVFNQSPSMAPGAMMGGQPSGF 460 470 480 490 500 510 550 560 570 580 590 600 pF1KB5 SQPVIFGTSPAVSGWNQPSPFAASTPPPVPVVWGPSASVAPNAWSTTSPLGNPFQSNIFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 SQPVIFGTSPAVSGWNQPSPFAASTPPPVPVVWGPSASVAPNAWSTTSPLGNPFQSNIFP 520 530 540 550 560 570 610 620 630 640 650 660 pF1KB5 APAVSTQPPSMHSSLLVTPPQPPPRAGPPKDISSDAFTALDPLGDKEIKDVKEMFKDFQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 APAVSTQPPSMHSSLLVTPPQPPPRAGPPKDISSDAFTALDPLGDKEIKDVKEMFKDFQL 580 590 600 610 620 630 670 680 690 700 710 720 pF1KB5 RQPPAVPARKGEQTSSGTLSAFASYFNSKVGIPQENADHDDFDANQLLNKINEPPKPAPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 RQPPAVPARKGEQTSSGTLSAFASYFNSKVGIPQENADHDDFDANQLLNKINEPPKPAPR 640 650 660 670 680 690 730 740 750 760 770 pF1KB5 QVSLPVTKSTDNAFENPFFKDSFGSSQASVASSQPVSSEMYRDPFGNPFA :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 QVSLPVTKSTDNAFENPFFKDSFGSSQASVASSQPVSSEMYRDPFGNPFA 700 710 720 730 740 >>CCDS607.1 DAB1 gene_id:1600|Hs108|chr1 (555 aa) initn: 1062 init1: 701 opt: 746 Z-score: 416.0 bits: 87.3 E(32554): 7.5e-17 Smith-Waterman score: 839; 32.9% identity (51.7% similar) in 773 aa overlap (13-768:4-540) 10 20 30 40 50 pF1KB5 MSNEVETSATNGQPDQQAAPKAPSKKE-KKKGPEKTDEYLLARFKGDGVKYKAKLIGIDD . . :.: :. .::. .::: .... :. ::::.::.:::::::::. CCDS60 MSTETELQVAVKTSAKKDSRKKGQDRSEATLIKRFKGEGVRYKAKLIGIDE 10 20 30 40 50 60 70 80 90 100 110 pF1KB5 VPDARGDKMSQDSMMKLKGMAAAGRSQGQHKQRIWVNISLSGIKIIDEKTGVIEHEHPVN : :::::. :::::::::..:..::.:.:::.:...::..::::.:::::...:.: :. CCDS60 VSAARGDKLCQDSMMKLKGVVAGARSKGEHKQKIFLTISFGGIKIFDEKTGALQHHHAVH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB5 KISFIARDVTDNRAFGYVCGGEGQHQFFAIKTGQQAEPLVVDLKDLFQVIYNVKKKEEEK .::.::.:.::.:::::::: ::.:.: ::::.: :::...::.::::.::..:..:: . CCDS60 EISYIAKDITDHRAFGYVCGKEGNHRFVAIKTAQAAEPVILDLRDLFQLIYELKQREELE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB5 KKIEEASKAVENGSEALMILDDQTNKLKSGVDQMDLFGDMSTPPDLNSPTESKDILLVDL :: .. : : .. .: : : :. CCDS60 KKAQK---------------DKQCEQ---AVYQTILEEDV-------------------- 180 190 240 250 260 270 280 290 pF1KB5 NSEIDTNQNSLRENPFLTNGITSCSLPRPTPQASFLPENAFSANLNFFPTPNPDPFRDDP :.: . : :. .:.:: : CCDS60 ------------EDPVYQYIV-------------------FEAG--------HEPIRD-P 200 210 300 310 320 330 340 350 pF1KB5 FTQPD-QSTPSSFDSLKSPDQKKENSSSSSTPLSNGPLNGDVDYFGQQFDQISNRTGKQE :. . ..:.: :::: : : CCDS60 ETEENIYQVPTS--------QKKE---------------GVYDV---------------- 220 230 360 370 380 390 400 410 pF1KB5 AQAGPWPFSSSQTQPAVRTQNGVSEREQNGFSVKSSPNPFVGSPPKGLSIQNGVKQDLES : ..::. ..:.. : :. :.: : . ::: CCDS60 ----P------KSQPV----SAVTQLEL--FGDMSTP-PDITSPP--------------- 240 250 260 420 430 440 450 460 470 pF1KB5 SVQSSPHDSIAIIPPPQSTKPGRGRRTAKSSANDLLASDIFAPPVSEPSGQASPTGQPTA . ..: : :.:: ..: :. ..:.:. : : : CCDS60 -TPATPGD--AFIPSSSQTLPA--------------SADVFS---SVPFG---------- 270 280 290 480 490 500 510 520 530 pF1KB5 LQPNPLDLFKTSAPAPVGPLVGLGGVTVTLPQAGPWNTASLVFNQSPSMAPGAMMGGQPS .: .: : :..:.: ::. :. :.: . .::.:: CCDS60 -----------TAAVPSG-YVAMGAV---LPSF--WG-------QQPLVQQQMVMGAQPP 300 310 320 540 550 560 570 580 590 pF1KB5 GFSQPVIFGTSPAVSGWNQPSPFAASTPPPVPVVWGPSASVAPNAWSTTSPLGNPFQSNI .: :. :..: . :.::. : : : : :.: : . : :. : :. . CCDS60 -VAQ-VMPGAQPIA--WGQPGLFPA-TQQPWPTVAG---QFPPAAFM-------PTQT-V 330 340 350 360 370 600 610 620 630 640 650 pF1KB5 FPAPAVSTQPPSMHSSLLVTPPQPPPRAGPPKDISSDAFTALDPLGDKEIKDV-KEMFKD .: ::. : : .:: : : .::. : .: :: . . :: ::: CCDS60 MPLPAAMFQGP-------LTPLATVP--G-----TSDS-TRSSPQTDKPRQKMGKETFKD 380 390 400 410 660 670 680 690 700 710 pF1KB5 FQLRQPPAVPARKGEQTS-SGTLSAFASYFNSKVGIPQENADHDDFDANQL-LNKI---- ::. ::: ::.:: .: : . : ::.:::: :::. :.. : :::: .:: :. . CCDS60 FQMAQPPPVPSRKPDQPSLTCTSEAFSSYFN-KVGVAQDTDDCDDFDISQLNLTPVTSTT 420 430 440 450 460 470 720 730 740 750 760 pF1KB5 ---NEPPKPAPRQVSLPVTKSTDNAFE---NPFFKDSFGSSQASVASSQPVSSEMYR--D : :: ::::: : : .:...: . . .:...: : . : . : .:. : CCDS60 PSTNSPPTPAPRQSS-PSKSSASHASDPTTDDIFEEGFESPSKSEEQEAPDGSQASSNSD 480 490 500 510 520 530 770 pF1KB5 PFGNPFA :::.: CCDS60 PFGEPSGEPSGDNISPQAGS 540 550 770 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:33:30 2016 done: Mon Nov 7 02:33:31 2016 Total Scan time: 3.100 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]