FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2382, 1328 aa 1>>>pF1KE2382 1328 - 1328 aa - 1328 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.3424+/-0.00115; mu= 0.8131+/- 0.069 mean_var=259.5417+/-51.373, 0's: 0 Z-trim(110.9): 6 B-trim: 7 in 1/51 Lambda= 0.079611 statistics sampled from 11972 (11974) to 11972 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.693), E-opt: 0.2 (0.368), width: 16 Scan time: 6.400 The best scores are: opt bits E(32554) CCDS11046.1 MYBBP1A gene_id:10514|Hs108|chr17 (1328) 8677 1011.0 0 CCDS42238.1 MYBBP1A gene_id:10514|Hs108|chr17 (1332) 8630 1005.6 0 >>CCDS11046.1 MYBBP1A gene_id:10514|Hs108|chr17 (1328 aa) initn: 8677 init1: 8677 opt: 8677 Z-score: 5397.1 bits: 1011.0 E(32554): 0 Smith-Waterman score: 8677; 100.0% identity (100.0% similar) in 1328 aa overlap (1-1328:1-1328) 10 20 30 40 50 60 pF1KE2 MESRDPAQPMSPGEATQSGARPADRYGLLKHSREFLDFFWDIAKPEQETRLAATEKLLEY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MESRDPAQPMSPGEATQSGARPADRYGLLKHSREFLDFFWDIAKPEQETRLAATEKLLEY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 LRGRPKGSEMKYALKRLITGLGVGRETARPCYSLALAQLLQSFEDLPLCSILQQIQEKYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LRGRPKGSEMKYALKRLITGLGVGRETARPCYSLALAQLLQSFEDLPLCSILQQIQEKYD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 LHQVKKAMLRPALFANLFGVLALFQSGRLVKDQEALMKSVKLLQALAQYQNHLQEQPRKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LHQVKKAMLRPALFANLFGVLALFQSGRLVKDQEALMKSVKLLQALAQYQNHLQEQPRKA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 LVDILSEVSKATLQEILPEVLKADLNIILSSPEQLELFLLAQQKVPSKLKKLVGSVNLFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LVDILSEVSKATLQEILPEVLKADLNIILSSPEQLELFLLAQQKVPSKLKKLVGSVNLFS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 DENVPRLVNVLKMAASSVKKDRKLPAIALDLLRLALKEDKFPRFWKEVVEQGLLKMQFWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 DENVPRLVNVLKMAASSVKKDRKLPAIALDLLRLALKEDKFPRFWKEVVEQGLLKMQFWP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 ASYLCFRLLGAALPLLTKEQLHLVMQGDVIRHYGEHVCTAKLPKQFKFAPEMDDYVGTFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 ASYLCFRLLGAALPLLTKEQLHLVMQGDVIRHYGEHVCTAKLPKQFKFAPEMDDYVGTFL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 EGCQDDPERQLAVLVAFSSVTNQGLPVTPTFWRVVRFLSPPALQGYVAWLRAMFLQPDLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 EGCQDDPERQLAVLVAFSSVTNQGLPVTPTFWRVVRFLSPPALQGYVAWLRAMFLQPDLD 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 SLVDFSTNNQKKAQDSSLHMPERAVFRLRKWIIFRLVSIVDSLHLEMEEALTEQVARFCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 SLVDFSTNNQKKAQDSSLHMPERAVFRLRKWIIFRLVSIVDSLHLEMEEALTEQVARFCL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE2 FHSFFVTKKPTSQIPETKHPFSFPLENQAREAVSSAFFSLLQTLSTQFKQAPGQTQGGQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 FHSFFVTKKPTSQIPETKHPFSFPLENQAREAVSSAFFSLLQTLSTQFKQAPGQTQGGQP 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE2 WTYHLVQFADLLLNHSHNVTTVTPFTAQQRQAWDRMLQTLKELEAHSAEARAAAFQHLLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 WTYHLVQFADLLLNHSHNVTTVTPFTAQQRQAWDRMLQTLKELEAHSAEARAAAFQHLLL 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE2 LVGIHLLKSPAESCDLLGDIQTCIRKSLGEKPRRSRTKTIDPQEPPWVEVLVEILLALLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LVGIHLLKSPAESCDLLGDIQTCIRKSLGEKPRRSRTKTIDPQEPPWVEVLVEILLALLA 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE2 QPSHLMRQVARSVFGHICSHLTPRALQLILDVLNPETSEDENDRVVVTDDSDERRLKGAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QPSHLMRQVARSVFGHICSHLTPRALQLILDVLNPETSEDENDRVVVTDDSDERRLKGAE 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE2 DKSEEGEDNRSSESEEESEGEESEEEERDGDVDQGFREQLMTVLQAGKALGGEDSENEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 DKSEEGEDNRSSESEEESEGEESEEEERDGDVDQGFREQLMTVLQAGKALGGEDSENEEE 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE2 LGDEAMMALDQSLASLFAEQKLRIQARRDEKNKLQKEKALRRDFQIRVLDLVEVLVTKQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LGDEAMMALDQSLASLFAEQKLRIQARRDEKNKLQKEKALRRDFQIRVLDLVEVLVTKQP 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE2 ENALVLELLEPLLSIIRRSLRSSSSKQEQDLLHKTARIFTHHLCRARRYCHDLGERAGAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 ENALVLELLEPLLSIIRRSLRSSSSKQEQDLLHKTARIFTHHLCRARRYCHDLGERAGAL 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE2 HAQVERLVQQAGRQPDSPTALYHFNASLYLLRVLKGNTAEGCVHETQEKQKAGTDPSHMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 HAQVERLVQQAGRQPDSPTALYHFNASLYLLRVLKGNTAEGCVHETQEKQKAGTDPSHMP 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KE2 TGPQAASCLDLNLVTRVYSTALSSFLTKRNSPLTVPMFLSLFSRHPVLCQSLLPILVQHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TGPQAASCLDLNLVTRVYSTALSSFLTKRNSPLTVPMFLSLFSRHPVLCQSLLPILVQHI 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KE2 TGPVRPRHQACLLLQKTLSMREVRSCFEDPEWKQLMGQVLAKVTENLRVLGEAQTKAQHQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TGPVRPRHQACLLLQKTLSMREVRSCFEDPEWKQLMGQVLAKVTENLRVLGEAQTKAQHQ 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KE2 QALSSLELLNVLFRTCKHEKLTLDLTVLLGVLQGQQQSLQQGAHSTGSSRLHDLYWQAMK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QALSSLELLNVLFRTCKHEKLTLDLTVLLGVLQGQQQSLQQGAHSTGSSRLHDLYWQAMK 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 pF1KE2 TLGVQRPKLEKKDAKEIPSATQSPISKKRKKKGFLPETKKRKKRKSEDGTPAEDGTPAAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TLGVQRPKLEKKDAKEIPSATQSPISKKRKKKGFLPETKKRKKRKSEDGTPAEDGTPAAT 1150 1160 1170 1180 1190 1200 1210 1220 1230 1240 1250 1260 pF1KE2 GGSQPPSMGRKKRNRTKAKVPAQANGTPTTKSPAPGAPTRSPSTPAKSPKLQKKNQKPSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GGSQPPSMGRKKRNRTKAKVPAQANGTPTTKSPAPGAPTRSPSTPAKSPKLQKKNQKPSQ 1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 1310 1320 pF1KE2 VNGAPGSPTEPAGQKQHQKALPKKGVLGKSPLSALARKKARLSLVIRSPSLLQSGAKKKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VNGAPGSPTEPAGQKQHQKALPKKGVLGKSPLSALARKKARLSLVIRSPSLLQSGAKKKA 1270 1280 1290 1300 1310 1320 pF1KE2 QVRKAGKP :::::::: CCDS11 QVRKAGKP >>CCDS42238.1 MYBBP1A gene_id:10514|Hs108|chr17 (1332 aa) initn: 8630 init1: 8630 opt: 8630 Z-score: 5367.9 bits: 1005.6 E(32554): 0 Smith-Waterman score: 8630; 100.0% identity (100.0% similar) in 1321 aa overlap (1-1321:1-1321) 10 20 30 40 50 60 pF1KE2 MESRDPAQPMSPGEATQSGARPADRYGLLKHSREFLDFFWDIAKPEQETRLAATEKLLEY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MESRDPAQPMSPGEATQSGARPADRYGLLKHSREFLDFFWDIAKPEQETRLAATEKLLEY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 LRGRPKGSEMKYALKRLITGLGVGRETARPCYSLALAQLLQSFEDLPLCSILQQIQEKYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LRGRPKGSEMKYALKRLITGLGVGRETARPCYSLALAQLLQSFEDLPLCSILQQIQEKYD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 LHQVKKAMLRPALFANLFGVLALFQSGRLVKDQEALMKSVKLLQALAQYQNHLQEQPRKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LHQVKKAMLRPALFANLFGVLALFQSGRLVKDQEALMKSVKLLQALAQYQNHLQEQPRKA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 LVDILSEVSKATLQEILPEVLKADLNIILSSPEQLELFLLAQQKVPSKLKKLVGSVNLFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LVDILSEVSKATLQEILPEVLKADLNIILSSPEQLELFLLAQQKVPSKLKKLVGSVNLFS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 DENVPRLVNVLKMAASSVKKDRKLPAIALDLLRLALKEDKFPRFWKEVVEQGLLKMQFWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 DENVPRLVNVLKMAASSVKKDRKLPAIALDLLRLALKEDKFPRFWKEVVEQGLLKMQFWP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 ASYLCFRLLGAALPLLTKEQLHLVMQGDVIRHYGEHVCTAKLPKQFKFAPEMDDYVGTFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 ASYLCFRLLGAALPLLTKEQLHLVMQGDVIRHYGEHVCTAKLPKQFKFAPEMDDYVGTFL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 EGCQDDPERQLAVLVAFSSVTNQGLPVTPTFWRVVRFLSPPALQGYVAWLRAMFLQPDLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EGCQDDPERQLAVLVAFSSVTNQGLPVTPTFWRVVRFLSPPALQGYVAWLRAMFLQPDLD 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 SLVDFSTNNQKKAQDSSLHMPERAVFRLRKWIIFRLVSIVDSLHLEMEEALTEQVARFCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SLVDFSTNNQKKAQDSSLHMPERAVFRLRKWIIFRLVSIVDSLHLEMEEALTEQVARFCL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE2 FHSFFVTKKPTSQIPETKHPFSFPLENQAREAVSSAFFSLLQTLSTQFKQAPGQTQGGQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 FHSFFVTKKPTSQIPETKHPFSFPLENQAREAVSSAFFSLLQTLSTQFKQAPGQTQGGQP 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE2 WTYHLVQFADLLLNHSHNVTTVTPFTAQQRQAWDRMLQTLKELEAHSAEARAAAFQHLLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 WTYHLVQFADLLLNHSHNVTTVTPFTAQQRQAWDRMLQTLKELEAHSAEARAAAFQHLLL 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE2 LVGIHLLKSPAESCDLLGDIQTCIRKSLGEKPRRSRTKTIDPQEPPWVEVLVEILLALLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LVGIHLLKSPAESCDLLGDIQTCIRKSLGEKPRRSRTKTIDPQEPPWVEVLVEILLALLA 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE2 QPSHLMRQVARSVFGHICSHLTPRALQLILDVLNPETSEDENDRVVVTDDSDERRLKGAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 QPSHLMRQVARSVFGHICSHLTPRALQLILDVLNPETSEDENDRVVVTDDSDERRLKGAE 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE2 DKSEEGEDNRSSESEEESEGEESEEEERDGDVDQGFREQLMTVLQAGKALGGEDSENEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 DKSEEGEDNRSSESEEESEGEESEEEERDGDVDQGFREQLMTVLQAGKALGGEDSENEEE 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE2 LGDEAMMALDQSLASLFAEQKLRIQARRDEKNKLQKEKALRRDFQIRVLDLVEVLVTKQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LGDEAMMALDQSLASLFAEQKLRIQARRDEKNKLQKEKALRRDFQIRVLDLVEVLVTKQP 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE2 ENALVLELLEPLLSIIRRSLRSSSSKQEQDLLHKTARIFTHHLCRARRYCHDLGERAGAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 ENALVLELLEPLLSIIRRSLRSSSSKQEQDLLHKTARIFTHHLCRARRYCHDLGERAGAL 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE2 HAQVERLVQQAGRQPDSPTALYHFNASLYLLRVLKGNTAEGCVHETQEKQKAGTDPSHMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 HAQVERLVQQAGRQPDSPTALYHFNASLYLLRVLKGNTAEGCVHETQEKQKAGTDPSHMP 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KE2 TGPQAASCLDLNLVTRVYSTALSSFLTKRNSPLTVPMFLSLFSRHPVLCQSLLPILVQHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 TGPQAASCLDLNLVTRVYSTALSSFLTKRNSPLTVPMFLSLFSRHPVLCQSLLPILVQHI 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KE2 TGPVRPRHQACLLLQKTLSMREVRSCFEDPEWKQLMGQVLAKVTENLRVLGEAQTKAQHQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 TGPVRPRHQACLLLQKTLSMREVRSCFEDPEWKQLMGQVLAKVTENLRVLGEAQTKAQHQ 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KE2 QALSSLELLNVLFRTCKHEKLTLDLTVLLGVLQGQQQSLQQGAHSTGSSRLHDLYWQAMK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 QALSSLELLNVLFRTCKHEKLTLDLTVLLGVLQGQQQSLQQGAHSTGSSRLHDLYWQAMK 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 pF1KE2 TLGVQRPKLEKKDAKEIPSATQSPISKKRKKKGFLPETKKRKKRKSEDGTPAEDGTPAAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 TLGVQRPKLEKKDAKEIPSATQSPISKKRKKKGFLPETKKRKKRKSEDGTPAEDGTPAAT 1150 1160 1170 1180 1190 1200 1210 1220 1230 1240 1250 1260 pF1KE2 GGSQPPSMGRKKRNRTKAKVPAQANGTPTTKSPAPGAPTRSPSTPAKSPKLQKKNQKPSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GGSQPPSMGRKKRNRTKAKVPAQANGTPTTKSPAPGAPTRSPSTPAKSPKLQKKNQKPSQ 1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 1310 1320 pF1KE2 VNGAPGSPTEPAGQKQHQKALPKKGVLGKSPLSALARKKARLSLVIRSPSLLQSGAKKKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 VNGAPGSPTEPAGQKQHQKALPKKGVLGKSPLSALARKKARLSLVIRSPSLLQSGAKKKA 1270 1280 1290 1300 1310 1320 pF1KE2 QVRKAGKP : CCDS42 QTLRFTISSSKK 1330 1328 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:00:15 2016 done: Sun Nov 6 14:00:16 2016 Total Scan time: 6.400 Total Display time: 0.100 Function used was FASTA [36.3.4 Apr, 2011]