FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5554, 429 aa 1>>>pF1KE5554 429 - 429 aa - 429 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.0757+/-0.000309; mu= 6.3232+/- 0.019 mean_var=166.8273+/-34.413, 0's: 0 Z-trim(121.7): 18 B-trim: 2168 in 1/57 Lambda= 0.099298 statistics sampled from 38671 (38696) to 38671 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.774), E-opt: 0.2 (0.454), width: 16 Scan time: 9.160 The best scores are: opt bits E(85289) NP_110403 (OMIM: 606750) Z-DNA-binding protein 1 i ( 429) 2934 431.9 1.6e-120 NP_001153889 (OMIM: 606750) Z-DNA-binding protein ( 428) 2916 429.3 9.3e-120 XP_011527359 (OMIM: 606750) PREDICTED: Z-DNA-bindi ( 405) 2483 367.2 4.2e-101 NP_001153890 (OMIM: 606750) Z-DNA-binding protein ( 354) 2362 349.9 6.2e-96 XP_011527360 (OMIM: 606750) PREDICTED: Z-DNA-bindi ( 357) 1992 296.9 5.7e-80 NP_001153891 (OMIM: 606750) Z-DNA-binding protein ( 248) 1507 227.3 3.5e-59 XP_016883575 (OMIM: 606750) PREDICTED: Z-DNA-bindi ( 245) 1134 173.8 4.2e-43 NP_001310895 (OMIM: 606750) Z-DNA-binding protein ( 173) 935 145.2 1.2e-34 >>NP_110403 (OMIM: 606750) Z-DNA-binding protein 1 isofo (429 aa) initn: 2934 init1: 2934 opt: 2934 Z-score: 2284.9 bits: 431.9 E(85289): 1.6e-120 Smith-Waterman score: 2934; 99.5% identity (100.0% similar) in 429 aa overlap (1-429:1-429) 10 20 30 40 50 60 pF1KE5 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_110 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 TSPATWCLGGTDPEGEGPAELALSSPAKRPQQHAATIPETPGPQFSQQREEDIYRFLKDN :::::::::::::::::::::::::::.:::::::::::::::::::::::::::::::: NP_110 TSPATWCLGGTDPEGEGPAELALSSPAERPQQHAATIPETPGPQFSQQREEDIYRFLKDN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKSASII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_110 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKSASII 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAPGDSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_110 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAPGDSS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 TWGTLVDPWGPQDIHMERSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATAAGPE :::::::::::::::::.:::::::::::::::::::::::::::::::::::::::::: NP_110 TWGTLVDPWGPQDIHMEQSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATAAGPE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 ASFEARIPSPGTHPEGEAAQRIHMKSCFLEDATIGNSNKMSISPGVAGPGGVAGSGEGEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_110 ASFEARIPSPGTHPEGEAAQRIHMKSCFLEDATIGNSNKMSISPGVAGPGGVAGSGEGEP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 GEDAGRRPADTQSRSHFPRDIGQPITPSHSKLTPKLETMTLGNRSHKAAEGSHYVDEASH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_110 GEDAGRRPADTQSRSHFPRDIGQPITPSHSKLTPKLETMTLGNRSHKAAEGSHYVDEASH 370 380 390 400 410 420 pF1KE5 EGSWWGGGI ::::::::: NP_110 EGSWWGGGI >>NP_001153889 (OMIM: 606750) Z-DNA-binding protein 1 is (428 aa) initn: 1811 init1: 1811 opt: 2916 Z-score: 2271.0 bits: 429.3 E(85289): 9.3e-120 Smith-Waterman score: 2916; 99.3% identity (99.8% similar) in 429 aa overlap (1-429:1-428) 10 20 30 40 50 60 pF1KE5 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 TSPATWCLGGTDPEGEGPAELALSSPAKRPQQHAATIPETPGPQFSQQREEDIYRFLKDN :::::::::::::::::::::::::::.:::::::::::::::::::::::::::::::: NP_001 TSPATWCLGGTDPEGEGPAELALSSPAERPQQHAATIPETPGPQFSQQREEDIYRFLKDN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKSASII ::::::::::::::::::::::::::::::::::::::::::::::: :::::::::::: NP_001 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRP-DSGRRAKSASII 130 140 150 160 170 190 200 210 220 230 240 pF1KE5 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAPGDSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAPGDSS 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE5 TWGTLVDPWGPQDIHMERSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATAAGPE :::::::::::::::::.:::::::::::::::::::::::::::::::::::::::::: NP_001 TWGTLVDPWGPQDIHMEQSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATAAGPE 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE5 ASFEARIPSPGTHPEGEAAQRIHMKSCFLEDATIGNSNKMSISPGVAGPGGVAGSGEGEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 ASFEARIPSPGTHPEGEAAQRIHMKSCFLEDATIGNSNKMSISPGVAGPGGVAGSGEGEP 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE5 GEDAGRRPADTQSRSHFPRDIGQPITPSHSKLTPKLETMTLGNRSHKAAEGSHYVDEASH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GEDAGRRPADTQSRSHFPRDIGQPITPSHSKLTPKLETMTLGNRSHKAAEGSHYVDEASH 360 370 380 390 400 410 pF1KE5 EGSWWGGGI ::::::::: NP_001 EGSWWGGGI 420 >>XP_011527359 (OMIM: 606750) PREDICTED: Z-DNA-binding p (405 aa) initn: 2483 init1: 2483 opt: 2483 Z-score: 1936.1 bits: 367.2 E(85289): 4.2e-101 Smith-Waterman score: 2487; 93.0% identity (94.3% similar) in 402 aa overlap (1-391:1-402) 10 20 30 40 50 60 pF1KE5 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 TSPATWCLGGTDPEGEGPAELALSSPAKRPQQHAATIPETPGPQFSQQREEDIYRFLKDN :::::::::::::::::::::::::::.:::::::::::::::::::::::::::::::: XP_011 TSPATWCLGGTDPEGEGPAELALSSPAERPQQHAATIPETPGPQFSQQREEDIYRFLKDN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKSASII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKSASII 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAPGDSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAPGDSS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 TWGTLVDPWGPQDIHMERSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATAAGPE :::::::::::::::::.:::::::::::::::::::::::::::::::::::::::::: XP_011 TWGTLVDPWGPQDIHMEQSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATAAGPE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 ASFEARIPSPGTHPEGEAAQRIHMKSCFLEDATIGNSNKMSISPGVAGPGGVAGSGEGEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 ASFEARIPSPGTHPEGEAAQRIHMKSCFLEDATIGNSNKMSISPGVAGPGGVAGSGEGEP 310 320 330 340 350 360 370 380 390 400 pF1KE5 GEDAGRRPAD-------TQSRSHFPRDIGQPI----TPSHSKLTPKLETMTLGNRSHKAA ::::: : : .:: : : .. : :: :: XP_011 GEDAGPLGAPCTCDTCRTGTRSAFLRTQAMSIWEHGLPSSSKGSL 370 380 390 400 410 420 pF1KE5 EGSHYVDEASHEGSWWGGGI >>NP_001153890 (OMIM: 606750) Z-DNA-binding protein 1 is (354 aa) initn: 2362 init1: 2362 opt: 2362 Z-score: 1843.3 bits: 349.9 E(85289): 6.2e-96 Smith-Waterman score: 2362; 99.4% identity (100.0% similar) in 343 aa overlap (87-429:12-354) 60 70 80 90 100 110 pF1KE5 KVSLTSPATWCLGGTDPEGEGPAELALSSPAKRPQQHAATIPETPGPQFSQQREEDIYRF :.:::::::::::::::::::::::::::: NP_001 MAQAPADPGREAERPQQHAATIPETPGPQFSQQREEDIYRF 10 20 30 40 120 130 140 150 160 170 pF1KE5 LKDNGPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LKDNGPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKS 50 60 70 80 90 100 180 190 200 210 220 230 pF1KE5 ASIIYQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 ASIIYQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAP 110 120 130 140 150 160 240 250 260 270 280 290 pF1KE5 GDSSTWGTLVDPWGPQDIHMERSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATA :::::::::::::::::::::.:::::::::::::::::::::::::::::::::::::: NP_001 GDSSTWGTLVDPWGPQDIHMEQSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATA 170 180 190 200 210 220 300 310 320 330 340 350 pF1KE5 AGPEASFEARIPSPGTHPEGEAAQRIHMKSCFLEDATIGNSNKMSISPGVAGPGGVAGSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AGPEASFEARIPSPGTHPEGEAAQRIHMKSCFLEDATIGNSNKMSISPGVAGPGGVAGSG 230 240 250 260 270 280 360 370 380 390 400 410 pF1KE5 EGEPGEDAGRRPADTQSRSHFPRDIGQPITPSHSKLTPKLETMTLGNRSHKAAEGSHYVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 EGEPGEDAGRRPADTQSRSHFPRDIGQPITPSHSKLTPKLETMTLGNRSHKAAEGSHYVD 290 300 310 320 330 340 420 pF1KE5 EASHEGSWWGGGI ::::::::::::: NP_001 EASHEGSWWGGGI 350 >>XP_011527360 (OMIM: 606750) PREDICTED: Z-DNA-binding p (357 aa) initn: 1990 init1: 1990 opt: 1992 Z-score: 1556.8 bits: 296.9 E(85289): 5.7e-80 Smith-Waterman score: 1992; 96.4% identity (98.0% similar) in 304 aa overlap (1-304:1-303) 10 20 30 40 50 60 pF1KE5 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 TSPATWCLGGTDPEGEGPAELALSSPAKRPQQHAATIPETPGPQFSQQREEDIYRFLKDN :::::::::::::::::::::::::::.:::::::::::::::::::::::::::::::: XP_011 TSPATWCLGGTDPEGEGPAELALSSPAERPQQHAATIPETPGPQFSQQREEDIYRFLKDN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKSASII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKSASII 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAPGDSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAPGDSS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 TWGTLVDPWGPQDIHMERSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATAAGPE :::::::::::::::::.:::::::::::::::::::::::::::::::::: . : . XP_011 TWGTLVDPWGPQDIHMEQSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVFQCS-GKQ 250 260 270 280 290 310 320 330 340 350 360 pF1KE5 ASFEARIPSPGTHPEGEAAQRIHMKSCFLEDATIGNSNKMSISPGVAGPGGVAGSGEGEP :. : XP_011 AAEERGAESQRKTLTSRSAGRDQWGRPTAGSVEQKDASQTAQQHERSLCHCCRPRSFV 300 310 320 330 340 350 >>NP_001153891 (OMIM: 606750) Z-DNA-binding protein 1 is (248 aa) initn: 1532 init1: 1503 opt: 1507 Z-score: 1183.5 bits: 227.3 E(85289): 3.5e-59 Smith-Waterman score: 1507; 92.0% identity (94.4% similar) in 250 aa overlap (1-250:1-244) 10 20 30 40 50 60 pF1KE5 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 TSPATWCLGGTDPEGEGPAELALSSPAKRPQQHAATIPETPGPQFSQQREEDIYRFLKDN :::::::::::::::::::::::::::.:::::::::::::::::::::::::::::::: NP_001 TSPATWCLGGTDPEGEGPAELALSSPAERPQQHAATIPETPGPQFSQQREEDIYRFLKDN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKSASII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKSASII 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAPGDSS ::::::::::::::::::::::::::::::::::::::::::::.. :. : : .. NP_001 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGKS-----PKRAQG-GD 190 200 210 220 230 250 260 270 280 290 300 pF1KE5 TWGTLVDPWGPQDIHMERSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATAAGPE : :: : NP_001 LGGEPPDPLGGGKG 240 >>XP_016883575 (OMIM: 606750) PREDICTED: Z-DNA-binding p (245 aa) initn: 1157 init1: 1134 opt: 1134 Z-score: 894.8 bits: 173.8 E(85289): 4.2e-43 Smith-Waterman score: 1134; 98.8% identity (99.4% similar) in 171 aa overlap (1-171:1-171) 10 20 30 40 50 60 pF1KE5 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MAQAPADPGREGHLEQRILQVLTEAGSPVKLAQLVKECQAPKRELNQVLYRMKKELKVSL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 TSPATWCLGGTDPEGEGPAELALSSPAKRPQQHAATIPETPGPQFSQQREEDIYRFLKDN :::::::::::::::::::::::::::.:::::::::::::::::::::::::::::::: XP_016 TSPATWCLGGTDPEGEGPAELALSSPAERPQQHAATIPETPGPQFSQQREEDIYRFLKDN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKSASII :::::::::::::::::::::::::::::::::::::::::::::::: :: XP_016 GPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEASGGPTDNSLQG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 YQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAPGDSS XP_016 LPGGRGVLARLTRDSFLEAWPKVCSCRRPVNGTEVAHSQCSQLWGAGISLSLGMVFFEHF 190 200 210 220 230 240 >>NP_001310895 (OMIM: 606750) Z-DNA-binding protein 1 is (173 aa) initn: 1040 init1: 925 opt: 935 Z-score: 742.9 bits: 145.2 E(85289): 1.2e-34 Smith-Waterman score: 935; 87.8% identity (91.5% similar) in 164 aa overlap (87-250:12-169) 60 70 80 90 100 110 pF1KE5 KVSLTSPATWCLGGTDPEGEGPAELALSSPAKRPQQHAATIPETPGPQFSQQREEDIYRF :.:::::::::::::::::::::::::::: NP_001 MAQAPADPGREAERPQQHAATIPETPGPQFSQQREEDIYRF 10 20 30 40 120 130 140 150 160 170 pF1KE5 LKDNGPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LKDNGPQRALVIAQALGMRTAKDVNRDLYRMKSRHLLDMDEQSKAWTIYRPEDSGRRAKS 50 60 70 80 90 100 180 190 200 210 220 230 pF1KE5 ASIIYQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGSAGPRHLPSMAP ::::::::::::::::::::::::::::::::::::::::::::::::.. :. : NP_001 ASIIYQHNPINMICQNGPNSWISIANSEAIQIGHGNIITRQTVSREDGKS-----PKRAQ 110 120 130 140 150 240 250 260 270 280 290 pF1KE5 GDSSTWGTLVDPWGPQDIHMERSILRRVQLGHSNEMRLHGVPSEGPAHIPPGSPPVSATA : .. : :: : NP_001 G-GDLGGEPPDPLGGGKG 160 170 429 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 01:46:29 2016 done: Tue Nov 8 01:46:30 2016 Total Scan time: 9.160 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]