FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7504, 238 aa 1>>>pF1KB7504 238 - 238 aa - 238 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.7500+/-0.00074; mu= 5.3744+/- 0.045 mean_var=186.2576+/-38.809, 0's: 0 Z-trim(116.2): 75 B-trim: 0 in 0/53 Lambda= 0.093976 statistics sampled from 16679 (16754) to 16679 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.812), E-opt: 0.2 (0.515), width: 16 Scan time: 2.650 The best scores are: opt bits E(32554) CCDS2428.1 FEV gene_id:54738|Hs108|chr2 ( 238) 1659 236.0 1.6e-62 CCDS59230.1 FLI1 gene_id:2313|Hs108|chr11 ( 259) 623 95.6 3.3e-20 CCDS59231.1 FLI1 gene_id:2313|Hs108|chr11 ( 386) 623 95.8 4.5e-20 CCDS53725.1 FLI1 gene_id:2313|Hs108|chr11 ( 419) 623 95.8 4.7e-20 CCDS44768.1 FLI1 gene_id:2313|Hs108|chr11 ( 452) 623 95.8 5e-20 CCDS58789.1 ERG gene_id:2078|Hs108|chr21 ( 363) 611 94.1 1.3e-19 CCDS46649.1 ERG gene_id:2078|Hs108|chr21 ( 387) 611 94.1 1.4e-19 CCDS82674.1 ERG gene_id:2078|Hs108|chr21 ( 455) 611 94.2 1.6e-19 CCDS13657.1 ERG gene_id:2078|Hs108|chr21 ( 462) 611 94.2 1.6e-19 CCDS13658.1 ERG gene_id:2078|Hs108|chr21 ( 479) 611 94.2 1.6e-19 CCDS46648.1 ERG gene_id:2078|Hs108|chr21 ( 486) 611 94.2 1.6e-19 CCDS12600.1 ERF gene_id:2077|Hs108|chr19 ( 548) 449 72.3 7.3e-13 CCDS53724.1 ETS1 gene_id:2113|Hs108|chr11 ( 225) 438 70.5 1.1e-12 CCDS81648.1 ETS1 gene_id:2113|Hs108|chr11 ( 354) 438 70.6 1.5e-12 CCDS8475.1 ETS1 gene_id:2113|Hs108|chr11 ( 441) 438 70.7 1.7e-12 CCDS44767.1 ETS1 gene_id:2113|Hs108|chr11 ( 485) 438 70.8 1.9e-12 CCDS13659.1 ETS2 gene_id:2114|Hs108|chr21 ( 469) 437 70.6 2e-12 CCDS1164.1 ETV3 gene_id:2117|Hs108|chr1 ( 143) 398 64.9 3.3e-11 CCDS13575.1 GABPA gene_id:2551|Hs108|chr21 ( 454) 406 66.4 3.6e-11 CCDS30893.1 ETV3L gene_id:440695|Hs108|chr1 ( 361) 398 65.2 6.5e-11 CCDS44250.1 ETV3 gene_id:2117|Hs108|chr1 ( 512) 398 65.4 8.4e-11 CCDS14283.1 ELK1 gene_id:2002|Hs108|chrX ( 428) 395 64.9 9.7e-11 >>CCDS2428.1 FEV gene_id:54738|Hs108|chr2 (238 aa) initn: 1659 init1: 1659 opt: 1659 Z-score: 1235.8 bits: 236.0 E(32554): 1.6e-62 Smith-Waterman score: 1659; 100.0% identity (100.0% similar) in 238 aa overlap (1-238:1-238) 10 20 30 40 50 60 pF1KB7 MRQSGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQKGSGQIQLWQFLLELLADR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 MRQSGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQKGSGQIQLWQFLLELLADR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 ANAGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 ANAGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 RYAYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLPFPGLSKLNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 RYAYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLPFPGLSKLNL 130 140 150 160 170 180 190 200 210 220 230 pF1KB7 MAASAGVAPAGFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGHYH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 MAASAGVAPAGFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGHYH 190 200 210 220 230 >>CCDS59230.1 FLI1 gene_id:2313|Hs108|chr11 (259 aa) initn: 665 init1: 605 opt: 623 Z-score: 476.2 bits: 95.6 E(32554): 3.3e-20 Smith-Waterman score: 654; 51.2% identity (71.0% similar) in 207 aa overlap (34-237:74-259) 10 20 30 40 50 60 pF1KB7 SGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQK-GSGQIQLWQFLLELLADRAN :: : . . :::::::::::::::.: :: CCDS59 GLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSAN 50 60 70 80 90 100 70 80 90 100 110 120 pF1KB7 AGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRY :.::.::: .::::.:::::::::::::::::::::::::::::::::::::.::::::: CCDS59 ASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRY 110 120 130 140 150 160 130 140 150 160 170 180 pF1KB7 AYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLP--FPGLSKLNL ::.:::.:.::: :: :. ....:: :. .. .: .:.:. CCDS59 AYKFDFHGIAQALQPHPT----------------ESSMYKYPSDISYMPSYHAHQQKVNF 170 180 190 200 190 200 210 220 230 pF1KB7 MAASAGVAPAGFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGHYH . . :. : . : . . .. :...::.:.. :. . ::::..: CCDS59 VPPHPSSMPVTSSSFFG-AASQYWTSPTGGIYPNPNVPRHPNTH----VPSHLGSYY 210 220 230 240 250 >>CCDS59231.1 FLI1 gene_id:2313|Hs108|chr11 (386 aa) initn: 629 init1: 605 opt: 623 Z-score: 473.9 bits: 95.8 E(32554): 4.5e-20 Smith-Waterman score: 654; 51.2% identity (71.0% similar) in 207 aa overlap (34-237:201-386) 10 20 30 40 50 60 pF1KB7 SGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQK-GSGQIQLWQFLLELLADRAN :: : . . :::::::::::::::.: :: CCDS59 GLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSAN 180 190 200 210 220 230 70 80 90 100 110 120 pF1KB7 AGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRY :.::.::: .::::.:::::::::::::::::::::::::::::::::::::.::::::: CCDS59 ASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRY 240 250 260 270 280 290 130 140 150 160 170 180 pF1KB7 AYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLP--FPGLSKLNL ::.:::.:.::: :: :. ....:: :. .. .: .:.:. CCDS59 AYKFDFHGIAQALQPHPT----------------ESSMYKYPSDISYMPSYHAHQQKVNF 300 310 320 330 190 200 210 220 230 pF1KB7 MAASAGVAPAGFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGHYH . . :. : . : . . .. :...::.:.. :. . ::::..: CCDS59 VPPHPSSMPVTSSSFFG-AASQYWTSPTGGIYPNPNVPRHPNTH----VPSHLGSYY 340 350 360 370 380 >>CCDS53725.1 FLI1 gene_id:2313|Hs108|chr11 (419 aa) initn: 629 init1: 605 opt: 623 Z-score: 473.4 bits: 95.8 E(32554): 4.7e-20 Smith-Waterman score: 654; 51.2% identity (71.0% similar) in 207 aa overlap (34-237:234-419) 10 20 30 40 50 60 pF1KB7 SGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQK-GSGQIQLWQFLLELLADRAN :: : . . :::::::::::::::.: :: CCDS53 GLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSAN 210 220 230 240 250 260 70 80 90 100 110 120 pF1KB7 AGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRY :.::.::: .::::.:::::::::::::::::::::::::::::::::::::.::::::: CCDS53 ASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRY 270 280 290 300 310 320 130 140 150 160 170 180 pF1KB7 AYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLP--FPGLSKLNL ::.:::.:.::: :: :. ....:: :. .. .: .:.:. CCDS53 AYKFDFHGIAQALQPHPT----------------ESSMYKYPSDISYMPSYHAHQQKVNF 330 340 350 360 190 200 210 220 230 pF1KB7 MAASAGVAPAGFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGHYH . . :. : . : . . .. :...::.:.. :. . ::::..: CCDS53 VPPHPSSMPVTSSSFFG-AASQYWTSPTGGIYPNPNVPRHPNTH----VPSHLGSYY 370 380 390 400 410 >>CCDS44768.1 FLI1 gene_id:2313|Hs108|chr11 (452 aa) initn: 629 init1: 605 opt: 623 Z-score: 473.0 bits: 95.8 E(32554): 5e-20 Smith-Waterman score: 654; 51.2% identity (71.0% similar) in 207 aa overlap (34-237:267-452) 10 20 30 40 50 60 pF1KB7 SGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQK-GSGQIQLWQFLLELLADRAN :: : . . :::::::::::::::.: :: CCDS44 GLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSAN 240 250 260 270 280 290 70 80 90 100 110 120 pF1KB7 AGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRY :.::.::: .::::.:::::::::::::::::::::::::::::::::::::.::::::: CCDS44 ASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRY 300 310 320 330 340 350 130 140 150 160 170 180 pF1KB7 AYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLP--FPGLSKLNL ::.:::.:.::: :: :. ....:: :. .. .: .:.:. CCDS44 AYKFDFHGIAQALQPHPT----------------ESSMYKYPSDISYMPSYHAHQQKVNF 360 370 380 390 400 190 200 210 220 230 pF1KB7 MAASAGVAPAGFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGHYH . . :. : . : . . .. :...::.:.. :. . ::::..: CCDS44 VPPHPSSMPVTSSSFFG-AASQYWTSPTGGIYPNPNVPRHPNTH----VPSHLGSYY 410 420 430 440 450 >>CCDS58789.1 ERG gene_id:2078|Hs108|chr21 (363 aa) initn: 637 init1: 596 opt: 611 Z-score: 465.5 bits: 94.1 E(32554): 1.3e-19 Smith-Waterman score: 640; 51.7% identity (68.7% similar) in 211 aa overlap (34-237:181-363) 10 20 30 40 50 60 pF1KB7 SGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQK-GSGQIQLWQFLLELLADRAN :: : . . :::::::::::::::.: .: CCDS58 TPQSKAAQPSPSTVPKTEDQRPQLDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSSN 160 170 180 190 200 210 70 80 90 100 110 120 pF1KB7 AGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRY ..::.::: .::::.:::::::::::::::::::::::::::::::::::::.::::::: CCDS58 SSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRY 220 230 240 250 260 270 130 140 150 160 170 pF1KB7 AYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLPFPGL-----SK ::.:::.:.::: :: : ...::: :. ::. : .: CCDS58 AYKFDFHGIAQALQPHPP----------------ESSLYKYPSD---LPYMGSYHAHPQK 280 290 300 310 180 190 200 210 220 230 pF1KB7 LNLMAASAGVAPA-GFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGH .:..: . :. . :.. .:.: .. :...::. : : :::: . CCDS58 MNFVAPHPPALPVTSSSFFAAPNPYWNSP--TGGIYPNTRLPTSHMP-------SHLGTY 320 330 340 350 360 pF1KB7 YH : CCDS58 Y >>CCDS46649.1 ERG gene_id:2078|Hs108|chr21 (387 aa) initn: 618 init1: 596 opt: 611 Z-score: 465.1 bits: 94.1 E(32554): 1.4e-19 Smith-Waterman score: 640; 51.7% identity (68.7% similar) in 211 aa overlap (34-237:205-387) 10 20 30 40 50 60 pF1KB7 SGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQK-GSGQIQLWQFLLELLADRAN :: : . . :::::::::::::::.: .: CCDS46 TPQSKAAQPSPSTVPKTEDQRPQLDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSSN 180 190 200 210 220 230 70 80 90 100 110 120 pF1KB7 AGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRY ..::.::: .::::.:::::::::::::::::::::::::::::::::::::.::::::: CCDS46 SSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRY 240 250 260 270 280 290 130 140 150 160 170 pF1KB7 AYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLPFPGL-----SK ::.:::.:.::: :: : ...::: :. ::. : .: CCDS46 AYKFDFHGIAQALQPHPP----------------ESSLYKYPSD---LPYMGSYHAHPQK 300 310 320 330 180 190 200 210 220 230 pF1KB7 LNLMAASAGVAPA-GFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGH .:..: . :. . :.. .:.: .. :...::. : : :::: . CCDS46 MNFVAPHPPALPVTSSSFFAAPNPYWNSP--TGGIYPNTRLPTSHMP-------SHLGTY 340 350 360 370 380 pF1KB7 YH : CCDS46 Y >>CCDS82674.1 ERG gene_id:2078|Hs108|chr21 (455 aa) initn: 618 init1: 596 opt: 611 Z-score: 464.2 bits: 94.2 E(32554): 1.6e-19 Smith-Waterman score: 640; 51.7% identity (68.7% similar) in 211 aa overlap (34-237:273-455) 10 20 30 40 50 60 pF1KB7 SGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQK-GSGQIQLWQFLLELLADRAN :: : . . :::::::::::::::.: .: CCDS82 TPQSKAAQPSPSTVPKTEDQRPQLDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSSN 250 260 270 280 290 300 70 80 90 100 110 120 pF1KB7 AGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRY ..::.::: .::::.:::::::::::::::::::::::::::::::::::::.::::::: CCDS82 SSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRY 310 320 330 340 350 360 130 140 150 160 170 pF1KB7 AYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLPFPGL-----SK ::.:::.:.::: :: : ...::: :. ::. : .: CCDS82 AYKFDFHGIAQALQPHPP----------------ESSLYKYPSD---LPYMGSYHAHPQK 370 380 390 400 180 190 200 210 220 230 pF1KB7 LNLMAASAGVAPA-GFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGH .:..: . :. . :.. .:.: .. :...::. : : :::: . CCDS82 MNFVAPHPPALPVTSSSFFAAPNPYWNSP--TGGIYPNTRLPTSHMP-------SHLGTY 410 420 430 440 450 pF1KB7 YH : CCDS82 Y >>CCDS13657.1 ERG gene_id:2078|Hs108|chr21 (462 aa) initn: 618 init1: 596 opt: 611 Z-score: 464.1 bits: 94.2 E(32554): 1.6e-19 Smith-Waterman score: 640; 51.7% identity (68.7% similar) in 211 aa overlap (34-237:280-462) 10 20 30 40 50 60 pF1KB7 SGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQK-GSGQIQLWQFLLELLADRAN :: : . . :::::::::::::::.: .: CCDS13 TPQSKAAQPSPSTVPKTEDQRPQLDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSSN 250 260 270 280 290 300 70 80 90 100 110 120 pF1KB7 AGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRY ..::.::: .::::.:::::::::::::::::::::::::::::::::::::.::::::: CCDS13 SSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRY 310 320 330 340 350 360 130 140 150 160 170 pF1KB7 AYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLPFPGL-----SK ::.:::.:.::: :: : ...::: :. ::. : .: CCDS13 AYKFDFHGIAQALQPHPP----------------ESSLYKYPSD---LPYMGSYHAHPQK 370 380 390 400 410 180 190 200 210 220 230 pF1KB7 LNLMAASAGVAPA-GFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGH .:..: . :. . :.. .:.: .. :...::. : : :::: . CCDS13 MNFVAPHPPALPVTSSSFFAAPNPYWNSP--TGGIYPNTRLPTSHMP-------SHLGTY 420 430 440 450 460 pF1KB7 YH : CCDS13 Y >>CCDS13658.1 ERG gene_id:2078|Hs108|chr21 (479 aa) initn: 618 init1: 596 opt: 611 Z-score: 463.9 bits: 94.2 E(32554): 1.6e-19 Smith-Waterman score: 640; 51.7% identity (68.7% similar) in 211 aa overlap (34-237:297-479) 10 20 30 40 50 60 pF1KB7 SGASQPLLINMYLPDPVGDGLFKDGKNPSWGPLSPAVQK-GSGQIQLWQFLLELLADRAN :: : . . :::::::::::::::.: .: CCDS13 TPQSKAAQPSPSTVPKTEDQRPQLDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSSN 270 280 290 300 310 320 70 80 90 100 110 120 pF1KB7 AGCIAWEGGHGEFKLTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRY ..::.::: .::::.:::::::::::::::::::::::::::::::::::::.::::::: CCDS13 SSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRY 330 340 350 360 370 380 130 140 150 160 170 pF1KB7 AYRFDFQGLAQACQPPPAHAHAAAAAAAAAAAAQDGALYKLPAGLAPLPFPGL-----SK ::.:::.:.::: :: : ...::: :. ::. : .: CCDS13 AYKFDFHGIAQALQPHPP----------------ESSLYKYPSD---LPYMGSYHAHPQK 390 400 410 420 180 190 200 210 220 230 pF1KB7 LNLMAASAGVAPA-GFSYWPGPGPAATAAAATAALYPSPSLQPPPGPFGAVAAASHLGGH .:..: . :. . :.. .:.: .. :...::. : : :::: . CCDS13 MNFVAPHPPALPVTSSSFFAAPNPYWNSP--TGGIYPNTRLPTSHMP-------SHLGTY 430 440 450 460 470 pF1KB7 YH : CCDS13 Y 238 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 00:37:24 2016 done: Mon Nov 7 00:37:24 2016 Total Scan time: 2.650 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]