FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5222, 256 aa 1>>>pF1KE5222 256 - 256 aa - 256 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6462+/-0.000825; mu= 12.6167+/- 0.049 mean_var=59.5830+/-11.985, 0's: 0 Z-trim(105.8): 22 B-trim: 151 in 2/47 Lambda= 0.166155 statistics sampled from 8641 (8651) to 8641 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.649), E-opt: 0.2 (0.266), width: 16 Scan time: 2.290 The best scores are: opt bits E(32554) CCDS11719.1 MIF4GD gene_id:57409|Hs108|chr17 ( 256) 1715 419.3 1.3e-117 CCDS58598.1 MIF4GD gene_id:57409|Hs108|chr17 ( 263) 1691 413.6 7e-116 CCDS56044.1 MIF4GD gene_id:57409|Hs108|chr17 ( 222) 1306 321.3 3.6e-88 CCDS11935.1 CTIF gene_id:9811|Hs108|chr18 ( 598) 340 89.8 4.6e-18 CCDS45864.1 CTIF gene_id:9811|Hs108|chr18 ( 600) 340 89.8 4.7e-18 >>CCDS11719.1 MIF4GD gene_id:57409|Hs108|chr17 (256 aa) initn: 1715 init1: 1715 opt: 1715 Z-score: 2225.2 bits: 419.3 E(32554): 1.3e-117 Smith-Waterman score: 1715; 100.0% identity (100.0% similar) in 256 aa overlap (1-256:1-256) 10 20 30 40 50 60 pF1KE5 MGEPSREEYKIQSFDAETQQLLKTALKVACFETEDGEYSVCQRSYSNCSRLMPSRCNTQY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MGEPSREEYKIQSFDAETQQLLKTALKVACFETEDGEYSVCQRSYSNCSRLMPSRCNTQY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 RDPGAVDLEKVANVIVDHSLQDCVFSKEAGRMCYAIIQAESKQAGQSVFRRGLLNRLQQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 RDPGAVDLEKVANVIVDHSLQDCVFSKEAGRMCYAIIQAESKQAGQSVFRRGLLNRLQQE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 YQAREQLRARSLQGWVCYVTFICNIFDYLRVNNMPMMALVNPVYDCLFRLAQPDSLSKEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YQAREQLRARSLQGWVCYVTFICNIFDYLRVNNMPMMALVNPVYDCLFRLAQPDSLSKEE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 EVDCLVLQLHRVGEQLEKMNGQRMDELFVLIRDGFLLPTGLSSLAQLLLLEIIEFRAAGW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 EVDCLVLQLHRVGEQLEKMNGQRMDELFVLIRDGFLLPTGLSSLAQLLLLEIIEFRAAGW 190 200 210 220 230 240 250 pF1KE5 KTTPAAHKYYYSEVSD :::::::::::::::: CCDS11 KTTPAAHKYYYSEVSD 250 >>CCDS58598.1 MIF4GD gene_id:57409|Hs108|chr17 (263 aa) initn: 1546 init1: 1546 opt: 1691 Z-score: 2193.9 bits: 413.6 E(32554): 7e-116 Smith-Waterman score: 1691; 97.3% identity (97.3% similar) in 263 aa overlap (1-256:1-263) 10 20 30 40 50 pF1KE5 MGEPSREEYKIQSFDAETQQLLKTALK-------VACFETEDGEYSVCQRSYSNCSRLMP ::::::::::::::::::::::::::: :::::::::::::::::::::::::: CCDS58 MGEPSREEYKIQSFDAETQQLLKTALKAPSLECTVACFETEDGEYSVCQRSYSNCSRLMP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 SRCNTQYRDPGAVDLEKVANVIVDHSLQDCVFSKEAGRMCYAIIQAESKQAGQSVFRRGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 SRCNTQYRDPGAVDLEKVANVIVDHSLQDCVFSKEAGRMCYAIIQAESKQAGQSVFRRGL 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 LNRLQQEYQAREQLRARSLQGWVCYVTFICNIFDYLRVNNMPMMALVNPVYDCLFRLAQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 LNRLQQEYQAREQLRARSLQGWVCYVTFICNIFDYLRVNNMPMMALVNPVYDCLFRLAQP 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE5 DSLSKEEEVDCLVLQLHRVGEQLEKMNGQRMDELFVLIRDGFLLPTGLSSLAQLLLLEII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 DSLSKEEEVDCLVLQLHRVGEQLEKMNGQRMDELFVLIRDGFLLPTGLSSLAQLLLLEII 190 200 210 220 230 240 240 250 pF1KE5 EFRAAGWKTTPAAHKYYYSEVSD ::::::::::::::::::::::: CCDS58 EFRAAGWKTTPAAHKYYYSEVSD 250 260 >>CCDS56044.1 MIF4GD gene_id:57409|Hs108|chr17 (222 aa) initn: 1306 init1: 1306 opt: 1306 Z-score: 1696.4 bits: 321.3 E(32554): 3.6e-88 Smith-Waterman score: 1391; 86.7% identity (86.7% similar) in 256 aa overlap (1-256:1-222) 10 20 30 40 50 60 pF1KE5 MGEPSREEYKIQSFDAETQQLLKTALKVACFETEDGEYSVCQRSYSNCSRLMPSRCNTQY ::::::::::::::::::::::::::: CCDS56 MGEPSREEYKIQSFDAETQQLLKTALK--------------------------------- 10 20 70 80 90 100 110 120 pF1KE5 RDPGAVDLEKVANVIVDHSLQDCVFSKEAGRMCYAIIQAESKQAGQSVFRRGLLNRLQQE ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 -DPGAVDLEKVANVIVDHSLQDCVFSKEAGRMCYAIIQAESKQAGQSVFRRGLLNRLQQE 30 40 50 60 70 80 130 140 150 160 170 180 pF1KE5 YQAREQLRARSLQGWVCYVTFICNIFDYLRVNNMPMMALVNPVYDCLFRLAQPDSLSKEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 YQAREQLRARSLQGWVCYVTFICNIFDYLRVNNMPMMALVNPVYDCLFRLAQPDSLSKEE 90 100 110 120 130 140 190 200 210 220 230 240 pF1KE5 EVDCLVLQLHRVGEQLEKMNGQRMDELFVLIRDGFLLPTGLSSLAQLLLLEIIEFRAAGW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 EVDCLVLQLHRVGEQLEKMNGQRMDELFVLIRDGFLLPTGLSSLAQLLLLEIIEFRAAGW 150 160 170 180 190 200 250 pF1KE5 KTTPAAHKYYYSEVSD :::::::::::::::: CCDS56 KTTPAAHKYYYSEVSD 210 220 >>CCDS11935.1 CTIF gene_id:9811|Hs108|chr18 (598 aa) initn: 247 init1: 199 opt: 340 Z-score: 437.7 bits: 89.8 E(32554): 4.6e-18 Smith-Waterman score: 340; 33.2% identity (69.0% similar) in 187 aa overlap (68-250:410-589) 40 50 60 70 80 90 pF1KE5 YSVCQRSYSNCSRLMPSRCNTQYRDPGAVDLEKVANVIVDHSLQDCVFSKEAGRMC--YA : ... .: .....: :. :...: .: CCDS11 NSMRNNSSDVDTKLTTFMEEAQNSTNSEEMLGEIVRTIYQKAVSDRSFAFTAAKLCDKMA 380 390 400 410 420 430 100 110 120 130 140 150 pF1KE5 IIQAESKQAGQSVFRRGLLNRLQQEYQAREQLRARSLQGWVCYVTFICNIFDYLRVNN-M ....:. . :: ::: ::... .::.:. .... :. ..::.:..: .: .. CCDS11 LFMVEGTK-----FRSLLLNMLQKDFTVREELQQQDVERWLGFITFLCEVFGTMRSSTGE 440 450 460 470 480 490 160 170 180 190 200 210 pF1KE5 PMMALVNPVYDCLFRLAQPDSLSKEEEVDCLVLQLHRVGEQLEKMNGQRMDELFVLIRDG :. .:: :.: :: .: : ... ::. : : ..:. .:. ::.. . : ::.. :: CCDS11 PFRVLVCPIYTCLRELLQSQDV-KEDAVLCCSMELQSTGRLLEEQLPEMMTELLASARDK 500 510 520 530 540 550 220 230 240 250 pF1KE5 FLLPTGLSSLAQLLLLEIIEFRAAGWKT-TPAAHKYYYSEVSD .: :. : :.. ::::.::..: .:. :: .:: CCDS11 MLCPSE-SMLTRSLLLEVIELHANSWNPLTPPITQYYNRTIQKLTA 560 570 580 590 >>CCDS45864.1 CTIF gene_id:9811|Hs108|chr18 (600 aa) initn: 247 init1: 199 opt: 340 Z-score: 437.7 bits: 89.8 E(32554): 4.7e-18 Smith-Waterman score: 340; 33.2% identity (69.0% similar) in 187 aa overlap (68-250:412-591) 40 50 60 70 80 90 pF1KE5 YSVCQRSYSNCSRLMPSRCNTQYRDPGAVDLEKVANVIVDHSLQDCVFSKEAGRMC--YA : ... .: .....: :. :...: .: CCDS45 NSMRNNSSDVDTKLTTFMEEAQNSTNSEEMLGEIVRTIYQKAVSDRSFAFTAAKLCDKMA 390 400 410 420 430 440 100 110 120 130 140 150 pF1KE5 IIQAESKQAGQSVFRRGLLNRLQQEYQAREQLRARSLQGWVCYVTFICNIFDYLRVNN-M ....:. . :: ::: ::... .::.:. .... :. ..::.:..: .: .. CCDS45 LFMVEGTK-----FRSLLLNMLQKDFTVREELQQQDVERWLGFITFLCEVFGTMRSSTGE 450 460 470 480 490 160 170 180 190 200 210 pF1KE5 PMMALVNPVYDCLFRLAQPDSLSKEEEVDCLVLQLHRVGEQLEKMNGQRMDELFVLIRDG :. .:: :.: :: .: : ... ::. : : ..:. .:. ::.. . : ::.. :: CCDS45 PFRVLVCPIYTCLRELLQSQDV-KEDAVLCCSMELQSTGRLLEEQLPEMMTELLASARDK 500 510 520 530 540 550 220 230 240 250 pF1KE5 FLLPTGLSSLAQLLLLEIIEFRAAGWKT-TPAAHKYYYSEVSD .: :. : :.. ::::.::..: .:. :: .:: CCDS45 MLCPSE-SMLTRSLLLEVIELHANSWNPLTPPITQYYNRTIQKLTA 560 570 580 590 600 256 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:38:56 2016 done: Mon Nov 7 22:38:56 2016 Total Scan time: 2.290 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]