FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8898, 232 aa 1>>>pF1KB8898 232 - 232 aa - 232 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.3904+/-0.000367; mu= 0.6704+/- 0.023 mean_var=391.4021+/-80.234, 0's: 0 Z-trim(125.9): 112 B-trim: 877 in 2/58 Lambda= 0.064828 statistics sampled from 50453 (50585) to 50453 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.852), E-opt: 0.2 (0.593), width: 16 Scan time: 7.160 The best scores are: opt bits E(85289) NP_114150 (OMIM: 609852) homeobox protein MIXL1 is ( 232) 1595 161.6 1e-39 NP_001269331 (OMIM: 609852) homeobox protein MIXL1 ( 240) 1569 159.2 5.6e-39 NP_001306003 (OMIM: 605726,610362,610381,613757) r ( 230) 351 45.3 0.00011 NP_038463 (OMIM: 601881,611038) retinal homeobox p ( 346) 326 43.2 0.00069 NP_852126 (OMIM: 122880,148820,193500,268220,60659 ( 403) 301 40.9 0.0038 NP_852125 (OMIM: 122880,148820,193500,268220,60659 ( 407) 301 40.9 0.0039 NP_006252 (OMIM: 262600,601538) homeobox protein p ( 226) 294 39.9 0.0042 NP_852122 (OMIM: 122880,148820,193500,268220,60659 ( 479) 301 41.0 0.0043 NP_116142 (OMIM: 605726,610362,610381,613757) reti ( 184) 292 39.6 0.0043 NP_001120838 (OMIM: 122880,148820,193500,268220,60 ( 483) 301 41.0 0.0043 NP_852123 (OMIM: 122880,148820,193500,268220,60659 ( 484) 301 41.0 0.0043 NP_852124 (OMIM: 122880,148820,193500,268220,60659 ( 505) 301 41.0 0.0044 NP_006483 (OMIM: 136760,606014) homeobox protein a ( 343) 286 39.4 0.0092 NP_115485 (OMIM: 604529) homeobox protein orthoped ( 325) 285 39.3 0.0095 XP_016883326 (OMIM: 122000,148300,605020,614195) P ( 280) 283 39.0 0.0099 >>NP_114150 (OMIM: 609852) homeobox protein MIXL1 isofor (232 aa) initn: 1595 init1: 1595 opt: 1595 Z-score: 834.1 bits: 161.6 E(85289): 1e-39 Smith-Waterman score: 1595; 100.0% identity (100.0% similar) in 232 aa overlap (1-232:1-232) 10 20 30 40 50 60 pF1KB8 MATAESRALQFAEGAAFPAYRAPHAGGALLPPPSPAAALLPAPPAGPGPATFAGFLGRDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 MATAESRALQFAEGAAFPAYRAPHAGGALLPPPSPAAALLPAPPAGPGPATFAGFLGRDP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 GPAPPPPASLGSPAPPKGAAAPSASQRRKRTSFSAEQLQLLELVFRRTRYPDIHLRERLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 GPAPPPPASLGSPAPPKGAAAPSASQRRKRTSFSAEQLQLLELVFRRTRYPDIHLRERLA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 ALTLLPESRIQVWFQNRRAKSRRQSGKSFQPLARPEIILNHCAPGTETKCLKPQLPLEVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 ALTLLPESRIQVWFQNRRAKSRRQSGKSFQPLARPEIILNHCAPGTETKCLKPQLPLEVD 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 VNCLPEPNGVGGGISDSSSQGQNFETCSPLSEDIGSKLDSWEEHIFSAFGNF :::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 VNCLPEPNGVGGGISDSSSQGQNFETCSPLSEDIGSKLDSWEEHIFSAFGNF 190 200 210 220 230 >>NP_001269331 (OMIM: 609852) homeobox protein MIXL1 iso (240 aa) initn: 1058 init1: 891 opt: 1569 Z-score: 820.8 bits: 159.2 E(85289): 5.6e-39 Smith-Waterman score: 1569; 96.7% identity (96.7% similar) in 240 aa overlap (1-232:1-240) 10 20 30 40 50 60 pF1KB8 MATAESRALQFAEGAAFPAYRAPHAGGALLPPPSPAAALLPAPPAGPGPATFAGFLGRDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MATAESRALQFAEGAAFPAYRAPHAGGALLPPPSPAAALLPAPPAGPGPATFAGFLGRDP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 GPAPPPPASLGSPAPPKGAAAPSASQRRKRTSFSAEQLQLLELVFRRTRYPDIHLRERLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GPAPPPPASLGSPAPPKGAAAPSASQRRKRTSFSAEQLQLLELVFRRTRYPDIHLRERLA 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 ALTLLPESRIQ--------VWFQNRRAKSRRQSGKSFQPLARPEIILNHCAPGTETKCLK ::::::::::: ::::::::::::::::::::::::::::::::::::::::: NP_001 ALTLLPESRIQLLFSPLFQVWFQNRRAKSRRQSGKSFQPLARPEIILNHCAPGTETKCLK 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB8 PQLPLEVDVNCLPEPNGVGGGISDSSSQGQNFETCSPLSEDIGSKLDSWEEHIFSAFGNF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PQLPLEVDVNCLPEPNGVGGGISDSSSQGQNFETCSPLSEDIGSKLDSWEEHIFSAFGNF 190 200 210 220 230 240 >>NP_001306003 (OMIM: 605726,610362,610381,613757) retin (230 aa) initn: 308 init1: 256 opt: 351 Z-score: 205.3 bits: 45.3 E(85289): 0.00011 Smith-Waterman score: 351; 40.4% identity (60.6% similar) in 193 aa overlap (13-197:6-191) 10 20 30 40 50 pF1KB8 MATAESRALQFAEGAAFP-AYRAPHAGGALLPPPSPAAALLPAPPAGPGPATFAGFLGRD ::. :: : : .. :: : .: :. .::. : :. .: . . NP_001 MPAPVEGTDFPGAGRQAWGSPALSLPVAPPAV---SPPSVPLPSHQVGAMFLS 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 PGPAPPPPASLGSPAPPKGAAAPSASQRRKRTSFSAEQLQLLELVFRRTRYPDIHLRERL :: . ::. :. : : ::. ..::.::.:.. ::. :: .:. ..:::.. ::.: NP_001 PGEG---PATEGGGLGP-GEEAPKKKHRRNRTTFTTYQLHQLERAFEASHYPDVYSREEL 60 70 80 90 100 120 130 140 150 160 170 pF1KB8 AALTLLPESRIQVWFQNRRAKSRRQ----SGKSFQPLAR-PEI-ILNHCAPGTETKCLKP :: . ::: :.:::::::::: ::: ::.. : :: : : . . :.: NP_001 AAKVHLPEVRVQVWFQNRRAKWRRQERLESGSGAVAAPRLPEAPALPFARPPAMSLPLEP 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB8 QL-PLEVDVNCLPEPNGVGGGISDSSSQGQNFETCSPLSEDIGSKLDSWEEHIFSAFGNF : : : ::. : : :.. : NP_001 WLGPGPPAVPGLPRLLGPGPGLQASFGPHAFAPTFADGFALEEASLRLLAKEHAQALDRA 170 180 190 200 210 220 >>NP_038463 (OMIM: 601881,611038) retinal homeobox prote (346 aa) initn: 251 init1: 251 opt: 326 Z-score: 190.8 bits: 43.2 E(85289): 0.00069 Smith-Waterman score: 326; 40.9% identity (63.1% similar) in 149 aa overlap (4-144:60-194) 10 20 30 pF1KB8 MATAESRALQFAEGAAFPAYRAPHAGGALLPPP :. : ... : : .::. :. ::: NP_038 SRLHSIEAILGFTKDDGILGTFPAERGARGAKERDRRLGARPACP--KAPEEGSEPSPPP 30 40 50 60 70 80 40 50 60 70 80 pF1KB8 SPAAALLPAPP-AGPGPATFAGFLGRDPGPAPPPPASLGSPAPPKGAAA-------PSAS .:: ::: .: : . ..:: : : : : :. : . : :. . NP_038 APA----PAPEYEAPRP-----YCPKEPGEARPSP---GLPVGPATGEAKLSEEEQPKKK 90 100 110 120 130 90 100 110 120 130 140 pF1KB8 QRRKRTSFSAEQLQLLELVFRRTRYPDIHLRERLAALTLLPESRIQVWFQNRRAKSRRQS .::.::.:.. ::. :: .:....:::.. ::.::. . ::: :.:::::::::: ::: NP_038 HRRNRTTFTTYQLHELERAFEKSHYPDVYSREELAGKVNLPEVRVQVWFQNRRAKWRRQE 140 150 160 170 180 190 150 160 170 180 190 200 pF1KB8 GKSFQPLARPEIILNHCAPGTETKCLKPQLPLEVDVNCLPEPNGVGGGISDSSSQGQNFE NP_038 KLEVSSMKLQDSPLLSFSRSPPSATLSPLGAGPGSGGGPAGGALPLESWLGPPLPGGGAT 200 210 220 230 240 250 >>NP_852126 (OMIM: 122880,148820,193500,268220,606597) p (403 aa) initn: 302 init1: 251 opt: 301 Z-score: 177.4 bits: 40.9 E(85289): 0.0038 Smith-Waterman score: 301; 40.1% identity (62.8% similar) in 137 aa overlap (63-199:196-326) 40 50 60 70 80 90 pF1KB8 PSPAAALLPAPPAGPGPATFAGFLGRDPGPAPPPPASLGSPAPPKGAAAPSASQRRKRTS : : .. :: . . .:::.::. NP_852 EEEEADLERKEAEESEKKAKHSIDGILSERASAPQSDEGSDIDSEPDLPLKRKQRRSRTT 170 180 190 200 210 220 100 110 120 130 140 150 pF1KB8 FSAEQLQLLELVFRRTRYPDIHLRERLAALTLLPESRIQVWFQNRRAKSRRQSGKSFQPL :.::::. :: .:.::.::::. ::.:: . : :.:.::::.::::. :.:.: . : . NP_852 FTAEQLEELERAFERTHYPDIYTREELAQRAKLTEARVQVWFSNRRARWRKQAGAN-QLM 230 240 250 260 270 280 160 170 180 190 200 210 pF1KB8 ARPEIILNHCAPGTETKCLKPQLPLEVDVNCLPEPNGVGGGISDSSSQGQNFETCSPLSE : .:: :: : :: . .:... ..:: :: NP_852 A-----FNHLIPGGFPPTAMPTLPTYQLSETSYQPTSIPQAVSDPSSTVHRPQPLPPSTV 290 300 310 320 330 220 230 pF1KB8 DIGSKLDSWEEHIFSAFGNF NP_852 HQSTIPSNPDSSSAYCLPSTRHGFSSYTDSFVPPSGPSNPMNPTIGNGLSPQVPFIISSQ 340 350 360 370 380 390 >>NP_852125 (OMIM: 122880,148820,193500,268220,606597) p (407 aa) initn: 302 init1: 251 opt: 301 Z-score: 177.4 bits: 40.9 E(85289): 0.0039 Smith-Waterman score: 301; 40.1% identity (62.8% similar) in 137 aa overlap (63-199:196-326) 40 50 60 70 80 90 pF1KB8 PSPAAALLPAPPAGPGPATFAGFLGRDPGPAPPPPASLGSPAPPKGAAAPSASQRRKRTS : : .. :: . . .:::.::. NP_852 EEEEADLERKEAEESEKKAKHSIDGILSERASAPQSDEGSDIDSEPDLPLKRKQRRSRTT 170 180 190 200 210 220 100 110 120 130 140 150 pF1KB8 FSAEQLQLLELVFRRTRYPDIHLRERLAALTLLPESRIQVWFQNRRAKSRRQSGKSFQPL :.::::. :: .:.::.::::. ::.:: . : :.:.::::.::::. :.:.: . : . NP_852 FTAEQLEELERAFERTHYPDIYTREELAQRAKLTEARVQVWFSNRRARWRKQAGAN-QLM 230 240 250 260 270 280 160 170 180 190 200 210 pF1KB8 ARPEIILNHCAPGTETKCLKPQLPLEVDVNCLPEPNGVGGGISDSSSQGQNFETCSPLSE : .:: :: : :: . .:... ..:: :: NP_852 A-----FNHLIPGGFPPTAMPTLPTYQLSETSYQPTSIPQAVSDPSSTVHRPQPLPPSTV 290 300 310 320 330 220 230 pF1KB8 DIGSKLDSWEEHIFSAFGNF NP_852 HQSTIPSNPDSSSAYCLPSTRHGFSSYTDSFVPPSGPSNPMNPTIGNGLSPQVPFIISSQ 340 350 360 370 380 390 >>NP_006252 (OMIM: 262600,601538) homeobox protein proph (226 aa) initn: 254 init1: 254 opt: 294 Z-score: 176.6 bits: 39.9 E(85289): 0.0042 Smith-Waterman score: 294; 38.4% identity (56.2% similar) in 185 aa overlap (37-213:19-193) 10 20 30 40 50 60 pF1KB8 RALQFAEGAAFPAYRAPHAGGALLPPPSPAAALLPA-PPAGPGPATFAGFLGRDPGPAPP ..::: :: :.: . . ::: NP_006 MEAERRRQAEKPKKGRVGSSLLPERHPATGTPTTTVD------SSAPP 10 20 30 40 70 80 90 100 110 pF1KB8 ----PPASLGSP--APPKGAAAPSASQRRKRTSFSAEQLQLLELVFRRTRYPDIHLRERL : :. : .: : . :.::.::.:: ::. :: .: :..:::: :: : NP_006 CRRLPGAGGGRSRFSPQGGQRGRPHSRRRHRTTFSPVQLEQLESAFGRNQYPDIWARESL 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB8 AALTLLPESRIQVWFQNRRAKSRRQSGKSFQPLAR-PEIILNHCAPGTETKCLKPQLPLE : : : :.::::::::::::.:.: . .::::. .. : . : : NP_006 ARDTGLSEARIQVWFQNRRAKQRKQERSLLQPLAHLSPAAFSSFLPES-TACPYSYAAPP 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB8 VDVNCLPEPNGVGGGISDSSSQGQNFETCSPLSEDIGSKLDSWEEHIFSAFGNF :.:.:.: . .. .. : : : . : ::: NP_006 PPVTCFPHP--YSHALPSQPSTGGAF-ALSHQSEDWYPTLHPAPAGHLPCPPPPPMLPLS 170 180 190 200 210 NP_006 LEPSKSWN 220 >>NP_852122 (OMIM: 122880,148820,193500,268220,606597) p (479 aa) initn: 302 init1: 251 opt: 301 Z-score: 176.6 bits: 41.0 E(85289): 0.0043 Smith-Waterman score: 301; 40.1% identity (62.8% similar) in 137 aa overlap (63-199:196-326) 40 50 60 70 80 90 pF1KB8 PSPAAALLPAPPAGPGPATFAGFLGRDPGPAPPPPASLGSPAPPKGAAAPSASQRRKRTS : : .. :: . . .:::.::. NP_852 EEEEADLERKEAEESEKKAKHSIDGILSERASAPQSDEGSDIDSEPDLPLKRKQRRSRTT 170 180 190 200 210 220 100 110 120 130 140 150 pF1KB8 FSAEQLQLLELVFRRTRYPDIHLRERLAALTLLPESRIQVWFQNRRAKSRRQSGKSFQPL :.::::. :: .:.::.::::. ::.:: . : :.:.::::.::::. :.:.: . : . NP_852 FTAEQLEELERAFERTHYPDIYTREELAQRAKLTEARVQVWFSNRRARWRKQAGAN-QLM 230 240 250 260 270 280 160 170 180 190 200 210 pF1KB8 ARPEIILNHCAPGTETKCLKPQLPLEVDVNCLPEPNGVGGGISDSSSQGQNFETCSPLSE : .:: :: : :: . .:... ..:: :: NP_852 A-----FNHLIPGGFPPTAMPTLPTYQLSETSYQPTSIPQAVSDPSSTVHRPQPLPPSTV 290 300 310 320 330 220 230 pF1KB8 DIGSKLDSWEEHIFSAFGNF NP_852 HQSTIPSNPDSSSAYCLPSTRHGFSSYTDSFVPPSGPSNPMNPTIGNGLSPQVMGLLTNH 340 350 360 370 380 390 >>NP_116142 (OMIM: 605726,610362,610381,613757) retina a (184 aa) initn: 291 init1: 256 opt: 292 Z-score: 176.6 bits: 39.6 E(85289): 0.0043 Smith-Waterman score: 292; 42.8% identity (62.1% similar) in 145 aa overlap (60-197:5-145) 30 40 50 60 70 80 pF1KB8 LPPPSPAAALLPAPPAGPGPATFAGFLGRDPGPAPPPPASLGSPAPPKGAAAPSASQRRK :: .: :. :. : : ::. ..::. NP_116 MFLSPGEGP---ATEGGGLGP-GEEAPKKKHRRN 10 20 30 90 100 110 120 130 140 pF1KB8 RTSFSAEQLQLLELVFRRTRYPDIHLRERLAALTLLPESRIQVWFQNRRAKSRRQ----S ::.:.. ::. :: .:. ..:::.. ::.::: . ::: :.:::::::::: ::: : NP_116 RTTFTTYQLHQLERAFEASHYPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQERLES 40 50 60 70 80 90 150 160 170 180 190 200 pF1KB8 GKSFQPLAR-PEI-ILNHCAPGTETKCLKPQL-PLEVDVNCLPEPNGVGGGISDSSSQGQ :.. : :: : : . . :.: : : : ::. : : :.. : NP_116 GSGAVAAPRLPEAPALPFARPPAMSLPLEPWLGPGPPAVPGLPRLLGPGPGLQASFGPHA 100 110 120 130 140 150 210 220 230 pF1KB8 NFETCSPLSEDIGSKLDSWEEHIFSAFGNF NP_116 FAPTFADGFALEEASLRLLAKEHAQALDRAWPPA 160 170 180 >>NP_001120838 (OMIM: 122880,148820,193500,268220,606597 (483 aa) initn: 302 init1: 251 opt: 301 Z-score: 176.5 bits: 41.0 E(85289): 0.0043 Smith-Waterman score: 301; 40.1% identity (62.8% similar) in 137 aa overlap (63-199:195-325) 40 50 60 70 80 90 pF1KB8 PSPAAALLPAPPAGPGPATFAGFLGRDPGPAPPPPASLGSPAPPKGAAAPSASQRRKRTS : : .. :: . . .:::.::. NP_001 EEEEADLERKEAEESEKKAKHSIDGILSERASAPQSDEGSDIDSEPDLPLKRKQRRSRTT 170 180 190 200 210 220 100 110 120 130 140 150 pF1KB8 FSAEQLQLLELVFRRTRYPDIHLRERLAALTLLPESRIQVWFQNRRAKSRRQSGKSFQPL :.::::. :: .:.::.::::. ::.:: . : :.:.::::.::::. :.:.: . : . NP_001 FTAEQLEELERAFERTHYPDIYTREELAQRAKLTEARVQVWFSNRRARWRKQAGAN-QLM 230 240 250 260 270 280 160 170 180 190 200 210 pF1KB8 ARPEIILNHCAPGTETKCLKPQLPLEVDVNCLPEPNGVGGGISDSSSQGQNFETCSPLSE : .:: :: : :: . .:... ..:: :: NP_001 A-----FNHLIPGGFPPTAMPTLPTYQLSETSYQPTSIPQAVSDPSSTVHRPQPLPPSTV 290 300 310 320 330 220 230 pF1KB8 DIGSKLDSWEEHIFSAFGNF NP_001 HQSTIPSNPDSSSAYCLPSTRHGFSSYTDSFVPPSGPSNPMNPTIGNGLSPQVMGLLTNH 340 350 360 370 380 390 232 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 17:46:33 2016 done: Mon Nov 7 17:46:34 2016 Total Scan time: 7.160 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]