FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7678, 406 aa 1>>>pF1KB7678 406 - 406 aa - 406 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 16.1488+/-0.000566; mu= -29.6122+/- 0.036 mean_var=931.7234+/-190.255, 0's: 0 Z-trim(124.3): 192 B-trim: 880 in 1/59 Lambda= 0.042018 statistics sampled from 45480 (45787) to 45480 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.783), E-opt: 0.2 (0.537), width: 16 Scan time: 10.900 The best scores are: opt bits E(85289) NP_703149 (OMIM: 300154) homeobox protein ESX1 [Ho ( 406) 2970 195.0 2.8e-49 NP_038463 (OMIM: 601881,611038) retinal homeobox p ( 346) 424 40.6 0.0072 >>NP_703149 (OMIM: 300154) homeobox protein ESX1 [Homo s (406 aa) initn: 2970 init1: 2970 opt: 2970 Z-score: 1005.8 bits: 195.0 E(85289): 2.8e-49 Smith-Waterman score: 2970; 100.0% identity (100.0% similar) in 406 aa overlap (1-406:1-406) 10 20 30 40 50 60 pF1KB7 MESLRGYTHSDIGYRSLAVGEDIEEVNDEKLTVTSLMARGGEDEENTRSKPEYGTEAENN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_703 MESLRGYTHSDIGYRSLAVGEDIEEVNDEKLTVTSLMARGGEDEENTRSKPEYGTEAENN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 VGTEGSVPSDDQDREGGGGHEPEQQQEEPPLTKPEQQQEEPPLLELKQEQEEPPQTTVEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_703 VGTEGSVPSDDQDREGGGGHEPEQQQEEPPLTKPEQQQEEPPLLELKQEQEEPPQTTVEG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 PQPAEGPQTAEGPQPPERKRRRRTAFTQFQLQELENFFDESQYPDVVARERLAARLNLTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_703 PQPAEGPQTAEGPQPPERKRRRRTAFTQFQLQELENFFDESQYPDVVARERLAARLNLTE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 DRVQVWFQNRRAKWKRNQRVLMLRNTATADLAHPLDMFLGGAYYAAPALDPALCVHLVPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_703 DRVQVWFQNRRAKWKRNQRVLMLRNTATADLAHPLDMFLGGAYYAAPALDPALCVHLVPQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 LPRPPVLPVPPMPPRPPMVPMPPRPPIAPMPPMAPVPPGSRMAPVPPGPRMAPVPPWPPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_703 LPRPPVLPVPPMPPRPPMVPMPPRPPIAPMPPMAPVPPGSRMAPVPPGPRMAPVPPWPPM 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 APVPPWPPMAPVPTGPPMAPVPPGPPMARVPPGPPMARVPPGPPMAPLPPGPPMAPLPPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_703 APVPPWPPMAPVPTGPPMAPVPPGPPMARVPPGPPMARVPPGPPMAPLPPGPPMAPLPPG 310 320 330 340 350 360 370 380 390 400 pF1KB7 PPMAPLPPGPPMAPLPPRSHVPHTGLAPVHITWAPVINSYYACPFF :::::::::::::::::::::::::::::::::::::::::::::: NP_703 PPMAPLPPGPPMAPLPPRSHVPHTGLAPVHITWAPVINSYYACPFF 370 380 390 400 >>NP_038463 (OMIM: 601881,611038) retinal homeobox prote (346 aa) initn: 431 init1: 296 opt: 424 Z-score: 172.5 bits: 40.6 E(85289): 0.0072 Smith-Waterman score: 440; 31.5% identity (48.9% similar) in 321 aa overlap (65-371:56-320) 40 50 60 70 80 pF1KB7 SLMARGGEDEENTRSKPEYGTEAENNVGTEGSVPSDDQDREGGG----GHEPEQQQEE-- :. . ..::. :. . ::. .: NP_038 GGSTSRLHSIEAILGFTKDDGILGTFPAERGARGAKERDRRLGARPACPKAPEEGSEPSP 30 40 50 60 70 80 90 100 110 120 130 140 pF1KB7 PPLTKPEQQQEEP-PLLELKQEQEEPPQTTVEGPQPAEGPQTAEGPQPPERKRRRRTAFT :: : . : : : . . .: :: .:. . .: :: ...:: ::.:: NP_038 PPAPAPAPEYEAPRPYCPKEPGEARPSPGLPVGPATGEA-KLSEEEQPKKKHRRNRTTFT 90 100 110 120 130 140 150 160 170 180 190 200 pF1KB7 QFQLQELENFFDESQYPDVVARERLAARLNLTEDRVQVWFQNRRAKWKRNQRVLMLRNTA .::.::: :..:.:::: .::.::...:: : :::::::::::::.:... : . . NP_038 TYQLHELERAFEKSHYPDVYSREELAGKVNLPEVRVQVWFQNRRAKWRRQEK-LEVSSMK 150 160 170 180 190 200 210 220 230 240 250 260 pF1KB7 TADLAHPLDMFLGGAYYAAPALDPALCVHLVPQLPRPPVLPVPPMPPRPPMVPMPPRPPI : :.. . :: NP_038 LQD---------------------------------------------SPLLSFSRSPPS 210 270 280 290 300 310 320 pF1KB7 APMPPMAPVPPGSRMAPVPPGPRMAPVPPW--PPMAPVPPWPPMAPVPT-GPPMAPVPPG : . :.. . ::: .:. : . :. : ::. : . .: ::: .: . NP_038 ATLSPLG-AGPGSGGGPA--GGAL-PLESWLGPPL-PGGGATALQSLPGFGPPAQSLPAS 220 230 240 250 260 270 330 340 350 360 370 380 pF1KB7 --PPMARVPPGPPMARVPP-GPPMAPLPPGPPMAPLPPG-PPMAPLPPGPPMAPLPPRSH :: :: ::. :: :: . :: : :: : :: :: . : NP_038 YTPP----PPPPPFLNSPPLGPGLQPLAPPPPSYPCGPGFGDKFPLDEADPRNSSIAALR 280 290 300 310 320 390 400 pF1KB7 VPHTGLAPVHITWAPVINSYYACPFF NP_038 LKAKEHIQAIGKPWQAL 330 340 406 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 07:19:41 2016 done: Thu Nov 3 07:19:42 2016 Total Scan time: 10.900 Total Display time: -0.040 Function used was FASTA [36.3.4 Apr, 2011]