FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8883, 184 aa 1>>>pF1KB8883 184 - 184 aa - 184 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.0496+/-0.000291; mu= 3.1404+/- 0.018 mean_var=227.4590+/-46.327, 0's: 0 Z-trim(124.8): 262 B-trim: 21 in 1/61 Lambda= 0.085040 statistics sampled from 46944 (47246) to 46944 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.842), E-opt: 0.2 (0.554), width: 16 Scan time: 6.860 The best scores are: opt bits E(85289) NP_116142 (OMIM: 605726,610362,610381,613757) reti ( 184) 1287 169.1 3.6e-42 NP_001306003 (OMIM: 605726,610362,610381,613757) r ( 230) 1287 169.2 4.2e-42 NP_038463 (OMIM: 601881,611038) retinal homeobox p ( 346) 535 77.1 3.3e-14 NP_068745 (OMIM: 605420,609597,613451,615529) home ( 411) 380 58.2 2e-08 XP_016870292 (OMIM: 604675) PREDICTED: paired meso ( 193) 365 56.0 4.2e-08 NP_006483 (OMIM: 136760,606014) homeobox protein a ( 343) 366 56.4 5.7e-08 NP_057391 (OMIM: 604675) paired mesoderm homeobox ( 253) 363 55.9 6e-08 NP_620689 (OMIM: 300004,300215,300382,300419,30835 ( 562) 365 56.5 8.7e-08 NP_001290437 (OMIM: 612019) intestine-specific hom ( 245) 358 55.2 9e-08 NP_005160 (OMIM: 602078,602753) paired mesoderm ho ( 284) 348 54.1 2.3e-07 NP_703149 (OMIM: 300154) homeobox protein ESX1 [Ho ( 406) 347 54.1 3.2e-07 NP_001263380 (OMIM: 606701) dorsal root ganglia ho ( 263) 335 52.5 6.7e-07 NP_003915 (OMIM: 209880,603851,613013) paired meso ( 314) 335 52.5 7.5e-07 NP_008833 (OMIM: 167420,202650) paired mesoderm ho ( 217) 324 51.0 1.5e-06 NP_115485 (OMIM: 604529) homeobox protein orthoped ( 325) 327 51.6 1.5e-06 XP_016883326 (OMIM: 122000,148300,605020,614195) P ( 280) 324 51.1 1.8e-06 XP_006711451 (OMIM: 167420,202650) PREDICTED: pair ( 198) 321 50.6 1.8e-06 NP_001243201 (OMIM: 122000,148300,605020,614195) v ( 301) 324 51.2 1.9e-06 NP_073207 (OMIM: 167420,202650) paired mesoderm ho ( 245) 321 50.7 2.1e-06 NP_055403 (OMIM: 122000,148300,605020,614195) visu ( 365) 324 51.3 2.1e-06 NP_878314 (OMIM: 142993,610092,610093) visual syst ( 361) 322 51.0 2.5e-06 NP_852126 (OMIM: 122880,148820,193500,268220,60659 ( 403) 322 51.1 2.7e-06 NP_852125 (OMIM: 122880,148820,193500,268220,60659 ( 407) 322 51.1 2.7e-06 NP_001128726 (OMIM: 167410,268220) paired box prot ( 505) 323 51.3 2.9e-06 NP_039236 (OMIM: 167410,268220) paired box protein ( 518) 323 51.3 2.9e-06 NP_002575 (OMIM: 167410,268220) paired box protein ( 520) 323 51.3 2.9e-06 NP_852122 (OMIM: 122880,148820,193500,268220,60659 ( 479) 322 51.1 3e-06 NP_001120838 (OMIM: 122880,148820,193500,268220,60 ( 483) 322 51.1 3.1e-06 NP_852123 (OMIM: 122880,148820,193500,268220,60659 ( 484) 322 51.1 3.1e-06 NP_852124 (OMIM: 122880,148820,193500,268220,60659 ( 505) 322 51.2 3.1e-06 NP_001297090 (OMIM: 106210,120430,136520,148190,16 ( 286) 314 49.9 4.2e-06 NP_001297089 (OMIM: 106210,120430,136520,148190,16 ( 286) 314 49.9 4.2e-06 NP_001297088 (OMIM: 106210,120430,136520,148190,16 ( 401) 314 50.1 5.3e-06 NP_001245394 (OMIM: 106210,120430,136520,148190,16 ( 422) 314 50.1 5.5e-06 NP_001121084 (OMIM: 106210,120430,136520,148190,16 ( 422) 314 50.1 5.5e-06 NP_000271 (OMIM: 106210,120430,136520,148190,16555 ( 422) 314 50.1 5.5e-06 NP_001245393 (OMIM: 106210,120430,136520,148190,16 ( 422) 314 50.1 5.5e-06 NP_001245391 (OMIM: 106210,120430,136520,148190,16 ( 436) 314 50.1 5.6e-06 NP_001595 (OMIM: 106210,120430,136520,148190,16555 ( 436) 314 50.1 5.6e-06 NP_001297087 (OMIM: 106210,120430,136520,148190,16 ( 436) 314 50.1 5.6e-06 NP_001245392 (OMIM: 106210,120430,136520,148190,16 ( 436) 314 50.1 5.6e-06 NP_006252 (OMIM: 262600,601538) homeobox protein p ( 226) 307 49.0 6.5e-06 XP_011537084 (OMIM: 601527,613456) PREDICTED: ALX ( 231) 303 48.5 9.3e-06 NP_005020 (OMIM: 107250,602669,610623) pituitary h ( 302) 303 48.6 1.1e-05 NP_008913 (OMIM: 601527,613456) ALX homeobox prote ( 326) 303 48.6 1.2e-05 NP_000545 (OMIM: 120970,268000,602225,613829) cone ( 299) 298 48.0 1.7e-05 XP_016862544 (OMIM: 602504) PREDICTED: short statu ( 190) 293 47.2 1.9e-05 XP_006713791 (OMIM: 602504) PREDICTED: short statu ( 190) 293 47.2 1.9e-05 XP_016862542 (OMIM: 602504) PREDICTED: short statu ( 204) 293 47.2 2e-05 XP_016862543 (OMIM: 602504) PREDICTED: short statu ( 204) 293 47.2 2e-05 >>NP_116142 (OMIM: 605726,610362,610381,613757) retina a (184 aa) initn: 1287 init1: 1287 opt: 1287 Z-score: 878.0 bits: 169.1 E(85289): 3.6e-42 Smith-Waterman score: 1287; 100.0% identity (100.0% similar) in 184 aa overlap (1-184:1-184) 10 20 30 40 50 60 pF1KB8 MFLSPGEGPATEGGGLGPGEEAPKKKHRRNRTTFTTYQLHQLERAFEASHYPDVYSREEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_116 MFLSPGEGPATEGGGLGPGEEAPKKKHRRNRTTFTTYQLHQLERAFEASHYPDVYSREEL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 AAKVHLPEVRVQVWFQNRRAKWRRQERLESGSGAVAAPRLPEAPALPFARPPAMSLPLEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_116 AAKVHLPEVRVQVWFQNRRAKWRRQERLESGSGAVAAPRLPEAPALPFARPPAMSLPLEP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 WLGPGPPAVPGLPRLLGPGPGLQASFGPHAFAPTFADGFALEEASLRLLAKEHAQALDRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_116 WLGPGPPAVPGLPRLLGPGPGLQASFGPHAFAPTFADGFALEEASLRLLAKEHAQALDRA 130 140 150 160 170 180 pF1KB8 WPPA :::: NP_116 WPPA >>NP_001306003 (OMIM: 605726,610362,610381,613757) retin (230 aa) initn: 1287 init1: 1287 opt: 1287 Z-score: 876.8 bits: 169.2 E(85289): 4.2e-42 Smith-Waterman score: 1287; 100.0% identity (100.0% similar) in 184 aa overlap (1-184:47-230) 10 20 30 pF1KB8 MFLSPGEGPATEGGGLGPGEEAPKKKHRRN :::::::::::::::::::::::::::::: NP_001 AWGSPALSLPVAPPAVSPPSVPLPSHQVGAMFLSPGEGPATEGGGLGPGEEAPKKKHRRN 20 30 40 50 60 70 40 50 60 70 80 90 pF1KB8 RTTFTTYQLHQLERAFEASHYPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQERLES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RTTFTTYQLHQLERAFEASHYPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQERLES 80 90 100 110 120 130 100 110 120 130 140 150 pF1KB8 GSGAVAAPRLPEAPALPFARPPAMSLPLEPWLGPGPPAVPGLPRLLGPGPGLQASFGPHA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GSGAVAAPRLPEAPALPFARPPAMSLPLEPWLGPGPPAVPGLPRLLGPGPGLQASFGPHA 140 150 160 170 180 190 160 170 180 pF1KB8 FAPTFADGFALEEASLRLLAKEHAQALDRAWPPA :::::::::::::::::::::::::::::::::: NP_001 FAPTFADGFALEEASLRLLAKEHAQALDRAWPPA 200 210 220 230 >>NP_038463 (OMIM: 601881,611038) retinal homeobox prote (346 aa) initn: 664 init1: 444 opt: 535 Z-score: 376.1 bits: 77.1 E(85289): 3.3e-14 Smith-Waterman score: 550; 55.6% identity (66.9% similar) in 178 aa overlap (4-155:112-283) 10 20 30 pF1KB8 MFLSPGE--GPATEGGGLGPGEEAPKKKHRRNR ::: :::: . :. :: ::::::::: NP_038 EPSPPPAPAPAPEYEAPRPYCPKEPGEARPSPGLPVGPATGEAKLSE-EEQPKKKHRRNR 90 100 110 120 130 140 40 50 60 70 80 90 pF1KB8 TTFTTYQLHQLERAFEASHYPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQERLESG :::::::::.:::::: :::::::::::::.::.:::::::::::::::::::::.:: NP_038 TTFTTYQLHELERAFEKSHYPDVYSREELAGKVNLPEVRVQVWFQNRRAKWRRQEKLE-- 150 160 170 180 190 100 110 120 pF1KB8 SGAVAAPRLPEAPALPFAR-PPAMSL------------------PLEPWLGP-----GPP :.. .: ..: : :.: ::. .: ::: :::: : NP_038 ---VSSMKLQDSPLLSFSRSPPSATLSPLGAGPGSGGGPAGGALPLESWLGPPLPGGGAT 200 210 220 230 240 250 130 140 150 160 170 180 pF1KB8 AVPGLPRLLGPGPGLQASFGPHAFAPTFADGFALEEASLRLLAKEHAQALDRAWPPA :. .:: . :. .: ::. : : : NP_038 ALQSLPGFGPPAQSLPASYTPPPPPPPFLNSPPLGPGLQPLAPPPPSYPCGPGFGDKFPL 260 270 280 290 300 310 >>NP_068745 (OMIM: 605420,609597,613451,615529) homeobox (411 aa) initn: 361 init1: 327 opt: 380 Z-score: 272.4 bits: 58.2 E(85289): 2e-08 Smith-Waterman score: 380; 46.7% identity (70.4% similar) in 135 aa overlap (21-153:208-339) 10 20 30 40 50 pF1KB8 MFLSPGEGPATEGGGLGPGEEAPKKKHRRNRTTFTTYQLHQLERAFEASH :. : :.::::::::.:::..::..:. .: NP_068 SYLSVKEAGVKGPQDRASSDLPSPLEKADSESNKGKKRRNRTTFTSYQLEELEKVFQKTH 180 190 200 210 220 230 60 70 80 90 100 pF1KB8 YPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQERLESGSGAVAAPRLPEAPALPF-A :::::.::.:: .. : :.::::::::::::::..::. :. . .. : ::. . NP_068 YPDVYAREQLAMRTDLTEARVQVWFQNRRAKWRKRERF--GQMQQVRTHFSTAYELPLLT 240 250 260 270 280 290 110 120 130 140 150 160 pF1KB8 RPPAMSLPLEP-WLGPGPPAVPGLPRLLGPGPGLQASFGPHAFAPTFADGFALEEASLRL : .. .: ::: . : : .: . : . : ..::: : NP_068 RAENYAQIQNPSWLGNNGAASP-VPACVVPCDPVPACMSPHAHPPGSGASSVTDFLSVSG 300 310 320 330 340 350 170 180 pF1KB8 LAKEHAQALDRAWPPA NP_068 AGSHVGQTHMGSLFGAASLSPGLNGYELNGEPDRKTSSIAALRMKAKEHSAAISWAT 360 370 380 390 400 410 >>XP_016870292 (OMIM: 604675) PREDICTED: paired mesoderm (193 aa) initn: 392 init1: 307 opt: 365 Z-score: 266.4 bits: 56.0 E(85289): 4.2e-08 Smith-Waterman score: 365; 47.2% identity (64.2% similar) in 176 aa overlap (3-172:22-181) 10 20 30 40 pF1KB8 MFLSPGEGPATEGGGLGPGEEAP-KKKHRRNRTTFTTYQLH :. ::: : . : : : :::.:::::::.. ::. XP_016 MGPLHRETGPERSGHRLKVTELGCGEG---ECPSPGRGSAAKRKKKQRRNRTTFNSSQLQ 10 20 30 40 50 50 60 70 80 90 pF1KB8 QLERAFEASHYPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQER--LESGSGAVAAP :::.:: .::::.. ::::: .:.: :.::::::::::::.::.:: : : :... XP_016 ALERVFERTHYPDAFVREELARRVNLSEARVQVWFQNRRAKFRRNERAMLASRSASLLKS 60 70 80 90 100 110 100 110 120 130 140 150 pF1KB8 RLPEAPA-LPFA-RPPAMSLPLEPWLGPGP-PAVPGLPRLLGPGPGLQASFGPHAFAPTF :: : : :: :.: : . .: .:: : .:: : :: . . .. XP_016 YSQEAAIEQPVAPRPTALSPDYLSWTASSPYSTVP--PY--SPG-----SSGPATPGVNM 120 130 140 150 160 160 170 180 pF1KB8 ADGFALEEASLRLLAKEHAQALDRAWPPA :...: :::: ::: XP_016 ANSIA----SLRLKAKEFSLHHSQVPTVN 170 180 190 >>NP_006483 (OMIM: 136760,606014) homeobox protein arist (343 aa) initn: 349 init1: 318 opt: 366 Z-score: 264.0 bits: 56.4 E(85289): 5.7e-08 Smith-Waterman score: 366; 43.4% identity (63.3% similar) in 166 aa overlap (4-155:124-284) 10 20 pF1KB8 MFLSPGEGPATEGGGLGPG-----EEAPKK-KH ::: :. :.:: : : .: :. NP_006 EEKTSKAASFPQLPLDCRGGPRDGPSNLQGSPGPCLASLHLPLSPGLPDSMELAKNKSKK 100 110 120 130 140 150 30 40 50 60 70 80 pF1KB8 RRNRTTFTTYQLHQLERAFEASHYPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQER :::::::.:.::..::..:. .::::::.::.:: .. : :.::::::::::::::..:: NP_006 RRNRTTFSTFQLEELEKVFQKTHYPDVYAREQLALRTDLTEARVQVWFQNRRAKWRKRER 160 170 180 190 200 210 90 100 110 120 130 pF1KB8 LESGSGAVAAPRLPEAPALPFARPPAM-SLP-LEP--WLGPGPPAVPGLPRLLGP----G : . : : . : .. : : : :. : .:: . :: : :..: . NP_006 Y----GKIQEGRNPFTAAYDISVLPRTDSHPQLQNSLWASPGSGS-PGGPCLVSPEGIPS 220 230 240 250 260 140 150 160 170 180 pF1KB8 PGLQASFGPHAFAPTFADGFALEEASLRLLAKEHAQALDRAWPPA : .. ::. . : NP_006 PCMSPYSHPHGSVAGFMGVPAPSAAHPGIYSIHGFPPTLGGHSFEPSSDGDYKSPSLVSL 270 280 290 300 310 320 >>NP_057391 (OMIM: 604675) paired mesoderm homeobox prot (253 aa) initn: 412 init1: 307 opt: 363 Z-score: 263.7 bits: 55.9 E(85289): 6e-08 Smith-Waterman score: 363; 45.2% identity (64.4% similar) in 177 aa overlap (4-172:78-241) 10 20 30 pF1KB8 MFLSPGEGPATEGGGLGPGEEAP---KKKHRRN : .:. .: .::. . :::.::: NP_057 EVAAAGRLAARPGARAEAREGAAREPSGGSSGSEAAPQDGECPSPGRGSAAKRKKKQRRN 50 60 70 80 90 100 40 50 60 70 80 pF1KB8 RTTFTTYQLHQLERAFEASHYPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQER--L ::::.. ::. :::.:: .::::.. ::::: .:.: :.::::::::::::.::.:: : NP_057 RTTFNSSQLQALERVFERTHYPDAFVREELARRVNLSEARVQVWFQNRRAKFRRNERAML 110 120 130 140 150 160 90 100 110 120 130 140 pF1KB8 ESGSGAVAAPRLPEAPA-LPFA-RPPAMSLPLEPWLGPGP-PAVPGLPRLLGPGPGLQAS : :... :: : : :: :.: : . .: .:: : .:: : NP_057 ASRSASLLKSYSQEAAIEQPVAPRPTALSPDYLSWTASSPYSTVP--PY--SPG-----S 170 180 190 200 210 150 160 170 180 pF1KB8 FGPHAFAPTFADGFALEEASLRLLAKEHAQALDRAWPPA :: . . ..:...: :::: ::: NP_057 SGPATPGVNMANSIA----SLRLKAKEFSLHHSQVPTVN 220 230 240 250 >>NP_620689 (OMIM: 300004,300215,300382,300419,308350,30 (562 aa) initn: 416 init1: 330 opt: 365 Z-score: 260.7 bits: 56.5 E(85289): 8.7e-08 Smith-Waterman score: 397; 50.4% identity (70.5% similar) in 129 aa overlap (6-132:307-426) 10 20 30 pF1KB8 MFLSPGEGPATEGGGLGPGEEAPKKKHRRNRTTFT :: . ..: : :.:.:: ::::: NP_620 AAAAAVATEGGELSPKEELLLHPEDAEGKDGEDSVCLSAGSDSEEGLLKRKQRRYRTTFT 280 290 300 310 320 330 40 50 60 70 80 90 pF1KB8 TYQLHQLERAFEASHYPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQERLESGSGAV .:::..:::::. .:::::..::::: .. : :.::::::::::::::..:. .:: NP_620 SYQLEELERAFQKTHYPDVFTREELAMRLDLTEARVQVWFQNRRAKWRKREK----AGAQ 340 350 360 370 380 390 100 110 120 130 140 150 pF1KB8 AAPRLPEAPALPFARPPAMSLPLEPWL--GPGPPAVPGLPRLLGPGPGLQASFGPHAFAP . : :.::: : . . :: :.: .: :: :.: NP_620 THP-----PGLPFPGPLSATHPLSPYLDASPFPPHHPALDSAWTAAAAAAAAAFPSLPPP 400 410 420 430 440 160 170 180 pF1KB8 TFADGFALEEASLRLLAKEHAQALDRAWPPA NP_620 PGSASLPPSGAPLGLSTFLGAAVFRHPAFISPAFGRLFSTMAPLTSASTAAALLRQPTPA 450 460 470 480 490 500 >>NP_001290437 (OMIM: 612019) intestine-specific homeobo (245 aa) initn: 434 init1: 302 opt: 358 Z-score: 260.5 bits: 55.2 E(85289): 9e-08 Smith-Waterman score: 358; 39.2% identity (63.6% similar) in 176 aa overlap (5-169:57-227) 10 20 30 pF1KB8 MFLSPGEGPATEGGGLGPGEEAP---KKKHRRNR :::. :. .: : .. : .:..:: : NP_001 LSLSFSIEAILKRPARRSDMDRPEGPGEEGPGEAAASGSGLEKPPKDQPQEGRKSKRRVR 30 40 50 60 70 80 40 50 60 70 80 90 pF1KB8 TTFTTYQLHQLERAFEASHYPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQERLESG ::::: :::.::. :. .:::::. : .:::...:::.:::.::::.:::::.::.. NP_001 TTFTTEQLHELEKIFHFTHYPDVHIRSQLAARINLPEARVQIWFQNQRAKWRKQEKI--- 90 100 110 120 130 140 100 110 120 130 140 pF1KB8 SGAVAAPR-LPEAP-ALP----FARPPAMSLPLEPWLGPGPPAVPGLPRLLGPG--PGLQ : ..::. : :: ::: : : : :. :.: :. :. . :. NP_001 -GNLGAPQQLSEASVALPTNLDVAGPTWTSTALRR-LAPPTSCCPSAQDQLASAWFPAWI 150 160 170 180 190 200 150 160 170 180 pF1KB8 ASFGPHAFAPTFADGFALEEASLRLLAKEHAQALDRAWPPA . . : . . :. .... . .: NP_001 TLLPAHPWETQPVPGLPIHQTCIPVLCILPPPHPKWGSICATST 210 220 230 240 >>NP_005160 (OMIM: 602078,602753) paired mesoderm homeob (284 aa) initn: 375 init1: 314 opt: 348 Z-score: 253.1 bits: 54.1 E(85289): 2.3e-07 Smith-Waterman score: 361; 41.2% identity (58.2% similar) in 182 aa overlap (18-184:81-257) 10 20 30 40 pF1KB8 MFLSPGEGPATEGGGLGPGEEAPKKKHRRNRTTFTTYQLHQLERAFE :. :.:.:: :::::. ::..:::.: NP_005 ALGSSNCALGALRDHQPAPYSAVPYKFFPEPSGLHEKRKQRRIRTTFTSAQLKELERVFA 60 70 80 90 100 110 50 60 70 80 90 100 pF1KB8 ASHYPDVYSREELAAKVHLPEVRVQVWFQNRRAKWRRQERLESGSGAVAAPRLPEAPAL- .::::.:.::::: :. : :.::::::::::::.:.::: :..::..: .. : NP_005 ETHYPDIYTREELALKIDLTEARVQVWFQNRRAKFRKQERAASAKGAAGAAGAKKGEARC 120 130 140 150 160 170 110 120 130 140 150 pF1KB8 ---------PFARP-PAMSLPLEPWLGPG---PPAVPG-LPRLLGPGPGLQASFGPHAFA : : . : : .:: : :. :: :: ::: . ::. . NP_005 SSEDDDSKESTCSPTPDSTASLPPPPAPGLASPRLSPSPLPVALGSGPG--PGPGPQPLK 180 190 200 210 220 160 170 180 pF1KB8 PTFADGFALEEASLRLLAKEHAQALDRAWPPA .. : : .. : : .:: :: NP_005 GALWAGVAGGGGG---GPGAGAAELLKAWQPAESGPGPFSGVLSSFHRKPGPALKTNLF 230 240 250 260 270 280 184 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:20:32 2016 done: Tue Nov 8 04:20:33 2016 Total Scan time: 6.860 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]