FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9706, 338 aa 1>>>pF1KB9706 338 - 338 aa - 338 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.7630+/-0.000423; mu= -2.7563+/- 0.027 mean_var=553.7968+/-113.029, 0's: 0 Z-trim(126.0): 126 B-trim: 992 in 1/58 Lambda= 0.054500 statistics sampled from 50785 (50924) to 50785 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.838), E-opt: 0.2 (0.597), width: 16 Scan time: 6.730 The best scores are: opt bits E(85289) NP_067015 (OMIM: 142986) homeobox protein Hox-D11 ( 338) 2388 201.4 2.3e-51 NP_055027 (OMIM: 605559) homeobox protein Hox-C11 ( 304) 643 64.2 4.2e-10 NP_005514 (OMIM: 142958,605432) homeobox protein H ( 313) 547 56.6 8.1e-08 NP_061824 (OMIM: 142957) homeobox protein Hox-A10 ( 410) 400 45.2 0.00029 NP_076922 (OMIM: 142964) homeobox protein Hox-B9 [ ( 250) 368 42.4 0.0012 NP_059105 (OMIM: 605560) homeobox protein Hox-C10 ( 342) 362 42.1 0.002 NP_689952 (OMIM: 142956) homeobox protein Hox-A9 [ ( 272) 350 41.1 0.0034 NP_055028 (OMIM: 142982) homeobox protein Hox-D9 [ ( 352) 343 40.7 0.0058 NP_008828 (OMIM: 142971) homeobox protein Hox-C9 [ ( 260) 339 40.2 0.0061 >>NP_067015 (OMIM: 142986) homeobox protein Hox-D11 [Hom (338 aa) initn: 2388 init1: 2388 opt: 2388 Z-score: 1043.3 bits: 201.4 E(85289): 2.3e-51 Smith-Waterman score: 2388; 100.0% identity (100.0% similar) in 338 aa overlap (1-338:1-338) 10 20 30 40 50 60 pF1KB9 MNDFDECGQSAASMYLPGCAYYVAPSDFASKPSFLSQPSSCQMTFPYSSNLAPHVQPVRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_067 MNDFDECGQSAASMYLPGCAYYVAPSDFASKPSFLSQPSSCQMTFPYSSNLAPHVQPVRE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 VAFRDYGLERAKWPYRGGGGGGSAGGGSSGGGPGGGGGGAGGYAPYYAAAAAAAAAAAAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_067 VAFRDYGLERAKWPYRGGGGGGSAGGGSSGGGPGGGGGGAGGYAPYYAAAAAAAAAAAAA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 EEAAMQRELLPPAGRRPDVLFKAPEPVCAAPGPPHGPAGAASNFYSAVGRNGILPQGFDQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_067 EEAAMQRELLPPAGRRPDVLFKAPEPVCAAPGPPHGPAGAASNFYSAVGRNGILPQGFDQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 FYEAAPGPPFAGPQPPPPPAPPQPEGAADKGDPRTGAGGGGGSPCTKATPGSEPKGAAEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_067 FYEAAPGPPFAGPQPPPPPAPPQPEGAADKGDPRTGAGGGGGSPCTKATPGSEPKGAAEG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 SGGDGEGPPGEAGAEKSSSAVAPQRSRKKRCPYTKYQIRELEREFFFNVYINKEKRLQLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_067 SGGDGEGPPGEAGAEKSSSAVAPQRSRKKRCPYTKYQIRELEREFFFNVYINKEKRLQLS 250 260 270 280 290 300 310 320 330 pF1KB9 RMLNLTDRQVKIWFQNRRMKEKKLNRDRLQYFTGNPLF :::::::::::::::::::::::::::::::::::::: NP_067 RMLNLTDRQVKIWFQNRRMKEKKLNRDRLQYFTGNPLF 310 320 330 >>NP_055027 (OMIM: 605559) homeobox protein Hox-C11 [Hom (304 aa) initn: 833 init1: 466 opt: 643 Z-score: 302.3 bits: 64.2 E(85289): 4.2e-10 Smith-Waterman score: 795; 44.5% identity (66.1% similar) in 339 aa overlap (3-338:21-304) 10 20 30 40 pF1KB9 MNDFDECGQSAASMYLPGCAYYVAPSDFASKPSFLSQPSSCQ :: : :. :...:::.:.::. : .:.. ::: : : : NP_055 MFNSVNLGNFCSPSRKERGADFGERGSCASNLYLPSCTYYM-P-EFSTVSSFLPQAPSRQ 10 20 30 40 50 50 60 70 80 90 100 pF1KB9 MTFPYSSNLAPHVQPVREVAFRDYGLE-RAKWPYRGGGGGGSAGGGSSGGGPGGGGGGAG ...:::. .: :::::. :::: .:: .:.. NP_055 ISYPYSA----QVPPVREVS---YGLEPSGKWHHRNS----------------------- 60 70 80 110 120 130 140 150 160 pF1KB9 GYAPYYAAAAAAAAAAAAAEEAAMQRELLPPAGRRPDVLFKAPEPVCAAPGPPHGPAGAA :. :::: . :.:: :::. ..:.: : .. : .: .. NP_055 -YSSCYAAA-----------DELMHRECLPPS-TVTEILMKN-EGSYGGHHHPSAPHATP 90 100 110 120 130 170 180 190 200 210 220 pF1KB9 SNFYSAVGRNGILPQGFDQFYEAAPGPPFAGPQPPPPPAPPQPEGAADKGDPRTGAGGGG ..:::.:..:..:::.::.:.. : . : :: : . .: : ::.:.. ..: NP_055 AGFYSSVNKNSVLPQAFDRFFDNA----YCGGGDPPAEPPCSGKGEA-KGEPEAPPASGL 140 150 160 170 180 230 240 250 260 270 pF1KB9 GSPCTKATPGSEPKGAAEGSGGDGEGPPGEAGAEKSSSAVAPQ--RSRKKRCPYTKYQIR .: .: :.: .. :... .. : .. : ...: ::. :.:::::::.:.::: NP_055 AS---RAEAGAEAEAEEENTNPSSSGSAHSVAKEPAKGA-APNAPRTRKKRCPYSKFQIR 190 200 210 220 230 240 280 290 300 310 320 330 pF1KB9 ELEREFFFNVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKLNRDRLQYFTGNPLF :::::::::::::::::::::::::::::::::::::::::::::.:::::::.::::. NP_055 ELEREFFFNVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKLSRDRLQYFSGNPLL 250 260 270 280 290 300 >>NP_005514 (OMIM: 142958,605432) homeobox protein Hox-A (313 aa) initn: 899 init1: 507 opt: 547 Z-score: 261.4 bits: 56.6 E(85289): 8.1e-08 Smith-Waterman score: 900; 48.3% identity (63.8% similar) in 356 aa overlap (3-338:2-313) 10 20 30 40 50 pF1KB9 MNDFDECGQSAASMYLPGCAYYVAPSDFASKPSFLSQ-PSSCQMTFPYSSNLAPHVQPVR :::: : ...::::.:.:::. ::.: :::: : ::: ::. ::::: :.::::: NP_005 MDFDERGPCSSNMYLPSCTYYVSGPDFSSLPSFLPQTPSSRPMTYSYSSNL-PQVQPVR 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 EVAFRDYGLERA-KWPYRGGGGGGSAGGGSSGGGPGGGGGGAGGYAPYYAAAAAAAAAAA ::.::.:..: : :: ::. : NP_005 EVTFREYAIEPATKWHPRGN-----------------------------------LAHCY 60 70 80 120 130 140 150 160 170 pF1KB9 AAEEAAMQRELLPP--AGRRPDVLFKAPEPVCAAPGPPHGPAGAASNFYSAVGRNGILPQ .::: . . : : :: ::: :. : : : ...:::::.:::::.::: NP_005 SAEELVHRDCLQAPSAAGVPGDVLAKSSANVYHHPTP-----AVSSNFYSTVGRNGVLPQ 90 100 110 120 130 180 190 200 210 220 pF1KB9 GFDQFYEAAPGPP-------FAGPQ-----PPPPPAPPQPEGAADKGDPRTGA---GGGG .::::.:.: : : . : . :: : .:: : : :.. :::: NP_005 AFDQFFETAYGTPENLASSDYPGDKSAEKGPPAATATSAAAAAAATGAPATSSSDSGGGG 140 150 160 170 180 190 230 240 250 260 270 280 pF1KB9 GSPCTKATPGSEPKGAAEGSGGDGEGPPGEAG-AEKSSSAVAPQRSRKKRCPYTKYQIRE : : ... ..: : . .. .: . .: .: .... . ::.:::::::::::::: NP_005 G--CRETAAAAEEK-ERRRRPESSSSPESSSGHTEDKAGGSSGQRTRKKRCPYTKYQIRE 200 210 220 230 240 250 290 300 310 320 330 pF1KB9 LEREFFFNVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKLNRDRLQYFTGNPLF :::::::.:::::::::::::::::::::::::::::::::::.:::::::...:::. NP_005 LEREFFFSVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKINRDRLQYYSANPLL 260 270 280 290 300 310 >>NP_061824 (OMIM: 142957) homeobox protein Hox-A10 [Hom (410 aa) initn: 571 init1: 314 opt: 400 Z-score: 197.7 bits: 45.2 E(85289): 0.00029 Smith-Waterman score: 416; 33.6% identity (50.8% similar) in 354 aa overlap (66-335:73-406) 40 50 60 70 80 90 pF1KB9 SQPSSCQMTFPYSSNLAPHVQPVREVAFRDYGLERAK-WPYRGGGGGGSAGGGSSGGGPG :::. .: :: . .:. ::.::: : NP_061 GGGGGGAGGGGGGGYYAHGGVYLPPAADLPYGLQSCGLFPTLGGKRNEAASPGSGGGG-G 50 60 70 80 90 100 100 110 120 130 140 150 pF1KB9 GGGGGAGGYAPYYAAAAAAAAAAAAAEEAAMQRELLPPAGRRPDVLFKAPEPVCAAPGPP : : :: ::.: . .: . .. :: : : . : : : :: NP_061 GLGPGAHGYGP---------SPIDLWLDAPRSCRMEPPDGPPPPPQQQPPPP----PQPP 110 120 130 140 160 170 180 190 pF1KB9 H-GPAGAASNFYSAVGRNG--ILPQGFDQFYE----AAPGPPFAGPQPPPP--------- . .: ... .: . . ... : .. :. . :: :: :. ::: NP_061 QPAPQATSCSFAQNIKEESSYCLYDSADKCPKVSATAAELAPF--PRGPPPDGCALGTSS 150 160 170 180 190 200 200 210 220 230 240 pF1KB9 --PAP-----PQPEGAADKGDPRTGAGGGGGS-----PCTKATPG---SEPKGAAEGSG- :.: : :.: :: :.::::.. : :: . : . : ::. NP_061 GVPVPGYFRLSQAYGTA-KG---YGSGGGGAQQLGAGPFPAQPPGRGFDLPPALASGSAD 210 220 230 240 250 260 250 260 pF1KB9 ------------------GDGEGPPGEAGAEKSSSAV-----APQRS------------- :.: : :. :. ::::. ::..: NP_061 AARKERALDSPPPPTLACGSGGGSQGDEEAHASSSAAEELSPAPSESSKASPEKDSLGNS 270 280 290 300 310 320 270 280 290 300 310 pF1KB9 --------------RKKRCPYTKYQIRELEREFFFNVYINKEKRLQLSRMLNLTDRQVKI :::::::::.: :::.::.::.:...:.::..:: ..:::::::: NP_061 KGENAANWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISRSVHLTDRQVKI 330 340 350 360 370 380 320 330 pF1KB9 WFQNRRMKEKKLNRD-RLQYFTGNPLF :::::::: ::.::. :.. .:.: NP_061 WFQNRRMKLKKMNRENRIRELTANFNFS 390 400 410 >>NP_076922 (OMIM: 142964) homeobox protein Hox-B9 [Homo (250 aa) initn: 369 init1: 312 opt: 368 Z-score: 186.3 bits: 42.4 E(85289): 0.0012 Smith-Waterman score: 388; 38.2% identity (59.3% similar) in 204 aa overlap (137-328:48-247) 110 120 130 140 150 160 pF1KB9 YAAAAAAAAAAAAAEEAAMQRELLPPAGRRPDVLFKAPEPVCAAPGPPHGPAGAASNFYS :. :. :: .: : .: ::. NP_076 HESEDAPPAKFPSGQYASSRQPGHAEHLEFPSCSFQPKAPVFGASWAPLSPH--ASGSLP 20 30 40 50 60 70 170 180 190 200 210 220 pF1KB9 AVGRNGILPQGFDQFYEAAPGPPFAGPQPPPPPAPPQPEGAADKGDPRTGAGG---GGGS .: . : ::: :. . : : :: : . :: :..: :: : :. NP_076 SVYHPYIQPQGVPP-AESRYLRTWLEPAPRGEAAPGQGQ-AAVKAEPLLGAPGELLKQGT 80 90 100 110 120 130 230 240 250 260 270 pF1KB9 P--CTKATPGSEPKGAAEGSG-GDGEGPPGEAGAEK------SSSAVAPQRSRKKRCPYT : ... : : . . : ::.. : :. :.. . . ::::::::: NP_076 PEYSLETSAGREAVLSNQRPGYGDNKICEGSEDKERPDQTNPSANWLHARSSRKKRCPYT 140 150 160 170 180 190 280 290 300 310 320 330 pF1KB9 KYQIRELEREFFFNVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKLNRDRLQYFTG ::: :::.::.::.:.....: ...:.:::..::::::::::::: ::.:... NP_076 KYQTLELEKEFLFNMYLTRDRRHEVARLLNLSERQVKIWFQNRRMKMKKMNKEQGKE 200 210 220 230 240 250 pF1KB9 NPLF >>NP_059105 (OMIM: 605560) homeobox protein Hox-C10 [Hom (342 aa) initn: 391 init1: 318 opt: 362 Z-score: 182.3 bits: 42.1 E(85289): 0.002 Smith-Waterman score: 362; 34.9% identity (60.0% similar) in 235 aa overlap (112-335:115-338) 90 100 110 120 130 pF1KB9 GSAGGGSSGGGPGGGGGGAGGYAPYYAAAAAAAAAAAAAEEAAMQRELLPPA--GRR--P .: : .. :::. . :: . :.. : NP_059 AYRLEQPVGRPLSSCSYPPSVKEENVCCMYSAEKRAKSGPEAALYSHPLPESCLGEHEVP 90 100 110 120 130 140 140 150 160 170 180 190 pF1KB9 -DVLFKAPEPVCAAPGPPHGPAGAASNFYSAV-GRNGILPQGFDQFYEAAPGPPFAGPQP ..: : :: ..:..: . : .. :.. : .: ..: NP_059 VPSYYRASPSYSALDKTPH--CSGANDFEAPFEQRASLNPRA-----EHLESPQLGGKVS 150 160 170 180 190 200 210 220 230 240 250 pF1KB9 -PPPPAPPQPEGAADKGDPRTGAGGGGGSPCTKATPGSEPKGAAEGSG--GDGEGPPGEA : : . . .. . . .: ::: . .: ::..: .:.:. : NP_059 FPETPKSDSQTPSPNEIKTEQSLAGPKGSPSESE---KERAKAADSSPDTSDNEAKE-EI 200 210 220 230 240 250 260 270 280 290 300 310 pF1KB9 GAEKSSSAVAPQRS-RKKRCPYTKYQIRELEREFFFNVYINKEKRLQLSRMLNLTDRQVK ::.... .: :::::::::.: :::.::.::.:...:.::..:. .:::::::: NP_059 KAENTTGNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISKTINLTDRQVK 260 270 280 290 300 310 320 330 pF1KB9 IWFQNRRMKEKKLNRD-RLQYFTGNPLF ::::::::: ::.::. :.. .:.: NP_059 IWFQNRRMKLKKMNRENRIRELTSNFNFT 320 330 340 >>NP_689952 (OMIM: 142956) homeobox protein Hox-A9 [Homo (272 aa) initn: 394 init1: 319 opt: 350 Z-score: 178.3 bits: 41.1 E(85289): 0.0034 Smith-Waterman score: 368; 40.2% identity (66.9% similar) in 169 aa overlap (175-328:103-268) 150 160 170 180 190 200 pF1KB9 EPVCAAPGPPHGPAGAASNFYSAVGRNGILPQG--FDQFYEAAPGP-PFAG-PQPPPPPA :.: . .. : .:: ::: :. : NP_689 AGANAVPAAVYHHHHHHPYVHPQAPVAAAAPDGRYMRSWLEPTPGALSFAGLPSSRPYGI 80 90 100 110 120 130 210 220 230 240 pF1KB9 PPQPEGAADKGD-PRTGAGG------GGGSPCT----KATPGSEPKGAAEGSGGDGEGPP :.: .: .:: : . . ::: . . . :. .. ::. .: :. :: NP_689 KPEPL-SARRGDCPTLDTHTLSLTDYACGSPPVDREKQPSEGAFSENNAENESG-GDKPP 140 150 160 170 180 190 250 260 270 280 290 300 pF1KB9 GEAGAEKSSSAVAPQRSRKKRCPYTKYQIRELEREFFFNVYINKEKRLQLSRMLNLTDRQ . . . ... . . .:::::::::.: :::.::.::.:.....: ...:.::::.:: NP_689 IDPN-NPAANWLHARSTRKKRCPYTKHQTLELEKEFLFNMYLTRDRRYEVARLLNLTERQ 200 210 220 230 240 310 320 330 pF1KB9 VKIWFQNRRMKEKKLNRDRLQYFTGNPLF ::::::::::: ::.:.:: NP_689 VKIWFQNRRMKMKKINKDRAKDE 250 260 270 >>NP_055028 (OMIM: 142982) homeobox protein Hox-D9 [Homo (352 aa) initn: 467 init1: 310 opt: 343 Z-score: 174.1 bits: 40.7 E(85289): 0.0058 Smith-Waterman score: 404; 29.7% identity (52.3% similar) in 333 aa overlap (8-328:55-347) 10 20 30 pF1KB9 MNDFDECGQSAASMYLPGCAYYVAPSDFASKPSFLSQ : .:.. . .:.. :: . . . :. . NP_055 LIGHEGDEVFAARFGPPGPGAQGRPAGVADGPAATAAEFASCSF--APRSAVFSASWSAV 30 40 50 60 70 80 40 50 60 70 80 90 pF1KB9 PSSCQMTFPYSSNLAPHVQPVREVAF-RDYGLERAKW--PYRGGGGGGSAGGGSSGGGPG ::. . .:. :.: : .: . : .: : : ::...:::..::::: NP_055 PSQPPAAAAMSGLYHPYVPPPPLAASASEPGRYVRSWMEPLPGFPGGAGGGGGGGGGGPG 90 100 110 120 130 140 100 110 120 130 140 150 pF1KB9 -GGGGGAGGYAPYYAAAAAAAAAAAAAEEAAMQRELLPPAGRRPDVLFKAPEPVCAAPGP : . : .: : ::. . :: :::.: NP_055 RGPSPGPSG----------------------------PANGRHYGI---KPETR-AAPAP 150 160 170 160 170 180 190 200 210 pF1KB9 PHGPAGAASNFYSAVGRNGILPQGFDQFYEAAPGPPFAGPQPPPPPAPPQPEGAADKGDP . . ..:. : . . . . ... :: :. . : ..:: : NP_055 ATAASTTSSSSTSLSSSSKRTECSVARESQGSSGPEFSCN------SFLQEKAAAATGGT 180 190 200 210 220 220 230 240 250 260 pF1KB9 RTGAG-----GGGGSPCTKATPGSEPKGAAEGSGGDGEGPPGEAGAEKSSSA---VAPQR ::: : ::: .: : . .. : . . .. : . . NP_055 GPGAGIGAATGTGGSSEPSACSDHPIPGCSLKEEEKQHSQPQQQQLDPNNPAANWIHARS 230 240 250 260 270 280 270 280 290 300 310 320 pF1KB9 SRKKRCPYTKYQIRELEREFFFNVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKLN .::::::::::: :::.::.::.:.....: ...:.::::.::::::::::::: ::.. NP_055 TRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARILNLTERQVKIWFQNRRMKMKKMS 290 300 310 320 330 340 330 pF1KB9 RDRLQYFTGNPLF ... NP_055 KEKCPKGD 350 >>NP_008828 (OMIM: 142971) homeobox protein Hox-C9 [Homo (260 aa) initn: 362 init1: 312 opt: 339 Z-score: 173.8 bits: 40.2 E(85289): 0.0061 Smith-Waterman score: 357; 34.5% identity (57.4% similar) in 235 aa overlap (116-328:30-254) 90 100 110 120 130 140 pF1KB9 GGSSGGGPGGGGGGAGGYAPYYAAAAAAAAAAAAAEEAAMQRELLPPAGRRPDVLFKAPE :..: :: :.: . :. : ::. NP_008 MSATGPISNYYVDSLISHDNEDLLASRFPATGAHPAAARPSGLVPDCSDFPSCSF-APK 10 20 30 40 50 150 160 170 180 pF1KB9 PVCA----APGP--------PHGPA---GAASNFYSAVGRNGILP-QGFDQFYEAAPGPP :. :: : :.:: :: . .. :. . : .: .: :. : NP_008 PAVFSTSWAPVPSQSSVVYHPYGPQPHLGADTRYM----RTWLEPLSGAVSF----PSFP 60 70 80 90 100 110 190 200 210 220 230 240 pF1KB9 FAGPQPP-PPPAPPQPEGAADKGDPRTGAGGGGGSP--CTKATPGSEPKGAAEGSGGDGE .: . : : : .. :. :. ::: .: . :. :.. .:. . NP_008 AGGRHYALKPDAYPGRRADCGPGEGRSYPDYMYGSPGELRDRAPQTLPSPEADALAGS-K 120 130 140 150 160 250 260 270 280 290 300 pF1KB9 GPPGEAGAEKSSSA---VAPQRSRKKRCPYTKYQIRELEREFFFNVYINKEKRLQLSRML .: . :. . . . .::::::::::: :::.::.::.:.....: ...:.: NP_008 HKEEKADLDPSNPVANWIHARSTRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARVL 170 180 190 200 210 220 310 320 330 pF1KB9 NLTDRQVKIWFQNRRMKEKKLNRDRLQYFTGNPLF :::.::::::::::::: ::.:... NP_008 NLTERQVKIWFQNRRMKMKKMNKEKTDKEQS 230 240 250 260 338 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:26:28 2016 done: Tue Nov 8 04:26:29 2016 Total Scan time: 6.730 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]