FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3544, 412 aa 1>>>pF1KE3544 412 - 412 aa - 412 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.9334+/-0.00037; mu= -12.5184+/- 0.023 mean_var=390.3430+/-82.584, 0's: 0 Z-trim(124.8): 13 B-trim: 1587 in 1/60 Lambda= 0.064916 statistics sampled from 47272 (47302) to 47272 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.822), E-opt: 0.2 (0.555), width: 16 Scan time: 8.450 The best scores are: opt bits E(85289) NP_003646 (OMIM: 603079) E3 SUMO-protein ligase CB ( 560) 432 54.1 1e-06 XP_006724240 (OMIM: 608457) PREDICTED: chromobox p ( 192) 415 52.1 1.4e-06 XP_005261470 (OMIM: 608457) PREDICTED: chromobox p ( 158) 392 49.9 5.3e-06 XP_006724239 (OMIM: 608457) PREDICTED: chromobox p ( 200) 393 50.1 5.9e-06 XP_016884168 (OMIM: 608457) PREDICTED: chromobox p ( 250) 395 50.3 6.2e-06 NP_783640 (OMIM: 608457) chromobox protein homolog ( 251) 395 50.3 6.2e-06 NP_005180 (OMIM: 602770,613080) chromobox protein ( 532) 390 50.1 1.5e-05 XP_011523684 (OMIM: 602770,613080) PREDICTED: chro ( 586) 390 50.1 1.7e-05 XP_011528327 (OMIM: 608457) PREDICTED: chromobox p ( 165) 366 47.5 3e-05 XP_006724241 (OMIM: 608457) PREDICTED: chromobox p ( 166) 366 47.5 3e-05 XP_006724238 (OMIM: 608457) PREDICTED: chromobox p ( 258) 368 47.8 3.7e-05 XP_006724237 (OMIM: 608457) PREDICTED: chromobox p ( 259) 368 47.8 3.7e-05 NP_116036 (OMIM: 602770,613080) chromobox protein ( 211) 341 45.2 0.00018 >>NP_003646 (OMIM: 603079) E3 SUMO-protein ligase CBX4 [ (560 aa) initn: 586 init1: 398 opt: 432 Z-score: 241.4 bits: 54.1 E(85289): 1e-06 Smith-Waterman score: 437; 30.2% identity (55.1% similar) in 394 aa overlap (1-385:1-355) 10 20 30 40 50 60 pF1KE3 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQK ::: ::::.:::.::: :.::::::.::::::.::. ::.:::::::::: ::. ::... NP_003 MELPAVGEHVFAVESIEKKRIRKGRVEYLVKWRGWSPKYNTWEPEENILDPRLLIAFQNR 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 ERERELYGPKKRGPKPKTFLLKARAQAEALRISDVHFSVKPSASASSPKLHSSAAV---- ::...:.: .::::::: ..... . : : :.: ... :.. . :: .: NP_003 ERQEQLMGYRKRGPKPKPLVVQVPTFA---RRSNVLTGLQDSSTDNRAKLDLGAQGKGQG 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 HRLKKDIRRCHRMSRRPLPRPDPQGGSPGLRPPISPFSETVRIINRKVKPREPKRNRIIL :. . . .. :... :. ..:.: :: . . .. ..: .: .: NP_003 HQYELNSKKHHQYQ----PHSKERAGKPP--PPGKSGKYYYQLNSKKHHPYQP------- 120 130 140 150 160 180 190 200 210 220 230 pF1KE3 NLKVIDKGAGGGGAGQGAGALARPKVPSRNRVIGKSKKFSESVLRTQIRHMKFGAFALYK . :. : :: . : :. .: ... .. . . .:: NP_003 DPKMYDLQYQGGHKEAPS-----PTCPD----LGAKSHPPDKWAQGAGAKGYLGAVK--- 170 180 190 200 210 240 250 260 270 280 290 pF1KE3 PPPAPLVAPSPGKAEASAPGPGLLLAAPAAPYDARSSGSSGCPSPTPQSSDPDDTPPKLL : .: .:::. ..: :.. :: : .: .... : .... . .. NP_003 --PLAGAAGAPGKGSEKGPPNGMM---PA-PKEAVTGNGIGGKMKIVKNKNKNGRIVIVM 220 230 240 250 260 300 310 320 330 340 350 pF1KE3 PETVSPS--APSWREPEVLDLSLPPESAATSKRAPPEVTAAAGPAPPTAPEPAGASSEP- . . . : . . :: . .: . .::: : :: : . ::: . NP_003 SKYMENGMQAVKIKSGEVAEGE--ARSPSHKKRAADE---RHPPADRTFKKAAGAEEKKV 270 280 290 300 310 320 360 370 380 390 400 410 pF1KE3 EAGDWR--PEMSPCSNVVVTDVTSNLLTVTIKEFCNPEDFEKVAAGVAGAAGGGGSIGAS :: : :.: :. :. : :. : . : NP_003 EAPPKRREEEVSGVSDPQPQDAGSRKLSPTKEAFGEQPLQLTTKPDLLAWDPARNTHPPS 330 340 350 360 370 380 pF1KE3 K NP_003 HHPHPHPHHHHHHHHHHHHAVGLNLSHVRKRCLSETHGEREPCKKRLTARSISTPTCLGG 390 400 410 420 430 440 >>XP_006724240 (OMIM: 608457) PREDICTED: chromobox prote (192 aa) initn: 543 init1: 411 opt: 415 Z-score: 239.2 bits: 52.1 E(85289): 1.4e-06 Smith-Waterman score: 419; 47.7% identity (68.2% similar) in 151 aa overlap (1-149:1-137) 10 20 30 40 50 60 pF1KE3 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQK :::::.::.:::.::: :.:.:::..::::::::: :::::::::.::: ::. :.:.: XP_006 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 ERERELYGPKKRGPKPKTFLLKARAQAEALRISDVHFSVKPSASASSPKLHSSAAVHRLK :.. . : .::::::: .::. : . ::.: .: . .:.:.: . :: XP_006 EERDRASGYRKRGPKPKRLLLQPRKAHKYLRLSRKKFPPR------GPNLESHS--HR-- 70 80 90 100 110 130 140 150 160 170 pF1KE3 KDIRRCHRMSRRPLPRPDPQGGS--PGLRPPISPFSETVRIINRKVKPREPKRNRIILNL : ... : : .: :. .:: XP_006 ----RELFLQEPPAPDVLQAAGEWEPAAQPPEEEADADLAEGPPPWTPALPSSEVTVTDI 120 130 140 150 160 180 190 200 210 220 230 pF1KE3 KVIDKGAGGGGAGQGAGALARPKVPSRNRVIGKSKKFSESVLRTQIRHMKFGAFALYKPP XP_006 TANSITVTFREAQAAEGFFRDRSGKF 170 180 190 >>XP_005261470 (OMIM: 608457) PREDICTED: chromobox prote (158 aa) initn: 497 init1: 391 opt: 392 Z-score: 228.8 bits: 49.9 E(85289): 5.3e-06 Smith-Waterman score: 392; 61.5% identity (83.5% similar) in 91 aa overlap (1-91:1-91) 10 20 30 40 50 60 pF1KE3 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQK :::::.::.:::.::: :.:.:::..::::::::: :::::::::.::: ::. :.:.: XP_005 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 ERERELYGPKKRGPKPKTFLLKARAQAEALRISDVHFSVKPSASASSPKLHSSAAVHRLK :.. . : .::::::: .::. ..:. XP_005 EERDRASGYRKRGPKPKRLLLQEPPAPDVLQAAGEWEPAAQPPEEEADADLAEGPPPWTP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 KDIRRCHRMSRRPLPRPDPQGGSPGLRPPISPFSETVRIINRKVKPREPKRNRIILNLKV XP_005 ALPSSEVTVTDITANSITVTFREAQAAEGFFRDRSGKF 130 140 150 >>XP_006724239 (OMIM: 608457) PREDICTED: chromobox prote (200 aa) initn: 521 init1: 318 opt: 393 Z-score: 227.8 bits: 50.1 E(85289): 5.9e-06 Smith-Waterman score: 398; 46.2% identity (64.4% similar) in 160 aa overlap (1-149:1-145) 10 20 30 40 50 60 pF1KE3 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQK :::::.::.:::.::: :.:.:::..::::::::: :::::::::.::: ::. :.:.: XP_006 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 ---------ERERELYGPKKRGPKPKTFLLKARAQAEALRISDVHFSVKPSASASSPKLH ::.: : .::::::: .::. : . ::.: .: . .:.:. XP_006 YGVLGTAGEERDRA-SGYRKRGPKPKRLLLQPRKAHKYLRLSRKKFPPR------GPNLE 70 80 90 100 110 120 130 140 150 160 pF1KE3 SSAAVHRLKKDIRRCHRMSRRPLPRPDPQGGS--PGLRPPISPFSETVRIINRKVKPREP : . :: : ... : : .: :. .:: XP_006 SHS--HR------RELFLQEPPAPDVLQAAGEWEPAAQPPEEEADADLAEGPPPWTPALP 120 130 140 150 160 170 180 190 200 210 220 pF1KE3 KRNRIILNLKVIDKGAGGGGAGQGAGALARPKVPSRNRVIGKSKKFSESVLRTQIRHMKF XP_006 SSEVTVTDITANSITVTFREAQAAEGFFRDRSGKF 170 180 190 200 >>XP_016884168 (OMIM: 608457) PREDICTED: chromobox prote (250 aa) initn: 507 init1: 391 opt: 395 Z-score: 227.5 bits: 50.3 E(85289): 6.2e-06 Smith-Waterman score: 400; 43.8% identity (65.1% similar) in 169 aa overlap (1-156:1-165) 10 20 30 40 50 60 pF1KE3 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQK :::::.::.:::.::: :.:.:::..::::::::: :::::::::.::: ::. :.:.: XP_016 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 ERERELYGPKKRGPKPKTFLLKARAQAEALRISD-------VHFSVKPSASASSPKLHSS :.. . : .::::::: .::. : . :: : . ::. ...::. . XP_016 EERDRASGYRKRGPKPKRLLLQ-RLYSMDLRSSHKAKGKEKLCFSLTCPLGSGSPEGVVK 70 80 90 100 110 120 130 140 150 160 pF1KE3 AAVHRLKKDIRRCHRMSRRPLPRPDPQGGSPGLR------PPISPFSETVRIINRKVKPR :.. .: . . . :.: :. . :: :: .: :. XP_016 AGAPEL---VDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQE 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 EPKRNRIILNLKVIDKGAGGGGAGQGAGALARPKVPSRNRVIGKSKKFSESVLRTQIRHM XP_016 PPAPDVLQAAGEWEPAAQPPEEEDADLAEGPPPWTPALPSSEVTVTDITANSITVTFREA 180 190 200 210 220 230 >>NP_783640 (OMIM: 608457) chromobox protein homolog 7 [ (251 aa) initn: 503 init1: 391 opt: 395 Z-score: 227.5 bits: 50.3 E(85289): 6.2e-06 Smith-Waterman score: 400; 43.8% identity (65.1% similar) in 169 aa overlap (1-156:1-165) 10 20 30 40 50 60 pF1KE3 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQK :::::.::.:::.::: :.:.:::..::::::::: :::::::::.::: ::. :.:.: NP_783 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 ERERELYGPKKRGPKPKTFLLKARAQAEALRISD-------VHFSVKPSASASSPKLHSS :.. . : .::::::: .::. : . :: : . ::. ...::. . NP_783 EERDRASGYRKRGPKPKRLLLQ-RLYSMDLRSSHKAKGKEKLCFSLTCPLGSGSPEGVVK 70 80 90 100 110 120 130 140 150 160 pF1KE3 AAVHRLKKDIRRCHRMSRRPLPRPDPQGGSPGLR------PPISPFSETVRIINRKVKPR :.. .: . . . :.: :. . :: :: .: :. NP_783 AGAPEL---VDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQE 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 EPKRNRIILNLKVIDKGAGGGGAGQGAGALARPKVPSRNRVIGKSKKFSESVLRTQIRHM NP_783 PPAPDVLQAAGEWEPAAQPPEEEADADLAEGPPPWTPALPSSEVTVTDITANSITVTFRE 180 190 200 210 220 230 >>NP_005180 (OMIM: 602770,613080) chromobox protein homo (532 aa) initn: 576 init1: 319 opt: 390 Z-score: 220.4 bits: 50.1 E(85289): 1.5e-05 Smith-Waterman score: 399; 32.0% identity (54.8% similar) in 372 aa overlap (2-344:3-347) 10 20 30 40 50 pF1KE3 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQ :::.:::.::::: :...:.:::..::::::.::. :...::::::::: ::. ::.. NP_005 MEELSSVGEQVFAAECILSKRLRKGKLEYLVKWRGWSSKHNSWEPEENILDPRLLLAFQK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 KERERELYGPKKRGPKPKTFLLKARAQAEALRIS-----DVHFSVKPSASASSPKLHSSA ::.:.:. . .::: .:. : :.. : : :. . : :.:.:: ::. NP_005 KEHEKEVQN-RKRGKRPRGRPRKLTAMSSCSRRSKLKEPDAPSKSKSSSSSSSSTSSSSS 70 80 90 100 110 120 130 140 150 160 pF1KE3 AVHRLKKDI--RRCHRMSR-RPLPRPDPQG--GSPGLRPPISPFSETVRIINRKVKPREP . .. .:. .: : . .:.:. : ..: :. :: . .:: : : NP_005 SDEEDDSDLDAKRGPRGRETHPVPQKKAQILVAKPELKDPIR------KKRGRKPLPPEQ 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 KRNRIILNLKVIDKGAGGGGAGQGAGALARPKVPSRNRVIGKSKKFSESVLRTQIRHMKF : .: ..: . : : :: : : : : : . .:... .. NP_005 KATRRPVSLAKVLKTAR---KDLGAPASKLPP-PLSAPVAGLA------ALKAHAKEACG 180 190 200 210 220 230 240 250 260 270 pF1KE3 GAFALYKPPP-APLV---APSPGKAEASAPGPGLLLA-------APAAPYDA-RSSGSSG : :. : : :. : :::.. : . . : :: : ...... NP_005 GPSAMATPENLASLMKGMASSPGRGGISWQSSIVHYMNRMTQSQAQAASRLALKAQATNK 230 240 250 260 270 280 280 290 300 310 320 330 pF1KE3 CP-----SPTPQSSDPDDTPPKLLPETVSPSAPSWR--EPEVLDLSLPPESAATSKRAPP : . :... .:: . :.::: : .: . . ::.. ..: :.: NP_005 CGLGLDLKVRTQKGELGMSPPG----SKIPKAPSGGAVEQKVGNTGGPPHTHGAS-RVP- 290 300 310 320 330 340 350 360 370 380 390 pF1KE3 EVTAAAGPAPPTAPEPAGASSEPEAGDWRPEMSPCSNVVVTDVTSNLLTVTIKEFCNPED :. :.: :: NP_005 ----AGCPGPQPAPTQELSLQVLDLQSVKNGMPGVGLLARHATATKGVPATNPAPGKGTG 340 350 360 370 380 390 >>XP_011523684 (OMIM: 602770,613080) PREDICTED: chromobo (586 aa) initn: 576 init1: 319 opt: 390 Z-score: 219.8 bits: 50.1 E(85289): 1.7e-05 Smith-Waterman score: 399; 32.0% identity (54.8% similar) in 372 aa overlap (2-344:3-347) 10 20 30 40 50 pF1KE3 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQ :::.:::.::::: :...:.:::..::::::.::. :...::::::::: ::. ::.. XP_011 MEELSSVGEQVFAAECILSKRLRKGKLEYLVKWRGWSSKHNSWEPEENILDPRLLLAFQK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 KERERELYGPKKRGPKPKTFLLKARAQAEALRIS-----DVHFSVKPSASASSPKLHSSA ::.:.:. . .::: .:. : :.. : : :. . : :.:.:: ::. XP_011 KEHEKEVQN-RKRGKRPRGRPRKLTAMSSCSRRSKLKEPDAPSKSKSSSSSSSSTSSSSS 70 80 90 100 110 120 130 140 150 160 pF1KE3 AVHRLKKDI--RRCHRMSR-RPLPRPDPQG--GSPGLRPPISPFSETVRIINRKVKPREP . .. .:. .: : . .:.:. : ..: :. :: . .:: : : XP_011 SDEEDDSDLDAKRGPRGRETHPVPQKKAQILVAKPELKDPIR------KKRGRKPLPPEQ 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 KRNRIILNLKVIDKGAGGGGAGQGAGALARPKVPSRNRVIGKSKKFSESVLRTQIRHMKF : .: ..: . : : :: : : : : : . .:... .. XP_011 KATRRPVSLAKVLKTAR---KDLGAPASKLPP-PLSAPVAGLA------ALKAHAKEACG 180 190 200 210 220 230 240 250 260 270 pF1KE3 GAFALYKPPP-APLV---APSPGKAEASAPGPGLLLA-------APAAPYDA-RSSGSSG : :. : : :. : :::.. : . . : :: : ...... XP_011 GPSAMATPENLASLMKGMASSPGRGGISWQSSIVHYMNRMTQSQAQAASRLALKAQATNK 230 240 250 260 270 280 280 290 300 310 320 330 pF1KE3 CP-----SPTPQSSDPDDTPPKLLPETVSPSAPSWR--EPEVLDLSLPPESAATSKRAPP : . :... .:: . :.::: : .: . . ::.. ..: :.: XP_011 CGLGLDLKVRTQKGELGMSPPG----SKIPKAPSGGAVEQKVGNTGGPPHTHGAS-RVP- 290 300 310 320 330 340 350 360 370 380 390 pF1KE3 EVTAAAGPAPPTAPEPAGASSEPEAGDWRPEMSPCSNVVVTDVTSNLLTVTIKEFCNPED :. :.: :: XP_011 ----AGCPGPQPAPTQELSLQVLDLQSVKNGMPGVGLLARHATATKGVPATNPAPGKGTG 340 350 360 370 380 390 >>XP_011528327 (OMIM: 608457) PREDICTED: chromobox prote (165 aa) initn: 485 init1: 318 opt: 366 Z-score: 215.3 bits: 47.5 E(85289): 3e-05 Smith-Waterman score: 371; 58.0% identity (76.0% similar) in 100 aa overlap (1-91:1-99) 10 20 30 40 50 60 pF1KE3 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQK :::::.::.:::.::: :.:.:::..::::::::: :::::::::.::: ::. :.:.: XP_011 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 ---------ERERELYGPKKRGPKPKTFLLKARAQAEALRISDVHFSVKPSASASSPKLH ::.: : .::::::: .::. ..:. XP_011 YGVLGTAGEERDRA-SGYRKRGPKPKRLLLQEPPAPDVLQAAGEWEPAAQPPEEEDADLA 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 SSAAVHRLKKDIRRCHRMSRRPLPRPDPQGGSPGLRPPISPFSETVRIINRKVKPREPKR XP_011 EGPPPWTPALPSSEVTVTDITANSITVTFREAQAAEGFFRDRSGKF 120 130 140 150 160 >>XP_006724241 (OMIM: 608457) PREDICTED: chromobox prote (166 aa) initn: 475 init1: 318 opt: 366 Z-score: 215.3 bits: 47.5 E(85289): 3e-05 Smith-Waterman score: 371; 58.0% identity (76.0% similar) in 100 aa overlap (1-91:1-99) 10 20 30 40 50 60 pF1KE3 MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQK :::::.::.:::.::: :.:.:::..::::::::: :::::::::.::: ::. :.:.: XP_006 MELSAIGEQVFAVESIRKKRVRKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEK 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 ---------ERERELYGPKKRGPKPKTFLLKARAQAEALRISDVHFSVKPSASASSPKLH ::.: : .::::::: .::. ..:. XP_006 YGVLGTAGEERDRA-SGYRKRGPKPKRLLLQEPPAPDVLQAAGEWEPAAQPPEEEADADL 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 SSAAVHRLKKDIRRCHRMSRRPLPRPDPQGGSPGLRPPISPFSETVRIINRKVKPREPKR XP_006 AEGPPPWTPALPSSEVTVTDITANSITVTFREAQAAEGFFRDRSGKF 120 130 140 150 160 412 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 10:17:08 2016 done: Tue Nov 8 10:17:10 2016 Total Scan time: 8.450 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]