FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8335, 440 aa 1>>>pF1KB8335 440 - 440 aa - 440 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7964+/-0.000354; mu= 15.6325+/- 0.022 mean_var=88.6486+/-18.433, 0's: 0 Z-trim(115.6): 182 B-trim: 214 in 1/54 Lambda= 0.136219 statistics sampled from 25957 (26148) to 25957 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.677), E-opt: 0.2 (0.307), width: 16 Scan time: 7.040 The best scores are: opt bits E(85289) NP_056980 (OMIM: 607070,615444) zinc finger MYND d ( 440) 2968 593.4 3.9e-169 NP_001295308 (OMIM: 607070,615444) zinc finger MYN ( 435) 2716 543.8 3.2e-154 XP_005265273 (OMIM: 607070,615444) PREDICTED: zinc ( 361) 2467 494.9 1.5e-139 NP_787127 (OMIM: 603870) protein CBFA2T3 isoform 2 ( 567) 160 41.6 0.0063 XP_005256380 (OMIM: 603870) PREDICTED: protein CBF ( 628) 160 41.6 0.0068 NP_005178 (OMIM: 603870) protein CBFA2T3 isoform 1 ( 653) 160 41.7 0.0071 >>NP_056980 (OMIM: 607070,615444) zinc finger MYND domai (440 aa) initn: 2968 init1: 2968 opt: 2968 Z-score: 3157.4 bits: 593.4 E(85289): 3.9e-169 Smith-Waterman score: 2968; 99.3% identity (99.8% similar) in 440 aa overlap (1-440:1-440) 10 20 30 40 50 60 pF1KB8 MGDLELLLPGEAEVLVRGLRSFPLREMGSEGWNQRHENLEKLNMQAILDATVSQGEPIQE ::::::::::::::::::::::::::::::::::.::::::::::::::::::::::::: NP_056 MGDLELLLPGEAEVLVRGLRSFPLREMGSEGWNQQHENLEKLNMQAILDATVSQGEPIQE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 LLVTHGKVPTLVEELIAVEMWKQKVFPVFCRVEDFKPQNTFPIYMVVHHEASIINLLETV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 LLVTHGKVPTLVEELIAVEMWKQKVFPVFCRVEDFKPQNTFPIYMVVHHEASIINLLETV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 FFHKEVCESAEDTVLDLVDYCHRKLTLLVAQSGCGGPPEGEGSQDSNPMQELQKQAELME :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 FFHKEVCESAEDTVLDLVDYCHRKLTLLVAQSGCGGPPEGEGSQDSNPMQELQKQAELME 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 FEIALKALSVLRYITDCVDSLSLSTLSRMLSAHNLPCLLVELLEHSPWSRREGGKLQQFE :::::::::::::::::::::::::::::::.:::::::::::::::::::::::::::: NP_056 FEIALKALSVLRYITDCVDSLSLSTLSRMLSTHNLPCLLVELLEHSPWSRREGGKLQQFE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 GSRWHTVAPSEQQKLSKLDGQVWIALYNLLLSPEAQARYCLTSFAKGRLLKLRAFLTDTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 GSRWHTVAPSEQQKLSKLDGQVWIALYNLLLSPEAQARYCLTSFAKGRLLKLRAFLTDTL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 LDQLPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPEIWERLERENRGKWQAIAKHQLQHV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 LDQLPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPEIWERLERENRGKWQAIAKHQLQHV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 FSPSEQDLRLQARRWAETYRLDVLEAVAPERPRCAYCSAEASKRCSRCQNEWYCCRECQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 FSPSEQDLRLQARRWAETYRLDVLEAVAPERPRCAYCSAEASKRCSRCQNEWYCCRECQV 370 380 390 400 410 420 430 440 pF1KB8 KHWEKHGKTCVPAAQGDRAK ::::::::::: :::::::: NP_056 KHWEKHGKTCVLAAQGDRAK 430 440 >>NP_001295308 (OMIM: 607070,615444) zinc finger MYND do (435 aa) initn: 2735 init1: 1429 opt: 2716 Z-score: 2889.9 bits: 543.8 E(85289): 3.2e-154 Smith-Waterman score: 2722; 92.4% identity (94.2% similar) in 447 aa overlap (1-440:1-435) 10 20 30 40 50 60 pF1KB8 MGDLELLLPGEAEVLVRGLRSFPLREMGSEGWNQRHENLEKLNMQAILDATVSQGEPIQE ::::::::::::::::::::::::::::::::::.::::::::::::::::::::::::: NP_001 MGDLELLLPGEAEVLVRGLRSFPLREMGSEGWNQQHENLEKLNMQAILDATVSQGEPIQE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 LLVTHGKVPTLVEELIAVEMWKQKVFPVFCRVEDFKPQNTFPIYMVVHHEASIINLLETV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LLVTHGKVPTLVEELIAVEMWKQKVFPVFCRVEDFKPQNTFPIYMVVHHEASIINLLETV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 FFHKEVCESAEDTVLDLVDYCHRKLTLLVAQSGCGGPPEGEGSQDSNPMQELQKQAELME :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FFHKEVCESAEDTVLDLVDYCHRKLTLLVAQSGCGGPPEGEGSQDSNPMQELQKQAELME 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 FEIALKALSVLRYITDCVD---SLS----LSTLSRMLSAHNLPCLLVELLEHSPWSRREG ::::::::::::::::::: :.: :. :.:. : . : .. :: NP_001 FEIALKALSVLRYITDCVDRQWSVSQPPQLAHLKRIQRLHPV-CWFL-----SP------ 190 200 210 220 240 250 260 270 280 290 pF1KB8 GKLQQFEGSRWHTVAPSEQQKLSKLDGQVWIALYNLLLSPEAQARYCLTSFAKGRLLKLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GKLQQFEGSRWHTVAPSEQQKLSKLDGQVWIALYNLLLSPEAQARYCLTSFAKGRLLKLR 230 240 250 260 270 280 300 310 320 330 340 350 pF1KB8 AFLTDTLLDQLPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPEIWERLERENRGKWQAIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AFLTDTLLDQLPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPEIWERLERENRGKWQAIA 290 300 310 320 330 340 360 370 380 390 400 410 pF1KB8 KHQLQHVFSPSEQDLRLQARRWAETYRLDVLEAVAPERPRCAYCSAEASKRCSRCQNEWY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KHQLQHVFSPSEQDLRLQARRWAETYRLDVLEAVAPERPRCAYCSAEASKRCSRCQNEWY 350 360 370 380 390 400 420 430 440 pF1KB8 CCRECQVKHWEKHGKTCVPAAQGDRAK :::::::::::::::::: :::::::: NP_001 CCRECQVKHWEKHGKTCVLAAQGDRAK 410 420 430 >>XP_005265273 (OMIM: 607070,615444) PREDICTED: zinc fin (361 aa) initn: 2467 init1: 2467 opt: 2467 Z-score: 2626.6 bits: 494.9 E(85289): 1.5e-139 Smith-Waterman score: 2467; 99.4% identity (99.7% similar) in 361 aa overlap (80-440:1-361) 50 60 70 80 90 100 pF1KB8 ATVSQGEPIQELLVTHGKVPTLVEELIAVEMWKQKVFPVFCRVEDFKPQNTFPIYMVVHH :::::::::::::::::::::::::::::: XP_005 MWKQKVFPVFCRVEDFKPQNTFPIYMVVHH 10 20 30 110 120 130 140 150 160 pF1KB8 EASIINLLETVFFHKEVCESAEDTVLDLVDYCHRKLTLLVAQSGCGGPPEGEGSQDSNPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 EASIINLLETVFFHKEVCESAEDTVLDLVDYCHRKLTLLVAQSGCGGPPEGEGSQDSNPM 40 50 60 70 80 90 170 180 190 200 210 220 pF1KB8 QELQKQAELMEFEIALKALSVLRYITDCVDSLSLSTLSRMLSAHNLPCLLVELLEHSPWS ::::::::::::::::::::::::::::::::::::::::::.::::::::::::::::: XP_005 QELQKQAELMEFEIALKALSVLRYITDCVDSLSLSTLSRMLSTHNLPCLLVELLEHSPWS 100 110 120 130 140 150 230 240 250 260 270 280 pF1KB8 RREGGKLQQFEGSRWHTVAPSEQQKLSKLDGQVWIALYNLLLSPEAQARYCLTSFAKGRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 RREGGKLQQFEGSRWHTVAPSEQQKLSKLDGQVWIALYNLLLSPEAQARYCLTSFAKGRL 160 170 180 190 200 210 290 300 310 320 330 340 pF1KB8 LKLRAFLTDTLLDQLPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPEIWERLERENRGKW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 LKLRAFLTDTLLDQLPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPEIWERLERENRGKW 220 230 240 250 260 270 350 360 370 380 390 400 pF1KB8 QAIAKHQLQHVFSPSEQDLRLQARRWAETYRLDVLEAVAPERPRCAYCSAEASKRCSRCQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 QAIAKHQLQHVFSPSEQDLRLQARRWAETYRLDVLEAVAPERPRCAYCSAEASKRCSRCQ 280 290 300 310 320 330 410 420 430 440 pF1KB8 NEWYCCRECQVKHWEKHGKTCVPAAQGDRAK :::::::::::::::::::::: :::::::: XP_005 NEWYCCRECQVKHWEKHGKTCVLAAQGDRAK 340 350 360 >>NP_787127 (OMIM: 603870) protein CBFA2T3 isoform 2 [Ho (567 aa) initn: 153 init1: 125 opt: 160 Z-score: 173.5 bits: 41.6 E(85289): 0.0063 Smith-Waterman score: 171; 30.3% identity (50.8% similar) in 122 aa overlap (334-439:396-515) 310 320 330 340 350 360 pF1KB8 LPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPE-IWERLERE-NRGKWQAIAKHQLQHVF .:: ::.. :. :. : ::.. .::.. NP_787 AARPRSSSAGPEGPQLDVPREFLPRTLTGYVPEDIWRKAEEAVNEVKRQAMS--ELQKAV 370 380 390 400 410 420 370 380 390 400 pF1KB8 SPSEQDLR--------------LQARRWAETYRLDVLEAVAPERPRCAYCSAEASKRCSR : .:. . .:.: : : :.. : :. .::. :: NP_787 SDAERKAHELITTERAKMERALAEAKRQASEDALTVINQQEDSSESCWNCGRKASETCSG 430 440 450 460 470 480 410 420 430 440 pF1KB8 CQNEWYCCRECQVKHWEKHGKTCVPAAQGDRAK :. :: :: . :::: ..: . :: : NP_787 CNAARYCGSFCQHRDWEKHHHVCGQSLQGPTAVVADPVPGPPEAAHSLGPSLPVGAASPS 490 500 510 520 530 540 NP_787 EAGSAGPSRPGSPSPPGPLDTVPR 550 560 >>XP_005256380 (OMIM: 603870) PREDICTED: protein CBFA2T3 (628 aa) initn: 153 init1: 125 opt: 160 Z-score: 172.9 bits: 41.6 E(85289): 0.0068 Smith-Waterman score: 171; 30.3% identity (50.8% similar) in 122 aa overlap (334-439:457-576) 310 320 330 340 350 360 pF1KB8 LPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPE-IWERLERE-NRGKWQAIAKHQLQHVF .:: ::.. :. :. : ::.. .::.. XP_005 AARPRSSSAGPEGPQLDVPREFLPRTLTGYVPEDIWRKAEEAVNEVKRQAMS--ELQKAV 430 440 450 460 470 480 370 380 390 400 pF1KB8 SPSEQDLR--------------LQARRWAETYRLDVLEAVAPERPRCAYCSAEASKRCSR : .:. . .:.: : : :.. : :. .::. :: XP_005 SDAERKAHELITTERAKMERALAEAKRQASEDALTVINQQEDSSESCWNCGRKASETCSG 490 500 510 520 530 540 410 420 430 440 pF1KB8 CQNEWYCCRECQVKHWEKHGKTCVPAAQGDRAK :. :: :: . :::: ..: . :: : XP_005 CNAARYCGSFCQHRDWEKHHHVCGQSLQGPTAVVADPVPGPPEAAHSLGPSLPVGAASPS 550 560 570 580 590 600 XP_005 EAGSAGPSRPGSPSPPGPLDTVPR 610 620 >>NP_005178 (OMIM: 603870) protein CBFA2T3 isoform 1 [Ho (653 aa) initn: 181 init1: 125 opt: 160 Z-score: 172.6 bits: 41.7 E(85289): 0.0071 Smith-Waterman score: 171; 30.3% identity (50.8% similar) in 122 aa overlap (334-439:482-601) 310 320 330 340 350 360 pF1KB8 LPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPE-IWERLERE-NRGKWQAIAKHQLQHVF .:: ::.. :. :. : ::.. .::.. NP_005 AARPRSSSAGPEGPQLDVPREFLPRTLTGYVPEDIWRKAEEAVNEVKRQAMS--ELQKAV 460 470 480 490 500 370 380 390 400 pF1KB8 SPSEQDLR--------------LQARRWAETYRLDVLEAVAPERPRCAYCSAEASKRCSR : .:. . .:.: : : :.. : :. .::. :: NP_005 SDAERKAHELITTERAKMERALAEAKRQASEDALTVINQQEDSSESCWNCGRKASETCSG 510 520 530 540 550 560 410 420 430 440 pF1KB8 CQNEWYCCRECQVKHWEKHGKTCVPAAQGDRAK :. :: :: . :::: ..: . :: : NP_005 CNAARYCGSFCQHRDWEKHHHVCGQSLQGPTAVVADPVPGPPEAAHSLGPSLPVGAASPS 570 580 590 600 610 620 NP_005 EAGSAGPSRPGSPSPPGPLDTVPR 630 640 650 440 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 11:52:13 2016 done: Fri Nov 4 11:52:14 2016 Total Scan time: 7.040 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]