FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7544, 262 aa 1>>>pF1KB7544 262 - 262 aa - 262 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.9695+/-0.000334; mu= -0.9319+/- 0.021 mean_var=223.1881+/-45.465, 0's: 0 Z-trim(122.3): 118 B-trim: 822 in 2/59 Lambda= 0.085850 statistics sampled from 40177 (40316) to 40177 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.796), E-opt: 0.2 (0.473), width: 16 Scan time: 8.020 The best scores are: opt bits E(85289) NP_003112 (OMIM: 606802) transcription factor Spi- ( 262) 1805 235.5 7.7e-62 NP_001230928 (OMIM: 606802) transcription factor S ( 177) 1138 152.7 4.2e-37 NP_001230927 (OMIM: 606802) transcription factor S ( 171) 994 134.9 9.5e-32 NP_001230929 (OMIM: 606802) transcription factor S ( 231) 921 125.9 6.3e-29 NP_003111 (OMIM: 165170) transcription factor PU.1 ( 270) 617 88.3 1.5e-17 NP_001074016 (OMIM: 165170) transcription factor P ( 271) 615 88.1 1.8e-17 XP_011518609 (OMIM: 165170) PREDICTED: transcripti ( 183) 558 80.9 1.8e-15 XP_016873662 (OMIM: 165170) PREDICTED: transcripti ( 183) 558 80.9 1.8e-15 XP_005268703 (OMIM: 612568) PREDICTED: transcripti ( 248) 380 59.0 9.9e-09 NP_689536 (OMIM: 612568) transcription factor Spi- ( 248) 380 59.0 9.9e-09 XP_006719302 (OMIM: 612568) PREDICTED: transcripti ( 248) 380 59.0 9.9e-09 XP_011527823 (OMIM: 600609) PREDICTED: GA-binding ( 454) 227 40.2 0.008 XP_011527822 (OMIM: 600609) PREDICTED: GA-binding ( 454) 227 40.2 0.008 NP_001184226 (OMIM: 600609) GA-binding protein alp ( 454) 227 40.2 0.008 NP_002031 (OMIM: 600609) GA-binding protein alpha ( 454) 227 40.2 0.008 XP_005260995 (OMIM: 600609) PREDICTED: GA-binding ( 454) 227 40.2 0.008 XP_016883802 (OMIM: 600609) PREDICTED: GA-binding ( 454) 227 40.2 0.008 >>NP_003112 (OMIM: 606802) transcription factor Spi-B is (262 aa) initn: 1805 init1: 1805 opt: 1805 Z-score: 1231.2 bits: 235.5 E(85289): 7.7e-62 Smith-Waterman score: 1805; 100.0% identity (100.0% similar) in 262 aa overlap (1-262:1-262) 10 20 30 40 50 60 pF1KB7 MLALEAAQLDGPHFSCLYPDGVFYDLDSCKHSSYPDSEGAPDSLWDWTVAPPVPATPYEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MLALEAAQLDGPHFSCLYPDGVFYDLDSCKHSSYPDSEGAPDSLWDWTVAPPVPATPYEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 FDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 FDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 RGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 RGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEI 190 200 210 220 230 240 250 260 pF1KB7 RKVKRKLTYQFDSALLPAVRRA :::::::::::::::::::::: NP_003 RKVKRKLTYQFDSALLPAVRRA 250 260 >>NP_001230928 (OMIM: 606802) transcription factor Spi-B (177 aa) initn: 1170 init1: 1138 opt: 1138 Z-score: 787.1 bits: 152.7 E(85289): 4.2e-37 Smith-Waterman score: 1138; 98.2% identity (99.4% similar) in 166 aa overlap (1-166:1-166) 10 20 30 40 50 60 pF1KB7 MLALEAAQLDGPHFSCLYPDGVFYDLDSCKHSSYPDSEGAPDSLWDWTVAPPVPATPYEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MLALEAAQLDGPHFSCLYPDGVFYDLDSCKHSSYPDSEGAPDSLWDWTVAPPVPATPYEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 FDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLT ::::::::::::::::::::::::::::::::::::::::::. .: NP_001 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEGLARSCACTSSCWGY 130 140 150 160 170 190 200 210 220 230 240 pF1KB7 RGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEI >>NP_001230927 (OMIM: 606802) transcription factor Spi-B (171 aa) initn: 991 init1: 991 opt: 994 Z-score: 690.9 bits: 134.9 E(85289): 9.5e-32 Smith-Waterman score: 994; 89.0% identity (93.0% similar) in 172 aa overlap (91-262:1-171) 70 80 90 100 110 120 pF1KB7 FDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAY .: :. . ..:: . . ::::::: NP_001 MASSMTWTAASIPATLIQR-GLLTLVPPAY 10 20 130 140 150 160 170 180 pF1KB7 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 APYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLT 30 40 50 60 70 80 190 200 210 220 230 240 pF1KB7 RGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEI 90 100 110 120 130 140 250 260 pF1KB7 RKVKRKLTYQFDSALLPAVRRA :::::::::::::::::::::: NP_001 RKVKRKLTYQFDSALLPAVRRA 150 160 170 >>NP_001230929 (OMIM: 606802) transcription factor Spi-B (231 aa) initn: 934 init1: 910 opt: 921 Z-score: 640.3 bits: 125.9 E(85289): 6.3e-29 Smith-Waterman score: 941; 71.7% identity (77.6% similar) in 219 aa overlap (44-262:29-231) 20 30 40 50 60 70 pF1KB7 FSCLYPDGVFYDLDSCKHSSYPDSEGAPDSLWDWTVAPPVPATPYEAFDPAAAAFSHPQA :: :: : : .: :. :.. NP_001 MASSMTWTAASIPATLIQRGLLTPCGTGLWPHLSQPP-PMKPSTRQQPLLATPRLPSS 10 20 30 40 50 80 90 100 110 120 130 pF1KB7 AQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAYAPYPSPVLSEEED : ::: .:.. . ::. ...:: : . ::::::::: NP_001 A--TNPPPT-----------ALQGTSNWPPAWRPRGLASPHT--PRRTSLASPVLSEEED 60 70 80 90 100 140 150 160 170 180 190 pF1KB7 LPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLTRGDMRECVWWVEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLTRGDMRECVWWVEP 110 120 130 140 150 160 200 210 220 230 240 250 pF1KB7 GAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEIRKVKRKLTYQFDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEIRKVKRKLTYQFDS 170 180 190 200 210 220 260 pF1KB7 ALLPAVRRA ::::::::: NP_001 ALLPAVRRA 230 >>NP_003111 (OMIM: 165170) transcription factor PU.1 iso (270 aa) initn: 530 init1: 480 opt: 617 Z-score: 435.9 bits: 88.3 E(85289): 1.5e-17 Smith-Waterman score: 617; 44.4% identity (63.4% similar) in 268 aa overlap (4-256:2-257) 10 20 30 40 50 pF1KB7 MLALEAAQLDGPHFSCLYP---DGVFYDLDSCK---HSSYP----DSEGAPDSLWDWTVA :.: ...: : . : : : :: : . : :: :.:. : ::. NP_003 MLQACKMEG--FPLVPPPSEDLVPYDTDLYQRQTHEYYPYLSSDGESHSDHYWDFH-- 10 20 30 40 50 60 70 80 90 100 pF1KB7 PPVPATPYEAFDPAAAAFSHPQAAQLCYEPPTYSPAG-NLELAPSLEAPGPGLPAYPTEN : . .:.: : :.. :..: :: . ..:: : .: .:. . NP_003 PHHVHSEFESF--AENNFTELQSVQ----PPQLQQLYRHMELEQMHVLDTPMVPPHPSLG 60 70 80 90 100 110 120 130 140 150 160 pF1KB7 FASQTLVPPAYAPYPS--PVL--SEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGT . : : ::: :. :.::. .:: :::::.:.: .: :: .:.:. NP_003 HQVSYL-PRMCLQYPSLSPAQPSSDEEEGERQSPPLEVSDGEAD-GLEPGPGLLPGETGS 110 120 130 140 150 160 170 180 190 200 210 220 pF1KB7 RKKLRLYQFLLGLLTRGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQ .::.::::::: :: :::.. .:::. :.::::::::: ::.::: ::::::.:::: NP_003 KKKIRLYQFLLDLLRSGDMKDSIWWVDKDKGTFQFSSKHKEALAHRWGIQKGNRKKMTYQ 170 180 190 200 210 220 230 240 250 260 pF1KB7 KLARALRNYAKTGEIRKVKRKLTYQFDSALLPAVRRA :.:::::::.::::..:::.::::::.. .: NP_003 KMARALRNYGKTGEVKKVKKKLTYQFSGEVLGRGGLAERRHPPH 230 240 250 260 270 >>NP_001074016 (OMIM: 165170) transcription factor PU.1 (271 aa) initn: 530 init1: 480 opt: 615 Z-score: 434.5 bits: 88.1 E(85289): 1.8e-17 Smith-Waterman score: 615; 44.2% identity (63.2% similar) in 269 aa overlap (4-256:2-258) 10 20 30 40 pF1KB7 MLALEAAQLDGPHFSCLYP----DGVFYDLDSCK---HSSYP----DSEGAPDSLWDWTV :.: ...: : . : : : :: : . : :: :.:. : ::. NP_001 MLQACKMEG--FPLVPPQPSEDLVPYDTDLYQRQTHEYYPYLSSDGESHSDHYWDFH- 10 20 30 40 50 50 60 70 80 90 100 pF1KB7 APPVPATPYEAFDPAAAAFSHPQAAQLCYEPPTYSPAG-NLELAPSLEAPGPGLPAYPTE : . .:.: : :.. :..: :: . ..:: : .: .:. NP_001 -PHHVHSEFESF--AENNFTELQSVQ----PPQLQQLYRHMELEQMHVLDTPMVPPHPSL 60 70 80 90 100 110 120 130 140 150 160 pF1KB7 NFASQTLVPPAYAPYPS--PVL--SEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAG . . : : ::: :. :.::. .:: :::::.:.: .: :: .:.: NP_001 GHQVSYL-PRMCLQYPSLSPAQPSSDEEEGERQSPPLEVSDGEAD-GLEPGPGLLPGETG 110 120 130 140 150 160 170 180 190 200 210 220 pF1KB7 TRKKLRLYQFLLGLLTRGDMRECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTY ..::.::::::: :: :::.. .:::. :.::::::::: ::.::: ::::::.::: NP_001 SKKKIRLYQFLLDLLRSGDMKDSIWWVDKDKGTFQFSSKHKEALAHRWGIQKGNRKKMTY 170 180 190 200 210 220 230 240 250 260 pF1KB7 QKLARALRNYAKTGEIRKVKRKLTYQFDSALLPAVRRA ::.:::::::.::::..:::.::::::.. .: NP_001 QKMARALRNYGKTGEVKKVKKKLTYQFSGEVLGRGGLAERRHPPH 230 240 250 260 270 >>XP_011518609 (OMIM: 165170) PREDICTED: transcription f (183 aa) initn: 530 init1: 480 opt: 558 Z-score: 398.7 bits: 80.9 E(85289): 1.8e-15 Smith-Waterman score: 558; 55.9% identity (75.8% similar) in 161 aa overlap (100-256:12-170) 70 80 90 100 110 120 pF1KB7 HPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAYAPYPS--PV : .: .:. . . : : ::: :. XP_011 MELEQMHVLDTPMVPPHPSLGHQVSYL-PRMCLQYPSLSPA 10 20 30 40 130 140 150 160 170 180 pF1KB7 L--SEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLTRGDMR :.::. .:: :::::.:.: .: :: .:.:..::.::::::: :: :::. XP_011 QPSSDEEEGERQSPPLEVSDGEAD-GLEPGPGLLPGETGSKKKIRLYQFLLDLLRSGDMK 50 60 70 80 90 190 200 210 220 230 240 pF1KB7 ECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEIRKVKR . .:::. :.::::::::: ::.::: ::::::.:::::.:::::::.::::..:::. XP_011 DSIWWVDKDKGTFQFSSKHKEALAHRWGIQKGNRKKMTYQKMARALRNYGKTGEVKKVKK 100 110 120 130 140 150 250 260 pF1KB7 KLTYQFDSALLPAVRRA ::::::.. .: XP_011 KLTYQFSGEVLGRGGLAERRHPPH 160 170 180 >>XP_016873662 (OMIM: 165170) PREDICTED: transcription f (183 aa) initn: 530 init1: 480 opt: 558 Z-score: 398.7 bits: 80.9 E(85289): 1.8e-15 Smith-Waterman score: 558; 55.9% identity (75.8% similar) in 161 aa overlap (100-256:12-170) 70 80 90 100 110 120 pF1KB7 HPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGLPAYPTENFASQTLVPPAYAPYPS--PV : .: .:. . . : : ::: :. XP_016 MELEQMHVLDTPMVPPHPSLGHQVSYL-PRMCLQYPSLSPA 10 20 30 40 130 140 150 160 170 180 pF1KB7 L--SEEEDLPLDSPALEVSDSESDEALVAGPEGKGSEAGTRKKLRLYQFLLGLLTRGDMR :.::. .:: :::::.:.: .: :: .:.:..::.::::::: :: :::. XP_016 QPSSDEEEGERQSPPLEVSDGEAD-GLEPGPGLLPGETGSKKKIRLYQFLLDLLRSGDMK 50 60 70 80 90 190 200 210 220 230 240 pF1KB7 ECVWWVEPGAGVFQFSSKHKELLARRWGQQKGNRKRMTYQKLARALRNYAKTGEIRKVKR . .:::. :.::::::::: ::.::: ::::::.:::::.:::::::.::::..:::. XP_016 DSIWWVDKDKGTFQFSSKHKEALAHRWGIQKGNRKKMTYQKMARALRNYGKTGEVKKVKK 100 110 120 130 140 150 250 260 pF1KB7 KLTYQFDSALLPAVRRA ::::::.. .: XP_016 KLTYQFSGEVLGRGGLAERRHPPH 160 170 180 >>XP_005268703 (OMIM: 612568) PREDICTED: transcription f (248 aa) initn: 397 init1: 375 opt: 380 Z-score: 277.7 bits: 59.0 E(85289): 9.9e-09 Smith-Waterman score: 386; 36.2% identity (63.3% similar) in 199 aa overlap (59-256:11-198) 30 40 50 60 70 80 pF1KB7 CKHSSYPDSEGAPDSLWDWTVAPPVPATPYEAFDPAAAAFSHPQAAQLCYEPPTYSPAGN .::. : .. . ....: : : . . XP_005 MTCVEQDKLGQAFEDAFEVLRQHSTGDLQYSPDYRNYLAL 10 20 30 40 90 100 110 120 130 140 pF1KB7 LELAPSLEAPGPGLPAYPTENFASQTLVPPAYAPYPSPVLSEEEDLPLDSPALEVSDSES .. : ... . . :::. :.: :.. :. ... . .. . XP_005 INHRPHVKGNSSCYGVLPTEE--------PVYNWR--TVINSAADFYFEGNIHQSLQNIT 50 60 70 80 90 150 160 170 180 190 200 pF1KB7 DEALVAGPEGKGSEAGT-RKKLRLYQFLLGLLTRGDMRECVWWVEPGAGVFQFSSKHKEL .. :: : ...: ::::::...: : .: :. ::. :.::: ::.:: XP_005 ENQLVQ-PTLLQQKGGKGRKKLRLFEYLHESLYNPEMASCIQWVDKTKGIFQFVSKNKEK 100 110 120 130 140 210 220 230 240 250 260 pF1KB7 LARRWGQQKGNRKRMTYQKLARALRNYAKTGEIRKVKRKLTYQFDSALLPAVRRA ::. ::..::::: :::::.:::::::...::: :..:::::::. :.: XP_005 LAELWGKRKGNRKTMTYQKMARALRNYGRSGEITKIRRKLTYQFSEAILQRLSPSYFLGK 150 160 170 180 190 200 XP_005 EIFYSQCVQPDQEYLSLNNWNANYNYTYANYHELNHHDC 210 220 230 240 >>NP_689536 (OMIM: 612568) transcription factor Spi-C [H (248 aa) initn: 397 init1: 375 opt: 380 Z-score: 277.7 bits: 59.0 E(85289): 9.9e-09 Smith-Waterman score: 386; 36.2% identity (63.3% similar) in 199 aa overlap (59-256:11-198) 30 40 50 60 70 80 pF1KB7 CKHSSYPDSEGAPDSLWDWTVAPPVPATPYEAFDPAAAAFSHPQAAQLCYEPPTYSPAGN .::. : .. . ....: : : . . NP_689 MTCVEQDKLGQAFEDAFEVLRQHSTGDLQYSPDYRNYLAL 10 20 30 40 90 100 110 120 130 140 pF1KB7 LELAPSLEAPGPGLPAYPTENFASQTLVPPAYAPYPSPVLSEEEDLPLDSPALEVSDSES .. : ... . . :::. :.: :.. :. ... . .. . NP_689 INHRPHVKGNSSCYGVLPTEE--------PVYNWR--TVINSAADFYFEGNIHQSLQNIT 50 60 70 80 90 150 160 170 180 190 200 pF1KB7 DEALVAGPEGKGSEAGT-RKKLRLYQFLLGLLTRGDMRECVWWVEPGAGVFQFSSKHKEL .. :: : ...: ::::::...: : .: :. ::. :.::: ::.:: NP_689 ENQLVQ-PTLLQQKGGKGRKKLRLFEYLHESLYNPEMASCIQWVDKTKGIFQFVSKNKEK 100 110 120 130 140 210 220 230 240 250 260 pF1KB7 LARRWGQQKGNRKRMTYQKLARALRNYAKTGEIRKVKRKLTYQFDSALLPAVRRA ::. ::..::::: :::::.:::::::...::: :..:::::::. :.: NP_689 LAELWGKRKGNRKTMTYQKMARALRNYGRSGEITKIRRKLTYQFSEAILQRLSPSYFLGK 150 160 170 180 190 200 NP_689 EIFYSQCVQPDQEYLSLNNWNANYNYTYANYHELNHHDC 210 220 230 240 262 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:09:28 2016 done: Sun Nov 6 12:09:30 2016 Total Scan time: 8.020 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]