FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9686, 199 aa 1>>>pF1KB9686 199 - 199 aa - 199 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3130+/-0.000278; mu= 10.3864+/- 0.018 mean_var=122.0704+/-24.561, 0's: 0 Z-trim(121.7): 69 B-trim: 0 in 0/52 Lambda= 0.116083 statistics sampled from 38551 (38623) to 38551 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.784), E-opt: 0.2 (0.453), width: 16 Scan time: 5.940 The best scores are: opt bits E(85289) NP_004600 (OMIM: 601010) transcription factor 15 [ ( 199) 1343 234.8 6.9e-62 NP_001073983 (OMIM: 609067) basic helix-loop-helix ( 201) 520 97.0 2.1e-20 XP_006716679 (OMIM: 609067) PREDICTED: basic helix ( 201) 520 97.0 2.1e-20 NP_068808 (OMIM: 602407) heart- and neural crest d ( 217) 254 52.5 5.8e-07 NP_005089 (OMIM: 603628) musculin [Homo sapiens] ( 206) 251 52.0 8e-07 XP_011513798 (OMIM: 101400,123100,180750,601622) P ( 202) 250 51.8 8.8e-07 NP_000465 (OMIM: 101400,123100,180750,601622) twis ( 202) 250 51.8 8.8e-07 XP_005268588 (OMIM: 602406) PREDICTED: heart- and ( 214) 250 51.8 9.2e-07 NP_004812 (OMIM: 602406) heart- and neural crest d ( 215) 250 51.8 9.2e-07 NP_938206 (OMIM: 603306) transcription factor 21 [ ( 179) 237 49.6 3.7e-06 NP_003197 (OMIM: 603306) transcription factor 21 [ ( 179) 237 49.6 3.7e-06 NP_660161 (OMIM: 609875) protein atonal homolog 7 ( 152) 228 48.0 9.2e-06 NP_803238 (OMIM: 608606) class A basic helix-loop- ( 189) 223 47.2 1.9e-05 NP_001277335 (OMIM: 187040,613065) T-cell acute ly ( 172) 221 46.9 2.3e-05 XP_016857682 (OMIM: 187040,613065) PREDICTED: T-ce ( 172) 221 46.9 2.3e-05 NP_003180 (OMIM: 187040,613065) T-cell acute lymph ( 331) 221 47.1 3.7e-05 XP_016857678 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 221 47.1 3.7e-05 NP_001274276 (OMIM: 187040,613065) T-cell acute ly ( 331) 221 47.1 3.7e-05 NP_001277333 (OMIM: 187040,613065) T-cell acute ly ( 331) 221 47.1 3.7e-05 XP_016857677 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 221 47.1 3.7e-05 XP_005271217 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 221 47.1 3.7e-05 NP_001277332 (OMIM: 187040,613065) T-cell acute ly ( 331) 221 47.1 3.7e-05 XP_016857679 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 221 47.1 3.7e-05 XP_016857680 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 221 47.1 3.7e-05 NP_001277334 (OMIM: 187040,613065) T-cell acute ly ( 331) 221 47.1 3.7e-05 XP_016857676 (OMIM: 187040,613065) PREDICTED: T-ce ( 331) 221 47.1 3.7e-05 NP_476527 (OMIM: 200110,209885,227260,607556) twis ( 160) 213 45.5 5.5e-05 NP_001258822 (OMIM: 200110,209885,227260,607556) t ( 160) 213 45.5 5.5e-05 XP_016871769 (OMIM: 604882,610370) PREDICTED: neur ( 214) 213 45.6 6.8e-05 NP_066279 (OMIM: 604882,610370) neurogenin-3 [Homo ( 214) 213 45.6 6.8e-05 NP_835455 (OMIM: 607194,609069,615935) pancreas tr ( 328) 206 44.6 0.00021 NP_005412 (OMIM: 186855,613065) T-cell acute lymph ( 108) 196 42.5 0.00029 NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapien ( 272) 199 43.4 0.00041 XP_005259969 (OMIM: 151440) PREDICTED: protein lyl ( 206) 195 42.6 0.00053 NP_005574 (OMIM: 151440) protein lyl-1 [Homo sapie ( 280) 195 42.7 0.00066 XP_016882306 (OMIM: 151440) PREDICTED: protein lyl ( 302) 195 42.7 0.0007 XP_016882305 (OMIM: 151440) PREDICTED: protein lyl ( 351) 195 42.8 0.00078 NP_005163 (OMIM: 601461) protein atonal homolog 1 ( 354) 195 42.8 0.00079 NP_006152 (OMIM: 601726) neurogenin-1 [Homo sapien ( 237) 191 42.0 0.00094 NP_001035047 (OMIM: 277300,605195,608681) mesoderm ( 397) 194 42.7 0.00096 NP_005589 (OMIM: 162360) helix-loop-helix protein ( 133) 187 41.1 0.00097 NP_005590 (OMIM: 162361) helix-loop-helix protein ( 135) 183 40.4 0.0016 NP_001104531 (OMIM: 162361) helix-loop-helix prote ( 135) 183 40.4 0.0016 NP_006151 (OMIM: 601725) neurogenic differentiatio ( 382) 187 41.5 0.0021 NP_073565 (OMIM: 611513) neurogenic differentiatio ( 337) 186 41.3 0.0022 NP_061140 (OMIM: 608689) mesoderm posterior protei ( 268) 181 40.3 0.0033 XP_005264216 (OMIM: 609635) PREDICTED: transcripti ( 210) 179 39.9 0.0034 NP_786951 (OMIM: 609635) transcription factor 23 [ ( 214) 179 39.9 0.0035 NP_002491 (OMIM: 125853,601724,606394) neurogenic ( 356) 181 40.4 0.004 NP_001004311 (OMIM: 608697,612310) factor in the g ( 219) 176 39.4 0.005 >>NP_004600 (OMIM: 601010) transcription factor 15 [Homo (199 aa) initn: 1343 init1: 1343 opt: 1343 Z-score: 1232.1 bits: 234.8 E(85289): 6.9e-62 Smith-Waterman score: 1343; 100.0% identity (100.0% similar) in 199 aa overlap (1-199:1-199) 10 20 30 40 50 60 pF1KB9 MAFALLRPVGAHVLYPDVRLLSEDEENRSESDASDQSFGCCEGPEAARRGPGPGGGRRAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 MAFALLRPVGAHVLYPDVRLLSEDEENRSESDASDQSFGCCEGPEAARRGPGPGGGRRAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 GGGGAGPVVVVRQRQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSKIETLRLASSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 GGGGAGPVVVVRQRQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSKIETLRLASSY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 IAHLANVLLLGDSADDGQPCFRAAGSAKGAVPAAADGGRQPRSICTFCLSNQRKGGGRRD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 IAHLANVLLLGDSADDGQPCFRAAGSAKGAVPAAADGGRQPRSICTFCLSNQRKGGGRRD 130 140 150 160 170 180 190 pF1KB9 LGGSCLKVRGVAPLRGPRR ::::::::::::::::::: NP_004 LGGSCLKVRGVAPLRGPRR 190 >>NP_001073983 (OMIM: 609067) basic helix-loop-helix tra (201 aa) initn: 513 init1: 413 opt: 520 Z-score: 487.1 bits: 97.0 E(85289): 2.1e-20 Smith-Waterman score: 616; 59.3% identity (73.4% similar) in 199 aa overlap (1-180:1-194) 10 20 30 40 50 pF1KB9 MAFALLRPVG-AHVLYPDVRLLSEDEENRSESDASDQSFGC------CEGPEAARRGPGP :.:: :::. .. :::.: :::::. :.:..::.. : : : ..::: NP_001 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEK-PCRVHAARC-GLQGARRR--- 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 GGGRRAGGGGGAG-PVVVVRQRQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSKIE .::::::::: .: : :::..:::::::::.:::::::::::::::::.:::::::: NP_001 AGGRRAGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIE 60 70 80 90 100 110 120 130 140 150 160 pF1KB9 TLRLASSYIAHLANVLLLGDSADDGQPC------FRAA--GSAKGAVPA--AADG-GRQP :::::::::.::.:::: :.. ::::: :.:: :: : : :: . :: NP_001 TLRLASSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARDGENTQP 120 130 140 150 160 170 170 180 190 pF1KB9 RSICTFCLSNQRKGGGRRDLGGSCLKVRGVAPLRGPRR ..::::::::::: . :: NP_001 KQICTFCLSNQRKLSKDRDRKTAIRS 180 190 200 >>XP_006716679 (OMIM: 609067) PREDICTED: basic helix-loo (201 aa) initn: 513 init1: 413 opt: 520 Z-score: 487.1 bits: 97.0 E(85289): 2.1e-20 Smith-Waterman score: 616; 59.3% identity (73.4% similar) in 199 aa overlap (1-180:1-194) 10 20 30 40 50 pF1KB9 MAFALLRPVG-AHVLYPDVRLLSEDEENRSESDASDQSFGC------CEGPEAARRGPGP :.:: :::. .. :::.: :::::. :.:..::.. : : : ..::: XP_006 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEK-PCRVHAARC-GLQGARRR--- 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 GGGRRAGGGGGAG-PVVVVRQRQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSKIE .::::::::: .: : :::..:::::::::.:::::::::::::::::.:::::::: XP_006 AGGRRAGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIE 60 70 80 90 100 110 120 130 140 150 160 pF1KB9 TLRLASSYIAHLANVLLLGDSADDGQPC------FRAA--GSAKGAVPA--AADG-GRQP :::::::::.::.:::: :.. ::::: :.:: :: : : :: . :: XP_006 TLRLASSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARDGENTQP 120 130 140 150 160 170 170 180 190 pF1KB9 RSICTFCLSNQRKGGGRRDLGGSCLKVRGVAPLRGPRR ..::::::::::: . :: XP_006 KQICTFCLSNQRKLSKDRDRKTAIRS 180 190 200 >>NP_068808 (OMIM: 602407) heart- and neural crest deriv (217 aa) initn: 252 init1: 236 opt: 254 Z-score: 245.9 bits: 52.5 E(85289): 5.8e-07 Smith-Waterman score: 254; 42.0% identity (65.5% similar) in 119 aa overlap (30-143:52-169) 10 20 30 40 50 pF1KB9 MAFALLRPVGAHVLYPDVRLLSEDEENRSESDASDQSFGCCEGPEAARRGPGPGGGRRA : . : :.. .:: : . : .. . NP_068 AAAAAAAAAAASRCSHEENPYFHGWLIGHPEMSPPDYSMALSYSPEYASGAAGLDHSHYG 30 40 50 60 70 80 60 70 80 90 100 110 pF1KB9 GGGGGAGPVVV-----VRQRQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSKIETL : :::: . :..: .:: .:: ::::.:.::. :: ::. :.: :::::.:: NP_068 GVPPGAGPPGLGGPRPVKRRGTANRKERRRTQSINSAFAELRECIPNVPADTKLSKIKTL 90 100 110 120 130 140 120 130 140 150 160 170 pF1KB9 RLASSYIAHLANVLLLGDSADDGQPCFRAAGSAKGAVPAAADGGRQPRSICTFCLSNQRK :::.::::.: ..: :. ... :.: NP_068 RLATSYIAYLMDLLAKDDQNGEAEA-FKAEIKKTDVKEEKRKKELNEILKSTVSSNDKKT 150 160 170 180 190 200 >>NP_005089 (OMIM: 603628) musculin [Homo sapiens] (206 aa) initn: 301 init1: 221 opt: 251 Z-score: 243.5 bits: 52.0 E(85289): 8e-07 Smith-Waterman score: 251; 42.6% identity (62.3% similar) in 122 aa overlap (23-137:53-171) 10 20 30 40 50 pF1KB9 MAFALLRPVGAHVLYPDVRLLSEDEENRSESDASDQSFGCCEGPEAARRGPG . ::.: .. .. :: . . : . : NP_005 VPASKRPPLRGVERSYASPSDNSSAEEEDPDGEEERCALGTAGSAEGCKR--KRPRVAGG 30 40 50 60 70 80 60 70 80 90 100 pF1KB9 PGGGRRAGGGG-------GAGPVVVVRQRQAANARERDRTQSVNTAFTALRTLIPTEPVD :.: ::::: :.. ::.::::::: : . .. ::. :.: .: : : NP_005 GGAGGSAGGGGKKPLPAKGSAAECKQSQRNAANARERARMRVLSKAFSRLKTSLPWVPPD 90 100 110 120 130 140 110 120 130 140 150 160 pF1KB9 RKLSKIETLRLASSYIAHLANVLLLGDSADDGQPCFRAAGSAKGAVPAAADGGRQPRSIC ::::..:::::::::::: . :: : ..: NP_005 TKLSKLDTLRLASSYIAHLRQ-LLQEDRYENGYVHPVNLTWPFVVSGRPDSDTKEVSAAN 150 160 170 180 190 170 180 190 pF1KB9 TFCLSNQRKGGGRRDLGGSCLKVRGVAPLRGPRR NP_005 RLCGTTA 200 >>XP_011513798 (OMIM: 101400,123100,180750,601622) PREDI (202 aa) initn: 225 init1: 172 opt: 250 Z-score: 242.7 bits: 51.8 E(85289): 8.8e-07 Smith-Waterman score: 250; 51.6% identity (66.7% similar) in 93 aa overlap (36-128:77-163) 10 20 30 40 50 60 pF1KB9 LRPVGAHVLYPDVRLLSEDEENRSESDASDQSFGCCEGPEAARRGPGPGGGRRAGGGGGA .: :: : : : ::: .:::. XP_011 GGGAGPGGAAGGGVGGGDEPGSPAQGKRGKKSAGCGGGG-----GAGGGGGSSSGGGSPQ 50 60 70 80 90 100 70 80 90 100 110 120 pF1KB9 GPVVVVRQRQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSKIETLRLASSYIAHLA . . :: ::.:::.::::.: ::.::: .::: : : :::::.::.::. :: : XP_011 SYEELQTQRVMANVRERQRTQSLNEAFAALRKIIPTLPSD-KLSKIQTLKLAARYIDFLY 110 120 130 140 150 160 130 140 150 160 170 180 pF1KB9 NVLLLGDSADDGQPCFRAAGSAKGAVPAAADGGRQPRSICTFCLSNQRKGGGRRDLGGSC .:: XP_011 QVLQSDELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH 170 180 190 200 >>NP_000465 (OMIM: 101400,123100,180750,601622) twist-re (202 aa) initn: 225 init1: 172 opt: 250 Z-score: 242.7 bits: 51.8 E(85289): 8.8e-07 Smith-Waterman score: 250; 51.6% identity (66.7% similar) in 93 aa overlap (36-128:77-163) 10 20 30 40 50 60 pF1KB9 LRPVGAHVLYPDVRLLSEDEENRSESDASDQSFGCCEGPEAARRGPGPGGGRRAGGGGGA .: :: : : : ::: .:::. NP_000 GGGAGPGGAAGGGVGGGDEPGSPAQGKRGKKSAGCGGGG-----GAGGGGGSSSGGGSPQ 50 60 70 80 90 100 70 80 90 100 110 120 pF1KB9 GPVVVVRQRQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSKIETLRLASSYIAHLA . . :: ::.:::.::::.: ::.::: .::: : : :::::.::.::. :: : NP_000 SYEELQTQRVMANVRERQRTQSLNEAFAALRKIIPTLPSD-KLSKIQTLKLAARYIDFLY 110 120 130 140 150 160 130 140 150 160 170 180 pF1KB9 NVLLLGDSADDGQPCFRAAGSAKGAVPAAADGGRQPRSICTFCLSNQRKGGGRRDLGGSC .:: NP_000 QVLQSDELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH 170 180 190 200 >>XP_005268588 (OMIM: 602406) PREDICTED: heart- and neur (214 aa) initn: 258 init1: 201 opt: 250 Z-score: 242.4 bits: 51.8 E(85289): 9.2e-07 Smith-Waterman score: 250; 43.0% identity (65.3% similar) in 121 aa overlap (43-162:73-176) 20 30 40 50 60 70 pF1KB9 VLYPDVRLLSEDEENRSESDASDQSFGCCEGPEAARRGPGPGGGRRAGGGGGAGPVVVVR ::.: :: . :: . :: : : XP_005 QSWLLSPADAAPDFPAGGPPPAAAAAATAYGPDAR---PGQSPGRLEALGGRLG-----R 50 60 70 80 90 80 90 100 110 120 130 pF1KB9 QRQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSKIETLRLASSYIAHLANVLLLGD .. .. .:: ::.:.:.::. :: ::. :.: :::::.:::::.::::.: .:: . XP_005 RKGSGPKKERRRTESINSAFAELRECIPNVPADTKLSKIKTLRLATSYIAYLMDVL--AK 100 110 120 130 140 150 140 150 160 170 180 190 pF1KB9 SADDGQP-CFRAAGSAKGAVPAAADGGRQPRSICTFCLSNQRKGGGRRDLGGSCLKVRGV .:..:.: :.: . :::::. . XP_005 DAQSGDPEAFKAELKK-------ADGGRESKRKRELQHEGFPPALGPVEKRIKGRTGWPQ 160 170 180 190 200 pF1KB9 APLRGPRR XP_005 QVWALELNQ 210 >>NP_004812 (OMIM: 602406) heart- and neural crest deriv (215 aa) initn: 252 init1: 201 opt: 250 Z-score: 242.4 bits: 51.8 E(85289): 9.2e-07 Smith-Waterman score: 250; 43.0% identity (65.3% similar) in 121 aa overlap (43-162:73-176) 20 30 40 50 60 70 pF1KB9 VLYPDVRLLSEDEENRSESDASDQSFGCCEGPEAARRGPGPGGGRRAGGGGGAGPVVVVR ::.: :: . :: . :: : : NP_004 QSWLLSPADAAPDFPAGGPPPAAAAAATAYGPDAR---PGQSPGRLEALGGRLG-----R 50 60 70 80 90 80 90 100 110 120 130 pF1KB9 QRQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSKIETLRLASSYIAHLANVLLLGD .. .. .:: ::.:.:.::. :: ::. :.: :::::.:::::.::::.: .:: . NP_004 RKGSGPKKERRRTESINSAFAELRECIPNVPADTKLSKIKTLRLATSYIAYLMDVL--AK 100 110 120 130 140 150 140 150 160 170 180 190 pF1KB9 SADDGQP-CFRAAGSAKGAVPAAADGGRQPRSICTFCLSNQRKGGGRRDLGGSCLKVRGV .:..:.: :.: . :::::. . NP_004 DAQSGDPEAFKAELKK-------ADGGRESKRKRELQQHEGFPPALGPVEKRIKGRTGWP 160 170 180 190 200 pF1KB9 APLRGPRR NP_004 QQVWALELNQ 210 >>NP_938206 (OMIM: 603306) transcription factor 21 [Homo (179 aa) initn: 208 init1: 208 opt: 237 Z-score: 231.6 bits: 49.6 E(85289): 3.7e-06 Smith-Waterman score: 237; 38.2% identity (64.2% similar) in 123 aa overlap (21-137:25-143) 10 20 30 40 50 pF1KB9 MAFALLRPVGAHVLYPDVRLLSEDEENRSESDASDQSFGCCEGPEAARRGPGPGGG .. ..: . ......: .: .: :: : : NP_938 MSTGSLSDVEDLQEVEMLECDGLKMDSNKEFVTSNESTEESSNCENGSPQKGRG---GLG 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 RRAGGGGGAGPVVVVRQ------RQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSK .: . .:. : : :.::::::: : . .. ::. :.: .: : : :::: NP_938 KRRKAPTKKSPLSGVSQEGKQVQRNAANARERARMRVLSKAFSRLKTTLPWVPPDTKLSK 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB9 IETLRLASSYIAHLANVLLLGDSADDGQPCFRAAGSAKGAVPAAADGGRQPRSICTFCLS ..:::::::::::: ..: .:. ..: NP_938 LDTLRLASSYIAHLRQILA-NDKYENGYIHPVNLTWPFMVAGKPESDLKEVVTASRLCGT 120 130 140 150 160 170 199 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:48:13 2016 done: Tue Nov 8 04:48:14 2016 Total Scan time: 5.940 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]