FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8888, 201 aa 1>>>pF1KB8888 201 - 201 aa - 201 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.1511+/-0.000269; mu= 3.6195+/- 0.017 mean_var=200.9329+/-40.808, 0's: 0 Z-trim(125.8): 56 B-trim: 2639 in 1/59 Lambda= 0.090479 statistics sampled from 50417 (50481) to 50417 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.842), E-opt: 0.2 (0.592), width: 16 Scan time: 6.550 The best scores are: opt bits E(85289) XP_006716679 (OMIM: 609067) PREDICTED: basic helix ( 201) 1405 194.2 1.2e-49 NP_001073983 (OMIM: 609067) basic helix-loop-helix ( 201) 1405 194.2 1.2e-49 NP_004600 (OMIM: 601010) transcription factor 15 [ ( 199) 520 78.7 7.2e-15 XP_011513798 (OMIM: 101400,123100,180750,601622) P ( 202) 261 44.8 0.00011 NP_000465 (OMIM: 101400,123100,180750,601622) twis ( 202) 261 44.8 0.00011 NP_068808 (OMIM: 602407) heart- and neural crest d ( 217) 253 43.8 0.00024 NP_005089 (OMIM: 603628) musculin [Homo sapiens] ( 206) 242 42.4 0.00062 NP_803238 (OMIM: 608606) class A basic helix-loop- ( 189) 239 42.0 0.00076 NP_001157877 (OMIM: 607539,609432,615416) class A ( 235) 235 41.5 0.0013 NP_476527 (OMIM: 200110,209885,227260,607556) twis ( 160) 230 40.7 0.0015 NP_001258822 (OMIM: 200110,209885,227260,607556) t ( 160) 230 40.7 0.0015 XP_005268588 (OMIM: 602406) PREDICTED: heart- and ( 214) 224 40.0 0.0032 NP_004812 (OMIM: 602406) heart- and neural crest d ( 215) 224 40.0 0.0033 NP_938206 (OMIM: 603306) transcription factor 21 [ ( 179) 222 39.7 0.0034 NP_003197 (OMIM: 603306) transcription factor 21 [ ( 179) 222 39.7 0.0034 XP_016871769 (OMIM: 604882,610370) PREDICTED: neur ( 214) 219 39.4 0.0051 NP_066279 (OMIM: 604882,610370) neurogenin-3 [Homo ( 214) 219 39.4 0.0051 NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapien ( 272) 219 39.5 0.0061 NP_660161 (OMIM: 609875) protein atonal homolog 7 ( 152) 213 38.5 0.0068 >>XP_006716679 (OMIM: 609067) PREDICTED: basic helix-loo (201 aa) initn: 1405 init1: 1405 opt: 1405 Z-score: 1012.2 bits: 194.2 E(85289): 1.2e-49 Smith-Waterman score: 1405; 100.0% identity (100.0% similar) in 201 aa overlap (1-201:1-201) 10 20 30 40 50 60 pF1KB8 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEKPCRVHAARCGLQGARRRAGGRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEKPCRVHAARCGLQGARRRAGGRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 AGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIETLRLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 AGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIETLRLA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 SSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARDGENTQPKQICT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 SSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARDGENTQPKQICT 130 140 150 160 170 180 190 200 pF1KB8 FCLSNQRKLSKDRDRKTAIRS ::::::::::::::::::::: XP_006 FCLSNQRKLSKDRDRKTAIRS 190 200 >>NP_001073983 (OMIM: 609067) basic helix-loop-helix tra (201 aa) initn: 1405 init1: 1405 opt: 1405 Z-score: 1012.2 bits: 194.2 E(85289): 1.2e-49 Smith-Waterman score: 1405; 100.0% identity (100.0% similar) in 201 aa overlap (1-201:1-201) 10 20 30 40 50 60 pF1KB8 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEKPCRVHAARCGLQGARRRAGGRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEKPCRVHAARCGLQGARRRAGGRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 AGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIETLRLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIETLRLA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 SSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARDGENTQPKQICT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARDGENTQPKQICT 130 140 150 160 170 180 190 200 pF1KB8 FCLSNQRKLSKDRDRKTAIRS ::::::::::::::::::::: NP_001 FCLSNQRKLSKDRDRKTAIRS 190 200 >>NP_004600 (OMIM: 601010) transcription factor 15 [Homo (199 aa) initn: 513 init1: 413 opt: 520 Z-score: 387.9 bits: 78.7 E(85289): 7.2e-15 Smith-Waterman score: 616; 59.3% identity (73.4% similar) in 199 aa overlap (1-194:1-180) 10 20 30 40 50 pF1KB8 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEK-PCRVHAARC-GLQGARRR--- :.:: :::. .. :::.: :::::. :.:..::.. : : : ..::: NP_004 MAFALLRPVG-AHVLYPDVRLLSEDEENRSESDASDQSFGC------CEGPEAARRGPGP 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 AGGRRAGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIE .::::::::: .: : :::..:::::::::.:::::::::::::::::.:::::::: NP_004 GGGRRAGGGGGAG-PVVVVRQRQAANARERDRTQSVNTAFTALRTLIPTEPVDRKLSKIE 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 TLRLASSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARDGENTQP :::::::::.::.:::: :.. ::::: :.:: :: : : :: . :: NP_004 TLRLASSYIAHLANVLLLGDSADDGQPC------FRAA--GSAKGAVPA--AADG-GRQP 120 130 140 150 160 180 190 200 pF1KB8 KQICTFCLSNQRKLSKDRDRKTAIRS ..::::::::::: . :: NP_004 RSICTFCLSNQRKGGGRRDLGGSCLKVRGVAPLRGPRR 170 180 190 >>XP_011513798 (OMIM: 101400,123100,180750,601622) PREDI (202 aa) initn: 304 init1: 155 opt: 261 Z-score: 205.1 bits: 44.8 E(85289): 0.00011 Smith-Waterman score: 261; 44.8% identity (65.5% similar) in 116 aa overlap (29-143:62-175) 10 20 30 40 50 pF1KB8 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEKPCR-VHAARCGLQGARRRAG :.: :: . : ..: :: :. .: XP_011 GKRGGRKRRSSRRSAGGGAGPGGAAGGGVGGGDEPGSPAQGKRGKKSAGCGGGGGAGGGG 40 50 60 70 80 90 60 70 80 90 100 110 pF1KB8 GRRAGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIETL : .:::.: . . :: ::.:::.::.:.: ::.::: .::: :.: :::::.:: XP_011 GSSSGGGSPQSYEELQT-QRVMANVRERQRTQSLNEAFAALRKIIPTLPSD-KLSKIQTL 100 110 120 130 140 120 130 140 150 160 170 pF1KB8 RLASSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARDGENTQPKQ .::. ::. : .:: . : . : XP_011 KLAARYIDFLYQVLQSDELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH 150 160 170 180 190 200 >>NP_000465 (OMIM: 101400,123100,180750,601622) twist-re (202 aa) initn: 304 init1: 155 opt: 261 Z-score: 205.1 bits: 44.8 E(85289): 0.00011 Smith-Waterman score: 261; 44.8% identity (65.5% similar) in 116 aa overlap (29-143:62-175) 10 20 30 40 50 pF1KB8 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEKPCR-VHAARCGLQGARRRAG :.: :: . : ..: :: :. .: NP_000 GKRGGRKRRSSRRSAGGGAGPGGAAGGGVGGGDEPGSPAQGKRGKKSAGCGGGGGAGGGG 40 50 60 70 80 90 60 70 80 90 100 110 pF1KB8 GRRAGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIETL : .:::.: . . :: ::.:::.::.:.: ::.::: .::: :.: :::::.:: NP_000 GSSSGGGSPQSYEELQT-QRVMANVRERQRTQSLNEAFAALRKIIPTLPSD-KLSKIQTL 100 110 120 130 140 120 130 140 150 160 170 pF1KB8 RLASSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARDGENTQPKQ .::. ::. : .:: . : . : NP_000 KLAARYIDFLYQVLQSDELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH 150 160 170 180 190 200 >>NP_068808 (OMIM: 602407) heart- and neural crest deriv (217 aa) initn: 244 init1: 244 opt: 253 Z-score: 199.1 bits: 43.8 E(85289): 0.00024 Smith-Waterman score: 254; 42.5% identity (63.8% similar) in 127 aa overlap (16-141:50-165) 10 20 30 40 pF1KB8 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEKPCRVHAA .::.:: : . : : : .. NP_068 FAAAAAAAAAAAASRCSHEENPYFHGWLIGHPEMSP----PDYSMALSYSPEYA----SG 20 30 40 50 60 70 50 60 70 80 90 100 pF1KB8 RCGLQGARRRAGGRRAGGGGPG-GRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPT ::. .. :: :.: :: : : : ..: ::: .:: ::.:.:.::. :: ::. NP_068 AAGLDHSHY--GGVPPGAGPPGLGGP-RPVKRRGTANRKERRRTQSINSAFAELRECIPN 80 90 100 110 120 110 120 130 140 150 160 pF1KB8 EPADRKLSKIETLRLASSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPP ::: :::::.:::::.:::..: ..: . :... NP_068 VPADTKLSKIKTLRLATSYIAYLMDLLAKDDQNGEAEAFKAEIKKTDVKEEKRKKELNEI 130 140 150 160 170 180 170 180 190 200 pF1KB8 PPARDGENTQPKQICTFCLSNQRKLSKDRDRKTAIRS NP_068 LKSTVSSNDKKTKGRTGWPQHVWALELKQ 190 200 210 >>NP_005089 (OMIM: 603628) musculin [Homo sapiens] (206 aa) initn: 309 init1: 197 opt: 242 Z-score: 191.6 bits: 42.4 E(85289): 0.00062 Smith-Waterman score: 252; 37.5% identity (57.6% similar) in 144 aa overlap (8-131:22-163) 10 20 30 40 pF1KB8 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEKPCRVHAAR :.: .. : . . .. ::.:...:. . : NP_005 MSTGSVSDPEEMELRGLQREYPVPASKR--PPLRGVERSYASPSDNSSAEEEDPDGEEER 10 20 30 40 50 50 60 70 80 pF1KB8 CGLQGARRRAGGRR-----AGGGGPGGRPG---REP------------RQRHTANARERD :.: : : .: ::::: :: : ..: ::..:::::: NP_005 CALGTAGSAEGCKRKRPRVAGGGGAGGSAGGGGKKPLPAKGSAAECKQSQRNAANARERA 60 70 80 90 100 110 90 100 110 120 130 140 pF1KB8 RTNSVNTAFTALRTLIPTEPADRKLSKIETLRLASSYISHLGNVLLAGEACGDGQPCHSG : .. ::. :.: .: : : ::::..:::::::::.:: ..: NP_005 RMRVLSKAFSRLKTSLPWVPPDTKLSKLDTLRLASSYIAHLRQLLQEDRYENGYVHPVNL 120 130 140 150 160 170 150 160 170 180 190 200 pF1KB8 PAFFHAARAGSPPPPPPPPPARDGENTQPKQICTFCLSNQRKLSKDRDRKTAIRS NP_005 TWPFVVSGRPDSDTKEVSAANRLCGTTA 180 190 200 >>NP_803238 (OMIM: 608606) class A basic helix-loop-heli (189 aa) initn: 224 init1: 224 opt: 239 Z-score: 190.0 bits: 42.0 E(85289): 0.00076 Smith-Waterman score: 241; 41.6% identity (61.6% similar) in 125 aa overlap (9-132:19-132) 10 20 30 40 pF1KB8 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSD-SSGSDEKPCRVHAARCGL : :: : .: . . : . ..: .: :. ::: NP_803 MKTKNRPPRRRAPVQDTEATPG-----EGTPDGSLPNPGPEPAKGLRSRPARA-AARAPG 10 20 30 40 50 50 60 70 80 90 100 pF1KB8 QGARRRAGGRRAGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADR .: ::: : .::::: ..: .: :::.: ...:.:: ::: .:: ::. NP_803 EGRRRRPG-----PSGPGGRRDSSIQRRLESNERERQRMHKLNNAFQALREVIPHVRADK 60 70 80 90 100 110 120 130 140 150 160 pF1KB8 KLSKIETLRLASSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARD :::::::: ::..::. : ..: NP_803 KLSKIETLTLAKNYIKSLTATILTMSSSRLPGLEGPGPKLYQHYQQQQQVAGGALGATEA 110 120 130 140 150 160 >>NP_001157877 (OMIM: 607539,609432,615416) class A basi (235 aa) initn: 194 init1: 156 opt: 235 Z-score: 185.9 bits: 41.5 E(85289): 0.0013 Smith-Waterman score: 240; 34.9% identity (52.9% similar) in 189 aa overlap (6-182:2-179) 10 20 30 40 50 pF1KB8 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSS--GSDEKPCRVHAARCGLQGARRRAGG :: :: : : . . . :: :. :.: ...: :. :.. :: NP_001 MLRGAP-GLGLTARKGAEDSAEDLGGPCPEPGGDSGVLGANGASCSRGEAEEPAGR 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 RRAGGGGPGGRPGREPRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIETLR ::: :: : .: .::.::: : . : ::.::: . . . ..:::: ::: NP_001 RRA-------RPVRSKARRMAANVRERKRILDYNEAFNALRRALRHDLGGKRLSKIATLR 60 70 80 90 100 120 130 140 150 160 pF1KB8 LASSYISHLGNVLLAGEA----CGDGQPCHSGPAFFHAARAGSPPPPP--P----PPPAR : :. :. :: :. : :: . ::. : .. .:. :::: : : :: NP_001 RAIHRIAALSLVLRASPAPRGPCGHLE-CHGPAARGDTGDTGASPPPPAGPSLARPDAAR 110 120 130 140 150 160 170 180 190 200 pF1KB8 DGENTQPKQICTFCLSNQRKLSKDRDRKTAIRS . . :. :. : NP_001 PSVPSAPR--CASCPPHAPLARPSAVAEGPGLAQASGGSWRRCPGASSAGPPPWPRGYLR 170 180 190 200 210 220 >>NP_476527 (OMIM: 200110,209885,227260,607556) twist-re (160 aa) initn: 201 init1: 135 opt: 230 Z-score: 184.6 bits: 40.7 E(85289): 0.0015 Smith-Waterman score: 230; 38.3% identity (64.8% similar) in 128 aa overlap (20-143:7-133) 10 20 30 40 50 pF1KB8 MSFATLRPAPPGRYLYPEVSPLSEDEDRGSDSSGSDEKPCRVHAARCGLQGARRRAG--- ::.: .. :.. ...: : : . . . .. NP_476 MEEGSSSPVSPVDSLGTSEEELERQPKRFGRKRRYSKKSSEDGSPTP 10 20 30 40 60 70 80 90 100 110 pF1KB8 GRRAGGGGPGGRPGRE-PRQRHTANARERDRTNSVNTAFTALRTLIPTEPADRKLSKIET :.:. :.:... .: :: ::.:::.::.:.: ::.::: .::: :.: :::::.: NP_476 GKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSD-KLSKIQT 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB8 LRLASSYISHLGNVLLAGEACGDGQPCHSGPAFFHAARAGSPPPPPPPPPARDGENTQPK :.::. ::. : .:: . : . : NP_476 LKLAARYIDFLYQVLQSDEMDNKMTSCSYVAHERLSYAFSVWRMEGAWSMSASH 110 120 130 140 150 160 201 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:48:50 2016 done: Tue Nov 8 04:48:51 2016 Total Scan time: 6.550 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]