FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7508, 242 aa 1>>>pF1KB7508 242 - 242 aa - 242 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.5765+/-0.000285; mu= 10.9906+/- 0.018 mean_var=125.9864+/-25.309, 0's: 0 Z-trim(122.7): 46 B-trim: 2707 in 2/55 Lambda= 0.114265 statistics sampled from 41161 (41218) to 41161 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.816), E-opt: 0.2 (0.483), width: 16 Scan time: 8.060 The best scores are: opt bits E(85289) NP_002460 (OMIM: 159991,614408) myogenic factor 6 ( 242) 1633 279.2 4.6e-75 NP_002470 (OMIM: 159980) myogenin [Homo sapiens] ( 224) 527 96.8 3.3e-20 NP_005584 (OMIM: 159990) myogenic factor 5 [Homo s ( 255) 499 92.2 9e-19 NP_002469 (OMIM: 159970) myoblast determination pr ( 320) 420 79.3 8.9e-15 >>NP_002460 (OMIM: 159991,614408) myogenic factor 6 [Hom (242 aa) initn: 1633 init1: 1633 opt: 1633 Z-score: 1468.6 bits: 279.2 E(85289): 4.6e-75 Smith-Waterman score: 1633; 100.0% identity (100.0% similar) in 242 aa overlap (1-242:1-242) 10 20 30 40 50 60 pF1KB7 MMMDLFETGSYFFYLDGENVTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAGSDSSGEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MMMDLFETGSYFFYLDGENVTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAGSDSSGEE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 HVLAPPGLQPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEAFEALKRRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 HVLAPPGLQPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEAFEALKRRT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 VANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQELGVDPFSYRPKQENLEGADFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 VANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQELGVDPFSYRPKQENLEGADFL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 RTCSSQWPSVSDHSRGLVITAKEGGASIDSSASSSLRCLSSIVDSISSEERKLPCVEEVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 RTCSSQWPSVSDHSRGLVITAKEGGASIDSSASSSLRCLSSIVDSISSEERKLPCVEEVV 190 200 210 220 230 240 pF1KB7 EK :: NP_002 EK >>NP_002470 (OMIM: 159980) myogenin [Homo sapiens] (224 aa) initn: 542 init1: 463 opt: 527 Z-score: 483.7 bits: 96.8 E(85289): 3.3e-20 Smith-Waterman score: 570; 45.6% identity (64.0% similar) in 250 aa overlap (3-240:1-222) 10 20 30 40 50 pF1KB7 MMMDLFETGSYFF----YLDGEN---VTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAG :.:.::. ::. . :::: : :: .: : : .. :::: NP_002 MELYETSPYFYQEPRFYDGENYLPVHLQGFE----PPGYERTELTLSP---------- 10 20 30 40 60 70 80 90 100 110 pF1KB7 SDSSGEEHVLAPPGL-QPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEA .. : : :: : ::::::: ::::.:::::. .:::.::::::.:::::.::: NP_002 -EAPGP---LEDKGLGTPEHCPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEA 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB7 FEALKRRTVANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQEL--GVDPFSYRPK :::::: :. ::::::::::::::::.:::::: :: :.:.:. . : : :. NP_002 FEALKRSTLLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVPS 110 120 130 140 150 160 180 190 200 210 220 pF1KB7 QENLEGADFLRTCSSQWPSVSDHSRGLVITAKEGGA--SIDSSASSSLRCLSSIVDSISS . . ..: .:: .: : .: ..:. : . : . . .:. :.::::::. NP_002 ECSSHSA----SCSPEWGS------ALEFSANPGDHLLTADPTDAHNLHSLTSIVDSITV 170 180 190 200 210 230 240 pF1KB7 EERKLPCVEEVVEK :. .. .:.. NP_002 EDVSVAFPDETMPN 220 >>NP_005584 (OMIM: 159990) myogenic factor 5 [Homo sapie (255 aa) initn: 484 init1: 396 opt: 499 Z-score: 458.0 bits: 92.2 E(85289): 9e-19 Smith-Waterman score: 499; 50.5% identity (69.6% similar) in 184 aa overlap (53-234:48-223) 30 40 50 60 70 80 pF1KB7 QPLEVAEGSPLYPGSDGTLSPCQDQMPPEAGSDSSGEEHVLAPPGLQPPHCPGQCLIWAC ::: .::: :: : : :.::.::: NP_005 GSCIPSPEGEFGDEFVPRVAAFGAHKAELQGSDE--DEHVRAPTG---HHQAGHCLMWAC 20 30 40 50 60 70 90 100 110 120 130 140 pF1KB7 KTCKRKSAPTDRRKAATLRERRRLKKINEAFEALKRRTVANPNQRLPKVEILRSAISYIE :.:::::. :::::::.::::::::.:.:::.::: :..:::::::::::::.:: ::: NP_005 KACKRKSTTMDRRKAATMRERRRLKKVNQAFETLKRCTTTNPNQRLPKVEILRNAIRYIE 80 90 100 110 120 130 150 160 170 180 190 200 pF1KB7 RLQDLLHRLDQQEKMQELGVDPFSYRPKQENLEGADFLRTCSSQ-WPSVSDHSRGLVITA ::.::. .: :.. : . : .: . . . .: . :.: : :. .. NP_005 SLQELLR--EQVENYYSLPGQSCS-EPTSPTSNCSDGMPECNSPVWSRKSSTFDSIYCPD 140 150 160 170 180 210 220 230 240 pF1KB7 KEGGASIDSSASSSLRCLSSIVDSI-SSEERKLPCVEEVVEK . . :... ::: :::.::: : :::. :: NP_005 VSNVYATDKNSLSSLDCLSNIVDRITSSEQPGLPLQDLASLSPVASTDSQPATPGASSSR 190 200 210 220 230 240 NP_005 LIYHVL 250 >>NP_002469 (OMIM: 159970) myoblast determination protei (320 aa) initn: 472 init1: 388 opt: 420 Z-score: 386.3 bits: 79.3 E(85289): 8.9e-15 Smith-Waterman score: 446; 44.9% identity (58.4% similar) in 214 aa overlap (41-234:57-267) 20 30 40 50 60 pF1KB7 YFFYLDGENVTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAGSDSSG---EEHVLAPPG :.: . . : : . : .::: :: : NP_002 DDFYDDPCFDSPDLRFFEDLDPRLMHVGALLKPEEHSHFPAAVHPAPGAREDEHVRAPSG 30 40 50 60 70 80 70 80 90 100 110 120 pF1KB7 LQPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEAFEALKRRTVANPNQR : :.::.::::.::::.. .:::::::.::::::.:.:::::.::: : .::::: NP_002 ---HHQAGRCLLWACKACKRKTTNADRRKAATMRERRRLSKVNEAFETLKRCTSSNPNQR 90 100 110 120 130 140 130 140 150 160 170 180 pF1KB7 LPKVEILRSAISYIERLQDLLHRLDQQEKMQELGV-DPFSYRPKQ--ENLEG---ADFLR ::::::::.:: ::: :: ::. : . : : . :. : :. : NP_002 LPKVEILRNAIRYIEGLQALLRDQDAAPPGAAAAFYAPGPLPPGRGGEHYSGDSDASSPR 150 160 170 180 190 200 190 200 210 220 230 pF1KB7 T-CSSQWPSVSDHSRGLVITAKEGGASID----------SSASSSLRCLSSIVDSISSEE . ::. . : : :: . :.: ::: ::::::. ::.: NP_002 SNCSDGMMDYSGPPSGARRRNCYEGAYYNEAPSEPRPGKSAAVSSLDCLSSIVERISTES 210 220 230 240 250 260 240 pF1KB7 RKLPCVEEVVEK : NP_002 PAAPALLLADVPSESPPRRQEAAAPSEGESSGDPTQSPDAAPQCPAGANPNPIYQVL 270 280 290 300 310 320 242 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 21:00:40 2016 done: Sat Nov 5 21:00:41 2016 Total Scan time: 8.060 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]