FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7508, 242 aa 1>>>pF1KB7508 242 - 242 aa - 242 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3837+/-0.000656; mu= 12.1177+/- 0.040 mean_var=119.4412+/-23.687, 0's: 0 Z-trim(115.1): 38 B-trim: 8 in 1/51 Lambda= 0.117354 statistics sampled from 15608 (15646) to 15608 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.815), E-opt: 0.2 (0.481), width: 16 Scan time: 2.690 The best scores are: opt bits E(32554) CCDS9019.1 MYF6 gene_id:4618|Hs108|chr12 ( 242) 1633 286.2 1.3e-77 CCDS1433.1 MYOG gene_id:4656|Hs108|chr1 ( 224) 527 98.9 2.9e-21 CCDS9020.1 MYF5 gene_id:4617|Hs108|chr12 ( 255) 499 94.2 8.6e-20 CCDS7826.1 MYOD1 gene_id:4654|Hs108|chr11 ( 320) 420 81.0 1.1e-15 >>CCDS9019.1 MYF6 gene_id:4618|Hs108|chr12 (242 aa) initn: 1633 init1: 1633 opt: 1633 Z-score: 1506.8 bits: 286.2 E(32554): 1.3e-77 Smith-Waterman score: 1633; 100.0% identity (100.0% similar) in 242 aa overlap (1-242:1-242) 10 20 30 40 50 60 pF1KB7 MMMDLFETGSYFFYLDGENVTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAGSDSSGEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS90 MMMDLFETGSYFFYLDGENVTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAGSDSSGEE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 HVLAPPGLQPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEAFEALKRRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS90 HVLAPPGLQPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEAFEALKRRT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 VANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQELGVDPFSYRPKQENLEGADFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS90 VANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQELGVDPFSYRPKQENLEGADFL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 RTCSSQWPSVSDHSRGLVITAKEGGASIDSSASSSLRCLSSIVDSISSEERKLPCVEEVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS90 RTCSSQWPSVSDHSRGLVITAKEGGASIDSSASSSLRCLSSIVDSISSEERKLPCVEEVV 190 200 210 220 230 240 pF1KB7 EK :: CCDS90 EK >>CCDS1433.1 MYOG gene_id:4656|Hs108|chr1 (224 aa) initn: 542 init1: 463 opt: 527 Z-score: 495.2 bits: 98.9 E(32554): 2.9e-21 Smith-Waterman score: 570; 45.6% identity (64.0% similar) in 250 aa overlap (3-240:1-222) 10 20 30 40 50 pF1KB7 MMMDLFETGSYFF----YLDGEN---VTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAG :.:.::. ::. . :::: : :: .: : : .. :::: CCDS14 MELYETSPYFYQEPRFYDGENYLPVHLQGFE----PPGYERTELTLSP---------- 10 20 30 40 60 70 80 90 100 110 pF1KB7 SDSSGEEHVLAPPGL-QPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEA .. : : :: : ::::::: ::::.:::::. .:::.::::::.:::::.::: CCDS14 -EAPGP---LEDKGLGTPEHCPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEA 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB7 FEALKRRTVANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQEL--GVDPFSYRPK :::::: :. ::::::::::::::::.:::::: :: :.:.:. . : : :. CCDS14 FEALKRSTLLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVPS 110 120 130 140 150 160 180 190 200 210 220 pF1KB7 QENLEGADFLRTCSSQWPSVSDHSRGLVITAKEGGA--SIDSSASSSLRCLSSIVDSISS . . ..: .:: .: : .: ..:. : . : . . .:. :.::::::. CCDS14 ECSSHSA----SCSPEWGS------ALEFSANPGDHLLTADPTDAHNLHSLTSIVDSITV 170 180 190 200 210 230 240 pF1KB7 EERKLPCVEEVVEK :. .. .:.. CCDS14 EDVSVAFPDETMPN 220 >>CCDS9020.1 MYF5 gene_id:4617|Hs108|chr12 (255 aa) initn: 484 init1: 396 opt: 499 Z-score: 468.8 bits: 94.2 E(32554): 8.6e-20 Smith-Waterman score: 499; 50.5% identity (69.6% similar) in 184 aa overlap (53-234:48-223) 30 40 50 60 70 80 pF1KB7 QPLEVAEGSPLYPGSDGTLSPCQDQMPPEAGSDSSGEEHVLAPPGLQPPHCPGQCLIWAC ::: .::: :: : : :.::.::: CCDS90 GSCIPSPEGEFGDEFVPRVAAFGAHKAELQGSDE--DEHVRAPTG---HHQAGHCLMWAC 20 30 40 50 60 70 90 100 110 120 130 140 pF1KB7 KTCKRKSAPTDRRKAATLRERRRLKKINEAFEALKRRTVANPNQRLPKVEILRSAISYIE :.:::::. :::::::.::::::::.:.:::.::: :..:::::::::::::.:: ::: CCDS90 KACKRKSTTMDRRKAATMRERRRLKKVNQAFETLKRCTTTNPNQRLPKVEILRNAIRYIE 80 90 100 110 120 130 150 160 170 180 190 200 pF1KB7 RLQDLLHRLDQQEKMQELGVDPFSYRPKQENLEGADFLRTCSSQ-WPSVSDHSRGLVITA ::.::. .: :.. : . : .: . . . .: . :.: : :. .. CCDS90 SLQELLR--EQVENYYSLPGQSCS-EPTSPTSNCSDGMPECNSPVWSRKSSTFDSIYCPD 140 150 160 170 180 210 220 230 240 pF1KB7 KEGGASIDSSASSSLRCLSSIVDSI-SSEERKLPCVEEVVEK . . :... ::: :::.::: : :::. :: CCDS90 VSNVYATDKNSLSSLDCLSNIVDRITSSEQPGLPLQDLASLSPVASTDSQPATPGASSSR 190 200 210 220 230 240 CCDS90 LIYHVL 250 >>CCDS7826.1 MYOD1 gene_id:4654|Hs108|chr11 (320 aa) initn: 472 init1: 388 opt: 420 Z-score: 395.2 bits: 81.0 E(32554): 1.1e-15 Smith-Waterman score: 446; 44.9% identity (58.4% similar) in 214 aa overlap (41-234:57-267) 20 30 40 50 60 pF1KB7 YFFYLDGENVTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAGSDSSG---EEHVLAPPG :.: . . : : . : .::: :: : CCDS78 DDFYDDPCFDSPDLRFFEDLDPRLMHVGALLKPEEHSHFPAAVHPAPGAREDEHVRAPSG 30 40 50 60 70 80 70 80 90 100 110 120 pF1KB7 LQPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEAFEALKRRTVANPNQR : :.::.::::.::::.. .:::::::.::::::.:.:::::.::: : .::::: CCDS78 ---HHQAGRCLLWACKACKRKTTNADRRKAATMRERRRLSKVNEAFETLKRCTSSNPNQR 90 100 110 120 130 140 130 140 150 160 170 180 pF1KB7 LPKVEILRSAISYIERLQDLLHRLDQQEKMQELGV-DPFSYRPKQ--ENLEG---ADFLR ::::::::.:: ::: :: ::. : . : : . :. : :. : CCDS78 LPKVEILRNAIRYIEGLQALLRDQDAAPPGAAAAFYAPGPLPPGRGGEHYSGDSDASSPR 150 160 170 180 190 200 190 200 210 220 230 pF1KB7 T-CSSQWPSVSDHSRGLVITAKEGGASID----------SSASSSLRCLSSIVDSISSEE . ::. . : : :: . :.: ::: ::::::. ::.: CCDS78 SNCSDGMMDYSGPPSGARRRNCYEGAYYNEAPSEPRPGKSAAVSSLDCLSSIVERISTES 210 220 230 240 250 260 240 pF1KB7 RKLPCVEEVVEK : CCDS78 PAAPALLLADVPSESPPRRQEAAAPSEGESSGDPTQSPDAAPQCPAGANPNPIYQVL 270 280 290 300 310 320 242 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 21:00:40 2016 done: Sat Nov 5 21:00:40 2016 Total Scan time: 2.690 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]