FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5007, 344 aa 1>>>pF1KE5007 344 - 344 aa - 344 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.1628+/-0.00111; mu= 1.5456+/- 0.067 mean_var=303.5251+/-62.573, 0's: 0 Z-trim(113.3): 160 B-trim: 676 in 1/52 Lambda= 0.073617 statistics sampled from 13814 (13981) to 13814 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.736), E-opt: 0.2 (0.429), width: 16 Scan time: 2.420 The best scores are: opt bits E(32554) CCDS13318.1 SRSF6 gene_id:6431|Hs108|chr20 ( 344) 2254 252.4 3.9e-67 CCDS333.1 SRSF4 gene_id:6429|Hs108|chr1 ( 494) 1385 160.3 3e-39 CCDS32109.1 SRSF5 gene_id:6430|Hs108|chr14 ( 272) 1036 123.0 2.9e-28 CCDS11600.1 SRSF1 gene_id:6426|Hs108|chr17 ( 248) 598 76.4 2.8e-14 >>CCDS13318.1 SRSF6 gene_id:6431|Hs108|chr20 (344 aa) initn: 2254 init1: 2254 opt: 2254 Z-score: 1318.8 bits: 252.4 E(32554): 3.9e-67 Smith-Waterman score: 2254; 100.0% identity (100.0% similar) in 344 aa overlap (1-344:1-344) 10 20 30 40 50 60 pF1KE5 MPRVYIGRLSYNVREKDIQRFFSGYGRLLEVDLKNGYGFVEFEDSRDADDAVYELNGKEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MPRVYIGRLSYNVREKDIQRFFSGYGRLLEVDLKNGYGFVEFEDSRDADDAVYELNGKEL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 CGERVIVEHARGPRRDRDGYSYGSRSGGGGYSSRRTSGRDKYGPPVRTEYRLIVENLSSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 CGERVIVEHARGPRRDRDGYSYGSRSGGGGYSSRRTSGRDKYGPPVRTEYRLIVENLSSR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 CSWQDLKDFMRQAGEVTYADAHKERTNEGVIEFRSYSDMKRALDKLDGTEINGRNIRLIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 CSWQDLKDFMRQAGEVTYADAHKERTNEGVIEFRSYSDMKRALDKLDGTEINGRNIRLIE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 DKPRTSHRRSYSGSRSRSRSRRRSRSRSRRSSRSRSRSISKSRSRSRSRSKGRSRSRSKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 DKPRTSHRRSYSGSRSRSRSRRRSRSRSRRSSRSRSRSISKSRSRSRSRSKGRSRSRSKG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 RKSRSKSKSKPKSDRGSHSHSRSRSKDEYEKSRSRSRSRSPKENGKGDIKSKSRSRSQSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 RKSRSKSKSKPKSDRGSHSHSRSRSKDEYEKSRSRSRSRSPKENGKGDIKSKSRSRSQSR 250 260 270 280 290 300 310 320 330 340 pF1KE5 SNSPLPVPPSKARSVSPPPKRATSRSRSRSRSKSRSRSRSSSRD :::::::::::::::::::::::::::::::::::::::::::: CCDS13 SNSPLPVPPSKARSVSPPPKRATSRSRSRSRSKSRSRSRSSSRD 310 320 330 340 >>CCDS333.1 SRSF4 gene_id:6429|Hs108|chr1 (494 aa) initn: 1340 init1: 840 opt: 1385 Z-score: 818.1 bits: 160.3 E(32554): 3e-39 Smith-Waterman score: 1427; 66.5% identity (82.0% similar) in 361 aa overlap (1-344:1-352) 10 20 30 40 50 60 pF1KE5 MPRVYIGRLSYNVREKDIQRFFSGYGRLLEVDLKNGYGFVEFEDSRDADDAVYELNGKEL :::::::::::..::.:..:::.:::..::::::::::::::.: :::::::::::::.: CCDS33 MPRVYIGRLSYQARERDVERFFKGYGKILEVDLKNGYGFVEFDDLRDADDAVYELNGKDL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 CGERVIVEHARGPRRDRDGYSYGSRSGGGGYSSRRTSGRDKYGPPVRTEYRLIVENLSSR ::::::::::::::: :: :::: : .::. :: :::::::::.:::::::::::::: CCDS33 CGERVIVEHARGPRR--DG-SYGS--GRSGYGYRR-SGRDKYGPPTRTEYRLIVENLSSR 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 CSWQDLKDFMRQAGEVTYADAHKERTNEGVIEFRSYSDMKRALDKLDGTEINGRNIRLIE ::::::::.:::::::::::::: : ::::::: :::::::::.::::::.:::.:::.: CCDS33 CSWQDLKDYMRQAGEVTYADAHKGRKNEGVIEFVSYSDMKRALEKLDGTEVNGRKIRLVE 120 130 140 150 160 170 190 200 210 220 230 pF1KE5 DKPRTSHRRSYSGSRSRSRSRRRSR----SRSR----RSSRSRSRSISKSRSRSRSRSKG ::: . .::::: :::.:::: ::: :::: .::.:.::: :.: :::::.:.. CCDS33 DKPGSRRRRSYSRSRSHSRSRSRSRHSRKSRSRSGSSKSSHSKSRSRSRSGSRSRSKSRS 180 190 200 210 220 230 240 250 260 270 280 pF1KE5 RSRSRSKGRKSRSKSKSKPKSDRGSHS--HSRSRSKDEYEK------SRSRSRSRSPKEN ::.:::...: .:.: :: :: ::: .:::.:::. :. . .. .::::... CCDS33 RSQSRSRSKKEKSRSPSKEKSRSRSHSAGKSRSKSKDQAEEKIQNNDNVGKPKSRSPSRH 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE5 GKGDIKSKSRSRSQSRSNSPLPVPP-SKARSVSPPPKRATSRSRSRSRSKSRSRSRSSSR . ::::::::: : :..:: ... :::::.. :.:::::::.:. CCDS33 KS---KSKSRSRSQERRVEEEKRGSVSRGRSQEKSLRQSRSRSRSKGGSRSRSRSRSKSK 300 310 320 330 340 350 pF1KE5 D : CCDS33 DKRKGRKRSREESRSRSRSRSKSERSRKRGSKRDSKAGSSKKKKKEDTDRSQSRSPSRSV 360 370 380 390 400 410 >>CCDS32109.1 SRSF5 gene_id:6430|Hs108|chr14 (272 aa) initn: 1608 init1: 522 opt: 1036 Z-score: 620.9 bits: 123.0 E(32554): 2.9e-28 Smith-Waterman score: 1048; 62.5% identity (79.3% similar) in 280 aa overlap (3-280:5-267) 10 20 30 40 50 pF1KE5 MPRVYIGRLSYNVREKDIQRFFSGYGRLLEVDLKNGYGFVEFEDSRDADDAVYELNGK ::.::::. .::::..:::.::::. ..::: :.::::::: ::::::::::.:: CCDS32 MSGCRVFIGRLNPAAREKDVERFFKGYGRIRDIDLKRGFGFVEFEDPRDADDAVYELDGK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 ELCGERVIVEHARGPRRDRDGYSYGSRSGGGGYSSRRTSGRDKYGPPVRTEYRLIVENLS :::.::: .::::. :.: : . : : .:::: . . .:::::: :::::::: CCDS32 ELCSERVTIEHARA--RSRGGRGRGRYSDR--FSSRRPRNDRRNAPPVRTENRLIVENLS 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 SRCSWQDLKDFMRQAGEVTYADAHKERTNEGVIEFRSYSDMKRALDKLDGTEINGRNIRL :: ::::::::::::::::.::::. . ::::.:: ::.:.: :..::.: :::::.:.: CCDS32 SRVSWQDLKDFMRQAGEVTFADAHRPKLNEGVVEFASYGDLKNAIEKLSGKEINGRKIKL 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 IEDKPRTSHRRSYSGSRSRSRSRRRSRSRSRRSSRSRSRSISKSR-SRSRSRSKGRSRSR :: ::. .:::: :::::.: :::::::: :.:: : :::::..::::: CCDS32 IE------------GSKRHSRSRSRSRSRTRSSSRSRSRSRSRSRKSYSRSRSRSRSRSR 180 190 200 210 220 240 250 260 270 280 290 pF1KE5 SKGRKSRSKSKSKPKSD-RGSHSHSRSRSKDEYEKSRSRSRSRSPKENGKGDIKSKSRSR ::.: : :.: ::. ::: :.:.: .. . ..::::::::: CCDS32 SKSR-SVSRSPVPEKSQKRGSSSRSKSPASVDRQRSRSRSRSRSVDSGN 230 240 250 260 270 300 310 320 330 340 pF1KE5 SQSRSNSPLPVPPSKARSVSPPPKRATSRSRSRSRSKSRSRSRSSSRD >>CCDS11600.1 SRSF1 gene_id:6426|Hs108|chr17 (248 aa) initn: 1046 init1: 210 opt: 598 Z-score: 370.0 bits: 76.4 E(32554): 2.8e-14 Smith-Waterman score: 598; 46.9% identity (65.8% similar) in 243 aa overlap (3-238:17-248) 10 20 30 40 pF1KE5 MPRVYIGRLSYNVREKDIQRFFSGYGRLLEVDLKNGYG-----FVE :.:.: : ..: :::. : :: . ..:::: : ::: CCDS11 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE5 FEDSRDADDAVYELNGKELCGERVIVEHARGPRRDRDGYSYGSRSGGGGYSSRRTSGRDK ::: :::.:::: .: . : :. :: :: : : . :. .:::: . : :: CCDS11 FEDPRDAEDAVYGRDGYDYDGYRLRVEF---PRSGR-GTGRGGGGGGGGGAPR---GR-- 70 80 90 100 110 110 120 130 140 150 160 pF1KE5 YGPPVR-TEYRLIVENLSSRCSWQDLKDFMRQAGEVTYADAHKERTNEGVIEFRSYSDMK :::: : .: :..: .: ::::::: ::.::.: :::.... : ::.:: :: CCDS11 YGPPSRRSENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGT--GVVEFVRKEDMT 120 130 140 150 160 170 180 190 200 210 220 pF1KE5 RALDKLDGTEINGRNIRLIEDKPRTSHRRSYSGSRSRSRSRRRSRSRSRRSSRSRSRSIS :. :::.:.. ... . . ... :: : .::::::: ::::::: .::::: : CCDS11 YAVRKLDNTKFRSHEGETAYIRVKVDGPRSPSYGRSRSRSRSRSRSRSRSNSRSRSYSPR 170 180 190 200 210 220 230 240 250 260 270 pF1KE5 KSRSRSR-SRSKGRSRSRSKGRKSRSKSKSKPKSDRGSHSHSRSRSKDEYEKSRSRSRSR .::. : : ..:::::. CCDS11 RSRGSPRYSPRHSRSRSRT 230 240 344 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:06:09 2016 done: Tue Nov 8 04:06:10 2016 Total Scan time: 2.420 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]