FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5280, 248 aa 1>>>pF1KE5280 248 - 248 aa - 248 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4758+/-0.000919; mu= 7.7469+/- 0.056 mean_var=263.5957+/-56.139, 0's: 0 Z-trim(114.4): 46 B-trim: 0 in 0/53 Lambda= 0.078996 statistics sampled from 14957 (15000) to 14957 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.461), width: 16 Scan time: 1.910 The best scores are: opt bits E(32554) CCDS11600.1 SRSF1 gene_id:6426|Hs108|chr17 ( 248) 1724 208.8 2.9e-54 CCDS58580.1 SRSF1 gene_id:6426|Hs108|chr17 ( 201) 1307 161.1 5.1e-40 CCDS9199.1 SRSF9 gene_id:8683|Hs108|chr12 ( 221) 977 123.6 1.1e-28 CCDS13318.1 SRSF6 gene_id:6431|Hs108|chr20 ( 344) 598 80.6 1.5e-15 CCDS333.1 SRSF4 gene_id:6429|Hs108|chr1 ( 494) 599 81.0 1.7e-15 CCDS32109.1 SRSF5 gene_id:6430|Hs108|chr14 ( 272) 593 79.9 1.9e-15 >>CCDS11600.1 SRSF1 gene_id:6426|Hs108|chr17 (248 aa) initn: 1724 init1: 1724 opt: 1724 Z-score: 1087.7 bits: 208.8 E(32554): 2.9e-54 Smith-Waterman score: 1724; 100.0% identity (100.0% similar) in 248 aa overlap (1-248:1-248) 10 20 30 40 50 60 pF1KE5 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 RSHEGETAYIRVKVDGPRSPSYGRSRSRSRSRSRSRSRSNSRSRSYSPRRSRGSPRYSPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 RSHEGETAYIRVKVDGPRSPSYGRSRSRSRSRSRSRSRSNSRSRSYSPRRSRGSPRYSPR 190 200 210 220 230 240 pF1KE5 HSRSRSRT :::::::: CCDS11 HSRSRSRT >>CCDS58580.1 SRSF1 gene_id:6426|Hs108|chr17 (201 aa) initn: 1303 init1: 1303 opt: 1307 Z-score: 831.9 bits: 161.1 E(32554): 5.1e-40 Smith-Waterman score: 1307; 96.9% identity (98.4% similar) in 192 aa overlap (1-192:1-190) 10 20 30 40 50 60 pF1KE5 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 RSHEGETAYIRVKVDGPRSPSYGRSRSRSRSRSRSRSRSNSRSRSYSPRRSRGSPRYSPR :::: ..: :. CCDS58 RSHE--VGYTRILFFDQNWIQWS 190 200 >>CCDS9199.1 SRSF9 gene_id:8683|Hs108|chr12 (221 aa) initn: 1019 init1: 549 opt: 977 Z-score: 628.2 bits: 123.6 E(32554): 1.1e-28 Smith-Waterman score: 1004; 66.8% identity (78.8% similar) in 241 aa overlap (1-239:1-217) 10 20 30 40 50 60 pF1KE5 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE ::: . :: :..: :::::::: :.: ::.::.::::: ::.:.::::.: :::::. CCDS91 MSGWADERG--GEGDGRIYVGNLPTDVREKDLEDLFYKYGRIREIELKNRHGLVPFAFVR 10 20 30 40 50 70 80 90 100 110 pF1KE5 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRG-RYGPPSRRS ::::::::::.:::.:::: ::::::::. :: :: ::: : :::.::: CCDS91 FEDPRDAEDAIYGRNGYDYGQCRLRVEFPRTY---------GGRGGWPRGGRNGPPTRRS 60 70 80 90 100 120 130 140 150 160 170 pF1KE5 ENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTK . ::.:::::::::::::::::::::::::::: .::.:.::..::::: ::.::::.:: CCDS91 DFRVLVSGLPPSGSWQDLKDHMREAGDVCYADVQKDGVGMVEYLRKEDMEYALRKLDDTK 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE5 FRSHEGETAYIRVKVDGPRSPSYGRSRSRSRSRSRSRSRSNSRSRSYSPRRSRGSPRY-S ::::::::.:::: . :: ::: ::::: ::.:. :: .:::::.: : CCDS91 FRSHEGETSYIRVYPE--RSTSYGYSRSRSGSRGRD-----------SPYQSRGSPHYFS 170 180 190 200 210 240 pF1KE5 PRHSRSRSRT : CCDS91 PFRPY 220 >>CCDS13318.1 SRSF6 gene_id:6431|Hs108|chr20 (344 aa) initn: 1280 init1: 210 opt: 598 Z-score: 392.7 bits: 80.6 E(32554): 1.5e-15 Smith-Waterman score: 598; 46.9% identity (65.8% similar) in 243 aa overlap (17-248:3-238) 10 20 30 40 50 60 pF1KE5 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE :.:.: : ..: :::. : :: . ..:::: : ::: CCDS13 MPRVYIGRLSYNVREKDIQRFFSGYGRLLEVDLKNGYG-----FVE 10 20 30 40 70 80 90 100 110 pF1KE5 FEDPRDAEDAVYGRDGYDYDGYRLRVEF---PRSGR-GTGRGGGGGGGGGAPR---GR-- ::: :::.:::: .: . : :. :: :: : : . :. .:::: . : :: CCDS13 FEDSRDADDAVYELNGKELCGERVIVEHARGPRRDRDGYSYGSRSGGGGYSSRRTSGRDK 50 60 70 80 90 100 120 130 140 150 160 pF1KE5 YGPPSRRSENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGT--GVVEFVRKEDMT :::: : .: :..: .: ::::::: ::.::.: :::.... : ::.:: :: CCDS13 YGPPVR-TEYRLIVENLSSRCSWQDLKDFMRQAGEVTYADAHKERTNEGVIEFRSYSDMK 110 120 130 140 150 160 170 180 190 200 210 220 pF1KE5 YAVRKLDNTKFRSHEGETAYIRVKVDGPRSPSYGRSRSRSRSRSRSRSRSNSRSRSYSPR :. :::.:.. ... . . ... :: : .::::::: ::::::: .::::: : CCDS13 RALDKLDGTEINGRNIRLIEDKPRTSHRRSYSGSRSRSRSRRRSRSRSRRSSRSRSRSIS 170 180 190 200 210 220 230 240 pF1KE5 RSRGSPRYSPRHSRSRSRT .::. : : ..:::::. CCDS13 KSRSRSR-SRSKGRSRSRSKGRKSRSKSKSKPKSDRGSHSHSRSRSKDEYEKSRSRSRSR 230 240 250 260 270 >>CCDS333.1 SRSF4 gene_id:6429|Hs108|chr1 (494 aa) initn: 712 init1: 184 opt: 599 Z-score: 391.7 bits: 81.0 E(32554): 1.7e-15 Smith-Waterman score: 599; 46.5% identity (65.1% similar) in 241 aa overlap (17-248:3-232) 10 20 30 40 50 60 pF1KE5 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE :.:.: : . : .:.: : :: : ..:::: : ::: CCDS33 MPRVYIGRLSYQARERDVERFFKGYGKILEVDLKNGYG-----FVE 10 20 30 40 70 80 90 100 110 pF1KE5 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGG---APRGRYGPPSR :.: :::.:::: .: : : :. :: :. : : :.: .: : . : .::::.: CCDS33 FDDLRDADDAVYELNGKDLCGERVIVEHARGPRRDGSYGSGRSGYGYRRSGRDKYGPPTR 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE5 RSENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVY--RDGTGVVEFVRKEDMTYAVRKL .: :..: .: :::::::.::.::.: :::.. : . ::.::: :: :..:: CCDS33 -TEYRLIVENLSSRCSWQDLKDYMRQAGEVTYADAHKGRKNEGVIEFVSYSDMKRALEKL 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE5 DNTKFRSHEGETAYIRVKVDGP---RSPSYGRSRSRSRSRSRSRSRSNSRSRSYSPRRSR :.:. ... ::. : : : ::.::::.:::::::: .::::: : . :. CCDS33 DGTEVNGRK-----IRLVEDKPGSRRRRSYSRSRSHSRSRSRSRHSRKSRSRSGSSKSSH 170 180 190 200 210 240 pF1KE5 GSPRYSPRH-SRSRSRT .. : : :::::.. CCDS33 SKSRSRSRSGSRSRSKSRSRSQSRSRSKKEKSRSPSKEKSRSRSHSAGKSRSKSKDQAEE 220 230 240 250 260 270 >>CCDS32109.1 SRSF5 gene_id:6430|Hs108|chr14 (272 aa) initn: 304 init1: 238 opt: 593 Z-score: 390.7 bits: 79.9 E(32554): 1.9e-15 Smith-Waterman score: 593; 47.3% identity (67.1% similar) in 243 aa overlap (16-248:4-233) 10 20 30 40 50 60 pF1KE5 MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE ::...: : : : ::.: : :: ::::::: :: :.::: CCDS32 MSGCRVFIGRLNPAAREKDVERFFKGYGRIRDIDLK--RG---FGFVE 10 20 30 40 70 80 90 100 110 pF1KE5 FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRS-GRGTGRGGGGGG---GGGAPRG-RYGPP :::::::.:::: :: . . :. .: :. .:: ::: : . .. ::. : . : CCDS32 FEDPRDADDAVYELDGKELCSERVTIEHARARSRG-GRGRGRYSDRFSSRRPRNDRRNAP 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE5 SRRSENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRD--GTGVVEFVRKEDMTYAVR :.:::..: .: ::::::: ::.::.: .::..: . :::::. :. :.. CCDS32 PVRTENRLIVENLSSRVSWQDLKDFMRQAGEVTFADAHRPKLNEGVVEFASYGDLKNAIE 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE5 KLDNTKFRSHEGETAYIRVKVDGPRSPSYGRSRSRSRSRSRSRSRSNSRSRS---YSPRR ::.. .. ... :.. ..: . : .:::::::.:: ::::: ::::: :: : CCDS32 KLSGKEINGRK-----IKL-IEGSKRHSRSRSRSRSRTRSSSRSRSRSRSRSRKSYSRSR 170 180 190 200 210 240 pF1KE5 SRGSPRYSPRHSRSRSRT ::. : : .::: ::. CCDS32 SRSRSR-SRSKSRSVSRSPVPEKSQKRGSSSRSKSPASVDRQRSRSRSRSRSVDSGN 220 230 240 250 260 270 248 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 23:22:33 2016 done: Mon Nov 7 23:22:33 2016 Total Scan time: 1.910 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]