FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3525, 249 aa 1>>>pF1KE3525 249 - 249 aa - 249 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8845+/-0.00109; mu= 11.9248+/- 0.065 mean_var=106.5547+/-22.320, 0's: 0 Z-trim(106.0): 175 B-trim: 181 in 1/49 Lambda= 0.124248 statistics sampled from 8508 (8724) to 8508 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.63), E-opt: 0.2 (0.268), width: 16 Scan time: 1.980 The best scores are: opt bits E(32554) CCDS5397.1 HNRNPA2B1 gene_id:3181|Hs108|chr7 ( 341) 1215 228.6 4.2e-60 CCDS43557.1 HNRNPA2B1 gene_id:3181|Hs108|chr7 ( 353) 1203 226.5 1.9e-59 CCDS41793.1 HNRNPA1 gene_id:3178|Hs108|chr12 ( 320) 1012 192.2 3.6e-49 CCDS44909.1 HNRNPA1 gene_id:3178|Hs108|chr12 ( 372) 1012 192.3 4e-49 CCDS82536.1 HNRNPA3 gene_id:220988|Hs108|chr2 ( 356) 996 189.4 2.8e-48 CCDS2273.1 HNRNPA3 gene_id:220988|Hs108|chr2 ( 378) 996 189.4 3e-48 CCDS31980.1 HNRNPA1L2 gene_id:144983|Hs108|chr13 ( 320) 986 187.5 9.1e-48 CCDS4193.1 HNRNPA0 gene_id:10949|Hs108|chr5 ( 305) 692 134.8 6.4e-32 CCDS9196.1 MSI1 gene_id:4440|Hs108|chr12 ( 362) 528 105.5 5.2e-23 CCDS3591.1 HNRNPD gene_id:3184|Hs108|chr4 ( 336) 527 105.3 5.5e-23 CCDS3590.1 HNRNPD gene_id:3184|Hs108|chr4 ( 306) 525 104.9 6.6e-23 CCDS3592.1 HNRNPD gene_id:3184|Hs108|chr4 ( 355) 525 104.9 7.4e-23 CCDS11597.1 MSI2 gene_id:124540|Hs108|chr17 ( 251) 517 103.4 1.6e-22 CCDS34310.1 HNRNPAB gene_id:3182|Hs108|chr5 ( 285) 517 103.4 1.7e-22 CCDS11596.1 MSI2 gene_id:124540|Hs108|chr17 ( 328) 517 103.5 1.9e-22 CCDS34309.1 HNRNPAB gene_id:3182|Hs108|chr5 ( 332) 517 103.5 1.9e-22 CCDS82168.1 MSI2 gene_id:124540|Hs108|chr17 ( 324) 511 102.4 3.9e-22 CCDS75153.1 HNRNPDL gene_id:9987|Hs108|chr4 ( 363) 490 98.7 5.8e-21 CCDS3593.1 HNRNPDL gene_id:9987|Hs108|chr4 ( 420) 490 98.7 6.5e-21 >>CCDS5397.1 HNRNPA2B1 gene_id:3181|Hs108|chr7 (341 aa) initn: 1215 init1: 1215 opt: 1215 Z-score: 1192.5 bits: 228.6 E(32554): 4.2e-60 Smith-Waterman score: 1215; 100.0% identity (100.0% similar) in 183 aa overlap (1-183:1-183) 10 20 30 40 50 60 pF1KE3 MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRSRGFGFVTFSSM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRSRGFGFVTFSSM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 AEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKEDTEEHHLRDYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 AEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKEDTEEHHLRDYF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 EEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHNAEVRKALSRQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHNAEVRKALSRQE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 MQEDLEVAILEVAPVMEEEEEDMVVEDLDMATRVGATEVVMTTMEEEIMEVEITMILEII ::: CCDS53 MQEVQSSRSGRGGNFGFGDSRGGGGNFGPGPGSNFRGGSDGYGSGRGFGDGYNGYGGGPG 190 200 210 220 230 240 >>CCDS43557.1 HNRNPA2B1 gene_id:3181|Hs108|chr7 (353 aa) initn: 1203 init1: 1203 opt: 1203 Z-score: 1180.7 bits: 226.5 E(32554): 1.9e-59 Smith-Waterman score: 1203; 99.5% identity (100.0% similar) in 182 aa overlap (2-183:14-195) 10 20 30 40 pF1KE3 MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKR .:::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MEKTLETVPLERKKREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKR 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 SRGFGFVTFSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 SRGFGFVTFSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIK 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE3 EDTEEHHLRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 EDTEEHHLRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGH 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE3 NAEVRKALSRQEMQEDLEVAILEVAPVMEEEEEDMVVEDLDMATRVGATEVVMTTMEEEI ::::::::::::::: CCDS43 NAEVRKALSRQEMQEVQSSRSGRGGNFGFGDSRGGGGNFGPGPGSNFRGGSDGYGSGRGF 190 200 210 220 230 240 >>CCDS41793.1 HNRNPA1 gene_id:3178|Hs108|chr12 (320 aa) initn: 1012 init1: 1012 opt: 1012 Z-score: 996.2 bits: 192.2 E(32554): 3.6e-49 Smith-Waterman score: 1012; 80.4% identity (95.0% similar) in 179 aa overlap (3-181:8-186) 10 20 30 40 50 pF1KE3 MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRSRGFGFV .: ::.:::::::::::::.::::...:::: :::::::::: .::::::::: CCDS41 MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVVMRDPNTKRSRGFGFV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 TFSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKEDTEEHH :.... :::::: ::::..:::::::::::.::.: .::::.::::.::::::::::::: CCDS41 TYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHLTVKKIFVGGIKEDTEEHH 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE3 LRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHNAEVRKA ::::::.::::..:::.::: :::::::.:::::::: :::::.:::::.:::: ::::: CCDS41 LRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVNGHNCEVRKA 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE3 LSRQEMQEDLEVAILEVAPVMEEEEEDMVVEDLDMATRVGATEVVMTTMEEEIMEVEITM ::.::: CCDS41 LSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNFSGRGGFGGSRGGGGYGGS 190 200 210 220 230 240 >>CCDS44909.1 HNRNPA1 gene_id:3178|Hs108|chr12 (372 aa) initn: 1012 init1: 1012 opt: 1012 Z-score: 995.4 bits: 192.3 E(32554): 4e-49 Smith-Waterman score: 1012; 80.4% identity (95.0% similar) in 179 aa overlap (3-181:8-186) 10 20 30 40 50 pF1KE3 MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRSRGFGFV .: ::.:::::::::::::.::::...:::: :::::::::: .::::::::: CCDS44 MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVVMRDPNTKRSRGFGFV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 TFSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKEDTEEHH :.... :::::: ::::..:::::::::::.::.: .::::.::::.::::::::::::: CCDS44 TYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHLTVKKIFVGGIKEDTEEHH 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE3 LRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHNAEVRKA ::::::.::::..:::.::: :::::::.:::::::: :::::.:::::.:::: ::::: CCDS44 LRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVNGHNCEVRKA 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE3 LSRQEMQEDLEVAILEVAPVMEEEEEDMVVEDLDMATRVGATEVVMTTMEEEIMEVEITM ::.::: CCDS44 LSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNFSGRGGFGGSRGGGGYGGS 190 200 210 220 230 240 >>CCDS82536.1 HNRNPA3 gene_id:220988|Hs108|chr2 (356 aa) initn: 996 init1: 996 opt: 996 Z-score: 980.1 bits: 189.4 E(32554): 2.8e-48 Smith-Waterman score: 996; 79.4% identity (94.4% similar) in 180 aa overlap (3-182:7-186) 10 20 30 40 50 pF1KE3 MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRSRGFGFVT .: ::.:::::::::::::..:::...:.:: :::::::::: .:::::::::: CCDS82 MEGHDPKEPEQLRKLFIGGLSFETTDDSLREHFEKWGTLTDCVVMRDPQTKRSRGFGFVT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 FSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKEDTEEHHL .: . :::::: ::::..:::::::::::.::.: :::::.::::.:::::::::::..: CCDS82 YSCVEEVDAAMCARPHKVDGRVVEPKRAVSREDSVKPGAHLTVKKIFVGGIKEDTEEYNL 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE3 RDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHNAEVRKAL :::::.::::.:::.. ::::::::::.:::::::: :::::.:::::::::: ::.::: CCDS82 RDYFEKYGKIETIEVMEDRQSGKKRGFAFVTFDDHDTVDKIVVQKYHTINGHNCEVKKAL 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE3 SRQEMQEDLEVAILEVAPVMEEEEEDMVVEDLDMATRVGATEVVMTTMEEEIMEVEITMI :.:::: CCDS82 SKQEMQSAGSQRGRGGGSGNFMGRGGNFGGGGGNFGRGGNFGGRGGYGGGGGGSRGSYGG 190 200 210 220 230 240 >>CCDS2273.1 HNRNPA3 gene_id:220988|Hs108|chr2 (378 aa) initn: 996 init1: 996 opt: 996 Z-score: 979.8 bits: 189.4 E(32554): 3e-48 Smith-Waterman score: 996; 79.4% identity (94.4% similar) in 180 aa overlap (3-182:29-208) 10 20 30 pF1KE3 MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGK .: ::.:::::::::::::..:::...:.:: CCDS22 MEVKPPPGRPQPDSGRRRRRRGEEGHDPKEPEQLRKLFIGGLSFETTDDSLREHFEKWGT 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE3 LTDCVVMRDPASKRSRGFGFVTFSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGKPG :::::::::: .::::::::::.: . :::::: ::::..:::::::::::.::.: ::: CCDS22 LTDCVVMRDPQTKRSRGFGFVTYSCVEEVDAAMCARPHKVDGRVVEPKRAVSREDSVKPG 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE3 AHVTVKKLFVGGIKEDTEEHHLRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPV ::.::::.:::::::::::..::::::.::::.:::.. ::::::::::.:::::::: : CCDS22 AHLTVKKIFVGGIKEDTEEYNLRDYFEKYGKIETIEVMEDRQSGKKRGFAFVTFDDHDTV 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE3 DKIVLQKYHTINGHNAEVRKALSRQEMQEDLEVAILEVAPVMEEEEEDMVVEDLDMATRV ::::.:::::::::: ::.::::.:::: CCDS22 DKIVVQKYHTINGHNCEVKKALSKQEMQSAGSQRGRGGGSGNFMGRGGNFGGGGGNFGRG 190 200 210 220 230 240 >>CCDS31980.1 HNRNPA1L2 gene_id:144983|Hs108|chr13 (320 aa) initn: 986 init1: 986 opt: 986 Z-score: 971.0 bits: 187.5 E(32554): 9.1e-48 Smith-Waterman score: 986; 78.2% identity (93.9% similar) in 179 aa overlap (3-181:8-186) 10 20 30 40 50 pF1KE3 MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRSRGFGFV .: ::.:::::::::::::.::::...:::: :::::::::: .::::::::: CCDS31 MSKSASPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVVMRDPNTKRSRGFGFV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 TFSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKEDTEEHH :.... :::::: . ::..:::::::::::.::.: .::::.::::.::::::::::::: CCDS31 TYATVEEVDAAMNTTPHKVDGRVVEPKRAVSREDSQRPGAHLTVKKIFVGGIKEDTEEHH 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE3 LRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHNAEVRKA ::::::.::::..:::.::: :::::::.:::::::: :::::.:::::..::: ::::: CCDS31 LRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVKGHNCEVRKA 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE3 LSRQEMQEDLEVAILEVAPVMEEEEEDMVVEDLDMATRVGATEVVMTTMEEEIMEVEITM : .::: CCDS31 LPKQEMASASSSQRGRRGSGNFGGGRGDGFGGNDNFGRGGNFSGRGGFGGSCGGGGYGGS 190 200 210 220 230 240 >>CCDS4193.1 HNRNPA0 gene_id:10949|Hs108|chr5 (305 aa) initn: 865 init1: 683 opt: 692 Z-score: 686.5 bits: 134.8 E(32554): 6.4e-32 Smith-Waterman score: 692; 55.6% identity (83.1% similar) in 178 aa overlap (4-181:2-179) 10 20 30 40 50 60 pF1KE3 MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRSRGFGFVTFSSM :. :. :::::::. .:.: .::...: .: ::::::. .: .:::: :::::.:.. CCDS41 MENSQLCKLFIGGLNVQTSESGLRGHFEAFGTLTDCVVVVNPQTKRSRCFGFVTYSNV 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 AEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKEDTEEHHLRDYF :.:::::: ::..:: .:: ::::.::.:..::::. ::::::::.: :. : : ..: CCDS41 EEADAAMAASPHAVDGNTVELKRAVSREDSARPGAHAKVKKLFVGGLKGDVAEGDLIEHF 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 EEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHNAEVRKALSRQE ..: .. :::.:.::::::::::: :..:: .:: .. :.: :.:: .::.::. ... CCDS41 SQFGTVEKAEIIADKQSGKKRGFGFVYFQNHDAADKAAVVKFHPIQGHRVEVKKAVPKED 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE3 MQEDLEVAILEVAPVMEEEEEDMVVEDLDMATRVGATEVVMTTMEEEIMEVEITMILEII . CCDS41 IYSGGGGGGSRSSRGGRGGRGRGGGRDQNGLSKGGGGGYNSYGGYGGGGGGGYNAYGGGG 180 190 200 210 220 230 >>CCDS9196.1 MSI1 gene_id:4440|Hs108|chr12 (362 aa) initn: 555 init1: 304 opt: 528 Z-score: 526.6 bits: 105.5 E(32554): 5.2e-23 Smith-Waterman score: 528; 43.6% identity (77.9% similar) in 172 aa overlap (10-181:21-190) 10 20 30 40 pF1KE3 MEREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVMRDPASKRS :.::::::..::.:.::.:. :.:.. .:.::::: .::: CCDS91 METDAPQPGLASPDSPHDPCKMFIGGLSWQTTQEGLREYFGQFGEVKECLVMRDPLTKRS 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 RGFGFVTFSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGKPGAHVTVKKLFVGGIKE :::::::: ..: :: ..: : .:.....:: : :. ..: . .::.::::.. CCDS91 RGFGFVTFMDQAGVDKVLAQSRHELDSKTIDPKVAFPRR--AQPKMVTRTKKIFVGGLSV 70 80 90 100 110 110 120 130 140 150 160 pF1KE3 DTEEHHLRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKIVLQKYHTINGHN .: . ...:::..::.: .. :. ....::::::::...: :.:. ..: ::.. CCDS91 NTTVEDVKQYFEQFGKVDDAMLMFDKTTNRHRGFGFVTFESEDIVEKVCEIHFHEINNKM 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 AEVRKALSRQEMQEDLEVAILEVAPVMEEEEEDMVVEDLDMATRVGATEVVMTTMEEEIM .: .:: .. : CCDS91 VECKKAQPKEVMSPTGSARGRSRVMPYGMDAFMLGIGMLGYPGFQATTYASRSYTGLAPG 180 190 200 210 220 230 >>CCDS3591.1 HNRNPD gene_id:3184|Hs108|chr4 (336 aa) initn: 489 init1: 264 opt: 527 Z-score: 526.1 bits: 105.3 E(32554): 5.5e-23 Smith-Waterman score: 527; 40.3% identity (77.9% similar) in 181 aa overlap (3-183:72-246) 10 20 30 pF1KE3 MEREKEQFRKLFIGGLSFETTEESLRNYYEQW ...:. :.::::::..::...:..:. .. CCDS35 GSGAGTGGGTASGGTEGGSAESEGAKIDASKNEEDEGKMFIGGLSWDTTKKDLKDYFSKF 50 60 70 80 90 100 40 50 60 70 80 90 pF1KE3 GKLTDCVVMRDPASKRSRGFGFVTFSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGK :...::.. :: . :::::::: :. :: .: . :...:.:..:::: : . . . CCDS35 GEVVDCTLKLDPITGRSRGFGFVLFKESESVDKVMDQKEHKLNGKVIDPKRAKAMK-TKE 110 120 130 140 150 160 100 110 120 130 140 150 pF1KE3 PGAHVTVKKLFVGGIKEDTEEHHLRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHD : :::.::::.. :: :...:.:: .:....::. : ...:.::: :.:: ... CCDS35 P-----VKKIFVGGLSPDTPEEKIREYFGGFGEVESIELPMDNKTNKRRGFCFITFKEEE 170 180 190 200 210 160 170 180 190 200 210 pF1KE3 PVDKIVLQKYHTINGHNAEVRKALSRQEMQEDLEVAILEVAPVMEEEEEDMVVEDLDMAT :: ::. .:::... . :.. :.:....:. CCDS35 PVKKIMEKKYHNVGLSKCEIKVAMSKEQYQQQQQWGSRGGFAGRARGRGGGPSQNWNQGY 220 230 240 250 260 270 220 230 240 pF1KE3 RVGATEVVMTTMEEEIMEVEITMILEIITSNLLTTVQ CCDS35 SNYWNQGYGNYGYNSQGYGGYGGYDYTGYNNYYGYGDYSNQQSGYGKVSRRGGHQNSYKP 280 290 300 310 320 330 249 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 22:31:33 2016 done: Sat Nov 5 22:31:33 2016 Total Scan time: 1.980 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]