FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2201, 711 aa 1>>>pF1KE2201 711 - 711 aa - 711 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.0405+/-0.00113; mu= 0.2268+/- 0.069 mean_var=592.3961+/-120.815, 0's: 0 Z-trim(117.4): 51 B-trim: 0 in 0/54 Lambda= 0.052695 statistics sampled from 18046 (18092) to 18046 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.793), E-opt: 0.2 (0.556), width: 16 Scan time: 3.500 The best scores are: opt bits E(32554) CCDS45936.1 KHSRP gene_id:8570|Hs108|chr19 ( 711) 5155 407.0 5e-113 CCDS683.1 FUBP1 gene_id:8880|Hs108|chr1 ( 644) 2330 192.2 2.1e-48 CCDS43893.1 FUBP3 gene_id:8939|Hs108|chr9 ( 572) 1591 135.9 1.6e-31 >>CCDS45936.1 KHSRP gene_id:8570|Hs108|chr19 (711 aa) initn: 5155 init1: 5155 opt: 5155 Z-score: 2142.7 bits: 407.0 E(32554): 5e-113 Smith-Waterman score: 5155; 100.0% identity (100.0% similar) in 711 aa overlap (1-711:1-711) 10 20 30 40 50 60 pF1KE2 MSDYSTGGPPPGPPPPAGGGGGAGGAGGGPPPGPPGAGDRGGGGPGGGGPGGGSAGGPSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MSDYSTGGPPPGPPPPAGGGGGAGGAGGGPPPGPPGAGDRGGGGPGGGGPGGGSAGGPSQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 PPGGGGPGIRKDAFADAVQRARQIAAKIGGDAATTVNNSTPDFGFGGQKRQLEDGDQPES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PPGGGGPGIRKDAFADAVQRARQIAAKIGGDAATTVNNSTPDFGFGGQKRQLEDGDQPES 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 KKLASQGDSISSQLGPIHPPPRTSMTEEYRVPDGMVGLIIGRGGEQINKIQQDSGCKVQI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KKLASQGDSISSQLGPIHPPPRTSMTEEYRVPDGMVGLIIGRGGEQINKIQQDSGCKVQI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 SPDSGGLPERSVSLTGAPESVQKAKMMLDDIVSRGRGGPPGQFHDNANGGQNGTVQEIMI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SPDSGGLPERSVSLTGAPESVQKAKMMLDDIVSRGRGGPPGQFHDNANGGQNGTVQEIMI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 PAGKAGLVIGKGGETIKQLQERAGVKMILIQDGSQNTNVDKPLRIIGDPYKVQQACEMVM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PAGKAGLVIGKGGETIKQLQERAGVKMILIQDGSQNTNVDKPLRIIGDPYKVQQACEMVM 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 DILRERDQGGFGDRNEYGSRIGGGIDVPVPRHSVGVVIGRSGEMIKKIQNDAGVRIQFKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 DILRERDQGGFGDRNEYGSRIGGGIDVPVPRHSVGVVIGRSGEMIKKIQNDAGVRIQFKQ 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 DDGTGPEKIAHIMGPPDRCEHAARIINDLLQSLRSGPPGPPGGPGMPPGGRGRGRGQGNW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 DDGTGPEKIAHIMGPPDRCEHAARIINDLLQSLRSGPPGPPGGPGMPPGGRGRGRGQGNW 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 GPPGGEMTFSIPTHKCGLVIGRGGENVKAINQQTGAFVEISRQLPPNGDPNFKLFIIRGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GPPGGEMTFSIPTHKCGLVIGRGGENVKAINQQTGAFVEISRQLPPNGDPNFKLFIIRGS 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE2 PQQIDHAKQLIEEKIEGPLCPVGPGPGGPGPAGPMGPFNPGPFNQGPPGAPPHAGGPPPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PQQIDHAKQLIEEKIEGPLCPVGPGPGGPGPAGPMGPFNPGPFNQGPPGAPPHAGGPPPH 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE2 QYPPQGWGNTYPQWQPPAPHDPSKAAAAAADPNAAWAAYYSHYYQQPPGPVPGPAPAPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 QYPPQGWGNTYPQWQPPAPHDPSKAAAAAADPNAAWAAYYSHYYQQPPGPVPGPAPAPAA 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE2 PPAQGEPPQPPPTGQSDYTKAWEEYYKKIGQQPQQPGAPPQQDYTKAWEEYYKKQAQVAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PPAQGEPPQPPPTGQSDYTKAWEEYYKKIGQQPQQPGAPPQQDYTKAWEEYYKKQAQVAT 610 620 630 640 650 660 670 680 690 700 710 pF1KE2 GGGPGAPPGSQPDYSAAWAEYYRQQAAYYGQTPGPGGPQPPPTQQGQQQAQ ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GGGPGAPPGSQPDYSAAWAEYYRQQAAYYGQTPGPGGPQPPPTQQGQQQAQ 670 680 690 700 710 >>CCDS683.1 FUBP1 gene_id:8880|Hs108|chr1 (644 aa) initn: 1098 init1: 673 opt: 2330 Z-score: 982.5 bits: 192.2 E(32554): 2.1e-48 Smith-Waterman score: 2831; 63.3% identity (78.7% similar) in 684 aa overlap (50-707:10-644) 20 30 40 50 60 70 pF1KE2 GGGAGGAGGGPPPGPPGAGDRGGGGPGGGGPGGGSAGGPSQPPGGGGPGIRKDAFADAVQ :..::::: . :::: : .::: ::.: CCDS68 MADYSTVPPPSSGSAGGGG---GGGGGGGVNDAFKDALQ 10 20 30 80 90 100 110 120 130 pF1KE2 RARQIAAKIGGDAATTVNNSTPDFGFGGQKRQLEDGDQPESKKLASQGDSISSQLGPIHP :::::::::::::.:..:.. :.:.::::: :::::::..::.: :.::...:: :.: CCDS68 RARQIAAKIGGDAGTSLNSN--DYGYGGQKRPLEDGDQPDAKKVAPQNDSFGTQLPPMHQ 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE2 PP-RTSMTEEYRVPDGMVGLIIGRGGEQINKIQQDSGCKVQISPDSGGLPERSVSLTGAP :. :::::.:::::::.:::::::::..:::.::::.::.:::::::::: :::.: CCDS68 QQSRSVMTEEYKVPDGMVGFIIGRGGEQISRIQQESGCKIQIAPDSGGLPERSCMLTGTP 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE2 ESVQKAKMMLDDIVSRGRGGPPGQFHDNANGGQNGTVQEIMIPAGKAGLVIGKGGETIKQ ::::.:: .::.:: .:: .: :: . : ...::::::::.::::::::::::::: CCDS68 ESVQSAKRLLDQIVEKGRPAPG--FHHG--DGPGNAVQEIMIPASKAGLVIGKGGETIKQ 160 170 180 190 200 210 260 270 280 290 300 310 pF1KE2 LQERAGVKMILIQDGSQNTNVDKPLRIIGDPYKVQQACEMVMDILRERDQGGFGD-RNEY :::::::::..:::: :::..:::::: ::::::::: :::.... :::::: . :::: CCDS68 LQERAGVKMVMIQDGPQNTGADKPLRITGDPYKVQQAKEMVLELI--RDQGGFREVRNEY 220 230 240 250 260 320 330 340 350 360 370 pF1KE2 GSRIGG--GIDVPVPRHSVGVVIGRSGEMIKKIQNDAGVRIQFKQDDGTGPEKIAHIMGP :::::: :::::.:: .::.::::.:::::::::::::::::: :::: ::.::.: :: CCDS68 GSRIGGNEGIDVPIPRFAVGIVIGRNGEMIKKIQNDAGVRIQFKPDDGTTPERIAQITGP 270 280 290 300 310 320 380 390 400 410 420 430 pF1KE2 PDRCEHAARIINDLLQSLRSGPPGPPGGPGMPPGGRGRGRGQGNW--GPPGG--EMTFSI ::::.:::.::.:::.:...: :: ::: ::::::::::::: ::::: :..: . CCDS68 PDRCQHAAEIITDLLRSVQAGNPG---GPG--PGGRGRGRGQGNWNMGPPGGLQEFNFIV 330 340 350 360 370 380 440 450 460 470 480 490 pF1KE2 PTHKCGLVIGRGGENVKAINQQTGAFVEISRQLPPNGDPNFKLFIIRGSPQQIDHAKQLI :: : ::.::.:::..:.:.::.:: .:..:. :::.:::.::: :::.:::::.:.::: CCDS68 PTGKTGLIIGKGGETIKSISQQSGARIELQRNPPPNADPNMKLFTIRGTPQQIDYARQLI 390 400 410 420 430 440 500 510 520 530 pF1KE2 EEKIEGPLCPVGP----GPGG----------PGPAGPMGPFNPGPFNQGPPGAPPHAGGP :::: ::. :.:: :: : :::. ::::.::.:.: :::: ::. CCDS68 EEKIGGPVNPLGPPVPHGPHGVPGPHGPPGPPGPGTPMGPYNPAPYNPGPPGPAPHG--- 450 460 470 480 490 500 540 550 560 570 580 590 pF1KE2 PPHQYPPQGWGNTYPQWQPPAPHDPSKAAAAAADPN-AAWAAYYSHYYQQ---PPGPVPG :: : ::::::.::.:: :: ::.::.. ::: :::::::.::::: :: .:. CCDS68 PPAPYAPQGWGNAYPHWQQQAPPDPAKAGT---DPNSAAWAAYYAHYYQQQAQPPPAAPA 510 520 530 540 550 600 610 620 630 640 650 pF1KE2 PAPAPAAPPAQGEPPQPPPTGQSDYTKAWEEYYKKIGQQPQQPGAPPQQDYTKAWEEYYK ::. . .::. .: :.:: ::::::::::::.:: : CCDS68 GAPTTTQTNGQGDQQNPAPAGQVDYTKAWEEYYKKMGQAVPAP----------------- 560 570 580 590 600 660 670 680 690 700 710 pF1KE2 KQAQVATGGGPGAPPGSQPDYSAAWAEYYRQQAAYYGQTPGPGGPQPPPTQQGQQQAQ :::::.:::::::::::::::::::.:: : :: ::. ::: CCDS68 ----------TGAPPGGQPDYSAAWAEYYRQQAAYYAQTSPQGMPQHPPAPQGQ 610 620 630 640 >>CCDS43893.1 FUBP3 gene_id:8939|Hs108|chr9 (572 aa) initn: 1613 init1: 381 opt: 1591 Z-score: 679.4 bits: 135.9 E(32554): 1.6e-31 Smith-Waterman score: 1872; 49.1% identity (71.5% similar) in 645 aa overlap (63-694:7-563) 40 50 60 70 80 90 pF1KE2 GPPGAGDRGGGGPGGGGPGGGSAGGPSQPPGGGGP-GIRKDAFADAVQRARQIAAKIGGD : ..: :.. ..:.::..:.::::::: : CCDS43 MAELVQGQSAPVGMKAEGFVDALHRVRQIAAKI--D 10 20 30 100 110 120 130 140 pF1KE2 AATTVNNSTP--D---FGFGGQKRQLEDGDQPESKKLASQGDSISSQLGP-IHPPPRTSM . .::::: : .:.: ::: :.:: ...::: .: :: . CCDS43 SIPHLNNSTPLVDPSVYGYGVQKRPLDDG--------------VGNQLGALVHQ--RTVI 40 50 60 70 150 160 170 180 190 200 pF1KE2 TEEYRVPDGMVGLIIGRGGEQINKIQQDSGCKVQISPDSGGLPERSVSLTGAPESVQKAK :::..::: :::.:::::::::..:: .::::.::. .:.:.::: :::.:::...:: CCDS43 TEEFKVPDKMVGFIIGRGGEQISRIQAESGCKIQIASESSGIPERPCVLTGTPESIEQAK 80 90 100 110 120 130 210 220 230 240 250 260 pF1KE2 MMLDDIVSRGRGGPPGQFHDNANGGQNGTVQEIMIPAGKAGLVIGKGGETIKQLQERAGV .: .::.: :.:: ::.. .. :.:.:::.:::.:.:::::.:::::::::::.:: CCDS43 RLLGQIVDRCRNGPG--FHNDIDS--NSTIQEILIPASKVGLVIGRGGETIKQLQERTGV 140 150 160 170 180 190 270 280 290 300 310 320 pF1KE2 KMILIQDGSQNTNVDKPLRIIGDPYKVQQACEMVMDILRERDQGGF-GDRNEYGSRIGGG ::..:::: :..:::::: :: .::::: :::..:.::.::. : : :....::.::: CCDS43 KMVMIQDGPLPTGADKPLRITGDAFKVQQAREMVLEIIREKDQADFRGVRGDFNSRMGGG 200 210 220 230 240 250 330 340 350 360 370 380 pF1KE2 -IDVPVPRHSVGVVIGRSGEMIKKIQNDAGVRIQFKQDDGTGPEKIAHIMGPPDRCEHAA :.: ::: .::.::::.:::::::::::::::::: ::: .::. :..:::::::.::: CCDS43 SIEVSVPRFAVGIVIGRNGEMIKKIQNDAGVRIQFKPDDGISPERAAQVMGPPDRCQHAA 260 270 280 290 300 310 390 400 410 420 430 pF1KE2 RIINDLLQSLRSGPPGPPGGPGMPPGGRGRGRGQGNW--GPPGG--EMTFSIPTHKCGLV .::..:. . . : : ..::::::.:.: : ::: :.:...:. ::::: CCDS43 HIISELILTAQERD-----GFGGLAAARGRGRGRGDWSVGAPGGVQEITYTVPADKCGLV 320 330 340 350 360 440 450 460 470 480 490 pF1KE2 IGRGGENVKAINQQTGAFVEISRQLPPNGDPNFKLFIIRGSPQQIDHAKQLIEEKIEGPL ::.::::.:.::::.:: ::..:. :::.:::.. : ::: ::::. :.:::.::. CCDS43 IGKGGENIKSINQQSGAHVELQRNPPPNSDPNLRRFTIRGVPQQIEVARQLIDEKV---- 370 380 390 400 410 420 500 510 520 530 540 550 pF1KE2 CPVGPGPGGPGPAGPMGPFNPGPFNQGPPGAPPHAGGPPPHQYPPQGWGNTYPQWQPPAP :: . ..: : :. .::.: :: :::: . .::.. : CCDS43 -------GGTNLGAP-GAFGQSPFSQ-PP-APPHQ-----NTFPPRSSGCF--------- 430 440 450 460 560 570 580 590 600 610 pF1KE2 HDPSKAAAAAADPNAAWAAYYSHYYQQPPGPVPGPAPAPAAPPAQGEPPQPPPTGQSDYT :. :: . ..:... :: :: : : .. CCDS43 --PNMAAKVNGNPHST--------------PVSGP-------------PAFLTQGWGSTY 470 480 490 620 630 640 650 660 670 pF1KE2 KAWEEYYKKIGQQPQQPGAPPQQDYTKAWEEYYKKQAQVATGGGPGAPPGSQPDYSAAWA .::.. ... .: .:: . : .:.::::.:::::...:... : : .: :::. ::: CCDS43 QAWQQPTQQVPSQQSQPQSS-QPNYSKAWEDYYKKQSHAASAA-PQA--SSPPDYTMAWA 500 510 520 530 540 680 690 700 710 pF1KE2 EYYRQQAAYYGQTPGPGGPQPPPTQQGQQQAQ ::::::.:.:::: : CCDS43 EYYRQQVAFYGQTLGQAQAHSQEQ 550 560 570 711 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 19:46:14 2016 done: Mon Nov 7 19:46:14 2016 Total Scan time: 3.500 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]