FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7892, 653 aa 1>>>pF1KB7892 653 - 653 aa - 653 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.4601+/-0.00115; mu= -3.7374+/- 0.070 mean_var=548.3162+/-113.555, 0's: 0 Z-trim(116.0): 72 B-trim: 427 in 1/54 Lambda= 0.054772 statistics sampled from 16534 (16599) to 16534 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.774), E-opt: 0.2 (0.51), width: 16 Scan time: 3.540 The best scores are: opt bits E(32554) CCDS683.1 FUBP1 gene_id:8880|Hs108|chr1 ( 644) 4546 374.1 3.2e-103 CCDS45936.1 KHSRP gene_id:8570|Hs108|chr19 ( 711) 2317 198.1 3.6e-50 CCDS43893.1 FUBP3 gene_id:8939|Hs108|chr9 ( 572) 1929 167.3 5.3e-41 >>CCDS683.1 FUBP1 gene_id:8880|Hs108|chr1 (644 aa) initn: 3958 init1: 3958 opt: 4546 Z-score: 1966.6 bits: 374.1 E(32554): 3.2e-103 Smith-Waterman score: 4546; 99.8% identity (99.8% similar) in 642 aa overlap (1-641:1-642) 10 20 30 40 50 60 pF1KB7 MADYSTVPPPSSGSAGGGGGGGGGGGVNDAFKDALQRARQIAAKIGGDAGTSLNSNDYGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 MADYSTVPPPSSGSAGGGGGGGGGGGVNDAFKDALQRARQIAAKIGGDAGTSLNSNDYGY 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 GGQKRPLEDGDQPDAKKVAPQNDSFGTQLPPMHQQQ-RSVMTEEYKVPDGMVGFIIGRGG :::::::::::::::::::::::::::::::::::: ::::::::::::::::::::::: CCDS68 GGQKRPLEDGDQPDAKKVAPQNDSFGTQLPPMHQQQSRSVMTEEYKVPDGMVGFIIGRGG 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 EQISRIQQESGCKIQIAPDSGGLPERSCMLTGTPESVQSAKRLLDQIVEKGRPAPGFHHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 EQISRIQQESGCKIQIAPDSGGLPERSCMLTGTPESVQSAKRLLDQIVEKGRPAPGFHHG 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB7 DGPGNAVQEIMIPASKAGLVIGKGGETIKQLQERAGVKMVMIQDGPQNTGADKPLRITGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 DGPGNAVQEIMIPASKAGLVIGKGGETIKQLQERAGVKMVMIQDGPQNTGADKPLRITGD 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB7 PYKVQQAKEMVLELIRDQGGFREVRNEYGSRIGGNEGIDVPIPRFAVGIVIGRNGEMIKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 PYKVQQAKEMVLELIRDQGGFREVRNEYGSRIGGNEGIDVPIPRFAVGIVIGRNGEMIKK 250 260 270 280 290 300 300 310 320 330 340 350 pF1KB7 IQNDAGVRIQFKPDDGTTPERIAQITGPPDRCQHAAEIITDLLRSVQAGNPGGPGPGGRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 IQNDAGVRIQFKPDDGTTPERIAQITGPPDRCQHAAEIITDLLRSVQAGNPGGPGPGGRG 310 320 330 340 350 360 360 370 380 390 400 410 pF1KB7 RGRGQGNWNMGPPGGLQEFNFIVPTGKTGLIIGKGGETIKSISQQSGARIELQRNPPPNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 RGRGQGNWNMGPPGGLQEFNFIVPTGKTGLIIGKGGETIKSISQQSGARIELQRNPPPNA 370 380 390 400 410 420 420 430 440 450 460 470 pF1KB7 DPNMKLFTIRGTPQQIDYARQLIEEKIGGPVNPLGPPVPHGPHGVPGPHGPPGPPGPGTP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 DPNMKLFTIRGTPQQIDYARQLIEEKIGGPVNPLGPPVPHGPHGVPGPHGPPGPPGPGTP 430 440 450 460 470 480 480 490 500 510 520 530 pF1KB7 MGPYNPAPYNPGPPGPAPHGPPAPYAPQGWGNAYPHWQQQAPPDPAKAGTDPNSAAWAAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 MGPYNPAPYNPGPPGPAPHGPPAPYAPQGWGNAYPHWQQQAPPDPAKAGTDPNSAAWAAY 490 500 510 520 530 540 540 550 560 570 580 590 pF1KB7 YAHYYQQQAQPPPAAPAGAPTTTQTNGQGDQQNPAPAGQVDYTKAWEEYYKKMGQAVPAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 YAHYYQQQAQPPPAAPAGAPTTTQTNGQGDQQNPAPAGQVDYTKAWEEYYKKMGQAVPAP 550 560 570 580 590 600 600 610 620 630 640 650 pF1KB7 TGAPPGGQPDYSAAWAEYYRQQAAYYAQTSPQGMPQHPPAPQCRFDPASIELAL :::::::::::::::::::::::::::::::::::::::::: CCDS68 TGAPPGGQPDYSAAWAEYYRQQAAYYAQTSPQGMPQHPPAPQGQ 610 620 630 640 >>CCDS45936.1 KHSRP gene_id:8570|Hs108|chr19 (711 aa) initn: 1735 init1: 673 opt: 2317 Z-score: 1014.2 bits: 198.1 E(32554): 3.6e-50 Smith-Waterman score: 2828; 63.1% identity (78.6% similar) in 681 aa overlap (10-641:50-705) 10 20 30 pF1KB7 MADYSTVPPPSSGSAGGGG---GGGGGGGVNDAFKDALQ :..::::: . :::: : .::: ::.: CCDS45 GGGAGGAGGGPPPGPPGAGDRGGGGPGGGGPGGGSAGGPSQPPGGGGPGIRKDAFADAVQ 20 30 40 50 60 70 40 50 60 70 80 90 pF1KB7 RARQIAAKIGGDAGTSLNSN--DYGYGGQKRPLEDGDQPDAKKVAPQNDSFGTQLPPMHQ :::::::::::::.:..:.. :.:.::::: :::::::..::.: :.::...:: :.: CCDS45 RARQIAAKIGGDAATTVNNSTPDFGFGGQKRQLEDGDQPESKKLASQGDSISSQLGPIHP 80 90 100 110 120 130 100 110 120 130 140 150 pF1KB7 QQRSVMTEEYKVPDGMVGFIIGRGGEQISRIQQESGCKIQIAPDSGGLPERSCMLTGTPE :. :::::.:::::::.:::::::::..:::.::::.::.:::::::::: :::.:: CCDS45 PPRTSMTEEYRVPDGMVGLIIGRGGEQINKIQQDSGCKVQISPDSGGLPERSVSLTGAPE 140 150 160 170 180 190 160 170 180 190 200 210 pF1KB7 SVQSAKRLLDQIVEKGRPAP-GFHHGD---GPGNAVQEIMIPASKAGLVIGKGGETIKQL :::.:: .::.:: .:: .: : : . : ...::::::::.:::::::::::::::: CCDS45 SVQKAKMMLDDIVSRGRGGPPGQFHDNANGGQNGTVQEIMIPAGKAGLVIGKGGETIKQL 200 210 220 230 240 250 220 230 240 250 260 pF1KB7 QERAGVKMVMIQDGPQNTGADKPLRITGDPYKVQQAKEMVLELIR--DQGGFREVRNEYG ::::::::..:::: :::..:::::: ::::::::: :::....: ::::: . ::::: CCDS45 QERAGVKMILIQDGSQNTNVDKPLRIIGDPYKVQQACEMVMDILRERDQGGFGD-RNEYG 260 270 280 290 300 310 270 280 290 300 310 320 pF1KB7 SRIGGNEGIDVPIPRFAVGIVIGRNGEMIKKIQNDAGVRIQFKPDDGTTPERIAQITGPP ::::: :::::.:: .::.::::.:::::::::::::::::: :::: ::.::.: ::: CCDS45 SRIGG--GIDVPVPRHSVGVVIGRSGEMIKKIQNDAGVRIQFKQDDGTGPEKIAHIMGPP 320 330 340 350 360 370 330 340 350 360 370 380 pF1KB7 DRCQHAAEIITDLLRSVQAGNPGGPG-----PGGRGRGRGQGNWNMGPPGGLQEFNFIVP :::.:::.::.:::.:...: :: :: ::::::::::::: ::::: :..: .: CCDS45 DRCEHAARIINDLLQSLRSGPPGPPGGPGMPPGGRGRGRGQGNW--GPPGG--EMTFSIP 380 390 400 410 420 430 390 400 410 420 430 440 pF1KB7 TGKTGLIIGKGGETIKSISQQSGARIELQRNPPPNADPNMKLFTIRGTPQQIDYARQLIE : : ::.::.:::..:.:.::.:: .:..:. :::.:::.::: :::.:::::.:.:::: CCDS45 THKCGLVIGRGGENVKAINQQTGAFVEISRQLPPNGDPNFKLFIIRGSPQQIDHAKQLIE 440 450 460 470 480 490 450 460 470 480 490 500 pF1KB7 EKIGGPVNPLGPPVPHGPHGVPGPHGPPGPPGPGTPMGPYNPAPYNPGPPGPAPHG---P ::: ::. :.:: :: : :::. ::::.::.:.: :::: ::. : CCDS45 EKIEGPLCPVGP----GPGG----------PGPAGPMGPFNPGPFNQGPPGAPPHAGGPP 500 510 520 530 510 520 530 540 550 pF1KB7 PAPYAPQGWGNAYPHWQQQAPPDPAKAGT---DPNSAAWAAYYAHYYQQQAQPPPAAPAG : : ::::::.::.:: :: ::.::.. ::: :::::::.::::: :: .:. CCDS45 PHQYPPQGWGNTYPQWQPPAPHDPSKAAAAAADPN-AAWAAYYSHYYQQ---PPGPVPGP 540 550 560 570 580 590 560 570 580 590 pF1KB7 APTTTQTNGQGDQQNPAPAGQVDYTKAWEEYYKKMGQAVPAP------------------ ::. . .::. .: :.:: ::::::::::::.:: : CCDS45 APAPAAPPAQGEPPQPPPTGQSDYTKAWEEYYKKIGQQPQQPGAPPQQDYTKAWEEYYKK 600 610 620 630 640 650 600 610 620 630 640 650 pF1KB7 ---------TGAPPGGQPDYSAAWAEYYRQQAAYYAQTSPQGMPQHPPAPQCRFDPASIE :::::.:::::::::::::::::::.:: : :: ::. : CCDS45 QAQVATGGGPGAPPGSQPDYSAAWAEYYRQQAAYYGQTPGPGGPQPPPTQQGQQQAQ 660 670 680 690 700 710 pF1KB7 LAL >>CCDS43893.1 FUBP3 gene_id:8939|Hs108|chr9 (572 aa) initn: 2020 init1: 948 opt: 1929 Z-score: 849.6 bits: 167.3 E(32554): 5.3e-41 Smith-Waterman score: 2088; 55.2% identity (71.9% similar) in 623 aa overlap (25-631:13-564) 10 20 30 40 50 pF1KB7 MADYSTVPPPSSGSAGGGGGGGGGGGVNDAFKDALQRARQIAAKIGGDAGTSLNSND--- : ..: :::.:.::::::: :. ::.. CCDS43 MAELVQGQSAPVGMKAEGFVDALHRVRQIAAKI--DSIPHLNNSTPLV 10 20 30 40 60 70 80 90 100 110 pF1KB7 ----YGYGGQKRPLEDGDQPDAKKVAPQNDSFGTQLPPMHQQQRSVMTEEYKVPDGMVGF :::: :::::.:: :.:: . .::.:.:::.:::: :::: CCDS43 DPSVYGYGVQKRPLDDG--------------VGNQLGAL-VHQRTVITEEFKVPDKMVGF 50 60 70 80 90 120 130 140 150 160 170 pF1KB7 IIGRGGEQISRIQQESGCKIQIAPDSGGLPERSCMLTGTPESVQSAKRLLDQIVEKGRPA ::::::::::::: ::::::::: .:.:.::: :.:::::::...::::: :::.. : . CCDS43 IIGRGGEQISRIQAESGCKIQIASESSGIPERPCVLTGTPESIEQAKRLLGQIVDRCRNG 100 110 120 130 140 150 180 190 200 210 220 230 pF1KB7 PGFHHGDGPGNAVQEIMIPASKAGLVIGKGGETIKQLQERAGVKMVMIQDGPQNTGADKP ::::. ....:::.:::::.:::::.:::::::::::.::::::::::: :::::: CCDS43 PGFHNDIDSNSTIQEILIPASKVGLVIGRGGETIKQLQERTGVKMVMIQDGPLPTGADKP 160 170 180 190 200 210 240 250 260 270 280 290 pF1KB7 LRITGDPYKVQQAKEMVLELIR--DQGGFREVRNEYGSRIGGNEGIDVPIPRFAVGIVIG :::::: .:::::.:::::.:: ::. :: ::....::.::. .:.: .:::::::::: CCDS43 LRITGDAFKVQQAREMVLEIIREKDQADFRGVRGDFNSRMGGG-SIEVSVPRFAVGIVIG 220 230 240 250 260 270 300 310 320 330 340 350 pF1KB7 RNGEMIKKIQNDAGVRIQFKPDDGTTPERIAQITGPPDRCQHAAEIITDLLRSVQAGNPG :::::::::::::::::::::::: .::: ::. ::::::::::.::..:. ..: . CCDS43 RNGEMIKKIQNDAGVRIQFKPDDGISPERAAQVMGPPDRCQHAAHIISELILTAQERDGF 280 290 300 310 320 330 360 370 380 390 400 410 pF1KB7 GPGPGGRGRGRGQGNWNMGPPGGLQEFNFIVPTGKTGLIIGKGGETIKSISQQSGARIEL : ..::::::.:.:..: :::.::... ::. : ::.::::::.::::.:::::..:: CCDS43 GGLAAARGRGRGRGDWSVGAPGGVQEITYTVPADKCGLVIGKGGENIKSINQQSGAHVEL 340 350 360 370 380 390 420 430 440 450 460 470 pF1KB7 QRNPPPNADPNMKLFTIRGTPQQIDYARQLIEEKIGGPVNPLGPPVPHGPHGVPGPHGPP :::::::.:::.. :::::.::::. :::::.::.:: . :: : : : :: CCDS43 QRNPPPNSDPNLRRFTIRGVPQQIEVARQLIDEKVGG--TNLGAP---GAFGQSPFSQPP 400 410 420 430 440 480 490 500 510 520 pF1KB7 GPPGPGT--PMG----PYNPAPYNPGPPGPAPHGPPAPYAPQGWGNAYPHWQQQAPPDPA .:: .: : . : : : .: . :::: . ::::..: ::: . :. CCDS43 APPHQNTFPPRSSGCFPNMAAKVNGNPHSTPVSGPPA-FLTQGWGSTYQAWQQPTQQVPS 450 460 470 480 490 500 530 540 550 560 570 580 pF1KB7 KAGTDPNSAAWAAYYAHYYQQQAQPPPAAPAGAPTTTQTNGQGDQQNPAPAGQVDYTKAW ::.:: . : .:.::: CCDS43 --------------------QQSQPQSSQP------------------------NYSKAW 510 520 590 600 610 620 630 640 pF1KB7 EEYYKKMGQAVPA-PTGAPPGGQPDYSAAWAEYYRQQAAYYAQTSPQGMPQHPPAPQCRF :.::::...:. : : .. : :::. :::::::::.:.:.:: : CCDS43 EDYYKKQSHAASAAPQASSP---PDYTMAWAEYYRQQVAFYGQTLGQAQAHSQEQ 530 540 550 560 570 650 pF1KB7 DPASIELAL 653 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 15:51:33 2016 done: Mon Nov 7 15:51:34 2016 Total Scan time: 3.540 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]