FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7892, 653 aa
1>>>pF1KB7892 653 - 653 aa - 653 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.4601+/-0.00115; mu= -3.7374+/- 0.070
mean_var=548.3162+/-113.555, 0's: 0 Z-trim(116.0): 72 B-trim: 427 in 1/54
Lambda= 0.054772
statistics sampled from 16534 (16599) to 16534 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.774), E-opt: 0.2 (0.51), width: 16
Scan time: 3.540
The best scores are: opt bits E(32554)
CCDS683.1 FUBP1 gene_id:8880|Hs108|chr1 ( 644) 4546 374.1 3.2e-103
CCDS45936.1 KHSRP gene_id:8570|Hs108|chr19 ( 711) 2317 198.1 3.6e-50
CCDS43893.1 FUBP3 gene_id:8939|Hs108|chr9 ( 572) 1929 167.3 5.3e-41
>>CCDS683.1 FUBP1 gene_id:8880|Hs108|chr1 (644 aa)
initn: 3958 init1: 3958 opt: 4546 Z-score: 1966.6 bits: 374.1 E(32554): 3.2e-103
Smith-Waterman score: 4546; 99.8% identity (99.8% similar) in 642 aa overlap (1-641:1-642)
10 20 30 40 50 60
pF1KB7 MADYSTVPPPSSGSAGGGGGGGGGGGVNDAFKDALQRARQIAAKIGGDAGTSLNSNDYGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 MADYSTVPPPSSGSAGGGGGGGGGGGVNDAFKDALQRARQIAAKIGGDAGTSLNSNDYGY
10 20 30 40 50 60
70 80 90 100 110
pF1KB7 GGQKRPLEDGDQPDAKKVAPQNDSFGTQLPPMHQQQ-RSVMTEEYKVPDGMVGFIIGRGG
:::::::::::::::::::::::::::::::::::: :::::::::::::::::::::::
CCDS68 GGQKRPLEDGDQPDAKKVAPQNDSFGTQLPPMHQQQSRSVMTEEYKVPDGMVGFIIGRGG
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB7 EQISRIQQESGCKIQIAPDSGGLPERSCMLTGTPESVQSAKRLLDQIVEKGRPAPGFHHG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 EQISRIQQESGCKIQIAPDSGGLPERSCMLTGTPESVQSAKRLLDQIVEKGRPAPGFHHG
130 140 150 160 170 180
180 190 200 210 220 230
pF1KB7 DGPGNAVQEIMIPASKAGLVIGKGGETIKQLQERAGVKMVMIQDGPQNTGADKPLRITGD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 DGPGNAVQEIMIPASKAGLVIGKGGETIKQLQERAGVKMVMIQDGPQNTGADKPLRITGD
190 200 210 220 230 240
240 250 260 270 280 290
pF1KB7 PYKVQQAKEMVLELIRDQGGFREVRNEYGSRIGGNEGIDVPIPRFAVGIVIGRNGEMIKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 PYKVQQAKEMVLELIRDQGGFREVRNEYGSRIGGNEGIDVPIPRFAVGIVIGRNGEMIKK
250 260 270 280 290 300
300 310 320 330 340 350
pF1KB7 IQNDAGVRIQFKPDDGTTPERIAQITGPPDRCQHAAEIITDLLRSVQAGNPGGPGPGGRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 IQNDAGVRIQFKPDDGTTPERIAQITGPPDRCQHAAEIITDLLRSVQAGNPGGPGPGGRG
310 320 330 340 350 360
360 370 380 390 400 410
pF1KB7 RGRGQGNWNMGPPGGLQEFNFIVPTGKTGLIIGKGGETIKSISQQSGARIELQRNPPPNA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 RGRGQGNWNMGPPGGLQEFNFIVPTGKTGLIIGKGGETIKSISQQSGARIELQRNPPPNA
370 380 390 400 410 420
420 430 440 450 460 470
pF1KB7 DPNMKLFTIRGTPQQIDYARQLIEEKIGGPVNPLGPPVPHGPHGVPGPHGPPGPPGPGTP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 DPNMKLFTIRGTPQQIDYARQLIEEKIGGPVNPLGPPVPHGPHGVPGPHGPPGPPGPGTP
430 440 450 460 470 480
480 490 500 510 520 530
pF1KB7 MGPYNPAPYNPGPPGPAPHGPPAPYAPQGWGNAYPHWQQQAPPDPAKAGTDPNSAAWAAY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 MGPYNPAPYNPGPPGPAPHGPPAPYAPQGWGNAYPHWQQQAPPDPAKAGTDPNSAAWAAY
490 500 510 520 530 540
540 550 560 570 580 590
pF1KB7 YAHYYQQQAQPPPAAPAGAPTTTQTNGQGDQQNPAPAGQVDYTKAWEEYYKKMGQAVPAP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS68 YAHYYQQQAQPPPAAPAGAPTTTQTNGQGDQQNPAPAGQVDYTKAWEEYYKKMGQAVPAP
550 560 570 580 590 600
600 610 620 630 640 650
pF1KB7 TGAPPGGQPDYSAAWAEYYRQQAAYYAQTSPQGMPQHPPAPQCRFDPASIELAL
::::::::::::::::::::::::::::::::::::::::::
CCDS68 TGAPPGGQPDYSAAWAEYYRQQAAYYAQTSPQGMPQHPPAPQGQ
610 620 630 640
>>CCDS45936.1 KHSRP gene_id:8570|Hs108|chr19 (711 aa)
initn: 1735 init1: 673 opt: 2317 Z-score: 1014.2 bits: 198.1 E(32554): 3.6e-50
Smith-Waterman score: 2828; 63.1% identity (78.6% similar) in 681 aa overlap (10-641:50-705)
10 20 30
pF1KB7 MADYSTVPPPSSGSAGGGG---GGGGGGGVNDAFKDALQ
:..::::: . :::: : .::: ::.:
CCDS45 GGGAGGAGGGPPPGPPGAGDRGGGGPGGGGPGGGSAGGPSQPPGGGGPGIRKDAFADAVQ
20 30 40 50 60 70
40 50 60 70 80 90
pF1KB7 RARQIAAKIGGDAGTSLNSN--DYGYGGQKRPLEDGDQPDAKKVAPQNDSFGTQLPPMHQ
:::::::::::::.:..:.. :.:.::::: :::::::..::.: :.::...:: :.:
CCDS45 RARQIAAKIGGDAATTVNNSTPDFGFGGQKRQLEDGDQPESKKLASQGDSISSQLGPIHP
80 90 100 110 120 130
100 110 120 130 140 150
pF1KB7 QQRSVMTEEYKVPDGMVGFIIGRGGEQISRIQQESGCKIQIAPDSGGLPERSCMLTGTPE
:. :::::.:::::::.:::::::::..:::.::::.::.:::::::::: :::.::
CCDS45 PPRTSMTEEYRVPDGMVGLIIGRGGEQINKIQQDSGCKVQISPDSGGLPERSVSLTGAPE
140 150 160 170 180 190
160 170 180 190 200 210
pF1KB7 SVQSAKRLLDQIVEKGRPAP-GFHHGD---GPGNAVQEIMIPASKAGLVIGKGGETIKQL
:::.:: .::.:: .:: .: : : . : ...::::::::.::::::::::::::::
CCDS45 SVQKAKMMLDDIVSRGRGGPPGQFHDNANGGQNGTVQEIMIPAGKAGLVIGKGGETIKQL
200 210 220 230 240 250
220 230 240 250 260
pF1KB7 QERAGVKMVMIQDGPQNTGADKPLRITGDPYKVQQAKEMVLELIR--DQGGFREVRNEYG
::::::::..:::: :::..:::::: ::::::::: :::....: ::::: . :::::
CCDS45 QERAGVKMILIQDGSQNTNVDKPLRIIGDPYKVQQACEMVMDILRERDQGGFGD-RNEYG
260 270 280 290 300 310
270 280 290 300 310 320
pF1KB7 SRIGGNEGIDVPIPRFAVGIVIGRNGEMIKKIQNDAGVRIQFKPDDGTTPERIAQITGPP
::::: :::::.:: .::.::::.:::::::::::::::::: :::: ::.::.: :::
CCDS45 SRIGG--GIDVPVPRHSVGVVIGRSGEMIKKIQNDAGVRIQFKQDDGTGPEKIAHIMGPP
320 330 340 350 360 370
330 340 350 360 370 380
pF1KB7 DRCQHAAEIITDLLRSVQAGNPGGPG-----PGGRGRGRGQGNWNMGPPGGLQEFNFIVP
:::.:::.::.:::.:...: :: :: ::::::::::::: ::::: :..: .:
CCDS45 DRCEHAARIINDLLQSLRSGPPGPPGGPGMPPGGRGRGRGQGNW--GPPGG--EMTFSIP
380 390 400 410 420 430
390 400 410 420 430 440
pF1KB7 TGKTGLIIGKGGETIKSISQQSGARIELQRNPPPNADPNMKLFTIRGTPQQIDYARQLIE
: : ::.::.:::..:.:.::.:: .:..:. :::.:::.::: :::.:::::.:.::::
CCDS45 THKCGLVIGRGGENVKAINQQTGAFVEISRQLPPNGDPNFKLFIIRGSPQQIDHAKQLIE
440 450 460 470 480 490
450 460 470 480 490 500
pF1KB7 EKIGGPVNPLGPPVPHGPHGVPGPHGPPGPPGPGTPMGPYNPAPYNPGPPGPAPHG---P
::: ::. :.:: :: : :::. ::::.::.:.: :::: ::. :
CCDS45 EKIEGPLCPVGP----GPGG----------PGPAGPMGPFNPGPFNQGPPGAPPHAGGPP
500 510 520 530
510 520 530 540 550
pF1KB7 PAPYAPQGWGNAYPHWQQQAPPDPAKAGT---DPNSAAWAAYYAHYYQQQAQPPPAAPAG
: : ::::::.::.:: :: ::.::.. ::: :::::::.::::: :: .:.
CCDS45 PHQYPPQGWGNTYPQWQPPAPHDPSKAAAAAADPN-AAWAAYYSHYYQQ---PPGPVPGP
540 550 560 570 580 590
560 570 580 590
pF1KB7 APTTTQTNGQGDQQNPAPAGQVDYTKAWEEYYKKMGQAVPAP------------------
::. . .::. .: :.:: ::::::::::::.:: :
CCDS45 APAPAAPPAQGEPPQPPPTGQSDYTKAWEEYYKKIGQQPQQPGAPPQQDYTKAWEEYYKK
600 610 620 630 640 650
600 610 620 630 640 650
pF1KB7 ---------TGAPPGGQPDYSAAWAEYYRQQAAYYAQTSPQGMPQHPPAPQCRFDPASIE
:::::.:::::::::::::::::::.:: : :: ::. :
CCDS45 QAQVATGGGPGAPPGSQPDYSAAWAEYYRQQAAYYGQTPGPGGPQPPPTQQGQQQAQ
660 670 680 690 700 710
pF1KB7 LAL
>>CCDS43893.1 FUBP3 gene_id:8939|Hs108|chr9 (572 aa)
initn: 2020 init1: 948 opt: 1929 Z-score: 849.6 bits: 167.3 E(32554): 5.3e-41
Smith-Waterman score: 2088; 55.2% identity (71.9% similar) in 623 aa overlap (25-631:13-564)
10 20 30 40 50
pF1KB7 MADYSTVPPPSSGSAGGGGGGGGGGGVNDAFKDALQRARQIAAKIGGDAGTSLNSND---
: ..: :::.:.::::::: :. ::..
CCDS43 MAELVQGQSAPVGMKAEGFVDALHRVRQIAAKI--DSIPHLNNSTPLV
10 20 30 40
60 70 80 90 100 110
pF1KB7 ----YGYGGQKRPLEDGDQPDAKKVAPQNDSFGTQLPPMHQQQRSVMTEEYKVPDGMVGF
:::: :::::.:: :.:: . .::.:.:::.:::: ::::
CCDS43 DPSVYGYGVQKRPLDDG--------------VGNQLGAL-VHQRTVITEEFKVPDKMVGF
50 60 70 80 90
120 130 140 150 160 170
pF1KB7 IIGRGGEQISRIQQESGCKIQIAPDSGGLPERSCMLTGTPESVQSAKRLLDQIVEKGRPA
::::::::::::: ::::::::: .:.:.::: :.:::::::...::::: :::.. : .
CCDS43 IIGRGGEQISRIQAESGCKIQIASESSGIPERPCVLTGTPESIEQAKRLLGQIVDRCRNG
100 110 120 130 140 150
180 190 200 210 220 230
pF1KB7 PGFHHGDGPGNAVQEIMIPASKAGLVIGKGGETIKQLQERAGVKMVMIQDGPQNTGADKP
::::. ....:::.:::::.:::::.:::::::::::.::::::::::: ::::::
CCDS43 PGFHNDIDSNSTIQEILIPASKVGLVIGRGGETIKQLQERTGVKMVMIQDGPLPTGADKP
160 170 180 190 200 210
240 250 260 270 280 290
pF1KB7 LRITGDPYKVQQAKEMVLELIR--DQGGFREVRNEYGSRIGGNEGIDVPIPRFAVGIVIG
:::::: .:::::.:::::.:: ::. :: ::....::.::. .:.: .::::::::::
CCDS43 LRITGDAFKVQQAREMVLEIIREKDQADFRGVRGDFNSRMGGG-SIEVSVPRFAVGIVIG
220 230 240 250 260 270
300 310 320 330 340 350
pF1KB7 RNGEMIKKIQNDAGVRIQFKPDDGTTPERIAQITGPPDRCQHAAEIITDLLRSVQAGNPG
:::::::::::::::::::::::: .::: ::. ::::::::::.::..:. ..: .
CCDS43 RNGEMIKKIQNDAGVRIQFKPDDGISPERAAQVMGPPDRCQHAAHIISELILTAQERDGF
280 290 300 310 320 330
360 370 380 390 400 410
pF1KB7 GPGPGGRGRGRGQGNWNMGPPGGLQEFNFIVPTGKTGLIIGKGGETIKSISQQSGARIEL
: ..::::::.:.:..: :::.::... ::. : ::.::::::.::::.:::::..::
CCDS43 GGLAAARGRGRGRGDWSVGAPGGVQEITYTVPADKCGLVIGKGGENIKSINQQSGAHVEL
340 350 360 370 380 390
420 430 440 450 460 470
pF1KB7 QRNPPPNADPNMKLFTIRGTPQQIDYARQLIEEKIGGPVNPLGPPVPHGPHGVPGPHGPP
:::::::.:::.. :::::.::::. :::::.::.:: . :: : : : ::
CCDS43 QRNPPPNSDPNLRRFTIRGVPQQIEVARQLIDEKVGG--TNLGAP---GAFGQSPFSQPP
400 410 420 430 440
480 490 500 510 520
pF1KB7 GPPGPGT--PMG----PYNPAPYNPGPPGPAPHGPPAPYAPQGWGNAYPHWQQQAPPDPA
.:: .: : . : : : .: . :::: . ::::..: ::: . :.
CCDS43 APPHQNTFPPRSSGCFPNMAAKVNGNPHSTPVSGPPA-FLTQGWGSTYQAWQQPTQQVPS
450 460 470 480 490 500
530 540 550 560 570 580
pF1KB7 KAGTDPNSAAWAAYYAHYYQQQAQPPPAAPAGAPTTTQTNGQGDQQNPAPAGQVDYTKAW
::.:: . : .:.:::
CCDS43 --------------------QQSQPQSSQP------------------------NYSKAW
510 520
590 600 610 620 630 640
pF1KB7 EEYYKKMGQAVPA-PTGAPPGGQPDYSAAWAEYYRQQAAYYAQTSPQGMPQHPPAPQCRF
:.::::...:. : : .. : :::. :::::::::.:.:.:: :
CCDS43 EDYYKKQSHAASAAPQASSP---PDYTMAWAEYYRQQVAFYGQTLGQAQAHSQEQ
530 540 550 560 570
650
pF1KB7 DPASIELAL
653 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 15:51:33 2016 done: Mon Nov 7 15:51:34 2016
Total Scan time: 3.540 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]