FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9976, 707 aa
1>>>pF1KB9976 707 - 707 aa - 707 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 17.2750+/-0.00136; mu= -28.0967+/- 0.083
mean_var=968.6740+/-198.181, 0's: 0 Z-trim(119.2): 129 B-trim: 57 in 2/53
Lambda= 0.041208
statistics sampled from 20185 (20311) to 20185 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.825), E-opt: 0.2 (0.624), width: 16
Scan time: 5.330
The best scores are: opt bits E(32554)
CCDS388.1 SFPQ gene_id:6421|Hs108|chr1 ( 707) 5111 319.2 1.3e-86
CCDS41870.1 PSPC1 gene_id:55269|Hs108|chr13 ( 523) 1783 121.3 3.8e-27
CCDS14410.1 NONO gene_id:4841|Hs108|chrX ( 471) 1716 117.2 5.6e-26
CCDS55445.1 NONO gene_id:4841|Hs108|chrX ( 382) 1463 102.1 1.6e-21
>>CCDS388.1 SFPQ gene_id:6421|Hs108|chr1 (707 aa)
initn: 5111 init1: 5111 opt: 5111 Z-score: 1668.5 bits: 319.2 E(32554): 1.3e-86
Smith-Waterman score: 5111; 100.0% identity (100.0% similar) in 707 aa overlap (1-707:1-707)
10 20 30 40 50 60
pF1KB9 MSRDRFRSRGGGGGGFHRRGGGGGRGGLHDFRSPPPGMGLNQNRGPMGPGPGQSGPKPPI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 MSRDRFRSRGGGGGGFHRRGGGGGRGGLHDFRSPPPGMGLNQNRGPMGPGPGQSGPKPPI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 PPPPPHQQQQQPPPQQPPPQQPPPHQPPPHPQPHQQQQPPPPPQDSSKPVVAQGPGPAPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 PPPPPHQQQQQPPPQQPPPQQPPPHQPPPHPQPHQQQQPPPPPQDSSKPVVAQGPGPAPG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 VGSAPPASSSAPPATPPTSGAPPGSGPGPTPTPPPAVTSAPPGAPPPTPPSSGVPTTPPQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 VGSAPPASSSAPPATPPTSGAPPGSGPGPTPTPPPAVTSAPPGAPPPTPPSSGVPTTPPQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 AGGPPPPPAAVPGPGPGPKQGPGPGGPKGGKMPGGPKPGGGPGLSTPGGHPKPPHRGGGE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 AGGPPPPPAAVPGPGPGPKQGPGPGGPKGGKMPGGPKPGGGPGLSTPGGHPKPPHRGGGE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 PRGGRQHHPPYHQQHHQGPPPGGPGGRSEEKISDSEGFKANLSLLRRPGEKTYTQRCRLF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 PRGGRQHHPPYHQQHHQGPPPGGPGGRSEEKISDSEGFKANLSLLRRPGEKTYTQRCRLF
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 VGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRALAEIAKAELDDTPMRGR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 VGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRALAEIAKAELDDTPMRGR
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB9 QLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVIVDDRGRSTGKGIVEFAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 QLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVIVDDRGRSTGKGIVEFAS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB9 KPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKLAQKNPMYQKERETPPRF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 KPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKLAQKNPMYQKERETPPRF
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB9 AQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEMEDAYHEHQANLLRQDLM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 AQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEMEDAYHEHQANLLRQDLM
490 500 510 520 530 540
550 560 570 580 590 600
pF1KB9 RRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIRQREMEEQMRRQREESYSRM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 RRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIRQREMEEQMRRQREESYSRM
550 560 570 580 590 600
610 620 630 640 650 660
pF1KB9 GYMDPRERDMRMGGGGAMNMGDPYGSGGQKFPPLGGGGGIGYEANPGVPPATMSGSMMGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 GYMDPRERDMRMGGGGAMNMGDPYGSGGQKFPPLGGGGGIGYEANPGVPPATMSGSMMGS
610 620 630 640 650 660
670 680 690 700
pF1KB9 DMRTERFGQGGAGPVGGQGPRGMGPGTPAGYGRGREEYEGPNKKPRF
:::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 DMRTERFGQGGAGPVGGQGPRGMGPGTPAGYGRGREEYEGPNKKPRF
670 680 690 700
>>CCDS41870.1 PSPC1 gene_id:55269|Hs108|chr13 (523 aa)
initn: 1710 init1: 1586 opt: 1783 Z-score: 600.8 bits: 121.3 E(32554): 3.8e-27
Smith-Waterman score: 1836; 60.5% identity (81.0% similar) in 484 aa overlap (259-707:45-523)
230 240 250 260 270 280
pF1KB9 GHPKPPHRGGGEPRGGRQHHPPYHQQHHQGPPPGGPGGRSEEKISDSEGFKANLSLLRRP
: : .:. :.. .. :: ... . .:
CCDS41 NPARLRALESAVGESEPAAAAAMALALAGEPAPPAPAP-PEDHPDEEMGFTIDIKSFLKP
20 30 40 50 60 70
290 300 310 320 330 340
pF1KB9 GEKTYTQRCRLFVGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRALAEIA
:::::::::::::::::.::::..::::: .::::.:::::. .:::::.::::.:::::
CCDS41 GEKTYTQRCRLFVGNLPTDITEEDFKRLFERYGEPSEVFINRDRGFGFIRLESRTLAEIA
80 90 100 110 120 130
350 360 370 380 390 400
pF1KB9 KAELDDTPMRGRQLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVIVDDRG
::::: : ...: ::.:::::.:::.:.:::: :::::::.:::::::.:.:::.:::::
CCDS41 KAELDGTILKSRPLRIRFATHGAALTVKNLSPVVSNELLEQAFSQFGPVEKAVVVVDDRG
140 150 160 170 180 190
410 420 430 440 450 460
pF1KB9 RSTGKGIVEFASKPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKLAQKNP
:.::::.::::.:: ::::.:::..:.::::::::::::::.::.:::::::::: ::.
CCDS41 RATGKGFVEFAAKPPARKALERCGDGAFLLTTTPRPVIVEPMEQFDDEDGLPEKLMQKTQ
200 210 220 230 240 250
470 480 490 500 510 520
pF1KB9 MYQKERETPPRFAQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEMEDAYH
.:.:::: :::::: ::::.::..:::.::::::::::::..:...::.:::.::: : :
CCDS41 QYHKEREQPPRFAQPGTFEFEYASRWKALDEMEKQQREQVDRNIREAKEKLEAEMEAARH
260 270 280 290 300 310
530 540 550 560 570 580
pF1KB9 EHQANLLRQDLMRRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIRQREMEEQ
::: :.:::::::::::::.:::.:::.::::..:::.:::.::::::: ::.::.::
CCDS41 EHQLMLMRQDLMRRQEELRRLEELRNQELQKRKQIQLRHEEEHRRREEEM-IRHREQEE-
320 330 340 350 360 370
590 600 610 620 630 640
pF1KB9 MRRQREESYSRMGYMDPRERDMRMGG-G--GAMNMGD---PYGSGGQKFPPLGGGGGIGY
.::: .:.. . .::. ::..:::: : ::.:::: : .:.: ::. : . .
CCDS41 LRRQ-QEGF-KPNYMENREQEMRMGDMGPRGAINMGDAFSPAPAGNQGPPPMMGMNMNNR
380 390 400 410 420
650 660 670
pF1KB9 EANPGVP--PATMSG----SMMGSDM-------RTERFGQGG----AGPVGG-------Q
. :: : :. : . ::. : ...:: :: ..:.:. :
CCDS41 ATIPGPPMGPGPAMGPEGAANMGTPMMPDNGAVHNDRFPQGPPSQMGSPMGSRTGSETPQ
430 440 450 460 470 480
680 690 700
pF1KB9 GP-RGMGP--GTPAGYGRGRE--EYEGPNKKPRF
.: :.:: : :.:.::: . ..:::::. :.
CCDS41 APMSGVGPVSGGPGGFGRGSQGGNFEGPNKRRRY
490 500 510 520
>>CCDS14410.1 NONO gene_id:4841|Hs108|chrX (471 aa)
initn: 1765 init1: 1622 opt: 1716 Z-score: 579.9 bits: 117.2 E(32554): 5.6e-26
Smith-Waterman score: 1778; 60.0% identity (79.9% similar) in 473 aa overlap (241-707:16-471)
220 230 240 250 260
pF1KB9 KMPGGPKPGGGPGLSTPGGHPKPPHRGGGEPRGGRQHH--PPYHQQHHQGPPPGGPGGRS
:: .::: .:::..: ::: . .
CCDS14 MQSNKTFNLEKQNHTPRKHHQHHHQQQHHQQQQQQPPPPPIPANG
10 20 30 40
270 280 290 300 310 320
pF1KB9 EEKISDSEGFKANLSLLRRPGEKTYTQRCRLFVGNLPADITEDEFKRLFAKYGEPGEVFI
.. :..::. .:. .:.:::::.::: :::::::: ::::.:...:: :::. :::::
CCDS14 QQASSQNEGLTIDLKNFRKPGEKTFTQRSRLFVGNLPPDITEEEMRKLFEKYGKAGEVFI
50 60 70 80 90 100
330 340 350 360 370 380
pF1KB9 NKGKGFGFIKLESRALAEIAKAELDDTPMRGRQLRVRFATHAAALSVRNLSPYVSNELLE
.: ::::::.::.:.::::::.:::. :.::.::::::: :.:.:.:::: ::::::::
CCDS14 HKDKGFGFIRLETRTLAEIAKVELDNMPLRGKQLRVRFACHSASLTVRNLPQYVSNELLE
110 120 130 140 150 160
390 400 410 420 430 440
pF1KB9 EAFSQFGPIERAVVIVDDRGRSTGKGIVEFASKPAARKAFERCSEGVFLLTTTPRPVIVE
:::: :: .:::::::::::: .:::::::..:::::::..::::: ::::: :::: ::
CCDS14 EAFSVFGQVERAVVIVDDRGRPSGKGIVEFSGKPAARKALDRCSEGSFLLTTFPRPVTVE
170 180 190 200 210 220
450 460 470 480 490 500
pF1KB9 PLEQLDDEDGLPEKLAQKNPMYQKERETPPRFAQHGTFEYEYSQRWKSLDEMEKQQREQV
:..:::::.::::::. :: ...:::: :::::: :.:::::..:::.: ::::::..::
CCDS14 PMDQLDDEEGLPEKLVIKNQQFHKEREQPPRFAQPGSFEYEYAMRWKALIEMEKQQQDQV
230 240 250 260 270 280
510 520 530 540 550 560
pF1KB9 EKNMKDAKDKLESEMEDAYHEHQANLLRQDLMRRQEELRRMEELHNQEMQKRKEMQLRQE
..:.:.:..::: ::: : ::::. :.:::::::::::::::::::::.::::...::::
CCDS14 DRNIKEAREKLEMEMEAARHEHQVMLMRQDLMRRQEELRRMEELHNQEVQKRKQLELRQE
290 300 310 320 330 340
570 580 590 600 610 620
pF1KB9 EERRRREEEMMIRQREMEEQMRRQREESYSRMGYMDPRERDMRMGGGGAMNMGDPYGSGG
:::::::::: .:..::.:::: .:.. . . : ::...::: : :: .: ..
CCDS14 EERRRREEEM---RRQQEEMMRRQ-QEGF-KGTFPDAREQEIRMG---QMAMGGAMGINN
350 360 370 380 390
630 640 650 660 670 680
pF1KB9 Q-KFPPLGGGGGIGYEANPGVPPATM--SGSMMGSDMRTERFGQGGAGPVGGQGPRGMGP
. .:: . : : :: :::: .:.. . :::::: :. . : : :
CCDS14 RGAMPP--APVPAGTPAPPG--PATMMPDGTLGLTPPTTERFGQ--AATMEGIGAIG---
400 410 420 430 440
690 700
pF1KB9 GTPAGYGRGREEYE-GPNKKPRF
::: ...:. : .:::. :.
CCDS14 GTPPAFNRAAPGAEFAPNKRRRY
450 460 470
>>CCDS55445.1 NONO gene_id:4841|Hs108|chrX (382 aa)
initn: 1574 init1: 1389 opt: 1463 Z-score: 499.8 bits: 102.1 E(32554): 1.6e-21
Smith-Waterman score: 1525; 61.8% identity (80.9% similar) in 398 aa overlap (314-707:2-382)
290 300 310 320 330 340
pF1KB9 LLRRPGEKTYTQRCRLFVGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRA
..:: :::. :::::.: ::::::.::.:.
CCDS55 MRKLFEKYGKAGEVFIHKDKGFGFIRLETRT
10 20 30
350 360 370 380 390 400
pF1KB9 LAEIAKAELDDTPMRGRQLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVI
::::::.:::. :.::.::::::: :.:.:.:::: :::::::::::: :: .::::::
CCDS55 LAEIAKVELDNMPLRGKQLRVRFACHSASLTVRNLPQYVSNELLEEAFSVFGQVERAVVI
40 50 60 70 80 90
410 420 430 440 450 460
pF1KB9 VDDRGRSTGKGIVEFASKPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKL
:::::: .:::::::..:::::::..::::: ::::: :::: :::..:::::.::::::
CCDS55 VDDRGRPSGKGIVEFSGKPAARKALDRCSEGSFLLTTFPRPVTVEPMDQLDDEEGLPEKL
100 110 120 130 140 150
470 480 490 500 510 520
pF1KB9 AQKNPMYQKERETPPRFAQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEM
. :: ...:::: :::::: :.:::::..:::.: ::::::..::..:.:.:..::: ::
CCDS55 VIKNQQFHKEREQPPRFAQPGSFEYEYAMRWKALIEMEKQQQDQVDRNIKEAREKLEMEM
160 170 180 190 200 210
530 540 550 560 570 580
pF1KB9 EDAYHEHQANLLRQDLMRRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIRQR
: : ::::. :.:::::::::::::::::::::.::::...:::::::::::::: .:
CCDS55 EAARHEHQVMLMRQDLMRRQEELRRMEELHNQEVQKRKQLELRQEEERRRREEEM---RR
220 230 240 250 260
590 600 610 620 630 640
pF1KB9 EMEEQMRRQREESYSRMGYMDPRERDMRMGGGGAMNMGDPYGSGGQ-KFPPLGGGGGIGY
..::.:::: .:.. . . : ::...::: : :: .: ... .:: . :
CCDS55 QQEEMMRRQ-QEGF-KGTFPDAREQEIRMG---QMAMGGAMGINNRGAMPP--APVPAGT
270 280 290 300 310 320
650 660 670 680 690
pF1KB9 EANPGVPPATM--SGSMMGSDMRTERFGQGGAGPVGGQGPRGMGPGTPAGYGRGREEYE-
: :: :::: .:.. . :::::: :. . : : : ::: ...:. :
CCDS55 PAPPG--PATMMPDGTLGLTPPTTERFGQ--AATMEGIGAIG---GTPPAFNRAAPGAEF
330 340 350 360 370
700
pF1KB9 GPNKKPRF
.:::. :.
CCDS55 APNKRRRY
380
707 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 14:38:06 2016 done: Mon Nov 7 14:38:06 2016
Total Scan time: 5.330 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]