FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7926, 350 aa
1>>>pF1KB7926 350 - 350 aa - 350 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.9192+/-0.000836; mu= -1.0371+/- 0.051
mean_var=289.5351+/-58.436, 0's: 0 Z-trim(117.6): 10 B-trim: 0 in 0/53
Lambda= 0.075374
statistics sampled from 18368 (18378) to 18368 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.832), E-opt: 0.2 (0.565), width: 16
Scan time: 3.420
The best scores are: opt bits E(32554)
CCDS31275.1 PCGF6 gene_id:84108|Hs108|chr10 ( 350) 2353 268.2 7.2e-72
CCDS7546.1 PCGF6 gene_id:84108|Hs108|chr10 ( 275) 1264 149.7 2.7e-36
CCDS1946.2 PCGF1 gene_id:84759|Hs108|chr2 ( 259) 524 69.2 4.3e-12
CCDS7413.1 PCGF5 gene_id:84333|Hs108|chr10 ( 256) 485 65.0 8e-11
>>CCDS31275.1 PCGF6 gene_id:84108|Hs108|chr10 (350 aa)
initn: 2353 init1: 2353 opt: 2353 Z-score: 1403.8 bits: 268.2 E(32554): 7.2e-72
Smith-Waterman score: 2353; 100.0% identity (100.0% similar) in 350 aa overlap (1-350:1-350)
10 20 30 40 50 60
pF1KB7 MEGVAVVTAGSVGAAKTEGAAALPPPPPVSPPALTPAPAAGEEGPAPLSETGAPGCSGSR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 MEGVAVVTAGSVGAAKTEGAAALPPPPPVSPPALTPAPAAGEEGPAPLSETGAPGCSGSR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 PPELEPERSLGRFRGRFEDEDEELEEEEELEEEEEEEEEDMSHFSLRLEGGRQDSEDEEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 PPELEPERSLGRFRGRFEDEDEELEEEEELEEEEEEEEEDMSHFSLRLEGGRQDSEDEEE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 RLINLSELTPYILCSICKGYLIDATTITECLHTFCKSCIVRHFYYSNRCPKCNIVVHQTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 RLINLSELTPYILCSICKGYLIDATTITECLHTFCKSCIVRHFYYSNRCPKCNIVVHQTQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 PLYNIRLDRQLQDIVYKLVINLEEREKKQMHDFYKERGLEVPKPAVPQPVPSSKGRSKKV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 PLYNIRLDRQLQDIVYKLVINLEEREKKQMHDFYKERGLEVPKPAVPQPVPSSKGRSKKV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 LESVFRIPPELDMSLLLEFIGANEGTGHFKPLEKKFVRVSGEATIGHVEKFLRRKMGLDP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 LESVFRIPPELDMSLLLEFIGANEGTGHFKPLEKKFVRVSGEATIGHVEKFLRRKMGLDP
250 260 270 280 290 300
310 320 330 340 350
pF1KB7 ACQVDIICGDHLLEQYQTLREIRRAIGDAAMQDGLLVLHYGLVVSPLKIT
::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 ACQVDIICGDHLLEQYQTLREIRRAIGDAAMQDGLLVLHYGLVVSPLKIT
310 320 330 340 350
>>CCDS7546.1 PCGF6 gene_id:84108|Hs108|chr10 (275 aa)
initn: 1264 init1: 1264 opt: 1264 Z-score: 765.2 bits: 149.7 E(32554): 2.7e-36
Smith-Waterman score: 1695; 78.3% identity (78.6% similar) in 350 aa overlap (1-350:1-275)
10 20 30 40 50 60
pF1KB7 MEGVAVVTAGSVGAAKTEGAAALPPPPPVSPPALTPAPAAGEEGPAPLSETGAPGCSGSR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 MEGVAVVTAGSVGAAKTEGAAALPPPPPVSPPALTPAPAAGEEGPAPLSETGAPGCSGSR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 PPELEPERSLGRFRGRFEDEDEELEEEEELEEEEEEEEEDMSHFSLRLEGGRQDSEDEEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 PPELEPERSLGRFRGRFEDEDEELEEEEELEEEEEEEEEDMSHFSLRLEGGRQDSEDEEE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 RLINLSELTPYILCSICKGYLIDATTITECLHTFCKSCIVRHFYYSNRCPKCNIVVHQTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 RLINLSELTPYILCSICKGYLIDATTITECLHTFCKSCIVRHFYYSNRCPKCNIVVHQTQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 PLYNIRLDRQLQDIVYKLVINLEEREKKQMHDFYKERGLEVPKPAVPQPVPSSKGRSKKV
:::::
CCDS75 PLYNI-------------------------------------------------------
250 260 270 280 290 300
pF1KB7 LESVFRIPPELDMSLLLEFIGANEGTGHFKPLEKKFVRVSGEATIGHVEKFLRRKMGLDP
.:::::::::::::::::::::::::::::::::::::::
CCDS75 --------------------SANEGTGHFKPLEKKFVRVSGEATIGHVEKFLRRKMGLDP
190 200 210 220
310 320 330 340 350
pF1KB7 ACQVDIICGDHLLEQYQTLREIRRAIGDAAMQDGLLVLHYGLVVSPLKIT
::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 ACQVDIICGDHLLEQYQTLREIRRAIGDAAMQDGLLVLHYGLVVSPLKIT
230 240 250 260 270
>>CCDS1946.2 PCGF1 gene_id:84759|Hs108|chr2 (259 aa)
initn: 443 init1: 443 opt: 524 Z-score: 330.6 bits: 69.2 E(32554): 4.3e-12
Smith-Waterman score: 524; 38.9% identity (74.4% similar) in 211 aa overlap (117-322:30-236)
90 100 110 120 130 140
pF1KB7 EEELEEEEEEEEEDMSHFSLRLEGGRQDSEDEEERLINLSELTPYILCSICKGYLIDATT
.::: .....:. .:.: .: ::..::::
CCDS19 MASPQGGQIAIAMRLRNQLQSVYKMDPLRNEEEVRVKIKDLNEHIVCCLCAGYFVDATT
10 20 30 40 50
150 160 170 180 190 200
pF1KB7 ITECLHTFCKSCIVRHFYYSNRCPKCNIVVHQTQPLYNIRLDRQLQDIVYKLVINLEERE
::::::::::::::... :. :: ::: .:.:::: :..::: .:::::::: .:.. :
CCDS19 ITECLHTFCKSCIVKYLQTSKYCPMCNIKIHETQPLLNLKLDRVMQDIVYKLVPGLQDSE
60 70 80 90 100 110
210 220 230 240 250 260
pF1KB7 KKQMHDFYKERGLE-VPKPAVPQPVPSSKGRSKKVLES----VFRIPPELDMSLLLEFIG
.:....::. :::. : .:. .:. :. : . .. .: .:. : :: ..
CCDS19 EKRIREFYQSRGLDRVTQPTGEEPALSNLGLPFSSFDHSKAHYYRYDEQLN--LCLERLS
120 130 140 150 160 170
270 280 290 300 310 320
pF1KB7 ANEGTGHFKPLEKKFVRVSGEATIGHVEKFLRRKMGLDPACQVDIICGDHLLEQYQTLRE
... .. . :..:.:: : .: . :... : ... :.: .:... ...: ...:...
CCDS19 SGKDKNK-SVLQNKYVRCSVRAEVRHLRRVLCHRLMLNPQ-HVQLLFDNEVLPDHMTMKQ
180 190 200 210 220 230
330 340 350
pF1KB7 IRRAIGDAAMQDGLLVLHYGLVVSPLKIT
:
CCDS19 IWLSRWFGKPSPLLLQYSVKEKRR
240 250
>>CCDS7413.1 PCGF5 gene_id:84333|Hs108|chr10 (256 aa)
initn: 505 init1: 369 opt: 485 Z-score: 307.8 bits: 65.0 E(32554): 8e-11
Smith-Waterman score: 486; 37.1% identity (62.9% similar) in 240 aa overlap (125-344:9-247)
100 110 120 130 140 150
pF1KB7 EEEEEDMSHFSLRLEGGRQDSEDEEERLINLSELTPYILCSICKGYLIDATTITECLHTF
.....::: : ::::::: ::.:::::::
CCDS74 MATQRKHLVKDFNPYITCYICKGYLIKPTTVTECLHTF
10 20 30
160 170 180 190 200 210
pF1KB7 CKSCIVRHFYYSNRCPKCNIVVHQTQPLYNIRLDRQLQDIVYKLVINLEEREKKQMHDFY
::.:::.:: :: ::.:. ::.:.:: .::: :..:..::: .:.:.: .. .:.
CCDS74 CKTCIVQHFEDSNDCPRCGNQVHETNPLEMLRLDNTLEEIIFKLVPGLREQELERESEFW
40 50 60 70 80 90
220 230 240 250 260
pF1KB7 K-----ERGLEVPKPAVPQPVPSSKGRSKKVLESVFRIPPELDMSLLLEFIGANEGTGHF
: : : . . : .: . .: .. .. : :.. . : ... : .
CCDS74 KKNKPQENGQDDTSKA-DKPKVDEEGDENEDDKDYHRSDPQIAICLDCLRNNGQSGDNVV
100 110 120 130 140 150
270 280 290 300 310 320
pF1KB7 KPLEKKFVRVSGEATIGHVEKFLRRKMGLDPACQVDIICG------DHLLE-QYQTLREI
: : :::.: : ..:.: ..::: :. : . ..:..:. :: .: :.: ..
CCDS74 KGLMKKFIRCSTRVTVGTIKKFLSLKLKLPSSYELDVLCNGEIMGKDHTMEFIYMTRWRL
160 170 180 190 200 210
330 340 350
pF1KB7 R----RAIGDAAMQ----DGLLVLHYGLVVSPLKIT
: : .. .: : :: : : .:.
CCDS74 RGENFRCLNCSASQVCSQDGPLYQSYPMVLQYRPRIDFG
220 230 240 250
350 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 19:49:17 2016 done: Sat Nov 5 19:49:17 2016
Total Scan time: 3.420 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]