FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KSDA0140, 422 aa
1>>>pF1KSDA0140 422 - 422 aa - 422 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.7606+/-0.000683; mu= 12.1299+/- 0.042
mean_var=116.0212+/-22.948, 0's: 0 Z-trim(114.3): 10 B-trim: 120 in 1/52
Lambda= 0.119071
statistics sampled from 14904 (14911) to 14904 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.792), E-opt: 0.2 (0.458), width: 16
Scan time: 2.710
The best scores are: opt bits E(32554)
CCDS7641.1 FAM53B gene_id:9679|Hs108|chr10 ( 422) 2970 520.5 1.2e-147
CCDS4204.1 FAM53C gene_id:51307|Hs108|chr5 ( 392) 432 84.5 2e-16
CCDS75091.1 FAM53A gene_id:152877|Hs108|chr4 ( 360) 323 65.7 8e-11
CCDS33939.1 FAM53A gene_id:152877|Hs108|chr4 ( 398) 323 65.8 8.7e-11
>>CCDS7641.1 FAM53B gene_id:9679|Hs108|chr10 (422 aa)
initn: 2970 init1: 2970 opt: 2970 Z-score: 2764.2 bits: 520.5 E(32554): 1.2e-147
Smith-Waterman score: 2970; 99.8% identity (100.0% similar) in 422 aa overlap (1-422:1-422)
10 20 30 40 50 60
pF1KSD MVMVLSESLSTRGADSIACGTFSRELHTPKKMSQGPTLFSCGIMENDRWRDLDRKCPLQI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 MVMVLSESLSTRGADSIACGTFSRELHTPKKMSQGPTLFSCGIMENDRWRDLDRKCPLQI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD DQPSTSIWECLPEKDSSLWHREAVTACAVTSLIKDLSISDHNGNPSAPPSKRQCRSLSFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 DQPSTSIWECLPEKDSSLWHREAVTACAVTSLIKDLSISDHNGNPSAPPSKRQCRSLSFS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD DEMSSCRTSWRPLGSKVWTPVEKRRCYSGGSVQRYSNGFSTMQRSSSFSLPSRANVLSSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 DEMSSCRTSWRPLGSKVWTPVEKRRCYSGGSVQRYSNGFSTMQRSSSFSLPSRANVLSSP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KSD CDQAGLHHRFGGQPCQGVPGSAPCGQAGDTWSPDLHPVGGGRLDLQRSLSCSHEQFSFVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 CDQAGLHHRFGGQPCQGVPGSAPCGQAGDTWSPDLHPVGGGRLDLQRSLSCSHEQFSFVE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KSD YCPPSANSTPASTPELARRSSGLSRSRSQPCVLNDKKVGVKRRRPEEVQEQRPSLDLAKM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 YCPPSANSTPASTPELARRSSGLSRSRSQPCVLNDKKVGVKRRRPEEVQEQRPSLDLAKM
250 260 270 280 290 300
310 320 330 340 350 360
pF1KSD AQNCQTFSSLSCLSAGTEDCGPQSPFARHVSNTRAWTALLSASGPGGRTPAGTPVPEPLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 AQNCQTFSSLSCLSAGTEDCGPQSPFARHVSNTRAWTALLSASGPGGRTPAGTPVPEPLP
310 320 330 340 350 360
370 380 390 400 410 420
pF1KSD PSFDDHLVCQEDLSCEESDSCALDEDCGRRAEPAAAWRDRGAPGNSLCSLDGELDIEQIE
:::::::.::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 PSFDDHLACQEDLSCEESDSCALDEDCGRRAEPAAAWRDRGAPGNSLCSLDGELDIEQIE
370 380 390 400 410 420
pF1KSD KN
::
CCDS76 KN
>>CCDS4204.1 FAM53C gene_id:51307|Hs108|chr5 (392 aa)
initn: 391 init1: 179 opt: 432 Z-score: 408.4 bits: 84.5 E(32554): 2e-16
Smith-Waterman score: 443; 30.4% identity (50.4% similar) in 450 aa overlap (1-422:1-392)
10 20 30 40 50
pF1KSD MVMVLSESLSTRGADSIACGTFSRELHTPKKMSQGPTLFSCG-----IMENDRWRDLDR-
:. ...:.:. . : . : :: : : . . . .:: . :. :: : .
CCDS42 MITLITEQLQKQTLDELKCTRFSISLPLPDHAD----ISNCGNSFQLVSEGASWRGLPHC
10 20 30 40 50
60 70 80 90 100
pF1KSD KCPLQID------QPS-TSIWECLPEKDSSLWHREAVTACAVTSLIKDLSISDHNGNPSA
.: : .:: :. : . .: .: . .. : .. : :
CCDS42 SCAEFQDSLNFSYHPSGLSLHLRPPSRGNS--PKEQPFSQVLRPEPPD---PEKLPVPPA
60 70 80 90 100 110
110 120 130 140 150 160
pF1KSD PPSKRQCRSLSFSDEMSSCRTSWRPLGSKVWTPVEKRRCYSGGSVQRYSNGFSTMQRSSS
:::::.::::: ..: . ::: ::.:::...: .::. : . : .: ::
CCDS42 PPSKRHCRSLSVPVDLSRWQPVWRPAPSKLWTPIKHRGSGGGGGPQVPHQ--SPPKRVSS
120 130 140 150 160
170 180 190 200 210
pF1KSD FSLPSRANVLSSPCDQAGLHHRFGGQPCQGVP----GSAPCG---QAGDTWSPD---LHP
. . .: :: : : :: . : .. .: ::. :.: .: : : :
CCDS42 LRF-LQAPSASSQCAPA---HRPYSPPFFSLALAQDSSRPCAASPQSG-SWESDAESLSP
170 180 190 200 210 220
220 230 240 250 260 270
pF1KSD VGGGR-LDLQRSLSCSHEQFSFVEYCPPSANSTPASTPELARRSSGLS---RSRSQPCVL
: ..:. ::. . .: ::: :.:::.::: : :: ::::::: :
CCDS42 CPPQRRFSLSPSLGPQASRFL------PSARSSPASSPELPWRPRGLRNLPRSRSQPCDL
230 240 250 260 270
280 290 300 310 320 330
pF1KSD NDKKVGVKRRRPEEVQEQRPSLDLAKMAQNCQTFSSLSCLSAGTEDCGPQSPFARHVSNT
. .:.:::::. :. .. :::::. :: : . .:. ::. ... . ::
CCDS42 DARKTGVKRRHEEDPRRLRPSLDFDKMNQ--KPYSGGLCLQETAREGSSISP--------
280 290 300 310 320
340 350 360 370 380 390
pF1KSD RAWTALLSASGPGGRTPAGTPVPEPLPPSFDDHLVCQEDLSCEESDSCALDEDCGRRAEP
: ... : : :: : :. . . : . .:. :
CCDS42 -PW--FMACS------------PPPLSAS------CSPTGGSSQVLSESEEEEEG-----
330 340 350 360
400 410 420
pF1KSD AAAWRDRGAPGNSLCSLD-GELDIEQIEKN
:. : .. .::. : :.::.. ::.:
CCDS42 AVRWGRQALSKRTLCQRDFGDLDLNLIEEN
370 380 390
>>CCDS75091.1 FAM53A gene_id:152877|Hs108|chr4 (360 aa)
initn: 426 init1: 255 opt: 323 Z-score: 307.8 bits: 65.7 E(32554): 8e-11
Smith-Waterman score: 494; 29.8% identity (57.7% similar) in 359 aa overlap (1-345:1-344)
10 20 30 40 50
pF1KSD MVMVLSESLSTRGADSIACGTFSREL-HTPKKMSQGPTLFSCGIMENDRWRDLDRKCPLQ
:: ...:.:.... :...: . . : .. . .... :: . ... :. .. :..
CCDS75 MVTLITEKLQSQSLDDLTCKAEAGPLQYSAETLNKSGRLFPLELNDQSPWKVFSGGPPVR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KSD IDQPSTSIWECLPEKDSSL------WHREAVTACAVTSLIKDLSISDHNGNPSAPPSKRQ
. . . :: ... :. .. : . . .. :. .:. .:::.::.
CCDS75 SQAATGPDFSFLPGLSAAAHTMGLQWQPQSPRPGAGLGAASTVDPSESTGSSTAPPTKRH
70 80 90 100 110 120
120 130 140 150 160 170
pF1KSD CRSLSFSDEMSSCRTSWRPLGSKVWTPVEKRRCYSGGSVQRYSNGFSTMQRSSSFSLPSR
::::: .:. ::. ::: .::::::: :::: ::::. : .. ... ::. .:
CCDS75 CRSLSEPEELVRCRSPWRPGSSKVWTPVSKRRCDSGGSATRQGSPGAVLPRSAVWSTGPT
130 140 150 160 170 180
180 190 200 210 220 230
pF1KSD ANVLSSPCDQAGLHHRFGGQPCQGVPGSAPCGQAGDTWSPDLHPVGGGRLDLQRSLSCSH
. . : . .: : . .: ::.: .... :. .: : :.
CCDS75 SPATPRPSSASG---GFVDSS-EGSAGSGPLWCSAESCLPST----------RRRPSLSQ
190 200 210 220
240 250 260 270 280 290
pF1KSD EQFSFVEYCPPSANSTPASTPELARRSSGLSRSRSQPCVLNDKKVGVKRRRPEEVQEQRP
:... . : :.:.:.::: :. : :: : :::::::. :. :::: :... ::
CCDS75 ERLAGAGTPLPWASSSPTSTPALGGRR-GLLRCRSQPCVLSGKRSRRKRRREEDARWTRP
230 240 250 260 270 280
300 310 320 330 340
pF1KSD SLDLAKMAQ--NC--QTFSSLSCLSAGTED-CGP--QSPFARHVSNTRAWTALLSASGPG
:::. ::.: .: . : . :... . :: :: . : : . . .:
CCDS75 SLDFLKMTQPHSCARECESRVRGLGVSLQHLSGPSSQSRGSTLNENKTPWFEMEGNLAPE
290 300 310 320 330 340
350 360 370 380 390 400
pF1KSD GRTPAGTPVPEPLPPSFDDHLVCQEDLSCEESDSCALDEDCGRRAEPAAAWRDRGAPGNS
CCDS75 DFKKFKKPLLPQLRR
350 360
>>CCDS33939.1 FAM53A gene_id:152877|Hs108|chr4 (398 aa)
initn: 529 init1: 279 opt: 323 Z-score: 307.1 bits: 65.8 E(32554): 8.7e-11
Smith-Waterman score: 516; 29.2% identity (55.3% similar) in 432 aa overlap (1-422:1-398)
10 20 30 40 50
pF1KSD MVMVLSESLSTRGADSIACGTFSREL-HTPKKMSQGPTLFSCGIMENDRWRDLDRKCPLQ
:: ...:.:.... :...: . . : .. . .... :: . ... :. .. :..
CCDS33 MVTLITEKLQSQSLDDLTCKAEAGPLQYSAETLNKSGRLFPLELNDQSPWKVFSGGPPVR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KSD IDQPSTSIWECLPEKDSSL------WHREAVTACAVTSLIKDLSISDHNGNPSAPPSKRQ
. . . :: ... :. .. : . . .. :. .:. .:::.::.
CCDS33 SQAATGPDFSFLPGLSAAAHTMGLQWQPQSPRPGAGLGAASTVDPSESTGSSTAPPTKRH
70 80 90 100 110 120
120 130 140 150 160 170
pF1KSD CRSLSFSDEMSSCRTSWRPLGSKVWTPVEKRRCYSGGSVQRYSNGFSTMQRSSSFSLPSR
::::: .:. ::. ::: .::::::: :::: ::::. : .. ... ::. .:
CCDS33 CRSLSEPEELVRCRSPWRPGSSKVWTPVSKRRCDSGGSATRQGSPGAVLPRSAVWSTGPT
130 140 150 160 170 180
180 190 200 210 220 230
pF1KSD ANVLSSPCDQAGLHHRFGGQPCQGVPGSAPCGQAGDTWSPDLHPVGGGRLDLQRSLSCSH
. . : . .: : . .: ::.: .... :. .: : :.
CCDS33 SPATPRPSSASG---GFVDSS-EGSAGSGPLWCSAESCLPS----------TRRRPSLSQ
190 200 210 220
240 250 260 270 280 290
pF1KSD EQFSFVEYCPPSANSTPASTPELARRSSGLSRSRSQPCVLNDKKVGVKRRRPEEVQEQRP
:... . : :.:.:.::: :. : :: : :::::::. :. :::: :... ::
CCDS33 ERLAGAGTPLPWASSSPTSTPALGGRR-GLLRCRSQPCVLSGKRSRRKRRREEDARWTRP
230 240 250 260 270 280
300 310 320 330 340 350
pF1KSD SLDLAKMAQNCQTFSSLSCLSAGTEDCGPQSPFARHVSNTRAWTALLSASGPG--GRTPA
:::. ::.:. .. .:: : .: ..: .:. .: . . :: :
CCDS33 SLDFLKMTQTLKNSKSL-CSLNYEDDDEDDTPVKTVLSSPCDSRGLPGITMPGCSQRGLR
290 300 310 320 330 340
360 370 380 390 400 410
pF1KSD GTPVPEPLPPSFDDHLVCQEDLSCEESDSCALDEDCGRRAE-PAAAWRDRGAPGNSLCSL
.:: : : .. : : . : . .. :... : : :
CCDS33 TSPVHPNLWASRES---VTSDGSRRSSGDPRDGDSVGEEGVFPRARW-------------
350 360 370 380
420
pF1KSD DGELDIEQIEKN
:::.::::.:
CCDS33 --ELDLEQIENN
390
422 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 00:12:23 2016 done: Thu Nov 3 00:12:24 2016
Total Scan time: 2.710 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]