FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5801, 371 aa
1>>>pF1KB5801 371 - 371 aa - 371 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.2081+/-0.000774; mu= 14.5380+/- 0.047
mean_var=121.1684+/-23.403, 0's: 0 Z-trim(112.1): 15 B-trim: 38 in 1/52
Lambda= 0.116514
statistics sampled from 12895 (12901) to 12895 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.752), E-opt: 0.2 (0.396), width: 16
Scan time: 2.870
The best scores are: opt bits E(32554)
CCDS2253.1 CDCA7 gene_id:83879|Hs108|chr2 ( 371) 2548 438.9 3.3e-123
CCDS2252.1 CDCA7 gene_id:83879|Hs108|chr2 ( 450) 2246 388.2 7.3e-108
CCDS47558.1 CDCA7L gene_id:55536|Hs108|chr7 ( 408) 739 134.9 1.2e-31
CCDS47559.1 CDCA7L gene_id:55536|Hs108|chr7 ( 420) 739 134.9 1.3e-31
CCDS5374.1 CDCA7L gene_id:55536|Hs108|chr7 ( 454) 739 134.9 1.3e-31
>>CCDS2253.1 CDCA7 gene_id:83879|Hs108|chr2 (371 aa)
initn: 2548 init1: 2548 opt: 2548 Z-score: 2325.4 bits: 438.9 E(32554): 3.3e-123
Smith-Waterman score: 2548; 100.0% identity (100.0% similar) in 371 aa overlap (1-371:1-371)
10 20 30 40 50 60
pF1KB5 MDARRVPQKDLRVKKNLKKFRYVKLISMETSSSSDDSCDSFASDNFANTRLQSVREGCRT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 MDARRVPQKDLRVKKNLKKFRYVKLISMETSSSSDDSCDSFASDNFANTRLQSVREGCRT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 RSQCRHSGPLRVAMKFPARSTRGATNKKAESRQPSENSVTDSNSDSEDESGMNFLEKRAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 RSQCRHSGPLRVAMKFPARSTRGATNKKAESRQPSENSVTDSNSDSEDESGMNFLEKRAL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 NIKQNKAMLAKLMSELESFPGSFRGRHPLPGSDSQSRRPRRRTFPGVASRRNPERRARPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 NIKQNKAMLAKLMSELESFPGSFRGRHPLPGSDSQSRRPRRRTFPGVASRRNPERRARPL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 TRSRSRILGSLDALPMEEEEEEDKYMLVRKRKTVDGYMNEDDLPRSRRSRSSVTLPHIIR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 TRSRSRILGSLDALPMEEEEEEDKYMLVRKRKTVDGYMNEDDLPRSRRSRSSVTLPHIIR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 PVEEITEEELENVCSNSREKIYNRSLGSTCHQCRQKTIDTKTNCRNPDCWGVRGQFCGPC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 PVEEITEEELENVCSNSREKIYNRSLGSTCHQCRQKTIDTKTNCRNPDCWGVRGQFCGPC
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 LRNRYGEEVRDALLDPNWHCPPCRGICNCSFCRQRDGRCATGVLVYLAKYHGFGNVHAYL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 LRNRYGEEVRDALLDPNWHCPPCRGICNCSFCRQRDGRCATGVLVYLAKYHGFGNVHAYL
310 320 330 340 350 360
370
pF1KB5 KSLKQEFEMQA
:::::::::::
CCDS22 KSLKQEFEMQA
370
>>CCDS2252.1 CDCA7 gene_id:83879|Hs108|chr2 (450 aa)
initn: 2233 init1: 2233 opt: 2246 Z-score: 2050.0 bits: 388.2 E(32554): 7.3e-108
Smith-Waterman score: 2246; 93.7% identity (97.4% similar) in 349 aa overlap (24-371:102-450)
10 20 30 40 50
pF1KB5 MDARRVPQKDLRVKKNLKKFRYVKLISMETSSSSDDSCDSFASDNFAN-TRLQ
.: .. ..:.:.: .:. ... . :::
CCDS22 ESFCGFSESEVQDVLDHCGFLQKPRPDVTNELAGIFHADSDDESFCGFSESEIQDGMRLQ
80 90 100 110 120 130
60 70 80 90 100 110
pF1KB5 SVREGCRTRSQCRHSGPLRVAMKFPARSTRGATNKKAESRQPSENSVTDSNSDSEDESGM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 SVREGCRTRSQCRHSGPLRVAMKFPARSTRGATNKKAESRQPSENSVTDSNSDSEDESGM
140 150 160 170 180 190
120 130 140 150 160 170
pF1KB5 NFLEKRALNIKQNKAMLAKLMSELESFPGSFRGRHPLPGSDSQSRRPRRRTFPGVASRRN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 NFLEKRALNIKQNKAMLAKLMSELESFPGSFRGRHPLPGSDSQSRRPRRRTFPGVASRRN
200 210 220 230 240 250
180 190 200 210 220 230
pF1KB5 PERRARPLTRSRSRILGSLDALPMEEEEEEDKYMLVRKRKTVDGYMNEDDLPRSRRSRSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 PERRARPLTRSRSRILGSLDALPMEEEEEEDKYMLVRKRKTVDGYMNEDDLPRSRRSRSS
260 270 280 290 300 310
240 250 260 270 280 290
pF1KB5 VTLPHIIRPVEEITEEELENVCSNSREKIYNRSLGSTCHQCRQKTIDTKTNCRNPDCWGV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 VTLPHIIRPVEEITEEELENVCSNSREKIYNRSLGSTCHQCRQKTIDTKTNCRNPDCWGV
320 330 340 350 360 370
300 310 320 330 340 350
pF1KB5 RGQFCGPCLRNRYGEEVRDALLDPNWHCPPCRGICNCSFCRQRDGRCATGVLVYLAKYHG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 RGQFCGPCLRNRYGEEVRDALLDPNWHCPPCRGICNCSFCRQRDGRCATGVLVYLAKYHG
380 390 400 410 420 430
360 370
pF1KB5 FGNVHAYLKSLKQEFEMQA
:::::::::::::::::::
CCDS22 FGNVHAYLKSLKQEFEMQA
440 450
>>CCDS47558.1 CDCA7L gene_id:55536|Hs108|chr7 (408 aa)
initn: 937 init1: 728 opt: 739 Z-score: 681.5 bits: 134.9 E(32554): 1.2e-31
Smith-Waterman score: 960; 43.5% identity (66.8% similar) in 391 aa overlap (6-367:33-404)
10 20 30
pF1KB5 MDARRVPQKDLRVKKNLKKFRYVKLISMETSSS-S
::.. : ... .: .. .. . :. :
CCDS47 LATRYQIPKEVADIFNAPSDDEEFVGFRDDVPMETLSSEESCDSFDSLESGKQVVESDLS
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB5 DDSCDSFASDNFANTRLQSVREGCRTRSQCRHSGPLRVAMKFPARSTRGATNKKAESRQ-
::. :..:.. . . ... :.::. : : ::::..::... . .:.. :.:
CCDS47 DDGKASLVSEEEEDEEEDKATPR-RSRSR-RSSIGLRVAFQFPTKKLANKPDKNSSSEQL
70 80 90 100 110 120
100 110 120
pF1KB5 ------------------------PSENSVTDSNSDSEDES--GMNFLEKRALNIKQNKA
:.:...:..::.::: . . : ::..:::.:::
CCDS47 FSSARLQNEKKTILERKKDCRQVIQREDSTSESEDDSRDESQESSDALLKRTMNIKENKA
130 140 150 160 170 180
130 140 150 160 170 180
pF1KB5 MLAKLMSELESFPGSFRGRHPLPGSDSQSRRPRRRTFPGVASRR-NPERRARPLTRSRSR
:::.:..::.:.: : : : : :... :: : .:: :: : ::: .
CCDS47 MLAQLLAELNSMPDFFPVR--TPTSASRKKTVRRAFSEGQITRRMNPTRSARPPEKF---
190 200 210 220 230
190 200 210 220 230 240
pF1KB5 ILGSLDALPMEEEEEEDKYMLVRKRKTVDGYMNEDDLPRSRRSRSSVTLPHIIRPVEEIT
.:. . . . .... :.:::. : : :: : : .::::.::
CCDS47 ---ALENFTVSAAKFAEEFYSFRRRKTIGGKCRE----YRRRHRISS-----FRPVEDIT
240 250 260 270 280
250 260 270 280 290 300
pF1KB5 EEELENVCSNSREKIYNRSLGSTCHQCRQKTIDTKTNCRNPDCWGVRGQFCGPCLRNRYG
::.:::: . :.:::.. ::.:::::::::::::: ::: : ::::::::::::::::
CCDS47 EEDLENVAITVRDKIYDKVLGNTCHQCRQKTIDTKTVCRNQGCCGVRGQFCGPCLRNRYG
290 300 310 320 330 340
310 320 330 340 350 360
pF1KB5 EEVRDALLDPNWHCPPCRGICNCSFCRQRDGRCATGVLVYLAKYHGFGNVHAYLKSLKQE
:.::.:::::.: :::::::::::.::.::::::::.:..:::..:. ::. ::.::..:
CCDS47 EDVRSALLDPDWVCPPCRGICNCSYCRKRDGRCATGILIHLAKFYGYDNVKEYLESLQKE
350 360 370 380 390 400
370
pF1KB5 FEMQA
.
CCDS47 LVEDN
>>CCDS47559.1 CDCA7L gene_id:55536|Hs108|chr7 (420 aa)
initn: 937 init1: 728 opt: 739 Z-score: 681.3 bits: 134.9 E(32554): 1.3e-31
Smith-Waterman score: 957; 45.5% identity (67.4% similar) in 365 aa overlap (31-367:71-416)
10 20 30 40 50 60
pF1KB5 MDARRVPQKDLRVKKNLKKFRYVKLISMETSSSSDDSCDSFASDNFANTRLQSVREGCRT
:. :::. :..:.. . . ... :.
CCDS47 EDTDSETEDFAGFTQSDLNGKTNPEVMVVESDLSDDGKASLVSEEEEDEEEDKATPR-RS
50 60 70 80 90
70 80 90
pF1KB5 RSQCRHSGPLRVAMKFPARSTRGATNKKAESRQ-------------------------PS
::. : : ::::..::... . .:.. :.:
CCDS47 RSR-RSSIGLRVAFQFPTKKLANKPDKNSSSEQLFSSARLQNEKKTILERKKDCRQVIQR
100 110 120 130 140 150
100 110 120 130 140 150
pF1KB5 ENSVTDSNSDSEDES--GMNFLEKRALNIKQNKAMLAKLMSELESFPGSFRGRHPLPGSD
:.:...:..::.::: . . : ::..:::.::::::.:..::.:.: : : : :
CCDS47 EDSTSESEDDSRDESQESSDALLKRTMNIKENKAMLAQLLAELNSMPDFFPVR--TPTSA
160 170 180 190 200 210
160 170 180 190 200 210
pF1KB5 SQSRRPRRRTFPGVASRR-NPERRARPLTRSRSRILGSLDALPMEEEEEEDKYMLVRKRK
:... :: : .:: :: : ::: . .:. . . . .... :.::
CCDS47 SRKKTVRRAFSEGQITRRMNPTRSARPPEKF------ALENFTVSAAKFAEEFYSFRRRK
220 230 240 250 260 270
220 230 240 250 260 270
pF1KB5 TVDGYMNEDDLPRSRRSRSSVTLPHIIRPVEEITEEELENVCSNSREKIYNRSLGSTCHQ
:. : : :: : : .::::.::::.:::: . :.:::.. ::.::::
CCDS47 TIGGKCRE----YRRRHRISS-----FRPVEDITEEDLENVAITVRDKIYDKVLGNTCHQ
280 290 300 310 320
280 290 300 310 320 330
pF1KB5 CRQKTIDTKTNCRNPDCWGVRGQFCGPCLRNRYGEEVRDALLDPNWHCPPCRGICNCSFC
:::::::::: ::: : :::::::::::::::::.::.:::::.: :::::::::::.:
CCDS47 CRQKTIDTKTVCRNQGCCGVRGQFCGPCLRNRYGEDVRSALLDPDWVCPPCRGICNCSYC
330 340 350 360 370 380
340 350 360 370
pF1KB5 RQRDGRCATGVLVYLAKYHGFGNVHAYLKSLKQEFEMQA
:.::::::::.:..:::..:. ::. ::.::..:.
CCDS47 RKRDGRCATGILIHLAKFYGYDNVKEYLESLQKELVEDN
390 400 410 420
>>CCDS5374.1 CDCA7L gene_id:55536|Hs108|chr7 (454 aa)
initn: 937 init1: 728 opt: 739 Z-score: 680.9 bits: 134.9 E(32554): 1.3e-31
Smith-Waterman score: 957; 45.5% identity (67.4% similar) in 365 aa overlap (31-367:105-450)
10 20 30 40 50 60
pF1KB5 MDARRVPQKDLRVKKNLKKFRYVKLISMETSSSSDDSCDSFASDNFANTRLQSVREGCRT
:. :::. :..:.. . . ... :.
CCDS53 EDTDSETEDFAGFTQSDLNGKTNPEVMVVESDLSDDGKASLVSEEEEDEEEDKATPR-RS
80 90 100 110 120 130
70 80 90
pF1KB5 RSQCRHSGPLRVAMKFPARSTRGATNKKAESRQ-------------------------PS
::. : : ::::..::... . .:.. :.:
CCDS53 RSR-RSSIGLRVAFQFPTKKLANKPDKNSSSEQLFSSARLQNEKKTILERKKDCRQVIQR
140 150 160 170 180 190
100 110 120 130 140 150
pF1KB5 ENSVTDSNSDSEDES--GMNFLEKRALNIKQNKAMLAKLMSELESFPGSFRGRHPLPGSD
:.:...:..::.::: . . : ::..:::.::::::.:..::.:.: : : : :
CCDS53 EDSTSESEDDSRDESQESSDALLKRTMNIKENKAMLAQLLAELNSMPDFFPVR--TPTSA
200 210 220 230 240 250
160 170 180 190 200 210
pF1KB5 SQSRRPRRRTFPGVASRR-NPERRARPLTRSRSRILGSLDALPMEEEEEEDKYMLVRKRK
:... :: : .:: :: : ::: . .:. . . . .... :.::
CCDS53 SRKKTVRRAFSEGQITRRMNPTRSARPPEKF------ALENFTVSAAKFAEEFYSFRRRK
260 270 280 290 300
220 230 240 250 260 270
pF1KB5 TVDGYMNEDDLPRSRRSRSSVTLPHIIRPVEEITEEELENVCSNSREKIYNRSLGSTCHQ
:. : : :: : : .::::.::::.:::: . :.:::.. ::.::::
CCDS53 TIGGKCRE----YRRRHRISS-----FRPVEDITEEDLENVAITVRDKIYDKVLGNTCHQ
310 320 330 340 350
280 290 300 310 320 330
pF1KB5 CRQKTIDTKTNCRNPDCWGVRGQFCGPCLRNRYGEEVRDALLDPNWHCPPCRGICNCSFC
:::::::::: ::: : :::::::::::::::::.::.:::::.: :::::::::::.:
CCDS53 CRQKTIDTKTVCRNQGCCGVRGQFCGPCLRNRYGEDVRSALLDPDWVCPPCRGICNCSYC
360 370 380 390 400 410
340 350 360 370
pF1KB5 RQRDGRCATGVLVYLAKYHGFGNVHAYLKSLKQEFEMQA
:.::::::::.:..:::..:. ::. ::.::..:.
CCDS53 RKRDGRCATGILIHLAKFYGYDNVKEYLESLQKELVEDN
420 430 440 450
371 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 14:55:37 2016 done: Sat Nov 5 14:55:38 2016
Total Scan time: 2.870 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]