FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7739, 436 aa
1>>>pF1KB7739 436 - 436 aa - 436 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.9346+/-0.000773; mu= 10.5381+/- 0.047
mean_var=109.9984+/-21.847, 0's: 0 Z-trim(112.0): 15 B-trim: 65 in 1/51
Lambda= 0.122287
statistics sampled from 12869 (12875) to 12869 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.753), E-opt: 0.2 (0.395), width: 16
Scan time: 3.350
The best scores are: opt bits E(32554)
CCDS4950.1 GCM1 gene_id:8521|Hs108|chr6 ( 436) 3090 555.6 3.6e-158
CCDS4517.1 GCM2 gene_id:9247|Hs108|chr6 ( 506) 824 155.8 8.9e-38
>>CCDS4950.1 GCM1 gene_id:8521|Hs108|chr6 (436 aa)
initn: 3090 init1: 3090 opt: 3090 Z-score: 2953.2 bits: 555.6 E(32554): 3.6e-158
Smith-Waterman score: 3090; 100.0% identity (100.0% similar) in 436 aa overlap (1-436:1-436)
10 20 30 40 50 60
pF1KB7 MEPDDFDSEDKEILSWDINDVKLPQNVKKTDWFQEWPDSYAKHIYSSEDKNAQRHLSSWA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 MEPDDFDSEDKEILSWDINDVKLPQNVKKTDWFQEWPDSYAKHIYSSEDKNAQRHLSSWA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 MRNTNNHNSRILKKSCLGVVVCGRDCLAEEGRKIYLRPAICDKARQKQQRKRCPNCDGPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 MRNTNNHNSRILKKSCLGVVVCGRDCLAEEGRKIYLRPAICDKARQKQQRKRCPNCDGPL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 KLIPCRGHGGFPVTNFWRHDGRFIFFQSKGEHDHPKPETKLEAEARRAMKKVNTAPSSVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 KLIPCRGHGGFPVTNFWRHDGRFIFFQSKGEHDHPKPETKLEAEARRAMKKVNTAPSSVS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 LSLKGSTETRSLPGETQSQGSLPLTWSFQEGVQLPGSYSGHLIANTPQQNSLNDCFSFSK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 LSLKGSTETRSLPGETQSQGSLPLTWSFQEGVQLPGSYSGHLIANTPQQNSLNDCFSFSK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 SYGLGGITDLTDQTSTVDPMKLYEKRKLSSSRTYSSGDLLPPSASGVYSDHGDLQAWSKN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 SYGLGGITDLTDQTSTVDPMKLYEKRKLSSSRTYSSGDLLPPSASGVYSDHGDLQAWSKN
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 AALGRNHLADNCYSNYPFPLTSWPCSFSPSQNSSEPFYQQLPLEPPAAKTGCPPLWPNPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 AALGRNHLADNCYSNYPFPLTSWPCSFSPSQNSSEPFYQQLPLEPPAAKTGCPPLWPNPA
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB7 GNLYEEKVHVDFNSYVQSPAYHSPQEDPFLFTYASHPHQQYSLPSKSSKWDFEEEMTYLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 GNLYEEKVHVDFNSYVQSPAYHSPQEDPFLFTYASHPHQQYSLPSKSSKWDFEEEMTYLG
370 380 390 400 410 420
430
pF1KB7 LDHCNNDMLLNLCPLR
::::::::::::::::
CCDS49 LDHCNNDMLLNLCPLR
430
>>CCDS4517.1 GCM2 gene_id:9247|Hs108|chr6 (506 aa)
initn: 824 init1: 797 opt: 824 Z-score: 791.7 bits: 155.8 E(32554): 8.9e-38
Smith-Waterman score: 838; 39.2% identity (61.1% similar) in 388 aa overlap (14-384:19-397)
10 20 30 40 50
pF1KB7 MEPDDFDSEDKEILSWDINDVKLPQNVKKTDWFQEWPDSYAKHIYSSEDKNAQRH
::::::: ..::.. : :.::::.:.. ::::..:.::::
CCDS45 MPAAAVQEAVGVCSYGMQLSWDINDPQMPQELALFDQFREWPDGYVRFIYSSDEKKAQRH
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 LSSWAMRNTNNHNSRILKKSCLGVVVCGRDCLAEEGRKIYLRPAICDKARQKQQRKRCPN
::.::::::::::..:::::::::::: . : .: .. :::::::::: :::.: :::
CCDS45 LSGWAMRNTNNHNGHILKKSCLGVVVCTQACTLPDGSRLQLRPAICDKARLKQQKKACPN
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB7 CDGPLKLIPCRGHGGFPVTNFWRHDGRFIFFQSKGEHDHPKPETKLEAEARRAMKKVNTA
: . :.:::::::.:.::::::: :: ::::.:: ::::.::.: :.::::. : . :
CCDS45 CHSALELIPCRGHSGYPVTNFWRLDGNAIFFQAKGVHDHPRPESKSETEARRSAIKRQMA
130 140 150 160 170 180
180 190 200 210 220 230
pF1KB7 PSSVSLSLKGSTETRSLPGETQSQGSLPLTWSFQEGVQLPGSYSGHLIANTPQQNSLNDC
:. . . : :.... . . :.. : . . ....: . :
CCDS45 ----SFYQPQKKRIR----ESEAEENQDSSGHFSNIPPLENPEDFDIVTETSFPIPGQPC
190 200 210 220 230
240 250 260 270 280
pF1KB7 FSFSKSYGLGGITDL-TDQTSTVDPMKLYEKRKLSSSRTYSSGDLLPPSASG------VY
:: :: . :: : : . . :.. : . .. : : .: :. .. .:
CCDS45 PSFPKSDVYKATCDLATFQGDKMPPFQKYSSPRIYLPRPPCSYELANPGYTNSSPYPTLY
240 250 260 270 280 290
290 300 310 320 330
pF1KB7 SDHGDLQAWSKNAALGRNHLADNCYSNYP--FPLTS----W-PCSFSPS--QNSSEPFYQ
.: .. . . :. . : ::.: : .:. : : .:: . ... .:
CCDS45 KDSTSIPNDTDWVHLNTLQCNVNSYSSYERSFDFTNKQHGWKPALGKPSLVERTNHGQFQ
300 310 320 330 340 350
340 350 360 370 380 390
pF1KB7 QLPLEPP-AAKTGCPPLWPNPAGNLYEEKVHVDFNSYVQSPAYHSPQEDPFLFTYASHPH
. .: . : : : : . : . .. :. ::. :
CCDS45 AMATRPYYNPELPCRYLTTPPPGAPALQTV-ITTTTKVSYQAYQPPAMKYSDSVREVKSL
360 370 380 390 400 410
400 410 420 430
pF1KB7 QQYSLPSKSSKWDFEEEMTYLGLDHCNNDMLLNLCPLR
CCDS45 SSCNYAPEDTGMSVYPEPWGPPVTVTRAASPSGPPPMKIAGDCRAIRPTVAIPHEPVSSR
420 430 440 450 460 470
436 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 05:40:19 2016 done: Sun Nov 6 05:40:20 2016
Total Scan time: 3.350 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]