FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6691, 318 aa
1>>>pF1KE6691 318 - 318 aa - 318 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.7219+/-0.000983; mu= 13.4872+/- 0.059
mean_var=67.9965+/-13.803, 0's: 0 Z-trim(104.1): 16 B-trim: 0 in 0/49
Lambda= 0.155536
statistics sampled from 7708 (7716) to 7708 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.618), E-opt: 0.2 (0.237), width: 16
Scan time: 1.690
The best scores are: opt bits E(32554)
CCDS14602.1 C1GALT1C1 gene_id:29071|Hs108|chrX ( 318) 2168 495.6 2.1e-140
CCDS82442.1 C1GALT1C1L gene_id:728819|Hs108|chr2 ( 315) 1391 321.2 6.5e-88
CCDS5355.1 C1GALT1 gene_id:56913|Hs108|chr7 ( 363) 414 102.0 7.3e-22
>>CCDS14602.1 C1GALT1C1 gene_id:29071|Hs108|chrX (318 aa)
initn: 2168 init1: 2168 opt: 2168 Z-score: 2634.0 bits: 495.6 E(32554): 2.1e-140
Smith-Waterman score: 2168; 100.0% identity (100.0% similar) in 318 aa overlap (1-318:1-318)
10 20 30 40 50 60
pF1KE6 MLSESSSFLKGVMLGSIFCALITMLGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MLSESSSFLKGVMLGSIFCALITMLGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDER
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 MELSKSFRVYCIILVKPKDVSLWAAVKETWTKHCDKAEFFSSENVKVFESINMDTNDMWL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MELSKSFRVYCIILVKPKDVSLWAAVKETWTKHCDKAEFFSSENVKVFESINMDTNDMWL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 MMRKAYKYAFDKYRDQYNWFFLARPTTFAIIENLKYFLLKKDPSQPFYLGHTIKSGDLEY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MMRKAYKYAFDKYRDQYNWFFLARPTTFAIIENLKYFLLKKDPSQPFYLGHTIKSGDLEY
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 VGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGMIWKISEDKQLAVCLKYAGVFAENAEDA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 VGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGMIWKISEDKQLAVCLKYAGVFAENAEDA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE6 DGKDVFNTKSVGLSIKEAMTYHPNQVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 DGKDVFNTKSVGLSIKEAMTYHPNQVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFG
250 260 270 280 290 300
310
pF1KE6 HIFNDALVFLPPNGSDND
::::::::::::::::::
CCDS14 HIFNDALVFLPPNGSDND
310
>>CCDS82442.1 C1GALT1C1L gene_id:728819|Hs108|chr2 (315 aa)
initn: 1373 init1: 569 opt: 1391 Z-score: 1691.8 bits: 321.2 E(32554): 6.5e-88
Smith-Waterman score: 1391; 63.9% identity (85.6% similar) in 319 aa overlap (1-318:1-315)
10 20 30 40 50
pF1KE6 MLSES-SSFLKGVMLGSIFCALITMLGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDE
:.: : .::.::..:::: .::::.:.:.: : .. . ::::::. ::..:.:. :.
CCDS82 MVSASGTSFFKGMLLGSISWVLITMFGQIHIRHRGQTQDHEHHHLRPPNRNDFLNTSKVI
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE6 RMELSKSFRVYCIILVKPKDVSLWAAVKETWTKHCDKAEFFSSENVKVFESINMDTNDMW
.:::::.::.:::. . .: : ::..::::::::::::.....: ..: :...:: :
CCDS82 LLELSKSIRVFCIIFGESEDESYWAVLKETWTKHCDKAELYDTKNDNLF---NIESNDRW
70 80 90 100 110
120 130 140 150 160 170
pF1KE6 LMMRKAYKYAFDKYRDQYNWFFLARPTTFAIIENLKYFLLKKDPSQPFYLGHTIKSGDLE
..:: ::::.:.:: :.::::::: :::::.::::::.:. .: :::::::::. ::::
CCDS82 VQMRTAYKYVFEKYGDNYNWFFLALPTTFAVIENLKYLLFTRDASQPFYLGHTVIFGDLE
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE6 YVGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGMIWKISEDKQLAVCLKYAGVFAENAED
:: .::::::: : ::::: ::. : : .:. .:::.:::::::.::::::: ::::::
CCDS82 YVTVEGGIVLSRELMKRLNRLLDNSETCADQS-VIWKLSEDKQLAICLKYAGVHAENAED
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE6 ADGKDVFNTKSVGLSIKEAMTYHPNQVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAF
.:.:::::: .. :.::.. .:.:::::::::::.:::::::..:.:::::.::::::
CCDS82 YEGRDVFNTKPIAQLIEEALSNNPQQVVEGCCSDMAITFNGLTPQKMEVMMYGLYRLRAF
240 250 260 270 280 290
300 310
pF1KE6 GHIFNDALVFLPPNGSDND
:: :::.:::::: ::.::
CCDS82 GHYFNDTLVFLPPVGSEND
300 310
>>CCDS5355.1 C1GALT1 gene_id:56913|Hs108|chr7 (363 aa)
initn: 277 init1: 148 opt: 414 Z-score: 506.0 bits: 102.0 E(32554): 7.3e-22
Smith-Waterman score: 429; 26.7% identity (60.6% similar) in 330 aa overlap (7-303:11-333)
10 20 30 40 50
pF1KE6 MLSESSSFLKGVMLGSIFCALI--TMLGHIRIGHGNRMHHHEHHHLQAPNKEDILK
.:: : .: ..:. . .::. . : .:. : . . : .. :.
CCDS53 MASKSWLNFLTFLCGSAIGFLLCSQLFSILLGEKVDTQPNVLHNDPHARHSDDNGQNHLE
10 20 30 40 50 60
60 70 80 90
pF1KE6 IS----------EDERMELSKSF----RVYCIILVKPKDVSLWAA-VKETWTKHCDKAEF
. .:: ...... :. : ... :... : :: ::...:.:. :
CCDS53 GQMNFNADSSQHKDENTDIAENLYQKVRILCWVMTGPQNLEKKAKHVKATWAQRCNKVLF
70 80 90 100 110 120
100 110 120 130 140 150
pF1KE6 FSSENVKVFESINMDTND-----MWLMMRKAYKYAFDKYRDQYNWFFLARPTTFAIIENL
.:::. : : .... :.. .: . ::..:. ..: .. .::. : :..:..::
CCDS53 MSSEENKDFPAVGLKTKEGRDQLYWKTI-KAFQYVHEHYLEDADWFLKADDDTYVILDNL
130 140 150 160 170
160 170 180 190 200 210
pF1KE6 KYFLLKKDPSQPFYLGHTIKSGDLE-YVGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGM
...: : :: .:.:.:. .: . :.. .: ::: :..::. . .. .:: .....
CCDS53 RWLLSKYDPEEPIYFGRRFKPYVKQGYMSGGAGYVLSKEALKRFVDAFKT-DKCTHSSSI
180 190 200 210 220 230
220 230 240 250 260
pF1KE6 IWKISEDKQLAVCLKYAGVFAENAEDADGKDVFNTKSVGLSIKEAM----------TYHP
:: :. :.. .: : ...:. ::..:. . ... .:.:
CCDS53 -----EDLALGRCMEIMNVEAGDSRDTIGKETFHPFVPEHHLIKGYLPRTFWYWNYNYYP
240 250 260 270 280 290
270 280 290 300 310
pF1KE6 NQVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFGHIFNDALVFLPPNGSDND
:::::.::.:. . . :. . : ::.:: .:...
CCDS53 PVEGPGCCSDLAVSFHYVDSTTMYELEYLVYHLRPYGYLYRYQPTLPERILKEISQANKN
300 310 320 330 340 350
CCDS53 EDTKVKLGNP
360
318 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 15:26:54 2016 done: Tue Nov 8 15:26:55 2016
Total Scan time: 1.690 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]