FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6691, 318 aa 1>>>pF1KE6691 318 - 318 aa - 318 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7219+/-0.000983; mu= 13.4872+/- 0.059 mean_var=67.9965+/-13.803, 0's: 0 Z-trim(104.1): 16 B-trim: 0 in 0/49 Lambda= 0.155536 statistics sampled from 7708 (7716) to 7708 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.618), E-opt: 0.2 (0.237), width: 16 Scan time: 1.690 The best scores are: opt bits E(32554) CCDS14602.1 C1GALT1C1 gene_id:29071|Hs108|chrX ( 318) 2168 495.6 2.1e-140 CCDS82442.1 C1GALT1C1L gene_id:728819|Hs108|chr2 ( 315) 1391 321.2 6.5e-88 CCDS5355.1 C1GALT1 gene_id:56913|Hs108|chr7 ( 363) 414 102.0 7.3e-22 >>CCDS14602.1 C1GALT1C1 gene_id:29071|Hs108|chrX (318 aa) initn: 2168 init1: 2168 opt: 2168 Z-score: 2634.0 bits: 495.6 E(32554): 2.1e-140 Smith-Waterman score: 2168; 100.0% identity (100.0% similar) in 318 aa overlap (1-318:1-318) 10 20 30 40 50 60 pF1KE6 MLSESSSFLKGVMLGSIFCALITMLGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MLSESSSFLKGVMLGSIFCALITMLGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDER 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 MELSKSFRVYCIILVKPKDVSLWAAVKETWTKHCDKAEFFSSENVKVFESINMDTNDMWL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MELSKSFRVYCIILVKPKDVSLWAAVKETWTKHCDKAEFFSSENVKVFESINMDTNDMWL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 MMRKAYKYAFDKYRDQYNWFFLARPTTFAIIENLKYFLLKKDPSQPFYLGHTIKSGDLEY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MMRKAYKYAFDKYRDQYNWFFLARPTTFAIIENLKYFLLKKDPSQPFYLGHTIKSGDLEY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 VGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGMIWKISEDKQLAVCLKYAGVFAENAEDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 VGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGMIWKISEDKQLAVCLKYAGVFAENAEDA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 DGKDVFNTKSVGLSIKEAMTYHPNQVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 DGKDVFNTKSVGLSIKEAMTYHPNQVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFG 250 260 270 280 290 300 310 pF1KE6 HIFNDALVFLPPNGSDND :::::::::::::::::: CCDS14 HIFNDALVFLPPNGSDND 310 >>CCDS82442.1 C1GALT1C1L gene_id:728819|Hs108|chr2 (315 aa) initn: 1373 init1: 569 opt: 1391 Z-score: 1691.8 bits: 321.2 E(32554): 6.5e-88 Smith-Waterman score: 1391; 63.9% identity (85.6% similar) in 319 aa overlap (1-318:1-315) 10 20 30 40 50 pF1KE6 MLSES-SSFLKGVMLGSIFCALITMLGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDE :.: : .::.::..:::: .::::.:.:.: : .. . ::::::. ::..:.:. :. CCDS82 MVSASGTSFFKGMLLGSISWVLITMFGQIHIRHRGQTQDHEHHHLRPPNRNDFLNTSKVI 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 RMELSKSFRVYCIILVKPKDVSLWAAVKETWTKHCDKAEFFSSENVKVFESINMDTNDMW .:::::.::.:::. . .: : ::..::::::::::::.....: ..: :...:: : CCDS82 LLELSKSIRVFCIIFGESEDESYWAVLKETWTKHCDKAELYDTKNDNLF---NIESNDRW 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 LMMRKAYKYAFDKYRDQYNWFFLARPTTFAIIENLKYFLLKKDPSQPFYLGHTIKSGDLE ..:: ::::.:.:: :.::::::: :::::.::::::.:. .: :::::::::. :::: CCDS82 VQMRTAYKYVFEKYGDNYNWFFLALPTTFAVIENLKYLLFTRDASQPFYLGHTVIFGDLE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 YVGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGMIWKISEDKQLAVCLKYAGVFAENAED :: .::::::: : ::::: ::. : : .:. .:::.:::::::.::::::: :::::: CCDS82 YVTVEGGIVLSRELMKRLNRLLDNSETCADQS-VIWKLSEDKQLAICLKYAGVHAENAED 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 ADGKDVFNTKSVGLSIKEAMTYHPNQVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAF .:.:::::: .. :.::.. .:.:::::::::::.:::::::..:.:::::.:::::: CCDS82 YEGRDVFNTKPIAQLIEEALSNNPQQVVEGCCSDMAITFNGLTPQKMEVMMYGLYRLRAF 240 250 260 270 280 290 300 310 pF1KE6 GHIFNDALVFLPPNGSDND :: :::.:::::: ::.:: CCDS82 GHYFNDTLVFLPPVGSEND 300 310 >>CCDS5355.1 C1GALT1 gene_id:56913|Hs108|chr7 (363 aa) initn: 277 init1: 148 opt: 414 Z-score: 506.0 bits: 102.0 E(32554): 7.3e-22 Smith-Waterman score: 429; 26.7% identity (60.6% similar) in 330 aa overlap (7-303:11-333) 10 20 30 40 50 pF1KE6 MLSESSSFLKGVMLGSIFCALI--TMLGHIRIGHGNRMHHHEHHHLQAPNKEDILK .:: : .: ..:. . .::. . : .:. : . . : .. :. CCDS53 MASKSWLNFLTFLCGSAIGFLLCSQLFSILLGEKVDTQPNVLHNDPHARHSDDNGQNHLE 10 20 30 40 50 60 60 70 80 90 pF1KE6 IS----------EDERMELSKSF----RVYCIILVKPKDVSLWAA-VKETWTKHCDKAEF . .:: ...... :. : ... :... : :: ::...:.:. : CCDS53 GQMNFNADSSQHKDENTDIAENLYQKVRILCWVMTGPQNLEKKAKHVKATWAQRCNKVLF 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE6 FSSENVKVFESINMDTND-----MWLMMRKAYKYAFDKYRDQYNWFFLARPTTFAIIENL .:::. : : .... :.. .: . ::..:. ..: .. .::. : :..:..:: CCDS53 MSSEENKDFPAVGLKTKEGRDQLYWKTI-KAFQYVHEHYLEDADWFLKADDDTYVILDNL 130 140 150 160 170 160 170 180 190 200 210 pF1KE6 KYFLLKKDPSQPFYLGHTIKSGDLE-YVGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGM ...: : :: .:.:.:. .: . :.. .: ::: :..::. . .. .:: ..... CCDS53 RWLLSKYDPEEPIYFGRRFKPYVKQGYMSGGAGYVLSKEALKRFVDAFKT-DKCTHSSSI 180 190 200 210 220 230 220 230 240 250 260 pF1KE6 IWKISEDKQLAVCLKYAGVFAENAEDADGKDVFNTKSVGLSIKEAM----------TYHP :: :. :.. .: : ...:. ::..:. . ... .:.: CCDS53 -----EDLALGRCMEIMNVEAGDSRDTIGKETFHPFVPEHHLIKGYLPRTFWYWNYNYYP 240 250 260 270 280 290 270 280 290 300 310 pF1KE6 NQVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFGHIFNDALVFLPPNGSDND :::::.::.:. . . :. . : ::.:: .:... CCDS53 PVEGPGCCSDLAVSFHYVDSTTMYELEYLVYHLRPYGYLYRYQPTLPERILKEISQANKN 300 310 320 330 340 350 CCDS53 EDTKVKLGNP 360 318 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 15:26:54 2016 done: Tue Nov 8 15:26:55 2016 Total Scan time: 1.690 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]