FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4466, 488 aa
1>>>pF1KE4466 488 - 488 aa - 488 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.2053+/-0.00089; mu= 14.8637+/- 0.054
mean_var=98.6910+/-19.963, 0's: 0 Z-trim(108.6): 12 B-trim: 53 in 1/51
Lambda= 0.129103
statistics sampled from 10290 (10298) to 10290 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.669), E-opt: 0.2 (0.316), width: 16
Scan time: 3.240
The best scores are: opt bits E(32554)
CCDS6420.1 DGAT1 gene_id:8694|Hs108|chr8 ( 488) 3345 633.5 1.6e-181
CCDS8847.1 SOAT2 gene_id:8435|Hs108|chr12 ( 522) 432 90.9 3.6e-18
CCDS58048.1 SOAT1 gene_id:6646|Hs108|chr1 ( 485) 405 85.9 1.1e-16
CCDS58047.1 SOAT1 gene_id:6646|Hs108|chr1 ( 492) 405 85.9 1.1e-16
CCDS1330.1 SOAT1 gene_id:6646|Hs108|chr1 ( 550) 405 85.9 1.2e-16
>>CCDS6420.1 DGAT1 gene_id:8694|Hs108|chr8 (488 aa)
initn: 3345 init1: 3345 opt: 3345 Z-score: 3372.5 bits: 633.5 E(32554): 1.6e-181
Smith-Waterman score: 3345; 100.0% identity (100.0% similar) in 488 aa overlap (1-488:1-488)
10 20 30 40 50 60
pF1KE4 MGDRGSSRRRRTGSRPSSHGGGGPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 MGDRGSSRRRRTGSRPSSHGGGGPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 SGHWELRCHRLQDSLFSSDSGFSNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 SGHWELRCHRLQDSLFSSDSGFSNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 VSLFLKDPYSWPAPCLVIAANVFAVAAFQVEKRLAVGALTEQAGLLLHVANLATILCFPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 VSLFLKDPYSWPAPCLVIAANVFAVAAFQVEKRLAVGALTEQAGLLLHVANLATILCFPA
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 AVVLLVESITPVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 AVVLLVESITPVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 VSYPDNLTYRDLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEMLFFTQLQVGLIQQWM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 VSYPDNLTYRDLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEMLFFTQLQVGLIQQWM
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 VPTIQNSMKPFKDMDYSRIIERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 VPTIQNSMKPFKDMDYSRIIERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREF
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 YRDWWNSESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKWMARTGVFLASAFFHEYLVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 YRDWWNSESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKWMARTGVFLASAFFHEYLVS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 VPLRMFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 VPLRMFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVLN
430 440 450 460 470 480
pF1KE4 YEAPAAEA
::::::::
CCDS64 YEAPAAEA
>>CCDS8847.1 SOAT2 gene_id:8435|Hs108|chr12 (522 aa)
initn: 306 init1: 213 opt: 432 Z-score: 439.8 bits: 90.9 E(32554): 3.6e-18
Smith-Waterman score: 479; 27.9% identity (54.8% similar) in 451 aa overlap (45-477:79-497)
20 30 40 50 60
pF1KE4 RPSSHGGGGPAAAEEEVRDAAAGPDVGAAGDAPAPAP-----NKDGDAGVGSGHWELRCH
: : : : .. . ..:. . . .
CCDS88 KAQLLEQAQGQLRELLDRAMREAIQSYPSQDKPLPPPPPGSLSRTQEPSLGKQKVFIIRK
50 60 70 80 90 100
70 80 90 100 110 120
pF1KE4 RLQDSLFSSDSGFSNYRGILNWCVVMLILSNARLFLEN---LIKYGILVDPIQVVSLFLK
: : :. . . :. .. :..: . : :... :... .:. . . : :
CCDS88 SLLDELMEVQHFRTIYHMFIAGLCVFIISTLAIDFIDEGRLLLEFDLLIFSFGQLPLAL-
110 120 130 140 150 160
130 140 150 160 170 180
pF1KE4 DPYSWPAPCLVIAANVFAVAAFQVEKRLAVGALTEQAGL-LLHVANLATILC-FPAAVVL
.: .: .. . .: .:. . : :. :. .:: .: :..:: .:. :.
CCDS88 --VTW-VPMFLSTL----LAPYQALRLWARGTWTQATGLGCALLAAHAVVLCALPVHVA-
170 180 190 200 210
190 200 210 220 230 240
pF1KE4 LVESITPVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHTVSYP
:: : .: .:. . . :: . :: :. :.: . : ... . :: ::
CCDS88 -VEHQLPPASRCVLVFEQVRFL-MKSY----SFLREAVPGTLRA-RRGEGIQAPSFSSY-
220 230 240 250 260 270
250 260 270 280 290 300
pF1KE4 DNLTYRDLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEMLFFTQLQVGLIQQWMVPTI
:::: ::: :. ..::.: .: .. . . . : . .. . ::..
CCDS88 --------LYFLFCPTLIYRETYPRTPYVRWNYVAKNFAQALGCVLYACFILGRLCVPVF
280 290 300 310 320
310 320 330 340 350 360
pF1KE4 QN-SMKPFKDMDYSRIIERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREFYRD
: : .::. .. .:. ..:. .. :..:. ..: ::: ::...:::: ::::
CCDS88 ANMSREPFST---RALVLSILHATLPGIFMLLLIFFAFLHCWLNAFAEMLRFGDRMFYRD
330 340 350 360 370 380
370 380 390 400 410 420
pF1KE4 WWNSESVTYFWQNWNIPVHKWCIRHFYKPMLRR-GS-SKWMARTGVFLASAFFHEYLVSV
:::: : . ....::. :: : . :. :: :. .. .: ::::.:: :::.
CCDS88 WWNSTSFSNYYRTWNVVVHDWLYSYVYQDGLRLLGARARGVAMLGVFLVSAVAHEYIFCF
390 400 410 420 430 440
430 440 450 460 470
pF1KE4 PLRMFR-----LWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDY
: .: :. : : : ... : :. .: :..:: : : .: ...
CCDS88 VLGFFYPVMLILFLVIGGM----LNFMMHDQRTGPAWNVLMWTMLFLGQGIQVSLYCQEW
450 460 470 480 490
480
pF1KE4 YVLNYEAPAAEA
:
CCDS88 YARRHCPLPQATFWGLVTPRSWSCHT
500 510 520
>>CCDS58048.1 SOAT1 gene_id:6646|Hs108|chr1 (485 aa)
initn: 358 init1: 231 opt: 405 Z-score: 413.1 bits: 85.9 E(32554): 1.1e-16
Smith-Waterman score: 446; 26.6% identity (56.0% similar) in 448 aa overlap (53-477:45-458)
30 40 50 60 70 80
pF1KE4 GPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVGSGHWELRCHRLQDSLFSSDSGF
:: : .:. . . : : :. :
CCDS58 LIEKSASLDNGGCALTTFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVD---
20 30 40 50 60 70
90 100 110 120 130
pF1KE4 SNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQVVSL-FLKDP---YSWPAPCLVI
. : : . ...::: .. . : : :: ....: : : : ..: ..
CCDS58 -HIRTIYHMFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTW----WIM
80 90 100 110 120
140 150 160 170 180 190
pF1KE4 AANVFAVAAFQVEKRLAVGALTEQA--------GLLLHVANLATILCFPAAVVLLVESIT
..:.: : . .. :.: . :.:. . ... .: : . :.:. ..
CCDS58 FLSTFSVPYF-LFQHWATGYSKSSHPLIRSLFHGFLFMIFQIG-VLGFGPTYVVLAYTLP
130 140 150 160 170 180
200 210 220 230 240 250
pF1KE4 PVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHTVSYPDNLTYR
:.. .. :.: .. ..:. :. .. ...:. :: :: : :
CCDS58 PASRFI------IIFEQIRFVMKAHSFVRENVPRVLNSAKEKSS-----TVPIP---TVN
190 200 210 220 230
260 270 280 290 300
pF1KE4 DLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEML--FFTQLQVGLIQQWMVPTIQN-S
. :::::::: :. ..::.: .: .. .. ... :: . ... .: ..: .
CCDS58 QYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFFYVYYI--FERLCAPLFRNIK
240 250 260 270 280
310 320 330 340 350 360
pF1KE4 MKPFKDMDYSRI-IERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREFYRDWWN
..::. .:. . ... .:. :: .. :. ..: ::: ::...:::: ::.::::
CCDS58 QEPFS----ARVLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEMLRFGDRMFYKDWWN
290 300 310 320 330 340
370 380 390 400 410 420
pF1KE4 SESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKW--MARTGVFLASAFFHEYLVSVPLR
: : . ....::. :: : . :: .: :... : .:: .:: ::: ..: :
CCDS58 STSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAVSAVVHEYALAVCLS
350 360 370 380 390 400
430 440 450 460 470
pF1KE4 -----MFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVL
.: :. : :: . ..:. . :. .: ::..:. . . .: ...:
CCDS58 FFYPVLFVLFMFFGMA----FNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYAR
410 420 430 440 450 460
480
pF1KE4 NYEAPAAEA
CCDS58 QHCPLKNPTFLDYVRPRSWTCRYVF
470 480
>>CCDS58047.1 SOAT1 gene_id:6646|Hs108|chr1 (492 aa)
initn: 358 init1: 231 opt: 405 Z-score: 413.0 bits: 85.9 E(32554): 1.1e-16
Smith-Waterman score: 446; 26.6% identity (56.0% similar) in 448 aa overlap (53-477:52-465)
30 40 50 60 70 80
pF1KE4 GPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVGSGHWELRCHRLQDSLFSSDSGF
:: : .:. . . : : :. :
CCDS58 LIEKSASLDNGGCALTTFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVD---
30 40 50 60 70
90 100 110 120 130
pF1KE4 SNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQVVSL-FLKDP---YSWPAPCLVI
. : : . ...::: .. . : : :: ....: : : : ..: ..
CCDS58 -HIRTIYHMFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTW----WIM
80 90 100 110 120 130
140 150 160 170 180 190
pF1KE4 AANVFAVAAFQVEKRLAVGALTEQA--------GLLLHVANLATILCFPAAVVLLVESIT
..:.: : . .. :.: . :.:. . ... .: : . :.:. ..
CCDS58 FLSTFSVPYF-LFQHWATGYSKSSHPLIRSLFHGFLFMIFQIG-VLGFGPTYVVLAYTLP
140 150 160 170 180 190
200 210 220 230 240 250
pF1KE4 PVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHTVSYPDNLTYR
:.. .. :.: .. ..:. :. .. ...:. :: :: : :
CCDS58 PASRFI------IIFEQIRFVMKAHSFVRENVPRVLNSAKEKSS-----TVPIP---TVN
200 210 220 230
260 270 280 290 300
pF1KE4 DLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEML--FFTQLQVGLIQQWMVPTIQN-S
. :::::::: :. ..::.: .: .. .. ... :: . ... .: ..: .
CCDS58 QYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFFYVYYI--FERLCAPLFRNIK
240 250 260 270 280 290
310 320 330 340 350 360
pF1KE4 MKPFKDMDYSRI-IERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREFYRDWWN
..::. .:. . ... .:. :: .. :. ..: ::: ::...:::: ::.::::
CCDS58 QEPFS----ARVLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEMLRFGDRMFYKDWWN
300 310 320 330 340 350
370 380 390 400 410 420
pF1KE4 SESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKW--MARTGVFLASAFFHEYLVSVPLR
: : . ....::. :: : . :: .: :... : .:: .:: ::: ..: :
CCDS58 STSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAVSAVVHEYALAVCLS
360 370 380 390 400 410
430 440 450 460 470
pF1KE4 -----MFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVL
.: :. : :: . ..:. . :. .: ::..:. . . .: ...:
CCDS58 FFYPVLFVLFMFFGMA----FNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYAR
420 430 440 450 460
480
pF1KE4 NYEAPAAEA
CCDS58 QHCPLKNPTFLDYVRPRSWTCRYVF
470 480 490
>>CCDS1330.1 SOAT1 gene_id:6646|Hs108|chr1 (550 aa)
initn: 358 init1: 231 opt: 405 Z-score: 412.3 bits: 85.9 E(32554): 1.2e-16
Smith-Waterman score: 446; 26.6% identity (56.0% similar) in 448 aa overlap (53-477:110-523)
30 40 50 60 70 80
pF1KE4 GPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVGSGHWELRCHRLQDSLFSSDSGF
:: : .:. . . : : :. :
CCDS13 LIEKSASLDNGGCALTTFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVD---
80 90 100 110 120 130
90 100 110 120 130
pF1KE4 SNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQVVSL-FLKDP---YSWPAPCLVI
. : : . ...::: .. . : : :: ....: : : : ..: ..
CCDS13 -HIRTIYHMFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTW----WIM
140 150 160 170 180 190
140 150 160 170 180 190
pF1KE4 AANVFAVAAFQVEKRLAVGALTEQA--------GLLLHVANLATILCFPAAVVLLVESIT
..:.: : . .. :.: . :.:. . ... .: : . :.:. ..
CCDS13 FLSTFSVPYF-LFQHWATGYSKSSHPLIRSLFHGFLFMIFQIG-VLGFGPTYVVLAYTLP
200 210 220 230 240
200 210 220 230 240 250
pF1KE4 PVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHTVSYPDNLTYR
:.. .. :.: .. ..:. :. .. ...:. :: :: : :
CCDS13 PASRFI------IIFEQIRFVMKAHSFVRENVPRVLNSAKEKSS-----TVPIP---TVN
250 260 270 280 290
260 270 280 290 300
pF1KE4 DLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEML--FFTQLQVGLIQQWMVPTIQN-S
. :::::::: :. ..::.: .: .. .. ... :: . ... .: ..: .
CCDS13 QYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFFYVYYI--FERLCAPLFRNIK
300 310 320 330 340 350
310 320 330 340 350 360
pF1KE4 MKPFKDMDYSRI-IERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREFYRDWWN
..::. .:. . ... .:. :: .. :. ..: ::: ::...:::: ::.::::
CCDS13 QEPFS----ARVLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEMLRFGDRMFYKDWWN
360 370 380 390 400
370 380 390 400 410 420
pF1KE4 SESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKW--MARTGVFLASAFFHEYLVSVPLR
: : . ....::. :: : . :: .: :... : .:: .:: ::: ..: :
CCDS13 STSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAVSAVVHEYALAVCLS
410 420 430 440 450 460
430 440 450 460 470
pF1KE4 -----MFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVL
.: :. : :: . ..:. . :. .: ::..:. . . .: ...:
CCDS13 FFYPVLFVLFMFFGMA----FNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYAR
470 480 490 500 510 520
480
pF1KE4 NYEAPAAEA
CCDS13 QHCPLKNPTFLDYVRPRSWTCRYVF
530 540 550
488 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 00:31:36 2016 done: Sun Nov 6 00:31:36 2016
Total Scan time: 3.240 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]