FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4466, 488 aa 1>>>pF1KE4466 488 - 488 aa - 488 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2053+/-0.00089; mu= 14.8637+/- 0.054 mean_var=98.6910+/-19.963, 0's: 0 Z-trim(108.6): 12 B-trim: 53 in 1/51 Lambda= 0.129103 statistics sampled from 10290 (10298) to 10290 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.669), E-opt: 0.2 (0.316), width: 16 Scan time: 3.240 The best scores are: opt bits E(32554) CCDS6420.1 DGAT1 gene_id:8694|Hs108|chr8 ( 488) 3345 633.5 1.6e-181 CCDS8847.1 SOAT2 gene_id:8435|Hs108|chr12 ( 522) 432 90.9 3.6e-18 CCDS58048.1 SOAT1 gene_id:6646|Hs108|chr1 ( 485) 405 85.9 1.1e-16 CCDS58047.1 SOAT1 gene_id:6646|Hs108|chr1 ( 492) 405 85.9 1.1e-16 CCDS1330.1 SOAT1 gene_id:6646|Hs108|chr1 ( 550) 405 85.9 1.2e-16 >>CCDS6420.1 DGAT1 gene_id:8694|Hs108|chr8 (488 aa) initn: 3345 init1: 3345 opt: 3345 Z-score: 3372.5 bits: 633.5 E(32554): 1.6e-181 Smith-Waterman score: 3345; 100.0% identity (100.0% similar) in 488 aa overlap (1-488:1-488) 10 20 30 40 50 60 pF1KE4 MGDRGSSRRRRTGSRPSSHGGGGPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 MGDRGSSRRRRTGSRPSSHGGGGPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 SGHWELRCHRLQDSLFSSDSGFSNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 SGHWELRCHRLQDSLFSSDSGFSNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 VSLFLKDPYSWPAPCLVIAANVFAVAAFQVEKRLAVGALTEQAGLLLHVANLATILCFPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 VSLFLKDPYSWPAPCLVIAANVFAVAAFQVEKRLAVGALTEQAGLLLHVANLATILCFPA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 AVVLLVESITPVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 AVVLLVESITPVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 VSYPDNLTYRDLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEMLFFTQLQVGLIQQWM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 VSYPDNLTYRDLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEMLFFTQLQVGLIQQWM 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 VPTIQNSMKPFKDMDYSRIIERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 VPTIQNSMKPFKDMDYSRIIERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREF 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 YRDWWNSESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKWMARTGVFLASAFFHEYLVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 YRDWWNSESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKWMARTGVFLASAFFHEYLVS 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 VPLRMFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 VPLRMFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVLN 430 440 450 460 470 480 pF1KE4 YEAPAAEA :::::::: CCDS64 YEAPAAEA >>CCDS8847.1 SOAT2 gene_id:8435|Hs108|chr12 (522 aa) initn: 306 init1: 213 opt: 432 Z-score: 439.8 bits: 90.9 E(32554): 3.6e-18 Smith-Waterman score: 479; 27.9% identity (54.8% similar) in 451 aa overlap (45-477:79-497) 20 30 40 50 60 pF1KE4 RPSSHGGGGPAAAEEEVRDAAAGPDVGAAGDAPAPAP-----NKDGDAGVGSGHWELRCH : : : : .. . ..:. . . . CCDS88 KAQLLEQAQGQLRELLDRAMREAIQSYPSQDKPLPPPPPGSLSRTQEPSLGKQKVFIIRK 50 60 70 80 90 100 70 80 90 100 110 120 pF1KE4 RLQDSLFSSDSGFSNYRGILNWCVVMLILSNARLFLEN---LIKYGILVDPIQVVSLFLK : : :. . . :. .. :..: . : :... :... .:. . . : : CCDS88 SLLDELMEVQHFRTIYHMFIAGLCVFIISTLAIDFIDEGRLLLEFDLLIFSFGQLPLAL- 110 120 130 140 150 160 130 140 150 160 170 180 pF1KE4 DPYSWPAPCLVIAANVFAVAAFQVEKRLAVGALTEQAGL-LLHVANLATILC-FPAAVVL .: .: .. . .: .:. . : :. :. .:: .: :..:: .:. :. CCDS88 --VTW-VPMFLSTL----LAPYQALRLWARGTWTQATGLGCALLAAHAVVLCALPVHVA- 170 180 190 200 210 190 200 210 220 230 240 pF1KE4 LVESITPVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHTVSYP :: : .: .:. . . :: . :: :. :.: . : ... . :: :: CCDS88 -VEHQLPPASRCVLVFEQVRFL-MKSY----SFLREAVPGTLRA-RRGEGIQAPSFSSY- 220 230 240 250 260 270 250 260 270 280 290 300 pF1KE4 DNLTYRDLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEMLFFTQLQVGLIQQWMVPTI :::: ::: :. ..::.: .: .. . . . : . .. . ::.. CCDS88 --------LYFLFCPTLIYRETYPRTPYVRWNYVAKNFAQALGCVLYACFILGRLCVPVF 280 290 300 310 320 310 320 330 340 350 360 pF1KE4 QN-SMKPFKDMDYSRIIERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREFYRD : : .::. .. .:. ..:. .. :..:. ..: ::: ::...:::: :::: CCDS88 ANMSREPFST---RALVLSILHATLPGIFMLLLIFFAFLHCWLNAFAEMLRFGDRMFYRD 330 340 350 360 370 380 370 380 390 400 410 420 pF1KE4 WWNSESVTYFWQNWNIPVHKWCIRHFYKPMLRR-GS-SKWMARTGVFLASAFFHEYLVSV :::: : . ....::. :: : . :. :: :. .. .: ::::.:: :::. CCDS88 WWNSTSFSNYYRTWNVVVHDWLYSYVYQDGLRLLGARARGVAMLGVFLVSAVAHEYIFCF 390 400 410 420 430 440 430 440 450 460 470 pF1KE4 PLRMFR-----LWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDY : .: :. : : : ... : :. .: :..:: : : .: ... CCDS88 VLGFFYPVMLILFLVIGGM----LNFMMHDQRTGPAWNVLMWTMLFLGQGIQVSLYCQEW 450 460 470 480 490 480 pF1KE4 YVLNYEAPAAEA : CCDS88 YARRHCPLPQATFWGLVTPRSWSCHT 500 510 520 >>CCDS58048.1 SOAT1 gene_id:6646|Hs108|chr1 (485 aa) initn: 358 init1: 231 opt: 405 Z-score: 413.1 bits: 85.9 E(32554): 1.1e-16 Smith-Waterman score: 446; 26.6% identity (56.0% similar) in 448 aa overlap (53-477:45-458) 30 40 50 60 70 80 pF1KE4 GPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVGSGHWELRCHRLQDSLFSSDSGF :: : .:. . . : : :. : CCDS58 LIEKSASLDNGGCALTTFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVD--- 20 30 40 50 60 70 90 100 110 120 130 pF1KE4 SNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQVVSL-FLKDP---YSWPAPCLVI . : : . ...::: .. . : : :: ....: : : : ..: .. CCDS58 -HIRTIYHMFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTW----WIM 80 90 100 110 120 140 150 160 170 180 190 pF1KE4 AANVFAVAAFQVEKRLAVGALTEQA--------GLLLHVANLATILCFPAAVVLLVESIT ..:.: : . .. :.: . :.:. . ... .: : . :.:. .. CCDS58 FLSTFSVPYF-LFQHWATGYSKSSHPLIRSLFHGFLFMIFQIG-VLGFGPTYVVLAYTLP 130 140 150 160 170 180 200 210 220 230 240 250 pF1KE4 PVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHTVSYPDNLTYR :.. .. :.: .. ..:. :. .. ...:. :: :: : : CCDS58 PASRFI------IIFEQIRFVMKAHSFVRENVPRVLNSAKEKSS-----TVPIP---TVN 190 200 210 220 230 260 270 280 290 300 pF1KE4 DLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEML--FFTQLQVGLIQQWMVPTIQN-S . :::::::: :. ..::.: .: .. .. ... :: . ... .: ..: . CCDS58 QYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFFYVYYI--FERLCAPLFRNIK 240 250 260 270 280 310 320 330 340 350 360 pF1KE4 MKPFKDMDYSRI-IERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREFYRDWWN ..::. .:. . ... .:. :: .. :. ..: ::: ::...:::: ::.:::: CCDS58 QEPFS----ARVLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEMLRFGDRMFYKDWWN 290 300 310 320 330 340 370 380 390 400 410 420 pF1KE4 SESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKW--MARTGVFLASAFFHEYLVSVPLR : : . ....::. :: : . :: .: :... : .:: .:: ::: ..: : CCDS58 STSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAVSAVVHEYALAVCLS 350 360 370 380 390 400 430 440 450 460 470 pF1KE4 -----MFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVL .: :. : :: . ..:. . :. .: ::..:. . . .: ...: CCDS58 FFYPVLFVLFMFFGMA----FNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYAR 410 420 430 440 450 460 480 pF1KE4 NYEAPAAEA CCDS58 QHCPLKNPTFLDYVRPRSWTCRYVF 470 480 >>CCDS58047.1 SOAT1 gene_id:6646|Hs108|chr1 (492 aa) initn: 358 init1: 231 opt: 405 Z-score: 413.0 bits: 85.9 E(32554): 1.1e-16 Smith-Waterman score: 446; 26.6% identity (56.0% similar) in 448 aa overlap (53-477:52-465) 30 40 50 60 70 80 pF1KE4 GPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVGSGHWELRCHRLQDSLFSSDSGF :: : .:. . . : : :. : CCDS58 LIEKSASLDNGGCALTTFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVD--- 30 40 50 60 70 90 100 110 120 130 pF1KE4 SNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQVVSL-FLKDP---YSWPAPCLVI . : : . ...::: .. . : : :: ....: : : : ..: .. CCDS58 -HIRTIYHMFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTW----WIM 80 90 100 110 120 130 140 150 160 170 180 190 pF1KE4 AANVFAVAAFQVEKRLAVGALTEQA--------GLLLHVANLATILCFPAAVVLLVESIT ..:.: : . .. :.: . :.:. . ... .: : . :.:. .. CCDS58 FLSTFSVPYF-LFQHWATGYSKSSHPLIRSLFHGFLFMIFQIG-VLGFGPTYVVLAYTLP 140 150 160 170 180 190 200 210 220 230 240 250 pF1KE4 PVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHTVSYPDNLTYR :.. .. :.: .. ..:. :. .. ...:. :: :: : : CCDS58 PASRFI------IIFEQIRFVMKAHSFVRENVPRVLNSAKEKSS-----TVPIP---TVN 200 210 220 230 260 270 280 290 300 pF1KE4 DLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEML--FFTQLQVGLIQQWMVPTIQN-S . :::::::: :. ..::.: .: .. .. ... :: . ... .: ..: . CCDS58 QYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFFYVYYI--FERLCAPLFRNIK 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE4 MKPFKDMDYSRI-IERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREFYRDWWN ..::. .:. . ... .:. :: .. :. ..: ::: ::...:::: ::.:::: CCDS58 QEPFS----ARVLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEMLRFGDRMFYKDWWN 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE4 SESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKW--MARTGVFLASAFFHEYLVSVPLR : : . ....::. :: : . :: .: :... : .:: .:: ::: ..: : CCDS58 STSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAVSAVVHEYALAVCLS 360 370 380 390 400 410 430 440 450 460 470 pF1KE4 -----MFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVL .: :. : :: . ..:. . :. .: ::..:. . . .: ...: CCDS58 FFYPVLFVLFMFFGMA----FNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYAR 420 430 440 450 460 480 pF1KE4 NYEAPAAEA CCDS58 QHCPLKNPTFLDYVRPRSWTCRYVF 470 480 490 >>CCDS1330.1 SOAT1 gene_id:6646|Hs108|chr1 (550 aa) initn: 358 init1: 231 opt: 405 Z-score: 412.3 bits: 85.9 E(32554): 1.2e-16 Smith-Waterman score: 446; 26.6% identity (56.0% similar) in 448 aa overlap (53-477:110-523) 30 40 50 60 70 80 pF1KE4 GPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVGSGHWELRCHRLQDSLFSSDSGF :: : .:. . . : : :. : CCDS13 LIEKSASLDNGGCALTTFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVD--- 80 90 100 110 120 130 90 100 110 120 130 pF1KE4 SNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQVVSL-FLKDP---YSWPAPCLVI . : : . ...::: .. . : : :: ....: : : : ..: .. CCDS13 -HIRTIYHMFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTW----WIM 140 150 160 170 180 190 140 150 160 170 180 190 pF1KE4 AANVFAVAAFQVEKRLAVGALTEQA--------GLLLHVANLATILCFPAAVVLLVESIT ..:.: : . .. :.: . :.:. . ... .: : . :.:. .. CCDS13 FLSTFSVPYF-LFQHWATGYSKSSHPLIRSLFHGFLFMIFQIG-VLGFGPTYVVLAYTLP 200 210 220 230 240 200 210 220 230 240 250 pF1KE4 PVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHTVSYPDNLTYR :.. .. :.: .. ..:. :. .. ...:. :: :: : : CCDS13 PASRFI------IIFEQIRFVMKAHSFVRENVPRVLNSAKEKSS-----TVPIP---TVN 250 260 270 280 290 260 270 280 290 300 pF1KE4 DLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEML--FFTQLQVGLIQQWMVPTIQN-S . :::::::: :. ..::.: .: .. .. ... :: . ... .: ..: . CCDS13 QYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFFYVYYI--FERLCAPLFRNIK 300 310 320 330 340 350 310 320 330 340 350 360 pF1KE4 MKPFKDMDYSRI-IERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREFYRDWWN ..::. .:. . ... .:. :: .. :. ..: ::: ::...:::: ::.:::: CCDS13 QEPFS----ARVLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEMLRFGDRMFYKDWWN 360 370 380 390 400 370 380 390 400 410 420 pF1KE4 SESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKW--MARTGVFLASAFFHEYLVSVPLR : : . ....::. :: : . :: .: :... : .:: .:: ::: ..: : CCDS13 STSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAVSAVVHEYALAVCLS 410 420 430 440 450 460 430 440 450 460 470 pF1KE4 -----MFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVL .: :. : :: . ..:. . :. .: ::..:. . . .: ...: CCDS13 FFYPVLFVLFMFFGMA----FNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYAR 470 480 490 500 510 520 480 pF1KE4 NYEAPAAEA CCDS13 QHCPLKNPTFLDYVRPRSWTCRYVF 530 540 550 488 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:31:36 2016 done: Sun Nov 6 00:31:36 2016 Total Scan time: 3.240 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]