FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5449, 335 aa 1>>>pF1KE5449 335 - 335 aa - 335 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.1620+/-0.000954; mu= 4.4595+/- 0.058 mean_var=189.9829+/-37.985, 0's: 0 Z-trim(112.4): 9 B-trim: 0 in 0/52 Lambda= 0.093050 statistics sampled from 13174 (13181) to 13174 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.748), E-opt: 0.2 (0.405), width: 16 Scan time: 2.630 The best scores are: opt bits E(32554) CCDS59273.1 IST1 gene_id:9798|Hs108|chr16 ( 335) 2197 306.9 1.5e-83 CCDS59272.1 IST1 gene_id:9798|Hs108|chr16 ( 366) 1685 238.2 8e-63 CCDS59271.1 IST1 gene_id:9798|Hs108|chr16 ( 379) 1685 238.2 8.2e-63 CCDS10905.1 IST1 gene_id:9798|Hs108|chr16 ( 360) 1673 236.6 2.4e-62 CCDS59274.1 IST1 gene_id:9798|Hs108|chr16 ( 218) 760 113.8 1.3e-25 >>CCDS59273.1 IST1 gene_id:9798|Hs108|chr16 (335 aa) initn: 2197 init1: 2197 opt: 2197 Z-score: 1613.3 bits: 306.9 E(32554): 1.5e-83 Smith-Waterman score: 2197; 100.0% identity (100.0% similar) in 335 aa overlap (1-335:1-335) 10 20 30 40 50 60 pF1KE5 MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPMPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPMPS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 ANTPFSYPLPKGPVDDINADKNISSAQIVGPGPKPEASAKLPSRPADNYDNFVLPELPSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 ANTPFSYPLPKGPVDDINADKNISSAQIVGPGPKPEASAKLPSRPADNYDNFVLPELPSV 250 260 270 280 290 300 310 320 330 pF1KE5 PDTLPTASAGASTSASEDIDFDDLSRRFEELKKKT ::::::::::::::::::::::::::::::::::: CCDS59 PDTLPTASAGASTSASEDIDFDDLSRRFEELKKKT 310 320 330 >>CCDS59272.1 IST1 gene_id:9798|Hs108|chr16 (366 aa) initn: 1762 init1: 1670 opt: 1685 Z-score: 1241.3 bits: 238.2 E(32554): 8e-63 Smith-Waterman score: 2125; 91.5% identity (91.5% similar) in 366 aa overlap (1-335:1-366) 10 20 30 40 50 60 pF1KE5 MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPMPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPMPS 190 200 210 220 230 240 250 260 pF1KE5 ANTPFSYPLPKGP-------------------------------VDDINADKNISSAQIV ::::::::::::: :::::::::::::::: CCDS59 ANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDDINADKNISSAQIV 250 260 270 280 290 300 270 280 290 300 310 320 pF1KE5 GPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRRFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 GPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRRFE 310 320 330 340 350 360 330 pF1KE5 ELKKKT :::::: CCDS59 ELKKKT >>CCDS59271.1 IST1 gene_id:9798|Hs108|chr16 (379 aa) initn: 1762 init1: 1670 opt: 1685 Z-score: 1241.1 bits: 238.2 E(32554): 8.2e-63 Smith-Waterman score: 2125; 91.5% identity (91.5% similar) in 366 aa overlap (1-335:14-379) 10 20 30 40 pF1KE5 MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAG ::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MVFKLKTKEEQHSMLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAG 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE5 KDERARIRVEHIIREDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 KDERARIRVEHIIREDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAA 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE5 PRLQSEVAELKIVADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 PRLQSEVAELKIVADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLI 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE5 EIAKNYNVPYEPDSVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 EIAKNYNVPYEPDSVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGT 190 200 210 220 230 240 230 240 250 pF1KE5 VPMPMPMPMPMPSANTPFSYPLPKGP-------------------------------VDD :::::::::::::::::::::::::: ::: CCDS59 VPMPMPMPMPMPSANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDD 250 260 270 280 290 300 260 270 280 290 300 310 pF1KE5 INADKNISSAQIVGPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 INADKNISSAQIVGPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSAS 310 320 330 340 350 360 320 330 pF1KE5 EDIDFDDLSRRFEELKKKT ::::::::::::::::::: CCDS59 EDIDFDDLSRRFEELKKKT 370 >>CCDS10905.1 IST1 gene_id:9798|Hs108|chr16 (360 aa) initn: 1762 init1: 1670 opt: 1673 Z-score: 1232.7 bits: 236.6 E(32554): 2.4e-62 Smith-Waterman score: 1673; 89.1% identity (94.6% similar) in 294 aa overlap (1-294:1-287) 10 20 30 40 50 60 pF1KE5 MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 ADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEPD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPMPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 SVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPMPS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 ANTPFSYPLPKGPVDDINADKNISSAQIVGPGPKPEASAKLPSRPADNYDNFVLPELPSV ::::::::::::: .:.:. ... : :. .: ..:. : .:....: CCDS10 ANTPFSYPLPKGP-SDFNGLP-MGTYQAF-PNIHPP---QIPATPP-SYESMTLMLIRIS 250 260 270 280 290 310 320 330 pF1KE5 PDTLPTASAGASTSASEDIDFDDLSRRFEELKKKT CCDS10 LLHRLLVLDPSQKPLQSFLPDLQITMTTLSYQSCHLCQTHYQLHLLVPAPQHLKTLTLMI 300 310 320 330 340 350 >>CCDS59274.1 IST1 gene_id:9798|Hs108|chr16 (218 aa) initn: 837 init1: 745 opt: 760 Z-score: 573.3 bits: 113.8 E(32554): 1.3e-25 Smith-Waterman score: 1200; 85.8% identity (85.8% similar) in 218 aa overlap (149-335:1-218) 120 130 140 150 160 170 pF1KE5 IVADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYE :::::::::::::::::::::::::::::: CCDS59 MHKLSVEAPPKILVERYLIEIAKNYNVPYE 10 20 30 180 190 200 210 220 230 pF1KE5 PDSVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 PDSVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFTAPVGGPDGTVPMPMPMPMPM 40 50 60 70 80 90 240 250 260 pF1KE5 PSANTPFSYPLPKGP-------------------------------VDDINADKNISSAQ ::::::::::::::: :::::::::::::: CCDS59 PSANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPATPPSYESVDDINADKNISSAQ 100 110 120 130 140 150 270 280 290 300 310 320 pF1KE5 IVGPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 IVGPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLPTASAGASTSASEDIDFDDLSRR 160 170 180 190 200 210 330 pF1KE5 FEELKKKT :::::::: CCDS59 FEELKKKT 335 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 00:52:40 2016 done: Tue Nov 8 00:52:41 2016 Total Scan time: 2.630 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]