FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4304, 301 aa 1>>>pF1KE4304 301 - 301 aa - 301 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6815+/-0.000808; mu= 13.6122+/- 0.048 mean_var=84.2546+/-17.257, 0's: 0 Z-trim(108.3): 63 B-trim: 0 in 0/51 Lambda= 0.139726 statistics sampled from 10074 (10134) to 10074 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.69), E-opt: 0.2 (0.311), width: 16 Scan time: 2.410 The best scores are: opt bits E(32554) CCDS4333.1 HAVCR2 gene_id:84868|Hs108|chr5 ( 301) 2020 416.9 9.3e-117 CCDS43392.1 HAVCR1 gene_id:26762|Hs108|chr5 ( 364) 344 79.1 5.5e-15 CCDS78076.1 HAVCR1 gene_id:26762|Hs108|chr5 ( 401) 344 79.1 5.9e-15 CCDS54943.1 TIMD4 gene_id:91937|Hs108|chr5 ( 350) 333 76.9 2.5e-14 CCDS4332.1 TIMD4 gene_id:91937|Hs108|chr5 ( 378) 333 76.9 2.6e-14 >>CCDS4333.1 HAVCR2 gene_id:84868|Hs108|chr5 (301 aa) initn: 2020 init1: 2020 opt: 2020 Z-score: 2209.6 bits: 416.9 E(32554): 9.3e-117 Smith-Waterman score: 2020; 99.7% identity (99.7% similar) in 301 aa overlap (1-301:1-301) 10 20 30 40 50 60 pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGACPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGACPV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 FECGNVVLRTDERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGIMND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 FECGNVVLRTDERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGIMND 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 EKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQISTLA ::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::: CCDS43 EKFNLKLVIKPAKVTPAPTRQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQISTLA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 NELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNLSLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 NELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNLSLI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 SLANLPPSGLANAVAEGIRSEENIYTIEENVYEVEEPNEYYCYVSSRQQPSQPLGCRFAM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 SLANLPPSGLANAVAEGIRSEENIYTIEENVYEVEEPNEYYCYVSSRQQPSQPLGCRFAM 250 260 270 280 290 300 pF1KE4 P : CCDS43 P >>CCDS43392.1 HAVCR1 gene_id:26762|Hs108|chr5 (364 aa) initn: 364 init1: 210 opt: 344 Z-score: 382.5 bits: 79.1 E(32554): 5.5e-15 Smith-Waterman score: 344; 36.0% identity (62.8% similar) in 172 aa overlap (10-178:6-168) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRS--SEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC :.: :.: :. : . :. .:.: .. ::: :. : .. .::..:.: CCDS43 MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYS----GAVTSMCWNRGSC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTDERDVNYWT-SRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI .: : : .. :. :.: .:: : ::. . ::::::::....:::.::::.. : CCDS43 SLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQIS .:: :....: : : ::: .: . .. : . :.: . : : . . : . CCDS43 FNDMKITVSLEIVPPKVTTTPIV-----TTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 TLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNL : CCDS43 TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHEPVAT 170 180 190 200 210 220 >>CCDS78076.1 HAVCR1 gene_id:26762|Hs108|chr5 (401 aa) initn: 340 init1: 210 opt: 344 Z-score: 381.9 bits: 79.1 E(32554): 5.9e-15 Smith-Waterman score: 344; 36.0% identity (62.8% similar) in 172 aa overlap (10-178:6-168) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRS--SEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC :.: :.: :. : . :. .:.: .. ::: :. : .. .::..:.: CCDS78 MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYS----GAVTSMCWNRGSC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTDERDVNYWT-SRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI .: : : .. :. :.: .:: : ::. . ::::::::....:::.::::.. : CCDS78 SLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQIS .:: :....: : : ::: .: . .. : . :.: . : : . . : . CCDS78 FNDMKITVSLEIVPPKVTTTPIV-----TTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 TLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNL : CCDS78 TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHEPVAT 170 180 190 200 210 220 >>CCDS54943.1 TIMD4 gene_id:91937|Hs108|chr5 (350 aa) initn: 276 init1: 199 opt: 333 Z-score: 370.8 bits: 76.9 E(32554): 2.5e-14 Smith-Waterman score: 333; 35.6% identity (60.3% similar) in 174 aa overlap (15-180:17-189) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC : : .::. .:. . :::.:. . : .:::: : CCDS54 MSKEPLILWLMIEFWWLYLTPVTSETVVTEVLGHRVTLPCLYSSWSH-NSNSMCWGKDQC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTD-ERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI : : ....::: : .. ...: :.: . .::::::: : . .:::.:::::..:: CCDS54 PYSGCKEALIRTDGMRVTSRKSAKYRLQGTIPRGDVSLTILNPSESDSGVYCCRIEVPGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTP---APTLQRDFTAAFPRMLTTRGHGPAETQT-LGSLPDINL .:: :.:..: .. :..: : : : :.. : :: : . . ::.. CCDS54 FNDVKINVRLNLQRASTTTHRTATTTTRRTTTTSPTTTRQMTTTPAALPTTVVTTPDLTT 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 ---TQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHS :..:.: CCDS54 GTPLQMTTIAVFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAESETVLPSDSWSS 180 190 200 210 220 230 >>CCDS4332.1 TIMD4 gene_id:91937|Hs108|chr5 (378 aa) initn: 276 init1: 199 opt: 333 Z-score: 370.3 bits: 76.9 E(32554): 2.6e-14 Smith-Waterman score: 333; 35.6% identity (60.3% similar) in 174 aa overlap (15-180:17-189) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC : : .::. .:. . :::.:. . : .:::: : CCDS43 MSKEPLILWLMIEFWWLYLTPVTSETVVTEVLGHRVTLPCLYSSWSH-NSNSMCWGKDQC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTD-ERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI : : ....::: : .. ...: :.: . .::::::: : . .:::.:::::..:: CCDS43 PYSGCKEALIRTDGMRVTSRKSAKYRLQGTIPRGDVSLTILNPSESDSGVYCCRIEVPGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTP---APTLQRDFTAAFPRMLTTRGHGPAETQT-LGSLPDINL .:: :.:..: .. :..: : : : :.. : :: : . . ::.. CCDS43 FNDVKINVRLNLQRASTTTHRTATTTTRRTTTTSPTTTRQMTTTPAALPTTVVTTPDLTT 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 ---TQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHS :..:.: CCDS43 GTPLQMTTIAVFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAESETVLPSDSWSS 180 190 200 210 220 230 301 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:29:09 2016 done: Sat Nov 5 23:29:10 2016 Total Scan time: 2.410 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]