FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4304, 301 aa
1>>>pF1KE4304 301 - 301 aa - 301 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6815+/-0.000808; mu= 13.6122+/- 0.048
mean_var=84.2546+/-17.257, 0's: 0 Z-trim(108.3): 63 B-trim: 0 in 0/51
Lambda= 0.139726
statistics sampled from 10074 (10134) to 10074 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.69), E-opt: 0.2 (0.311), width: 16
Scan time: 2.410
The best scores are: opt bits E(32554)
CCDS4333.1 HAVCR2 gene_id:84868|Hs108|chr5 ( 301) 2020 416.9 9.3e-117
CCDS43392.1 HAVCR1 gene_id:26762|Hs108|chr5 ( 364) 344 79.1 5.5e-15
CCDS78076.1 HAVCR1 gene_id:26762|Hs108|chr5 ( 401) 344 79.1 5.9e-15
CCDS54943.1 TIMD4 gene_id:91937|Hs108|chr5 ( 350) 333 76.9 2.5e-14
CCDS4332.1 TIMD4 gene_id:91937|Hs108|chr5 ( 378) 333 76.9 2.6e-14
>>CCDS4333.1 HAVCR2 gene_id:84868|Hs108|chr5 (301 aa)
initn: 2020 init1: 2020 opt: 2020 Z-score: 2209.6 bits: 416.9 E(32554): 9.3e-117
Smith-Waterman score: 2020; 99.7% identity (99.7% similar) in 301 aa overlap (1-301:1-301)
10 20 30 40 50 60
pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGACPV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGACPV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 FECGNVVLRTDERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGIMND
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 FECGNVVLRTDERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGIMND
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 EKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQISTLA
::::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::
CCDS43 EKFNLKLVIKPAKVTPAPTRQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQISTLA
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 NELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNLSLI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 NELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNLSLI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 SLANLPPSGLANAVAEGIRSEENIYTIEENVYEVEEPNEYYCYVSSRQQPSQPLGCRFAM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 SLANLPPSGLANAVAEGIRSEENIYTIEENVYEVEEPNEYYCYVSSRQQPSQPLGCRFAM
250 260 270 280 290 300
pF1KE4 P
:
CCDS43 P
>>CCDS43392.1 HAVCR1 gene_id:26762|Hs108|chr5 (364 aa)
initn: 364 init1: 210 opt: 344 Z-score: 382.5 bits: 79.1 E(32554): 5.5e-15
Smith-Waterman score: 344; 36.0% identity (62.8% similar) in 172 aa overlap (10-178:6-168)
10 20 30 40 50
pF1KE4 MFSHLPFDCVLLLLLLLLTRS--SEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC
:.: :.: :. : . :. .:.: .. ::: :. : .. .::..:.:
CCDS43 MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYS----GAVTSMCWNRGSC
10 20 30 40 50
60 70 80 90 100 110
pF1KE4 PVFECGNVVLRTDERDVNYWT-SRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI
.: : : .. :. :.: .:: : ::. . ::::::::....:::.::::.. :
CCDS43 SLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGW
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE4 MNDEKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQIS
.:: :....: : : ::: .: . .. : . :.: . : : . . : .
CCDS43 FNDMKITVSLEIVPPKVTTTPIV-----TTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP
120 130 140 150 160
180 190 200 210 220 230
pF1KE4 TLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNL
:
CCDS43 TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHEPVAT
170 180 190 200 210 220
>>CCDS78076.1 HAVCR1 gene_id:26762|Hs108|chr5 (401 aa)
initn: 340 init1: 210 opt: 344 Z-score: 381.9 bits: 79.1 E(32554): 5.9e-15
Smith-Waterman score: 344; 36.0% identity (62.8% similar) in 172 aa overlap (10-178:6-168)
10 20 30 40 50
pF1KE4 MFSHLPFDCVLLLLLLLLTRS--SEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC
:.: :.: :. : . :. .:.: .. ::: :. : .. .::..:.:
CCDS78 MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYS----GAVTSMCWNRGSC
10 20 30 40 50
60 70 80 90 100 110
pF1KE4 PVFECGNVVLRTDERDVNYWT-SRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI
.: : : .. :. :.: .:: : ::. . ::::::::....:::.::::.. :
CCDS78 SLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGW
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE4 MNDEKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQIS
.:: :....: : : ::: .: . .. : . :.: . : : . . : .
CCDS78 FNDMKITVSLEIVPPKVTTTPIV-----TTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP
120 130 140 150 160
180 190 200 210 220 230
pF1KE4 TLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNL
:
CCDS78 TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHEPVAT
170 180 190 200 210 220
>>CCDS54943.1 TIMD4 gene_id:91937|Hs108|chr5 (350 aa)
initn: 276 init1: 199 opt: 333 Z-score: 370.8 bits: 76.9 E(32554): 2.5e-14
Smith-Waterman score: 333; 35.6% identity (60.3% similar) in 174 aa overlap (15-180:17-189)
10 20 30 40 50
pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC
: : .::. .:. . :::.:. . : .:::: :
CCDS54 MSKEPLILWLMIEFWWLYLTPVTSETVVTEVLGHRVTLPCLYSSWSH-NSNSMCWGKDQC
10 20 30 40 50
60 70 80 90 100 110
pF1KE4 PVFECGNVVLRTD-ERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI
: : ....::: : .. ...: :.: . .::::::: : . .:::.:::::..::
CCDS54 PYSGCKEALIRTDGMRVTSRKSAKYRLQGTIPRGDVSLTILNPSESDSGVYCCRIEVPGW
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE4 MNDEKFNLKLVIKPAKVTP---APTLQRDFTAAFPRMLTTRGHGPAETQT-LGSLPDINL
.:: :.:..: .. :..: : : : :.. : :: : . . ::..
CCDS54 FNDVKINVRLNLQRASTTTHRTATTTTRRTTTTSPTTTRQMTTTPAALPTTVVTTPDLTT
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE4 ---TQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHS
:..:.:
CCDS54 GTPLQMTTIAVFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAESETVLPSDSWSS
180 190 200 210 220 230
>>CCDS4332.1 TIMD4 gene_id:91937|Hs108|chr5 (378 aa)
initn: 276 init1: 199 opt: 333 Z-score: 370.3 bits: 76.9 E(32554): 2.6e-14
Smith-Waterman score: 333; 35.6% identity (60.3% similar) in 174 aa overlap (15-180:17-189)
10 20 30 40 50
pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC
: : .::. .:. . :::.:. . : .:::: :
CCDS43 MSKEPLILWLMIEFWWLYLTPVTSETVVTEVLGHRVTLPCLYSSWSH-NSNSMCWGKDQC
10 20 30 40 50
60 70 80 90 100 110
pF1KE4 PVFECGNVVLRTD-ERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI
: : ....::: : .. ...: :.: . .::::::: : . .:::.:::::..::
CCDS43 PYSGCKEALIRTDGMRVTSRKSAKYRLQGTIPRGDVSLTILNPSESDSGVYCCRIEVPGW
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE4 MNDEKFNLKLVIKPAKVTP---APTLQRDFTAAFPRMLTTRGHGPAETQT-LGSLPDINL
.:: :.:..: .. :..: : : : :.. : :: : . . ::..
CCDS43 FNDVKINVRLNLQRASTTTHRTATTTTRRTTTTSPTTTRQMTTTPAALPTTVVTTPDLTT
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE4 ---TQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHS
:..:.:
CCDS43 GTPLQMTTIAVFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAESETVLPSDSWSS
180 190 200 210 220 230
301 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 23:29:09 2016 done: Sat Nov 5 23:29:10 2016
Total Scan time: 2.410 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]