FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4304, 301 aa 1>>>pF1KE4304 301 - 301 aa - 301 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3652+/-0.000361; mu= 15.4826+/- 0.022 mean_var=87.7397+/-17.636, 0's: 0 Z-trim(114.6): 114 B-trim: 0 in 0/57 Lambda= 0.136923 statistics sampled from 24377 (24505) to 24377 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.671), E-opt: 0.2 (0.287), width: 16 Scan time: 7.510 The best scores are: opt bits E(85289) NP_116171 (OMIM: 606652) hepatitis A virus cellula ( 301) 2020 409.0 5.8e-114 XP_016864829 (OMIM: 147050,606518) PREDICTED: hepa ( 352) 344 78.0 3e-14 NP_001166864 (OMIM: 147050,606518) hepatitis A vir ( 364) 344 78.0 3.1e-14 NP_036338 (OMIM: 147050,606518) hepatitis A virus ( 364) 344 78.0 3.1e-14 XP_016864828 (OMIM: 147050,606518) PREDICTED: hepa ( 400) 344 78.1 3.3e-14 NP_001295085 (OMIM: 147050,606518) hepatitis A vir ( 401) 344 78.1 3.3e-14 XP_011532996 (OMIM: 610096) PREDICTED: T-cell immu ( 299) 333 75.8 1.2e-13 XP_016865510 (OMIM: 610096) PREDICTED: T-cell immu ( 323) 333 75.8 1.3e-13 NP_001140198 (OMIM: 610096) T-cell immunoglobulin ( 350) 333 75.8 1.3e-13 NP_612388 (OMIM: 610096) T-cell immunoglobulin and ( 378) 333 75.9 1.4e-13 NP_001186438 (OMIM: 604531) natural cytotoxicity t ( 270) 170 43.5 0.00054 NP_001193993 (OMIM: 602621) coxsackievirus and ade ( 200) 166 42.6 0.00076 NP_001186439 (OMIM: 604531) natural cytotoxicity t ( 258) 164 42.3 0.0012 XP_016866989 (OMIM: 604531) PREDICTED: natural cyt ( 274) 164 42.4 0.0013 NP_004819 (OMIM: 604531) natural cytotoxicity trig ( 276) 164 42.4 0.0013 XP_011527781 (OMIM: 602621) PREDICTED: coxsackievi ( 252) 159 41.3 0.0023 NP_001193992 (OMIM: 602621) coxsackievirus and ade ( 252) 159 41.3 0.0023 XP_011527780 (OMIM: 602621) PREDICTED: coxsackievi ( 258) 159 41.3 0.0024 XP_011527779 (OMIM: 602621) PREDICTED: coxsackievi ( 278) 159 41.4 0.0025 >>NP_116171 (OMIM: 606652) hepatitis A virus cellular re (301 aa) initn: 2020 init1: 2020 opt: 2020 Z-score: 2167.0 bits: 409.0 E(85289): 5.8e-114 Smith-Waterman score: 2020; 99.7% identity (99.7% similar) in 301 aa overlap (1-301:1-301) 10 20 30 40 50 60 pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGACPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_116 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGACPV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 FECGNVVLRTDERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGIMND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_116 FECGNVVLRTDERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGIMND 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 EKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQISTLA ::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::: NP_116 EKFNLKLVIKPAKVTPAPTRQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQISTLA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 NELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNLSLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_116 NELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNLSLI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 SLANLPPSGLANAVAEGIRSEENIYTIEENVYEVEEPNEYYCYVSSRQQPSQPLGCRFAM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_116 SLANLPPSGLANAVAEGIRSEENIYTIEENVYEVEEPNEYYCYVSSRQQPSQPLGCRFAM 250 260 270 280 290 300 pF1KE4 P : NP_116 P >>XP_016864829 (OMIM: 147050,606518) PREDICTED: hepatiti (352 aa) initn: 340 init1: 210 opt: 344 Z-score: 376.9 bits: 78.0 E(85289): 3e-14 Smith-Waterman score: 344; 36.0% identity (62.8% similar) in 172 aa overlap (10-178:6-168) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRS--SEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC :.: :.: :. : . :. .:.: .. ::: :. : .. .::..:.: XP_016 MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYS----GAVTSMCWNRGSC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTDERDVNYWT-SRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI .: : : .. :. :.: .:: : ::. . ::::::::....:::.::::.. : XP_016 SLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQIS .:: :....: : : ::: .: . .. : . :.: . : : . . : . XP_016 FNDMKITVSLEIVPPKVTTTPIV-----TTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 TLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNL : XP_016 TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHEPVAT 170 180 190 200 210 220 >>NP_001166864 (OMIM: 147050,606518) hepatitis A virus c (364 aa) initn: 364 init1: 210 opt: 344 Z-score: 376.7 bits: 78.0 E(85289): 3.1e-14 Smith-Waterman score: 344; 36.0% identity (62.8% similar) in 172 aa overlap (10-178:6-168) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRS--SEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC :.: :.: :. : . :. .:.: .. ::: :. : .. .::..:.: NP_001 MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYS----GAVTSMCWNRGSC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTDERDVNYWT-SRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI .: : : .. :. :.: .:: : ::. . ::::::::....:::.::::.. : NP_001 SLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQIS .:: :....: : : ::: .: . .. : . :.: . : : . . : . NP_001 FNDMKITVSLEIVPPKVTTTPIV-----TTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 TLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNL : NP_001 TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHEPVAT 170 180 190 200 210 220 >>NP_036338 (OMIM: 147050,606518) hepatitis A virus cell (364 aa) initn: 364 init1: 210 opt: 344 Z-score: 376.7 bits: 78.0 E(85289): 3.1e-14 Smith-Waterman score: 344; 36.0% identity (62.8% similar) in 172 aa overlap (10-178:6-168) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRS--SEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC :.: :.: :. : . :. .:.: .. ::: :. : .. .::..:.: NP_036 MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYS----GAVTSMCWNRGSC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTDERDVNYWT-SRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI .: : : .. :. :.: .:: : ::. . ::::::::....:::.::::.. : NP_036 SLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQIS .:: :....: : : ::: .: . .. : . :.: . : : . . : . NP_036 FNDMKITVSLEIVPPKVTTTPIV-----TTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 TLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNL : NP_036 TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHEPVAT 170 180 190 200 210 220 >>XP_016864828 (OMIM: 147050,606518) PREDICTED: hepatiti (400 aa) initn: 340 init1: 210 opt: 344 Z-score: 376.1 bits: 78.1 E(85289): 3.3e-14 Smith-Waterman score: 344; 36.0% identity (62.8% similar) in 172 aa overlap (10-178:6-168) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRS--SEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC :.: :.: :. : . :. .:.: .. ::: :. : .. .::..:.: XP_016 MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYS----GAVTSMCWNRGSC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTDERDVNYWT-SRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI .: : : .. :. :.: .:: : ::. . ::::::::....:::.::::.. : XP_016 SLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQIS .:: :....: : : ::: .: . .. : . :.: . : : . . : . XP_016 FNDMKITVSLEIVPPKVTTTPIV-----TTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 TLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNL : XP_016 TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHEPATS 170 180 190 200 210 220 >>NP_001295085 (OMIM: 147050,606518) hepatitis A virus c (401 aa) initn: 340 init1: 210 opt: 344 Z-score: 376.1 bits: 78.1 E(85289): 3.3e-14 Smith-Waterman score: 344; 36.0% identity (62.8% similar) in 172 aa overlap (10-178:6-168) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRS--SEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC :.: :.: :. : . :. .:.: .. ::: :. : .. .::..:.: NP_001 MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYS----GAVTSMCWNRGSC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTDERDVNYWT-SRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI .: : : .. :. :.: .:: : ::. . ::::::::....:::.::::.. : NP_001 SLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTPAPTLQRDFTAAFPRMLTTRGHGPAETQTLGSLPDINLTQIS .:: :....: : : ::: .: . .. : . :.: . : : . . : . NP_001 FNDMKITVSLEIVPPKVTTTPIV-----TTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 TLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHSKEKIQNL : NP_001 TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHEPVAT 170 180 190 200 210 220 >>XP_011532996 (OMIM: 610096) PREDICTED: T-cell immunogl (299 aa) initn: 276 init1: 199 opt: 333 Z-score: 366.0 bits: 75.8 E(85289): 1.2e-13 Smith-Waterman score: 333; 35.6% identity (60.3% similar) in 174 aa overlap (15-180:17-189) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC : : .::. .:. . :::.:. . : .:::: : XP_011 MSKEPLILWLMIEFWWLYLTPVTSETVVTEVLGHRVTLPCLYSSWSH-NSNSMCWGKDQC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTD-ERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI : : ....::: : .. ...: :.: . .::::::: : . .:::.:::::..:: XP_011 PYSGCKEALIRTDGMRVTSRKSAKYRLQGTIPRGDVSLTILNPSESDSGVYCCRIEVPGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTP---APTLQRDFTAAFPRMLTTRGHGPAETQT-LGSLPDINL .:: :.:..: .. :..: : : : :.. : :: : . . ::.. XP_011 FNDVKINVRLNLQRASTTTHRTATTTTRRTTTTSPTTTRQMTTTPAALPTTVVTTPDLTT 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 ---TQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHS :..:.: XP_011 GTPLQMTTIAVFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAESETVLPSDSWSS 180 190 200 210 220 230 >>XP_016865510 (OMIM: 610096) PREDICTED: T-cell immunogl (323 aa) initn: 276 init1: 199 opt: 333 Z-score: 365.6 bits: 75.8 E(85289): 1.3e-13 Smith-Waterman score: 333; 35.6% identity (60.3% similar) in 174 aa overlap (15-180:17-189) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC : : .::. .:. . :::.:. . : .:::: : XP_016 MSKEPLILWLMIEFWWLYLTPVTSETVVTEVLGHRVTLPCLYSSWSH-NSNSMCWGKDQC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTD-ERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI : : ....::: : .. ...: :.: . .::::::: : . .:::.:::::..:: XP_016 PYSGCKEALIRTDGMRVTSRKSAKYRLQGTIPRGDVSLTILNPSESDSGVYCCRIEVPGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTP---APTLQRDFTAAFPRMLTTRGHGPAETQT-LGSLPDINL .:: :.:..: .. :..: : : : :.. : :: : . . ::.. XP_016 FNDVKINVRLNLQRASTTTHRTATTTTRRTTTTSPTTTRQMTTTPAALPTTVVTTPDLTT 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 ---TQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHS :..:.: XP_016 GTPLQMTTIAVFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAASDTAVPEQNKTT 180 190 200 210 220 230 >>NP_001140198 (OMIM: 610096) T-cell immunoglobulin and (350 aa) initn: 276 init1: 199 opt: 333 Z-score: 365.1 bits: 75.8 E(85289): 1.3e-13 Smith-Waterman score: 333; 35.6% identity (60.3% similar) in 174 aa overlap (15-180:17-189) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC : : .::. .:. . :::.:. . : .:::: : NP_001 MSKEPLILWLMIEFWWLYLTPVTSETVVTEVLGHRVTLPCLYSSWSH-NSNSMCWGKDQC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTD-ERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI : : ....::: : .. ...: :.: . .::::::: : . .:::.:::::..:: NP_001 PYSGCKEALIRTDGMRVTSRKSAKYRLQGTIPRGDVSLTILNPSESDSGVYCCRIEVPGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTP---APTLQRDFTAAFPRMLTTRGHGPAETQT-LGSLPDINL .:: :.:..: .. :..: : : : :.. : :: : . . ::.. NP_001 FNDVKINVRLNLQRASTTTHRTATTTTRRTTTTSPTTTRQMTTTPAALPTTVVTTPDLTT 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 ---TQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHS :..:.: NP_001 GTPLQMTTIAVFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAESETVLPSDSWSS 180 190 200 210 220 230 >>NP_612388 (OMIM: 610096) T-cell immunoglobulin and muc (378 aa) initn: 276 init1: 199 opt: 333 Z-score: 364.7 bits: 75.9 E(85289): 1.4e-13 Smith-Waterman score: 333; 35.6% identity (60.3% similar) in 174 aa overlap (15-180:17-189) 10 20 30 40 50 pF1KE4 MFSHLPFDCVLLLLLLLLTRSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGAC : : .::. .:. . :::.:. . : .:::: : NP_612 MSKEPLILWLMIEFWWLYLTPVTSETVVTEVLGHRVTLPCLYSSWSH-NSNSMCWGKDQC 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 PVFECGNVVLRTD-ERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGI : : ....::: : .. ...: :.: . .::::::: : . .:::.:::::..:: NP_612 PYSGCKEALIRTDGMRVTSRKSAKYRLQGTIPRGDVSLTILNPSESDSGVYCCRIEVPGW 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 MNDEKFNLKLVIKPAKVTP---APTLQRDFTAAFPRMLTTRGHGPAETQT-LGSLPDINL .:: :.:..: .. :..: : : : :.. : :: : . . ::.. NP_612 FNDVKINVRLNLQRASTTTHRTATTTTRRTTTTSPTTTRQMTTTPAALPTTVVTTPDLTT 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 ---TQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALALIFGALIFKWYSHS :..:.: NP_612 GTPLQMTTIAVFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAESETVLPSDSWSS 180 190 200 210 220 230 301 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:29:10 2016 done: Sat Nov 5 23:29:11 2016 Total Scan time: 7.510 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]