FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6212, 238 aa
1>>>pF1KE6212 238 - 238 aa - 238 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.9690+/-0.00034; mu= 16.0060+/- 0.021
mean_var=63.8178+/-12.678, 0's: 0 Z-trim(113.7): 68 B-trim: 238 in 1/54
Lambda= 0.160547
statistics sampled from 23061 (23131) to 23061 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.659), E-opt: 0.2 (0.271), width: 16
Scan time: 5.930
The best scores are: opt bits E(85289)
NP_006395 (OMIM: 600646) endothelial protein C rec ( 238) 1592 377.2 1.3e-104
XP_011526798 (OMIM: 600646) PREDICTED: endothelial ( 203) 1353 321.8 5.4e-88
NP_001306074 (OMIM: 188410) antigen-presenting gly ( 242) 226 60.9 2.4e-09
XP_011508427 (OMIM: 188410) PREDICTED: antigen-pre ( 335) 207 56.5 6.5e-08
NP_001757 (OMIM: 188410) antigen-presenting glycop ( 335) 207 56.5 6.5e-08
NP_001754 (OMIM: 188370) T-cell surface glycoprote ( 327) 180 50.3 4.8e-06
NP_001181929 (OMIM: 600764) major histocompatibili ( 249) 161 45.8 8.2e-05
XP_011508436 (OMIM: 188411) PREDICTED: T-cell surf ( 290) 157 44.9 0.00018
NP_001036049 (OMIM: 188411) T-cell surface glycopr ( 290) 157 44.9 0.00018
NP_001307581 (OMIM: 188370) T-cell surface glycopr ( 316) 157 44.9 0.00019
NP_001036050 (OMIM: 188411) T-cell surface glycopr ( 321) 157 45.0 0.00019
XP_011508435 (OMIM: 188411) PREDICTED: T-cell surf ( 333) 157 45.0 0.0002
NP_001036048 (OMIM: 188411) T-cell surface glycopr ( 376) 157 45.0 0.00022
NP_112155 (OMIM: 188411) T-cell surface glycoprote ( 388) 157 45.0 0.00022
NP_001297142 (OMIM: 600764) major histocompatibili ( 178) 153 43.9 0.00023
XP_006711684 (OMIM: 188410) PREDICTED: antigen-pre ( 231) 147 42.5 0.00073
XP_011508429 (OMIM: 188410) PREDICTED: antigen-pre ( 231) 147 42.5 0.00073
XP_005245640 (OMIM: 188410) PREDICTED: antigen-pre ( 231) 147 42.5 0.00073
NP_001756 (OMIM: 188340) T-cell surface glycoprote ( 333) 142 41.5 0.0022
XP_005245636 (OMIM: 188340) PREDICTED: T-cell surf ( 346) 142 41.5 0.0023
NP_001181964 (OMIM: 600764) major histocompatibili ( 214) 136 40.0 0.004
XP_006711349 (OMIM: 600764) PREDICTED: major histo ( 312) 136 40.1 0.0055
NP_001522 (OMIM: 600764) major histocompatibility ( 341) 136 40.1 0.0059
>>NP_006395 (OMIM: 600646) endothelial protein C recepto (238 aa)
initn: 1592 init1: 1592 opt: 1592 Z-score: 1998.9 bits: 377.2 E(85289): 1.3e-104
Smith-Waterman score: 1592; 99.6% identity (100.0% similar) in 238 aa overlap (1-238:1-238)
10 20 30 40 50 60
pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVLEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_006 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVLEG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 PDTNTTIIQLQPLQEPESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRCFLGCEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_006 PDTNTTIIQLQPLQEPESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRCFLGCEL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 PPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRTRYELRE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_006 PPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRTRYELRE
130 140 150 160 170 180
190 200 210 220 230
pF1KE6 FLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLCTGGRRC
::::::::::::::::::::::::::::::::::::::.:::::::::::::::::::
NP_006 FLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGSFIIAGVAVGIFLCTGGRRC
190 200 210 220 230
>>XP_011526798 (OMIM: 600646) PREDICTED: endothelial pro (203 aa)
initn: 1353 init1: 1353 opt: 1353 Z-score: 1700.8 bits: 321.8 E(85289): 5.4e-88
Smith-Waterman score: 1353; 100.0% identity (100.0% similar) in 200 aa overlap (1-200:1-200)
10 20 30 40 50 60
pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVLEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVLEG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 PDTNTTIIQLQPLQEPESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRCFLGCEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 PDTNTTIIQLQPLQEPESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRCFLGCEL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 PPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRTRYELRE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 PPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRTRYELRE
130 140 150 160 170 180
190 200 210 220 230
pF1KE6 FLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLCTGGRRC
::::::::::::::::::::
XP_011 FLEDTCVQYVQKHISAENTKDLY
190 200
>>NP_001306074 (OMIM: 188410) antigen-presenting glycopr (242 aa)
initn: 183 init1: 81 opt: 226 Z-score: 288.9 bits: 60.9 E(85289): 2.4e-09
Smith-Waterman score: 226; 29.2% identity (58.0% similar) in 219 aa overlap (2-216:4-214)
10 20 30 40 50
pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVL
: :: ::..:. : .. . : :. :::: : . . .: : :: ::
NP_001 MGCLLFLLLWALLQAWG--SAEVPQRLFPLRCLQISSFANSSWTRTDGLAWLGELQTHSW
10 20 30 40 50
60 70 80 90 100 110
pF1KE6 EGPDTNTTIIQLQPLQE----PESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRC
. :..: . .:.: .. ..: : .. : .: :. . :..:: ..
NP_001 SN-DSDT-VRSLKPWSQGTFSDQQWETLQHIFRVYRSSFTRDVKEFAKMLRLSYPLELQV
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE6 FLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRT
:::. : :. .. ::.:: .:....::. :. :: . :....: :: . :
NP_001 SAGCEVHP-GNASNNFFHVAFQGKDILSFQGTS--WEP-TQEAPLWVNLAIQVLNQDKWT
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE6 RYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLCTG
: .. .:. :: :.:. . . ... .. . ::::. : .:
NP_001 RETVQWLLNGTCPQFVSGLLESGKSELKKQGGSYTSMGLIALAVLACLLFLLIVGFTSRF
180 190 200 210 220 230
pF1KE6 GRRC
NP_001 KRQTSYQGVL
240
>>XP_011508427 (OMIM: 188410) PREDICTED: antigen-present (335 aa)
initn: 164 init1: 62 opt: 207 Z-score: 263.1 bits: 56.5 E(85289): 6.5e-08
Smith-Waterman score: 207; 30.1% identity (57.0% similar) in 193 aa overlap (2-190:4-188)
10 20 30 40 50
pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVL
: :: ::..:. : .. . : :. :::: : . . .: : :: ::
XP_011 MGCLLFLLLWALLQAWG--SAEVPQRLFPLRCLQISSFANSSWTRTDGLAWLGELQTHSW
10 20 30 40 50
60 70 80 90 100 110
pF1KE6 EGPDTNTTIIQLQPLQE----PESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRC
. :..: . .:.: .. ..: : .. : .: :. . :..:: ..
XP_011 SN-DSDT-VRSLKPWSQGTFSDQQWETLQHIFRVYRSSFTRDVKEFAKMLRLSYPLELQV
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE6 FLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRT
:::. : :. .. ::.:: .:....::. :. :: . :....: :: . :
XP_011 SAGCEVHP-GNASNNFFHVAFQGKDILSFQGTS--WEP-TQEAPLWVNLAIQVLNQDKWT
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE6 RYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLCTG
: .. .:. :: :.:
XP_011 RETVQWLLNGTCPQFVSGLLESGKSELKKQVKPKAWLSRGPSPGPGRLLLVCHVSGFYPK
180 190 200 210 220 230
>>NP_001757 (OMIM: 188410) antigen-presenting glycoprote (335 aa)
initn: 164 init1: 62 opt: 207 Z-score: 263.1 bits: 56.5 E(85289): 6.5e-08
Smith-Waterman score: 207; 30.1% identity (57.0% similar) in 193 aa overlap (2-190:4-188)
10 20 30 40 50
pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVL
: :: ::..:. : .. . : :. :::: : . . .: : :: ::
NP_001 MGCLLFLLLWALLQAWG--SAEVPQRLFPLRCLQISSFANSSWTRTDGLAWLGELQTHSW
10 20 30 40 50
60 70 80 90 100 110
pF1KE6 EGPDTNTTIIQLQPLQE----PESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRC
. :..: . .:.: .. ..: : .. : .: :. . :..:: ..
NP_001 SN-DSDT-VRSLKPWSQGTFSDQQWETLQHIFRVYRSSFTRDVKEFAKMLRLSYPLELQV
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE6 FLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRT
:::. : :. .. ::.:: .:....::. :. :: . :....: :: . :
NP_001 SAGCEVHP-GNASNNFFHVAFQGKDILSFQGTS--WEP-TQEAPLWVNLAIQVLNQDKWT
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE6 RYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLCTG
: .. .:. :: :.:
NP_001 RETVQWLLNGTCPQFVSGLLESGKSELKKQVKPKAWLSRGPSPGPGRLLLVCHVSGFYPK
180 190 200 210 220 230
>>NP_001754 (OMIM: 188370) T-cell surface glycoprotein C (327 aa)
initn: 123 init1: 40 opt: 180 Z-score: 229.4 bits: 50.3 E(85289): 4.8e-06
Smith-Waterman score: 180; 27.4% identity (56.7% similar) in 201 aa overlap (1-190:1-187)
10 20 30 40 50
pF1KE6 MLTTLLPIL-LLSGWAFCSQDASDGLQR---LHMLQISYFRDPYHVWYQGNAS--LGGHL
:: :::.: .: : . .:::.. .:. :. : . : : :. .: :.
NP_001 MLFLLLPLLAVLPG-----DGNADGLKEPLSFHVTWIASFYN--HSWKQNLVSGWLSDLQ
10 20 30 40 50
60 70 80 90 100
pF1KE6 THVLEGPDTNTTIIQLQPLQEP----ESWARTQSGLQSYLLQ-FHGLVRLVHQERTLAFP
::. .. ...::. : : .. : : . .. .. .. :.:. : .: : . .:
NP_001 THTWDS--NSSTIVFLCPWSRGNFSNEEWKELETLFRIRTIRSFEGIRRYAH-ELQFEYP
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE6 LTIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLN
. :. :::: :. . :...: .::.::::. . : :..... . ::
NP_001 FEIQVTGGCELHS-GKVSGSFLQLAYQGSDFVSFQNNS--WLP-YPVAGNMAKHFCKVLN
120 130 140 150 160
170 180 190 200 210 220
pF1KE6 AYNRTRYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGI
.. ...: ::: ...
NP_001 QNQHENDITHNLLSDTCPRFILGLLDAGKAHLQRQVKPEAWLSHGPSPGPGHLQLVCHVS
170 180 190 200 210 220
>>NP_001181929 (OMIM: 600764) major histocompatibility c (249 aa)
initn: 63 init1: 45 opt: 161 Z-score: 207.4 bits: 45.8 E(85289): 8.2e-05
Smith-Waterman score: 161; 23.6% identity (54.2% similar) in 225 aa overlap (27-237:23-234)
10 20 30 40 50
pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFR----DPYHVWYQGNASLGGHLTH
: : :. ::: :: : . :.: .:
NP_001 MGELMAFLLPLIIVLMVKHSDSRTHSLR--YFRLGVSDPIHGVPEF-ISVGYVDSH
10 20 30 40 50
60 70 80 90 100
pF1KE6 VLEGPDTNTTIIQLQP----LQE---PESWARTQSGLQSYLLQFHGLVRLVHQERTLAFP
. :. : : .: . : :. : : . :... .:. .. .... . .
NP_001 PITTYDSVTR--QKEPRAPWMAENLAPDHWERYTQLLRGWQQMFKVELKRLQRHYNHSGS
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE6 LTIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLN
: . ..:::: .:: . :.. : .:..:. : . : : .:. :..:
NP_001 HTYQRMIGCELLEDGSTTG-FLQYAYDGQDFLIFNKDTLSWLAVDNVAH-----TIKQ--
120 130 140 150 160
170 180 190 200 210 220
pF1KE6 AYNRTRYEL---REFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVA
:.. ...:: ...::. :. .... . . ..: ::. .. :..... :
NP_001 AWEANQHELLYQKNWLEEECIAWLKRFLEYGKDTLQRTESETIPLVMKAVSGSIVLVIVL
170 180 190 200 210 220
230
pF1KE6 VGIFLCTGGRRC
.:. . . ::
NP_001 AGVGVLVWRRRPREQNGAIYLPTPDR
230 240
>>XP_011508436 (OMIM: 188411) PREDICTED: T-cell surface (290 aa)
initn: 69 init1: 47 opt: 157 Z-score: 201.4 bits: 44.9 E(85289): 0.00018
Smith-Waterman score: 157; 26.9% identity (57.3% similar) in 171 aa overlap (28-190:37-194)
10 20 30 40 50
pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWY--QGNASLGGHLT
..::: : : . : : .:.. :: :
XP_011 LFEGLCCPGENTAAPQALQSYHLAAEEQLSFRMLQTSSFAN--HSWAHSEGSGWLGDLQT
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE6 HVLEGPDTNTTIIQ-LQPLQEPESWARTQSGLQS-YLLQFHGLVRLVHQ---ERTLAFPL
: : :: :. :.: .. . . ..::: . : ::.....:. . : .:.
XP_011 H---GWDTVLGTIRFLKPWSHGNFSKQELKNLQSLFQLYFHSFIQIVQASAGQFQLEYPF
70 80 90 100 110 120
120 130 140 150 160
pF1KE6 TIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQ-LN
:. . ::.. . ..:...: .::.:.:: . :. . .:. . .. . ::
XP_011 EIQILAGCRM----NAPQIFLNMAYQGSDFLSF--QGISWEPSP--GAGIRAQNICKVLN
130 140 150 160 170
170 180 190 200 210 220
pF1KE6 AYNRTRYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGI
: . :. .: :: ...
XP_011 RYLDIKEILQSLLGHTCPRFLAGLMEAGESELKRKVKPEAWLSCGPSPGPGRLQLVCHVS
180 190 200 210 220 230
>>NP_001036049 (OMIM: 188411) T-cell surface glycoprotei (290 aa)
initn: 69 init1: 47 opt: 157 Z-score: 201.4 bits: 44.9 E(85289): 0.00018
Smith-Waterman score: 157; 26.9% identity (57.3% similar) in 171 aa overlap (28-190:37-194)
10 20 30 40 50
pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWY--QGNASLGGHLT
..::: : : . : : .:.. :: :
NP_001 LFEGLCCPGENTAAPQALQSYHLAAEEQLSFRMLQTSSFAN--HSWAHSEGSGWLGDLQT
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE6 HVLEGPDTNTTIIQ-LQPLQEPESWARTQSGLQS-YLLQFHGLVRLVHQ---ERTLAFPL
: : :: :. :.: .. . . ..::: . : ::.....:. . : .:.
NP_001 H---GWDTVLGTIRFLKPWSHGNFSKQELKNLQSLFQLYFHSFIQIVQASAGQFQLEYPF
70 80 90 100 110 120
120 130 140 150 160
pF1KE6 TIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQ-LN
:. . ::.. . ..:...: .::.:.:: . :. . .:. . .. . ::
NP_001 EIQILAGCRM----NAPQIFLNMAYQGSDFLSF--QGISWEPSP--GAGIRAQNICKVLN
130 140 150 160 170
170 180 190 200 210 220
pF1KE6 AYNRTRYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGI
: . :. .: :: ...
NP_001 RYLDIKEILQSLLGHTCPRFLAGLMEAGESELKRKVKPEAWLSCGPSPGPGRLQLVCHVS
180 190 200 210 220 230
>>NP_001307581 (OMIM: 188370) T-cell surface glycoprotei (316 aa)
initn: 101 init1: 40 opt: 157 Z-score: 200.9 bits: 44.9 E(85289): 0.00019
Smith-Waterman score: 157; 26.9% identity (56.1% similar) in 171 aa overlap (28-190:16-176)
10 20 30 40 50
pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNAS--LGGHLTHVL
.:. :. : . : : :. .: :. ::.
NP_001 MFAFGGATGLKEPLSFHVTWIASFYN--HSWKQNLVSGWLSDLQTHTW
10 20 30 40
60 70 80 90 100 110
pF1KE6 EGPDTNT-TIIQLQPLQEP----ESWARTQSGLQSYLLQ-FHGLVRLVHQERTLAFPLTI
:.:. ::. : : .. : : . .. .. .. :.:. : .: : . .:. :
NP_001 ---DSNSSTIVFLCPWSRGNFSNEEWKELETLFRIRTIRSFEGIRRYAH-ELQFEYPFEI
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE6 RCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYN
. :::: :. . :...: .::.::::. . : :..... . :: .
NP_001 QVTGGCELHS-GKVSGSFLQLAYQGSDFVSFQNNS--WLP-YPVAGNMAKHFCKVLNQNQ
110 120 130 140 150
180 190 200 210 220 230
pF1KE6 RTRYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLC
. ...: ::: ...
NP_001 HENDITHNLLSDTCPRFILGLLDAGKAHLQRQVKPEAWLSHGPSPGPGHLQLVCHVSGFY
160 170 180 190 200 210
238 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 11:08:04 2016 done: Tue Nov 8 11:08:05 2016
Total Scan time: 5.930 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]