FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6212, 238 aa 1>>>pF1KE6212 238 - 238 aa - 238 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9690+/-0.00034; mu= 16.0060+/- 0.021 mean_var=63.8178+/-12.678, 0's: 0 Z-trim(113.7): 68 B-trim: 238 in 1/54 Lambda= 0.160547 statistics sampled from 23061 (23131) to 23061 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.659), E-opt: 0.2 (0.271), width: 16 Scan time: 5.930 The best scores are: opt bits E(85289) NP_006395 (OMIM: 600646) endothelial protein C rec ( 238) 1592 377.2 1.3e-104 XP_011526798 (OMIM: 600646) PREDICTED: endothelial ( 203) 1353 321.8 5.4e-88 NP_001306074 (OMIM: 188410) antigen-presenting gly ( 242) 226 60.9 2.4e-09 XP_011508427 (OMIM: 188410) PREDICTED: antigen-pre ( 335) 207 56.5 6.5e-08 NP_001757 (OMIM: 188410) antigen-presenting glycop ( 335) 207 56.5 6.5e-08 NP_001754 (OMIM: 188370) T-cell surface glycoprote ( 327) 180 50.3 4.8e-06 NP_001181929 (OMIM: 600764) major histocompatibili ( 249) 161 45.8 8.2e-05 XP_011508436 (OMIM: 188411) PREDICTED: T-cell surf ( 290) 157 44.9 0.00018 NP_001036049 (OMIM: 188411) T-cell surface glycopr ( 290) 157 44.9 0.00018 NP_001307581 (OMIM: 188370) T-cell surface glycopr ( 316) 157 44.9 0.00019 NP_001036050 (OMIM: 188411) T-cell surface glycopr ( 321) 157 45.0 0.00019 XP_011508435 (OMIM: 188411) PREDICTED: T-cell surf ( 333) 157 45.0 0.0002 NP_001036048 (OMIM: 188411) T-cell surface glycopr ( 376) 157 45.0 0.00022 NP_112155 (OMIM: 188411) T-cell surface glycoprote ( 388) 157 45.0 0.00022 NP_001297142 (OMIM: 600764) major histocompatibili ( 178) 153 43.9 0.00023 XP_006711684 (OMIM: 188410) PREDICTED: antigen-pre ( 231) 147 42.5 0.00073 XP_011508429 (OMIM: 188410) PREDICTED: antigen-pre ( 231) 147 42.5 0.00073 XP_005245640 (OMIM: 188410) PREDICTED: antigen-pre ( 231) 147 42.5 0.00073 NP_001756 (OMIM: 188340) T-cell surface glycoprote ( 333) 142 41.5 0.0022 XP_005245636 (OMIM: 188340) PREDICTED: T-cell surf ( 346) 142 41.5 0.0023 NP_001181964 (OMIM: 600764) major histocompatibili ( 214) 136 40.0 0.004 XP_006711349 (OMIM: 600764) PREDICTED: major histo ( 312) 136 40.1 0.0055 NP_001522 (OMIM: 600764) major histocompatibility ( 341) 136 40.1 0.0059 >>NP_006395 (OMIM: 600646) endothelial protein C recepto (238 aa) initn: 1592 init1: 1592 opt: 1592 Z-score: 1998.9 bits: 377.2 E(85289): 1.3e-104 Smith-Waterman score: 1592; 99.6% identity (100.0% similar) in 238 aa overlap (1-238:1-238) 10 20 30 40 50 60 pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVLEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVLEG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PDTNTTIIQLQPLQEPESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRCFLGCEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 PDTNTTIIQLQPLQEPESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRCFLGCEL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 PPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRTRYELRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 PPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRTRYELRE 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 FLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLCTGGRRC ::::::::::::::::::::::::::::::::::::::.::::::::::::::::::: NP_006 FLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGSFIIAGVAVGIFLCTGGRRC 190 200 210 220 230 >>XP_011526798 (OMIM: 600646) PREDICTED: endothelial pro (203 aa) initn: 1353 init1: 1353 opt: 1353 Z-score: 1700.8 bits: 321.8 E(85289): 5.4e-88 Smith-Waterman score: 1353; 100.0% identity (100.0% similar) in 200 aa overlap (1-200:1-200) 10 20 30 40 50 60 pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVLEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVLEG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PDTNTTIIQLQPLQEPESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRCFLGCEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 PDTNTTIIQLQPLQEPESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRCFLGCEL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 PPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRTRYELRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 PPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRTRYELRE 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 FLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLCTGGRRC :::::::::::::::::::: XP_011 FLEDTCVQYVQKHISAENTKDLY 190 200 >>NP_001306074 (OMIM: 188410) antigen-presenting glycopr (242 aa) initn: 183 init1: 81 opt: 226 Z-score: 288.9 bits: 60.9 E(85289): 2.4e-09 Smith-Waterman score: 226; 29.2% identity (58.0% similar) in 219 aa overlap (2-216:4-214) 10 20 30 40 50 pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVL : :: ::..:. : .. . : :. :::: : . . .: : :: :: NP_001 MGCLLFLLLWALLQAWG--SAEVPQRLFPLRCLQISSFANSSWTRTDGLAWLGELQTHSW 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 EGPDTNTTIIQLQPLQE----PESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRC . :..: . .:.: .. ..: : .. : .: :. . :..:: .. NP_001 SN-DSDT-VRSLKPWSQGTFSDQQWETLQHIFRVYRSSFTRDVKEFAKMLRLSYPLELQV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 FLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRT :::. : :. .. ::.:: .:....::. :. :: . :....: :: . : NP_001 SAGCEVHP-GNASNNFFHVAFQGKDILSFQGTS--WEP-TQEAPLWVNLAIQVLNQDKWT 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 RYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLCTG : .. .:. :: :.:. . . ... .. . ::::. : .: NP_001 RETVQWLLNGTCPQFVSGLLESGKSELKKQGGSYTSMGLIALAVLACLLFLLIVGFTSRF 180 190 200 210 220 230 pF1KE6 GRRC NP_001 KRQTSYQGVL 240 >>XP_011508427 (OMIM: 188410) PREDICTED: antigen-present (335 aa) initn: 164 init1: 62 opt: 207 Z-score: 263.1 bits: 56.5 E(85289): 6.5e-08 Smith-Waterman score: 207; 30.1% identity (57.0% similar) in 193 aa overlap (2-190:4-188) 10 20 30 40 50 pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVL : :: ::..:. : .. . : :. :::: : . . .: : :: :: XP_011 MGCLLFLLLWALLQAWG--SAEVPQRLFPLRCLQISSFANSSWTRTDGLAWLGELQTHSW 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 EGPDTNTTIIQLQPLQE----PESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRC . :..: . .:.: .. ..: : .. : .: :. . :..:: .. XP_011 SN-DSDT-VRSLKPWSQGTFSDQQWETLQHIFRVYRSSFTRDVKEFAKMLRLSYPLELQV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 FLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRT :::. : :. .. ::.:: .:....::. :. :: . :....: :: . : XP_011 SAGCEVHP-GNASNNFFHVAFQGKDILSFQGTS--WEP-TQEAPLWVNLAIQVLNQDKWT 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 RYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLCTG : .. .:. :: :.: XP_011 RETVQWLLNGTCPQFVSGLLESGKSELKKQVKPKAWLSRGPSPGPGRLLLVCHVSGFYPK 180 190 200 210 220 230 >>NP_001757 (OMIM: 188410) antigen-presenting glycoprote (335 aa) initn: 164 init1: 62 opt: 207 Z-score: 263.1 bits: 56.5 E(85289): 6.5e-08 Smith-Waterman score: 207; 30.1% identity (57.0% similar) in 193 aa overlap (2-190:4-188) 10 20 30 40 50 pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVL : :: ::..:. : .. . : :. :::: : . . .: : :: :: NP_001 MGCLLFLLLWALLQAWG--SAEVPQRLFPLRCLQISSFANSSWTRTDGLAWLGELQTHSW 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 EGPDTNTTIIQLQPLQE----PESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPLTIRC . :..: . .:.: .. ..: : .. : .: :. . :..:: .. NP_001 SN-DSDT-VRSLKPWSQGTFSDQQWETLQHIFRVYRSSFTRDVKEFAKMLRLSYPLELQV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 FLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYNRT :::. : :. .. ::.:: .:....::. :. :: . :....: :: . : NP_001 SAGCEVHP-GNASNNFFHVAFQGKDILSFQGTS--WEP-TQEAPLWVNLAIQVLNQDKWT 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 RYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLCTG : .. .:. :: :.: NP_001 RETVQWLLNGTCPQFVSGLLESGKSELKKQVKPKAWLSRGPSPGPGRLLLVCHVSGFYPK 180 190 200 210 220 230 >>NP_001754 (OMIM: 188370) T-cell surface glycoprotein C (327 aa) initn: 123 init1: 40 opt: 180 Z-score: 229.4 bits: 50.3 E(85289): 4.8e-06 Smith-Waterman score: 180; 27.4% identity (56.7% similar) in 201 aa overlap (1-190:1-187) 10 20 30 40 50 pF1KE6 MLTTLLPIL-LLSGWAFCSQDASDGLQR---LHMLQISYFRDPYHVWYQGNAS--LGGHL :: :::.: .: : . .:::.. .:. :. : . : : :. .: :. NP_001 MLFLLLPLLAVLPG-----DGNADGLKEPLSFHVTWIASFYN--HSWKQNLVSGWLSDLQ 10 20 30 40 50 60 70 80 90 100 pF1KE6 THVLEGPDTNTTIIQLQPLQEP----ESWARTQSGLQSYLLQ-FHGLVRLVHQERTLAFP ::. .. ...::. : : .. : : . .. .. .. :.:. : .: : . .: NP_001 THTWDS--NSSTIVFLCPWSRGNFSNEEWKELETLFRIRTIRSFEGIRRYAH-ELQFEYP 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE6 LTIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLN . :. :::: :. . :...: .::.::::. . : :..... . :: NP_001 FEIQVTGGCELHS-GKVSGSFLQLAYQGSDFVSFQNNS--WLP-YPVAGNMAKHFCKVLN 120 130 140 150 160 170 180 190 200 210 220 pF1KE6 AYNRTRYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGI .. ...: ::: ... NP_001 QNQHENDITHNLLSDTCPRFILGLLDAGKAHLQRQVKPEAWLSHGPSPGPGHLQLVCHVS 170 180 190 200 210 220 >>NP_001181929 (OMIM: 600764) major histocompatibility c (249 aa) initn: 63 init1: 45 opt: 161 Z-score: 207.4 bits: 45.8 E(85289): 8.2e-05 Smith-Waterman score: 161; 23.6% identity (54.2% similar) in 225 aa overlap (27-237:23-234) 10 20 30 40 50 pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFR----DPYHVWYQGNASLGGHLTH : : :. ::: :: : . :.: .: NP_001 MGELMAFLLPLIIVLMVKHSDSRTHSLR--YFRLGVSDPIHGVPEF-ISVGYVDSH 10 20 30 40 50 60 70 80 90 100 pF1KE6 VLEGPDTNTTIIQLQP----LQE---PESWARTQSGLQSYLLQFHGLVRLVHQERTLAFP . :. : : .: . : :. : : . :... .:. .. .... . . NP_001 PITTYDSVTR--QKEPRAPWMAENLAPDHWERYTQLLRGWQQMFKVELKRLQRHYNHSGS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE6 LTIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLN : . ..:::: .:: . :.. : .:..:. : . : : .:. :..: NP_001 HTYQRMIGCELLEDGSTTG-FLQYAYDGQDFLIFNKDTLSWLAVDNVAH-----TIKQ-- 120 130 140 150 160 170 180 190 200 210 220 pF1KE6 AYNRTRYEL---REFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVA :.. ...:: ...::. :. .... . . ..: ::. .. :..... : NP_001 AWEANQHELLYQKNWLEEECIAWLKRFLEYGKDTLQRTESETIPLVMKAVSGSIVLVIVL 170 180 190 200 210 220 230 pF1KE6 VGIFLCTGGRRC .:. . . :: NP_001 AGVGVLVWRRRPREQNGAIYLPTPDR 230 240 >>XP_011508436 (OMIM: 188411) PREDICTED: T-cell surface (290 aa) initn: 69 init1: 47 opt: 157 Z-score: 201.4 bits: 44.9 E(85289): 0.00018 Smith-Waterman score: 157; 26.9% identity (57.3% similar) in 171 aa overlap (28-190:37-194) 10 20 30 40 50 pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWY--QGNASLGGHLT ..::: : : . : : .:.. :: : XP_011 LFEGLCCPGENTAAPQALQSYHLAAEEQLSFRMLQTSSFAN--HSWAHSEGSGWLGDLQT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 HVLEGPDTNTTIIQ-LQPLQEPESWARTQSGLQS-YLLQFHGLVRLVHQ---ERTLAFPL : : :: :. :.: .. . . ..::: . : ::.....:. . : .:. XP_011 H---GWDTVLGTIRFLKPWSHGNFSKQELKNLQSLFQLYFHSFIQIVQASAGQFQLEYPF 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 TIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQ-LN :. . ::.. . ..:...: .::.:.:: . :. . .:. . .. . :: XP_011 EIQILAGCRM----NAPQIFLNMAYQGSDFLSF--QGISWEPSP--GAGIRAQNICKVLN 130 140 150 160 170 170 180 190 200 210 220 pF1KE6 AYNRTRYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGI : . :. .: :: ... XP_011 RYLDIKEILQSLLGHTCPRFLAGLMEAGESELKRKVKPEAWLSCGPSPGPGRLQLVCHVS 180 190 200 210 220 230 >>NP_001036049 (OMIM: 188411) T-cell surface glycoprotei (290 aa) initn: 69 init1: 47 opt: 157 Z-score: 201.4 bits: 44.9 E(85289): 0.00018 Smith-Waterman score: 157; 26.9% identity (57.3% similar) in 171 aa overlap (28-190:37-194) 10 20 30 40 50 pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWY--QGNASLGGHLT ..::: : : . : : .:.. :: : NP_001 LFEGLCCPGENTAAPQALQSYHLAAEEQLSFRMLQTSSFAN--HSWAHSEGSGWLGDLQT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 HVLEGPDTNTTIIQ-LQPLQEPESWARTQSGLQS-YLLQFHGLVRLVHQ---ERTLAFPL : : :: :. :.: .. . . ..::: . : ::.....:. . : .:. NP_001 H---GWDTVLGTIRFLKPWSHGNFSKQELKNLQSLFQLYFHSFIQIVQASAGQFQLEYPF 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 TIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQ-LN :. . ::.. . ..:...: .::.:.:: . :. . .:. . .. . :: NP_001 EIQILAGCRM----NAPQIFLNMAYQGSDFLSF--QGISWEPSP--GAGIRAQNICKVLN 130 140 150 160 170 170 180 190 200 210 220 pF1KE6 AYNRTRYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGI : . :. .: :: ... NP_001 RYLDIKEILQSLLGHTCPRFLAGLMEAGESELKRKVKPEAWLSCGPSPGPGRLQLVCHVS 180 190 200 210 220 230 >>NP_001307581 (OMIM: 188370) T-cell surface glycoprotei (316 aa) initn: 101 init1: 40 opt: 157 Z-score: 200.9 bits: 44.9 E(85289): 0.00019 Smith-Waterman score: 157; 26.9% identity (56.1% similar) in 171 aa overlap (28-190:16-176) 10 20 30 40 50 pF1KE6 MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNAS--LGGHLTHVL .:. :. : . : : :. .: :. ::. NP_001 MFAFGGATGLKEPLSFHVTWIASFYN--HSWKQNLVSGWLSDLQTHTW 10 20 30 40 60 70 80 90 100 110 pF1KE6 EGPDTNT-TIIQLQPLQEP----ESWARTQSGLQSYLLQ-FHGLVRLVHQERTLAFPLTI :.:. ::. : : .. : : . .. .. .. :.:. : .: : . .:. : NP_001 ---DSNSSTIVFLCPWSRGNFSNEEWKELETLFRIRTIRSFEGIRRYAH-ELQFEYPFEI 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE6 RCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNAYN . :::: :. . :...: .::.::::. . : :..... . :: . NP_001 QVTGGCELHS-GKVSGSFLQLAYQGSDFVSFQNNS--WLP-YPVAGNMAKHFCKVLNQNQ 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 RTRYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGGFIIAGVAVGIFLC . ...: ::: ... NP_001 HENDITHNLLSDTCPRFILGLLDAGKAHLQRQVKPEAWLSHGPSPGPGHLQLVCHVSGFY 160 170 180 190 200 210 238 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:08:04 2016 done: Tue Nov 8 11:08:05 2016 Total Scan time: 5.930 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]