FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1665, 184 aa 1>>>pF1KE1665 184 - 184 aa - 184 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7767+/-0.000385; mu= 11.6765+/- 0.024 mean_var=130.9799+/-25.366, 0's: 0 Z-trim(117.9): 126 B-trim: 391 in 2/53 Lambda= 0.112065 statistics sampled from 30079 (30241) to 30079 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.724), E-opt: 0.2 (0.355), width: 16 Scan time: 5.850 The best scores are: opt bits E(85289) NP_008967 (OMIM: 601521) endothelial cell-specific ( 184) 1349 228.7 4.1e-60 NP_001129076 (OMIM: 601521) endothelial cell-speci ( 134) 746 131.1 7.4e-31 XP_016859749 (OMIM: 606189) PREDICTED: cysteine-ri ( 955) 217 46.5 0.00015 XP_016859748 (OMIM: 606189) PREDICTED: cysteine-ri ( 971) 217 46.5 0.00015 NP_057525 (OMIM: 606189) cysteine-rich motor neuro (1036) 217 46.6 0.00015 XP_011542734 (OMIM: 610700) PREDICTED: serine prot ( 343) 203 43.8 0.00036 XP_011542733 (OMIM: 610700) PREDICTED: serine prot ( 380) 203 43.8 0.00039 NP_710159 (OMIM: 610700) serine protease HTRA4 pre ( 476) 203 43.9 0.00045 XP_005264414 (OMIM: 606189) PREDICTED: cysteine-ri ( 978) 207 44.9 0.00046 XP_016859747 (OMIM: 606189) PREDICTED: cysteine-ri ( 996) 205 44.6 0.00058 XP_011531203 (OMIM: 606189) PREDICTED: cysteine-ri (1003) 205 44.6 0.00058 XP_011531201 (OMIM: 606189) PREDICTED: cysteine-ri (1012) 205 44.6 0.00058 XP_011531200 (OMIM: 606189) PREDICTED: cysteine-ri (1077) 205 44.7 0.00061 NP_002505 (OMIM: 164958) protein NOV homolog precu ( 357) 198 43.0 0.00065 XP_016857696 (OMIM: 600222) PREDICTED: tyrosine-pr ( 830) 202 44.0 0.00072 NP_001240286 (OMIM: 600222) tyrosine-protein kinas (1093) 202 44.2 0.00086 NP_005415 (OMIM: 600222) tyrosine-protein kinase r (1138) 202 44.2 0.00088 NP_001295048 (OMIM: 612453,614399) multiple epider ( 567) 196 42.9 0.0011 NP_001295050 (OMIM: 612453,614399) multiple epider ( 567) 196 42.9 0.0011 XP_005251618 (OMIM: 600195,600221) PREDICTED: angi (1123) 196 43.2 0.0017 NP_000450 (OMIM: 600195,600221) angiopoietin-1 rec (1124) 196 43.2 0.0017 NP_001243474 (OMIM: 612453,614399) multiple epider (1140) 196 43.2 0.0017 XP_011541996 (OMIM: 612453,614399) PREDICTED: mult (1140) 196 43.2 0.0017 NP_115822 (OMIM: 612453,614399) multiple epidermal (1140) 196 43.2 0.0017 XP_016865476 (OMIM: 612453,614399) PREDICTED: mult (1195) 196 43.3 0.0018 NP_001284488 (OMIM: 608785) serine protease HTRA3 ( 357) 189 41.5 0.0018 XP_011511898 (OMIM: 608785) PREDICTED: serine prot ( 373) 189 41.5 0.0018 NP_444272 (OMIM: 608785) serine protease HTRA3 iso ( 453) 189 41.6 0.0021 NP_001892 (OMIM: 121009) connective tissue growth ( 349) 184 40.7 0.0031 NP_002766 (OMIM: 600142,602194,610149,616779) seri ( 480) 183 40.7 0.0042 NP_001543 (OMIM: 146733) insulin-like growth facto ( 258) 179 39.7 0.0044 NP_001310298 (OMIM: 603399) WNT1-inducible-signali ( 218) 178 39.5 0.0045 NP_001310299 (OMIM: 603399) WNT1-inducible-signali ( 250) 174 38.9 0.0076 NP_003872 (OMIM: 603399) WNT1-inducible-signaling ( 250) 174 38.9 0.0076 XP_016870188 (OMIM: 610413) PREDICTED: insulin-lik ( 174) 171 38.2 0.0084 >>NP_008967 (OMIM: 601521) endothelial cell-specific mol (184 aa) initn: 1349 init1: 1349 opt: 1349 Z-score: 1200.3 bits: 228.7 E(85289): 4.1e-60 Smith-Waterman score: 1349; 100.0% identity (100.0% similar) in 184 aa overlap (1-184:1-184) 10 20 30 40 50 60 pF1KE1 MKSVLLLTTLLVPAHLVAAWSNNYAVDCPQHCDSSECKSSPRCKRTVLDDCGCCRVCAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_008 MKSVLLLTTLLVPAHLVAAWSNNYAVDCPQHCDSSECKSSPRCKRTVLDDCGCCRVCAAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RGETCYRTVSGMDGMKCGPGLRCQPSNGEDPFGEEFGICKDCPYGTFGMDCRETCNCQSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_008 RGETCYRTVSGMDGMKCGPGLRCQPSNGEDPFGEEFGICKDCPYGTFGMDCRETCNCQSG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASGDGNIVREEVVKENAAGSPVMRKW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_008 ICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASGDGNIVREEVVKENAAGSPVMRKW 130 140 150 160 170 180 pF1KE1 LNPR :::: NP_008 LNPR >>NP_001129076 (OMIM: 601521) endothelial cell-specific (134 aa) initn: 982 init1: 746 opt: 746 Z-score: 675.0 bits: 131.1 E(85289): 7.4e-31 Smith-Waterman score: 866; 72.8% identity (72.8% similar) in 184 aa overlap (1-184:1-134) 10 20 30 40 50 60 pF1KE1 MKSVLLLTTLLVPAHLVAAWSNNYAVDCPQHCDSSECKSSPRCKRTVLDDCGCCRVCAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MKSVLLLTTLLVPAHLVAAWSNNYAVDCPQHCDSSECKSSPRCKRTVLDDCGCCRVCAAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RGETCYRTVSGMDGMKCGPGLRCQPSNGEDPFGEEFGICKDCPYGTFGMDCRETCNCQSG :::::::::::::::::::::::::::::::::::::::: NP_001 RGETCYRTVSGMDGMKCGPGLRCQPSNGEDPFGEEFGICK-------------------- 70 80 90 100 130 140 150 160 170 180 pF1KE1 ICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASGDGNIVREEVVKENAAGSPVMRKW :::::::::::::::::::::::::::::: NP_001 ------------------------------EHDMASGDGNIVREEVVKENAAGSPVMRKW 110 120 130 pF1KE1 LNPR :::: NP_001 LNPR >>XP_016859749 (OMIM: 606189) PREDICTED: cysteine-rich m (955 aa) initn: 122 init1: 91 opt: 217 Z-score: 202.9 bits: 46.5 E(85289): 0.00015 Smith-Waterman score: 224; 34.5% identity (55.6% similar) in 142 aa overlap (6-134:16-152) 10 20 30 40 pF1KE1 MKSVLLLTTLLVPAHLVAAWSNNYAVDC-PQHCDSSECKSSPRCKRTVLD ::..:: :. : :.. :. : : :: :.:. : .... XP_016 MYLVAGDRGLAGCGHLLVSLL-GLLLLLARSGTRALVCLP--CDESKCEEPRNCPGSIVQ 10 20 30 40 50 50 60 70 80 90 100 pF1KE1 D-CGCCRVCAAGRGETCYRTVSGMDGMKCGPGLRC---QPSNGEDPFGEEFGICKDCPYG :::: .::. :.:.: : :. : : :::: : ::.. : :.:.: . XP_016 GVCGCCYTCASQRNESCGGTF-GIYGT-CDRGLRCVIRPPLNGDSLTEYEAGVCEDENWT 60 70 80 90 100 110 110 120 130 140 150 pF1KE1 T---FGMD-CRET----CNCQSGICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASG .:. : :. :: .: :. .: . . :: XP_016 DDQLLGFKPCNENLIAGCNIINGKCECNTIRTCSNPFEFPSQDMCLSALKRIEVFGVDCR 120 130 140 150 160 170 160 170 180 pF1KE1 DGNIVREEVVKENAAGSPVMRKWLNPR XP_016 TVECPPVQQTACPPDSYETQVRLTADGCCTLPTRCECLSGLCGFPVCEVGSTPRIVSRGD 180 190 200 210 220 230 >>XP_016859748 (OMIM: 606189) PREDICTED: cysteine-rich m (971 aa) initn: 122 init1: 91 opt: 217 Z-score: 202.8 bits: 46.5 E(85289): 0.00015 Smith-Waterman score: 224; 34.5% identity (55.6% similar) in 142 aa overlap (6-134:16-152) 10 20 30 40 pF1KE1 MKSVLLLTTLLVPAHLVAAWSNNYAVDC-PQHCDSSECKSSPRCKRTVLD ::..:: :. : :.. :. : : :: :.:. : .... XP_016 MYLVAGDRGLAGCGHLLVSLL-GLLLLLARSGTRALVCLP--CDESKCEEPRNCPGSIVQ 10 20 30 40 50 50 60 70 80 90 100 pF1KE1 D-CGCCRVCAAGRGETCYRTVSGMDGMKCGPGLRC---QPSNGEDPFGEEFGICKDCPYG :::: .::. :.:.: : :. : : :::: : ::.. : :.:.: . XP_016 GVCGCCYTCASQRNESCGGTF-GIYGT-CDRGLRCVIRPPLNGDSLTEYEAGVCEDENWT 60 70 80 90 100 110 110 120 130 140 150 pF1KE1 T---FGMD-CRET----CNCQSGICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASG .:. : :. :: .: :. .: . . :: XP_016 DDQLLGFKPCNENLIAGCNIINGKCECNTIRTCSNPFEFPSQDMCLSALKRIEEEKPDCS 120 130 140 150 160 170 160 170 180 pF1KE1 DGNIVREEVVKENAAGSPVMRKWLNPR XP_016 KARCEVQFSPRCPEDSVLIEGYAPPGECCPLPSRCVCNPAGCLRKVCQPGNLNILVSKAS 180 190 200 210 220 230 >>NP_057525 (OMIM: 606189) cysteine-rich motor neuron 1 (1036 aa) initn: 122 init1: 91 opt: 217 Z-score: 202.4 bits: 46.6 E(85289): 0.00015 Smith-Waterman score: 224; 34.5% identity (55.6% similar) in 142 aa overlap (6-134:16-152) 10 20 30 40 pF1KE1 MKSVLLLTTLLVPAHLVAAWSNNYAVDC-PQHCDSSECKSSPRCKRTVLD ::..:: :. : :.. :. : : :: :.:. : .... NP_057 MYLVAGDRGLAGCGHLLVSLL-GLLLLLARSGTRALVCLP--CDESKCEEPRNCPGSIVQ 10 20 30 40 50 50 60 70 80 90 100 pF1KE1 D-CGCCRVCAAGRGETCYRTVSGMDGMKCGPGLRC---QPSNGEDPFGEEFGICKDCPYG :::: .::. :.:.: : :. : : :::: : ::.. : :.:.: . NP_057 GVCGCCYTCASQRNESCGGTF-GIYGT-CDRGLRCVIRPPLNGDSLTEYEAGVCEDENWT 60 70 80 90 100 110 110 120 130 140 150 pF1KE1 T---FGMD-CRET----CNCQSGICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASG .:. : :. :: .: :. .: . . :: NP_057 DDQLLGFKPCNENLIAGCNIINGKCECNTIRTCSNPFEFPSQDMCLSALKRIEEEKPDCS 120 130 140 150 160 170 160 170 180 pF1KE1 DGNIVREEVVKENAAGSPVMRKWLNPR NP_057 KARCEVQFSPRCPEDSVLIEGYAPPGECCPLPSRCVCNPAGCLRKVCQPGNLNILVSKAS 180 190 200 210 220 230 >>XP_011542734 (OMIM: 610700) PREDICTED: serine protease (343 aa) initn: 180 init1: 95 opt: 203 Z-score: 195.8 bits: 43.8 E(85289): 0.00036 Smith-Waterman score: 203; 38.6% identity (61.4% similar) in 83 aa overlap (7-85:19-97) 10 20 30 40 pF1KE1 MKSVLLLTTLLVPAHLVAAWSNNYAVDCPQHCDSSECKSSPRCK---R : ::::. ..: . . .:: :. ..: . : : XP_011 MIRPQLRTAGLGRCLLPGLLLLLVPVLWAGAEKLHTQPSCPAVCQPTRCPALPTCALGTT 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 TVLDDCGCCRVCAAGRGETCYRTVSGMDGMKCGPGLRC-QPSNGEDPFGEEFGICKDCPY :.: : ::::: :.. :.: .: .:. :.:::.: :: XP_011 PVFDLCRCCRVCPAAEREVC----GGAQGQPCAPGLQCLQPLRPGFPSTCGCPTLGGAVC 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 GTFGMDCRETCNCQSGICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASGDGNIVRE XP_011 GSDRRTYPSMCALRAENRAARRLGKVPAVPVQWGNCGDTGTRSAGPLRRNYNFIAAVVEK 120 130 140 150 160 170 >>XP_011542733 (OMIM: 610700) PREDICTED: serine protease (380 aa) initn: 180 init1: 95 opt: 203 Z-score: 195.3 bits: 43.8 E(85289): 0.00039 Smith-Waterman score: 203; 38.6% identity (61.4% similar) in 83 aa overlap (7-85:19-97) 10 20 30 40 pF1KE1 MKSVLLLTTLLVPAHLVAAWSNNYAVDCPQHCDSSECKSSPRCK---R : ::::. ..: . . .:: :. ..: . : : XP_011 MIRPQLRTAGLGRCLLPGLLLLLVPVLWAGAEKLHTQPSCPAVCQPTRCPALPTCALGTT 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 TVLDDCGCCRVCAAGRGETCYRTVSGMDGMKCGPGLRC-QPSNGEDPFGEEFGICKDCPY :.: : ::::: :.. :.: .: .:. :.:::.: :: XP_011 PVFDLCRCCRVCPAAEREVC----GGAQGQPCAPGLQCLQPLRPGFPSTCGCPTLGGAVC 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 GTFGMDCRETCNCQSGICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASGDGNIVRE XP_011 GSDRRTYPSMCALRAENRAARRLGKVPAVPVQWGNCGDTGTRSAGPLRRNYNFIAAVVEK 120 130 140 150 160 170 >>NP_710159 (OMIM: 610700) serine protease HTRA4 precurs (476 aa) initn: 180 init1: 95 opt: 203 Z-score: 194.1 bits: 43.9 E(85289): 0.00045 Smith-Waterman score: 203; 38.6% identity (61.4% similar) in 83 aa overlap (7-85:19-97) 10 20 30 40 pF1KE1 MKSVLLLTTLLVPAHLVAAWSNNYAVDCPQHCDSSECKSSPRCK---R : ::::. ..: . . .:: :. ..: . : : NP_710 MIRPQLRTAGLGRCLLPGLLLLLVPVLWAGAEKLHTQPSCPAVCQPTRCPALPTCALGTT 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 TVLDDCGCCRVCAAGRGETCYRTVSGMDGMKCGPGLRC-QPSNGEDPFGEEFGICKDCPY :.: : ::::: :.. :.: .: .:. :.:::.: :: NP_710 PVFDLCRCCRVCPAAEREVC----GGAQGQPCAPGLQCLQPLRPGFPSTCGCPTLGGAVC 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 GTFGMDCRETCNCQSGICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASGDGNIVRE NP_710 GSDRRTYPSMCALRAENRAARRLGKVPAVPVQWGNCGDTGTRSAGPLRRNYNFIAAVVEK 120 130 140 150 160 170 >>XP_005264414 (OMIM: 606189) PREDICTED: cysteine-rich m (978 aa) initn: 122 init1: 91 opt: 207 Z-score: 194.0 bits: 44.9 E(85289): 0.00046 Smith-Waterman score: 207; 32.4% identity (53.2% similar) in 139 aa overlap (6-133:16-149) 10 20 30 40 pF1KE1 MKSVLLLTTLLVPAHLVAAWSNNYAVDC-PQHCDSSECKSSPRCKRTVLD ::..:: :. : :.. :. : : :: :.:. : .... XP_005 MYLVAGDRGLAGCGHLLVSLL-GLLLLLARSGTRALVCLP--CDESKCEEPRNCPGSIVQ 10 20 30 40 50 50 60 70 80 90 100 pF1KE1 D-CGCCRVCAAGRGETCYRTVSGMDGMKCGPGLRC---QPSNGEDPFGEEFGICK----D :::: .::. :.:.: : :. : : :::: : ::.. : :.:. : XP_005 GVCGCCYTCASQRNESCGGTF-GIYGT-CDRGLRCVIRPPLNGDSLTEYEAGVCEEEKPD 60 70 80 90 100 110 110 120 130 140 150 pF1KE1 CPYGTFGMDCRETCNCQSGICD--RGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASGDG : . .. : .: . . :.: .: XP_005 CSKARCEVQFSPRCPEDSVLIEGYAPPGECCPLPSRCVCNPAGCLRKVCQPGNLNILVSK 120 130 140 150 160 170 160 170 180 pF1KE1 NIVREEVVKENAAGSPVMRKWLNPR XP_005 ASGKPGECCDLYECKPVFGVDCRTVECPPVQQTACPPDSYETQVRLTADGCCTLPTRCEC 180 190 200 210 220 230 >>XP_016859747 (OMIM: 606189) PREDICTED: cysteine-rich m (996 aa) initn: 122 init1: 91 opt: 205 Z-score: 192.2 bits: 44.6 E(85289): 0.00058 Smith-Waterman score: 209; 30.3% identity (54.5% similar) in 178 aa overlap (6-177:16-177) 10 20 30 40 pF1KE1 MKSVLLLTTLLVPAHLVAAWSNNYAVDC-PQHCDSSECKSSPRCKRTVLD ::..:: :. : :.. :. : : :: :.:. : .... XP_016 MYLVAGDRGLAGCGHLLVSLL-GLLLLLARSGTRALVCLP--CDESKCEEPRNCPGSIVQ 10 20 30 40 50 50 60 70 80 90 100 pF1KE1 D-CGCCRVCAAGRGETCYRTVSGMDGMKCGPGLRC---QPSNGEDPFGEEFGICKDCPYG :::: .::. :.:.: : :. : : :::: : ::.. : :.:. XP_016 GVCGCCYTCASQRNESCGGTF-GIYGT-CDRGLRCVIRPPLNGDSLTEYEAGVCE----- 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 TFGMDCRETCNCQSGICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASGDGNIVREE .:... . . :: : :. ..::.. . . :. :: .: : ... . XP_016 VFSLN--DKIYGKHGISDTPTAP--RLPFLKKELEEPSD--VSSYLEDENWTDDQLLGFK 120 130 140 150 160 170 180 pF1KE1 VVKEN-AAGSPVMRKWLNPR .:: :: .. XP_016 PCNENLIAGCNIINGKCECNTIRTCSNPFEFPSQDMCLSALKRIEVFGVDCRTVECPPVQ 170 180 190 200 210 220 184 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 17:14:16 2016 done: Sun Nov 6 17:14:17 2016 Total Scan time: 5.850 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]