FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6585, 159 aa 1>>>pF1KE6585 159 - 159 aa - 159 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2738+/-0.00052; mu= 11.0581+/- 0.032 mean_var=262.0150+/-50.694, 0's: 0 Z-trim(118.3): 137 B-trim: 54 in 1/48 Lambda= 0.079234 statistics sampled from 30908 (31061) to 30908 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.663), E-opt: 0.2 (0.364), width: 16 Scan time: 4.480 The best scores are: opt bits E(85289) NP_114163 (OMIM: 608822) keratin-associated protei ( 174) 429 61.1 9.4e-10 NP_112228 (OMIM: 608820) keratin-associated protei ( 167) 389 56.5 2.2e-08 NP_005544 (OMIM: 148021) keratin-associated protei ( 169) 378 55.3 5.3e-08 NP_112229 (OMIM: 608819) keratin-associated protei ( 177) 336 50.5 1.5e-06 NP_001244234 (OMIM: 608821) keratin-associated pro ( 121) 309 47.1 1.1e-05 NP_001005922 (OMIM: 148022) keratin-associated pro ( 278) 272 43.5 0.0003 NP_853630 (OMIM: 608718) keratin-associated protei ( 172) 231 38.5 0.0061 NP_848525 (OMIM: 612619) late cornified envelope p ( 118) 226 37.6 0.0075 NP_848516 (OMIM: 612611) late cornified envelope p ( 110) 222 37.1 0.0099 >>NP_114163 (OMIM: 608822) keratin-associated protein 1- (174 aa) initn: 519 init1: 225 opt: 429 Z-score: 296.1 bits: 61.1 E(85289): 9.4e-10 Smith-Waterman score: 519; 42.3% identity (56.0% similar) in 182 aa overlap (12-158:2-174) 10 20 30 40 50 pF1KE6 MTHCCSPCCQPTCCRTT-CWQPT-TVT-TCSSTPCCQPSCCVSSCCQP-------CCHPT :::.:. : :. ... ::.:. ::::::: .::::: : :. NP_114 MTCCQTSFCGYPSFSISGTCGSS-CCQPSCCETSCCQPRSCQTSFCGFPS 10 20 30 40 60 70 80 90 pF1KE6 CCQNTCCRTTCCQPICV-TSCCQPSCCSTPCCQPTCC-----GSSCG--------QSSSC . : ..:::: : ::::::::: : ::::.:: :..:: : .: NP_114 FSTSGTCSSSCCQPSCCETSCCQPSCCETSCCQPSCCQISSCGTGCGIGGGISYGQEGSS 50 60 70 80 90 100 100 110 120 130 140 pF1KE6 APVYCR-RTCYHPTSVC---LPGCLNQSCGS-NCCQ------PCCRPACCETTCCRTTCF . : : : : . : :: : :: .::: ::::. : .::: NP_114 GAVSTRIRWCRPDSRVEGTYLPPCCVVSCTPPSCCQLHHAQASCCRPSYCGQSCCR---- 110 120 130 140 150 160 150 pF1KE6 QPTCVYSCCQPSCC :.: ::.:.: NP_114 -PVC---CCEPTC 170 >>NP_112228 (OMIM: 608820) keratin-associated protein 1- (167 aa) initn: 345 init1: 224 opt: 389 Z-score: 271.5 bits: 56.5 E(85289): 2.2e-08 Smith-Waterman score: 514; 42.4% identity (57.6% similar) in 177 aa overlap (12-158:2-167) 10 20 30 40 50 pF1KE6 MTHCCSPCCQPTCCRTT-CWQPTTVT--TCSSTPCCQPSCCVSSCCQPCCHPTCCQNTCC :::.:. : :. : ::.:. ::::::: .::::: .:::.. : NP_112 MTCCQTSFCGYPSCSTSGTCGSS-CCQPSCCETSCCQP----SCCQTSFC 10 20 30 40 60 70 80 90 pF1KE6 R--TTCCQPICVTSCCQPSCCSTPCCQPTCC-----GSSCG--------QSSSCAPV--- . . : .:::::::: : ::::.:: :..:: : .: . : NP_112 GFPSFSTSGTCSSSCCQPSCCETSCCQPSCCQTSSCGTGCGIGGGIGYGQEGSSGAVSTR 50 60 70 80 90 100 100 110 120 130 140 150 pF1KE6 --YCRRTCYHPTSVCLPGCLNQSCGS-NCCQ------PCCRPACCETTCCRTTCFQPTCV .:: : . ..::: : :: .::: ::::. : .::: .: : NP_112 IRWCRPDC-RVEGTCLPPCCVVSCTPPTCCQLHHAEASCCRPSYCGQSCCRPVC----CC 110 120 130 140 150 160 pF1KE6 YSCCQPSCC ::: .:.: NP_112 YSC-EPTC >>NP_005544 (OMIM: 148021) keratin-associated protein 5- (169 aa) initn: 1210 init1: 336 opt: 378 Z-score: 264.7 bits: 55.3 E(85289): 5.3e-08 Smith-Waterman score: 461; 39.5% identity (61.9% similar) in 147 aa overlap (3-146:40-166) 10 20 30 pF1KE6 MTHCCSP--CCQPTCCRTTCWQPTTVTTCSST .::.: :: :.: ..: . .:... NP_005 CGSSCGGCDSSCGSCGSGCRGCGPSCCAPVYCCKPVCCCVPACSCSSCGK-RGCGSCGGS 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 PCCQPSCCVSSCCQPCCHPTCCQNTCCRTTCCQPICVTSCCQPSCCSTPCCQPTCCGSSC :: :.: ::.: :: .. : ..::: : :::.: : . ::.: ::.:: NP_005 KGGCGSCGCSQC--SCCKPCCC-SSGCGSSCCQ--C--SCCKPYCSQCSCCKP-CCSSS- 70 80 90 100 110 100 110 120 130 140 pF1KE6 GQSSSCAPVYCRRTCYHPTSVCLPGCLNQSCGSNCCQP-CCRPACCETTCCRTTCFQPTC :..::: :. .: : : : ...:::.::: ::.: : .. :: .:.: NP_005 GRGSSC----CQSSC------CKPCCSSSGCGSSCCQSSCCKPCCSQSRCCVPVCYQCKI 120 130 140 150 160 150 pF1KE6 VYSCCQPSCC >>NP_112229 (OMIM: 608819) keratin-associated protein 1- (177 aa) initn: 444 init1: 225 opt: 336 Z-score: 238.6 bits: 50.5 E(85289): 1.5e-06 Smith-Waterman score: 451; 42.2% identity (55.9% similar) in 161 aa overlap (8-159:3-144) 10 20 30 40 50 pF1KE6 MTHCCSPCCQPTCCR-TTCWQPTTVTTCSSTPCCQPSCCVSSCCQP-CCHPTCCQNTCCR ::: . : .: .: ::.:. ::::::: .: ::: ::. .::: .::. NP_112 MACCQTSFCGFPSC---STSGTCGSS-CCQPSCCETSSCQPRCCETSCCQPSCCQ 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 TTCCQ-P------ICVTSCCQPSCCSTPCCQPTCCGSSCGQSSSCAPVYCRRTCYHPTSV :. : : : .:::::::: : :::: :: :.:::. : .. NP_112 TSFCGFPSFSTGGTCDSSCCQPSCCETSCCQP-----SCYQTSSCG-----TGCGIGGGI 60 70 80 90 100 120 130 140 150 pF1KE6 CLPGCLNQSCGSNCCQPCCRPACCETTCCRTTCFQPTCVYSCCQPSCC : ..: . . ::: : . ::. : :: :: :::: NP_112 GY-GQEGSSGAVSTRIRWCRPDCR----VEGTCLPPCCVVSCTPPSCCQLHHAEASCCRP 110 120 130 140 150 NP_112 SYCGQSCCRPVCCCYCSEPTC 160 170 >>NP_001244234 (OMIM: 608821) keratin-associated protein (121 aa) initn: 328 init1: 211 opt: 309 Z-score: 223.4 bits: 47.1 E(85289): 1.1e-05 Smith-Waterman score: 358; 41.1% identity (54.8% similar) in 124 aa overlap (50-158:3-121) 20 30 40 50 60 70 pF1KE6 QPTTVTTCSSTPCCQPSCCVSSCCQPCCHPTCCQNTCCRTTCCQPICV-TSCCQPSCCST .: . : ..:::: : :::::::::.: NP_001 MASCSTSGTCGSSCCQPSCCETSCCQPSCCQT 10 20 30 80 90 100 110 120 pF1KE6 PCCQPTCC---GSSCGQSSSCAPVYCR-RTCY---HPTSVCLPGCLNQSCGS-NCCQ--- : : : . :: .: . : : : :. : ..::: : :: .::: NP_001 SSCGTGCGIGGGIGYGQEGSGGSVSTRIRWCHPDCHVEGTCLPPCYLVSCTPPSCCQLHH 40 50 60 70 80 90 130 140 150 pF1KE6 ---PCCRPACCETTCCRTTCFQPTCVYSCCQPSCC ::::. : .::: :.: ::.:.: NP_001 AEASCCRPSYCGQSCCR-----PACCCHCCEPTC 100 110 120 >>NP_001005922 (OMIM: 148022) keratin-associated protein (278 aa) initn: 382 init1: 224 opt: 272 Z-score: 197.3 bits: 43.5 E(85289): 0.0003 Smith-Waterman score: 318; 34.9% identity (48.5% similar) in 169 aa overlap (1-154:1-152) 10 20 30 40 50 pF1KE6 MTHC-CSPCCQPTC--CRTTCWQPTTVTTCSSTPC--CQPSC--CVSSCCQP--CCHPTC : : :: : .: : . : :.: : : .: :::: : ::.:.: NP_001 MGCCGCSGGCGSSCGGCGSGC------GGCGSG-CGGCGSGCGGSGSSCCVPVCCCKPVC 10 20 30 40 50 60 70 80 90 100 pF1KE6 CQ-NTCCRTTCCQPICVTSC-CQPSCCSTPCCQPTC--CGSSCGQSSSCAPVY--CRRTC :. :: ..: . : .: . .: : :. : ::.: : .::. : .: NP_001 CRVPTCSCSSCGKGGCGSSGGSKGGCGSCGGCKGGCGSCGGSKGGCGSCGGSKGGCG-SC 60 70 80 90 100 110 110 120 130 140 150 pF1KE6 YHPTSVCLPGCLNQSCGSNCCQPCCRPACCETTCCRTTCFQPTCVYSCCQPSCC . : :: .:::.:: : : ::. :: : :.: : : NP_001 GGSKGGCGSGC--GGCGSSCCVPVC---CCKPMCC---CV-PACSCSSCGKGGCGSCGCS 120 130 140 150 160 NP_001 KGACGSCGGSKGGCGSCGGCKGGCGSCGGSKGGCGSGCGGCGSGCGVPVCCCSCSSCGSC 170 180 190 200 210 220 >-- initn: 382 init1: 224 opt: 277 Z-score: 200.4 bits: 44.1 E(85289): 0.0002 Smith-Waterman score: 293; 32.0% identity (55.2% similar) in 125 aa overlap (27-146:157-275) 10 20 30 40 50 pF1KE6 MTHCCSPCCQPTCCRTTCWQPTTVTTCSSTPCCQPSCCVSSCCQPCCHPTC--CQN :.: : . .: . . : .: :.. NP_001 GSSCCVPVCCCKPMCCCVPACSCSSCGKGGCGSCGCSKGACGSCGGSKGGCG-SCGGCKG 130 140 150 160 170 180 60 70 80 90 100 110 pF1KE6 TCCRTTCCQPICVTSCCQPSCCSTPCCQPTCCGSSCGQSSSCAPVY--CRRTCYHPTSVC : . : ..: . :.. : :.:: ::.. .::: : .: . : : NP_001 GCGSCGGSKGGCGSGC---GGCGSGCGVPVCC-CSCSSCGSCAGSKGGCGSSCSQ-CSCC 190 200 210 220 230 240 120 130 140 150 pF1KE6 LPGCLNQSCGSNCCQP-CCRPACCETTCCRTTCFQPTCVYSCCQPSCC : : ...:::.::: ::.: : ...:: .: : NP_001 KPCCCSSGCGSSCCQSSCCKPCCSQSSCCVPVCCQCKI 250 260 270 >>NP_853630 (OMIM: 608718) keratin-associated protein 13 (172 aa) initn: 289 init1: 109 opt: 231 Z-score: 173.8 bits: 38.5 E(85289): 0.0061 Smith-Waterman score: 238; 31.0% identity (45.6% similar) in 171 aa overlap (3-149:4-166) 10 20 30 40 pF1KE6 MTHCCSPCCQPTCC-------RTTCWQPTTVTTCSSTPCCQPSCC------VSSCCQPC .::: . : ..: . :: :.:: : .: : : NP_853 MSYNCCSGNFSSRSCGGYLHYPASSCGFSYPSNQVYSTDLCSPSTCQLGSSLYRGCQQTC 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 CHPTCCQNTCCRTTCCQPICVTSCCQPSC---CSTPCCQPTCCGSSCGQSSSCAPV-YCR .:: ::.. ... :: ::: .: :: :: : : :: :::: . : NP_853 WEPTSCQTSYVESSPCQ----TSCYRPRTSLLCS-PC-QTTYSGSLGFGSSSCRSLGYGS 70 80 90 100 110 110 120 130 140 150 pF1KE6 RTCYHP-------TSVCLPGCLNQSCGSNCCQPCCRPACCETTCCRTTCFQPTCVYSCCQ :.:: :. :: : : . :::. . :...:..::: NP_853 RSCYSVGCGSSGFRSLGYGGCGFPSLGYGV--GFCRPTYLASRSCQSSCYRPTCGSGFYY 120 130 140 150 160 170 pF1KE6 PSCC >>NP_848525 (OMIM: 612619) late cornified envelope prote (118 aa) initn: 239 init1: 92 opt: 226 Z-score: 172.2 bits: 37.6 E(85289): 0.0075 Smith-Waterman score: 234; 33.6% identity (44.8% similar) in 125 aa overlap (33-140:3-118) 10 20 30 40 50 pF1KE6 HCCSPCCQPTCCRTTCWQPTTVTTCSSTPCCQPS---CCVSSCCQPCCHPTCCQNTCCRT :: : : : : : : : : NP_848 MSCQQSQQQCQPPPKCTPKCPPKC-------T 10 20 60 70 80 90 100 pF1KE6 TCCQPICVTSCCQPSCCSTPCCQP--TCCGSSCGQSSSCAPVYCRRTCYHP--------- : : : .: : ::.:: : .::::: : : : . ..: NP_848 PKCPPKCPPKC--PPQCSAPCPPPVSSCCGSSSGGCCSSEGGGCCLSHHRPRQSLRRRPQ 30 40 50 60 70 80 110 120 130 140 150 pF1KE6 -TSVCLPGCLNQSCGSNCCQPCCRPACCETT--CCRTTCFQPTCVYSCCQPSCC .: : : .:: ::.::. .::... :: NP_848 SSSCCGSGSGQQSGGSSCCHSSGGSGCCHSSGGCC 90 100 110 >>NP_848516 (OMIM: 612611) late cornified envelope prote (110 aa) initn: 131 init1: 91 opt: 222 Z-score: 170.0 bits: 37.1 E(85289): 0.0099 Smith-Waterman score: 222; 33.6% identity (48.4% similar) in 122 aa overlap (17-130:2-110) 10 20 30 40 50 60 pF1KE6 MTHCCSPCCQPTCCRTTCWQPTTVTTCSSTPCCQPSCCVSSCCQPCCHPTCCQNTCCRTT .: : :. : : :.: . : : : : : . : . NP_848 MSCQQNQQ--QCQPPPKCPPKC--TPKCPPKCPPKCPPQ--CPAP 10 20 30 70 80 90 100 110 pF1KE6 CCQPICVTSCCQPSCCSTPCCQPT---CCGSSCGQSSSCAPVYCRRTCYHPTSVCLPGCL : : :.::: :: : :: :. ::.:. : .:. . : .: : : NP_848 CF-PA-VSSCCGPS--SGSCCGPSSGGCCSSGAG---GCSLSHHRPRLFHRRRHQSPDCC 40 50 60 70 80 90 120 130 140 150 pF1KE6 NQ--SCGSNCCQP---CCRPACCETTCCRTTCFQPTCVYSCCQPSCC .. : ::.::. :: NP_848 ESEPSGGSGCCHSSGGCC 100 110 159 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:35:38 2016 done: Tue Nov 8 14:35:39 2016 Total Scan time: 4.480 Total Display time: -0.040 Function used was FASTA [36.3.4 Apr, 2011]