FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1219, 223 aa 1>>>pF1KE1219 223 - 223 aa - 223 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7943+/-0.000352; mu= 17.2275+/- 0.022 mean_var=70.8937+/-14.240, 0's: 0 Z-trim(114.2): 32 B-trim: 0 in 0/49 Lambda= 0.152325 statistics sampled from 23841 (23871) to 23841 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.663), E-opt: 0.2 (0.28), width: 16 Scan time: 6.240 The best scores are: opt bits E(85289) NP_001139808 (OMIM: 611579) transmembrane protein ( 223) 1426 322.2 4.4e-88 NP_001277026 (OMIM: 611579) transmembrane protein ( 142) 880 202.0 4.2e-52 NP_001277024 (OMIM: 611579) transmembrane protein ( 177) 683 158.8 5.3e-39 NP_001277027 (OMIM: 611579) transmembrane protein ( 103) 668 155.3 3.5e-38 NP_055220 (OMIM: 606404) voltage-dependent calcium ( 327) 216 56.4 6.5e-08 NP_006530 (OMIM: 606403) voltage-dependent calcium ( 315) 205 54.0 3.4e-07 NP_006069 (OMIM: 602911,614256) voltage-dependent ( 323) 205 54.0 3.4e-07 NP_114101 (OMIM: 606900) voltage-dependent calcium ( 425) 179 48.4 2.2e-05 NP_665810 (OMIM: 606405) voltage-dependent calcium ( 275) 151 42.1 0.0011 NP_057453 (OMIM: 609210) claudin-18 isoform 1 prec ( 261) 137 39.0 0.0093 >>NP_001139808 (OMIM: 611579) transmembrane protein 114 (223 aa) initn: 1426 init1: 1426 opt: 1426 Z-score: 1702.3 bits: 322.2 E(85289): 4.4e-88 Smith-Waterman score: 1426; 100.0% identity (100.0% similar) in 223 aa overlap (1-223:1-223) 10 20 30 40 50 60 pF1KE1 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLILMVFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLILMVFG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEKALLDQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEKALLDQ 130 140 150 160 170 180 190 200 210 220 pF1KE1 VDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI ::::::::::::::::::::::::::::::::::::::::::: NP_001 VDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI 190 200 210 220 >>NP_001277026 (OMIM: 611579) transmembrane protein 114 (142 aa) initn: 880 init1: 880 opt: 880 Z-score: 1056.4 bits: 202.0 E(85289): 4.2e-52 Smith-Waterman score: 880; 100.0% identity (100.0% similar) in 142 aa overlap (82-223:1-142) 60 70 80 90 100 110 pF1KE1 GSINRSQPEPLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLP :::::::::::::::::::::::::::::: NP_001 MNPFRLENVTVSESSRQLLTMHGTFVILLP 10 20 30 120 130 140 150 160 170 pF1KE1 LSLILMVFGGMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LSLILMVFGGMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCL 40 50 60 70 80 90 180 190 200 210 220 pF1KE1 LEEKALLDQVDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI :::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LEEKALLDQVDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI 100 110 120 130 140 >>NP_001277024 (OMIM: 611579) transmembrane protein 114 (177 aa) initn: 696 init1: 675 opt: 683 Z-score: 821.2 bits: 158.8 E(85289): 5.3e-39 Smith-Waterman score: 1037; 79.4% identity (79.4% similar) in 223 aa overlap (1-223:1-177) 10 20 30 40 50 60 pF1KE1 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLILMVFG :::::::::::::::::::::::::::::::::::::::::: NP_001 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTM------------------ 70 80 90 100 130 140 150 160 170 180 pF1KE1 GMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEKALLDQ :::::::::::::::::::::::::::::::: NP_001 ----------------------------VTLAGISVYIAYSAAAFREALCLLEEKALLDQ 110 120 130 190 200 210 220 pF1KE1 VDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI ::::::::::::::::::::::::::::::::::::::::::: NP_001 VDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI 140 150 160 170 >>NP_001277027 (OMIM: 611579) transmembrane protein 114 (103 aa) initn: 668 init1: 668 opt: 668 Z-score: 806.4 bits: 155.3 E(85289): 3.5e-38 Smith-Waterman score: 668; 100.0% identity (100.0% similar) in 101 aa overlap (1-101:1-101) 10 20 30 40 50 60 pF1KE1 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLILMVFG ::::::::::::::::::::::::::::::::::::::::: NP_001 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTAF 70 80 90 100 130 140 150 160 170 180 pF1KE1 GMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEKALLDQ >>NP_055220 (OMIM: 606404) voltage-dependent calcium cha (327 aa) initn: 226 init1: 65 opt: 216 Z-score: 263.0 bits: 56.4 E(85289): 6.5e-08 Smith-Waterman score: 216; 27.7% identity (55.8% similar) in 224 aa overlap (7-220:7-220) 10 20 30 40 50 pF1KE1 MRVHLGGLAGAAALTGALS-FVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQP :: . .::.. : :.: :::::.: ... . :. .: : : NP_055 MVRCDRGLQMLLTTAGAFAAFSLMAIAIGTDYWLYSSAHICNGTNLTMDD--GPPPRRAR 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 EPLSSHSGLWRTCRVQS----PCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLI :. ::::::.: ... : . : : .: .::. :: . . .. :: : NP_055 GDLT-HSGLWRVCCIEGIYKGHCFRI-NHFPEDNDYDHDSSEYLLRIVRASSVFPILSTI 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 LMVFGGMT-GFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEE :...::. : . . ..: .::::. ... .. :: :::. ... . :. NP_055 LLLLGGLCIGAGRIYSRKNNIVLSAGILFVAAGLSNIIGIIVYISSNTGDPSDKRD--ED 120 130 140 150 160 170 180 190 200 210 220 pF1KE1 KALLDQVDISFGWSLALGWISFIAELLTGA----AFLAAARELSLRRRQDQAI : . ..:::. .: .:::. .:. .. .:: .. ... NP_055 K----KNHYNYGWSFYFGALSFIVAETVGVLAVNIYIEKNKELRFKTKREFLKASSSSPY 180 190 200 210 220 230 NP_055 ARMPSYRYRRRRSRSSSRSTEASPSRDVSPMGLKITGAIPMGELSMYTLSREPLKVTTAA 240 250 260 270 280 290 >>NP_006530 (OMIM: 606403) voltage-dependent calcium cha (315 aa) initn: 191 init1: 74 opt: 205 Z-score: 250.2 bits: 54.0 E(85289): 3.4e-07 Smith-Waterman score: 205; 25.7% identity (55.8% similar) in 226 aa overlap (1-218:1-210) 10 20 30 40 50 pF1KE1 MRVHLGGLAGAAALTGALS-FVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQP ::. :. . .::.. : :.. :.:::.: . .. . :: ... . : NP_006 MRMCDRGIQMLITTVGAFAAFSLMTIAVGTDYW--LYSRGVCRTKSTSDNETSRKN---- 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 EPLSSHSGLWRTCRVQSP----CTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLI : . .:::::::: ... : . . : . ..... :: . .. ::. NP_006 EEVMTHSGLWRTCCLEGAFRGVCKKI-DHFPEDADYEQDTAEYLLRAVRASSVFPILSVT 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 LMVFGGMTGFLS-FLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEE :. :::. : : . . ..: .::.:. ... .. :: :::. .:. . NP_006 LLFFGGLCVAASEFHRSRHNVILSAGIFFVSAGLSNIIGIIVYISANAG---------DP 120 130 140 150 160 180 190 200 210 220 pF1KE1 KALLDQVDISFGWSLALGWISFIAELLTG--AAFLAAARELSLRRRQDQAI .. . :.:::. .: .::: ..: :. . .. .:: . NP_006 GQRDSKKSYSYGWSFYFGAFSFIIAEIVGVVAVHIYIEKHQQLRAKSHSEFLKKSTFARL 170 180 190 200 210 220 NP_006 PPYRYRFRRRSSSRSTEPRSRDLSPISKGFHTIPSTDISMFTLSRDPSKITMGTLLNSDR 230 240 250 260 270 280 >>NP_006069 (OMIM: 602911,614256) voltage-dependent calc (323 aa) initn: 217 init1: 74 opt: 205 Z-score: 250.0 bits: 54.0 E(85289): 3.4e-07 Smith-Waterman score: 205; 26.9% identity (57.2% similar) in 208 aa overlap (16-216:16-209) 10 20 30 40 50 pF1KE1 MRVHLGGLAGAAALTGALS-FVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQP ::.. : :.. :.:::.: . .. . .: ... .. : NP_006 MGLFDRGVQMLLTTVGAFAAFSLMTIAVGTDYW--LYSRGVCKTKSVSENETSKKN---- 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 EPLSSHSGLWRTCRVQSPCTPL---MNPFRLENVTVSESSRQLLTMHGTFVILLPLSLIL : . .:::::::: ... : .. : . ..... .: . :. ::.:: NP_006 EEVMTHSGLWRTCCLEGNFKGLCKQIDHFPEDADYEADTAEYFLRAVRASSIFPILSVIL 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 MVFGGMTGFLS-FLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEK . .::. : : . ..: .::.:. ... .. :: :::. .:. : NP_006 LFMGGLCIAASEFYKTRHNIILSAGIFFVSAGLSNIIGIIVYISANAGD--------PSK 120 130 140 150 160 180 190 200 210 220 pF1KE1 ALLDQVDISFGWSLALGWISFIAELLTG--AAFLAAARELSLRRRQDQAI . . . :.:::. .: .::: ..: :. . :. .:: NP_006 SDSKKNSYSYGWSFYFGALSFIIAEMVGVLAVHMFIDRHKQLRATARATDYLQASAITRI 170 180 190 200 210 220 NP_006 PSYRYRYQRRSRSSSRSTEPSHSRDASPVGIKGFNTLPSTEISMYTLSRDPLKAATTPTA 230 240 250 260 270 280 >>NP_114101 (OMIM: 606900) voltage-dependent calcium cha (425 aa) initn: 185 init1: 63 opt: 179 Z-score: 217.6 bits: 48.4 E(85289): 2.2e-05 Smith-Waterman score: 207; 29.3% identity (55.0% similar) in 222 aa overlap (16-220:27-241) 10 20 30 40 pF1KE1 MRVHLGGLAGAAALTGALS-FVLLAAAIGTDFWY-----IIDTERLERT ::.. : :.. ::.::.: : .: : NP_114 MESLKRWNEERGLWCEKGVQVLLTTVGAFAAFGLMTIAISTDYWLYTRALICNTTNLTAG 10 20 30 40 50 60 50 60 70 80 90 pF1KE1 GP-GAQDLLGSINRSQPEPLS-SHSGLWRTCRVQS----PCTPLMNPFRLENVTVSESSR : :. :. . .: . .:::::: : ... :. . : : .. .:.. NP_114 GDDGTPHRGGGGASEKKDPGGLTHSGLWRICCLEGLKRGVCVKI-NHFPEDTDYDHDSAE 70 80 90 100 110 100 110 120 130 140 150 pF1KE1 QLLTMHGTFVILLPLSLILMVFGGMTGFLSFLLQAYLLLLL-TGILFLFGAMVTLAGISV :: . . :. :: ::...::. : . .. ..: .::::. ... .. :. : NP_114 YLLRVVRASSIFPILSAILLLLGGVCVAASRVYKSKRNIILGAGILFVAAGLSNIIGVIV 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE1 YIAYSAAAFREALCLLEEKALLDQVDISFGWSLALGWISFIAELLTGA----AFLAAARE :: :: : . . ::: . :.:::. .: .::: . :. .. .:: NP_114 YI--SANAGEPGPKRDEEK----KNHYSYGWSFYFGGLSFILAEVIGVLAVNIYIERSRE 180 190 200 210 220 230 220 pF1KE1 LSLRRRQDQAI . :.: NP_114 AHCQSRSDLLKAGGGAGGSGGSGPSAILRLPSYRFRYRRRSRSSSRSSEPSPSRDASPGG 240 250 260 270 280 290 >>NP_665810 (OMIM: 606405) voltage-dependent calcium cha (275 aa) initn: 106 init1: 68 opt: 151 Z-score: 186.8 bits: 42.1 E(85289): 0.0011 Smith-Waterman score: 167; 23.2% identity (57.5% similar) in 181 aa overlap (22-197:23-187) 10 20 30 40 50 pF1KE1 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQP ::. :..::.: .. . .. :.: NP_665 MSACGRKALTLLSSVFAVCGLGLLGIAVSTDYWLYLE-----------EGVIVPQNQSTE 10 20 30 40 60 70 80 90 100 110 pF1KE1 EPLSSHSGLWRTCRVQSP----CTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLI .: ::::::.: . . : . . ... .:::. ..: : . . . .::. NP_665 IKMSLHSGLWRVCFLAGEERGRCFTIEYVMPMNTQLTSESTVNVLKMIRSATPFPLVSLF 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE1 LMVFGGMTGFLSFLLQAYLLL-LLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEE .: .: . . .. . .: ...::.:..... ..:. .::. .. . . : . NP_665 FMFIGFILNNIGHIRPHRTILAFVSGIFFILSGLSLVVGLVLYIS----SINDEM-LNRT 110 120 130 140 150 160 180 190 200 210 220 pF1KE1 KALLDQVDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI : . ..:::.:.. :::. NP_665 KDAETYFNYKYGWSFAFAAISFLLTESAGVMSVYLFMKRYTAEDMYRPHPGFYRPRLSNC 170 180 190 200 210 220 >>NP_057453 (OMIM: 609210) claudin-18 isoform 1 precurso (261 aa) initn: 183 init1: 60 opt: 137 Z-score: 170.5 bits: 39.0 E(85289): 0.0093 Smith-Waterman score: 139; 21.8% identity (52.0% similar) in 225 aa overlap (12-223:10-209) 10 20 30 40 50 60 pF1KE1 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE : : . :... :: : :.: ..::: . NP_057 MSTTTCQVVAFLLSILGLAGCIAATGMDMW-------------STQDLYDN------- 10 20 30 70 80 90 100 110 pF1KE1 PLSS---HSGLWRTCRVQSP----CTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLS :..: . ::::.: :: : : .. . : . .. : :. . ..: .. NP_057 PVTSVFQYEGLWRSCVRQSSGFTECRPYFTILGLPAML--QAVRALMIVG---IVLGAIG 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE1 LILMVFGGMTGFLSFLLQAYL--LLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCL :.. .:. .. . .. . : .::.:. ... ..::.::. . .. : . NP_057 LLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGLCAIAGVSVFANMLVTNFWMSTAN 100 110 120 130 140 150 180 190 200 210 220 pF1KE1 LEEK--ALLD--QVDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI . .... :. .:: .: .::.. :. :. . : : :. .. . .:. NP_057 MYTGMGGMVQTVQTRYTFGAALFVGWVAGGLTLIGGVMMCIACRGLAPEETNYKAVSYHA 160 170 180 190 200 210 NP_057 SGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDYV 220 230 240 250 260 223 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 21:34:52 2016 done: Sun Nov 6 21:34:52 2016 Total Scan time: 6.240 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]