FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4305, 305 aa 1>>>pF1KE4305 305 - 305 aa - 305 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1655+/-0.000847; mu= 16.0925+/- 0.050 mean_var=64.9184+/-13.017, 0's: 0 Z-trim(106.7): 33 B-trim: 0 in 0/48 Lambda= 0.159181 statistics sampled from 9134 (9163) to 9134 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.655), E-opt: 0.2 (0.281), width: 16 Scan time: 2.440 The best scores are: opt bits E(32554) CCDS3296.1 CLDN16 gene_id:10686|Hs108|chr3 ( 305) 2111 493.5 8.5e-140 CCDS11096.1 CLDN7 gene_id:1366|Hs108|chr17 ( 211) 292 75.6 3.5e-14 CCDS3295.1 CLDN1 gene_id:9076|Hs108|chr3 ( 211) 280 72.9 2.4e-13 CCDS5717.1 CLDN15 gene_id:24146|Hs108|chr7 ( 228) 259 68.1 7.2e-12 >>CCDS3296.1 CLDN16 gene_id:10686|Hs108|chr3 (305 aa) initn: 2111 init1: 2111 opt: 2111 Z-score: 2623.3 bits: 493.5 E(32554): 8.5e-140 Smith-Waterman score: 2111; 100.0% identity (100.0% similar) in 305 aa overlap (1-305:1-305) 10 20 30 40 50 60 pF1KE4 MTSRTPLLVTACLYYSYCNSRHLQQGVRKSKRPVFSHCQVPETQKTDTRHLSGARAGVCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MTSRTPLLVTACLYYSYCNSRHLQQGVRKSKRPVFSHCQVPETQKTDTRHLSGARAGVCP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 CCHPDGLLATMRDLLQYIACFFAFFSAGFLIVATWTDCWMVNADDSLEVSTKCRGLWWEC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 CCHPDGLLATMRDLLQYIACFFAFFSAGFLIVATWTDCWMVNADDSLEVSTKCRGLWWEC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 VTNAFDGIRTCDEYDSILAEHPLKLVVTRALMITADILAGFGFLTLLLGLDCVKFLPDEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 VTNAFDGIRTCDEYDSILAEHPLKLVVTRALMITADILAGFGFLTLLLGLDCVKFLPDEP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 YIKVRICFVAGATLLIAGTPGIIGSVWYAVDVYVERSTLVLHNIFLGIQYKFGWSCWLGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 YIKVRICFVAGATLLIAGTPGIIGSVWYAVDVYVERSTLVLHNIFLGIQYKFGWSCWLGM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 AGSLGCFLAGAVLTCCLYLFKDVGPERNYPYSLRKAYSAAGVSMAKSYSAPRTETAKMYA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 AGSLGCFLAGAVLTCCLYLFKDVGPERNYPYSLRKAYSAAGVSMAKSYSAPRTETAKMYA 250 260 270 280 290 300 pF1KE4 VDTRV ::::: CCDS32 VDTRV >>CCDS11096.1 CLDN7 gene_id:1366|Hs108|chr17 (211 aa) initn: 280 init1: 126 opt: 292 Z-score: 368.0 bits: 75.6 E(32554): 3.5e-14 Smith-Waterman score: 292; 28.8% identity (62.9% similar) in 205 aa overlap (75-272:6-204) 50 60 70 80 90 100 pF1KE4 KTDTRHLSGARAGVCPCCHPDGLLATMRDLLQYIACFFAFFSAGFLIVATWTDCWMVNA- :: .. .:... :.. : :.... CCDS11 MANSGLQLLGFSMALLGWVGLVACTAIPQWQMSSY 10 20 30 110 120 130 140 150 160 pF1KE4 --DDSLEVSTKCRGLWWECVTNAFDGIRTCDEYDSILAEHPLKLVVTRALMITADILAGF :. . ... .::: .:::.. :. .: :::.:: : .:::::... .:. . CCDS11 AGDNIITAQAMYKGLWMDCVTQS-TGMMSCKMYDSVLALSA-ALQATRALMVVSLVLGFL 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE4 GFLTLLLGLDCVKFLPDEPYIKVRICFVAGATLLIAGTPGIIGSVWYAVDVYVE-RSTLV .... .:. :.. :. :.:: . .: ...:: .... ::. .. .. . :. CCDS11 AMFVATMGMKCTRCGGDDKVKKARIAMGGGIIFIVAGLAALVACSWYGHQIVTDFYNPLI 100 110 120 130 140 150 230 240 250 260 270 pF1KE4 LHNIFLGIQYKFGWSCWLGMAGSLGCFLAGAVLTC-CLYLFKDVGPE--RNYPYSLRKAY :: .:.:: . ..: ::: .:.::.:.: : . .: . :.:: : CCDS11 PTNI----KYEFGPAIFIGWAGSALVILGGALLSCSCPGNESKAGYRVPRSYPKSNSSKE 160 170 180 190 200 280 290 300 pF1KE4 SAAGVSMAKSYSAPRTETAKMYAVDTRV CCDS11 YV 210 >>CCDS3295.1 CLDN1 gene_id:9076|Hs108|chr3 (211 aa) initn: 208 init1: 146 opt: 280 Z-score: 353.1 bits: 72.9 E(32554): 2.4e-13 Smith-Waterman score: 281; 27.4% identity (58.4% similar) in 219 aa overlap (74-288:8-210) 50 60 70 80 90 100 pF1KE4 QKTDTRHLSGARAGVCPCCHPDGLLATMRDLLQYIACFFAFFSAGFLIVATWTDCWMVNA :: .: :.....: ::.: : . . CCDS32 MANAGLQLLGFILAFLGWIGA---IVSTALPQWRIYS 10 20 30 110 120 130 140 150 160 pF1KE4 ---DDSLEVSTKCRGLWWECVTNAFDGIRTCDEYDSILAEHPLKLVVTRALMITADILAG :. . ... .::: ::... :. : .::.: : .:::::... .:. CCDS32 YAGDNIVTAQAMYEGLWMSCVSQSTGQIQ-CKVFDSLLNLSST-LQATRALMVVGILLGV 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE4 FGFLTLLLGLDCVKFLPDEPYIKVRICFVAGATLLIAGTPGIIGSVWYAVDVYVERSTLV ..... .:. :.: : :. :.:. ..:: .:.:: .....::. . : . CCDS32 IAIFVATVGMKCMKCLEDDEVQKMRMAVIGGAIFLLAGLAILVATAWYGNRIVQEFYDPM 100 110 120 130 140 150 230 240 250 260 270 pF1KE4 LHNIFLGIQYKFGWSCWLGMAGSLGCFLAGAVLTC-CLYLFKDVGPERNYPYSLRKAYSA .. .:.:: . . : :.. :.:.::.: : : :... : . : CCDS32 TP---VNARYEFGQALFTGWAAASLCLLGGALLCCSC--------PRKTTSYPTPRPYPK 160 170 180 190 200 280 290 300 pF1KE4 AGVSMAKSYSAPRTETAKMYAVDTRV . : .:.: CCDS32 PAPSSGKDYV 210 >>CCDS5717.1 CLDN15 gene_id:24146|Hs108|chr7 (228 aa) initn: 171 init1: 133 opt: 259 Z-score: 326.6 bits: 68.1 E(32554): 7.2e-12 Smith-Waterman score: 259; 30.9% identity (59.9% similar) in 207 aa overlap (82-279:8-201) 60 70 80 90 100 pF1KE4 SGARAGVCPCCHPDGLLATMRDLLQYIACFFAFFSA--GFLI--VATWTDCWMVNA--DD :.:: : :.:. :. .. : :.. . CCDS57 MSMAVETFGFFMATVGLLMLGVTLPNSYWRVSTVHGN 10 20 30 110 120 130 140 150 160 pF1KE4 SLEVSTKCRGLWWECVTNAFDGIRTCDEYDSILAEHPLKLVVTRALMITADILAGF-GFL . ..: ..::. :.:... :. .: :. :.:: . . ::::::: :: :: :.: CCDS57 VITTNTIFENLWFSCATDSL-GVYNCWEFPSMLALSGY-IQACRALMITA-ILLGFLGLL 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE4 TLLLGLDCVKFLPDEPYIKVRICFVAGATLLIAGTPGIIGSVWYAVDVYVERSTLVLHNI . :: :... : :... .::: ..:: :... ::: .. . . . CCDS57 LGIAGLRCTNIGGLELSRKAKLAATAGALHILAGICGMVAISWYAFNI----TRDFFDPL 100 110 120 130 140 150 230 240 250 260 270 280 pF1KE4 FLGIQYKFGWSCWLGMAGSLGCFLAGAVL--TCCLYLFKDVGPERNYPYSLRKAYSAAGV . : .:..: . .:: ..:: .:.: : .:: : ... : :. :.: CCDS57 YPGTKYELGPALYLGWSASLISILGGLCLCSACC------CGSDEDPAASARRPYQAPVS 160 170 180 190 200 290 300 pF1KE4 SMAKSYSAPRTETAKMYAVDTRV CCDS57 VMPVATSDQEGDSSFGKYGRNAYV 210 220 305 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:28:37 2016 done: Sat Nov 5 23:28:38 2016 Total Scan time: 2.440 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]