FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1219, 223 aa 1>>>pF1KE1219 223 - 223 aa - 223 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2888+/-0.000799; mu= 14.2846+/- 0.048 mean_var=70.4824+/-14.678, 0's: 0 Z-trim(108.1): 30 B-trim: 342 in 1/49 Lambda= 0.152769 statistics sampled from 9955 (9981) to 9955 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.687), E-opt: 0.2 (0.307), width: 16 Scan time: 2.280 The best scores are: opt bits E(32554) CCDS73825.1 TMEM114 gene_id:283953|Hs108|chr16 ( 223) 1426 323.0 9.6e-89 CCDS76819.1 TMEM114 gene_id:283953|Hs108|chr16 ( 142) 880 202.5 1.1e-52 CCDS76820.1 TMEM114 gene_id:283953|Hs108|chr16 ( 177) 683 159.2 1.6e-39 CCDS56046.1 TMEM235 gene_id:283999|Hs108|chr17 ( 223) 472 112.7 1.9e-25 CCDS56047.1 TMEM235 gene_id:283999|Hs108|chr17 ( 196) 315 78.1 4.5e-15 >>CCDS73825.1 TMEM114 gene_id:283953|Hs108|chr16 (223 aa) initn: 1426 init1: 1426 opt: 1426 Z-score: 1706.7 bits: 323.0 E(32554): 9.6e-89 Smith-Waterman score: 1426; 100.0% identity (100.0% similar) in 223 aa overlap (1-223:1-223) 10 20 30 40 50 60 pF1KE1 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLILMVFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLILMVFG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEKALLDQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 GMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEKALLDQ 130 140 150 160 170 180 190 200 210 220 pF1KE1 VDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI ::::::::::::::::::::::::::::::::::::::::::: CCDS73 VDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI 190 200 210 220 >>CCDS76819.1 TMEM114 gene_id:283953|Hs108|chr16 (142 aa) initn: 880 init1: 880 opt: 880 Z-score: 1059.2 bits: 202.5 E(32554): 1.1e-52 Smith-Waterman score: 880; 100.0% identity (100.0% similar) in 142 aa overlap (82-223:1-142) 60 70 80 90 100 110 pF1KE1 GSINRSQPEPLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLP :::::::::::::::::::::::::::::: CCDS76 MNPFRLENVTVSESSRQLLTMHGTFVILLP 10 20 30 120 130 140 150 160 170 pF1KE1 LSLILMVFGGMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 LSLILMVFGGMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCL 40 50 60 70 80 90 180 190 200 210 220 pF1KE1 LEEKALLDQVDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 LEEKALLDQVDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI 100 110 120 130 140 >>CCDS76820.1 TMEM114 gene_id:283953|Hs108|chr16 (177 aa) initn: 696 init1: 675 opt: 683 Z-score: 823.1 bits: 159.2 E(32554): 1.6e-39 Smith-Waterman score: 1037; 79.4% identity (79.4% similar) in 223 aa overlap (1-223:1-177) 10 20 30 40 50 60 pF1KE1 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLILMVFG :::::::::::::::::::::::::::::::::::::::::: CCDS76 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTM------------------ 70 80 90 100 130 140 150 160 170 180 pF1KE1 GMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEKALLDQ :::::::::::::::::::::::::::::::: CCDS76 ----------------------------VTLAGISVYIAYSAAAFREALCLLEEKALLDQ 110 120 130 190 200 210 220 pF1KE1 VDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI ::::::::::::::::::::::::::::::::::::::::::: CCDS76 VDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI 140 150 160 170 >>CCDS56046.1 TMEM235 gene_id:283999|Hs108|chr17 (223 aa) initn: 583 init1: 376 opt: 472 Z-score: 570.4 bits: 112.7 E(32554): 1.9e-25 Smith-Waterman score: 560; 46.4% identity (70.1% similar) in 211 aa overlap (5-215:4-204) 10 20 30 40 50 60 pF1KE1 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE ::.: :::: . :::.:::::...:.::: :: . : :: .. : CCDS56 MARLGALLLAAALGALLSFALLAAAVASDYWYI-----LEVADAGN----GSAWPGRAE 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLILMVFG ::::::::: :. :. : ::..:: :.. :: : ..:. .: . ...:::::.:.: : CCDS56 LLSSHSGLWRICEGQNGCIPLVDPFASESLDVSTSVQHLILLHRAVIVVLPLSLVLLVCG 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 GMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEKALLDQ . :.:: : :. :::.:: ::.:...::::.:.::.:: :: :.. . . . CCDS56 WICGLLSSLAQSVSLLLFTGCYFLLGSVLTLAGVSIYISYSHLAFAETVQQYGPQHM-QG 120 130 140 150 160 190 200 210 220 pF1KE1 VDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI : .:::::.::.: : : ..:. .:.:: ::: CCDS56 VRVSFGWSMALAWGSCALEAFSGTLLLSAAWTLSLSPPICGHLSPQQVGGRGGD 170 180 190 200 210 220 >>CCDS56047.1 TMEM235 gene_id:283999|Hs108|chr17 (196 aa) initn: 445 init1: 220 opt: 315 Z-score: 384.2 bits: 78.1 E(32554): 4.5e-15 Smith-Waterman score: 429; 41.7% identity (61.6% similar) in 211 aa overlap (5-215:4-177) 10 20 30 40 50 60 pF1KE1 MRVHLGGLAGAAALTGALSFVLLAAAIGTDFWYIIDTERLERTGPGAQDLLGSINRSQPE ::.: :::: . :::.:::::...:.::: :: . : :: .. : CCDS56 MARLGALLLAAALGALLSFALLAAAVASDYWYI-----LEVADAGN----GSAWPGRAE 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 PLSSHSGLWRTCRVQSPCTPLMNPFRLENVTVSESSRQLLTMHGTFVILLPLSLILMVFG ::::::::: :.: .: . ...:::::.:.: : CCDS56 LLSSHSGLWRICEV---------------------------LHRAVIVVLPLSLVLLVCG 60 70 80 130 140 150 160 170 180 pF1KE1 GMTGFLSFLLQAYLLLLLTGILFLFGAMVTLAGISVYIAYSAAAFREALCLLEEKALLDQ . :.:: : :. :::.:: ::.:...::::.:.::.:: :: :.. . . . CCDS56 WICGLLSSLAQSVSLLLFTGCYFLLGSVLTLAGVSIYISYSHLAFAETVQQYGPQHM-QG 90 100 110 120 130 140 190 200 210 220 pF1KE1 VDISFGWSLALGWISFIAELLTGAAFLAAARELSLRRRQDQAI : .:::::.::.: : : ..:. .:.:: ::: CCDS56 VRVSFGWSMALAWGSCALEAFSGTLLLSAAWTLSLSPPICGHLSPQQVGGRGGD 150 160 170 180 190 223 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 21:34:51 2016 done: Sun Nov 6 21:34:51 2016 Total Scan time: 2.280 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]