FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3031, 156 aa 1>>>pF1KE3031 156 - 156 aa - 156 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8009+/-0.000888; mu= 13.4569+/- 0.053 mean_var=50.8157+/-10.528, 0's: 0 Z-trim(103.7): 31 B-trim: 473 in 1/47 Lambda= 0.179918 statistics sampled from 7520 (7545) to 7520 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.605), E-opt: 0.2 (0.232), width: 16 Scan time: 1.470 The best scores are: opt bits E(32554) CCDS9598.1 CMTM5 gene_id:116173|Hs108|chr14 ( 156) 998 266.8 3.9e-72 CCDS32050.1 CMTM5 gene_id:116173|Hs108|chr14 ( 125) 607 165.2 1.1e-41 CCDS73617.1 CMTM5 gene_id:116173|Hs108|chr14 ( 223) 603 164.3 3.9e-41 CCDS73618.1 CMTM5 gene_id:116173|Hs108|chr14 ( 105) 400 111.5 1.5e-25 CCDS10815.1 CMTM3 gene_id:123920|Hs108|chr16 ( 182) 336 95.0 2.4e-20 CCDS73619.1 CMTM5 gene_id:116173|Hs108|chr14 ( 74) 278 79.7 3.7e-16 CCDS14319.1 PLP2 gene_id:5355|Hs108|chrX ( 152) 255 73.9 4.3e-14 >>CCDS9598.1 CMTM5 gene_id:116173|Hs108|chr14 (156 aa) initn: 998 init1: 998 opt: 998 Z-score: 1408.5 bits: 266.8 E(32554): 3.9e-72 Smith-Waterman score: 998; 100.0% identity (100.0% similar) in 156 aa overlap (1-156:1-156) 10 20 30 40 50 60 pF1KE3 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETELALTLIIFICFTASISAYM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETELALTLIIFICFTASISAYM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 AAALLEFFITLAFLFLYATQYYQRFDRINWPCLDFLRCVSAIIIFLVVSFAAVTSRDGAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 AAALLEFFITLAFLFLYATQYYQRFDRINWPCLDFLRCVSAIIIFLVVSFAAVTSRDGAA 70 80 90 100 110 120 130 140 150 pF1KE3 IAAFVFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ :::::::::::::::::::::::::::::::::::: CCDS95 IAAFVFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ 130 140 150 >>CCDS32050.1 CMTM5 gene_id:116173|Hs108|chr14 (125 aa) initn: 607 init1: 607 opt: 607 Z-score: 861.5 bits: 165.2 E(32554): 1.1e-41 Smith-Waterman score: 738; 80.1% identity (80.1% similar) in 156 aa overlap (1-156:1-125) 10 20 30 40 50 60 pF1KE3 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETELALTLIIFICFTASISAYM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETELALTLIIFICFTASISAYM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 AAALLEFFITLAFLFLYATQYYQRFDRINWPCLDFLRCVSAIIIFLVVSFAAVTSRDGAA ::::::::::::::::::::::::::::::::: CCDS32 AAALLEFFITLAFLFLYATQYYQRFDRINWPCL--------------------------- 70 80 90 130 140 150 pF1KE3 IAAFVFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ :::::::::::::::::::::::::::::::: CCDS32 ----VFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ 100 110 120 >>CCDS73617.1 CMTM5 gene_id:116173|Hs108|chr14 (223 aa) initn: 984 init1: 603 opt: 603 Z-score: 852.0 bits: 164.3 E(32554): 3.9e-41 Smith-Waterman score: 629; 64.4% identity (64.4% similar) in 188 aa overlap (1-121:1-188) 10 20 30 40 50 60 pF1KE3 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETELALTLIIFICFTASISAYM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETELALTLIIFICFTASISAYM 10 20 30 40 50 60 70 80 90 pF1KE3 AAALLEFFITLAFLFLYATQYYQRFDRINWPCL--------------------------- ::::::::::::::::::::::::::::::::: CCDS73 AAALLEFFITLAFLFLYATQYYQRFDRINWPCLLQGHGQSGGPHPLDLLSHSAKVQPQPW 70 80 90 100 110 120 100 110 pF1KE3 ----------------------------------------DFLRCVSAIIIFLVVSFAAV :::::::::::::::::::: CCDS73 PGLTPPGWHTPAAVPWVPAPAPGFWSWLLWFICFHSLGSSDFLRCVSAIIIFLVVSFAAV 130 140 150 160 170 180 120 130 140 150 pF1KE3 TSRDGAAIAAFVFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ :::::::: CCDS73 TSRDGAAIAAFVFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ 190 200 210 220 >>CCDS73618.1 CMTM5 gene_id:116173|Hs108|chr14 (105 aa) initn: 400 init1: 400 opt: 400 Z-score: 572.3 bits: 111.5 E(32554): 1.5e-25 Smith-Waterman score: 551; 67.3% identity (67.3% similar) in 156 aa overlap (1-156:1-105) 10 20 30 40 50 60 pF1KE3 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETELALTLIIFICFTASISAYM :::::::::::::::::::::::::::::::::::::::::: CCDS73 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETEL------------------ 10 20 30 40 70 80 90 100 110 120 pF1KE3 AAALLEFFITLAFLFLYATQYYQRFDRINWPCLDFLRCVSAIIIFLVVSFAAVTSRDGAA ::::::::::::::::::::::::::: CCDS73 ---------------------------------DFLRCVSAIIIFLVVSFAAVTSRDGAA 50 60 130 140 150 pF1KE3 IAAFVFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ :::::::::::::::::::::::::::::::::::: CCDS73 IAAFVFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ 70 80 90 100 >>CCDS10815.1 CMTM3 gene_id:123920|Hs108|chr16 (182 aa) initn: 204 init1: 204 opt: 336 Z-score: 478.8 bits: 95.0 E(32554): 2.4e-20 Smith-Waterman score: 336; 42.1% identity (73.6% similar) in 140 aa overlap (17-154:24-162) 10 20 30 40 50 pF1KE3 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETELALTLIIFICFT : :... .::: : :: :: .: .:..: :::.. CCDS10 MWPPDPDPDPDPEPAGGSRPGPAVPGLRALLPARAFLCSLKGRLLLAESGLSFITFICYV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 AS-ISAYMAAALLEFFITLAFLFLYATQYYQRFDRINWPCLDFLRCVSAIIIFLVVSFAA :: ::...: ::::...: ::: : : .... . :: .::::::.: .:....:..: CCDS10 ASSASAFLTAPLLEFLLALYFLFADAMQLNDKWQGLCWPMMDFLRCVTAALIYFAISITA 70 80 90 100 110 120 120 130 140 150 pF1KE3 VTSR-DGAAIAAFVFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ ... :::. :: :::.. . .:: : . :. ...: .::: CCDS10 IAKYSDGASKAAGVFGFFATIVFATDFYLIF-NDVAKFLKQGDSADETTAHKTEEENSDS 130 140 150 160 170 CCDS10 DSD 180 >>CCDS73619.1 CMTM5 gene_id:116173|Hs108|chr14 (74 aa) initn: 284 init1: 276 opt: 278 Z-score: 403.5 bits: 79.7 E(32554): 3.7e-16 Smith-Waterman score: 301; 47.4% identity (47.4% similar) in 156 aa overlap (1-156:1-74) 10 20 30 40 50 60 pF1KE3 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETELALTLIIFICFTASISAYM :::::::::::::::::::::::::::::::::::::::::: CCDS73 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETEL------------------ 10 20 30 40 70 80 90 100 110 120 pF1KE3 AAALLEFFITLAFLFLYATQYYQRFDRINWPCLDFLRCVSAIIIFLVVSFAAVTSRDGAA CCDS73 ------------------------------------------------------------ 130 140 150 pF1KE3 IAAFVFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ :::::::::::::::::::::::::::::::: CCDS73 ----VFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ 50 60 70 >>CCDS14319.1 PLP2 gene_id:5355|Hs108|chrX (152 aa) initn: 225 init1: 202 opt: 255 Z-score: 366.4 bits: 73.9 E(32554): 4.3e-14 Smith-Waterman score: 255; 35.7% identity (73.2% similar) in 112 aa overlap (29-139:19-130) 10 20 30 40 50 60 pF1KE3 MLSARDRRDRHPEEGVVAELQGFAVDKAFLTSHKGILLETELALTLIIFICFTASISAYM : ..::::: .:. : :.:.:::.:: .: CCDS14 MADSERLSAPGCWAACTNFSRTRKGILLFAEIILCLVILICFSASTPGYS 10 20 30 40 50 70 80 90 100 110 pF1KE3 AAALLEFFITLAFLFLYATQYYQRFDRINWPCLDFLRCVSAIIIFLVVSFAAVTSR-DGA . ...:.... :. .: . . .. :::: ::.: . : :..:..:..... : . . CCDS14 SLSVIEMILAAIFFVVYMCDLHTKIPFINWPWSDFFRTLIAAILYLITSIVVLVERGNHS 60 70 80 90 100 110 120 130 140 150 pF1KE3 AIAAFVFGIILVSIFAYDAFKIYRTEMAPGASQGDQQ :.: :.:.: . .:.:::. CCDS14 KIVAGVLGLIATCLFGYDAYVTFPVRQPRHTAAPTDPADGPV 120 130 140 150 156 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:29:52 2016 done: Sun Nov 6 14:29:52 2016 Total Scan time: 1.470 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]