FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5172, 162 aa 1>>>pF1KE5172 162 - 162 aa - 162 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0473+/-0.000649; mu= 13.0888+/- 0.039 mean_var=57.9214+/-11.795, 0's: 0 Z-trim(110.3): 37 B-trim: 292 in 2/49 Lambda= 0.168521 statistics sampled from 11436 (11475) to 11436 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.752), E-opt: 0.2 (0.352), width: 16 Scan time: 1.710 The best scores are: opt bits E(32554) CCDS5064.1 SNX3 gene_id:8724|Hs108|chr6 ( 162) 1084 271.1 2e-73 CCDS14405.1 SNX12 gene_id:29934|Hs108|chrX ( 162) 910 228.8 1.1e-60 CCDS59169.1 SNX12 gene_id:29934|Hs108|chrX ( 158) 877 220.8 2.8e-58 CCDS75501.1 SNX3 gene_id:8724|Hs108|chr6 ( 140) 751 190.1 4.2e-49 CCDS5065.1 SNX3 gene_id:8724|Hs108|chr6 ( 130) 531 136.6 5e-33 CCDS43865.1 SNX30 gene_id:401548|Hs108|chr9 ( 437) 247 67.9 8.6e-12 CCDS755.2 SNX7 gene_id:51375|Hs108|chr1 ( 451) 245 67.4 1.2e-11 CCDS32266.1 SNX1 gene_id:6642|Hs108|chr15 ( 522) 240 66.2 3.3e-11 CCDS58371.1 SNX1 gene_id:6642|Hs108|chr15 ( 557) 240 66.2 3.4e-11 CCDS82152.1 SNX11 gene_id:29916|Hs108|chr17 ( 262) 232 64.1 7e-11 CCDS11526.1 SNX11 gene_id:29916|Hs108|chr17 ( 270) 232 64.1 7.2e-11 >>CCDS5064.1 SNX3 gene_id:8724|Hs108|chr6 (162 aa) initn: 1084 init1: 1084 opt: 1084 Z-score: 1431.5 bits: 271.1 E(32554): 2e-73 Smith-Waterman score: 1084; 100.0% identity (100.0% similar) in 162 aa overlap (1-162:1-162) 10 20 30 40 50 60 pF1KE5 MAETVADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRVKTNLPIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS50 MAETVADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRVKTNLPIF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 KLKESTVRRRYSDFEWLRSELERESKVVVPPLPGKAFLRQLPFRGDDGIFDDNFIEERKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS50 KLKESTVRRRYSDFEWLRSELERESKVVVPPLPGKAFLRQLPFRGDDGIFDDNFIEERKQ 70 80 90 100 110 120 130 140 150 160 pF1KE5 GLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA :::::::::::::::::::::::::::::::::::::::::: CCDS50 GLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA 130 140 150 160 >>CCDS14405.1 SNX12 gene_id:29934|Hs108|chrX (162 aa) initn: 907 init1: 907 opt: 910 Z-score: 1202.9 bits: 228.8 E(32554): 1.1e-60 Smith-Waterman score: 910; 79.0% identity (96.3% similar) in 162 aa overlap (1-161:1-162) 10 20 30 40 50 pF1KE5 MAET-VADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRVKTNLPI :..: ::::::: .:::.:.:::::::::::::. :::::::::.::::::.:..::::: CCDS14 MSDTAVADTRRLNSKPQDLTDAYGPPSNFLEIDIFNPQTVGVGRARFTTYEVRMRTNLPI 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 FKLKESTVRRRYSDFEWLRSELERESKVVVPPLPGKAFLRQLPFRGDDGIFDDNFIEERK :::::: :::::::::::..::::.::.:::::::::. ::::::::.:::...:::::. CCDS14 FKLKESCVRRRYSDFEWLKNELERDSKIVVPPLPGKALKRQLPFRGDEGIFEESFIEERR 70 80 90 100 110 120 120 130 140 150 160 pF1KE5 QGLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA :::::::::.:::::::::::::::::.: ::..:.:.:.:. CCDS14 QGLEQFINKIAGHPLAQNERCLHMFLQEEAIDRNYVPGKVRQ 130 140 150 160 >>CCDS59169.1 SNX12 gene_id:29934|Hs108|chrX (158 aa) initn: 878 init1: 626 opt: 877 Z-score: 1159.7 bits: 220.8 E(32554): 2.8e-58 Smith-Waterman score: 877; 78.4% identity (93.8% similar) in 162 aa overlap (1-161:1-158) 10 20 30 40 50 pF1KE5 MAET-VADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRVKTNLPI :..: ::::::: .:::.:.:::::::::::::. :::::::::.:::::: ::::: CCDS59 MSDTAVADTRRLNSKPQDLTDAYGPPSNFLEIDIFNPQTVGVGRARFTTYE----TNLPI 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 FKLKESTVRRRYSDFEWLRSELERESKVVVPPLPGKAFLRQLPFRGDDGIFDDNFIEERK :::::: :::::::::::..::::.::.:::::::::. ::::::::.:::...:::::. CCDS59 FKLKESCVRRRYSDFEWLKNELERDSKIVVPPLPGKALKRQLPFRGDEGIFEESFIEERR 60 70 80 90 100 110 120 130 140 150 160 pF1KE5 QGLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA :::::::::.:::::::::::::::::.: ::..:.:.:.:. CCDS59 QGLEQFINKIAGHPLAQNERCLHMFLQEEAIDRNYVPGKVRQ 120 130 140 150 >>CCDS75501.1 SNX3 gene_id:8724|Hs108|chr6 (140 aa) initn: 750 init1: 737 opt: 751 Z-score: 994.9 bits: 190.1 E(32554): 4.2e-49 Smith-Waterman score: 888; 86.4% identity (86.4% similar) in 162 aa overlap (1-162:1-140) 10 20 30 40 50 60 pF1KE5 MAETVADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRVKTNLPIF :::::::::::::::::::::::::::::::: :::::: CCDS75 MAETVADTRRLITKPQNLNDAYGPPSNFLEID----------------------TNLPIF 10 20 30 70 80 90 100 110 120 pF1KE5 KLKESTVRRRYSDFEWLRSELERESKVVVPPLPGKAFLRQLPFRGDDGIFDDNFIEERKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 KLKESTVRRRYSDFEWLRSELERESKVVVPPLPGKAFLRQLPFRGDDGIFDDNFIEERKQ 40 50 60 70 80 90 130 140 150 160 pF1KE5 GLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA :::::::::::::::::::::::::::::::::::::::::: CCDS75 GLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA 100 110 120 130 140 >>CCDS5065.1 SNX3 gene_id:8724|Hs108|chr6 (130 aa) initn: 531 init1: 531 opt: 531 Z-score: 706.4 bits: 136.6 E(32554): 5e-33 Smith-Waterman score: 803; 80.2% identity (80.2% similar) in 162 aa overlap (1-162:1-130) 10 20 30 40 50 60 pF1KE5 MAETVADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRVKTNLPIF :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS50 MAETVADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRVK------ 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 KLKESTVRRRYSDFEWLRSELERESKVVVPPLPGKAFLRQLPFRGDDGIFDDNFIEERKQ :::::::::::::::::::::::::::::::::: CCDS50 --------------------------VVVPPLPGKAFLRQLPFRGDDGIFDDNFIEERKQ 60 70 80 130 140 150 160 pF1KE5 GLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA :::::::::::::::::::::::::::::::::::::::::: CCDS50 GLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA 90 100 110 120 130 >>CCDS43865.1 SNX30 gene_id:401548|Hs108|chr9 (437 aa) initn: 240 init1: 131 opt: 247 Z-score: 325.2 bits: 67.9 E(32554): 8.6e-12 Smith-Waterman score: 247; 31.2% identity (66.7% similar) in 141 aa overlap (6-145:69-203) 10 20 30 pF1KE5 MAETVADTRRLITKPQNLNDAYGPPSNFLEIDVSN :.. :... : .: : ... : :.. CCDS43 PSPDLLMARSFGDKDLILPNGGTPAGTSSPASSSSLLNRLQLDDDIDGETRDLFVI-VDD 40 50 60 70 80 90 40 50 60 70 80 90 pF1KE5 PQTVGVGRGRFTTYEIRVKTNLPIFKLKESTVRRRYSDFEWLRSELERESKV-VVPPLPG :. . ::.: .:.. : : : .:::::.::.::::.::. . . ..:::: CCDS43 PKKHVCTMETYITYRITTKSTRVEFDLPEYSVRRRYQDFDWLRSKLEESQPTHLIPPLPE 100 110 120 130 140 150 100 110 120 130 140 150 pF1KE5 KAFLRQLPFRGDDGIFDDNFIEERKQGLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSY : .. . : :...:.: :...:..:..... ::. . .. ...:: CCDS43 KFVVKGVVDR-----FSEEFVETRRKALDKFLKRITDHPVLSFNEHFNIFLTAKDLNAYK 160 170 180 190 200 210 160 pF1KE5 TPSKIRHA CCDS43 KQGIALLTRMGESVKHVTGGYKLRTRPLEFAAIGDYLDTFALKLGTIDRIAQRIIKEEIE 220 230 240 250 260 270 >>CCDS755.2 SNX7 gene_id:51375|Hs108|chr1 (451 aa) initn: 246 init1: 114 opt: 245 Z-score: 322.3 bits: 67.4 E(32554): 1.2e-11 Smith-Waterman score: 245; 35.8% identity (66.7% similar) in 123 aa overlap (24-145:91-208) 10 20 30 40 50 pF1KE5 MAETVADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRV : . : : :..:.. . : ::.: . CCDS75 DASLMDMNSFSPMMPTSPLSMINQIKFEDEPDLKDLFITVDEPESHVTTIETFITYRIIT 70 80 90 100 110 120 60 70 80 90 100 110 pF1KE5 KTNLPIFKLKESTVRRRYSDFEWLRSELER-ESKVVVPPLPGKAFLRQLPFRGDDGIFDD ::. : .: :::::.:: ::...::. . ...:::: : ... . : :.: CCDS75 KTSRGEFDSSEFEVRRRYQDFLWLKGKLEEAHPTLIIPPLPEKFIVKGMVER-----FND 130 140 150 160 170 120 130 140 150 160 pF1KE5 NFIEERKQGLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA .::: :...:..:.:..: :: .. ...:: CCDS75 DFIETRRKALHKFLNRIADHPTLTFNEDFKIFLTAQAWELSSHKKQGPGLLSRMGQTVRA 180 190 200 210 220 230 CCDS75 VASSMRGVKNRPEEFMEMNNFIELFSQKINLIDKISQRIYKEEREYFDEMKEYGPIHILW 240 250 260 270 280 290 >>CCDS32266.1 SNX1 gene_id:6642|Hs108|chr15 (522 aa) initn: 242 init1: 139 opt: 240 Z-score: 314.8 bits: 66.2 E(32554): 3.3e-11 Smith-Waterman score: 240; 27.9% identity (69.8% similar) in 129 aa overlap (29-153:145-273) 10 20 30 40 50 pF1KE5 MAETVADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRVKTNLP : . ...:. .: : . ...:.. ..:.:: CCDS32 SLPPQEATNSSKPQPTYEELEEEEQEDQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLP 120 130 140 150 160 170 60 70 80 90 100 110 pF1KE5 IFKLKESTVRRRYSDFEWLRSEL-ERESK--VVVPPLPGKAFLRQLPFR-GDDGIFDDNF .:. :. .:.::.::: : .: :..:. .::: : :... . . : . . .: CCDS32 LFRSKQFAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEF 180 190 200 210 220 230 120 130 140 150 160 pF1KE5 IEERKQGLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA .:.:. .::........:: .. .. ::. : . .. CCDS32 LEKRRAALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSGAGLLKMFNKATDA 240 250 260 270 280 290 CCDS32 VSKMTIKMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELALNTAQFAKSLAML 300 310 320 330 340 350 >>CCDS58371.1 SNX1 gene_id:6642|Hs108|chr15 (557 aa) initn: 242 init1: 139 opt: 240 Z-score: 314.4 bits: 66.2 E(32554): 3.4e-11 Smith-Waterman score: 240; 27.9% identity (69.8% similar) in 129 aa overlap (29-153:145-273) 10 20 30 40 50 pF1KE5 MAETVADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGRGRFTTYEIRVKTNLP : . ...:. .: : . ...:.. ..:.:: CCDS58 SLPPQEATNSSKPQPTYEELEEEEQEDQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLP 120 130 140 150 160 170 60 70 80 90 100 110 pF1KE5 IFKLKESTVRRRYSDFEWLRSEL-ERESK--VVVPPLPGKAFLRQLPFR-GDDGIFDDNF .:. :. .:.::.::: : .: :..:. .::: : :... . . : . . .: CCDS58 LFRSKQFAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEF 180 190 200 210 220 230 120 130 140 150 160 pF1KE5 IEERKQGLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA .:.:. .::........:: .. .. ::. : . .. CCDS58 LEKRRAALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSGAGLLKMFNKATDA 240 250 260 270 280 290 CCDS58 VSKMTIKMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELALNTAQFAKSLAML 300 310 320 330 340 350 >>CCDS82152.1 SNX11 gene_id:29916|Hs108|chr17 (262 aa) initn: 239 init1: 112 opt: 232 Z-score: 308.8 bits: 64.1 E(32554): 7e-11 Smith-Waterman score: 250; 37.4% identity (66.7% similar) in 123 aa overlap (29-149:9-122) 10 20 30 40 50 pF1KE5 MAETVADTRRLITKPQNLNDAYGPPSNFLEIDVSNPQTVGVGR-GRFTTYEIRVKTNLPI . . :..:.. . : . .. :.: ..:: CCDS82 MVCREQEVITVRVQDPRVQNEGSWNSYVDYKIFLHTNSKA 10 20 30 40 60 70 80 90 100 110 pF1KE5 FKLKESTVRRRYSDFEWLRSELERESKVV-VPPLPGKAFLRQLPFRGDDGIFDDNFIEER : : : ::::: .: :::..:.:.. .: :: ::::. : : .:.:::.: CCDS82 FTAKTSCVRRRYREFVWLRKQLQRNAGLVPVPELPGKS-----TFFGT----SDEFIEKR 50 60 70 80 90 120 130 140 150 160 pF1KE5 KQGLEQFINKVAGHPLAQNERCLHMFLQDEIIDKSYTPSKIRHA .:::..:..:: . .. ::.:::... CCDS82 RQGLQHFLEKVLQSVVLLSDSQLHLFLQSQLSVPEIEACVQGRSTMTVSDAILRYAMSNC 100 110 120 130 140 150 CCDS82 GWAQEERQSSSHLAKGDQPKSCCFLPRSGRRSSPSPPPSEEKDHLEVWAPVVDSEVPSLE 160 170 180 190 200 210 162 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:16:27 2016 done: Mon Nov 7 22:16:28 2016 Total Scan time: 1.710 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]