FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6441, 464 aa 1>>>pF1KE6441 464 - 464 aa - 464 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3356+/-0.000759; mu= 17.2851+/- 0.045 mean_var=64.0525+/-12.788, 0's: 0 Z-trim(107.8): 9 B-trim: 11 in 2/49 Lambda= 0.160253 statistics sampled from 9804 (9810) to 9804 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.678), E-opt: 0.2 (0.301), width: 16 Scan time: 3.170 The best scores are: opt bits E(32554) CCDS10528.1 ALG1 gene_id:56052|Hs108|chr16 ( 464) 3197 747.8 5.3e-216 CCDS81946.1 ALG1 gene_id:56052|Hs108|chr16 ( 353) 2460 577.4 8.2e-165 CCDS33840.1 ALG1L gene_id:200810|Hs108|chr3 ( 187) 707 172.0 4.8e-43 CCDS74998.1 ALG1L gene_id:200810|Hs108|chr3 ( 207) 707 172.0 5.2e-43 >>CCDS10528.1 ALG1 gene_id:56052|Hs108|chr16 (464 aa) initn: 3197 init1: 3197 opt: 3197 Z-score: 3991.4 bits: 747.8 E(32554): 5.3e-216 Smith-Waterman score: 3197; 99.8% identity (100.0% similar) in 464 aa overlap (1-464:1-464) 10 20 30 40 50 60 pF1KE6 MAASCLVLLALCLLLPLLLLGGWKRWRRGRAARHVVAVVLGDVGRSPRMQYHALSLAMHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MAASCLVLLALCLLLPLLLLGGWKRWRRGRAARHVVAVVLGDVGRSPRMQYHALSLAMHG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 FSVTLLGFCNSKPHDELLQNNRIQIVGLTELQSLAVGPRVFQYGVKVVLQAMYLLWKLMW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 FSVTLLGFCNSKPHDELLQNNRIQIVGLTELQSLAVGPRVFQYGVKVVLQAMYLLWKLMW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 REPGAYIFLQNPPGLPSIAVCWFVGCLCGSKLVIDWHNYGYSIMGLVHGPNHPLVLLAKW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 REPGAYIFLQNPPGLPSIAVCWFVGCLCGSKLVIDWHNYGYSIMGLVHGPNHPLVLLAKW 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 YEKFFGRLSHLNLCVTNAMREDLADNWHIRAVTVYDKPASFFKETPLDLQHRLFMKLGSM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YEKFFGRLSHLNLCVTNAMREDLADNWHIRAVTVYDKPASFFKETPLDLQHRLFMKLGSM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 HSPFRARSEPEDPVTERSAFTERDAGNGLVTRLRERPALLVSSTSWTEDEDFSILLAALE ::::::::::::::::::::::::::.::::::::::::::::::::::::::::::::: CCDS10 HSPFRARSEPEDPVTERSAFTERDAGSGLVTRLRERPALLVSSTSWTEDEDFSILLAALE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 KFEQLTLDGHNLPSLVCVITGKGPLREYYSRLIHQKHFQHIQVCTPWLEAEDYPLLLGSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 KFEQLTLDGHNLPSLVCVITGKGPLREYYSRLIHQKHFQHIQVCTPWLEAEDYPLLLGSA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 DLGVCLHTSSSGLDLPMKVVDMFGCCLPVCAVNFKCLHELVKHEENGLVFEDSEELAAQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DLGVCLHTSSSGLDLPMKVVDMFGCCLPVCAVNFKCLHELVKHEENGLVFEDSEELAAQL 370 380 390 400 410 420 430 440 450 460 pF1KE6 QMLFSNFPDPAGKLNQFRKNLRESQQLRWDESWVQTVLPLVMDT :::::::::::::::::::::::::::::::::::::::::::: CCDS10 QMLFSNFPDPAGKLNQFRKNLRESQQLRWDESWVQTVLPLVMDT 430 440 450 460 >>CCDS81946.1 ALG1 gene_id:56052|Hs108|chr16 (353 aa) initn: 2460 init1: 2460 opt: 2460 Z-score: 3072.4 bits: 577.4 E(32554): 8.2e-165 Smith-Waterman score: 2460; 99.7% identity (100.0% similar) in 353 aa overlap (112-464:1-353) 90 100 110 120 130 140 pF1KE6 RIQIVGLTELQSLAVGPRVFQYGVKVVLQAMYLLWKLMWREPGAYIFLQNPPGLPSIAVC :::::::::::::::::::::::::::::: CCDS81 MYLLWKLMWREPGAYIFLQNPPGLPSIAVC 10 20 30 150 160 170 180 190 200 pF1KE6 WFVGCLCGSKLVIDWHNYGYSIMGLVHGPNHPLVLLAKWYEKFFGRLSHLNLCVTNAMRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 WFVGCLCGSKLVIDWHNYGYSIMGLVHGPNHPLVLLAKWYEKFFGRLSHLNLCVTNAMRE 40 50 60 70 80 90 210 220 230 240 250 260 pF1KE6 DLADNWHIRAVTVYDKPASFFKETPLDLQHRLFMKLGSMHSPFRARSEPEDPVTERSAFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 DLADNWHIRAVTVYDKPASFFKETPLDLQHRLFMKLGSMHSPFRARSEPEDPVTERSAFT 100 110 120 130 140 150 270 280 290 300 310 320 pF1KE6 ERDAGNGLVTRLRERPALLVSSTSWTEDEDFSILLAALEKFEQLTLDGHNLPSLVCVITG :::::.:::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 ERDAGSGLVTRLRERPALLVSSTSWTEDEDFSILLAALEKFEQLTLDGHNLPSLVCVITG 160 170 180 190 200 210 330 340 350 360 370 380 pF1KE6 KGPLREYYSRLIHQKHFQHIQVCTPWLEAEDYPLLLGSADLGVCLHTSSSGLDLPMKVVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 KGPLREYYSRLIHQKHFQHIQVCTPWLEAEDYPLLLGSADLGVCLHTSSSGLDLPMKVVD 220 230 240 250 260 270 390 400 410 420 430 440 pF1KE6 MFGCCLPVCAVNFKCLHELVKHEENGLVFEDSEELAAQLQMLFSNFPDPAGKLNQFRKNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 MFGCCLPVCAVNFKCLHELVKHEENGLVFEDSEELAAQLQMLFSNFPDPAGKLNQFRKNL 280 290 300 310 320 330 450 460 pF1KE6 RESQQLRWDESWVQTVLPLVMDT ::::::::::::::::::::::: CCDS81 RESQQLRWDESWVQTVLPLVMDT 340 350 >>CCDS33840.1 ALG1L gene_id:200810|Hs108|chr3 (187 aa) initn: 547 init1: 414 opt: 707 Z-score: 886.3 bits: 172.0 E(32554): 4.8e-43 Smith-Waterman score: 850; 70.2% identity (72.1% similar) in 208 aa overlap (256-463:2-158) 230 240 250 260 270 280 pF1KE6 PLDLQHRLFMKLGSMHSPFRARSEPEDPVTERSAFTERDAGNGLVTRLRERPALLVSSTS ::::: : :::. :: .::: ::::::::. CCDS33 MERSAFMELDAGSRLVMHLREWPALLVSSTG 10 20 30 290 300 310 320 330 340 pF1KE6 WTEDEDFSILLAALEKFEQLTLDGHNLPSLVCVITGKGPLREYYSRLIHQKHFQHIQVCT ::: :::::::::::::::::::: CCDS33 WTE-------------FEQLTLDGHNLPSLVCVITG------------------------ 40 50 350 360 370 380 390 400 pF1KE6 PWLEAEDYPLLLGSADLGVCLHTSSSGLDLPMKVVDMFGCCLPVCAVNFKCLHELVKHEE :.::::::: ::::::::::::::::::::::::::::::::::::: CCDS33 -------------SVDLGVCLHMSSSGLDLPMKVVDMFGCCLPVCAVNFKCLHELVKHEE 60 70 80 90 100 410 420 430 440 450 460 pF1KE6 NGLVFEDSEELAAQLQMLFSNFPDPAGKLNQFRKNLRESQQLRWDESWVQTVLPLVMDT ::::::::::::: :::::::::::::::::: ::::::::::::::::::::::::: CCDS33 NGLVFEDSEELAA-LQMLFSNFPDPAGKLNQFWKNLRESQQLRWDESWVQTVLPLVMDIQ 110 120 130 140 150 160 CCDS33 LLGQRLKPRDPCCPSRSFFSESQGKPF 170 180 >>CCDS74998.1 ALG1L gene_id:200810|Hs108|chr3 (207 aa) initn: 632 init1: 414 opt: 707 Z-score: 885.6 bits: 172.0 E(32554): 5.2e-43 Smith-Waterman score: 935; 69.9% identity (72.5% similar) in 229 aa overlap (235-463:1-178) 210 220 230 240 250 260 pF1KE6 DNWHIRAVTVYDKPASFFKETPLDLQHRLFMKLGSMHSPFRARSEPEDPVTERSAFTERD ::::. :: ::: ::::: . ::::: : : CCDS74 MKLGGTHSLFRACSEPEDAAMERSAFMELD 10 20 30 270 280 290 300 310 320 pF1KE6 AGNGLVTRLRERPALLVSSTSWTEDEDFSILLAALEKFEQLTLDGHNLPSLVCVITGKGP ::. :: .::: ::::::::.::: :::::::::::::::::::: CCDS74 AGSRLVMHLREWPALLVSSTGWTE-------------FEQLTLDGHNLPSLVCVITG--- 40 50 60 70 330 340 350 360 370 380 pF1KE6 LREYYSRLIHQKHFQHIQVCTPWLEAEDYPLLLGSADLGVCLHTSSSGLDLPMKVVDMFG :.::::::: :::::::::::::::: CCDS74 ----------------------------------SVDLGVCLHMSSSGLDLPMKVVDMFG 80 90 100 390 400 410 420 430 440 pF1KE6 CCLPVCAVNFKCLHELVKHEENGLVFEDSEELAAQLQMLFSNFPDPAGKLNQFRKNLRES :::::::::::::::::::::::::::::::::: :::::::::::::::::: :::::: CCDS74 CCLPVCAVNFKCLHELVKHEENGLVFEDSEELAA-LQMLFSNFPDPAGKLNQFWKNLRES 110 120 130 140 150 450 460 pF1KE6 QQLRWDESWVQTVLPLVMDT ::::::::::::::::::: CCDS74 QQLRWDESWVQTVLPLVMDIQLLGQRLKPRDPCCPSRSFFSESQGKPF 160 170 180 190 200 464 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 13:20:06 2016 done: Tue Nov 8 13:20:07 2016 Total Scan time: 3.170 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]