FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6367, 360 aa 1>>>pF1KE6367 360 - 360 aa - 360 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8617+/-0.000841; mu= 13.3334+/- 0.051 mean_var=68.4462+/-13.416, 0's: 0 Z-trim(106.5): 16 B-trim: 2 in 1/50 Lambda= 0.155024 statistics sampled from 9018 (9023) to 9018 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.657), E-opt: 0.2 (0.277), width: 16 Scan time: 1.800 The best scores are: opt bits E(32554) CCDS11674.1 AMZ2 gene_id:51321|Hs108|chr17 ( 360) 2464 560.0 1.1e-159 CCDS32714.1 AMZ2 gene_id:51321|Hs108|chr17 ( 302) 1452 333.7 1.3e-91 CCDS34589.1 AMZ1 gene_id:155185|Hs108|chr7 ( 498) 727 171.6 1.3e-42 CCDS64582.1 AMZ1 gene_id:155185|Hs108|chr7 ( 297) 422 103.3 2.7e-22 >>CCDS11674.1 AMZ2 gene_id:51321|Hs108|chr17 (360 aa) initn: 2464 init1: 2464 opt: 2464 Z-score: 2980.5 bits: 560.0 E(32554): 1.1e-159 Smith-Waterman score: 2464; 99.7% identity (100.0% similar) in 360 aa overlap (1-360:1-360) 10 20 30 40 50 60 pF1KE6 MQIIRHSEQTLKTALISKNPVLVSQYEKLDAGEQRLMNEAFQPASDLFGPITLHSPSDWI :::::::::::::::::::::::::::::.:::::::::::::::::::::::::::::: CCDS11 MQIIRHSEQTLKTALISKNPVLVSQYEKLNAGEQRLMNEAFQPASDLFGPITLHSPSDWI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 TSHPEAPQDFEQFFSDPYRKTPSPNKRSIYIQSIGSLGNTRIISEEYIKWLTGYCKAYFY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TSHPEAPQDFEQFFSDPYRKTPSPNKRSIYIQSIGSLGNTRIISEEYIKWLTGYCKAYFY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 GLRVKLLEPVPVSVTRCSFRVNENTHNLQIHAGDILKFLKKKKPEDAFCVVGITMIDLYP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GLRVKLLEPVPVSVTRCSFRVNENTHNLQIHAGDILKFLKKKKPEDAFCVVGITMIDLYP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 RDSWNFVFGQASLTDGVGIFSFARYGSDFYSMHYKGKVKKLKKTSSSDYSIFDNYYIPEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 RDSWNFVFGQASLTDGVGIFSFARYGSDFYSMHYKGKVKKLKKTSSSDYSIFDNYYIPEI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 TSVLLLRSCKTLTHEIGHIFGLRHCQWLACLMQGSNHLEEADRRPLNLCPICLHKLQCAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TSVLLLRSCKTLTHEIGHIFGLRHCQWLACLMQGSNHLEEADRRPLNLCPICLHKLQCAV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 GFSIVERYKALVRWIDDESSDTPGATPEHSHEDNGNLPKPVEAFKEWKEWIIKCLAVLQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GFSIVERYKALVRWIDDESSDTPGATPEHSHEDNGNLPKPVEAFKEWKEWIIKCLAVLQK 310 320 330 340 350 360 >>CCDS32714.1 AMZ2 gene_id:51321|Hs108|chr17 (302 aa) initn: 1449 init1: 1449 opt: 1452 Z-score: 1758.5 bits: 333.7 E(32554): 1.3e-91 Smith-Waterman score: 1947; 83.6% identity (83.9% similar) in 360 aa overlap (1-360:1-302) 10 20 30 40 50 60 pF1KE6 MQIIRHSEQTLKTALISKNPVLVSQYEKLDAGEQRLMNEAFQPASDLFGPITLHSPSDWI :::::::::::::::::::::::::::::.:::::::::::::::::::::::::::::: CCDS32 MQIIRHSEQTLKTALISKNPVLVSQYEKLNAGEQRLMNEAFQPASDLFGPITLHSPSDWI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 TSHPEAPQDFEQFFSDPYRKTPSPNKRSIYIQSIGSLGNTRIISEEYIKWLTGYCKAYFY ::::::::::::::::::::::::::::::::::: CCDS32 TSHPEAPQDFEQFFSDPYRKTPSPNKRSIYIQSIG------------------------- 70 80 90 130 140 150 160 170 180 pF1KE6 GLRVKLLEPVPVSVTRCSFRVNENTHNLQIHAGDILKFLKKKKPEDAFCVVGITMIDLYP ::::::::::::::::::::::::::: CCDS32 ---------------------------------DILKFLKKKKPEDAFCVVGITMIDLYP 100 110 120 190 200 210 220 230 240 pF1KE6 RDSWNFVFGQASLTDGVGIFSFARYGSDFYSMHYKGKVKKLKKTSSSDYSIFDNYYIPEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 RDSWNFVFGQASLTDGVGIFSFARYGSDFYSMHYKGKVKKLKKTSSSDYSIFDNYYIPEI 130 140 150 160 170 180 250 260 270 280 290 300 pF1KE6 TSVLLLRSCKTLTHEIGHIFGLRHCQWLACLMQGSNHLEEADRRPLNLCPICLHKLQCAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 TSVLLLRSCKTLTHEIGHIFGLRHCQWLACLMQGSNHLEEADRRPLNLCPICLHKLQCAV 190 200 210 220 230 240 310 320 330 340 350 360 pF1KE6 GFSIVERYKALVRWIDDESSDTPGATPEHSHEDNGNLPKPVEAFKEWKEWIIKCLAVLQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 GFSIVERYKALVRWIDDESSDTPGATPEHSHEDNGNLPKPVEAFKEWKEWIIKCLAVLQK 250 260 270 280 290 300 >>CCDS34589.1 AMZ1 gene_id:155185|Hs108|chr7 (498 aa) initn: 610 init1: 300 opt: 727 Z-score: 878.6 bits: 171.6 E(32554): 1.3e-42 Smith-Waterman score: 727; 36.2% identity (68.0% similar) in 309 aa overlap (9-314:15-321) 10 20 30 40 50 pF1KE6 MQIIRHSEQTLKTALISKNPVLVSQY-EKLDAGEQRLMNEAFQPASDLFGPITL ..:: ::.: . .: . : .. .:. .. ::..: :: . . CCDS34 MLQCRPAQEFSFGPRALKDALVSTDAALQQLYVSAFSPAERLFLAEAYNPQRTLFCTLLI 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 HSPSDWITSHPEAPQDFEQFFSDPYRKTPSPNKRSIYIQSIGSLGNTRIISEEYIKWLTG .. ::. :.::::.::. : .. .. : .. ::.: : . . .. .. : . CCDS34 RTGFDWLLSRPEAPEDFQTFHASLQHRKPRLARKHIYLQPIDL--SEEPVGSSLLHQLCS 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 YCKAYFYGLRVKLLEPVPVSVTRCSFRVNENTHNLQIHAGDILKFLKKKKPEDAFCVVGI .:.: ::::: : : .. ::: : .... ::.:. ::.:::..:: ::.::.:. CCDS34 CTEAFFLGLRVKCLPSVAAASIRCSSRPSRDSDRLQLHTDGILSFLKNNKPGDALCVLGL 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 TMIDLYPRDSWNFVFGQASLTDGVGIFSFARYGSDFYSMHYKGKVKKLKKTSSS--DYSI :. ::::...:.:.:.. ::. ::::....: . .. : ..... . . CCDS34 TLSDLYPHEAWSFTFSKFLPGHEVGVCSFARFSGEFPKSGPSAPDLALVEAAADGPEAPL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 FDNYYIPEITSVLLLRSCKTLTHEIGHIFGLRHCQWLACLMQGSNHLEEADRRPLNLCPI : . .... ... ::. ::. :..:: .:.:: :::::. :.:: ::::.:::: CCDS34 QDRGWALCFSALGMVQCCKVTCHELCHLLGLGNCRWLRCLMQGALSLDEALRRPLDLCPI 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE6 CLHKLQCAVGFSIVERYKALVRWIDDESSDTPGATPEHSHEDNGNLPKPVEAFKEWKEWI ::.::: ..:: ..:::. : : CCDS34 CLRKLQHVLGFRLIERYQRLYTWTQAVVGTWPSQEAGEPSVWEDTPPASADSGMCCESDS 300 310 320 330 340 350 >>CCDS64582.1 AMZ1 gene_id:155185|Hs108|chr7 (297 aa) initn: 439 init1: 260 opt: 422 Z-score: 513.6 bits: 103.3 E(32554): 2.7e-22 Smith-Waterman score: 422; 35.5% identity (68.3% similar) in 183 aa overlap (9-190:15-195) 10 20 30 40 50 pF1KE6 MQIIRHSEQTLKTALISKNPVLVSQY-EKLDAGEQRLMNEAFQPASDLFGPITL ..:: ::.: . .: . : .. .:. .. ::..: :: . . CCDS64 MLQCRPAQEFSFGPRALKDALVSTDAALQQLYVSAFSPAERLFLAEAYNPQRTLFCTLLI 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 HSPSDWITSHPEAPQDFEQFFSDPYRKTPSPNKRSIYIQSIGSLGNTRIISEEYIKWLTG .. ::. :.::::.::. : .. .. : .. ::.: : . . .. .. : . CCDS64 RTGFDWLLSRPEAPEDFQTFHASLQHRKPRLARKHIYLQPIDL--SEEPVGSSLLHQLCS 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 YCKAYFYGLRVKLLEPVPVSVTRCSFRVNENTHNLQIHAGDILKFLKKKKPEDAFCVVGI .:.: ::::: : : .. ::: : .... ::.:. ::.:::..:: ::.::.:. CCDS64 CTEAFFLGLRVKCLPSVAAASIRCSSRPSRDSDRLQLHTDGILSFLKNNKPGDALCVLGL 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 TMIDLYPRDSWNFVFGQASLTDGVGIFSFARYGSDFYSMHYKGKVKKLKKTSSSDYSIFD :. ::::...:.:.:.. CCDS64 TLSDLYPHEAWSFTFSKFLPGHGHVPRALPPSGPGELPLAPLPHAGCAQPGRGPAAAPGP 180 190 200 210 220 230 360 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:31:27 2016 done: Tue Nov 8 12:31:28 2016 Total Scan time: 1.800 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]