FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA0005, 451 aa 1>>>pF1KA0005 451 - 451 aa - 451 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1996+/-0.00087; mu= 12.5311+/- 0.052 mean_var=71.6913+/-14.148, 0's: 0 Z-trim(106.5): 21 B-trim: 0 in 0/52 Lambda= 0.151475 statistics sampled from 9015 (9034) to 9015 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.638), E-opt: 0.2 (0.278), width: 16 Scan time: 3.190 The best scores are: opt bits E(32554) CCDS56155.1 BZW1 gene_id:9689|Hs108|chr2 ( 451) 2898 642.6 2.4e-184 CCDS56154.1 BZW1 gene_id:9689|Hs108|chr2 ( 423) 2704 600.2 1.3e-171 CCDS56156.1 BZW1 gene_id:9689|Hs108|chr2 ( 419) 2686 596.3 2e-170 CCDS5362.1 BZW2 gene_id:28969|Hs108|chr7 ( 419) 2054 458.2 7.4e-129 >>CCDS56155.1 BZW1 gene_id:9689|Hs108|chr2 (451 aa) initn: 2898 init1: 2898 opt: 2898 Z-score: 3423.1 bits: 642.6 E(32554): 2.4e-184 Smith-Waterman score: 2898; 100.0% identity (100.0% similar) in 451 aa overlap (1-451:1-451) 10 20 30 40 50 60 pF1KA0 MYGAPGAPAQSASVTVVRSLRRPPPQATGVSFMNNQKQQKPTLSGQRFKTRKRDEKERFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 MYGAPGAPAQSASVTVVRSLRRPPPQATGVSFMNNQKQQKPTLSGQRFKTRKRDEKERFD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA0 PTQFQDCIIQGLTETGTDLEAVAKFLDASGAKLDYRRYAETLFDILVAGGMLAPGGTLAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 PTQFQDCIIQGLTETGTDLEAVAKFLDASGAKLDYRRYAETLFDILVAGGMLAPGGTLAD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA0 DMMRTDVCVFAAQEDLETMQAFAQVFNKLIRRYKYLEKGFEDEVKKLLLFLKGFSESERN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 DMMRTDVCVFAAQEDLETMQAFAQVFNKLIRRYKYLEKGFEDEVKKLLLFLKGFSESERN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA0 KLAMLTGVLLANGTLNASILNSLYNENLVKEGVSAAFAVKLFKSWINEKDINAVAASLRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 KLAMLTGVLLANGTLNASILNSLYNENLVKEGVSAAFAVKLFKSWINEKDINAVAASLRK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA0 VSMDNRLMELFPANKQSVEHFTKYFTEAGLKELSEYVRNQQTIGARKELQKELQEQMSRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 VSMDNRLMELFPANKQSVEHFTKYFTEAGLKELSEYVRNQQTIGARKELQKELQEQMSRG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA0 DPFKDIILYVKEEMKKNNIPEPVVIGIVWSSVMSTVEWNKKEELVAEQAIKHLKQYSPLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 DPFKDIILYVKEEMKKNNIPEPVVIGIVWSSVMSTVEWNKKEELVAEQAIKHLKQYSPLL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KA0 AAFTTQGQSELTLLLKIQEYCYDNIHFMKAFQKIVVLFYKAEVLSEEPILKWYKDAHVAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AAFTTQGQSELTLLLKIQEYCYDNIHFMKAFQKIVVLFYKAEVLSEEPILKWYKDAHVAK 370 380 390 400 410 420 430 440 450 pF1KA0 GKSVFLEQMKKFVEWLKNAEEESESEAEEGD ::::::::::::::::::::::::::::::: CCDS56 GKSVFLEQMKKFVEWLKNAEEESESEAEEGD 430 440 450 >>CCDS56154.1 BZW1 gene_id:9689|Hs108|chr2 (423 aa) initn: 2704 init1: 2704 opt: 2704 Z-score: 3194.5 bits: 600.2 E(32554): 1.3e-171 Smith-Waterman score: 2704; 100.0% identity (100.0% similar) in 422 aa overlap (30-451:2-423) 10 20 30 40 50 60 pF1KA0 MYGAPGAPAQSASVTVVRSLRRPPPQATGVSFMNNQKQQKPTLSGQRFKTRKRDEKERFD ::::::::::::::::::::::::::::::: CCDS56 MVSFMNNQKQQKPTLSGQRFKTRKRDEKERFD 10 20 30 70 80 90 100 110 120 pF1KA0 PTQFQDCIIQGLTETGTDLEAVAKFLDASGAKLDYRRYAETLFDILVAGGMLAPGGTLAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 PTQFQDCIIQGLTETGTDLEAVAKFLDASGAKLDYRRYAETLFDILVAGGMLAPGGTLAD 40 50 60 70 80 90 130 140 150 160 170 180 pF1KA0 DMMRTDVCVFAAQEDLETMQAFAQVFNKLIRRYKYLEKGFEDEVKKLLLFLKGFSESERN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 DMMRTDVCVFAAQEDLETMQAFAQVFNKLIRRYKYLEKGFEDEVKKLLLFLKGFSESERN 100 110 120 130 140 150 190 200 210 220 230 240 pF1KA0 KLAMLTGVLLANGTLNASILNSLYNENLVKEGVSAAFAVKLFKSWINEKDINAVAASLRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 KLAMLTGVLLANGTLNASILNSLYNENLVKEGVSAAFAVKLFKSWINEKDINAVAASLRK 160 170 180 190 200 210 250 260 270 280 290 300 pF1KA0 VSMDNRLMELFPANKQSVEHFTKYFTEAGLKELSEYVRNQQTIGARKELQKELQEQMSRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 VSMDNRLMELFPANKQSVEHFTKYFTEAGLKELSEYVRNQQTIGARKELQKELQEQMSRG 220 230 240 250 260 270 310 320 330 340 350 360 pF1KA0 DPFKDIILYVKEEMKKNNIPEPVVIGIVWSSVMSTVEWNKKEELVAEQAIKHLKQYSPLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 DPFKDIILYVKEEMKKNNIPEPVVIGIVWSSVMSTVEWNKKEELVAEQAIKHLKQYSPLL 280 290 300 310 320 330 370 380 390 400 410 420 pF1KA0 AAFTTQGQSELTLLLKIQEYCYDNIHFMKAFQKIVVLFYKAEVLSEEPILKWYKDAHVAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AAFTTQGQSELTLLLKIQEYCYDNIHFMKAFQKIVVLFYKAEVLSEEPILKWYKDAHVAK 340 350 360 370 380 390 430 440 450 pF1KA0 GKSVFLEQMKKFVEWLKNAEEESESEAEEGD ::::::::::::::::::::::::::::::: CCDS56 GKSVFLEQMKKFVEWLKNAEEESESEAEEGD 400 410 420 >>CCDS56156.1 BZW1 gene_id:9689|Hs108|chr2 (419 aa) initn: 2686 init1: 2686 opt: 2686 Z-score: 3173.3 bits: 596.3 E(32554): 2e-170 Smith-Waterman score: 2686; 100.0% identity (100.0% similar) in 419 aa overlap (33-451:1-419) 10 20 30 40 50 60 pF1KA0 GAPGAPAQSASVTVVRSLRRPPPQATGVSFMNNQKQQKPTLSGQRFKTRKRDEKERFDPT :::::::::::::::::::::::::::::: CCDS56 MNNQKQQKPTLSGQRFKTRKRDEKERFDPT 10 20 30 70 80 90 100 110 120 pF1KA0 QFQDCIIQGLTETGTDLEAVAKFLDASGAKLDYRRYAETLFDILVAGGMLAPGGTLADDM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 QFQDCIIQGLTETGTDLEAVAKFLDASGAKLDYRRYAETLFDILVAGGMLAPGGTLADDM 40 50 60 70 80 90 130 140 150 160 170 180 pF1KA0 MRTDVCVFAAQEDLETMQAFAQVFNKLIRRYKYLEKGFEDEVKKLLLFLKGFSESERNKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 MRTDVCVFAAQEDLETMQAFAQVFNKLIRRYKYLEKGFEDEVKKLLLFLKGFSESERNKL 100 110 120 130 140 150 190 200 210 220 230 240 pF1KA0 AMLTGVLLANGTLNASILNSLYNENLVKEGVSAAFAVKLFKSWINEKDINAVAASLRKVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AMLTGVLLANGTLNASILNSLYNENLVKEGVSAAFAVKLFKSWINEKDINAVAASLRKVS 160 170 180 190 200 210 250 260 270 280 290 300 pF1KA0 MDNRLMELFPANKQSVEHFTKYFTEAGLKELSEYVRNQQTIGARKELQKELQEQMSRGDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 MDNRLMELFPANKQSVEHFTKYFTEAGLKELSEYVRNQQTIGARKELQKELQEQMSRGDP 220 230 240 250 260 270 310 320 330 340 350 360 pF1KA0 FKDIILYVKEEMKKNNIPEPVVIGIVWSSVMSTVEWNKKEELVAEQAIKHLKQYSPLLAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 FKDIILYVKEEMKKNNIPEPVVIGIVWSSVMSTVEWNKKEELVAEQAIKHLKQYSPLLAA 280 290 300 310 320 330 370 380 390 400 410 420 pF1KA0 FTTQGQSELTLLLKIQEYCYDNIHFMKAFQKIVVLFYKAEVLSEEPILKWYKDAHVAKGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 FTTQGQSELTLLLKIQEYCYDNIHFMKAFQKIVVLFYKAEVLSEEPILKWYKDAHVAKGK 340 350 360 370 380 390 430 440 450 pF1KA0 SVFLEQMKKFVEWLKNAEEESESEAEEGD ::::::::::::::::::::::::::::: CCDS56 SVFLEQMKKFVEWLKNAEEESESEAEEGD 400 410 >>CCDS5362.1 BZW2 gene_id:28969|Hs108|chr7 (419 aa) initn: 1628 init1: 1628 opt: 2054 Z-score: 2426.9 bits: 458.2 E(32554): 7.4e-129 Smith-Waterman score: 2054; 72.6% identity (94.2% similar) in 416 aa overlap (37-449:3-418) 10 20 30 40 50 60 pF1KA0 APAQSASVTVVRSLRRPPPQATGVSFMNNQKQQKPTLSGQRFKTRKRDEKERFDPTQFQD :.:::.:.:::::::::::::.:.:: :.: CCDS53 MNKHQKPVLTGQRFKTRKRDEKEKFEPTVFRD 10 20 30 70 80 90 100 110 120 pF1KA0 CIIQGLTETGTDLEAVAKFLDASGAKLDYRRYAETLFDILVAGGMLAPGGTLADDMMRTD ..:::.:.: ::::::::::..:..:::::::.:::::::::.::::::: :: .: CCDS53 TLVQGLNEAGDDLEAVAKFLDSTGSRLDYRRYADTLFDILVAGSMLAPGGTRIDDGDKTK 40 50 60 70 80 90 130 140 150 160 170 180 pF1KA0 V---CVFAAQEDLETMQAFAQVFNKLIRRYKYLEKGFEDEVKKLLLFLKGFSESERNKLA . :::.:.:: ::.. .::::::::::::::::.::::.::::::::.:::.:..::: CCDS53 MTNHCVFSANEDHETIRNYAQVFNKLIRRYKYLEKAFEDEMKKLLLFLKAFSETEQTKLA 100 110 120 130 140 150 190 200 210 220 230 240 pF1KA0 MLTGVLLANGTLNASILNSLYNENLVKEGVSAAFAVKLFKSWINEKDINAVAASLRKVSM ::.:.::.:::: :.::.::....:::::..:.:::::::.:. ::: :.:..::::... CCDS53 MLSGILLGNGTLPATILTSLFTDSLVKEGIAASFAVKLFKAWMAEKDANSVTSSLRKANL 160 170 180 190 200 210 250 260 270 280 290 300 pF1KA0 DNRLMELFPANKQSVEHFTKYFTEAGLKELSEYVRNQQTIGARKELQKELQEQMSRGDPF :.::.::::.:.:::.::.::::.:::::::...: ::..:.::::::::::..:. :. CCDS53 DKRLLELFPVNRQSVDHFAKYFTDAGLKELSDFLRVQQSLGTRKELQKELQERLSQECPI 220 230 240 250 260 270 310 320 330 340 350 360 pF1KA0 KDIILYVKEEMKKNNIPEPVVIGIVWSSVMSTVEWNKKEELVAEQAIKHLKQYSPLLAAF :...::::::::.:..:: .:::..:. .:..::::::::::::::.::::::.::::.: CCDS53 KEVVLYVKEEMKRNDLPETAVIGLLWTCIMNAVEWNKKEELVAEQALKHLKQYAPLLAVF 280 290 300 310 320 330 370 380 390 400 410 420 pF1KA0 TTQGQSELTLLLKIQEYCYDNIHFMKAFQKIVVLFYKAEVLSEEPILKWYKDAHVAKGKS ..:::::: :: :.::::::::::::::::::::::::.::::: ::::::.:::::::: CCDS53 SSQGQSELILLQKVQEYCYDNIHFMKAFQKIVVLFYKADVLSEEAILKWYKEAHVAKGKS 340 350 360 370 380 390 430 440 450 pF1KA0 VFLEQMKKFVEWLKNAEEESESEAEEGD :::.:::::::::.:::::::::.:: CCDS53 VFLDQMKKFVEWLQNAEEESESEGEEN 400 410 451 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 04:50:58 2016 done: Mon Nov 7 04:50:58 2016 Total Scan time: 3.190 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]