FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6685, 229 aa 1>>>pF1KE6685 229 - 229 aa - 229 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1529+/-0.000844; mu= 15.1006+/- 0.051 mean_var=69.4024+/-13.883, 0's: 0 Z-trim(107.4): 36 B-trim: 431 in 2/49 Lambda= 0.153953 statistics sampled from 9542 (9571) to 9542 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.675), E-opt: 0.2 (0.294), width: 16 Scan time: 2.060 The best scores are: opt bits E(32554) CCDS11655.1 CD79B gene_id:974|Hs108|chr17 ( 229) 1537 350.1 7e-97 CCDS42372.1 CD79B gene_id:974|Hs108|chr17 ( 230) 1525 347.4 4.5e-96 CCDS11656.1 CD79B gene_id:974|Hs108|chr17 ( 125) 552 131.1 3.1e-31 >>CCDS11655.1 CD79B gene_id:974|Hs108|chr17 (229 aa) initn: 1537 init1: 1537 opt: 1537 Z-score: 1852.8 bits: 350.1 E(32554): 7e-97 Smith-Waterman score: 1537; 100.0% identity (100.0% similar) in 229 aa overlap (1-229:1-229) 10 20 30 40 50 60 pF1KE6 MARLALSPVPSHWMVALLLLLSAEPVPAARSEDRYRNPKGSACSRIWQSPRFIARKRGFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MARLALSPVPSHWMVALLLLLSAEPVPAARSEDRYRNPKGSACSRIWQSPRFIARKRGFT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VKMHCYMNSASGNVSWLWKQEMDENPQQLKLEKGRMEESQNESLATLTIQGIRFEDNGIY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VKMHCYMNSASGNVSWLWKQEMDENPQQLKLEKGRMEESQNESLATLTIQGIRFEDNGIY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 FCQQKCNNTSEVYQGCGTELRVMGFSTLAQLKQRNTLKDGIIMIQTLLIILFIIVPIFLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 FCQQKCNNTSEVYQGCGTELRVMGFSTLAQLKQRNTLKDGIIMIQTLLIILFIIVPIFLL 130 140 150 160 170 180 190 200 210 220 pF1KE6 LDKDDSKAGMEEDHTYEGLDIDQTATYEDIVTLRTGEVKWSVGEHPGQE ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LDKDDSKAGMEEDHTYEGLDIDQTATYEDIVTLRTGEVKWSVGEHPGQE 190 200 210 220 >>CCDS42372.1 CD79B gene_id:974|Hs108|chr17 (230 aa) initn: 1414 init1: 1414 opt: 1525 Z-score: 1838.3 bits: 347.4 E(32554): 4.5e-96 Smith-Waterman score: 1525; 99.6% identity (99.6% similar) in 230 aa overlap (1-229:1-230) 10 20 30 40 50 pF1KE6 MARLALSPVPSHWMVALLLLLSA-EPVPAARSEDRYRNPKGSACSRIWQSPRFIARKRGF ::::::::::::::::::::::: :::::::::::::::::::::::::::::::::::: CCDS42 MARLALSPVPSHWMVALLLLLSAAEPVPAARSEDRYRNPKGSACSRIWQSPRFIARKRGF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 TVKMHCYMNSASGNVSWLWKQEMDENPQQLKLEKGRMEESQNESLATLTIQGIRFEDNGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 TVKMHCYMNSASGNVSWLWKQEMDENPQQLKLEKGRMEESQNESLATLTIQGIRFEDNGI 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 YFCQQKCNNTSEVYQGCGTELRVMGFSTLAQLKQRNTLKDGIIMIQTLLIILFIIVPIFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 YFCQQKCNNTSEVYQGCGTELRVMGFSTLAQLKQRNTLKDGIIMIQTLLIILFIIVPIFL 130 140 150 160 170 180 180 190 200 210 220 pF1KE6 LLDKDDSKAGMEEDHTYEGLDIDQTATYEDIVTLRTGEVKWSVGEHPGQE :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LLDKDDSKAGMEEDHTYEGLDIDQTATYEDIVTLRTGEVKWSVGEHPGQE 190 200 210 220 230 >>CCDS11656.1 CD79B gene_id:974|Hs108|chr17 (125 aa) initn: 552 init1: 552 opt: 552 Z-score: 674.2 bits: 131.1 E(32554): 3.1e-31 Smith-Waterman score: 592; 54.6% identity (54.6% similar) in 229 aa overlap (1-229:1-125) 10 20 30 40 50 60 pF1KE6 MARLALSPVPSHWMVALLLLLSAEPVPAARSEDRYRNPKGSACSRIWQSPRFIARKRGFT :::::::::::::::::::::::::::::::::::::::: CCDS11 MARLALSPVPSHWMVALLLLLSAEPVPAARSEDRYRNPKG-------------------- 10 20 30 40 70 80 90 100 110 120 pF1KE6 VKMHCYMNSASGNVSWLWKQEMDENPQQLKLEKGRMEESQNESLATLTIQGIRFEDNGIY CCDS11 ------------------------------------------------------------ 130 140 150 160 170 180 pF1KE6 FCQQKCNNTSEVYQGCGTELRVMGFSTLAQLKQRNTLKDGIIMIQTLLIILFIIVPIFLL :::::::::::::::::::::::::::::::::::: CCDS11 ------------------------FSTLAQLKQRNTLKDGIIMIQTLLIILFIIVPIFLL 50 60 70 190 200 210 220 pF1KE6 LDKDDSKAGMEEDHTYEGLDIDQTATYEDIVTLRTGEVKWSVGEHPGQE ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LDKDDSKAGMEEDHTYEGLDIDQTATYEDIVTLRTGEVKWSVGEHPGQE 80 90 100 110 120 229 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 15:24:17 2016 done: Tue Nov 8 15:24:17 2016 Total Scan time: 2.060 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]