FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1335, 232 aa 1>>>pF1KE1335 232 - 232 aa - 232 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6110+/-0.000626; mu= 14.0676+/- 0.038 mean_var=81.7974+/-16.610, 0's: 0 Z-trim(113.3): 9 B-trim: 321 in 1/51 Lambda= 0.141809 statistics sampled from 13903 (13910) to 13903 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.784), E-opt: 0.2 (0.427), width: 16 Scan time: 2.470 The best scores are: opt bits E(32554) CCDS47308.1 CD74 gene_id:972|Hs108|chr5 ( 232) 1587 333.3 8e-92 CCDS47309.1 CD74 gene_id:972|Hs108|chr5 ( 296) 1429 301.1 5.2e-82 CCDS34276.1 CD74 gene_id:972|Hs108|chr5 ( 160) 987 210.5 5.3e-55 >>CCDS47308.1 CD74 gene_id:972|Hs108|chr5 (232 aa) initn: 1587 init1: 1587 opt: 1587 Z-score: 1762.0 bits: 333.3 E(32554): 8e-92 Smith-Waterman score: 1587; 100.0% identity (100.0% similar) in 232 aa overlap (1-232:1-232) 10 20 30 40 50 60 pF1KE1 MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPKPPKPVSKMRMATPLLMQALPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPKPPKPVSKMRMATPLLMQALPM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 GALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 FESWMHHWLLFEMSRHSLEQKPTDAPPKESLELEDPSSGLGVTKQDLGPVPM :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 FESWMHHWLLFEMSRHSLEQKPTDAPPKESLELEDPSSGLGVTKQDLGPVPM 190 200 210 220 230 >>CCDS47309.1 CD74 gene_id:972|Hs108|chr5 (296 aa) initn: 1573 init1: 1429 opt: 1429 Z-score: 1585.8 bits: 301.1 E(32554): 5.2e-82 Smith-Waterman score: 1429; 100.0% identity (100.0% similar) in 208 aa overlap (1-208:1-208) 10 20 30 40 50 60 pF1KE1 MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPKPPKPVSKMRMATPLLMQALPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPKPPKPVSKMRMATPLLMQALPM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 GALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 FESWMHHWLLFEMSRHSLEQKPTDAPPKESLELEDPSSGLGVTKQDLGPVPM :::::::::::::::::::::::::::: CCDS47 FESWMHHWLLFEMSRHSLEQKPTDAPPKVLTKCQEEVSHIPAVHPGSFRPKCDENGNYLP 190 200 210 220 230 240 >>CCDS34276.1 CD74 gene_id:972|Hs108|chr5 (160 aa) initn: 1001 init1: 987 opt: 987 Z-score: 1100.9 bits: 210.5 E(32554): 5.3e-55 Smith-Waterman score: 987; 99.3% identity (100.0% similar) in 148 aa overlap (1-148:1-148) 10 20 30 40 50 60 pF1KE1 MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPKPPKPVSKMRMATPLLMQALPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 LAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPKPPKPVSKMRMATPLLMQALPM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV :::::::::::::::::::::::::::. CCDS34 GALPQGPMQNATKYGNMTEDHVMHLLQSHWNWRTRLLGWV 130 140 150 160 232 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 00:54:53 2016 done: Mon Nov 7 00:54:54 2016 Total Scan time: 2.470 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]