FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3895, 123 aa 1>>>pF1KE3895 123 - 123 aa - 123 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7000+/-0.000528; mu= 13.5384+/- 0.032 mean_var=53.0945+/-10.472, 0's: 0 Z-trim(111.8): 11 B-trim: 0 in 0/52 Lambda= 0.176015 statistics sampled from 12846 (12856) to 12846 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.779), E-opt: 0.2 (0.385), width: 16 Scan time: 0.820 The best scores are: opt bits E(33420) CCDS30654.1 THEMIS2 gene_id:9473|Hs109|chr1 ( 123) 830 217.7 1.5e-57 CCDS30653.1 THEMIS2 gene_id:9473|Hs109|chr1 ( 260) 526 140.7 4.8e-34 CCDS65461.1 THEMIS2 gene_id:9473|Hs109|chr1 ( 514) 526 140.8 8.5e-34 CCDS41290.1 THEMIS2 gene_id:9473|Hs109|chr1 ( 643) 526 140.9 1e-33 CCDS34534.1 THEMIS gene_id:387357|Hs109|chr6 ( 641) 258 72.8 3.1e-13 CCDS55056.1 THEMIS gene_id:387357|Hs109|chr6 ( 680) 258 72.9 3.3e-13 >>CCDS30654.1 THEMIS2 gene_id:9473|Hs109|chr1 (123 aa) initn: 830 init1: 830 opt: 830 Z-score: 1146.9 bits: 217.7 E(33420): 1.5e-57 Smith-Waterman score: 830; 100.0% identity (100.0% similar) in 123 aa overlap (1-123:1-123) 10 20 30 40 50 60 pF1KE3 MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 VCENPKTSQTMELAPNFQVFSSLRIAATRSAAQTQGEDLARVHQGWLQYVQQDSCPQEGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 VCENPKTSQTMELAPNFQVFSSLRIAATRSAAQTQGEDLARVHQGWLQYVQQDSCPQEGP 70 80 90 100 110 120 pF1KE3 QAR ::: CCDS30 QAR >>CCDS30653.1 THEMIS2 gene_id:9473|Hs109|chr1 (260 aa) initn: 525 init1: 525 opt: 526 Z-score: 724.9 bits: 140.7 E(33420): 4.8e-34 Smith-Waterman score: 526; 95.2% identity (96.4% similar) in 84 aa overlap (1-83:1-84) 10 20 30 40 50 60 pF1KE3 MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 VCENPKTSQTMELAPNFQ-VFSSLRIAATRSAAQTQGEDLARVHQGWLQYVQQDSCPQEG :::::::::::::::::: :. : CCDS30 VCENPKTSQTMELAPNFQGYFTPLNTPQSYETLEELVSATTQSSKQLPTCFMSTHRIVTE 70 80 90 100 110 120 >-- initn: 319 init1: 305 opt: 308 Z-score: 425.7 bits: 85.3 E(33420): 2.2e-17 Smith-Waterman score: 308; 86.5% identity (94.2% similar) in 52 aa overlap (72-123:209-260) 50 60 70 80 90 100 pF1KE3 CCLSTGDLIKVTQVRLQKVVCENPKTSQTMELAPNFQVFSSLRIAATRSAAQTQGEDLAR :. ...:::::::::::::::::::::: CCDS30 LQVLQDPALKDLVLTCPTLPWHSLILRPQYEIQAIMHIFSSLRIAATRSAAQTQGEDLAR 180 190 200 210 220 230 110 120 pF1KE3 VHQGWLQYVQQDSCPQEGPQAR :::::::::::::::::::::: CCDS30 VHQGWLQYVQQDSCPQEGPQAR 240 250 260 >>CCDS65461.1 THEMIS2 gene_id:9473|Hs109|chr1 (514 aa) initn: 525 init1: 525 opt: 526 Z-score: 720.5 bits: 140.8 E(33420): 8.5e-34 Smith-Waterman score: 526; 95.2% identity (96.4% similar) in 84 aa overlap (1-83:1-84) 10 20 30 40 50 60 pF1KE3 MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 VCENPKTSQTMELAPNFQ-VFSSLRIAATRSAAQTQGEDLARVHQGWLQYVQQDSCPQEG :::::::::::::::::: :. : CCDS65 VCENPKTSQTMELAPNFQGYFTPLNTPQSYETLEELVSATTQSSKQLPTCFMSTHRIVTE 70 80 90 100 110 120 >>CCDS41290.1 THEMIS2 gene_id:9473|Hs109|chr1 (643 aa) initn: 546 init1: 525 opt: 526 Z-score: 719.0 bits: 140.9 E(33420): 1e-33 Smith-Waterman score: 526; 95.2% identity (96.4% similar) in 84 aa overlap (1-83:1-84) 10 20 30 40 50 60 pF1KE3 MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 VCENPKTSQTMELAPNFQ-VFSSLRIAATRSAAQTQGEDLARVHQGWLQYVQQDSCPQEG :::::::::::::::::: :. : CCDS41 VCENPKTSQTMELAPNFQGYFTPLNTPQSYETLEELVSATTQSSKQLPTCFMSTHRIVTE 70 80 90 100 110 120 >>CCDS34534.1 THEMIS gene_id:387357|Hs109|chr6 (641 aa) initn: 278 init1: 228 opt: 258 Z-score: 351.3 bits: 72.8 E(33420): 3.1e-13 Smith-Waterman score: 258; 40.6% identity (71.7% similar) in 106 aa overlap (6-105:5-108) 10 20 30 40 50 60 pF1KE3 MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV :..::..:: .:::::.. .:.:.::::::. :::::.:::..::.: ....:. CCDS34 MALSLEEFVHSLDLRTLPRVLEIQAGIYLEGSIYEMFGNECCFSTGEVIKITGLKVKKI 10 20 30 40 50 70 80 90 100 110 pF1KE3 V---CENPK---TSQTMELAPNFQVFSSLRIAATRSAAQTQGEDLARVHQGWLQYVQQDS . ::. . . : .:: :: . ..:.: .. :. : .: : CCDS34 IAEICEQIEGCESLQPFELPMNFPGL--FKIVADKTPYLTMEEITRTIHIGPSRLGHPCF 60 70 80 90 100 110 120 pF1KE3 CPQEGPQAR CCDS34 YHQKDIKLENLIIKQGEQIMLNSVEEIDGEIMVSCAVARNHQTHSFNLPLSQEGEFYECE 120 130 140 150 160 170 >>CCDS55056.1 THEMIS gene_id:387357|Hs109|chr6 (680 aa) initn: 278 init1: 228 opt: 258 Z-score: 350.9 bits: 72.9 E(33420): 3.3e-13 Smith-Waterman score: 258; 40.6% identity (71.7% similar) in 106 aa overlap (6-105:5-108) 10 20 30 40 50 60 pF1KE3 MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV :..::..:: .:::::.. .:.:.::::::. :::::.:::..::.: ....:. CCDS55 MALSLEEFVHSLDLRTLPRVLEIQAGIYLEGSIYEMFGNECCFSTGEVIKITGLKVKKI 10 20 30 40 50 70 80 90 100 110 pF1KE3 V---CENPK---TSQTMELAPNFQVFSSLRIAATRSAAQTQGEDLARVHQGWLQYVQQDS . ::. . . : .:: :: . ..:.: .. :. : .: : CCDS55 IAEICEQIEGCESLQPFELPMNFPGL--FKIVADKTPYLTMEEITRTIHIGPSRLGHPCF 60 70 80 90 100 110 120 pF1KE3 CPQEGPQAR CCDS55 YHQKDIKLENLIIKQGEQIMLNSVEEIDGEIMVSCAVARNHQTHSFNLPLSQEGEFYECE 120 130 140 150 160 170 123 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Aug 4 20:31:13 2021 done: Wed Aug 4 20:31:14 2021 Total Scan time: 0.820 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]