FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5154, 100 aa 1>>>pF1KE5154 100 - 100 aa - 100 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5119+/-0.000556; mu= 10.6378+/- 0.034 mean_var=104.6881+/-21.975, 0's: 0 Z-trim(115.8): 121 B-trim: 852 in 1/51 Lambda= 0.125350 statistics sampled from 16254 (16415) to 16254 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.843), E-opt: 0.2 (0.504), width: 16 Scan time: 1.350 The best scores are: opt bits E(32554) CCDS35070.2 BARX1 gene_id:56033|Hs108|chr9 ( 254) 667 129.6 7.8e-31 CCDS8481.1 BARX2 gene_id:8538|Hs108|chr11 ( 279) 447 89.9 7.9e-19 >>CCDS35070.2 BARX1 gene_id:56033|Hs108|chr9 (254 aa) initn: 667 init1: 667 opt: 667 Z-score: 667.0 bits: 129.6 E(32554): 7.8e-31 Smith-Waterman score: 667; 100.0% identity (100.0% similar) in 100 aa overlap (1-100:155-254) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQL :::::::::::::::::::::::::::::: CCDS35 GKLEAAGPGEPGTKAKKGRRSRTVFTELQLMGLEKRFEKQKYLSTPDRIDLAESLGLSQL 130 140 150 160 170 180 40 50 60 70 80 90 pF1KE5 QVKTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 QVKTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVP 190 200 210 220 230 240 100 pF1KE5 GEPSDRSRED :::::::::: CCDS35 GEPSDRSRED 250 >>CCDS8481.1 BARX2 gene_id:8538|Hs108|chr11 (279 aa) initn: 454 init1: 315 opt: 447 Z-score: 451.5 bits: 89.9 E(32554): 7.9e-19 Smith-Waterman score: 447; 68.7% identity (86.9% similar) in 99 aa overlap (1-99:146-242) 10 20 30 pF1KE5 MGLEKRFEKQKYLSTPDRIDLAESLGLSQL :::::.:.::::::::::.:::.::::.:: CCDS84 LASSESETEQPTPRQKKPRRSRTIFTELQLMGLEKKFQKQKYLSTPDRLDLAQSLGLTQL 120 130 140 150 160 170 40 50 60 70 80 90 pF1KE5 QVKTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKPAEVP :::::::::::::::.::.:: :.:::::::::::::::::.. .:. .. . : CCDS84 QVKTWYQNRRMKWKKMVLKGGQ-EAPTKPKGRPKKNSIPTSEEIEAEEKMNSQAQGQE-Q 180 190 200 210 220 230 100 pF1KE5 GEPSDRSRED :::. ..: CCDS84 LEPSQGQEELCEAQEPKARDVPLEMAEPPDPPQELPIPSSEPPPLS 240 250 260 270 100 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:01:57 2016 done: Mon Nov 7 22:01:57 2016 Total Scan time: 1.350 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]