FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2740, 504 aa 1>>>pF1KE2740 504 - 504 aa - 504 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.8667+/-0.000784; mu= 3.9269+/- 0.047 mean_var=213.9887+/-43.450, 0's: 0 Z-trim(116.3): 9 B-trim: 229 in 2/50 Lambda= 0.087676 statistics sampled from 17116 (17124) to 17116 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.81), E-opt: 0.2 (0.512), width: 16 Scan time: 1.820 The best scores are: opt bits E(33420) CCDS3370.2 DOK7 gene_id:285489|Hs109|chr4 ( 504) 3455 449.3 4.8e-126 CCDS87205.1 DOK7 gene_id:285489|Hs109|chr4 ( 464) 2194 289.8 4.6e-78 CCDS87206.1 DOK7 gene_id:285489|Hs109|chr4 ( 194) 1377 186.1 3e-47 CCDS54717.1 DOK7 gene_id:285489|Hs109|chr4 ( 255) 1189 162.5 5.4e-40 >>CCDS3370.2 DOK7 gene_id:285489|Hs109|chr4 (504 aa) initn: 3455 init1: 3455 opt: 3455 Z-score: 2376.7 bits: 449.3 E(33420): 4.8e-126 Smith-Waterman score: 3455; 100.0% identity (100.0% similar) in 504 aa overlap (1-504:1-504) 10 20 30 40 50 60 pF1KE2 MTEAALVEGQVKLRDGKKWKSRWLVLRKPSPVADCLLMLVYKDKSERIKGLRERSSLTLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MTEAALVEGQVKLRDGKKWKSRWLVLRKPSPVADCLLMLVYKDKSERIKGLRERSSLTLE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 DICGLEPGLPYEGLVHTLAIVCLSQAIMLGFDSHEAMCAWDARIRYALGEVHRFHVTVAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DICGLEPGLPYEGLVHTLAIVCLSQAIMLGFDSHEAMCAWDARIRYALGEVHRFHVTVAP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 GTKLESGPATLHLCNDVLVLARDIPPAVTGQWKLSDLRRYGAVPSGFIFEGGTRCGYWAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GTKLESGPATLHLCNDVLVLARDIPPAVTGQWKLSDLRRYGAVPSGFIFEGGTRCGYWAG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 VFFLSSAEGEQISFLFDCIVRGISPTKGPFGLRPVLPDPSPPGPSTVEERVAQEALETLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VFFLSSAEGEQISFLFDCIVRGISPTKGPFGLRPVLPDPSPPGPSTVEERVAQEALETLQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 LEKRLSLLSHAGRPGSGGDDRSLSSSSSEASHLDVSASSRLTAWPEQSSSSASTSQEGPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LEKRLSLLSHAGRPGSGGDDRSLSSSSSEASHLDVSASSRLTAWPEQSSSSASTSQEGPR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 PAAAQAAGEAMVGASRPPPKPLRPRQLQEVGRQSSSDSGIATGSHSSYSSSLSSYAGSSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PAAAQAAGEAMVGASRPPPKPLRPRQLQEVGRQSSSDSGIATGSHSSYSSSLSSYAGSSL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 DVWRATDELGSLLSLPAAGAPEPSLCTCLPGTVEYQVPTSLRAHYDTPRSLCLAPRDHSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DVWRATDELGSLLSLPAAGAPEPSLCTCLPGTVEYQVPTSLRAHYDTPRSLCLAPRDHSP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 PSQGSPGNSAARDSGGQTSAGCPSGWLGTRRRGLVMEAPQGSEATLPGPAPGEPWEAGGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PSQGSPGNSAARDSGGQTSAGCPSGWLGTRRRGLVMEAPQGSEATLPGPAPGEPWEAGGP 430 440 450 460 470 480 490 500 pF1KE2 HAGPPPAFFSACPVCGGLKVNPPP :::::::::::::::::::::::: CCDS33 HAGPPPAFFSACPVCGGLKVNPPP 490 500 >>CCDS87205.1 DOK7 gene_id:285489|Hs109|chr4 (464 aa) initn: 2190 init1: 2190 opt: 2194 Z-score: 1515.1 bits: 289.8 E(33420): 4.6e-78 Smith-Waterman score: 2194; 92.9% identity (95.7% similar) in 352 aa overlap (148-499:7-355) 120 130 140 150 160 170 pF1KE2 VAPGTKLESGPATLHLCNDVLVLARDIPPAVTGQWKLSDLRRYGAVPSGFIFEGGTRCGY : :: :: : ... . .... . . CCDS87 MTEAALVEGQVKLRDGKKWKS--RWLVLRKPSPVAG 10 20 30 180 190 200 210 220 230 pF1KE2 WAGVFFLSSAEGEQISFLFDCIVRGISPTKGPFGLRPVLPDPSPPGPSTVEERVAQEALE ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 -AGVFFLSSAEGEQISFLFDCIVRGISPTKGPFGLRPVLPDPSPPGPSTVEERVAQEALE 40 50 60 70 80 90 240 250 260 270 280 290 pF1KE2 TLQLEKRLSLLSHAGRPGSGGDDRSLSSSSSEASHLDVSASSRLTAWPEQSSSSASTSQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 TLQLEKRLSLLSHAGRPGSGGDDRSLSSSSSEASHLDVSASSRLTAWPEQSSSSASTSQE 100 110 120 130 140 150 300 310 320 330 340 350 pF1KE2 GPRPAAAQAAGEAMVGASRPPPKPLRPRQLQEVGRQSSSDSGIATGSHSSYSSSLSSYAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 GPRPAAAQAAGEAMVGASRPPPKPLRPRQLQEVGRQSSSDSGIATGSHSSYSSSLSSYAG 160 170 180 190 200 210 360 370 380 390 400 410 pF1KE2 SSLDVWRATDELGSLLSLPAAGAPEPSLCTCLPGTVEYQVPTSLRAHYDTPRSLCLAPRD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 SSLDVWRATDELGSLLSLPAAGAPEPSLCTCLPGTVEYQVPTSLRAHYDTPRSLCLAPRD 220 230 240 250 260 270 420 430 440 450 460 470 pF1KE2 HSPPSQGSPGNSAARDSGGQTSAGCPSGWLGTRRRGLVMEAPQGSEATLPGPAPGEPWEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 HSPPSQGSPGNSAARDSGGQTSAGCPSGWLGTRRRGLVMEAPQGSEATLPGPAPGEPWEA 280 290 300 310 320 330 480 490 500 pF1KE2 GGPHAGPPPAFFSACPVCGGLKVNPPP :::::::::::::::::::::: CCDS87 GGPHAGPPPAFFSACPVCGGLKGAAASAPGPATAHSGSPGPVAVDSPGPERPRGESPTYV 340 350 360 370 380 390 >>CCDS87206.1 DOK7 gene_id:285489|Hs109|chr4 (194 aa) initn: 1377 init1: 1377 opt: 1377 Z-score: 961.9 bits: 186.1 E(33420): 3e-47 Smith-Waterman score: 1377; 100.0% identity (100.0% similar) in 194 aa overlap (311-504:1-194) 290 300 310 320 330 340 pF1KE2 LTAWPEQSSSSASTSQEGPRPAAAQAAGEAMVGASRPPPKPLRPRQLQEVGRQSSSDSGI :::::::::::::::::::::::::::::: CCDS87 MVGASRPPPKPLRPRQLQEVGRQSSSDSGI 10 20 30 350 360 370 380 390 400 pF1KE2 ATGSHSSYSSSLSSYAGSSLDVWRATDELGSLLSLPAAGAPEPSLCTCLPGTVEYQVPTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 ATGSHSSYSSSLSSYAGSSLDVWRATDELGSLLSLPAAGAPEPSLCTCLPGTVEYQVPTS 40 50 60 70 80 90 410 420 430 440 450 460 pF1KE2 LRAHYDTPRSLCLAPRDHSPPSQGSPGNSAARDSGGQTSAGCPSGWLGTRRRGLVMEAPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 LRAHYDTPRSLCLAPRDHSPPSQGSPGNSAARDSGGQTSAGCPSGWLGTRRRGLVMEAPQ 100 110 120 130 140 150 470 480 490 500 pF1KE2 GSEATLPGPAPGEPWEAGGPHAGPPPAFFSACPVCGGLKVNPPP :::::::::::::::::::::::::::::::::::::::::::: CCDS87 GSEATLPGPAPGEPWEAGGPHAGPPPAFFSACPVCGGLKVNPPP 160 170 180 190 >>CCDS54717.1 DOK7 gene_id:285489|Hs109|chr4 (255 aa) initn: 1240 init1: 1179 opt: 1189 Z-score: 831.7 bits: 162.5 E(33420): 5.4e-40 Smith-Waterman score: 1189; 83.0% identity (87.9% similar) in 223 aa overlap (1-223:1-219) 10 20 30 40 50 60 pF1KE2 MTEAALVEGQVKLRDGKKWKSRWLVLRKPSPVADCLLMLVYKDKSERIKGLRERSSLTLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MTEAALVEGQVKLRDGKKWKSRWLVLRKPSPVADCLLMLVYKDKSERIKGLRERSSLTLE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 DICGLEPGLPYEGLVHTLAIVCLSQAIMLGFDSHEAMCAWDARIRYALGEVHRFHVTVAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DICGLEPGLPYEGLVHTLAIVCLSQAIMLGFDSHEAMCAWDARIRYALGEVHRFHVTVAP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 GTKLESGPATLHLCNDVLVLARDIPPAVTGQWKLSDLRRYGAVPSGFIFEGGTRCGYWAG :::::::::::::::::::::::::::::::::::::::::::::::::::::: : : CCDS54 GTKLESGPATLHLCNDVLVLARDIPPAVTGQWKLSDLRRYGAVPSGFIFEGGTR-G-WRL 130 140 150 160 170 190 200 210 220 230 240 pF1KE2 VFFLSSAEGEQISFLFDCIVRGISPTKGPFGLRPVLPDPSPPGPSTVEERVAQEALETLQ . :. . ..:. . : . : .::. :. :: CCDS54 LPVLGRGGADQLPVRLHRP-RHL-PHQGPLWAAAGSTRPKSPGTLDCGGACGPGSPGNPT 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE2 LEKRLSLLSHAGRPGSGGDDRSLSSSSSEASHLDVSASSRLTAWPEQSSSSASTSQEGPR CCDS54 AGEAAEPPLTCGQAGQWRG 240 250 504 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Dec 17 11:06:48 2018 done: Mon Dec 17 11:06:49 2018 Total Scan time: 1.820 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]