FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6334, 316 aa 1>>>pF1KE6334 316 - 316 aa - 316 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0040+/-0.000814; mu= 17.8550+/- 0.049 mean_var=67.5706+/-13.354, 0's: 0 Z-trim(107.6): 22 B-trim: 0 in 0/51 Lambda= 0.156025 statistics sampled from 9690 (9708) to 9690 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.683), E-opt: 0.2 (0.298), width: 16 Scan time: 2.010 The best scores are: opt bits E(32554) CCDS43268.1 NUDT6 gene_id:11162|Hs108|chr4 ( 316) 2164 495.8 1.8e-140 CCDS3729.1 NUDT6 gene_id:11162|Hs108|chr4 ( 147) 1000 233.5 7.5e-62 >>CCDS43268.1 NUDT6 gene_id:11162|Hs108|chr4 (316 aa) initn: 2164 init1: 2164 opt: 2164 Z-score: 2635.2 bits: 495.8 E(32554): 1.8e-140 Smith-Waterman score: 2164; 100.0% identity (100.0% similar) in 316 aa overlap (1-316:1-316) 10 20 30 40 50 60 pF1KE6 MRQPLSWGRWRAMLARTYGPGPSAGYRWASGAQGYVRNPPVGACDLQGELDRFGGISVRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MRQPLSWGRWRAMLARTYGPGPSAGYRWASGAQGYVRNPPVGACDLQGELDRFGGISVRL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ARLDALDRLDAAAFQKGLQAAVQQWRSEGRTAVWLHIPILQSRFIAPAASLGFCFHHAES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 ARLDALDRLDAAAFQKGLQAAVQQWRSEGRTAVWLHIPILQSRFIAPAASLGFCFHHAES 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 DSSTLTLWLREGPSRLPGYASHQVGVAGAVFDESTRKILVVQDRNKLKNMWKFPGGLSEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 DSSTLTLWLREGPSRLPGYASHQVGVAGAVFDESTRKILVVQDRNKLKNMWKFPGGLSEP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 EEDIGDTAVREVFEETGIKSEFRSVLSIRQQHTNPGAFGKSDMYIICRLKPYSFTINFCQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 EEDIGDTAVREVFEETGIKSEFRSVLSIRQQHTNPGAFGKSDMYIICRLKPYSFTINFCQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 EECLRCEWMDLNDLAKTENTTPITSRVARLLLYGYREGFDKIDLTVEELPAVYTGLFYKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 EECLRCEWMDLNDLAKTENTTPITSRVARLLLYGYREGFDKIDLTVEELPAVYTGLFYKL 250 260 270 280 290 300 310 pF1KE6 YHKELPENYKTMKGID :::::::::::::::: CCDS43 YHKELPENYKTMKGID 310 >>CCDS3729.1 NUDT6 gene_id:11162|Hs108|chr4 (147 aa) initn: 1000 init1: 1000 opt: 1000 Z-score: 1223.9 bits: 233.5 E(32554): 7.5e-62 Smith-Waterman score: 1000; 100.0% identity (100.0% similar) in 147 aa overlap (170-316:1-147) 140 150 160 170 180 190 pF1KE6 ASHQVGVAGAVFDESTRKILVVQDRNKLKNMWKFPGGLSEPEEDIGDTAVREVFEETGIK :::::::::::::::::::::::::::::: CCDS37 MWKFPGGLSEPEEDIGDTAVREVFEETGIK 10 20 30 200 210 220 230 240 250 pF1KE6 SEFRSVLSIRQQHTNPGAFGKSDMYIICRLKPYSFTINFCQEECLRCEWMDLNDLAKTEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 SEFRSVLSIRQQHTNPGAFGKSDMYIICRLKPYSFTINFCQEECLRCEWMDLNDLAKTEN 40 50 60 70 80 90 260 270 280 290 300 310 pF1KE6 TTPITSRVARLLLYGYREGFDKIDLTVEELPAVYTGLFYKLYHKELPENYKTMKGID ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 TTPITSRVARLLLYGYREGFDKIDLTVEELPAVYTGLFYKLYHKELPENYKTMKGID 100 110 120 130 140 316 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:12:36 2016 done: Tue Nov 8 12:12:36 2016 Total Scan time: 2.010 Total Display time: -0.040 Function used was FASTA [36.3.4 Apr, 2011]