# /hgtech/tools/fasta-34.26.5_v890/fasta34_t -T 8 -b50 -d10 -E0.01 -H -O./tmp/hk01743.fasta.nr -Q ../query/KIAA1166.ptfa /cdna2/lib/nr/nr 2 FASTA searches a protein or DNA sequence data bank version 34.26.5 April 26, 2007 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 KIAA1166, 165 aa vs /cdna2/lib/nr/nr library 2693465022 residues in 7827732 sequences statistics sampled from 60000 to 7825945 sequences Expectation_n fit: rho(ln(x))= 5.4368+/-0.00019; mu= 5.2507+/- 0.011 mean_var=90.4678+/-17.317, 0's: 42 Z-trim: 48 B-trim: 130 in 2/64 Lambda= 0.134843 FASTA (3.5 Sept 2006) function [optimized, BL50 matrix (15:-5)] ktup: 2 join: 36, opt: 24, open/ext: -10/-2, width: 16 The best scores are: opt bits E(7827732) gi|149042276|gb|EDL95983.1| similar to cDNA sequen ( 133) 840 172.5 2.1e-41 gi|194374291|dbj|BAG57041.1| unnamed protein produ ( 176) 840 172.6 2.5e-41 gi|116283958|gb|AAH50850.1| AK129302 protein [Mus ( 196) 840 172.6 2.8e-41 gi|74206933|dbj|BAE33268.1| unnamed protein produc ( 224) 836 171.9 5.2e-41 gi|126342160|ref|XP_001378910.1| PREDICTED: hypoth ( 224) 834 171.5 6.8e-41 gi|63102314|gb|AAH94942.1| CDNA sequence AK129302 ( 224) 830 170.7 1.2e-40 gi|10434587|dbj|BAB14308.1| unnamed protein produc ( 224) 821 169.0 3.9e-40 gi|52354764|gb|AAH82892.1| LOC494779 protein [Xeno ( 224) 816 168.0 7.7e-40 gi|49118579|gb|AAH73590.1| MGC82889 protein [Xenop ( 224) 814 167.6 1e-39 gi|213625683|gb|AAI71113.1| Hypothetical protein L ( 224) 813 167.4 1.2e-39 gi|31419167|gb|AAH53110.1| Zgc:63849 [Danio rerio] ( 224) 786 162.2 4.4e-38 gi|122889504|emb|CAM14599.1| novel protein [Mus mu ( 217) 769 158.8 4.3e-37 gi|209732682|gb|ACI67210.1| Hepatocellular carcino ( 224) 759 156.9 1.7e-36 gi|209735668|gb|ACI68703.1| Hepatocellular carcino ( 233) 759 156.9 1.7e-36 gi|47226235|emb|CAG08382.1| unnamed protein produc ( 224) 757 156.5 2.2e-36 gi|118089258|ref|XP_420173.2| PREDICTED: hypotheti ( 242) 754 156.0 3.5e-36 gi|193784804|dbj|BAG53957.1| unnamed protein produ ( 219) 719 149.1 3.6e-34 gi|215502243|gb|EEC11737.1| hepatocellular carcino ( 226) 537 113.7 1.7e-23 gi|210111088|gb|EEA58901.1| hypothetical protein B ( 173) 534 113.0 2.1e-23 gi|210111090|gb|EEA58903.1| hypothetical protein B ( 213) 534 113.1 2.4e-23 gi|149272035|ref|XP_001473240.1| PREDICTED: simila ( 96) 488 103.9 6.7e-21 gi|156210430|gb|EDO31605.1| predicted protein [Nem ( 232) 471 100.9 1.3e-19 gi|189234221|ref|XP_972511.2| PREDICTED: similar t ( 266) 460 98.8 6.2e-19 gi|148682283|gb|EDL14230.1| mCG115432 [Mus musculu ( 78) 451 96.6 8.4e-19 gi|212515588|gb|EEB17712.1| conserved hypothetical ( 263) 453 97.4 1.6e-18 gi|66517924|ref|XP_395763.2| PREDICTED: similar to ( 247) 437 94.3 1.3e-17 gi|108882292|gb|EAT46517.1| conserved hypothetical ( 271) 434 93.8 2.1e-17 gi|108882293|gb|EAT46518.1| conserved hypothetical ( 272) 434 93.8 2.1e-17 gi|193624978|ref|XP_001946538.1| PREDICTED: simila ( 290) 426 92.2 6.4e-17 gi|167862660|gb|EDS26043.1| conserved hypothetical ( 272) 425 92.0 7e-17 gi|156547885|ref|XP_001607797.1| PREDICTED: simila ( 246) 419 90.8 1.5e-16 gi|198423134|ref|XP_002131537.1| PREDICTED: simila ( 227) 396 86.3 3.1e-15 gi|194188966|gb|EDX02550.1| GE15631 [Drosophila ya ( 334) 379 83.1 4e-14 gi|7293304|gb|AAF48684.1| CG13001 [Drosophila mela ( 335) 379 83.1 4e-14 gi|190649164|gb|EDV46442.1| GG18214 [Drosophila er ( 337) 379 83.1 4.1e-14 gi|198149420|gb|EAL31383.2| GA11967 [Drosophila ps ( 326) 378 82.9 4.5e-14 gi|190622796|gb|EDV38320.1| GF21819 [Drosophila an ( 329) 378 82.9 4.6e-14 gi|190585054|gb|EDV25123.1| hypothetical protein T ( 232) 376 82.4 4.6e-14 gi|194171607|gb|EDW86508.1| GK18524 [Drosophila wi ( 353) 378 83.0 4.8e-14 gi|194149862|gb|EDW65553.1| GJ19320 [Drosophila vi ( 328) 376 82.6 6e-14 gi|193907904|gb|EDW06771.1| GI15222 [Drosophila mo ( 328) 376 82.6 6e-14 gi|193893282|gb|EDV92148.1| GH24201 [Drosophila gr ( 346) 376 82.6 6.2e-14 gi|157018501|gb|EAA07059.3| AGAP010728-PA [Anophel ( 251) 374 82.1 6.4e-14 gi|149441823|ref|XP_001517803.1| PREDICTED: hypoth ( 201) 370 81.2 9.4e-14 gi|149450157|ref|XP_001520769.1| PREDICTED: simila ( 57) 346 76.0 9.5e-13 gi|149439497|ref|XP_001520139.1| PREDICTED: hypoth ( 67) 336 74.2 4.1e-12 gi|194114658|gb|EDW36701.1| GL22723 [Drosophila pe ( 183) 282 64.0 1.2e-08 gi|221110225|ref|XP_002166308.1| PREDICTED: simila ( 138) 212 50.3 0.00013 gi|167866651|gb|EDS30034.1| conserved hypothetical ( 128) 208 49.5 0.00021 >>gi|149042276|gb|EDL95983.1| similar to cDNA sequence A (133 aa) initn: 840 init1: 840 opt: 840 Z-score: 899.7 bits: 172.5 E(): 2.1e-41 Smith-Waterman score: 840; 100.000% identity (100.000% similar) in 133 aa overlap (33-165:1-133) 10 20 30 40 50 60 KIAA11 ARRRWLHCVYCIPWLVYLYISRDVKLTVKSMADEQEIMCKLESIKEIRNKTLQMEKIKAR :::::::::::::::::::::::::::::: gi|149 MADEQEIMCKLESIKEIRNKTLQMEKIKAR 10 20 30 70 80 90 100 110 120 KIAA11 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|149 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL 40 50 60 70 80 90 130 140 150 160 KIAA11 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE ::::::::::::::::::::::::::::::::::::::::::: gi|149 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE 100 110 120 130 >>gi|194374291|dbj|BAG57041.1| unnamed protein product [ (176 aa) initn: 840 init1: 840 opt: 840 Z-score: 898.1 bits: 172.6 E(): 2.5e-41 Smith-Waterman score: 840; 100.000% identity (100.000% similar) in 133 aa overlap (33-165:1-133) 10 20 30 40 50 60 KIAA11 ARRRWLHCVYCIPWLVYLYISRDVKLTVKSMADEQEIMCKLESIKEIRNKTLQMEKIKAR :::::::::::::::::::::::::::::: gi|194 MADEQEIMCKLESIKEIRNKTLQMEKIKAR 10 20 30 70 80 90 100 110 120 KIAA11 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|194 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL 40 50 60 70 80 90 130 140 150 160 KIAA11 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE ::::::::::::::::::::::::::::::::::::::::::: gi|194 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLEPACHVTSKFTGMHLYAL 100 110 120 130 140 150 gi|194 FARPRVGPGTPKSRNGSRMNKEREST 160 170 >>gi|116283958|gb|AAH50850.1| AK129302 protein [Mus musc (196 aa) initn: 840 init1: 840 opt: 840 Z-score: 897.5 bits: 172.6 E(): 2.8e-41 Smith-Waterman score: 840; 100.000% identity (100.000% similar) in 133 aa overlap (33-165:1-133) 10 20 30 40 50 60 KIAA11 ARRRWLHCVYCIPWLVYLYISRDVKLTVKSMADEQEIMCKLESIKEIRNKTLQMEKIKAR :::::::::::::::::::::::::::::: gi|116 MADEQEIMCKLESIKEIRNKTLQMEKIKAR 10 20 30 70 80 90 100 110 120 KIAA11 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|116 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL 40 50 60 70 80 90 130 140 150 160 KIAA11 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE ::::::::::::::::::::::::::::::::::::::::::: gi|116 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLEMTHPMYVIPKVTLRSRK 100 110 120 130 140 150 gi|116 QNGRLNLKNPPYLNPLLLQLLLLNNSRWPGSRTQGRQPPSGSSLHL 160 170 180 190 >>gi|74206933|dbj|BAE33268.1| unnamed protein product [M (224 aa) initn: 836 init1: 836 opt: 836 Z-score: 892.5 bits: 171.9 E(): 5.2e-41 Smith-Waterman score: 836; 99.248% identity (100.000% similar) in 133 aa overlap (33-165:1-133) 10 20 30 40 50 60 KIAA11 ARRRWLHCVYCIPWLVYLYISRDVKLTVKSMADEQEIMCKLESIKEIRNKTLQMEKIKAR :::::::::::::::::::::::::::::: gi|742 MADEQEIMCKLESIKEIRNKTLQMEKIKAR 10 20 30 70 80 90 100 110 120 KIAA11 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|742 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL 40 50 60 70 80 90 130 140 150 160 KIAA11 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE ::::::::::::::::::::::::::::::::::::::::::. gi|742 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLDYFEKQKAEWQTEPQEPP 100 110 120 130 140 150 gi|742 IPESLAAAAAAAQQLQVARKQDTRQTATFRQQPPPMKACLSCHQQIHRNAPICPLCKAKS 160 170 180 190 200 210 >>gi|126342160|ref|XP_001378910.1| PREDICTED: hypothetic (224 aa) initn: 834 init1: 834 opt: 834 Z-score: 890.4 bits: 171.5 E(): 6.8e-41 Smith-Waterman score: 834; 98.496% identity (100.000% similar) in 133 aa overlap (33-165:1-133) 10 20 30 40 50 60 KIAA11 ARRRWLHCVYCIPWLVYLYISRDVKLTVKSMADEQEIMCKLESIKEIRNKTLQMEKIKAR :::::::::::::::::::::.:::::::: gi|126 MADEQEIMCKLESIKEIRNKTMQMEKIKAR 10 20 30 70 80 90 100 110 120 KIAA11 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|126 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL 40 50 60 70 80 90 130 140 150 160 KIAA11 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE ::::::::::::::::::::::::::::::::::::::::::. gi|126 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLDYFEKQKAEWQTEPQEPP 100 110 120 130 140 150 gi|126 IPESLAAAAAAAQQLQVARKQDTRQTATFRQQPPPMKACLSCHQQIHRNAPICPLCKAKS 160 170 180 190 200 210 >>gi|63102314|gb|AAH94942.1| CDNA sequence AK129302 [Mus (224 aa) initn: 830 init1: 830 opt: 830 Z-score: 886.2 bits: 170.7 E(): 1.2e-40 Smith-Waterman score: 830; 98.496% identity (100.000% similar) in 133 aa overlap (33-165:1-133) 10 20 30 40 50 60 KIAA11 ARRRWLHCVYCIPWLVYLYISRDVKLTVKSMADEQEIMCKLESIKEIRNKTLQMEKIKAR :::::::::::::::::::::::::::::: gi|631 MADEQEIMCKLESIKEIRNKTLQMEKIKAR 10 20 30 70 80 90 100 110 120 KIAA11 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL :::::::::::::::::::.:::::::::::::::::::::::::::::::::::::::: gi|631 LKAEFEALESEERHLKEYKREMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL 40 50 60 70 80 90 130 140 150 160 KIAA11 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE ::::::::::::::::::::::::::::::::::::::::::. gi|631 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLDYFEKQKAEWQTEPQEPP 100 110 120 130 140 150 gi|631 IPESLAAAAAAAQQLQVARKQDTRQTATFRQQPPPMKACLSCHQQIHRNAPICPLCKAKS 160 170 180 190 200 210 >>gi|10434587|dbj|BAB14308.1| unnamed protein product [H (224 aa) initn: 854 init1: 821 opt: 821 Z-score: 876.7 bits: 169.0 E(): 3.9e-40 Smith-Waterman score: 821; 97.744% identity (100.000% similar) in 133 aa overlap (33-165:1-133) 10 20 30 40 50 60 KIAA11 ARRRWLHCVYCIPWLVYLYISRDVKLTVKSMADEQEIMCKLESIKEIRNKTLQMEKIKAR :::::::::::::::::::::::::::::: gi|104 MADEQEIMCKLESIKEIRNKTLQMEKIKAR 10 20 30 70 80 90 100 110 120 KIAA11 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL :::::::::::::::::::::::::::::::::::::::.:::::::::::::::::::: gi|104 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIRADINVMENTIKQSENDLNKL 40 50 60 70 80 90 130 140 150 160 KIAA11 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE :::::::::.::::::::::::::::::::::::::::::::. gi|104 LESTRRLHDKYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLDYFEKQKAEWQTEPQEPP 100 110 120 130 140 150 gi|104 IPESLAAAAAAAQQLQVARKQDTRQTATFRQQPPPMKACLSCHQQIHRNAPICPLCKAKS 160 170 180 190 200 210 >>gi|52354764|gb|AAH82892.1| LOC494779 protein [Xenopus (224 aa) initn: 822 init1: 799 opt: 816 Z-score: 871.5 bits: 168.0 E(): 7.7e-40 Smith-Waterman score: 816; 96.241% identity (100.000% similar) in 133 aa overlap (33-165:1-133) 10 20 30 40 50 60 KIAA11 ARRRWLHCVYCIPWLVYLYISRDVKLTVKSMADEQEIMCKLESIKEIRNKTLQMEKIKAR :.:.:::::::::::::::::::::::::: gi|523 MGDDQEIMCKLESIKEIRNKTLQMEKIKAR 10 20 30 70 80 90 100 110 120 KIAA11 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL ::::::.::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|523 LKAEFETLESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL 40 50 60 70 80 90 130 140 150 160 KIAA11 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE ::::::::.:::::::::::::::::::::::::::::::::. gi|523 LESTRRLHEEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLDYFEKQKAEWQTEPQDPP 100 110 120 130 140 150 gi|523 IPESLAAAAAAAQQLQVARKQDNRQTATFRQQPPPMKACLSCHQQIHRNAPICPLCKAKS 160 170 180 190 200 210 >>gi|49118579|gb|AAH73590.1| MGC82889 protein [Xenopus l (224 aa) initn: 830 init1: 807 opt: 814 Z-score: 869.4 bits: 167.6 E(): 1e-39 Smith-Waterman score: 814; 96.241% identity (100.000% similar) in 133 aa overlap (33-165:1-133) 10 20 30 40 50 60 KIAA11 ARRRWLHCVYCIPWLVYLYISRDVKLTVKSMADEQEIMCKLESIKEIRNKTLQMEKIKAR :.:::::::::::::::::::::::::::: gi|491 MGDEQEIMCKLESIKEIRNKTLQMEKIKAR 10 20 30 70 80 90 100 110 120 KIAA11 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL ::::::.::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|491 LKAEFETLESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL 40 50 60 70 80 90 130 140 150 160 KIAA11 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE ::::::::.::::::::::::::::::::::.::::::::::. gi|491 LESTRRLHEEYKPLKEHVDALRMTLGLQRLPNLCEEEEKLSLDYFEKQKAEWQTEPQEPP 100 110 120 130 140 150 gi|491 IPESLAAAAAAAQQLQVARKQDNRQTATFRQQPPPMKACLSCHQQIHRNAPICPLCKAKS 160 170 180 190 200 210 >>gi|213625683|gb|AAI71113.1| Hypothetical protein LOC54 (224 aa) initn: 829 init1: 806 opt: 813 Z-score: 868.3 bits: 167.4 E(): 1.2e-39 Smith-Waterman score: 813; 96.241% identity (99.248% similar) in 133 aa overlap (33-165:1-133) 10 20 30 40 50 60 KIAA11 ARRRWLHCVYCIPWLVYLYISRDVKLTVKSMADEQEIMCKLESIKEIRNKTLQMEKIKAR :.::::::::::::::::::: :::::::: gi|213 MGDEQEIMCKLESIKEIRNKTHQMEKIKAR 10 20 30 70 80 90 100 110 120 KIAA11 LKAEFEALESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL ::::::.::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|213 LKAEFESLESEERHLKEYKQEMDLLLQEKMAHVEELRLIHADINVMENTIKQSENDLNKL 40 50 60 70 80 90 130 140 150 160 KIAA11 LESTRRLHDEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLE ::::::::.:::::::::::::::::::::::::::::::::. gi|213 LESTRRLHEEYKPLKEHVDALRMTLGLQRLPDLCEEEEKLSLDYFEKQKAEWQTEPQEPP 100 110 120 130 140 150 gi|213 IPESLAAAAAAAQQLQVARKQDNRQTATFRQQPPPMKACLSCHQQIHRNAPICPLCKAKS 160 170 180 190 200 210 165 residues in 1 query sequences 2693465022 residues in 7827732 library sequences Tcomplib [34.26] (8 proc) start: Tue Mar 3 23:53:49 2009 done: Tue Mar 3 23:59:56 2009 Total Scan time: 1110.370 Total Display time: 0.020 Function used was FASTA [version 34.26.5 April 26, 2007]