# /usr/local/bin/fasta34_t -T 4 -b50 -d10 -E0.01 -H -O./tmp/mbj00492.fasta.nr -Q ../query/mKIAA1068.ptfa /cdna4/rodent/rouge_util/new.rouge/nfasta/nr 2 FASTA searches a protein or DNA sequence data bank version 34.26.5 April 26, 2007 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 mKIAA1068, 107 aa vs /cdna4/rodent/rouge_util/new.rouge/nfasta/nr library 2727779818 residues in 7921681 sequences statistics sampled from 60000 to 7921314 sequences Expectation_n fit: rho(ln(x))= 5.0667+/-0.000177; mu= 5.3484+/- 0.010 mean_var=60.6454+/-11.755, 0's: 37 Z-trim: 41 B-trim: 29 in 1/66 Lambda= 0.164693 FASTA (3.5 Sept 2006) function [optimized, BL50 matrix (15:-5)] ktup: 2 join: 36, opt: 24, open/ext: -10/-2, width: 16 The best scores are: opt bits E(7921681) gi|123245877|emb|CAM19835.1| NudC domain containin ( 112) 735 182.1 1.5e-44 gi|56205673|emb|CAI24961.1| NudC domain containing ( 251) 685 170.4 1.1e-40 gi|26341230|dbj|BAC34277.1| unnamed protein produc ( 315) 685 170.4 1.3e-40 gi|149047665|gb|EDM00335.1| rCG35703, isoform CRA_ ( 315) 685 170.4 1.3e-40 gi|34784256|gb|AAH57603.1| Nudcd3 protein [Mus mus ( 291) 679 169.0 3.3e-40 gi|78100747|sp|Q8R1N4.3|NUDC3_MOUSE RecName: Full= ( 363) 679 169.0 4e-40 gi|149047664|gb|EDM00334.1| rCG35703, isoform CRA_ ( 363) 679 169.0 4e-40 gi|90079665|dbj|BAE89512.1| unnamed protein produc ( 220) 675 168.0 5.1e-40 gi|10436552|dbj|BAB14855.1| unnamed protein produc ( 256) 675 168.0 5.8e-40 gi|119581504|gb|EAW61100.1| NudC domain containing ( 290) 675 168.0 6.4e-40 gi|211826085|gb|AAH11673.2| NUDCD3 protein [Homo s ( 296) 675 168.0 6.5e-40 gi|90110782|sp|Q5RB75.1|NUDC3_PONAB RecName: Full= ( 361) 675 168.1 7.7e-40 gi|23273927|gb|AAH35014.1| NudC domain containing ( 361) 675 168.1 7.7e-40 gi|145559509|sp|Q8IVD9.3|NUDC3_HUMAN RecName: Full ( 361) 675 168.1 7.7e-40 gi|109066663|ref|XP_001092225.1| PREDICTED: NudC d ( 364) 675 168.1 7.7e-40 gi|74010953|ref|XP_850094.1| PREDICTED: similar to ( 199) 650 162.0 2.9e-38 gi|126302969|ref|XP_001370227.1| PREDICTED: simila ( 364) 653 162.9 2.9e-38 gi|89994047|gb|AAI14137.1| NudC domain containing ( 359) 652 162.6 3.4e-38 gi|194209528|ref|XP_001495826.2| PREDICTED: simila ( 398) 642 160.3 1.9e-37 gi|118113655|ref|XP_001233318.1| PREDICTED: simila ( 99) 621 155.0 1.9e-36 gi|224080704|ref|XP_002195998.1| PREDICTED: simila ( 353) 616 154.1 1.3e-35 gi|59808725|gb|AAH89719.1| MGC108341 protein [Xeno ( 346) 605 151.4 7.5e-35 gi|49257406|gb|AAH73360.1| MGC80778 protein [Xenop ( 346) 596 149.3 3.3e-34 gi|120538431|gb|AAI29582.1| LOC100036878 protein [ ( 347) 594 148.8 4.6e-34 gi|125828613|ref|XP_690667.2| PREDICTED: hypotheti ( 344) 567 142.4 3.9e-32 gi|41946864|gb|AAH65992.1| Zgc:77067 [Danio rerio] ( 344) 567 142.4 3.9e-32 gi|47228546|emb|CAG05366.1| unnamed protein produc ( 353) 551 138.6 5.6e-31 gi|210083698|gb|EEA32282.1| hypothetical protein B ( 344) 459 116.8 2.1e-24 gi|210129222|gb|EEA76897.1| hypothetical protein B ( 364) 459 116.8 2.2e-24 gi|198413868|ref|XP_002128027.1| PREDICTED: simila ( 326) 451 114.8 7.4e-24 gi|221117580|ref|XP_002160902.1| PREDICTED: simila ( 300) 310 81.3 8.4e-14 gi|124392199|emb|CAK57733.1| unnamed protein produ ( 329) 307 80.6 1.5e-13 gi|156217078|gb|EDO38002.1| predicted protein [Nem ( 182) 297 78.1 4.7e-13 gi|115669539|ref|XP_001202258.1| PREDICTED: hypoth ( 136) 290 76.4 1.2e-12 gi|110760682|ref|XP_395833.3| PREDICTED: similar t ( 301) 292 77.0 1.6e-12 gi|211964906|gb|EEB00102.1| nuclear movement domai ( 384) 278 73.8 2e-11 gi|221484864|gb|EEE23154.1| nuclear movement domai ( 384) 278 73.8 2e-11 gi|221506082|gb|EEE31717.1| nuclear movement domai ( 384) 278 73.8 2e-11 gi|215505165|gb|EEC14659.1| nuclear distribution p ( 370) 265 70.7 1.7e-10 gi|89297339|gb|EAR95327.1| Nuclear movement protei (1380) 270 72.2 2.2e-10 gi|56758148|gb|AAW27214.1| SJCHGC02542 protein [Sc ( 334) 251 67.3 1.5e-09 gi|156549630|ref|XP_001604092.1| PREDICTED: simila ( 300) 247 66.4 2.7e-09 gi|209557898|gb|EEA07943.1| hypothetical protein, ( 207) 245 65.8 2.8e-09 gi|194168877|gb|EDW83778.1| GK13789 [Drosophila wi ( 297) 235 63.5 1.9e-08 gi|194112161|gb|EDW34204.1| GL21703 [Drosophila pe ( 297) 234 63.3 2.3e-08 gi|190626590|gb|EDV42114.1| GF17174 [Drosophila an ( 303) 231 62.6 3.8e-08 gi|198132481|gb|EDY68284.1| GA26376 [Drosophila ps ( 294) 230 62.3 4.4e-08 gi|194154963|gb|EDW70147.1| GJ11725 [Drosophila vi ( 304) 228 61.8 6.2e-08 gi|45446520|gb|AAN13748.2| CG31251 [Drosophila mel ( 306) 228 61.8 6.3e-08 gi|91084479|ref|XP_971343.1| PREDICTED: similar to ( 271) 227 61.6 6.7e-08 >>gi|123245877|emb|CAM19835.1| NudC domain containing 3 (112 aa) initn: 735 init1: 735 opt: 735 Z-score: 956.3 bits: 182.1 E(): 1.5e-44 Smith-Waterman score: 735; 100.000% identity (100.000% similar) in 107 aa overlap (1-107:6-112) 10 20 30 40 50 mKIAA1 RHRSGDAEVNLSKVGEYWWSAILEGEEPIDIDKINKERSMATVDEEEQAVLDRLT ::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|123 MLVSHRHRSGDAEVNLSKVGEYWWSAILEGEEPIDIDKINKERSMATVDEEEQAVLDRLT 10 20 30 40 50 60 60 70 80 90 100 mKIAA1 FDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMFNISPGAVQF :::::::::::::::::::::::::::::::::::::::::::::::::::: gi|123 FDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMFNISPGAVQF 70 80 90 100 110 >>gi|56205673|emb|CAI24961.1| NudC domain containing 3 [ (251 aa) initn: 681 init1: 681 opt: 685 Z-score: 886.8 bits: 170.4 E(): 1.1e-40 Smith-Waterman score: 685; 93.458% identity (95.327% similar) in 107 aa overlap (1-107:145-251) 10 20 30 mKIAA1 RHRSGDAEVNLSKVGEYWWSAILEGEEPID .: .:::::::::::::::::::::: gi|562 KNPDSYNGAIRENYIWSQDYTDLEVRVPVPKHVMKGKQVNLSKVGEYWWSAILEGEEPID 120 130 140 150 160 170 40 50 60 70 80 90 mKIAA1 IDKINKERSMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|562 IDKINKERSMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRG 180 190 200 210 220 230 100 mKIAA1 QRFDPAMFNISPGAVQF ::::::::::::::::: gi|562 QRFDPAMFNISPGAVQF 240 250 >>gi|26341230|dbj|BAC34277.1| unnamed protein product [M (315 aa) initn: 681 init1: 681 opt: 685 Z-score: 885.3 bits: 170.4 E(): 1.3e-40 Smith-Waterman score: 685; 93.458% identity (95.327% similar) in 107 aa overlap (1-107:209-315) 10 20 30 mKIAA1 RHRSGDAEVNLSKVGEYWWSAILEGEEPID .: .:::::::::::::::::::::: gi|263 KNPDSYNGAIRENYIWSQDYTDLEVRVPVPKHVMKGKQVNLSKVGEYWWSAILEGEEPID 180 190 200 210 220 230 40 50 60 70 80 90 mKIAA1 IDKINKERSMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|263 IDKINKERSMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRG 240 250 260 270 280 290 100 mKIAA1 QRFDPAMFNISPGAVQF ::::::::::::::::: gi|263 QRFDPAMFNISPGAVQF 300 310 >>gi|149047665|gb|EDM00335.1| rCG35703, isoform CRA_b [R (315 aa) initn: 681 init1: 681 opt: 685 Z-score: 885.3 bits: 170.4 E(): 1.3e-40 Smith-Waterman score: 685; 93.458% identity (95.327% similar) in 107 aa overlap (1-107:209-315) 10 20 30 mKIAA1 RHRSGDAEVNLSKVGEYWWSAILEGEEPID .: .:::::::::::::::::::::: gi|149 KNPDSYNGAIRENYTWSQDYTDLEVRVPVPKHVVKGKQVNLSKVGEYWWSAILEGEEPID 180 190 200 210 220 230 40 50 60 70 80 90 mKIAA1 IDKINKERSMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|149 IDKINKERSMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRG 240 250 260 270 280 290 100 mKIAA1 QRFDPAMFNISPGAVQF ::::::::::::::::: gi|149 QRFDPAMFNISPGAVQF 300 310 >>gi|34784256|gb|AAH57603.1| Nudcd3 protein [Mus musculu (291 aa) initn: 679 init1: 679 opt: 679 Z-score: 878.1 bits: 169.0 E(): 3.3e-40 Smith-Waterman score: 679; 100.000% identity (100.000% similar) in 99 aa overlap (9-107:193-291) 10 20 30 mKIAA1 RHRSGDAEVNLSKVGEYWWSAILEGEEPIDIDKINKER :::::::::::::::::::::::::::::: gi|347 GERVLMEGKLTHKINTESSLWSLEPGRCVLVNLSKVGEYWWSAILEGEEPIDIDKINKER 170 180 190 200 210 220 40 50 60 70 80 90 mKIAA1 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|347 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF 230 240 250 260 270 280 100 mKIAA1 NISPGAVQF ::::::::: gi|347 NISPGAVQF 290 >>gi|78100747|sp|Q8R1N4.3|NUDC3_MOUSE RecName: Full=NudC (363 aa) initn: 679 init1: 679 opt: 679 Z-score: 876.7 bits: 169.0 E(): 4e-40 Smith-Waterman score: 679; 100.000% identity (100.000% similar) in 99 aa overlap (9-107:265-363) 10 20 30 mKIAA1 RHRSGDAEVNLSKVGEYWWSAILEGEEPIDIDKINKER :::::::::::::::::::::::::::::: gi|781 GERVLMEGKLTHKINTESSLWSLEPGRCVLVNLSKVGEYWWSAILEGEEPIDIDKINKER 240 250 260 270 280 290 40 50 60 70 80 90 mKIAA1 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|781 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF 300 310 320 330 340 350 100 mKIAA1 NISPGAVQF ::::::::: gi|781 NISPGAVQF 360 >>gi|149047664|gb|EDM00334.1| rCG35703, isoform CRA_a [R (363 aa) initn: 679 init1: 679 opt: 679 Z-score: 876.7 bits: 169.0 E(): 4e-40 Smith-Waterman score: 679; 100.000% identity (100.000% similar) in 99 aa overlap (9-107:265-363) 10 20 30 mKIAA1 RHRSGDAEVNLSKVGEYWWSAILEGEEPIDIDKINKER :::::::::::::::::::::::::::::: gi|149 GERVLMEGKLTHKINTESSLWSLEPGKCVLVNLSKVGEYWWSAILEGEEPIDIDKINKER 240 250 260 270 280 290 40 50 60 70 80 90 mKIAA1 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|149 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF 300 310 320 330 340 350 100 mKIAA1 NISPGAVQF ::::::::: gi|149 NISPGAVQF 360 >>gi|90079665|dbj|BAE89512.1| unnamed protein product [M (220 aa) initn: 675 init1: 675 opt: 675 Z-score: 874.8 bits: 168.0 E(): 5.1e-40 Smith-Waterman score: 675; 98.990% identity (100.000% similar) in 99 aa overlap (9-107:122-220) 10 20 30 mKIAA1 RHRSGDAEVNLSKVGEYWWSAILEGEEPIDIDKINKER :::::::::::.:::::::::::::::::: gi|900 GERVLMEGKLTHKINTESSLWSLEPGKCVLVNLSKVGEYWWNAILEGEEPIDIDKINKER 100 110 120 130 140 150 40 50 60 70 80 90 mKIAA1 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|900 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF 160 170 180 190 200 210 100 mKIAA1 NISPGAVQF ::::::::: gi|900 NISPGAVQF 220 >>gi|10436552|dbj|BAB14855.1| unnamed protein product [H (256 aa) initn: 675 init1: 675 opt: 675 Z-score: 873.8 bits: 168.0 E(): 5.8e-40 Smith-Waterman score: 675; 98.990% identity (100.000% similar) in 99 aa overlap (9-107:122-220) 10 20 30 mKIAA1 RHRSGDAEVNLSKVGEYWWSAILEGEEPIDIDKINKER :::::::::::.:::::::::::::::::: gi|104 GERVLMEGKLTHKINTESSLWSLEPGKCVLVNLSKVGEYWWNAILEGEEPIDIDKINKER 100 110 120 130 140 150 40 50 60 70 80 90 mKIAA1 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|104 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF 160 170 180 190 200 210 100 mKIAA1 NISPGAVQF ::::::::: gi|104 NISPGAVQFNDQKERKPSPVGRQSLILGCPSWLPAFQGLARLVYP 220 230 240 250 >>gi|119581504|gb|EAW61100.1| NudC domain containing 3, (290 aa) initn: 675 init1: 675 opt: 675 Z-score: 873.0 bits: 168.0 E(): 6.4e-40 Smith-Waterman score: 675; 98.990% identity (100.000% similar) in 99 aa overlap (9-107:192-290) 10 20 30 mKIAA1 RHRSGDAEVNLSKVGEYWWSAILEGEEPIDIDKINKER :::::::::::.:::::::::::::::::: gi|119 GERVLMEGKLTHKINTESSLWSLEPGKCVLVNLSKVGEYWWNAILEGEEPIDIDKINKER 170 180 190 200 210 220 40 50 60 70 80 90 mKIAA1 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|119 SMATVDEEEQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFRGQRFDPAMF 230 240 250 260 270 280 100 mKIAA1 NISPGAVQF ::::::::: gi|119 NISPGAVQF 290 107 residues in 1 query sequences 2727779818 residues in 7921681 library sequences Tcomplib [34.26] (2 proc) start: Thu Mar 12 12:37:23 2009 done: Thu Mar 12 12:41:05 2009 Total Scan time: 517.750 Total Display time: 0.010 Function used was FASTA [version 34.26.5 April 26, 2007]