# /hgtech/tools/fasta-34.26.5_v890/fasta34_t -T 8 -b50 -d10 -E0.01 -H -O./tmp/ha00771.fasta.nr -Q ../query/KIAA0101.ptfa /cdna2/lib/nr/nr 2 FASTA searches a protein or DNA sequence data bank version 34.26.5 April 26, 2007 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 KIAA0101, 130 aa vs /cdna2/lib/nr/nr library 2693465022 residues in 7827732 sequences statistics sampled from 60000 to 7826513 sequences Expectation_n fit: rho(ln(x))= 4.6278+/-0.000185; mu= 8.8222+/- 0.010 mean_var=64.7396+/-12.499, 0's: 31 Z-trim: 32 B-trim: 1476 in 1/64 Lambda= 0.159400 FASTA (3.5 Sept 2006) function [optimized, BL50 matrix (15:-5)] ktup: 2 join: 36, opt: 24, open/ext: -10/-2, width: 16 The best scores are: opt bits E(7827732) gi|148694161|gb|EDL26108.1| mCG131663, isoform CRA ( 140) 667 161.1 4.6e-38 gi|52783100|sp|Q6RIA2.1|PAF_RAT RecName: Full=PCNA ( 110) 649 156.9 6.8e-37 gi|119598089|gb|EAW77683.1| hCG2039386, isoform CR ( 107) 645 155.9 1.3e-36 gi|119598086|gb|EAW77680.1| hCG2039386, isoform CR ( 115) 637 154.1 4.7e-36 gi|52783147|sp|Q9CQX4.1|PAF_MOUSE RecName: Full=PC ( 110) 634 153.4 7.4e-36 gi|12850259|dbj|BAB28650.1| unnamed protein produc ( 110) 624 151.1 3.6e-35 gi|148707428|gb|EDL39375.1| mCG1047258 [Mus muscul ( 110) 615 149.0 1.5e-34 gi|74000861|ref|XP_853346.1| PREDICTED: similar to ( 100) 602 146.0 1.1e-33 gi|149245224|ref|XP_001472289.1| PREDICTED: hypoth ( 160) 602 146.2 1.6e-33 gi|194035027|ref|XP_001927039.1| PREDICTED: simila ( 163) 583 141.8 3.4e-32 gi|126277352|ref|XP_001375098.1| PREDICTED: simila ( 117) 549 133.9 5.9e-30 gi|149632219|ref|XP_001509145.1| PREDICTED: simila ( 200) 454 112.2 3.3e-23 gi|47213233|emb|CAF89754.1| unnamed protein produc ( 114) 449 110.9 4.9e-23 gi|50603813|gb|AAH77667.1| MGC89765 protein [Xenop ( 125) 446 110.2 8.4e-23 gi|57032577|gb|AAH88968.1| LOC496365 protein [Xeno ( 123) 430 106.5 1.1e-21 gi|221220744|gb|ACM09033.1| PCNA-associated factor ( 113) 429 106.3 1.2e-21 gi|209735044|gb|ACI68391.1| PCNA-associated factor ( 113) 408 101.4 3.3e-20 gi|148703932|gb|EDL35879.1| mCG49332 [Mus musculus ( 87) 300 76.5 8.2e-13 gi|210104307|gb|EEA52332.1| hypothetical protein B ( 105) 268 69.2 1.6e-10 gi|119598088|gb|EAW77682.1| hCG2039386, isoform CR ( 58) 250 64.9 1.8e-09 gi|71773819|ref|NP_001025160.1| hypothetical prote ( 65) 250 64.9 1.9e-09 gi|198424847|ref|XP_002131342.1| PREDICTED: simila ( 213) 204 54.8 7.1e-06 gi|156544534|ref|XP_001607698.1| PREDICTED: simila ( 140) 168 46.3 0.0016 gi|221130338|ref|XP_002163831.1| PREDICTED: simila ( 161) 168 46.4 0.0018 >>gi|148694161|gb|EDL26108.1| mCG131663, isoform CRA_b [ (140 aa) initn: 668 init1: 473 opt: 667 Z-score: 839.6 bits: 161.1 E(): 4.6e-38 Smith-Waterman score: 667; 83.051% identity (95.763% similar) in 118 aa overlap (13-130:24-140) 10 20 30 40 KIAA01 NTLGWEVSSFSPLLSSCLNMVRTKADSVPGTYRKVVAARAPRKVLGSST ..:::.:::::::. :::.:::.::..:::::::::: gi|148 TVAGTSFSGSSLIETSAVGVTVKVVSSCVNMVRTKANYVPGAYRKAVASQAPRKVLGSST 10 20 30 40 50 60 50 60 70 80 90 100 KIAA01 SATNSTSVSSRKAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSG .:::.: ::::::::::::::::::::::::::::::::::::.:.:::: :::::.:: gi|148 FVTNSSS-SSRKAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKESKKENQAPEEAGTSG 70 80 90 100 110 110 120 130 KIAA01 LGKAKRKACPLQPDHTNDEKE ::::::::::::::: .::.: gi|148 LGKAKRKACPLQPDHRDDENE 120 130 140 >>gi|52783100|sp|Q6RIA2.1|PAF_RAT RecName: Full=PCNA-ass (110 aa) initn: 652 init1: 488 opt: 649 Z-score: 818.6 bits: 156.9 E(): 6.8e-37 Smith-Waterman score: 649; 87.387% identity (96.396% similar) in 111 aa overlap (20-130:1-110) 10 20 30 40 50 60 KIAA01 NTLGWEVSSFSPLLSSCLNMVRTKADSVPGTYRKVVAARAPRKVLGSSTSATNSTSVSSR ::::::. :::.::::::..:::::::::: .:::.. ::: gi|527 MVRTKANYVPGAYRKVVASQAPRKVLGSSTFVTNSSG-SSR 10 20 30 40 70 80 90 100 110 120 KIAA01 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACPL :::::::::::::::::::::::::::::::::::.:::::::::::::::::::::::: gi|527 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSKKENQIPEEAGSSGLGKAKRKACPL 50 60 70 80 90 100 130 KIAA01 QPDHTNDEKE :::: .::.: gi|527 QPDHRDDENE 110 >>gi|119598089|gb|EAW77683.1| hCG2039386, isoform CRA_d (107 aa) initn: 642 init1: 642 opt: 645 Z-score: 813.8 bits: 155.9 E(): 1.3e-36 Smith-Waterman score: 645; 95.098% identity (98.039% similar) in 102 aa overlap (30-130:6-107) 10 20 30 40 50 KIAA01 NTLGWEVSSFSPLLSSCLNMVRTKADSVPGTY-RKVVAARAPRKVLGSSTSATNSTSVSS : : ...:::::::::::::::::::::::: gi|119 NKTARGRYCKRLVAARAPRKVLGSSTSATNSTSVSS 10 20 30 60 70 80 90 100 110 KIAA01 RKAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|119 RKAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACP 40 50 60 70 80 90 120 130 KIAA01 LQPDHTNDEKE ::::::::::: gi|119 LQPDHTNDEKE 100 >>gi|119598086|gb|EAW77680.1| hCG2039386, isoform CRA_b (115 aa) initn: 637 init1: 637 opt: 637 Z-score: 803.4 bits: 154.1 E(): 4.7e-36 Smith-Waterman score: 637; 90.826% identity (95.413% similar) in 109 aa overlap (20-128:1-109) 10 20 30 40 50 60 KIAA01 NTLGWEVSSFSPLLSSCLNMVRTKADSVPGTYRKVVAARAPRKVLGSSTSATNSTSVSSR ::::::::::::::::::::::::::::::::::::::::: gi|119 MVRTKADSVPGTYRKVVAARAPRKVLGSSTSATNSTSVSSR 10 20 30 40 70 80 90 100 110 120 KIAA01 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::. : gi|119 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRNQEPP 50 60 70 80 90 100 130 KIAA01 QPDHTNDEKE .::... gi|119 ILQHTEQQAGHTAH 110 >>gi|52783147|sp|Q9CQX4.1|PAF_MOUSE RecName: Full=PCNA-a (110 aa) initn: 637 init1: 473 opt: 634 Z-score: 800.0 bits: 153.4 E(): 7.4e-36 Smith-Waterman score: 634; 84.685% identity (95.495% similar) in 111 aa overlap (20-130:1-110) 10 20 30 40 50 60 KIAA01 NTLGWEVSSFSPLLSSCLNMVRTKADSVPGTYRKVVAARAPRKVLGSSTSATNSTSVSSR ::::::. :::.:::.::..:::::::::: .:::.: ::: gi|527 MVRTKANYVPGAYRKAVASQAPRKVLGSSTFVTNSSS-SSR 10 20 30 40 70 80 90 100 110 120 KIAA01 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACPL :::::::::::::::::::::::::::::::::.:.:::: :::::.::::::::::::: gi|527 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKESKKENQAPEEAGTSGLGKAKRKACPL 50 60 70 80 90 100 130 KIAA01 QPDHTNDEKE :::: .::.: gi|527 QPDHRDDENE 110 >>gi|12850259|dbj|BAB28650.1| unnamed protein product [M (110 aa) initn: 628 init1: 463 opt: 624 Z-score: 787.5 bits: 151.1 E(): 3.6e-35 Smith-Waterman score: 624; 83.784% identity (94.595% similar) in 111 aa overlap (20-130:1-110) 10 20 30 40 50 60 KIAA01 NTLGWEVSSFSPLLSSCLNMVRTKADSVPGTYRKVVAARAPRKVLGSSTSATNSTSVSSR ::::::. :::.:::.::..:::::::::: .:::.: :: gi|128 MVRTKANYVPGAYRKAVASQAPRKVLGSSTFVTNSSS-SSG 10 20 30 40 70 80 90 100 110 120 KIAA01 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACPL :::::::::::::::::::::::::::::::::.:.:::: :::::.::::::::::::: gi|128 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKESKKENQAPEEAGTSGLGKAKRKACPL 50 60 70 80 90 100 130 KIAA01 QPDHTNDEKE :::: .::.: gi|128 QPDHRDDENE 110 >>gi|148707428|gb|EDL39375.1| mCG1047258 [Mus musculus] (110 aa) initn: 618 init1: 454 opt: 615 Z-score: 776.3 bits: 149.0 E(): 1.5e-34 Smith-Waterman score: 615; 82.883% identity (94.595% similar) in 111 aa overlap (20-130:1-110) 10 20 30 40 50 60 KIAA01 NTLGWEVSSFSPLLSSCLNMVRTKADSVPGTYRKVVAARAPRKVLGSSTSATNSTSVSSR ::::::. :::.:::.::..:::::::::: .:::.: ::: gi|148 MVRTKANYVPGAYRKAVASQAPRKVLGSSTFVTNSSS-SSR 10 20 30 40 70 80 90 100 110 120 KIAA01 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACPL ::::::::::::::::::::::::::::::: :.:..:.: ::::::::::::::::::: gi|148 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSAKESKEESQAPEEAGSSGLGKAKRKACPL 50 60 70 80 90 100 130 KIAA01 QPDHTNDEKE :::: .::.: gi|148 QPDHRDDENE 110 >>gi|74000861|ref|XP_853346.1| PREDICTED: similar to HCV (100 aa) initn: 602 init1: 602 opt: 602 Z-score: 760.7 bits: 146.0 E(): 1.1e-33 Smith-Waterman score: 602; 93.814% identity (98.969% similar) in 97 aa overlap (20-116:1-97) 10 20 30 40 50 60 KIAA01 NTLGWEVSSFSPLLSSCLNMVRTKADSVPGTYRKVVAARAPRKVLGSSTSATNSTSVSSR ::::::::::::::::::.::::::::::::::::: .::: gi|740 MVRTKADSVPGTYRKVVASRAPRKVLGSSTSATNSTPLSSR 10 20 30 40 70 80 90 100 110 120 KIAA01 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACPL :.:::::::::::::::::::::::::::::::::::::.::::::::::::::.: gi|740 KVENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENRIPEEAGSSGLGKAKKKKIQ 50 60 70 80 90 100 130 KIAA01 QPDHTNDEKE >>gi|149245224|ref|XP_001472289.1| PREDICTED: hypothetic (160 aa) initn: 620 init1: 473 opt: 602 Z-score: 758.0 bits: 146.2 E(): 1.6e-33 Smith-Waterman score: 602; 77.119% identity (90.678% similar) in 118 aa overlap (13-130:51-160) 10 20 30 40 KIAA01 NTLGWEVSSFSPLLSSCLNMVRTKADSVPGTYRKVVAARAPR ..:::.:.: ::.:::.::..::: gi|149 SLRLFAQDRTKQKRRGSSLIETSAVGVSVKVVSSCVNYV-------PGAYRKAVASQAPR 30 40 50 60 70 50 60 70 80 90 100 KIAA01 KVLGSSTSATNSTSVSSRKAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIP ::::::: .:.:.: ::::::::::::::::::::::::::::::::::::.:.:::: : gi|149 KVLGSSTFVTSSSS-SSRKAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKESKKENQAP 80 90 100 110 120 130 110 120 130 KIAA01 EEAGSSGLGKAKRKACPLQPDHTNDEKE ::::.::::::::::::::::: .::.: gi|149 EEAGTSGLGKAKRKACPLQPDHRDDENE 140 150 160 >>gi|194035027|ref|XP_001927039.1| PREDICTED: similar to (163 aa) initn: 414 init1: 414 opt: 583 Z-score: 734.3 bits: 141.8 E(): 3.4e-32 Smith-Waterman score: 583; 94.792% identity (97.917% similar) in 96 aa overlap (20-115:1-95) 10 20 30 40 50 60 KIAA01 NTLGWEVSSFSPLLSSCLNMVRTKADSVPGTYRKVVAARAPRKVLGSSTSATNSTSVSSR ::::::::::::::::::.::::::::::::: ::::.::: gi|194 MVRTKADSVPGTYRKVVASRAPRKVLGSSTSA-NSTSLSSR 10 20 30 40 70 80 90 100 110 120 KIAA01 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACPL ::::::::::::::::::::::::::::::: :::::::.::::::::::::::: gi|194 KAENKYAGGNPVCVRPTPKWQKGIGEFFRLSAKDSEKENRIPEEAGSSGLGKAKRNQSHG 50 60 70 80 90 100 130 KIAA01 QPDHTNDEKE gi|194 LSFTPSTQPTQGAVISMDKEMVARLSLGLLLLVLLLPGQIYSSIPLTSTEDTPHETTTSS 110 120 130 140 150 160 130 residues in 1 query sequences 2693465022 residues in 7827732 library sequences Tcomplib [34.26] (8 proc) start: Tue Mar 3 20:47:33 2009 done: Tue Mar 3 20:54:06 2009 Total Scan time: 1056.460 Total Display time: 0.020 Function used was FASTA [version 34.26.5 April 26, 2007]