# /hgtech/tools/fasta-34.26.5_v890/fasta34_t -T 8 -b50 -d10 -E0.01 -H -O./tmp/ha01551.fasta.nr -Q ../query/KIAA0047.ptfa /cdna2/lib/nr/nr 2 FASTA searches a protein or DNA sequence data bank version 34.26.5 April 26, 2007 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 KIAA0047, 268 aa vs /cdna2/lib/nr/nr library 2693465022 residues in 7827732 sequences statistics sampled from 60000 to 7819697 sequences Expectation_n fit: rho(ln(x))= 5.4933+/-0.000194; mu= 8.9187+/- 0.011 mean_var=104.8600+/-20.065, 0's: 35 Z-trim: 57 B-trim: 9 in 1/66 Lambda= 0.125248 FASTA (3.5 Sept 2006) function [optimized, BL50 matrix (15:-5)] ktup: 2 join: 36, opt: 24, open/ext: -10/-2, width: 16 The best scores are: opt bits E(7827732) gi|119587119|gb|EAW66715.1| procollagen (type III) ( 332) 1988 369.1 5.3e-100 gi|1354931|gb|AAC50775.1| PRSM1 ( 318) 1974 366.6 3e-99 gi|134019487|ref|NP_001076783.1| chromatin modifyi ( 240) 1746 325.2 6.2e-87 gi|119587120|gb|EAW66716.1| procollagen (type III) ( 135) 881 168.7 4.7e-40 gi|117589|sp|P04922.1|CSP_PLAKU RecName: Full=Circ ( 351) 212 48.2 0.0022 gi|2088834|gb|AAB54250.1| Blistered cuticle protei ( 311) 211 48.0 0.0023 gi|187037393|emb|CAP24059.1| C. briggsae CBR-BLI-2 ( 311) 210 47.8 0.0026 gi|15077111|gb|AAK83075.1| collagen [Meloidogyne j ( 345) 210 47.9 0.0028 gi|187029342|emb|CAP31685.1| C. briggsae CBR-DPY-1 ( 320) 209 47.6 0.003 gi|7206749|gb|AAF39908.1| Dumpy : shorter than wil ( 322) 204 46.7 0.0057 gi|188988363|gb|ACD67726.1| circumsporozoite prote ( 351) 201 46.2 0.0088 gi|187021044|emb|CAP39626.1| C. briggsae CBR-COL-3 ( 316) 200 46.0 0.0093 >>gi|119587119|gb|EAW66715.1| procollagen (type III) N-e (332 aa) initn: 1988 init1: 1988 opt: 1988 Z-score: 1951.5 bits: 369.1 E(): 5.3e-100 Smith-Waterman score: 1988; 99.627% identity (99.627% similar) in 268 aa overlap (1-268:65-332) 10 20 30 KIAA00 PDTARSWRPFSLSECCSCHCGHGRYPVPVE :::::::::::::::::::::::::::::: gi|119 SREHPVRPEVGRRPRKSPPGAAWSVRSPPGPDTARSWRPFSLSECCSCHCGHGRYPVPVE 40 50 60 70 80 90 40 50 60 70 80 90 KIAA00 VHGEAAGEAGQEGGEGLQGGAGQSEEGPSAEKCRVCPCVCRERHPQEERRCELASDGVPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|119 VHGEAAGEAGQEGGEGLQGGAGQSEEGPSAEKCRVCPCVCRERHPQEERRCELASDGVPR 100 110 120 130 140 150 100 110 120 130 140 150 KIAA00 RRSGLQGADSCDYEGGDQEYGPGDQSPGQGPEHHGPAEGLLSDGQVRAAGAEPGRPYIGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|119 RRSGLQGADSCDYEGGDQEYGPGDQSPGQGPEHHGPAEGLLSDGQVRAAGAEPGRPYIGD 160 170 180 190 200 210 160 170 180 190 200 210 KIAA00 GGLHELGHHPDHAAGAGGQPHHADRRGEWPGGAGPAQPAARGRLCRGRELCAQPGGPAVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|119 GGLHELGHHPDHAAGAGGQPHHADRRGEWPGGAGPAQPAARGRLCRGRELCAQPGGPAVT 220 230 240 250 260 270 220 230 240 250 260 KIAA00 EVGRLEELAVPRRFAPPLPRDVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP ::::::::::::: :::::::::::::::::::::::::::::::::::::::::::: gi|119 EVGRLEELAVPRRCAPPLPRDVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP 280 290 300 310 320 330 >>gi|1354931|gb|AAC50775.1| PRSM1 (318 aa) initn: 1974 init1: 1974 opt: 1974 Z-score: 1938.1 bits: 366.6 E(): 3e-99 Smith-Waterman score: 1974; 98.881% identity (99.254% similar) in 268 aa overlap (1-268:51-318) 10 20 30 KIAA00 PDTARSWRPFSLSECCSCHCGHGRYPVPVE :::::::::::::::::::::::::::::: gi|135 REEHPVRPEVGRRPRKSPPGAAWSVRSPPGPDTARSWRPFSLSECCSCHCGHGRYPVPVE 30 40 50 60 70 80 40 50 60 70 80 90 KIAA00 VHGEAAGEAGQEGGEGLQGGAGQSEEGPSAEKCRVCPCVCRERHPQEERRCELASDGVPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|135 VHGEAAGEAGQEGGEGLQGGAGQSEEGPSAEKCRVCPCVCRERHPQEERRCELASDGVPR 90 100 110 120 130 140 100 110 120 130 140 150 KIAA00 RRSGLQGADSCDYEGGDQEYGPGDQSPGQGPEHHGPAEGLLSDGQVRAAGAEPGRPYIGD :::::::. ::::::::::::::::::::::::::::::::::::::::::::::::::: gi|135 RRSGLQGGHSCDYEGGDQEYGPGDQSPGQGPEHHGPAEGLLSDGQVRAAGAEPGRPYIGD 150 160 170 180 190 200 160 170 180 190 200 210 KIAA00 GGLHELGHHPDHAAGAGGQPHHADRRGEWPGGAGPAQPAARGRLCRGRELCAQPGGPAVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|135 GGLHELGHHPDHAAGAGGQPHHADRRGEWPGGAGPAQPAARGRLCRGRELCAQPGGPAVT 210 220 230 240 250 260 220 230 240 250 260 KIAA00 EVGRLEELAVPRRFAPPLPRDVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP ::::::::::::: :::::::::::::::::::::::::::::::::::::::::::: gi|135 EVGRLEELAVPRRCAPPLPRDVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP 270 280 290 300 310 >>gi|134019487|ref|NP_001076783.1| chromatin modifying p (240 aa) initn: 1746 init1: 1746 opt: 1746 Z-score: 1716.9 bits: 325.2 E(): 6.2e-87 Smith-Waterman score: 1746; 98.750% identity (99.583% similar) in 240 aa overlap (29-268:1-240) 10 20 30 40 50 60 KIAA00 PDTARSWRPFSLSECCSCHCGHGRYPVPVEVHGEAAGEAGQEGGEGLQGGAGQSEEGPSA ..:::::::::::::::::::::::::::::: gi|134 MDVHGEAAGEAGQEGGEGLQGGAGQSEEGPSA 10 20 30 70 80 90 100 110 120 KIAA00 EKCRVCPCVCRERHPQEERRCELASDGVPRRRSGLQGADSCDYEGGDQEYGPGDQSPGQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|134 EKCRVCPCVCRERHPQEERRCELASDGVPRRRSGLQGADSCDYEGGDQEYGPGDQSPGQG 40 50 60 70 80 90 130 140 150 160 170 180 KIAA00 PEHHGPAEGLLSDGQVRAAGAEPGRPYIGDGGLHELGHHPDHAAGAGGQPHHADRRGEWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|134 PEHHGPAEGLLSDGQVRAAGAEPGRPYIGDGGLHELGHHPDHAAGAGGQPHHADRRGEWP 100 110 120 130 140 150 190 200 210 220 230 240 KIAA00 GGAGPAQPAARGRLCRGRELCAQPGGPAVTEVGRLEELAVPRRFAPPLPRDVLEGSCPLP ::::::::::::::::::::::::::::::::::::::::::: :::::::::::::::: gi|134 GGAGPAQPAARGRLCRGRELCAQPGGPAVTEVGRLEELAVPRRCAPPLPRDVLEGSCPLP 160 170 180 190 200 210 250 260 KIAA00 TASCLCADPAGLRPAATLRLSPARPAWP :::::::::::::::::::::::::::: gi|134 TASCLCADPAGLRPAATLRLSPARPAWP 220 230 240 >>gi|119587120|gb|EAW66716.1| procollagen (type III) N-e (135 aa) initn: 881 init1: 881 opt: 881 Z-score: 875.3 bits: 168.7 E(): 4.7e-40 Smith-Waterman score: 881; 99.167% identity (99.167% similar) in 120 aa overlap (149-268:16-135) 120 130 140 150 160 170 KIAA00 QGPEHHGPAEGLLSDGQVRAAGAEPGRPYIGDGGLHELGHHPDHAAGAGGQPHHADRRGE :::::::::::::::::::::::::::::: gi|119 MGLAPPVVQSMPLPPGDGGLHELGHHPDHAAGAGGQPHHADRRGE 10 20 30 40 180 190 200 210 220 230 KIAA00 WPGGAGPAQPAARGRLCRGRELCAQPGGPAVTEVGRLEELAVPRRFAPPLPRDVLEGSCP ::::::::::::::::::::::::::::::::::::::::::::: :::::::::::::: gi|119 WPGGAGPAQPAARGRLCRGRELCAQPGGPAVTEVGRLEELAVPRRCAPPLPRDVLEGSCP 50 60 70 80 90 100 240 250 260 KIAA00 LPTASCLCADPAGLRPAATLRLSPARPAWP :::::::::::::::::::::::::::::: gi|119 LPTASCLCADPAGLRPAATLRLSPARPAWP 110 120 130 >>gi|117589|sp|P04922.1|CSP_PLAKU RecName: Full=Circumsp (351 aa) initn: 83 init1: 83 opt: 212 Z-score: 216.9 bits: 48.2 E(): 0.0022 Smith-Waterman score: 212; 31.553% identity (53.883% similar) in 206 aa overlap (29-226:42-235) 10 20 30 40 50 KIAA00 PDTARSWRPFSLSECCSCHCGHGRYPVPVEVHGEAAGEAGQEGGEGLQGGAGQSEEGP :.. . .:... : ...: .: . . .:: gi|117 ILLVDLLPTHFEHNVDLSRAINVNGVSFNNVDTSSLGAAQVRQSASRG-RGLGEKPKEGA 20 30 40 50 60 70 60 70 80 90 100 110 KIAA00 SAEKCRVCPCVCRERHPQEERRCELASDGVPRRRSGLQGADSCDYEGGDQEY-GPGDQSP . :: . .:..:.. . .: . : .: : . ::.: : : ..: gi|117 DKEKKKEKEKE-KEEEPKKPNENKLKQPEQPA--AGAGGEQPAAGAGGEQPAAGAGGEQP 80 90 100 110 120 120 130 140 150 160 170 KIAA00 GQGPEHHGPAEGLLSDGQVRAAGAEPGRPYIGDGGLHEL----GHHPDHAAGAGGQPHHA . : . . :: : . :. :::: .: : :: . :..: ::::::. : gi|117 AAGARGEQPAAG--AGGEQPAAGAGGEQPAAGAGGEQPAAGAGGEQP--AAGAGGEQPAA 130 140 150 160 170 180 180 190 200 210 220 230 KIAA00 DRRGEWPG-GAGPAQPAARGRLCRGRELCAQPGG--PAVTEVGRLEELAVPRRFAPPLPR ::: :. ::: :::: : :.. : : ::. :. . .::: : gi|117 GARGEQPAAGAGGEQPAA-G--AGGEQPAAGARGEQPAAGAGGE-QPAPAPRREQPAPGA 190 200 210 220 230 240 250 260 KIAA00 DVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP gi|117 VAGDGARGGNAGAGKGQGQNNQGANVPNEKVVNDYLHKIRSSVTTEWTPCSVTCGNGVRI 240 250 260 270 280 290 >>gi|2088834|gb|AAB54250.1| Blistered cuticle protein 2 (311 aa) initn: 103 init1: 61 opt: 211 Z-score: 216.6 bits: 48.0 E(): 0.0023 Smith-Waterman score: 211; 29.268% identity (47.805% similar) in 205 aa overlap (14-204:114-305) 10 20 30 40 KIAA00 PDTARSWRPFSLSECCSCHCGHGRYPVPVEVHGEAAGEAGQEG .::::. : . : : :: :. :..: gi|208 KKQAGYDFAESNTNAESGFSSSKSSLAPGGQCCSCKTGPSGPPGP---PGED-GRDGRDG 90 100 110 120 130 50 60 70 80 90 100 KIAA00 GEGLQGGAGQSEEGPSAEKCRVCPCV-CRERHPQEERRCELASDGVPRR--RSGLQGADS ::.: : . . . .. . :: : : ...: : : : ..:: :. . gi|208 KPGLNGEDGTDAKDSAPRRDAAAPCYDCPVGPPGPPG--NIGSKGQPGRNGKDGLPGVPG 140 150 160 170 180 190 110 120 130 140 150 KIAA00 CDYEGGDQ-EYG-PG-DQSPGQGPEHHGPAEGLLSDGQVRAAGAEPGRP----YIGDGGL . :. . : :: : .::: .. :.. : .: .: . :: : : :: gi|208 LPGQPGEPGDDGEPGEDGDPGQPGDNGEPGK---CD-EVNVAQGPPGSPGPPGLPGPDGL 200 210 220 230 240 250 160 170 180 190 200 KIAA00 HELGHHP--DHAAGAGGQPHHADRRGE--WPGGAGPAQPAARGRLCRGRELCAQPGGPAV .: : : .:.: . . :. :: :: . : : : : : gi|208 PGTPGNPGQDGEQGPAGEPGRDGKDGQPGRPGQPGPPGEPGTGGGC---EHCPTPRTAPG 260 270 280 290 300 310 210 220 230 240 250 260 KIAA00 TEVGRLEELAVPRRFAPPLPRDVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP gi|208 Y >>gi|187037393|emb|CAP24059.1| C. briggsae CBR-BLI-2 pro (311 aa) initn: 106 init1: 51 opt: 210 Z-score: 215.6 bits: 47.8 E(): 0.0026 Smith-Waterman score: 210; 29.756% identity (47.317% similar) in 205 aa overlap (14-204:114-305) 10 20 30 40 KIAA00 PDTARSWRPFSLSECCSCHCGHGRYPVPVEVHGEAAGEAGQEG .::::. : . : : :: :. :..: gi|187 KKQAGYDFSESNTNSESGFSSSKSSLSPGGQCCSCKTGPSGPPGP---PGED-GRDGRDG 90 100 110 120 130 50 60 70 80 90 100 KIAA00 GEGLQGGAGQSEEGPSAEKCRVCPCV-CRERHPQEERRCELASDGVPRR--RSGLQGADS ::.: : . . .:.. :: : : ... : : ..:: :. . gi|187 KPGLNGEDGTDAKDSGARRDTSAPCYDCPVGPPGPPG--NIGPKGQSGRNGKDGLPGVPG 140 150 160 170 180 190 110 120 130 140 150 KIAA00 CDYEGGDQ-EYG-PG-DQSPGQGPEHHGPAEGLLSDGQVRAAGAEPGRP----YIGDGGL . :. : : :: : .::: .. :.. : .: .: . :::: : :: gi|187 LPGQPGEPGEDGQPGEDGDPGQPGDNGEPGK---CD-EVNVAQGPPGRPGPPGLPGPDGL 200 210 220 230 240 250 160 170 180 190 200 KIAA00 HELGHHP--DHAAGAGGQPHHADRRGE--WPGGAGPAQPAARGRLCRGRELCAQPGGPAV .: : : .:.: . . :. :: :: . : : : : : gi|187 PGTPGNPGQDGEQGPAGEPGRDGKDGQPGRPGQPGPPGEPGTGGGC---EHCPTPRTAPG 260 270 280 290 300 310 210 220 230 240 250 260 KIAA00 TEVGRLEELAVPRRFAPPLPRDVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP gi|187 Y >>gi|15077111|gb|AAK83075.1| collagen [Meloidogyne javan (345 aa) initn: 125 init1: 56 opt: 210 Z-score: 215.0 bits: 47.9 E(): 0.0028 Smith-Waterman score: 215; 30.244% identity (46.341% similar) in 205 aa overlap (15-207:143-330) 10 20 30 40 KIAA00 PDTARSWRPFSLSECCSCHCGHGRYPVPVEVHGEAA--GEAGQE ::.:: : : :. .:: . :: : . gi|150 SAISQGPSGATYGQGAAGYQPVVAPKPAPVCCTCHQGP---PGPIGPEGEPGPDGEDGPN 120 130 140 150 160 50 60 70 80 90 KIAA00 GGEGLQGGAGQSEEGPSAEKCRVCP-CVCRERHPQEERR-----CELASDGVPRRRS--G : .: .: .. .: : .:: . : . : .:::: ... : gi|150 GKDGTSGKDARILPAPLEPPCIICPPGPAGPQGPAGAKGPPGSLGEPPKDGVPGEQGMVG 170 180 190 200 210 220 100 110 120 130 140 150 KIAA00 LQGADSCD-YEGGDQEYG-PGDQSPGQGPEHHGPAEGLLSDGQVRAAGAEPGRPYIGDGG .: . :: : :: : ::. ::: : ..: :: : : .: gi|150 QHGPPGRPGREGPRGAPGSPGRLIPVPGPQ--GPA------GPPGVVGP-PGAP--GAAG 230 240 250 260 270 160 170 180 190 200 210 KIAA00 LHELGHHPDHAAGAGGQPHHADRRGEWPGGAGPAQPAARGRLCRGRELCAQPGGPAVTEV :. . : :.: . :.:. ::: ::: : .. . : : .: : gi|150 --PPGQSFEGPPGPPGEPGRPGREGR-PGGPGPAGPPGQDGEKGSCEHCPEPRTPPGYFA 280 290 300 310 320 330 220 230 240 250 260 KIAA00 GRLEELAVPRRFAPPLPRDVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP gi|150 EASAKSGGYH 340 >>gi|187029342|emb|CAP31685.1| C. briggsae CBR-DPY-14 pr (320 aa) initn: 91 init1: 45 opt: 209 Z-score: 214.4 bits: 47.6 E(): 0.003 Smith-Waterman score: 209; 29.612% identity (49.029% similar) in 206 aa overlap (14-207:125-317) 10 20 30 40 KIAA00 PDTARSWRPFSLSECCSCHCGHGRYPVPV--EVHGEAAGEAGQ .::.:. :.. : : . : : :: gi|187 GYGSGPADSGYGAQGTNNGYGPVVNAEPEPQCCTCQQGKAGPPGPPGDDGHDGKDGSAGA 100 110 120 130 140 150 50 60 70 80 90 100 KIAA00 EGGEGLQGGAGQSEEGPSAEKCRVCPCVCRERHPQEERRCELASDGVPRRRSGLQGADSC .. .: .::.: :. : ..: : .:: . . . .: :: ::.:.:. gi|187 DAKNGKDGGVGPSD-GLQSEPCVICPPGAQGLGGAPGAK---GPQG-PRGSPGLSGVDGR 160 170 180 190 200 110 120 130 140 150 KIAA00 DYEGGDQEYGPGDQSPGQGPEHHGPAEGLLSDGQVRAAG--AEPGRP----YIGDGGLHE : : . ::. . ::. :: .::.: . : . :: : :. : . gi|187 RGEPGMS--GPAGTQGEPGPQ--GPPGKKGDDGRVNVNGPPGPPGAPGPQGRKGERGPKG 210 220 230 240 250 260 160 170 180 190 200 210 KIAA00 L-GH-HP--DHAAGAGGQPHHADRRGEWPGGAGPAQPAARGRLCRGRELCAQPGGPAVTE . : :: . .: :. .: :.:: :: :: . . : : : : gi|187 VPGSVHPGVQGPTGDQGRKGRAGRKGE-TGGQGPIGSKGPNGDCFH---CPTPRTPPGY 270 280 290 300 310 320 220 230 240 250 260 KIAA00 VGRLEELAVPRRFAPPLPRDVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP >>gi|7206749|gb|AAF39908.1| Dumpy : shorter than wild-ty (322 aa) initn: 71 init1: 45 opt: 204 Z-score: 209.5 bits: 46.7 E(): 0.0057 Smith-Waterman score: 204; 30.244% identity (52.195% similar) in 205 aa overlap (14-207:126-319) 10 20 30 40 KIAA00 PDTARSWRPFSLSECCSCHCGHGRYPVPVEVHGEAA--GEAGQ .::.:. :.. : : :. . : ::. gi|720 GYGSGPADSGYGAQGTNNGYGPVVNAEPEPQCCTCQQGKAGPPGPPGDDGKDGNDGSAGN 100 110 120 130 140 150 50 60 70 80 90 100 KIAA00 EGGEGLQGGAGQSEEGPSAEKCRVCPCVCRERHPQEERRCELASDGVPRRRSGLQGADSC .. .: .::.: :. : ..: : .:: . . . .: :: ::.:.:. gi|720 DAKNGKDGGVGPSD-GLQSEPCMICPPGAQGLGGAPGAK---GPQG-PRGSPGLSGVDGR 160 170 180 190 200 210 110 120 130 140 150 KIAA00 DYEGGDQEYGP-GDQS-PG-QGPEHHGPAEG-LLSDGQVRAAGAEPG-RPYIGDGGLHEL : : . :: : :. :: ::: . .: ... . .::. :: . :. : . . gi|720 RGEPGMS--GPAGTQGEPGPQGPPGKKGDDGRVINVNGPPGAGGAPGPQGRKGERGPKGV 220 230 240 250 260 160 170 180 190 200 210 KIAA00 -GH-HP--DHAAGAGGQPHHADRRGEWPGGAGPAQPAARGRLCRGRELCAQPGGPAVTEV : :: . .: :. .: :.:: :: :: . . : : : : gi|720 PGSVHPGVQGPTGDQGRKGRAGRKGE-TGGQGPIGSKGPNGDCFH---CPTPRTPPGY 270 280 290 300 310 320 220 230 240 250 260 KIAA00 GRLEELAVPRRFAPPLPRDVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP 268 residues in 1 query sequences 2693465022 residues in 7827732 library sequences Tcomplib [34.26] (8 proc) start: Tue Mar 3 16:56:43 2009 done: Tue Mar 3 17:02:31 2009 Total Scan time: 1374.890 Total Display time: 0.050 Function used was FASTA [version 34.26.5 April 26, 2007]