FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6350, 351 aa 1>>>pF1KE6350 351 - 351 aa - 351 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3923+/-0.000891; mu= 16.5685+/- 0.054 mean_var=81.2250+/-15.882, 0's: 0 Z-trim(107.2): 11 B-trim: 0 in 0/51 Lambda= 0.142308 statistics sampled from 9431 (9436) to 9431 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.657), E-opt: 0.2 (0.29), width: 16 Scan time: 2.470 The best scores are: opt bits E(32554) CCDS5509.1 ZPBP gene_id:11055|Hs108|chr7 ( 351) 2377 497.7 6.2e-141 CCDS55110.1 ZPBP gene_id:11055|Hs108|chr7 ( 350) 2360 494.2 7e-140 CCDS11352.1 ZPBP2 gene_id:124626|Hs108|chr17 ( 338) 730 159.5 3.7e-39 CCDS11353.2 ZPBP2 gene_id:124626|Hs108|chr17 ( 316) 701 153.5 2.2e-37 >>CCDS5509.1 ZPBP gene_id:11055|Hs108|chr7 (351 aa) initn: 2377 init1: 2377 opt: 2377 Z-score: 2643.7 bits: 497.7 E(32554): 6.2e-141 Smith-Waterman score: 2377; 100.0% identity (100.0% similar) in 351 aa overlap (1-351:1-351) 10 20 30 40 50 60 pF1KE6 MEAFALGPARRGRRRTRAAGSLLSRAAILLFISAFLVRVPSSVGHLVRLPRAFRLTKDSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MEAFALGPARRGRRRTRAAGSLLSRAAILLFISAFLVRVPSSVGHLVRLPRAFRLTKDSV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 KIVGSTSFPVKAYVMLHQKSPHVLCVTQQLRNAELIDPSFQWYGPKGKVVSVENRTAQIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 KIVGSTSFPVKAYVMLHQKSPHVLCVTQQLRNAELIDPSFQWYGPKGKVVSVENRTAQIT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 STGSLVFQNFEESMSGIYTCFLEYKPTVEEIVKRLQLKYAIYAYREPHYYYQFTARYHAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 STGSLVFQNFEESMSGIYTCFLEYKPTVEEIVKRLQLKYAIYAYREPHYYYQFTARYHAA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 PCNSIYNISFEKKLLQILSKLLLDLSCEISLLKSECHRVKMQRAGLQNELFFAFSVSSLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 PCNSIYNISFEKKLLQILSKLLLDLSCEISLLKSECHRVKMQRAGLQNELFFAFSVSSLD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 TEKGPKRCTDHNCEPYKRLFKAKNLIERFFNQQVEILGRRAEQLPQIYYIEGTLQMVWIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 TEKGPKRCTDHNCEPYKRLFKAKNLIERFFNQQVEILGRRAEQLPQIYYIEGTLQMVWIN 250 260 270 280 290 300 310 320 330 340 350 pF1KE6 RCFPGYGMNVQQHPKCPECCVICSPGSYNPRDGIHCLQCNSSLVYGAKTCL ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 RCFPGYGMNVQQHPKCPECCVICSPGSYNPRDGIHCLQCNSSLVYGAKTCL 310 320 330 340 350 >>CCDS55110.1 ZPBP gene_id:11055|Hs108|chr7 (350 aa) initn: 1653 init1: 1653 opt: 2360 Z-score: 2624.8 bits: 494.2 E(32554): 7e-140 Smith-Waterman score: 2360; 99.7% identity (99.7% similar) in 351 aa overlap (1-351:1-350) 10 20 30 40 50 60 pF1KE6 MEAFALGPARRGRRRTRAAGSLLSRAAILLFISAFLVRVPSSVGHLVRLPRAFRLTKDSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MEAFALGPARRGRRRTRAAGSLLSRAAILLFISAFLVRVPSSVGHLVRLPRAFRLTKDSV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 KIVGSTSFPVKAYVMLHQKSPHVLCVTQQLRNAELIDPSFQWYGPKGKVVSVENRTAQIT ::::::::::::::::::::::::::::::::::::::::::::::::::: :::::::: CCDS55 KIVGSTSFPVKAYVMLHQKSPHVLCVTQQLRNAELIDPSFQWYGPKGKVVS-ENRTAQIT 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 STGSLVFQNFEESMSGIYTCFLEYKPTVEEIVKRLQLKYAIYAYREPHYYYQFTARYHAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 STGSLVFQNFEESMSGIYTCFLEYKPTVEEIVKRLQLKYAIYAYREPHYYYQFTARYHAA 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE6 PCNSIYNISFEKKLLQILSKLLLDLSCEISLLKSECHRVKMQRAGLQNELFFAFSVSSLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 PCNSIYNISFEKKLLQILSKLLLDLSCEISLLKSECHRVKMQRAGLQNELFFAFSVSSLD 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE6 TEKGPKRCTDHNCEPYKRLFKAKNLIERFFNQQVEILGRRAEQLPQIYYIEGTLQMVWIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 TEKGPKRCTDHNCEPYKRLFKAKNLIERFFNQQVEILGRRAEQLPQIYYIEGTLQMVWIN 240 250 260 270 280 290 310 320 330 340 350 pF1KE6 RCFPGYGMNVQQHPKCPECCVICSPGSYNPRDGIHCLQCNSSLVYGAKTCL ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 RCFPGYGMNVQQHPKCPECCVICSPGSYNPRDGIHCLQCNSSLVYGAKTCL 300 310 320 330 340 350 >>CCDS11352.1 ZPBP2 gene_id:124626|Hs108|chr17 (338 aa) initn: 631 init1: 253 opt: 730 Z-score: 816.4 bits: 159.5 E(32554): 3.7e-39 Smith-Waterman score: 730; 37.7% identity (66.8% similar) in 313 aa overlap (47-350:18-326) 20 30 40 50 60 70 pF1KE6 RAAGSLLSRAAILLFISAFLVRVPSSVGHLVRLPRAFRLTKDSVKIVGSTSFPVKAYVML :. :: : : . . : :.:. : : :: : CCDS11 MMRTCVLLSAVLWCLTGVQCPR-FTLFNKKGFIYGKTGQPDKIYVEL 10 20 30 40 80 90 100 110 120 130 pF1KE6 HQKSPHVLCVTQQLRNAELIDPSFQWYGPKGKVVSVENRTAQITSTGSLVFQNFEESMSG ::.:: ..:. .: . :..::.. : ::. :... .:: .:: ::.:. ..: : .:: CCDS11 HQNSPVLICMDFKLSKKEIVDPTYLWIGPNEKTLTGNNRI-NITETGQLMVKDFLEPLSG 50 60 70 80 90 100 140 150 160 170 180 190 pF1KE6 IYTCFLEYKP----TVEEIVKRLQLKYAIYAYREPHYYYQFTARYHAAPCNSIYNISFEK .::: : :: : :: . . . . ..::::: : ::...:. . : . :: : . CCDS11 LYTCTLSYKTVKAETQEEKTVKKRYDFMVFAYREPDYSYQMAVRFTTRSCIGRYNDVFFR 110 120 130 140 150 160 200 210 220 230 240 250 pF1KE6 KLLQILSKLLLDLSCEISLLKSECHRVKMQRAGLQNELFFAFSVSSLDTE-KGPKRCTDH : .::..:. ::::.. . .:: :.. . :: .:::.::.:. . :: :. CCDS11 VLKKILDSLISDLSCHVIEPSYKCHSVEIPEHGLIHELFIAFQVNPFAPGWKGA--CNGS 170 180 190 200 210 220 260 270 280 290 300 pF1KE6 -NCEPY--KRLFKAKNLIERFFNQQVEILGRRAEQ-LPQIYYIEGTLQMVWINRCFPGYG .:: . ...:.. :: :: .:. :. . .. :: ..... .::.: .. : ::.: CCDS11 VDCEDTTNHNILQARDRIEDFFRSQAYIFYHNFNKTLPAMHFVDHSLQVVRLDSCRPGFG 230 240 250 260 270 280 310 320 330 340 350 pF1KE6 MNVQQHPKCPECCVICSPGSYNPRDGIHCLQCNSSLVYGAKTCL : . : .: :::.:::....: .. : : : :.::::.: CCDS11 KNERLHSNCASCCVVCSPATFSPDVNVTCQTCVSVLTYGAKSCPQTSNKNQQYED 290 300 310 320 330 >>CCDS11353.2 ZPBP2 gene_id:124626|Hs108|chr17 (316 aa) initn: 612 init1: 253 opt: 701 Z-score: 784.6 bits: 153.5 E(32554): 2.2e-37 Smith-Waterman score: 701; 37.7% identity (67.5% similar) in 289 aa overlap (71-350:19-304) 50 60 70 80 90 100 pF1KE6 SSVGHLVRLPRAFRLTKDSVKIVGSTSFPVKAYVMLHQKSPHVLCVTQQLRNAELIDPSF : :: :::.:: ..:. .: . :..::.. CCDS11 MMRTCVLLSAVLWCLTGDKIYVELHQNSPVLICMDFKLSKKEIVDPTY 10 20 30 40 110 120 130 140 150 pF1KE6 QWYGPKGKVVSVENRTAQITSTGSLVFQNFEESMSGIYTCFLEYKP----TVEEIVKRLQ : ::. :... .:: .:: ::.:. ..: : .::.::: : :: : :: . . . CCDS11 LWIGPNEKTLTGNNRI-NITETGQLMVKDFLEPLSGLYTCTLSYKTVKAETQEEKTVKKR 50 60 70 80 90 100 160 170 180 190 200 210 pF1KE6 LKYAIYAYREPHYYYQFTARYHAAPCNSIYNISFEKKLLQILSKLLLDLSCEISLLKSEC . ..::::: : ::...:. . : . :: : . : .::..:. ::::.. . .: CCDS11 YDFMVFAYREPDYSYQMAVRFTTRSCIGRYNDVFFRVLKKILDSLISDLSCHVIEPSYKC 110 120 130 140 150 160 220 230 240 250 260 270 pF1KE6 HRVKMQRAGLQNELFFAFSVSSLDTE-KGPKRCTDH-NCEPY--KRLFKAKNLIERFFNQ : :.. . :: .:::.::.:. . :: :. .:: . ...:.. :: :: . CCDS11 HSVEIPEHGLIHELFIAFQVNPFAPGWKGA--CNGSVDCEDTTNHNILQARDRIEDFFRS 170 180 190 200 210 220 280 290 300 310 320 330 pF1KE6 QVEILGRRAEQ-LPQIYYIEGTLQMVWINRCFPGYGMNVQQHPKCPECCVICSPGSYNPR :. :. . .. :: ..... .::.: .. : ::.: : . : .: :::.:::....: CCDS11 QAYIFYHNFNKTLPAMHFVDHSLQVVRLDSCRPGFGKNERLHSNCASCCVVCSPATFSPD 230 240 250 260 270 280 340 350 pF1KE6 DGIHCLQCNSSLVYGAKTCL .. : : : :.::::.: CCDS11 VNVTCQTCVSVLTYGAKSCPQTSNKNQQYED 290 300 310 351 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:22:07 2016 done: Tue Nov 8 12:22:08 2016 Total Scan time: 2.470 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]