FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1304, 289 aa 1>>>pF1KE1304 289 - 289 aa - 289 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2774+/-0.000774; mu= 15.3498+/- 0.047 mean_var=62.8535+/-12.518, 0's: 0 Z-trim(107.7): 16 B-trim: 0 in 0/53 Lambda= 0.161774 statistics sampled from 9717 (9722) to 9717 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.686), E-opt: 0.2 (0.299), width: 16 Scan time: 2.300 The best scores are: opt bits E(32554) CCDS7299.1 PPA1 gene_id:5464|Hs108|chr10 ( 289) 2036 483.5 7.4e-137 CCDS3667.1 PPA2 gene_id:27068|Hs108|chr4 ( 334) 1162 279.6 2.1e-75 CCDS3669.2 PPA2 gene_id:27068|Hs108|chr4 ( 232) 644 158.6 3.9e-39 CCDS3668.2 PPA2 gene_id:27068|Hs108|chr4 ( 305) 642 158.2 6.8e-39 CCDS34043.1 PPA2 gene_id:27068|Hs108|chr4 ( 168) 449 113.0 1.5e-25 >>CCDS7299.1 PPA1 gene_id:5464|Hs108|chr10 (289 aa) initn: 2036 init1: 2036 opt: 2036 Z-score: 2570.4 bits: 483.5 E(32554): 7.4e-137 Smith-Waterman score: 2036; 100.0% identity (100.0% similar) in 289 aa overlap (1-289:1-289) 10 20 30 40 50 60 pF1KE1 MSGFSTEERAAPFSLEYRVFLKNEKGQYISPFHDIPIYADKDVFHMVVEVPRWSNAKMEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 MSGFSTEERAAPFSLEYRVFLKNEKGQYISPFHDIPIYADKDVFHMVVEVPRWSNAKMEI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 ATKDPLNPIKQDVKKGKLRYVANLFPYKGYIWNYGAIPQTWEDPGHNDKHTGCCGDNDPI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 ATKDPLNPIKQDVKKGKLRYVANLFPYKGYIWNYGAIPQTWEDPGHNDKHTGCCGDNDPI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 DVCEIGSKVCARGEIIGVKVLGILAMIDEGETDWKVIAINVDDPDAANYNDINDVKRLKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 DVCEIGSKVCARGEIIGVKVLGILAMIDEGETDWKVIAINVDDPDAANYNDINDVKRLKP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 GYLEATVDWFRRYKVPDGKPENEFAFNAEFKDKDFAIDIIKSTHDHWKALVTKKTNGKGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 GYLEATVDWFRRYKVPDGKPENEFAFNAEFKDKDFAIDIIKSTHDHWKALVTKKTNGKGI 190 200 210 220 230 240 250 260 270 280 pF1KE1 SCMNTTLSESPFKCDPDAARAIVDALPPPCESACTVPTDVDKWFHHQKN ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 SCMNTTLSESPFKCDPDAARAIVDALPPPCESACTVPTDVDKWFHHQKN 250 260 270 280 >>CCDS3667.1 PPA2 gene_id:27068|Hs108|chr4 (334 aa) initn: 1274 init1: 1133 opt: 1162 Z-score: 1467.1 bits: 279.6 E(32554): 2.1e-75 Smith-Waterman score: 1269; 61.5% identity (83.2% similar) in 291 aa overlap (1-272:32-322) 10 20 30 pF1KE1 MSGFSTEERAAPFSLEYRVFLKNEKGQYIS :. . ::::. : : .::.:.:: :.::: CCDS36 SALLRLLRTGAPAAACLRLGTSAGTGSRRAMALYHTEERGQPCSQNYRLFFKNVTGHYIS 10 20 30 40 50 60 40 50 60 70 pF1KE1 PFHDIPIYAD-----------------KDVFHMVVEVPRWSNAKMEIATKDPLNPIKQDV ::::::. .. ...:.:.::.:::.:::::::::.:.::::: : CCDS36 PFHDIPLKVNSKEENGIPMKKARNDEYENLFNMIVEIPRWTNAKMEIATKEPMNPIKQYV 70 80 90 100 110 120 80 90 100 110 120 130 pF1KE1 KKGKLRYVANLFPYKGYIWNYGAIPQTWEDPGHNDKHTGCCGDNDPIDVCEIGSKVCARG : ::::::::.:::::::::::..::::::: ..:: :.: ::::::::::::::. . : CCDS36 KDGKLRYVANIFPYKGYIWNYGTLPQTWEDPHEKDKSTNCFGDNDPIDVCEIGSKILSCG 130 140 150 160 170 180 140 150 160 170 180 190 pF1KE1 EIIGVKVLGILAMIDEGETDWKVIAINVDDPDAANYNDINDVKRLKPGYLEATVDWFRRY :.: ::.:::::.:::::::::.::::..::.:....::.:::..::::::::..::: : CCDS36 EVIHVKILGILALIDEGETDWKLIAINANDPEASKFHDIDDVKKFKPGYLEATLNWFRLY 190 200 210 220 230 240 200 210 220 230 240 250 pF1KE1 KVPDGKPENEFAFNAEFKDKDFAIDIIKSTHDHWKALVTKKTNGKGISCMNTTLSESPFK :::::::::.::::.:::.: ::...:::::. ::::. :: :: .:.: :. .:.:::. CCDS36 KVPDGKPENQFAFNGEFKNKAFALEVIKSTHQCWKALLMKKCNGGAINCTNVQISDSPFR 250 260 270 280 290 300 260 270 280 pF1KE1 CDPDAARAIVDALP--PPCESACTVPTDVDKWFHHQKN : . ::..:... : :: CCDS36 CTQEEARSLVESVSSSPNKESNEEEQVWHFLGK 310 320 330 >>CCDS3669.2 PPA2 gene_id:27068|Hs108|chr4 (232 aa) initn: 781 init1: 640 opt: 644 Z-score: 816.1 bits: 158.6 E(32554): 3.9e-39 Smith-Waterman score: 644; 59.9% identity (86.8% similar) in 152 aa overlap (124-272:69-220) 100 110 120 130 140 150 pF1KE1 YGAIPQTWEDPGHNDKHTGCCGDNDPIDVCEIGSK-VCARGEIIGVKVLGILAMIDEGET ...:: . . ::.: ::.:::::.:::::: CCDS36 ERGQPCSQNYRLFFKNVTGHYISPFHDIPLKVNSKEILSCGEVIHVKILGILALIDEGET 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE1 DWKVIAINVDDPDAANYNDINDVKRLKPGYLEATVDWFRRYKVPDGKPENEFAFNAEFKD :::.::::..::.:....::.:::..::::::::..::: ::::::::::.::::.:::. CCDS36 DWKLIAINANDPEASKFHDIDDVKKFKPGYLEATLNWFRLYKVPDGKPENQFAFNGEFKN 100 110 120 130 140 150 220 230 240 250 260 270 pF1KE1 KDFAIDIIKSTHDHWKALVTKKTNGKGISCMNTTLSESPFKCDPDAARAIVDALP--PPC : ::...:::::. ::::. :: :: .:.: :. .:.:::.: . ::..:... : CCDS36 KAFALEVIKSTHQCWKALLMKKCNGGAINCTNVQISDSPFRCTQEEARSLVESVSSSPNK 160 170 180 190 200 210 280 pF1KE1 ESACTVPTDVDKWFHHQKN :: CCDS36 ESNEEEQVWHFLGK 220 230 >>CCDS3668.2 PPA2 gene_id:27068|Hs108|chr4 (305 aa) initn: 1105 init1: 640 opt: 642 Z-score: 811.8 bits: 158.2 E(32554): 6.8e-39 Smith-Waterman score: 1031; 54.3% identity (74.2% similar) in 291 aa overlap (1-272:32-293) 10 20 30 pF1KE1 MSGFSTEERAAPFSLEYRVFLKNEKGQYIS :. . ::::. : : .::.:.:: :.::: CCDS36 SALLRLLRTGAPAAACLRLGTSAGTGSRRAMALYHTEERGQPCSQNYRLFFKNVTGHYIS 10 20 30 40 50 60 40 50 60 70 pF1KE1 PFHDIPIYAD-----------------KDVFHMVVEVPRWSNAKMEIATKDPLNPIKQDV ::::::. .. ...:.:.::.:::.:::::::::.:.::::: : CCDS36 PFHDIPLKVNSKEENGIPMKKARNDEYENLFNMIVEIPRWTNAKMEIATKEPMNPIKQYV 70 80 90 100 110 120 80 90 100 110 120 130 pF1KE1 KKGKLRYVANLFPYKGYIWNYGAIPQTWEDPGHNDKHTGCCGDNDPIDVCEIGSKVCARG : ::::::::.:::::::::::..:: : : : CCDS36 KDGKLRYVANIFPYKGYIWNYGTLPQ--------------------ILSC---------G 130 140 150 140 150 160 170 180 190 pF1KE1 EIIGVKVLGILAMIDEGETDWKVIAINVDDPDAANYNDINDVKRLKPGYLEATVDWFRRY :.: ::.:::::.:::::::::.::::..::.:....::.:::..::::::::..::: : CCDS36 EVIHVKILGILALIDEGETDWKLIAINANDPEASKFHDIDDVKKFKPGYLEATLNWFRLY 160 170 180 190 200 210 200 210 220 230 240 250 pF1KE1 KVPDGKPENEFAFNAEFKDKDFAIDIIKSTHDHWKALVTKKTNGKGISCMNTTLSESPFK :::::::::.::::.:::.: ::...:::::. ::::. :: :: .:.: :. .:.:::. CCDS36 KVPDGKPENQFAFNGEFKNKAFALEVIKSTHQCWKALLMKKCNGGAINCTNVQISDSPFR 220 230 240 250 260 270 260 270 280 pF1KE1 CDPDAARAIVDALP--PPCESACTVPTDVDKWFHHQKN : . ::..:... : :: CCDS36 CTQEEARSLVESVSSSPNKESNEEEQVWHFLGK 280 290 300 >>CCDS34043.1 PPA2 gene_id:27068|Hs108|chr4 (168 aa) initn: 480 init1: 435 opt: 449 Z-score: 572.3 bits: 113.0 E(32554): 1.5e-25 Smith-Waterman score: 449; 57.0% identity (80.7% similar) in 114 aa overlap (164-272:43-156) 140 150 160 170 180 190 pF1KE1 EIIGVKVLGILAMIDEGETDWKVIAINVDDPDAANYN---DINDVKRLKPGYLEATVDWF : . :: .:.:::..::::::::..:: CCDS34 PAAACLRLGTSAGTGSRRAMALYHTEERGQPCSQNYRLFFNIDDVKKFKPGYLEATLNWF 20 30 40 50 60 70 200 210 220 230 240 250 pF1KE1 RRYKVPDGKPENEFAFNAEFKDKDFAIDIIKSTHDHWKALVTKKTNGKGISCMNTTLSES : ::::::::::.::::.:::.: ::...:::::. ::::. :: :: .:.: :. .:.: CCDS34 RLYKVPDGKPENQFAFNGEFKNKAFALEVIKSTHQCWKALLMKKCNGGAINCTNVQISDS 80 90 100 110 120 130 260 270 280 pF1KE1 PFKCDPDAARAIVDALP--PPCESACTVPTDVDKWFHHQKN ::.: . ::..:... : :: CCDS34 PFRCTQEEARSLVESVSSSPNKESNEEEQVWHFLGK 140 150 160 289 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:56:32 2016 done: Mon Nov 7 02:56:33 2016 Total Scan time: 2.300 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]