FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4306, 306 aa 1>>>pF1KE4306 306 - 306 aa - 306 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0103+/-0.000693; mu= 17.5596+/- 0.042 mean_var=63.1138+/-12.626, 0's: 0 Z-trim(109.7): 10 B-trim: 0 in 0/51 Lambda= 0.161440 statistics sampled from 11075 (11080) to 11075 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.715), E-opt: 0.2 (0.34), width: 16 Scan time: 2.380 The best scores are: opt bits E(32554) CCDS447.1 PPT1 gene_id:5538|Hs108|chr1 ( 306) 2100 497.2 6.3e-141 CCDS44119.1 PPT1 gene_id:5538|Hs108|chr1 ( 203) 1108 266.1 1.6e-71 CCDS4742.1 PPT2 gene_id:9374|Hs108|chr6 ( 302) 257 68.0 1e-11 CCDS4740.1 PPT2 gene_id:9374|Hs108|chr6 ( 308) 257 68.0 1e-11 >>CCDS447.1 PPT1 gene_id:5538|Hs108|chr1 (306 aa) initn: 2100 init1: 2100 opt: 2100 Z-score: 2643.6 bits: 497.2 E(32554): 6.3e-141 Smith-Waterman score: 2100; 100.0% identity (100.0% similar) in 306 aa overlap (1-306:1-306) 10 20 30 40 50 60 pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH 250 260 270 280 290 300 pF1KE4 IIPFLG :::::: CCDS44 IIPFLG >>CCDS44119.1 PPT1 gene_id:5538|Hs108|chr1 (203 aa) initn: 1098 init1: 1098 opt: 1108 Z-score: 1397.5 bits: 266.1 E(32554): 1.6e-71 Smith-Waterman score: 1190; 66.3% identity (66.3% similar) in 306 aa overlap (1-306:1-203) 10 20 30 40 50 60 pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK :::::::::::::::::::::::::::::::::::::::::: CCDS44 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMG------------------ 10 20 30 40 70 80 90 100 110 120 pF1KE4 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF CCDS44 ------------------------------------------------------------ 130 140 150 160 170 180 pF1KE4 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL ::::::::::::::::::::::::::::::::::: CCDS44 -------------------------VFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL 50 60 70 190 200 210 220 230 240 pF1KE4 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD 80 90 100 110 120 130 250 260 270 280 290 300 pF1KE4 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH 140 150 160 170 180 190 pF1KE4 IIPFLG :::::: CCDS44 IIPFLG 200 >>CCDS4742.1 PPT2 gene_id:9374|Hs108|chr6 (302 aa) initn: 245 init1: 107 opt: 257 Z-score: 323.8 bits: 68.0 E(32554): 1e-11 Smith-Waterman score: 296; 28.5% identity (56.6% similar) in 288 aa overlap (8-281:13-274) 10 20 30 40 50 pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPL-----PLVIWHGMGDSCCNPLS :.: :::. : : ::: :... ::. :: : CCDS47 MLGLCGQRLPAAWVLL--LLPFL----PLLLLAAPAPHRASYKPVIVVHGLFDSS---YS 10 20 30 40 50 60 70 80 90 100 pF1KE4 MGAIKKMVEKKIPG--IYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQG . . ..... :: . ::.: :. .. .. .:.. .: .:: : :: CCDS47 FRHLLEYINETHPGTVVTVLDLFDGRESLRP----LWEQVQGFREAVVPIMAKAP---QG 60 70 80 90 100 110 120 130 140 150 160 pF1KE4 YNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDF----IR . . .:::: ::. . . . ..::... ..: .: ..... . .: CCDS47 VHLICYSQGGLVCRALLSVMDDHNVDSFISLSSPQMGQYG------DTDYLKWLFPTSMR 110 120 130 140 150 170 180 190 200 210 220 pF1KE4 KTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINES--YKKNLMAL ..: :: :: . .::::: ..:.: : : ::: :: :: .. ..::.. . CCDS47 SNLYRICYSPWGQEFSI-CNYWHDPHHDDLYLNASSFLALINGERDHPNATVWRKNFLRV 160 170 180 190 200 210 230 240 250 260 270 280 pF1KE4 KKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETI-PLQETSLYTQDRLGLKEMDNAGQLV ..:.. .:... : .: .:::: .:.::. ..: .: .: .::: . : .: CCDS47 GHLVLIGGPDDGVITPWQSSFFGFY---DANETVLEMEEQLVYLRDSFGLKTLLARGAIV 220 230 240 250 260 270 290 300 pF1KE4 FLATEGDHLQLSEEWFYAHIIPFLG CCDS47 RCPMAGISHTAWHSNRTLYETCIEPWLS 280 290 300 >>CCDS4740.1 PPT2 gene_id:9374|Hs108|chr6 (308 aa) initn: 245 init1: 107 opt: 257 Z-score: 323.7 bits: 68.0 E(32554): 1e-11 Smith-Waterman score: 296; 28.5% identity (56.6% similar) in 288 aa overlap (8-281:19-280) 10 20 30 40 pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPL-----PLVIWHGMGDS :.: :::. : : ::: :... ::. :: CCDS47 MKSCGSMLGLCGQRLPAAWVLL--LLPFL----PLLLLAAPAPHRASYKPVIVVHGLFDS 10 20 30 40 50 50 60 70 80 90 100 pF1KE4 CCNPLSMGAIKKMVEKKIPG--IYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKD :. . ..... :: . ::.: :. .. .. .:.. .: .:: CCDS47 S---YSFRHLLEYINETHPGTVVTVLDLFDGRESLRP----LWEQVQGFREAVVPIMAKA 60 70 80 90 100 110 120 130 140 150 160 pF1KE4 PKLQQGYNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDF : :: . . .:::: ::. . . . ..::... ..: .: ..... . CCDS47 P---QGVHLICYSQGGLVCRALLSVMDDHNVDSFISLSSPQMGQYG------DTDYLKWL 110 120 130 140 150 170 180 190 200 210 pF1KE4 ----IRKTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINES--YK .:..: :: :: . .::::: ..:.: : : ::: :: :: .. .. CCDS47 FPTSMRSNLYRICYSPWGQEFSI-CNYWHDPHHDDLYLNASSFLALINGERDHPNATVWR 160 170 180 190 200 210 220 230 240 250 260 270 pF1KE4 KNLMALKKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETI-PLQETSLYTQDRLGLKEMD ::.. . ..:.. .:... : .: .:::: .:.::. ..: .: .: .::: . CCDS47 KNFLRVGHLVLIGGPDDGVITPWQSSFFGFY---DANETVLEMEEQLVYLRDSFGLKTLL 220 230 240 250 260 270 280 290 300 pF1KE4 NAGQLVFLATEGDHLQLSEEWFYAHIIPFLG : .: CCDS47 ARGAIVRCPMAGISHTAWHSNRTLYETCIEPWLS 280 290 300 306 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:28:03 2016 done: Sat Nov 5 23:28:04 2016 Total Scan time: 2.380 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]