FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1119, 328 aa 1>>>pF1KE1119 328 - 328 aa - 328 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3700+/-0.000942; mu= 15.0528+/- 0.056 mean_var=58.2856+/-11.509, 0's: 0 Z-trim(103.6): 20 B-trim: 8 in 1/50 Lambda= 0.167994 statistics sampled from 7482 (7487) to 7482 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.612), E-opt: 0.2 (0.23), width: 16 Scan time: 2.120 The best scores are: opt bits E(32554) CCDS10590.1 ERI2 gene_id:112479|Hs108|chr16 ( 328) 2240 551.4 3.7e-157 CCDS45436.1 ERI2 gene_id:112479|Hs108|chr16 ( 691) 1667 412.6 4.6e-115 CCDS30696.1 ERI3 gene_id:79033|Hs108|chr1 ( 337) 273 74.6 1.2e-13 CCDS5972.1 ERI1 gene_id:90459|Hs108|chr8 ( 349) 261 71.7 9.4e-13 >>CCDS10590.1 ERI2 gene_id:112479|Hs108|chr16 (328 aa) initn: 2240 init1: 2240 opt: 2240 Z-score: 2935.0 bits: 551.4 E(32554): 3.7e-157 Smith-Waterman score: 2240; 99.7% identity (100.0% similar) in 328 aa overlap (1-328:1-328) 10 20 30 40 50 60 pF1KE1 MATKRLARQLGLIRRKSIAPANGNLGRSKSKQLFDYLIVIDFESTCWNDGKHHHSQEIIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MATKRLARQLGLIRRKSIAPANGNLGRSKSKQLFDYLIVIDFESTCWNDGKHHHSQEIIE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 FPAVLLNTSTGQIDSEFQAYVQPQEHPILSEFCMELTGIKQAQVDEGVPLKICLSQFCKW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 FPAVLLNTSTGQIDSEFQAYVQPQEHPILSEFCMELTGIKQAQVDEGVPLKICLSQFCKW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 IHKIQQQKNIIFATGISEPSASEVKLCAFVTWSDWDLGVCLEYECKRKQLLKPVFLNSWI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 IHKIQQQKNIIFATGISEPSASEVKLCAFVTWSDWDLGVCLEYECKRKQLLKPVFLNSWI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DLRATYKLFYRRKPKGLSGALQEVGIEFSGREHSGLDDSRNTALLAWKMIRDGCVMKITR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DLRATYKLFYRRKPKGLSGALQEVGIEFSGREHSGLDDSRNTALLAWKMIRDGCVMKITR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 SLNKGPFLLPSWTWNSDLASGDQHAFLKQEFGCGTYRTLLQKPNMSKQEKGNILWLTMVW :::::::::::::::::::::::::::.:::::::::::::::::::::::::::::::: CCDS10 SLNKGPFLLPSWTWNSDLASGDQHAFLQQEFGCGTYRTLLQKPNMSKQEKGNILWLTMVW 250 260 270 280 290 300 310 320 pF1KE1 LSLACLQRKNYNDCMLNTASQTVTTEKF :::::::::::::::::::::::::::: CCDS10 LSLACLQRKNYNDCMLNTASQTVTTEKF 310 320 >>CCDS45436.1 ERI2 gene_id:112479|Hs108|chr16 (691 aa) initn: 1661 init1: 1661 opt: 1667 Z-score: 2179.3 bits: 412.6 E(32554): 4.6e-115 Smith-Waterman score: 1667; 86.4% identity (91.5% similar) in 294 aa overlap (1-294:1-287) 10 20 30 40 50 60 pF1KE1 MATKRLARQLGLIRRKSIAPANGNLGRSKSKQLFDYLIVIDFESTCWNDGKHHHSQEIIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MATKRLARQLGLIRRKSIAPANGNLGRSKSKQLFDYLIVIDFESTCWNDGKHHHSQEIIE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 FPAVLLNTSTGQIDSEFQAYVQPQEHPILSEFCMELTGIKQAQVDEGVPLKICLSQFCKW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 FPAVLLNTSTGQIDSEFQAYVQPQEHPILSEFCMELTGIKQAQVDEGVPLKICLSQFCKW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 IHKIQQQKNIIFATGISEPSASEVKLCAFVTWSDWDLGVCLEYECKRKQLLKPVFLNSWI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 IHKIQQQKNIIFATGISEPSASEVKLCAFVTWSDWDLGVCLEYECKRKQLLKPVFLNSWI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DLRATYKLFYRRKPKGLSGALQEVGIEFSGREHSGLDDSRNTALLAWKMIRDGCVMKITR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 DLRATYKLFYRRKPKGLSGALQEVGIEFSGREHSGLDDSRNTALLAWKMIRDGCVMKITR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 SLNKGPFLLPSWTWNSDLASGDQHAFLKQEFGCGTYRTLLQKPNMSKQEKGNILWLTMVW :::: .:. : :: . . ... .:. .: :.. ..: ::. CCDS45 SLNK----VPTKKNFSILARNLNTIQVEEMSACNIS---IQGPSIYNKEPKNIINPHEKV 250 260 270 280 290 310 320 pF1KE1 LSLACLQRKNYNDCMLNTASQTVTTEKF CCDS45 QMKSICANSPIKAQQDQLQVKNNIKASLHNVKSSLPLFNTKSSTSVGQLQSPTLNSPIYM 300 310 320 330 340 350 >>CCDS30696.1 ERI3 gene_id:79033|Hs108|chr1 (337 aa) initn: 400 init1: 203 opt: 273 Z-score: 358.4 bits: 74.6 E(32554): 1.2e-13 Smith-Waterman score: 418; 38.1% identity (60.0% similar) in 210 aa overlap (32-239:141-333) 10 20 30 40 50 60 pF1KE1 ATKRLARQLGLIRRKSIAPANGNLGRSKSKQLFDYLIVIDFESTCWNDGKHHHSQEIIEF : . :..:.:::.:: : . : :::::: CCDS30 GVPEFCSISTRKLAAHGFGASMAAMVSFPPQRYHYFLVLDFEATC--DKPQIHPQEIIEF 120 130 140 150 160 70 80 90 100 110 120 pF1KE1 PAVLLNTSTGQIDSEFQAYVQPQEHPILSEFCMELTGIKQAQVDEGVP-LKICLSQFCKW : . :: : .:.: :. :::: :: :. :: ::::: ::.:: : : :. : . .: CCDS30 PILKLNGRTMEIESTFHMYVQPVVHPQLTPFCTELTGIIQAMVD-GQPSLQQVLERVDEW 170 180 190 200 210 220 130 140 150 160 170 180 pF1KE1 IHKIQQQKNIIFATGISEPSASEVKLCAFVTWSDWDLGVCLEYECKRKQLLKPVFLNSWI . : :. .:... . ::: .:::: : : .:. : ....:: CCDS30 MAK----------EGLLDPNVKSI----FVTCGDWDLKVMLPGQCQYLGLPVADYFKQWI 230 240 250 260 270 190 200 210 220 230 pF1KE1 DLRATYKLFYRRKPK-GLSGALQEVGIEFSGREHSGLDDSRNTALLAWKMIRDGCVMKIT .:. .:.. . :: :: . .... :: :::.:: .: : . . : ..: : CCDS30 NLKKAYSFAMGCWPKNGLLDMNKGLSLQHIGRPHSGIDDCKNIANIMKTLAYRGFIFKQT 280 290 300 310 320 330 240 250 260 270 280 290 pF1KE1 RSLNKGPFLLPSWTWNSDLASGDQHAFLKQEFGCGTYRTLLQKPNMSKQEKGNILWLTMV CCDS30 SKPF >>CCDS5972.1 ERI1 gene_id:90459|Hs108|chr8 (349 aa) initn: 496 init1: 261 opt: 261 Z-score: 342.4 bits: 71.7 E(32554): 9.4e-13 Smith-Waterman score: 479; 35.5% identity (67.7% similar) in 217 aa overlap (34-248:127-328) 10 20 30 40 50 60 pF1KE1 KRLARQLGLIRRKSIAPANGNLGRSKSKQLFDYLIVIDFESTCWNDGKHHHSQEIIEFPA .::. .::::.:: . . . .::::::. CCDS59 GVKDVLKKRLKNYYKKQKLMLKESNFADSYYDYICIIDFEATCEEGNPPEFVHEIIEFPV 100 110 120 130 140 150 70 80 90 100 110 120 pF1KE1 VLLNTSTGQIDSEFQAYVQPQEHPILSEFCMELTGIKQAQVDEGVPLKICLSQFCKWIHK ::::: : .:.. :: ::.:. . ::.::. :::: : :::.. . :.. :. : CCDS59 VLLNTHTLEIEDTFQQYVRPEINTQLSDFCISLTGITQDQVDRADTFPQVLKKVIDWM-K 160 170 180 190 200 210 130 140 150 160 170 180 pF1KE1 IQQQKNIIFATGISEPSASEVKLCAFVTWSDWDLGVCLEYECKRKQLLKPVFLNSWIDLR ... ... : ...: ..::.. :. .:. ..: : : ..::..: CCDS59 LKEL-------------GTKYKY-SLLTDGSWDMSKFLNIQCQLSRLKYPPFAKKWINIR 220 230 240 250 260 190 200 210 220 230 240 pF1KE1 ATYKLFYR--RKPKGLSGALQEVGIEFSGREHSGLDDSRNTALLAWKMIRDGCVMKITRS .: ::. :. :. :...:....:: : :::::.: : .: .:..::: ..:... CCDS59 KSYGNFYKVPRSQTKLTIMLEKLGMDYDGRPHCGLDDSKNIARIAVRMLQDGCELRINEK 270 280 290 300 310 320 250 260 270 280 290 300 pF1KE1 LNKGPFLLPSWTWNSDLASGDQHAFLKQEFGCGTYRTLLQKPNMSKQEKGNILWLTMVWL .. : .. CCDS59 MHAGQLMSVSSSLPIEGTPPPQMPHFRK 330 340 328 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:56:24 2016 done: Mon Nov 7 01:56:24 2016 Total Scan time: 2.120 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]