FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0284, 337 aa 1>>>pF1KE0284 337 - 337 aa - 337 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9564+/-0.000728; mu= 14.2207+/- 0.044 mean_var=94.8082+/-19.035, 0's: 0 Z-trim(112.0): 7 B-trim: 3 in 1/50 Lambda= 0.131720 statistics sampled from 12861 (12867) to 12861 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.757), E-opt: 0.2 (0.395), width: 16 Scan time: 2.840 The best scores are: opt bits E(32554) CCDS30696.1 ERI3 gene_id:79033|Hs108|chr1 ( 337) 2391 464.0 7.5e-131 CCDS5972.1 ERI1 gene_id:90459|Hs108|chr8 ( 349) 467 98.4 9e-21 >>CCDS30696.1 ERI3 gene_id:79033|Hs108|chr1 (337 aa) initn: 2391 init1: 2391 opt: 2391 Z-score: 2462.6 bits: 464.0 E(32554): 7.5e-131 Smith-Waterman score: 2391; 100.0% identity (100.0% similar) in 337 aa overlap (1-337:1-337) 10 20 30 40 50 60 pF1KE0 MATASPAADGGRGRPWEGGLVSWPPAPPLTLPWTWMGPSWGQHPGHWGFPALTEPSASPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MATASPAADGGRGRPWEGGLVSWPPAPPLTLPWTWMGPSWGQHPGHWGFPALTEPSASPA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 AGLGIFEVRRVLDASGCSMLAPLQTGAARFSSYLLSRARKVLGSHLFSPCGVPEFCSIST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 AGLGIFEVRRVLDASGCSMLAPLQTGAARFSSYLLSRARKVLGSHLFSPCGVPEFCSIST 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 RKLAAHGFGASMAAMVSFPPQRYHYFLVLDFEATCDKPQIHPQEIIEFPILKLNGRTMEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 RKLAAHGFGASMAAMVSFPPQRYHYFLVLDFEATCDKPQIHPQEIIEFPILKLNGRTMEI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 ESTFHMYVQPVVHPQLTPFCTELTGIIQAMVDGQPSLQQVLERVDEWMAKEGLLDPNVKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 ESTFHMYVQPVVHPQLTPFCTELTGIIQAMVDGQPSLQQVLERVDEWMAKEGLLDPNVKS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 IFVTCGDWDLKVMLPGQCQYLGLPVADYFKQWINLKKAYSFAMGCWPKNGLLDMNKGLSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 IFVTCGDWDLKVMLPGQCQYLGLPVADYFKQWINLKKAYSFAMGCWPKNGLLDMNKGLSL 250 260 270 280 290 300 310 320 330 pF1KE0 QHIGRPHSGIDDCKNIANIMKTLAYRGFIFKQTSKPF ::::::::::::::::::::::::::::::::::::: CCDS30 QHIGRPHSGIDDCKNIANIMKTLAYRGFIFKQTSKPF 310 320 330 >>CCDS5972.1 ERI1 gene_id:90459|Hs108|chr8 (349 aa) initn: 365 init1: 246 opt: 467 Z-score: 486.4 bits: 98.4 E(32554): 9e-21 Smith-Waterman score: 467; 40.7% identity (64.9% similar) in 194 aa overlap (137-327:121-313) 110 120 130 140 150 160 pF1KE0 FSPCGVPEFCSISTRKLAAHGFGASMAAMVSFPPQRYHYFLVLDFEATCDK--PQIHPQE .: . : :. ..::::::.. : .: CCDS59 FKLETRGVKDVLKKRLKNYYKKQKLMLKESNFADSYYDYICIIDFEATCEEGNPPEFVHE 100 110 120 130 140 150 170 180 190 200 210 220 pF1KE0 IIEFPILKLNGRTMEIESTFHMYVQPVVHPQLTPFCTELTGIIQAMVDGQPSLQQVLERV :::::.. :: .:.:::.::..::.: .. ::. :: :::: : .:: .. :::..: CCDS59 IIEFPVVLLNTHTLEIEDTFQQYVRPEINTQLSDFCISLTGITQDQVDRADTFPQVLKKV 160 170 180 190 200 210 230 240 250 260 270 280 pF1KE0 DEWMAKEGLLDPNVKSIFVTCGDWDLKVMLPGQCQYLGLPVADYFKQWINLKKAY-SFAM .:: : : . : ..: :.::.. .: ::: : . :.:::..:.: .: CCDS59 IDWM-KLKELGTKYKYSLLTDGSWDMSKFLNIQCQLSRLKYPPFAKKWINIRKSYGNFYK 220 230 240 250 260 290 300 310 320 330 pF1KE0 GCWPKNGLLDMNKGLSLQHIGRPHSGIDDCKNIANIMKTLAYRGFIFKQTSKPF .. : : . :.... :::: :.:: :::: : . : CCDS59 VPRSQTKLTIMLEKLGMDYDGRPHCGLDDSKNIARIAVRMLQDGCELRINEKMHAGQLMS 270 280 290 300 310 320 CCDS59 VSSSLPIEGTPPPQMPHFRK 330 340 337 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 17:45:20 2016 done: Thu Nov 3 17:45:20 2016 Total Scan time: 2.840 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]