FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0284, 337 aa
1>>>pF1KE0284 337 - 337 aa - 337 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9564+/-0.000728; mu= 14.2207+/- 0.044
mean_var=94.8082+/-19.035, 0's: 0 Z-trim(112.0): 7 B-trim: 3 in 1/50
Lambda= 0.131720
statistics sampled from 12861 (12867) to 12861 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.757), E-opt: 0.2 (0.395), width: 16
Scan time: 2.840
The best scores are: opt bits E(32554)
CCDS30696.1 ERI3 gene_id:79033|Hs108|chr1 ( 337) 2391 464.0 7.5e-131
CCDS5972.1 ERI1 gene_id:90459|Hs108|chr8 ( 349) 467 98.4 9e-21
>>CCDS30696.1 ERI3 gene_id:79033|Hs108|chr1 (337 aa)
initn: 2391 init1: 2391 opt: 2391 Z-score: 2462.6 bits: 464.0 E(32554): 7.5e-131
Smith-Waterman score: 2391; 100.0% identity (100.0% similar) in 337 aa overlap (1-337:1-337)
10 20 30 40 50 60
pF1KE0 MATASPAADGGRGRPWEGGLVSWPPAPPLTLPWTWMGPSWGQHPGHWGFPALTEPSASPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 MATASPAADGGRGRPWEGGLVSWPPAPPLTLPWTWMGPSWGQHPGHWGFPALTEPSASPA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 AGLGIFEVRRVLDASGCSMLAPLQTGAARFSSYLLSRARKVLGSHLFSPCGVPEFCSIST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 AGLGIFEVRRVLDASGCSMLAPLQTGAARFSSYLLSRARKVLGSHLFSPCGVPEFCSIST
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 RKLAAHGFGASMAAMVSFPPQRYHYFLVLDFEATCDKPQIHPQEIIEFPILKLNGRTMEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 RKLAAHGFGASMAAMVSFPPQRYHYFLVLDFEATCDKPQIHPQEIIEFPILKLNGRTMEI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 ESTFHMYVQPVVHPQLTPFCTELTGIIQAMVDGQPSLQQVLERVDEWMAKEGLLDPNVKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 ESTFHMYVQPVVHPQLTPFCTELTGIIQAMVDGQPSLQQVLERVDEWMAKEGLLDPNVKS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 IFVTCGDWDLKVMLPGQCQYLGLPVADYFKQWINLKKAYSFAMGCWPKNGLLDMNKGLSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 IFVTCGDWDLKVMLPGQCQYLGLPVADYFKQWINLKKAYSFAMGCWPKNGLLDMNKGLSL
250 260 270 280 290 300
310 320 330
pF1KE0 QHIGRPHSGIDDCKNIANIMKTLAYRGFIFKQTSKPF
:::::::::::::::::::::::::::::::::::::
CCDS30 QHIGRPHSGIDDCKNIANIMKTLAYRGFIFKQTSKPF
310 320 330
>>CCDS5972.1 ERI1 gene_id:90459|Hs108|chr8 (349 aa)
initn: 365 init1: 246 opt: 467 Z-score: 486.4 bits: 98.4 E(32554): 9e-21
Smith-Waterman score: 467; 40.7% identity (64.9% similar) in 194 aa overlap (137-327:121-313)
110 120 130 140 150 160
pF1KE0 FSPCGVPEFCSISTRKLAAHGFGASMAAMVSFPPQRYHYFLVLDFEATCDK--PQIHPQE
.: . : :. ..::::::.. : .:
CCDS59 FKLETRGVKDVLKKRLKNYYKKQKLMLKESNFADSYYDYICIIDFEATCEEGNPPEFVHE
100 110 120 130 140 150
170 180 190 200 210 220
pF1KE0 IIEFPILKLNGRTMEIESTFHMYVQPVVHPQLTPFCTELTGIIQAMVDGQPSLQQVLERV
:::::.. :: .:.:::.::..::.: .. ::. :: :::: : .:: .. :::..:
CCDS59 IIEFPVVLLNTHTLEIEDTFQQYVRPEINTQLSDFCISLTGITQDQVDRADTFPQVLKKV
160 170 180 190 200 210
230 240 250 260 270 280
pF1KE0 DEWMAKEGLLDPNVKSIFVTCGDWDLKVMLPGQCQYLGLPVADYFKQWINLKKAY-SFAM
.:: : : . : ..: :.::.. .: ::: : . :.:::..:.: .:
CCDS59 IDWM-KLKELGTKYKYSLLTDGSWDMSKFLNIQCQLSRLKYPPFAKKWINIRKSYGNFYK
220 230 240 250 260
290 300 310 320 330
pF1KE0 GCWPKNGLLDMNKGLSLQHIGRPHSGIDDCKNIANIMKTLAYRGFIFKQTSKPF
.. : : . :.... :::: :.:: :::: : . :
CCDS59 VPRSQTKLTIMLEKLGMDYDGRPHCGLDDSKNIARIAVRMLQDGCELRINEKMHAGQLMS
270 280 290 300 310 320
CCDS59 VSSSLPIEGTPPPQMPHFRK
330 340
337 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 17:45:20 2016 done: Thu Nov 3 17:45:20 2016
Total Scan time: 2.840 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]