FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4605, 472 aa
1>>>pF1KB4605 472 - 472 aa - 472 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1975+/-0.000895; mu= 18.4447+/- 0.054
mean_var=63.7694+/-12.608, 0's: 0 Z-trim(105.1): 24 B-trim: 0 in 0/51
Lambda= 0.160608
statistics sampled from 8222 (8236) to 8222 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.629), E-opt: 0.2 (0.253), width: 16
Scan time: 2.330
The best scores are: opt bits E(32554)
CCDS14210.1 EIF2S3 gene_id:1968|Hs108|chrX ( 472) 3080 722.6 2.2e-208
CCDS33849.1 EEFSEC gene_id:60678|Hs108|chr3 ( 596) 278 73.4 7.5e-13
>>CCDS14210.1 EIF2S3 gene_id:1968|Hs108|chrX (472 aa)
initn: 3080 init1: 3080 opt: 3080 Z-score: 3854.5 bits: 722.6 E(32554): 2.2e-208
Smith-Waterman score: 3080; 100.0% identity (100.0% similar) in 472 aa overlap (1-472:1-472)
10 20 30 40 50 60
pF1KB4 MAGGEAGVTLGQPHLSRQDLTTLDVTKLTPLSHEVISRQATINIGTIGHVAHGKSTVVKA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MAGGEAGVTLGQPHLSRQDLTTLDVTKLTPLSHEVISRQATINIGTIGHVAHGKSTVVKA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 ISGVHTVRFKNELERNITIKLGYANAKIYKLDDPSCPRPECYRSCGSSTPDEFPTDIPGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 ISGVHTVRFKNELERNITIKLGYANAKIYKLDDPSCPRPECYRSCGSSTPDEFPTDIPGT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 KGNFKLVRHVSFVDCPGHDILMATMLNGAAVMDAALLLIAGNESCPQPQTSEHLAAIEIM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 KGNFKLVRHVSFVDCPGHDILMATMLNGAAVMDAALLLIAGNESCPQPQTSEHLAAIEIM
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 KLKHILILQNKIDLVKESQAKEQYEQILAFVQGTVAEGAPIIPISAQLKYNIEVVCEYIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 KLKHILILQNKIDLVKESQAKEQYEQILAFVQGTVAEGAPIIPISAQLKYNIEVVCEYIV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 KKIPVPPRDFTSEPRLIVIRSFDVNKPGCEVDDLKGGVAGGSILKGVLKVGQEIEVRPGI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 KKIPVPPRDFTSEPRLIVIRSFDVNKPGCEVDDLKGGVAGGSILKGVLKVGQEIEVRPGI
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB4 VSKDSEGKLMCKPIFSKIVSLFAEHNDLQYAAPGGLIGVGTKIDPTLCRADRMVGQVLGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 VSKDSEGKLMCKPIFSKIVSLFAEHNDLQYAAPGGLIGVGTKIDPTLCRADRMVGQVLGA
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB4 VGALPEIFTELEISYFLLRRLLGVRTEGDKKAAKVQKLSKNEVLMVNIGSLSTGGRVSAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 VGALPEIFTELEISYFLLRRLLGVRTEGDKKAAKVQKLSKNEVLMVNIGSLSTGGRVSAV
370 380 390 400 410 420
430 440 450 460 470
pF1KB4 KADLGKIVLTNPVCTEVGEKIALSRRVEKHWRLIGWGQIRRGVTIKPTVDDD
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 KADLGKIVLTNPVCTEVGEKIALSRRVEKHWRLIGWGQIRRGVTIKPTVDDD
430 440 450 460 470
>>CCDS33849.1 EEFSEC gene_id:60678|Hs108|chr3 (596 aa)
initn: 273 init1: 109 opt: 278 Z-score: 344.2 bits: 73.4 E(32554): 7.5e-13
Smith-Waterman score: 337; 28.3% identity (55.0% similar) in 329 aa overlap (38-347:4-297)
10 20 30 40 50 60
pF1KB4 VTLGQPHLSRQDLTTLDVTKLTPLSHEVISRQATINIGTIGHVAHGKSTVVKAISGV-HT
:....:.:..::. ::.....:.: . :
CCDS33 MAGRRVNVNVGVLGHIDSGKTALARALSTTAST
10 20 30
70 80 90 100 110 120
pF1KB4 VRFKNE---LERNITIKLGYANAKIYKLDDPSCPRPECYRSCGSSTPDEFPTDIPGTKGN
. : .. ::.::. ::.. : : : :: : : :: . .
CCDS33 AAFDKQPQSRERGITLDLGFSCF--------SVPLPARLRS---SLP-EFQAAPEAEPEP
40 50 60 70 80
130 140 150 160 170 180
pF1KB4 FKLVRHVSFVDCPGHDILMATMLNGAAVMDAALLLIAGNESCPQPQTSEHLAAIEIMKLK
. . .:..:::::: :. :...:: ..: .:.: ... : :..: :. .: :
CCDS33 GEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMMLVIDVTKGM-QTQSAECLVIGQIACQK
90 100 110 120 130 140
190 200 210 220
pF1KB4 HILILQNKIDLVKESQAKEQYEQILAFVQGTVAE----GAPIIPISAQ----------LK
...: :::::. :.. . ... .: :. . ::::::..:.
CCDS33 LVVVL-NKIDLLPEGKRQAAIDKMTKKMQKTLENTKFRGAPIIPVAAKPGGPEAPETEAP
150 160 170 180 190
230 240 250 260 270 280
pF1KB4 YNIEVVCEYIVKKIPVPPRDFTSEPRLIVIRSFDVNKPGCEVDDLKGGVAGGSILKGVLK
.: . : ....: .: :: : : :. : : : .: : :.::.: ..
CCDS33 QGIPELIELLTSQISIPTRD-PSGPFLM---SVD----HCFSIKGQGTVMTGTILSGSIS
200 210 220 230 240 250
290 300 310 320 330 340
pF1KB4 VGQEIEVRPGIVSKDSEGKLMCKPIFSKIVSLFAEHNDLQYAAPGGLIGVG-TKIDPTLC
.:. .:. :.. . .:. :. : . : : .:. :..:: :
CCDS33 LGDSVEI-PAL------------KVVKKVKSMQMFHMPITSAMQGDRLGICVTQFDPKLL
260 270 280 290
350 360 370 380 390 400
pF1KB4 RADRMVGQVLGAVGALPEIFTELEISYFLLRRLLGVRTEGDKKAAKVQKLSKNEVLMVNI
CCDS33 ERGLVCAPESLHTVHAALISVEKIPYFRGPLQTKAKFHITVGHETVMGRLMFFSPAPDNF
300 310 320 330 340 350
472 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 14:28:44 2016 done: Mon Nov 7 14:28:45 2016
Total Scan time: 2.330 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]