FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5716, 607 aa
1>>>pF1KE5716 607 - 607 aa - 607 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.2896+/-0.000825; mu= 10.8627+/- 0.050
mean_var=139.5336+/-27.761, 0's: 0 Z-trim(112.3): 13 B-trim: 0 in 0/52
Lambda= 0.108576
statistics sampled from 13083 (13094) to 13083 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.746), E-opt: 0.2 (0.402), width: 16
Scan time: 2.850
The best scores are: opt bits E(32554)
CCDS4680.1 GNL1 gene_id:2794|Hs108|chr6 ( 607) 4131 658.7 6.1e-189
CCDS33922.1 LSG1 gene_id:55341|Hs108|chr3 ( 658) 426 78.4 3.3e-14
>>CCDS4680.1 GNL1 gene_id:2794|Hs108|chr6 (607 aa)
initn: 4131 init1: 4131 opt: 4131 Z-score: 3505.5 bits: 658.7 E(32554): 6.1e-189
Smith-Waterman score: 4131; 100.0% identity (100.0% similar) in 607 aa overlap (1-607:1-607)
10 20 30 40 50 60
pF1KE5 MPRKKPFSVKQKKKQLQDKRERKRGLQDGLRSSSNSRSGSRERREEQTDTSDGESVTHHI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 MPRKKPFSVKQKKKQLQDKRERKRGLQDGLRSSSNSRSGSRERREEQTDTSDGESVTHHI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 RRLNQQPSQGLGPRGYDPNRYRLHFERDSREEVERRKRAAREQVLQPVSAELLELDIREV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 RRLNQQPSQGLGPRGYDPNRYRLHFERDSREEVERRKRAAREQVLQPVSAELLELDIREV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 YQPGSVLDFPRRPPWSYEMSKEQLMSQEERSFQDYLGKIHGAYSSEKLSYFEHNLETWRQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 YQPGSVLDFPRRPPWSYEMSKEQLMSQEERSFQDYLGKIHGAYSSEKLSYFEHNLETWRQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 LWRVLEMSDIVLLITDIRHPVVNFPPALYEYVTGELGLALVLVLNKVDLAPPALVVAWKH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 LWRVLEMSDIVLLITDIRHPVVNFPPALYEYVTGELGLALVLVLNKVDLAPPALVVAWKH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 YFHQHYPQLHVVLFTSFPRDPRTPQDPSSVLKKSRRRGRGWTRALGPEQLLRACEAITVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 YFHQHYPQLHVVLFTSFPRDPRTPQDPSSVLKKSRRRGRGWTRALGPEQLLRACEAITVG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE5 KVDLSSWREKIARDVAGATWGNGSGEEEEEEDGPAVLVEQQTDSAMEPTGPTQERYKDGV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 KVDLSSWREKIARDVAGATWGNGSGEEEEEEDGPAVLVEQQTDSAMEPTGPTQERYKDGV
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE5 VTIGCVGFPNVGKSSLINGLVGRKVVSVSRTPGHTRYFQTYFLTPSVKLCDCPGLIFPSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 VTIGCVGFPNVGKSSLINGLVGRKVVSVSRTPGHTRYFQTYFLTPSVKLCDCPGLIFPSL
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE5 LPRQLQVLAGIYPIAQIQEPYTAVGYLASRIPVQALLHLRHPEAEDPSAEHPWCAWDICE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 LPRQLQVLAGIYPIAQIQEPYTAVGYLASRIPVQALLHLRHPEAEDPSAEHPWCAWDICE
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE5 AWAEKRGYKTAKAARNDVYRAANSLLRLAVDGRLSLCFHPPGYSEQKGTWESHPETTELV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 AWAEKRGYKTAKAARNDVYRAANSLLRLAVDGRLSLCFHPPGYSEQKGTWESHPETTELV
490 500 510 520 530 540
550 560 570 580 590 600
pF1KE5 VLQGRVGPAGDEEEEEEEELSSSCEEEGEEDRDADEEGEGDEETPTSAPGSSLAGRNPYA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 VLQGRVGPAGDEEEEEEEELSSSCEEEGEEDRDADEEGEGDEETPTSAPGSSLAGRNPYA
550 560 570 580 590 600
pF1KE5 LLGEDEC
:::::::
CCDS46 LLGEDEC
>>CCDS33922.1 LSG1 gene_id:55341|Hs108|chr3 (658 aa)
initn: 559 init1: 302 opt: 426 Z-score: 368.5 bits: 78.4 E(32554): 3.3e-14
Smith-Waterman score: 602; 31.0% identity (55.2% similar) in 449 aa overlap (127-522:111-552)
100 110 120 130 140 150
pF1KE5 KRAAREQVLQPVSAELLELDIREVYQPGSVLDFPRRPPWSYEMSKEQLMSQEERSFQDYL
: .:::: :. . . :.: . :. .: ..
CCDS33 KFVPAEARTGLLSFEESQRIKKLHEENKQFLCIPRRPNWNQNTTPEELKQAEKDNFLEWR
90 100 110 120 130 140
160 170 180 190 200 210
pF1KE5 GKIHGAYSSEKL--SYFEHNLETWRQLWRVLEMSDIVLLITDIRHPVVNFPPALYEYVTG
.. .:: . ::.::. :::::::.: ::::. :.: :.:.. : ::
CCDS33 RQLVRLEEEQKLILTPFERNLDFWRQLWRVIERSDIVVQIVDARNPLLFRCEDLECYVKE
150 160 170 180 190 200
220 230 240 250 260 270
pF1KE5 -ELGLALVLVLNKVDLAPPALVVAWKHYFHQHYPQLHVVLFTSFPRDPRTPQDPSSVLKK
. . :...::.:: :: ::... ...:...... : . .: .
CCDS33 MDANKENVILINKADLLTAEQRSAWAMYFEKE--DVKVIFWSALA--GAIPLNGDSEEEA
210 220 230 240 250
280 290 300 310 320
pF1KE5 SRRRGRGWTRALGPEQLLRA----CEAITVGKVDLSSWREKIARDVAGATWGNGSGEEEE
.: .. : .: .. .: :. . : : :. . : . . . :::.
CCDS33 NRDDRQSNTTKFGHSSFDQAEISHSESEHLPARDSPSLSENPTTDEDDSEYEDCPEEEED
260 270 280 290 300 310
330 340 350
pF1KE5 ------EEDGP---------------------------------AVLVEQQTDSAMEPTG
::::: . :: .: .
CCDS33 DWQTCSEEDGPKEEDCSQDWKESSTADSEARSRKTPQKRQIHNFSHLVSKQELLELFKEL
320 330 340 350 360 370
360 370 380 390 400 410
pF1KE5 PTQERYKDGVVTIGCVGFPNVGKSSLINGLVGRKVVSVSRTPGHTRYFQTYFLTPSVKLC
: .. ::: .:.: ::.::::::: :: ..: : :::: :::::..::: .. :.. ::
CCDS33 HTGRKVKDGQLTVGLVGYPNVGKSSTINTIMGNKKVSVSATPGHTKHFQTLYVEPGLCLC
380 390 400 410 420 430
420 430 440 450 460
pF1KE5 DCPGLIFPSLLPRQLQVL-AGIYPIAQIQEPYTAVGYLASRIPVQAL-----LHLRHP-E
:::::..::.. . .. .:: :: :... :. . . :: ..: ... : :
CCDS33 DCPGLVMPSFVSTKAEMTCSGILPIDQMRDHVPPVSLVCQNIPRHVLEATYGINIITPRE
440 450 460 470 480 490
470 480 490 500 510 520
pF1KE5 AEDPSAEHPWCAWDICEAWAEKRGYKTAKAARNDVYRAANSLLRLAVDGRLSLCFHPPGY
::: ..: . .. :.. ::. ::.. . : :.: .:. :.:.: : :::
CCDS33 DEDP--HRPPTSEELLTAYGYMRGFMTAHG-QPDQPRSARYILKDYVSGKLLYCHPPPGR
500 510 520 530 540 550
530 540 550 560 570 580
pF1KE5 SEQKGTWESHPETTELVVLQGRVGPAGDEEEEEEEELSSSCEEEGEEDRDADEEGEGDEE
CCDS33 DPVTFQHQHQRLLENKMNSDEIKMQLGRNKKAKQIENIVDKTFFHQENVRALTKGVQAVM
560 570 580 590 600 610
607 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 06:00:59 2016 done: Tue Nov 8 06:01:00 2016
Total Scan time: 2.850 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]