FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5884, 496 aa
1>>>pF1KB5884 496 - 496 aa - 496 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.5479+/-0.000928; mu= 5.3656+/- 0.056
mean_var=263.1873+/-54.220, 0's: 0 Z-trim(114.4): 11 B-trim: 245 in 2/50
Lambda= 0.079057
statistics sampled from 14972 (14981) to 14972 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.46), width: 16
Scan time: 3.440
The best scores are: opt bits E(32554)
CCDS2553.1 NEU4 gene_id:129807|Hs108|chr2 ( 496) 3527 415.5 6.8e-116
CCDS54441.1 NEU4 gene_id:129807|Hs108|chr2 ( 497) 3515 414.1 1.8e-115
CCDS54442.1 NEU4 gene_id:129807|Hs108|chr2 ( 484) 3443 405.9 5.1e-113
CCDS44682.1 NEU3 gene_id:10825|Hs108|chr11 ( 461) 957 122.3 1.1e-27
CCDS2501.1 NEU2 gene_id:4759|Hs108|chr2 ( 380) 809 105.3 1.2e-22
>>CCDS2553.1 NEU4 gene_id:129807|Hs108|chr2 (496 aa)
initn: 3527 init1: 3527 opt: 3527 Z-score: 2194.1 bits: 415.5 E(32554): 6.8e-116
Smith-Waterman score: 3527; 100.0% identity (100.0% similar) in 496 aa overlap (1-496:1-496)
10 20 30 40 50 60
pF1KB5 MMSSAAFPRWLSMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 MMSSAAFPRWLSMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 SHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 SHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 TPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 TPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 SGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 SGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAAV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 DGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPAP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 DGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPAP
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 NRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 NRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQPG
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB5 PRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWTE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 PRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWTE
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB5 PWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPASP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS25 PWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPASP
430 440 450 460 470 480
490
pF1KB5 KPPNLGDKPRGCCWPS
::::::::::::::::
CCDS25 KPPNLGDKPRGCCWPS
490
>>CCDS54441.1 NEU4 gene_id:129807|Hs108|chr2 (497 aa)
initn: 3513 init1: 3448 opt: 3515 Z-score: 2186.7 bits: 414.1 E(32554): 1.8e-115
Smith-Waterman score: 3515; 99.8% identity (99.8% similar) in 497 aa overlap (1-496:1-497)
10 20 30 40 50
pF1KB5 MMSSAAFPRWL-SMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPD
::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MMSSAAFPRWLQSMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPD
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB5 DSHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DSHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLG
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB5 HTPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 HTPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQL
130 140 150 160 170 180
180 190 200 210 220 230
pF1KB5 PSGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PSGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAA
190 200 210 220 230 240
240 250 260 270 280 290
pF1KB5 VDGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 VDGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPA
250 260 270 280 290 300
300 310 320 330 340 350
pF1KB5 PNRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PNRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQP
310 320 330 340 350 360
360 370 380 390 400 410
pF1KB5 GPRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 GPRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWT
370 380 390 400 410 420
420 430 440 450 460 470
pF1KB5 EPWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 EPWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPAS
430 440 450 460 470 480
480 490
pF1KB5 PKPPNLGDKPRGCCWPS
:::::::::::::::::
CCDS54 PKPPNLGDKPRGCCWPS
490
>>CCDS54442.1 NEU4 gene_id:129807|Hs108|chr2 (484 aa)
initn: 3443 init1: 3443 opt: 3443 Z-score: 2142.5 bits: 405.9 E(32554): 5.1e-113
Smith-Waterman score: 3443; 100.0% identity (100.0% similar) in 484 aa overlap (13-496:1-484)
10 20 30 40 50 60
pF1KB5 MMSSAAFPRWLSMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDD
::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDD
10 20 30 40
70 80 90 100 110 120
pF1KB5 SHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 SHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGH
50 60 70 80 90 100
130 140 150 160 170 180
pF1KB5 TPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 TPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLP
110 120 130 140 150 160
190 200 210 220 230 240
pF1KB5 SGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 SGRLLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAAV
170 180 190 200 210 220
250 260 270 280 290 300
pF1KB5 DGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPAP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPAP
230 240 250 260 270 280
310 320 330 340 350 360
pF1KB5 NRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 NRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQPG
290 300 310 320 330 340
370 380 390 400 410 420
pF1KB5 PRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWTE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWTE
350 360 370 380 390 400
430 440 450 460 470 480
pF1KB5 PWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPASP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPASP
410 420 430 440 450 460
490
pF1KB5 KPPNLGDKPRGCCWPS
::::::::::::::::
CCDS54 KPPNLGDKPRGCCWPS
470 480
>>CCDS44682.1 NEU3 gene_id:10825|Hs108|chr11 (461 aa)
initn: 1160 init1: 657 opt: 957 Z-score: 610.3 bits: 122.3 E(32554): 1.1e-27
Smith-Waterman score: 1154; 42.6% identity (61.2% similar) in 469 aa overlap (24-489:46-455)
10 20 30 40 50
pF1KB5 MMSSAAFPRWLSMGVPRTPSRTVLFERERT-GLTYRVPSLLPVPPGPTLLAFV
::..: :.:::.:.:: .:: :.:::.
CCDS44 ASSSAPTETEEPGSSAEVMEEVTTCSFNSPLFRQEDDRGITYRIPALLYIPPTHTFLAFA
20 30 40 50 60 70
60 70 80 90 100 110
pF1KB5 EQRLSPDDSHAHRLVLRRGTLAGGSVRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFL
:.: . : : .:::::: : :.:: :. : :.: ::.:::::: . .: :::
CCDS44 EKRSTRRDEDALHLVLRRGLRIGQLVQWGPLKPLMEATLPGHRTMNPCPVWEQKSGCVFL
80 90 100 110 120 130
120 130 140 150 160 170
pF1KB5 FFIAVLGHTPEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVG
::: : ::. : ::..:::::::: . :.::: ::. .::::::.::. .. :::::::
CCDS44 FFICVRGHVTERQQIVSGRNAARLCFIYSQDAGCSWSEVRDLTEEVIGSELKHWATFAVG
140 150 160 170 180 190
180 190 200 210 220 230
pF1KB5 PGHGVQLPSGRLLVPAYTYRVDRRE-CFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLR
::::.:: ::::..::::: . :: :.: :::. .:::: : ::. : :. .
CCDS44 PGHGIQLQSGRLVIPAYTYYIPSWFFCFQLPCKTRPHSLMIYSDDLGVTWHHGRLIRPMV
200 210 220 230 240 250
240 250 260 270 280 290
pF1KB5 SGECQLAAVDGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETAWGCQGS
. ::..: : : . :::.::.: :..:::::.: .: .: : :::::
CCDS44 TVECEVAEVTGRAGHPVLYCSARTPNRCRAEALSTDHGEGFQRLALSRQLCEPPHGCQGS
260 270 280 290 300 310
300 310 320 330 340 350
pF1KB5 IVGF-PAPAPNRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQ
.:.: : :.: .:.: :.
CCDS44 VVSFRPLEIPHRCQDSS---------------------------------------SK--
320 330
360 370 380 390 400 410
pF1KB5 PRGDGPRQPGPRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQ
:.: :: : : . : : .:::::::..:. :. .:: :.:
CCDS44 ---DAPTIQQSSPGSS---------LRLEEEAGTPSESWLLYSHPTSRKQRVDLGIYLNQ
340 350 360 370 380
420 430 440 450 460 470
pF1KB5 SPLDPRSWTEPWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLR
.::. :..::... :: ::::::.. : :: :.::.: :.. ..:.: :. :
CCDS44 TPLEAACWSRPWILHCGPCGYSDLAAL---EEEGL-FGCLFECGTKQECEQIAFRLFTHR
390 400 410 420 430
480 490
pF1KB5 EVLENVPASPKPPNLGDKPRGCCWPS
:.: .. .. : : .:
CCDS44 EILSHLQGDCTSP--GRNPSQFKSN
440 450 460
>>CCDS2501.1 NEU2 gene_id:4759|Hs108|chr2 (380 aa)
initn: 966 init1: 363 opt: 809 Z-score: 520.1 bits: 105.3 E(32554): 1.2e-22
Smith-Waterman score: 880; 38.8% identity (56.6% similar) in 454 aa overlap (34-482:20-379)
10 20 30 40 50 60
pF1KB5 SAAFPRWLSMGVPRTPSRTVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDDSHA
::.:.:: .: .::::.::: : : ::
CCDS25 MASLPVLQKESVFQSGAHAYRIPALLYLPGQQSLLAFAEQRASKKDEHA
10 20 30 40
70 80 90 100 110 120
pF1KB5 HRLVLRRGTLAGGS--VRWGALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGHT
. .::::: . . :.: : .:.. : : ::::::::..:: :::.::::::. :..
CCDS25 ELIVLRRGDYDAPTHQVQWQAQEVVAQARLDGHRSMNPCPLYDAQTGTLFLFFIAIPGQV
50 60 70 80 90 100
130 140 150 160 170 180
pF1KB5 PEAVQIATGRNAARLCCVASRDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLPS
: :. : :..::: :.: : : .:.: ::::. ::: : ..:.:::::::: .:: .
CCDS25 TEQQQLQTRANVTRLCQVTSTDHGRTWSSPRDLTDAAIGPAYREWSTFAVGPGHCLQLHD
110 120 130 140 150 160
190 200 210 220 230
pF1KB5 -GR-LLVPAYTYRVDRRECFGKICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAA
.: :.::::.:: . : : : .: : : :::::: : .: . . :::.:
CCDS25 RARSLVVPAYAYRK-----LHPIQRPIPSAFCFLSHDHGRTWARGHFVAQ-DTLECQVAE
170 180 190 200 210 220
240 250 260 270 280 290
pF1KB5 VDGGQAGSFLYCNARSPLGSRVQALSTDEGTSFLPAERVASLPETA-WGCQGSIVGFPAP
:. :. . :::: : .:::: ::..: .: .. : .: : :::::...::.:
CCDS25 VETGEQ-RVVTLNARSHLRARVQAQSTNDGLDFQESQLVKKLVEPPPQGCQGSVISFPSP
230 240 250 260 270 280
300 310 320 330 340 350
pF1KB5 APNRPRDDSWSVGPGSPLQPPLLGPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQ
:. ::::: :
CCDS25 -----RS-----GPGSPAQ-----------------------------------------
290
360 370 380 390 400 410
pF1KB5 PGPRPGVSGDVGSWTLALPMPFAAPPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSW
::::.::. : .: :. : :..:
CCDS25 ------------------------------WLLYTHPTHSWQRADLGAYLNPRPPAPEAW
300 310 320
420 430 440 450 460 470
pF1KB5 TEPWVIYEGPSGYSDLASIGPAPEGGLVFACLYESGARTSYDEISFCTFSLREVLENVPA
.:: .. .: .:::: :.: .:.:. .:.::::.. .:.:: : :.:.... ::
CCDS25 SEPVLLAKGSCAYSDLQSMGTGPDGSPLFGCLYEAN---DYEEIVFLMFTLKQAF---PA
330 340 350 360 370
480 490
pF1KB5 SPKPPNLGDKPRGCCWPS
:
CCDS25 EYLPQ
380
496 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 21:06:06 2016 done: Sat Nov 5 21:06:06 2016
Total Scan time: 3.440 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]