FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8390, 398 aa
1>>>pF1KB8390 398 - 398 aa - 398 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 13.5992+/-0.0011; mu= -17.7039+/- 0.065
mean_var=653.4952+/-141.233, 0's: 0 Z-trim(118.2): 554 B-trim: 1167 in 1/52
Lambda= 0.050171
statistics sampled from 18458 (19154) to 18458 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.841), E-opt: 0.2 (0.588), width: 16
Scan time: 3.340
The best scores are: opt bits E(32554)
CCDS33322.1 SP5 gene_id:389058|Hs108|chr2 ( 398) 2847 220.3 2.4e-57
CCDS5373.1 SP4 gene_id:6671|Hs108|chr7 ( 784) 746 68.6 2.3e-11
>>CCDS33322.1 SP5 gene_id:389058|Hs108|chr2 (398 aa)
initn: 2847 init1: 2847 opt: 2847 Z-score: 1142.9 bits: 220.3 E(32554): 2.4e-57
Smith-Waterman score: 2847; 100.0% identity (100.0% similar) in 398 aa overlap (1-398:1-398)
10 20 30 40 50 60
pF1KB8 MAAVAVLRNDSLQAFLQDRTPSASPDLGKHSPLALLAATCSRIGQPGAAAPPDFLQVPYD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 MAAVAVLRNDSLQAFLQDRTPSASPDLGKHSPLALLAATCSRIGQPGAAAPPDFLQVPYD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 PALGSPSRLFHPWTADMPAHSPGALPPPHPSLGLTPQKTHLQPSFGAAHELPLTPPADPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 PALGSPSRLFHPWTADMPAHSPGALPPPHPSLGLTPQKTHLQPSFGAAHELPLTPPADPS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 YPYEFSPVKMLPSSMAALPASCAPAYVPYAAQAALPPGYSNLLPPPPPPPPPPTCRQLSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 YPYEFSPVKMLPSSMAALPASCAPAYVPYAAQAALPPGYSNLLPPPPPPPPPPTCRQLSP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 NPAPDDLPWWSIPQAGAGPGASGVPGSGLSGACAGAPHAPRFPASAAAAAAAAAALQRGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 NPAPDDLPWWSIPQAGAGPGASGVPGSGLSGACAGAPHAPRFPASAAAAAAAAAALQRGL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 VLGPSDFAQYQSQIAALLQTKAPLAATARRCRRCRCPNCQAAGGAPEAEPGKKKQHVCHV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 VLGPSDFAQYQSQIAALLQTKAPLAATARRCRRCRCPNCQAAGGAPEAEPGKKKQHVCHV
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB8 PGCGKVYGKTSHLKAHLRWHTGERPFVCNWLFCGKSFTRSDELQRHLRTHTGEKRFACPE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 PGCGKVYGKTSHLKAHLRWHTGERPFVCNWLFCGKSFTRSDELQRHLRTHTGEKRFACPE
310 320 330 340 350 360
370 380 390
pF1KB8 CGKRFMRSDHLAKHVKTHQNKKLKVAEAGVKREDARDL
::::::::::::::::::::::::::::::::::::::
CCDS33 CGKRFMRSDHLAKHVKTHQNKKLKVAEAGVKREDARDL
370 380 390
>>CCDS5373.1 SP4 gene_id:6671|Hs108|chr7 (784 aa)
initn: 812 init1: 705 opt: 746 Z-score: 317.4 bits: 68.6 E(32554): 2.3e-11
Smith-Waterman score: 758; 38.8% identity (59.9% similar) in 374 aa overlap (37-382:366-733)
10 20 30 40 50 60
pF1KB8 LRNDSLQAFLQDRTPSASPDLGKHSPLALLAATCSRIGQPGAAAPPDFLQVPYDPA--LG
::: :. .: .. :. .: : . :
CCDS53 DTLVSSADTGQYASTSASSSERTIEESQTPAATESE-AQSSSQLQPNGMQNAQDQSNSLQ
340 350 360 370 380 390
70 80 90 100 110
pF1KB8 SPSRLFHPWTADMPAHSPG-----ALPPPHPSL--GLTPQKTHLQP-------SFGAAHE
. . . .: .. ..: :.:: .: : : : . :: . . ..
CCDS53 QVQIVGQPILQQIQIQQPQQQIIQAIPPQSFQLQSGQTIQTIQQQPLQNVQLQAVNPTQV
400 410 420 430 440 450
120 130 140 150 160
pF1KB8 LPLTPPADPSYPYEFSPVKMLP-SSMAALPASCAPAYVPYAAQAALPPGYSNLLPPPP--
: .: :: .. :.. .:.. : .. : . . : ..: :
CCDS53 LIRAPTLTPSGQISWQTVQVQNIQSLSNLQVQNAGLSQQLTITPVSSSGGTTLAQIAPVA
460 470 480 490 500 510
170 180 190 200 210 220
pF1KB8 --PPPPPPTCRQLSPNPAPDDLPWWSIPQAGA-GPGASGVPGS--GLSGACAGAPHAPRF
: . ::. : .: :. . :: : ..::: . ...: : .
CCDS53 VAGAPITLNTAQLASVP---NLQTVSVANLGAAGVQVQGVPVTITSVAGQQQGQDGVKVQ
520 530 540 550 560 570
230 240 250 260 270
pF1KB8 PASAAAAAAAAAALQRGLV--LGPSDFAQYQSQIAALLQTKAPLAATARRCRR--CRCPN
:. : ...:.... . . ..:....: . : . ::. . ..: :: : :::
CCDS53 QATIAPVTVAVGGIANATIGAVSPDQLTQVHLQQGQ--QTSDQEVQPGKRLRRVACSCPN
580 590 600 610 620
280 290 300 310 320 330
pF1KB8 CQAAGGAPEAEPGKKKQHVCHVPGCGKVYGKTSHLKAHLRWHTGERPFVCNWLFCGKSFT
:. . : ::::::::.::. ::::::::::::.::::::::::::.:::.:::: ::
CCDS53 CREGEGRGSNEPGKKKQHICHIEGCGKVYGKTSHLRAHLRWHTGERPFICNWMFCGKRFT
630 640 650 660 670 680
340 350 360 370 380 390
pF1KB8 RSDELQRHLRTHTGEKRFACPECGKRFMRSDHLAKHVKTHQNKKLKVAEAGVKREDARDL
:::::::: ::::::::: ::::.:::::::::.::::::::::
CCDS53 RSDELQRHRRTHTGEKRFECPECSKRFMRSDHLSKHVKTHQNKKGGGTALAIVTSGELDS
690 700 710 720 730 740
CCDS53 SVTEVLGSPRIVTVAAISQDSNPATPNVSTNMEEF
750 760 770 780
398 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 12:29:54 2016 done: Fri Nov 4 12:29:54 2016
Total Scan time: 3.340 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]