FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8193, 119 aa
1>>>pF1KB8193 119 - 119 aa - 119 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.5633+/-0.000465; mu= 11.0117+/- 0.028
mean_var=76.8290+/-15.226, 0's: 0 Z-trim(116.9): 14 B-trim: 0 in 0/52
Lambda= 0.146323
statistics sampled from 17587 (17600) to 17587 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.858), E-opt: 0.2 (0.541), width: 16
Scan time: 1.870
The best scores are: opt bits E(32554)
CCDS237.1 ID3 gene_id:3399|Hs108|chr1 ( 119) 807 177.9 1.3e-45
CCDS13186.1 ID1 gene_id:3397|Hs108|chr20 ( 149) 275 65.7 9.7e-12
CCDS13185.1 ID1 gene_id:3397|Hs108|chr20 ( 155) 275 65.7 1e-11
CCDS1659.1 ID2 gene_id:3398|Hs108|chr2 ( 134) 272 65.0 1.4e-11
CCDS4544.1 ID4 gene_id:3400|Hs108|chr6 ( 161) 273 65.3 1.4e-11
>>CCDS237.1 ID3 gene_id:3399|Hs108|chr1 (119 aa)
initn: 807 init1: 807 opt: 807 Z-score: 932.6 bits: 177.9 E(32554): 1.3e-45
Smith-Waterman score: 807; 99.2% identity (100.0% similar) in 119 aa overlap (1-119:1-119)
10 20 30 40 50 60
pF1KB8 MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLDDMNHCYSRLRELVPGVPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS23 MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLDDMNHCYSRLRELVPGVPR
10 20 30 40 50 60
70 80 90 100 110
pF1KB8 GTQLSQVEILQRVIDYILDLQVVLAEPAPGPPDGPHLPIQTAELAPELVISNDKRSFCH
::::::::::::::::::::::::::::::::::::::::::::.::::::::::::::
CCDS23 GTQLSQVEILQRVIDYILDLQVVLAEPAPGPPDGPHLPIQTAELTPELVISNDKRSFCH
70 80 90 100 110
>>CCDS13186.1 ID1 gene_id:3397|Hs108|chr20 (149 aa)
initn: 248 init1: 192 opt: 275 Z-score: 324.2 bits: 65.7 E(32554): 9.7e-12
Smith-Waterman score: 276; 49.1% identity (69.1% similar) in 110 aa overlap (1-100:19-126)
10 20 30 40
pF1KB8 MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLD
.:: . . : :.: ::::.:.::.: : : .:. : .:::
CCDS13 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAG-GAGARLP-ALLD
10 20 30 40 50
50 60 70 80 90
pF1KB8 ---------DMNHCYSRLRELVPGVPRGTQLSQVEILQRVIDYILDLQVVL-AEPAPGPP
::: :::::.:::: .:.. ..:.:::::.::::: :::. : .: : :
CCDS13 EQQVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTP
60 70 80 90 100 110
100 110
pF1KB8 DGPHLPIQTAELAPELVISNDKRSFCH
: ::..
CCDS13 GGRGLPVRAPLSTLNGEISALTAEVRSRSDH
120 130 140
>>CCDS13185.1 ID1 gene_id:3397|Hs108|chr20 (155 aa)
initn: 248 init1: 192 opt: 275 Z-score: 324.0 bits: 65.7 E(32554): 1e-11
Smith-Waterman score: 276; 49.1% identity (69.1% similar) in 110 aa overlap (1-100:19-126)
10 20 30 40
pF1KB8 MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLD
.:: . . : :.: ::::.:.::.: : : .:. : .:::
CCDS13 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAG-GAGARLP-ALLD
10 20 30 40 50
50 60 70 80 90
pF1KB8 ---------DMNHCYSRLRELVPGVPRGTQLSQVEILQRVIDYILDLQVVL-AEPAPGPP
::: :::::.:::: .:.. ..:.:::::.::::: :::. : .: : :
CCDS13 EQQVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTP
60 70 80 90 100 110
100 110
pF1KB8 DGPHLPIQTAELAPELVISNDKRSFCH
: ::..
CCDS13 GGRGLPVRAPLSTLNGEISALTAEAACVPADDRILCR
120 130 140 150
>>CCDS1659.1 ID2 gene_id:3398|Hs108|chr2 (134 aa)
initn: 283 init1: 203 opt: 272 Z-score: 321.5 bits: 65.0 E(32554): 1.4e-11
Smith-Waterman score: 288; 41.0% identity (70.5% similar) in 139 aa overlap (1-118:1-133)
10 20 30 40 50 60
pF1KB8 MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLDDMNHCYSRLRELVPGVPR
:::.::::. . ::..::.:.:. : :. ..:.::: .:: :::.:.::::..:.
CCDS16 MKAFSPVRSVRKN--SLSDHSLGISRS--KTPV-DDPMSLLYNMNDCYSKLKELVPSIPQ
10 20 30 40 50
70 80 90
pF1KB8 GTQLSQVEILQRVIDYILDLQVVL-AEPA--------PGPPDGPHLPI------------
. ..:..::::.:::::::::..: ..:. :: .. . :.
CCDS16 NKKVSKMEILQHVIDYILDLQIALDSHPTIVSLHHQRPGQNQASRTPLTTLNTDISILSL
60 70 80 90 100 110
100 110
pF1KB8 QTAELAPELVISNDKRSFCH
:..:. :: .:::....:
CCDS16 QASEFPSEL-MSNDSKALCG
120 130
>>CCDS4544.1 ID4 gene_id:3400|Hs108|chr6 (161 aa)
initn: 301 init1: 194 opt: 273 Z-score: 321.4 bits: 65.3 E(32554): 1.4e-11
Smith-Waterman score: 286; 47.4% identity (60.7% similar) in 135 aa overlap (1-106:1-135)
10 20 30
pF1KB8 MKALSPVR--------GC---YEAVCCLSER--SL----------AIARGRGKGPAAEEP
:::.:::: :: :. ::.:. :: : :: .. ::.::
CCDS45 MKAVSPVRPSGRKAPSGCGGGELALRCLAEHGHSLGGSAAAAAAAAAARCKAAEAAADEP
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB8 -LSLLDDMNHCYSRLRELVPGVPRGTQLSQVEILQRVIDYILDLQVVL-AEPA----PGP
: : ::: ::::::.::: .: . ..:.:::::.:::::::::..: ..:: : :
CCDS45 ALCLQCDMNDCYSRLRRLVPTIPPNKKVSKVEILQHVIDYILDLQLALETHPALLRQPPP
70 80 90 100 110 120
100 110
pF1KB8 PDGPHLPIQTAELAPELVISNDKRSFCH
: :: : : ::
CCDS45 PAPPHHPAGTCPAAPPRTPLTALNTDPAGAVNKQGDSILCR
130 140 150 160
119 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 19:50:35 2016 done: Fri Nov 4 19:50:36 2016
Total Scan time: 1.870 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]