FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6845, 175 aa
1>>>pF1KB6845 175 - 175 aa - 175 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.4964+/-0.000646; mu= 17.4667+/- 0.039
mean_var=67.1507+/-13.375, 0's: 0 Z-trim(111.9): 94 B-trim: 401 in 2/51
Lambda= 0.156512
statistics sampled from 12666 (12775) to 12666 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.766), E-opt: 0.2 (0.392), width: 16
Scan time: 1.730
The best scores are: opt bits E(32554)
CCDS1965.1 REG3A gene_id:5068|Hs108|chr2 ( 175) 1236 287.0 4e-78
CCDS1962.1 REG3G gene_id:130120|Hs108|chr2 ( 175) 1094 254.9 1.8e-68
CCDS1964.1 REG1A gene_id:5967|Hs108|chr2 ( 166) 588 140.6 4.3e-34
CCDS1963.1 REG1B gene_id:5968|Hs108|chr2 ( 166) 564 135.2 1.9e-32
CCDS58714.1 REG3G gene_id:130120|Hs108|chr2 ( 129) 403 98.8 1.4e-21
CCDS906.1 REG4 gene_id:83998|Hs108|chr1 ( 158) 310 77.9 3.3e-15
CCDS81953.1 CLEC19A gene_id:728276|Hs108|chr16 ( 186) 257 66.0 1.5e-11
>>CCDS1965.1 REG3A gene_id:5068|Hs108|chr2 (175 aa)
initn: 1236 init1: 1236 opt: 1236 Z-score: 1516.0 bits: 287.0 E(32554): 4e-78
Smith-Waterman score: 1236; 100.0% identity (100.0% similar) in 175 aa overlap (1-175:1-175)
10 20 30 40 50 60
pF1KB6 MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGSKAYGSHCYALFLSPKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGSKAYGSHCYALFLSPKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 WTDADLACQKRPSGNLVSVLSGAEGSFVSSLVKSIGNSYSYVWIGLHDPTQGTEPNGEGW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 WTDADLACQKRPSGNLVSVLSGAEGSFVSSLVKSIGNSYSYVWIGLHDPTQGTEPNGEGW
70 80 90 100 110 120
130 140 150 160 170
pF1KB6 EWSSSDVMNYFAWERNPSTISSPGHCASLSRSTAFLRWKDYNCNVRLPYVCKFTD
:::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 EWSSSDVMNYFAWERNPSTISSPGHCASLSRSTAFLRWKDYNCNVRLPYVCKFTD
130 140 150 160 170
>>CCDS1962.1 REG3G gene_id:130120|Hs108|chr2 (175 aa)
initn: 1133 init1: 1094 opt: 1094 Z-score: 1342.7 bits: 254.9 E(32554): 1.8e-68
Smith-Waterman score: 1094; 85.1% identity (95.4% similar) in 175 aa overlap (1-175:1-175)
10 20 30 40 50 60
pF1KB6 MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGSKAYGSHCYALFLSPKS
::::::::::::::::::.:: :::::: :.:::: :: :::::::::: ::::::::::
CCDS19 MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 WTDADLACQKRPSGNLVSVLSGAEGSFVSSLVKSIGNSYSYVWIGLHDPTQGTEPNGEGW
: ::::::::::::.:::::::::::::::::.::.:::::.::::::::::.::.:.::
CCDS19 WMDADLACQKRPSGKLVSVLSGAEGSFVSSLVRSISNSYSYIWIGLHDPTQGSEPDGDGW
70 80 90 100 110 120
130 140 150 160 170
pF1KB6 EWSSSDVMNYFAWERNPSTISSPGHCASLSRSTAFLRWKDYNCNVRLPYVCKFTD
::::.:::::::::.::::: .::::.::::::.::.::::::...::::::: :
CCDS19 EWSSTDVMNYFAWEKNPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD
130 140 150 160 170
>>CCDS1964.1 REG1A gene_id:5967|Hs108|chr2 (166 aa)
initn: 578 init1: 417 opt: 588 Z-score: 725.5 bits: 140.6 E(32554): 4.3e-34
Smith-Waterman score: 588; 50.0% identity (76.2% similar) in 172 aa overlap (5-175:1-166)
10 20 30 40 50 60
pF1KB6 MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGSKAYGSHCYALFLSPKS
:: : .::.::::.::: ::.: : :::.::: ::.:..:: :.:: . . ..
CCDS19 MAQTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNEDRET
10 20 30 40 50
70 80 90 100 110 120
pF1KB6 WTDADLACQKRPSGNLVSVLSGAEGSFVSSLVKSIGNSYSYVWIGLHDPTQGTEPNGEGW
:.:::: ::. ::::::::. :::.::.::.: :.. :::::::: .. . :
CCDS19 WVDADLYCQNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRR-----W
60 70 80 90 100 110
130 140 150 160 170
pF1KB6 EWSSSDVMNYFAWERN-PSTISSPGHCASLSRSTAFLRWKDYNCNVRLPYVCKFTD
.:::.....: .: . ::... ::.:.::. ::.: .::: :. .. .:::: .
CCDS19 HWSSGSLVSYKSWGIGAPSSVN-PGYCVSLTSSTGFQKWKDVPCEDKFSFVCKFKN
120 130 140 150 160
>>CCDS1963.1 REG1B gene_id:5968|Hs108|chr2 (166 aa)
initn: 556 init1: 397 opt: 564 Z-score: 696.2 bits: 135.2 E(32554): 1.9e-32
Smith-Waterman score: 564; 46.2% identity (74.9% similar) in 171 aa overlap (5-175:1-166)
10 20 30 40 50 60
pF1KB6 MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGSKAYGSHCYALFLSPKS
:: . .::.: ::.:: ::.: : :::. :: ::.:..:: :.:: . .:..
CCDS19 MAQTNSFFMLISSLMFLSLSQGQESQTELPNPRISCPEGTNAYRSYCYYFNEDPET
10 20 30 40 50
70 80 90 100 110 120
pF1KB6 WTDADLACQKRPSGNLVSVLSGAEGSFVSSLVKSIGNSYSYVWIGLHDPTQGTEPNGEGW
:.:::: ::. ::::::::. :::.::.::.: ... : :::::::: .. . :
CCDS19 WVDADLYCQNMNSGNLVSVLTQAEGAFVASLIKESSTDDSNVWIGLHDPKKNRR-----W
60 70 80 90 100 110
130 140 150 160 170
pF1KB6 EWSSSDVMNYFAWERNPSTISSPGHCASLSRSTAFLRWKDYNCNVRLPYVCKFTD
.:::.....: .:. . . .. :.::::. ..: .::: .:. .. .:::: .
CCDS19 HWSSGSLVSYKSWDTGSPSSANAGYCASLTSCSGFKKWKDESCEKKFSFVCKFKN
120 130 140 150 160
>>CCDS58714.1 REG3G gene_id:130120|Hs108|chr2 (129 aa)
initn: 808 init1: 401 opt: 403 Z-score: 501.1 bits: 98.8 E(32554): 1.4e-21
Smith-Waterman score: 708; 61.1% identity (69.1% similar) in 175 aa overlap (1-175:1-129)
10 20 30 40 50 60
pF1KB6 MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGSKAYGSHCYALFLSPKS
::::::::::::::::::.:: :::::: :.:::: :: :::::::::: ::::::::::
CCDS58 MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 WTDADLACQKRPSGNLVSVLSGAEGSFVSSLVKSIGNSYSYVWIGLHDPTQGTEPNGEGW
: ::: :.::.:.::
CCDS58 WMDAD----------------------------------------------GSEPDGDGW
70
130 140 150 160 170
pF1KB6 EWSSSDVMNYFAWERNPSTISSPGHCASLSRSTAFLRWKDYNCNVRLPYVCKFTD
::::.:::::::::.::::: .::::.::::::.::.::::::...::::::: :
CCDS58 EWSSTDVMNYFAWEKNPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD
80 90 100 110 120
>>CCDS906.1 REG4 gene_id:83998|Hs108|chr1 (158 aa)
initn: 254 init1: 125 opt: 310 Z-score: 386.5 bits: 77.9 E(32554): 3.3e-15
Smith-Waterman score: 310; 32.7% identity (61.7% similar) in 162 aa overlap (13-173:10-156)
10 20 30 40 50 60
pF1KB6 MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGSKAYGSHCYALFLSPKS
.::::: . : :. .: :: : : . :.::. : . ..
CCDS90 MASRSMRLLLLLSCLAK-TGVLGDIIMR--PS----CAPGWFYHKSNCYGYFRKLRN
10 20 30 40 50
70 80 90 100 110
pF1KB6 WTDADLACQKRPSG-NLVSVLSGAEGSFVSSLVKSIGNSYSYVWIGLHDPTQGTEPNGEG
:.::.: ::. .: .:.:.:: :.: .. ... : .::::::: . .
CCDS90 WSDAELECQSYGNGAHLASILSLKEASTIAEYISGYQRSQP-IWIGLHDPQKRQQ-----
60 70 80 90 100
120 130 140 150 160 170
pF1KB6 WEWSSSDVMNYFAWERNPSTISSPGHCASLSRSTAFLRWKDYNCNVRLPYVCKFTD
:.: .. .. : .: . ..... ::: .: .. :: :.. .:: : ..::.
CCDS90 WQWIDGAMYLYRSW--SGKSMGGNKHCAEMSSNNNFLTWSSNECNKRQHFLCKYRP
110 120 130 140 150
>>CCDS81953.1 CLEC19A gene_id:728276|Hs108|chr16 (186 aa)
initn: 226 init1: 97 opt: 257 Z-score: 320.9 bits: 66.0 E(32554): 1.5e-11
Smith-Waterman score: 257; 34.0% identity (62.8% similar) in 156 aa overlap (29-172:32-179)
10 20 30 40 50
pF1KB6 MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGSKAYGSHCYALFLSP
:. ::: :: . .::: .:
CCDS81 QRWTLWAAAFLTLHSAQAFPQTDISISPALPELPLPSL---CPLFWMEFKGHCYRFFPLN
10 20 30 40 50
60 70 80 90 100 110
pF1KB6 KSWTDADLACQK----RPSGNLVSVLSGAEGSFVSSLVKS-IGNSYSYVWIGLHDPTQGT
:.:..::: :.. : :..:.:. : :. :: .::.: . . . :: :::: :
CCDS81 KTWAEADLYCSEFSVGRKSAKLASIHSWEENVFVYDLVNSCVPGIPADVWTGLHDHRQ--
60 70 80 90 100 110
120 130 140 150 160
pF1KB6 EPNGEGWEWSSSDVMNYFAWE-RNPS--TISSPGH--CASL-SRSTAFLR-WKDYNCNVR
.:. .::.... ..: :. .:. . ..: . :... : :. :: :.: .:. .
CCDS81 --EGQ-FEWTDGSSYDYSYWDGSQPDDGVHADPEEEDCVQIWYRPTSALRSWNDNTCSRK
120 130 140 150 160 170
170
pF1KB6 LPYVCKFTD
.:.:::
CCDS81 FPFVCKIPSLTIH
180
175 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 00:28:31 2016 done: Sat Nov 5 00:28:32 2016
Total Scan time: 1.730 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]