FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6829, 166 aa
1>>>pF1KB6829 166 - 166 aa - 166 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.3434+/-0.000719; mu= 11.6890+/- 0.043
mean_var=57.2458+/-11.482, 0's: 0 Z-trim(108.9): 120 B-trim: 304 in 1/50
Lambda= 0.169513
statistics sampled from 10378 (10503) to 10378 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.716), E-opt: 0.2 (0.323), width: 16
Scan time: 1.660
The best scores are: opt bits E(32554)
CCDS1964.1 REG1A gene_id:5967|Hs108|chr2 ( 166) 1165 292.6 7.5e-80
CCDS1963.1 REG1B gene_id:5968|Hs108|chr2 ( 166) 1006 253.7 3.8e-68
CCDS1965.1 REG3A gene_id:5068|Hs108|chr2 ( 175) 588 151.5 2.4e-37
CCDS1962.1 REG3G gene_id:130120|Hs108|chr2 ( 175) 582 150.0 6.6e-37
CCDS906.1 REG4 gene_id:83998|Hs108|chr1 ( 158) 317 85.2 1.9e-17
CCDS56087.1 CLEC17A gene_id:388512|Hs108|chr19 ( 378) 268 73.3 1.7e-13
CCDS32782.1 COLEC12 gene_id:81035|Hs108|chr18 ( 742) 264 72.4 6.2e-13
CCDS53971.1 ACAN gene_id:176|Hs108|chr15 (2431) 264 72.6 1.8e-12
CCDS53970.1 ACAN gene_id:176|Hs108|chr15 (2530) 264 72.6 1.9e-12
CCDS12184.1 FCER2 gene_id:2208|Hs108|chr19 ( 321) 252 69.4 2.2e-12
CCDS11634.1 MRC2 gene_id:9902|Hs108|chr17 (1479) 258 71.1 3.2e-12
CCDS12397.1 NCAN gene_id:1463|Hs108|chr19 (1321) 253 69.8 6.8e-12
CCDS31739.1 CLEC6A gene_id:93978|Hs108|chr12 ( 209) 230 64.0 6.3e-11
CCDS7123.2 MRC1 gene_id:4360|Hs108|chr10 (1456) 239 66.4 8e-11
CCDS56017.1 ASGR1 gene_id:432|Hs108|chr17 ( 252) 229 63.7 8.9e-11
CCDS47242.1 VCAN gene_id:1462|Hs108|chr5 ( 655) 234 65.1 9e-11
CCDS11088.1 ASGR2 gene_id:433|Hs108|chr17 ( 287) 229 63.8 1e-10
>>CCDS1964.1 REG1A gene_id:5967|Hs108|chr2 (166 aa)
initn: 1165 init1: 1165 opt: 1165 Z-score: 1547.0 bits: 292.6 E(32554): 7.5e-80
Smith-Waterman score: 1165; 100.0% identity (100.0% similar) in 166 aa overlap (1-166:1-166)
10 20 30 40 50 60
pF1KB6 MAQTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNEDRETWVDA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 MAQTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNEDRETWVDA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 DLYCQNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRRWHWSSGSLVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 DLYCQNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRRWHWSSGSLVS
70 80 90 100 110 120
130 140 150 160
pF1KB6 YKSWGIGAPSSVNPGYCVSLTSSTGFQKWKDVPCEDKFSFVCKFKN
::::::::::::::::::::::::::::::::::::::::::::::
CCDS19 YKSWGIGAPSSVNPGYCVSLTSSTGFQKWKDVPCEDKFSFVCKFKN
130 140 150 160
>>CCDS1963.1 REG1B gene_id:5968|Hs108|chr2 (166 aa)
initn: 1006 init1: 1006 opt: 1006 Z-score: 1336.9 bits: 253.7 E(32554): 3.8e-68
Smith-Waterman score: 1006; 86.7% identity (92.8% similar) in 166 aa overlap (1-166:1-166)
10 20 30 40 50 60
pF1KB6 MAQTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNEDRETWVDA
::::.:.::::: ::::: :::::.:::::. ::::::::::::::::::::: ::::::
CCDS19 MAQTNSFFMLISSLMFLSLSQGQESQTELPNPRISCPEGTNAYRSYCYYFNEDPETWVDA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 DLYCQNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRRWHWSSGSLVS
:::::::::::::::::::::::::::::::.::: ::::::::::::::::::::::::
CCDS19 DLYCQNMNSGNLVSVLTQAEGAFVASLIKESSTDDSNVWIGLHDPKKNRRWHWSSGSLVS
70 80 90 100 110 120
130 140 150 160
pF1KB6 YKSWGIGAPSSVNPGYCVSLTSSTGFQKWKDVPCEDKFSFVCKFKN
:::: :.:::.: :::.:::: .::.:::: :: ::::::::::
CCDS19 YKSWDTGSPSSANAGYCASLTSCSGFKKWKDESCEKKFSFVCKFKN
130 140 150 160
>>CCDS1965.1 REG3A gene_id:5068|Hs108|chr2 (175 aa)
initn: 578 init1: 417 opt: 588 Z-score: 784.0 bits: 151.5 E(32554): 2.4e-37
Smith-Waterman score: 588; 50.0% identity (75.6% similar) in 172 aa overlap (1-166:5-175)
10 20 30 40 50
pF1KB6 MAQTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNEDRET
:: : .::.::::.::: ::.: : :::.::: ::.:..:: :.:: . . ..
CCDS19 MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGSKAYGSHCYALFLSPKS
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB6 WVDADLYCQNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRR-----W
:.:::: ::. ::::::::. :::.::.::.: :.. :::::::: .. . :
CCDS19 WTDADLACQKRPSGNLVSVLSGAEGSFVSSLVKSIGNSYSYVWIGLHDPTQGTEPNGEGW
70 80 90 100 110 120
120 130 140 150 160
pF1KB6 HWSSGSLVSYKSWGIGAPSSVN-PGYCVSLTSSTGFQKWKDVPCEDKFSFVCKFKN
.:::.....: .: ::... ::.:.::. ::.: .::: :. .. .:::: .
CCDS19 EWSSSDVMNYFAWE-RNPSTISSPGHCASLSRSTAFLRWKDYNCNVRLPYVCKFTD
130 140 150 160 170
>>CCDS1962.1 REG3G gene_id:130120|Hs108|chr2 (175 aa)
initn: 574 init1: 384 opt: 582 Z-score: 776.1 bits: 150.0 E(32554): 6.6e-37
Smith-Waterman score: 582; 48.5% identity (74.3% similar) in 171 aa overlap (1-166:5-175)
10 20 30 40 50
pF1KB6 MAQTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNEDRET
:: : .::.:::..: : ::.:.: :::. :::::.:..:: : :: . . ..
CCDS19 MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKS
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB6 WVDADLYCQNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRR-----W
:.:::: ::. ::.:::::. :::.::.::.. ... .::::::: .. . :
CCDS19 WMDADLACQKRPSGKLVSVLSGAEGSFVSSLVRSISNSYSYIWIGLHDPTQGSEPDGDGW
70 80 90 100 110 120
120 130 140 150 160
pF1KB6 HWSSGSLVSYKSWGIGAPSSVNPGYCVSLTSSTGFQKWKDVPCEDKFSFVCKFKN
.::: ....: .: . . .:::.: ::. :::: :::: :. :. .:::::.
CCDS19 EWSSTDVMNYFAWEKNPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD
130 140 150 160 170
>>CCDS906.1 REG4 gene_id:83998|Hs108|chr1 (158 aa)
initn: 238 init1: 218 opt: 317 Z-score: 426.6 bits: 85.2 E(32554): 1.9e-17
Smith-Waterman score: 317; 36.0% identity (67.6% similar) in 136 aa overlap (33-165:27-157)
10 20 30 40 50 60
pF1KB6 QTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCY-YFNEDRETWVDAD
: :: : ..: :: :: . :. : ::.
CCDS90 MASRSMRLLLLLSCLAKTGVLGDIIMRPSCAPGWFYHKSNCYGYFRKLRN-WSDAE
10 20 30 40 50
70 80 90 100 110
pF1KB6 LYCQNMNSG-NLVSVLTQAEGAFVASLIKESGTDDFN-VWIGLHDPKKNRRWHWSSGSLV
: ::....: .:.:.:. :.. .: : :: . . .:::::::.: ..:.: .:..
CCDS90 LECQSYGNGAHLASILSLKEASTIAEYI--SGYQRSQPIWIGLHDPQKRQQWQWIDGAMY
60 70 80 90 100 110
120 130 140 150 160
pF1KB6 SYKSWGIGAPSSVNPGYCVSLTSSTGFQKWKDVPCEDKFSFVCKFKN
:.::. : . : .:. ..:...: :.. :. . :.::..
CCDS90 LYRSWS-GKSMGGNK-HCAEMSSNNNFLTWSSNECNKRQHFLCKYRP
120 130 140 150
>>CCDS56087.1 CLEC17A gene_id:388512|Hs108|chr19 (378 aa)
initn: 128 init1: 128 opt: 268 Z-score: 355.7 bits: 73.3 E(32554): 1.7e-13
Smith-Waterman score: 268; 31.6% identity (61.7% similar) in 133 aa overlap (33-165:251-375)
10 20 30 40 50 60
pF1KB6 QTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNEDRETWVDADL
::.:::: ... ::::. . ..: .: .
CCDS56 GLAGLKHDIARVRADTNQSLVELWGLLDCRRITCPEGWLPFEGKCYYFSPSTKSWDEARM
230 240 250 260 270 280
70 80 90 100 110 120
pF1KB6 YCQNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRRWHWSSGSLVSYK
.::. : ..:: . . :: ::: : :. :.::.: .. :.: .:: :. .
CCDS56 FCQE-NYSHLVIINSFAEHNFVA---KAHGSPRV-YWLGLNDRAQEGDWRWLDGSPVTLS
290 300 310 320 330
130 140 150 160
pF1KB6 SWGIGAPSSVNPGYCVSLTSSTGFQKWKDVPCEDKFSFVCKFKN
: :.... :...... : :.:. : ..:. :
CCDS56 FWEPEEPNNIHDEDCATMNKG-G--TWNDLSCYKTTYWICERKCSC
340 350 360 370
>>CCDS32782.1 COLEC12 gene_id:81035|Hs108|chr18 (742 aa)
initn: 241 init1: 117 opt: 264 Z-score: 345.6 bits: 72.4 E(32554): 6.2e-13
Smith-Waterman score: 264; 30.8% identity (61.0% similar) in 146 aa overlap (23-163:595-731)
10 20 30 40 50
pF1KB6 MAQTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNE
:. : :. .:: . . . ::::.
CCDS32 PGLPGVPGMPGPKGPPGPPGPSGAVVPLALQNEPTPAPEDN-GCPPHWKNFTDKCYYFSV
570 580 590 600 610 620
60 70 80 90 100 110
pF1KB6 DRETWVDADLYCQNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRRWH
..: . :: :.:.. .:..:: . :. : . ::.. . . :::: : ... .:.
CCDS32 EKEIFEDAKLFCED-KSSHLVFINTREEQQW----IKKQMVGRESHWIGLTDSERENEWK
630 640 650 660 670
120 130 140 150 160
pF1KB6 WSSGSLVSYKSWGIGAPSSVNPGY-----CVSLTSSTGFQKWKDVPCEDKFSFVCKFKN
: .:. .::.: : :.. . :. :..: . : .:.: ::: .:.:.
CCDS32 WLDGTSPDYKNWKAGQPDNWGHGHGPGEDCAGLIYA-G--QWNDFQCEDVNNFICEKDRE
680 690 700 710 720 730
CCDS32 TVLSSAL
740
>>CCDS53971.1 ACAN gene_id:176|Hs108|chr15 (2431 aa)
initn: 225 init1: 118 opt: 264 Z-score: 337.2 bits: 72.6 E(32554): 1.8e-12
Smith-Waterman score: 264; 35.1% identity (64.1% similar) in 131 aa overlap (36-163:2282-2403)
10 20 30 40 50 60
pF1KB6 SYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNEDRETWVDADLYCQ
: :: : :...:: ::::::::. :.
CCDS53 ETATSPTDASIPASPEWKRESESTAADQEVCEEGWNKYQGHCYRHFPDRETWVDAERRCR
2260 2270 2280 2290 2300 2310
70 80 90 100 110 120
pF1KB6 NMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRRWHWSSGSLVSYKSWG
...: .: :..: : :: .....:.. ::::.: . ..::.: .....:
CCDS53 EQQS-HLSSIVTPEEQEFV-----NNNAQDYQ-WIGLNDRTIEGDFRWSDGHPMQFENWR
2320 2330 2340 2350 2360
130 140 150 160
pF1KB6 IGAPSSV-NPGY-CVSLT-SSTGFQKWKDVPCEDKFSFVCKFKN
. :.. : :: . : .:.::::. .. :.::
CCDS53 PNQPDNFFAAGEDCVVMIWHEKG--EWNDVPCNYHLPFTCKKGTATTYKRRLQKRSSRHP
2370 2380 2390 2400 2410 2420
CCDS53 RRSRPSTAH
2430
>>CCDS53970.1 ACAN gene_id:176|Hs108|chr15 (2530 aa)
initn: 225 init1: 118 opt: 264 Z-score: 337.0 bits: 72.6 E(32554): 1.9e-12
Smith-Waterman score: 264; 35.1% identity (64.1% similar) in 131 aa overlap (36-163:2320-2441)
10 20 30 40 50 60
pF1KB6 SYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNEDRETWVDADLYCQ
: :: : :...:: ::::::::. :.
CCDS53 AGTCKETEGHVICLCPPGYTGEHCNIDQEVCEEGWNKYQGHCYRHFPDRETWVDAERRCR
2290 2300 2310 2320 2330 2340
70 80 90 100 110 120
pF1KB6 NMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRRWHWSSGSLVSYKSWG
...: .: :..: : :: .....:.. ::::.: . ..::.: .....:
CCDS53 EQQS-HLSSIVTPEEQEFV-----NNNAQDYQ-WIGLNDRTIEGDFRWSDGHPMQFENWR
2350 2360 2370 2380 2390 2400
130 140 150 160
pF1KB6 IGAPSSV-NPGY-CVSLT-SSTGFQKWKDVPCEDKFSFVCKFKN
. :.. : :: . : .:.::::. .. :.::
CCDS53 PNQPDNFFAAGEDCVVMIWHEKG--EWNDVPCNYHLPFTCKKGTVACGEPPVVEHARTFG
2410 2420 2430 2440 2450 2460
CCDS53 QKKDRYEINSLVRYQCTEGFVQRHMPTIRCQPSGHWEEPQITCTDPTTYKRRLQKRSSRH
2470 2480 2490 2500 2510 2520
>>CCDS12184.1 FCER2 gene_id:2208|Hs108|chr19 (321 aa)
initn: 191 init1: 99 opt: 252 Z-score: 335.7 bits: 69.4 E(32554): 2.2e-12
Smith-Waterman score: 252; 32.3% identity (63.1% similar) in 130 aa overlap (35-162:162-282)
10 20 30 40 50 60
pF1KB6 SSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYRSYCYYFNEDRETWVDADLYC
.::: .. ::::.. . :: : :
CCDS12 NEASDLLERLREEVTKLRMELQVSSGFVCNTCPEKWINFQRKCYYFGKGTKQWVHARYAC
140 150 160 170 180 190
70 80 90 100 110 120
pF1KB6 QNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIGLHDPKKNRRWHWSSGSLVSYKSW
..:. :.:::. . : :... ...:. ::::.. . .. : .:: :.:..:
CCDS12 DDME-GQLVSIHSPEEQDFLTKHASHTGS-----WIGLRNLDLKGEFIWVDGSHVDYSNW
200 210 220 230 240
130 140 150 160
pF1KB6 GIGAPSSVNPGY-CVSLTSSTGFQKWKDVPCEDKF-SFVCKFKN
. : :.: . : :: . .: .:.:. :. :. ..::
CCDS12 APGEPTSRSQGEDCVMMRGS---GRWNDAFCDRKLGAWVCDRLATCTPPASEGSAESMGP
250 260 270 280 290 300
CCDS12 DSRPDPDGRLPTPSAPLHS
310 320
166 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 14:06:04 2016 done: Sat Nov 5 14:06:04 2016
Total Scan time: 1.660 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]