FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE9506, 332 aa
1>>>pF1KE9506 332 - 332 aa - 332 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4070+/-0.00116; mu= 15.0502+/- 0.068
mean_var=229.3755+/-103.535, 0's: 0 Z-trim(102.2): 263 B-trim: 829 in 2/44
Lambda= 0.084684
statistics sampled from 6395 (6836) to 6395 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.541), E-opt: 0.2 (0.21), width: 16
Scan time: 2.410
The best scores are: opt bits E(32554)
CCDS11976.1 MC4R gene_id:4160|Hs108|chr18 ( 332) 2182 280.7 1.1e-75
CCDS11868.1 MC5R gene_id:4161|Hs108|chr18 ( 325) 1318 175.2 6.5e-44
CCDS13449.2 MC3R gene_id:4159|Hs108|chr20 ( 323) 1273 169.7 2.9e-42
CCDS56011.1 MC1R gene_id:4157|Hs108|chr16 ( 317) 982 134.1 1.5e-31
CCDS11869.1 MC2R gene_id:4158|Hs108|chr18 ( 297) 934 128.2 8.3e-30
>>CCDS11976.1 MC4R gene_id:4160|Hs108|chr18 (332 aa)
initn: 2182 init1: 2182 opt: 2182 Z-score: 1472.2 bits: 280.7 E(32554): 1.1e-75
Smith-Waterman score: 2182; 100.0% identity (100.0% similar) in 332 aa overlap (1-332:1-332)
10 20 30 40 50 60
pF1KE9 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGGCYEQLFVSPEVFVTLGVISLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGGCYEQLFVSPEVFVTLGVISLL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE9 ENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQSFTVN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 ENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQSFTVN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE9 IDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAACTVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 IDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAACTVS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE9 GILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAIRQGAN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 GILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAIRQGAN
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE9 MKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSIIDPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSIIDPL
250 260 270 280 290 300
310 320 330
pF1KE9 IYALRSQELRKTFKEIICCYPLGGLCDLSSRY
::::::::::::::::::::::::::::::::
CCDS11 IYALRSQELRKTFKEIICCYPLGGLCDLSSRY
310 320 330
>>CCDS11868.1 MC5R gene_id:4161|Hs108|chr18 (325 aa)
initn: 1297 init1: 1010 opt: 1318 Z-score: 901.8 bits: 175.2 E(32554): 6.5e-44
Smith-Waterman score: 1318; 62.2% identity (84.5% similar) in 328 aa overlap (9-331:1-324)
10 20 30 40 50
pF1KE9 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYS----DGGCYEQLFVSPEVFVTLGV
:..:.:: :. ::.:. .: . .. : :.. .. :::.::::
CCDS11 MNSSFHL---HFLDLNLNATEGNLSGPNVKNKSSPC-EDMGIAVEVFLTLGV
10 20 30 40
60 70 80 90 100 110
pF1KE9 ISLLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTD-TDAQ
:::::::::: ::.::::::::::::.:::::::::::.:.. :::.: :::. . :.
CCDS11 ISLLENILVIGAIVKNKNLHSPMYFFVCSLAVADMLVSMSSAWETITIYLLNNKHLVIAD
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE9 SFTVNIDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWA
.:. .::::.::.:: :..::.::::.:::::: ::::::.::.:::..: : ::. :::
CCDS11 AFVRHIDNVFDSMICISVVASMCSLLAIAVDRYVTIFYALRYHHIMTARRSGAIIAGIWA
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE9 ACTVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAI
:: ::.::.::.:. ::.:::.:::.:: :..:::.::::.:: :.::::.:::...
CCDS11 FCTGCGIVFILYSESTYVILCLISMFFAMLFLLVSLYIHMFLLARTHVKRIAALPGASSA
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE9 RQGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNS
:: ..:.::.:.:.:.:::.:::::::::: ...::::: :: ::::::.:::::::::
CCDS11 RQRTSMQGAVTVTMLLGVFTVCWAPFFLHLTLMLSCPQNLYCSRFMSHFNMYLILIMCNS
230 240 250 260 270 280
300 310 320 330
pF1KE9 IIDPLIYALRSQELRKTFKEIICCYPLGGLCDLSSRY
..::::::.::::.:::::::::: . :.. :
CCDS11 VMDPLIYAFRSQEMRKTFKEIICCRGFRIACSFPRRD
290 300 310 320
>>CCDS13449.2 MC3R gene_id:4159|Hs108|chr20 (323 aa)
initn: 1267 init1: 514 opt: 1273 Z-score: 872.1 bits: 169.7 E(32554): 2.9e-42
Smith-Waterman score: 1273; 61.3% identity (85.8% similar) in 302 aa overlap (26-319:16-317)
10 20 30 40 50
pF1KE9 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGY----SDGGCYEQLFVSPEVFVTLGV
:.:: : . :... ::.:..::::..::.
CCDS13 MNASCCLPSVQPTLPNGSEHLQAPFFSNQSSSAFCEQVFIKPEVFLSLGI
10 20 30 40 50
60 70 80 90 100 110
pF1KE9 ISLLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTD-TDAQ
.:::::::::.:...: :::::::::.::::::::::::::. :::.:....: : .
CCDS13 VSLLENILVILAVVRNGNLHSPMYFFLCSLAVADMLVSVSNALETIMIAIVHSDYLTFED
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE9 SFTVNIDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWA
.: ..::..::.:: ::.::::.::.:::::: ::::::.::.::::... .: ::.
CCDS13 QFIQHMDNIFDSMICISLVASICNLLAIAVDRYVTIFYALRYHSIMTVRKALTLIVAIWV
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE9 ACTVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAI
: : :..::.::.:. ::.:::::::.:. ::..:::::::.::::.::::.:: . ..
CCDS13 CCGVCGVVFIVYSESKMVIVCLITMFFAMMLLMGTLYVHMFLFARLHVKRIAALPPADGV
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE9 --RQGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMC
.: . ::::.:.:::.:::. :::::::::.. :.:: ::::.:. .::: ::.::::
CCDS13 APQQHSCMKGAVTITILLGVFIFCWAPFFLHLVLIITCPTNPYCICYTAHFNTYLVLIMC
240 250 260 270 280 290
300 310 320 330
pF1KE9 NSIIDPLIYALRSQELRKTFKEIIC-CYPLGGLCDLSSRY
::.:::::::.:: :::.::.::.: :
CCDS13 NSVIDPLIYAFRSLELRNTFREILCGCNGMNLG
300 310 320
>>CCDS56011.1 MC1R gene_id:4157|Hs108|chr16 (317 aa)
initn: 982 init1: 398 opt: 982 Z-score: 680.1 bits: 134.1 E(32554): 1.5e-31
Smith-Waterman score: 982; 51.2% identity (78.1% similar) in 297 aa overlap (27-319:20-315)
10 20 30 40 50
pF1KE9 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGG--CYEQLFVSPEVFVTLGVIS
: .:: . .. : : : . .: .:..::..:
CCDS56 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLE-VSISDGLFLSLGLVS
10 20 30 40 50
60 70 80 90 100 110
pF1KE9 LLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQSFT
:.:: ::...::::.::::::: ::: ::..:.::: :: :: :: ::.. :.. .
CCDS56 LVENALVVATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAV
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE9 VN-IDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAAC
.. .::::: . :::.:.:.: : .::::::..:::::.::.:.:. :. .. ::.:
CCDS56 LQQLDNVIDVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVAS
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE9 TVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTG-AIR
.: . ::: : : ::..::...:..::.::: :::::. : : . :: : ..
CCDS56 VVFSTLFIAYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVH
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE9 QGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSI
:: ..:::.:::::.:.: .::.:::::: . . ::..: : :....:::.: ::.::.:
CCDS56 QGFGLKGAVTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAI
240 250 260 270 280 290
300 310 320 330
pF1KE9 IDPLIYALRSQELRKTFKEIICCYPLGGLCDLSSRY
:::::::..:::::.:.::.. :
CCDS56 IDPLIYAFHSQELRRTLKEVLTCSW
300 310
>>CCDS11869.1 MC2R gene_id:4158|Hs108|chr18 (297 aa)
initn: 950 init1: 397 opt: 934 Z-score: 648.6 bits: 128.2 E(32554): 8.3e-30
Smith-Waterman score: 934; 47.5% identity (77.6% similar) in 295 aa overlap (26-319:6-293)
10 20 30 40 50 60
pF1KE9 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGGCYEQLFVSPEVFVTLGVISLL
:. :.... ... .. . :.: :......:
CCDS11 MKHIINSYENINNTARNNSDCPRVVLPEEIFFTISIVGVL
10 20 30 40
70 80 90 100 110
pF1KE9 ENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQ-SFTV
::..:..:. :::::..::::::::::..::: :. . :.:.: : : . :: .
CCDS11 ENLIVLLAVFKNKNLQAPMYFFICSLAISDMLGSLYKILENILIILRNMGYLKPRGSFET
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE9 NIDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAACTV
. :..:::.. :::.:: :: ::.:::.:::.::.::.:.:..:. .... ::. ::
CCDS11 TADDIIDSLFVLSLLGSIFSLSVIAADRYITIFHALRYHSIVTMRRTVVVLTVIWTFCTG
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE9 SGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAIRQGA
.:: ..:.: ..: . ..: ::... :::::::.:: : ..:..:: : :
CCDS11 TGITMVIFSHHVPTVITFTSLFPLMLVFILCLYVHMFLLARSHTRKISTLP-----R--A
170 180 190 200 210
240 250 260 270 280 290
pF1KE9 NMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSIIDP
:::::::::::.:::. ::::: ::.... ::.::::.:.:: :.. .:::::..:::
CCDS11 NMKGAITLTILLGVFIFCWAPFVLHVLLMTFCPSNPYCACYMSLFQVNGMLIMCNAVIDP
220 230 240 250 260 270
300 310 320 330
pF1KE9 LIYALRSQELRKTFKEIICCYPLGGLCDLSSRY
.:::.:: ::: .::..: :
CCDS11 FIYAFRSPELRDAFKKMIFCSRYW
280 290
332 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 13:40:06 2016 done: Sun Nov 6 13:40:06 2016
Total Scan time: 2.410 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]