FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE9521, 317 aa
1>>>pF1KE9521 317 - 317 aa - 317 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.7584+/-0.00109; mu= 13.3728+/- 0.064
mean_var=217.3367+/-89.325, 0's: 0 Z-trim(105.0): 173 B-trim: 1073 in 2/46
Lambda= 0.086998
statistics sampled from 7816 (8174) to 7816 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.603), E-opt: 0.2 (0.251), width: 16
Scan time: 2.430
The best scores are: opt bits E(32554)
CCDS56011.1 MC1R gene_id:4157|Hs108|chr16 ( 317) 2064 272.7 2.7e-73
CCDS11976.1 MC4R gene_id:4160|Hs108|chr18 ( 332) 982 136.9 2.1e-32
CCDS13449.2 MC3R gene_id:4159|Hs108|chr20 ( 323) 958 133.9 1.7e-31
CCDS11868.1 MC5R gene_id:4161|Hs108|chr18 ( 325) 921 129.2 4.2e-30
CCDS11869.1 MC2R gene_id:4158|Hs108|chr18 ( 297) 750 107.7 1.2e-23
>>CCDS56011.1 MC1R gene_id:4157|Hs108|chr16 (317 aa)
initn: 2064 init1: 2064 opt: 2064 Z-score: 1429.3 bits: 272.7 E(32554): 2.7e-73
Smith-Waterman score: 2064; 100.0% identity (100.0% similar) in 317 aa overlap (1-317:1-317)
10 20 30 40 50 60
pF1KE9 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLEVSISDGLFLSLGLVSLVENALVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLEVSISDGLFLSLGLVSLVENALVV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE9 ATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAVLQQLDNVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 ATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAVLQQLDNVI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE9 DVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVASVVFSTLFI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 DVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVASVVFSTLFI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE9 AYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVHQGFGLKGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 AYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVHQGFGLKGA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE9 VTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAIIDPLIYAF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 VTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAIIDPLIYAF
250 260 270 280 290 300
310
pF1KE9 HSQELRRTLKEVLTCSW
:::::::::::::::::
CCDS56 HSQELRRTLKEVLTCSW
310
>>CCDS11976.1 MC4R gene_id:4160|Hs108|chr18 (332 aa)
initn: 982 init1: 398 opt: 982 Z-score: 695.1 bits: 136.9 E(32554): 2.1e-32
Smith-Waterman score: 982; 51.2% identity (78.1% similar) in 297 aa overlap (20-315:27-319)
10 20 30 40 50
pF1KE9 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLE-VSISDGLFLSLGLVS
: .:: . .. : : : . .: .:..::..:
CCDS11 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGG--CYEQLFVSPEVFVTLGVIS
10 20 30 40 50
60 70 80 90 100 110
pF1KE9 LVENALVVATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAV
:.:: ::...::::.::::::: ::: ::..:.::: :: :: :: ::.. :.. .
CCDS11 LLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQSFT
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE9 LQQLDNVIDVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVAS
.. .::::: . :::.:.:.: : .::::::..:::::.::.:.:. :. .. ::.:
CCDS11 VN-IDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAAC
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE9 VVFSTLFIAYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVH
.: . ::: : : ::..::...:..::.::: :::::. : : . :: : ..
CCDS11 TVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTG-AIR
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE9 QGFGLKGAVTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAI
:: ..:::.:::::.:.: .::.:::::: . . ::..: : :....:::.: ::.::.:
CCDS11 QGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSI
240 250 260 270 280 290
300 310
pF1KE9 IDPLIYAFHSQELRRTLKEVLTCSW
:::::::..:::::.:.::.. :
CCDS11 IDPLIYALRSQELRKTFKEIICCYPLGGLCDLSSRY
300 310 320 330
>>CCDS13449.2 MC3R gene_id:4159|Hs108|chr20 (323 aa)
initn: 940 init1: 584 opt: 958 Z-score: 679.0 bits: 133.9 E(32554): 1.7e-31
Smith-Waterman score: 958; 49.5% identity (77.6% similar) in 295 aa overlap (23-315:22-315)
10 20 30 40 50
pF1KE9 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTG-ARCLEVSISDGLFLSLGLVSLVENALV
: . .::.. : : .: :. .:::::.:::.:: ::
CCDS13 MNASCCLPSVQPTLPNGSEHLQAPFFSNQSSSAFCEQVFIKPEVFLSLGIVSLLENILV
10 20 30 40 50
60 70 80 90 100 110
pF1KE9 VATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAVLQQLDNV
. ....: ::::::: :.: ::..:.::: ::.::: .: .... :. . .:..::.
CCDS13 ILAVVRNGNLHSPMYFFLCSLAVADMLVSVSNALETIMIAIVHSDYLTFEDQFIQHMDNI
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE9 IDVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVASVVFSTLF
.: . : :...:.: : :::::::..::::::::::.:. .: ..:::: : ...:
CCDS13 FDSMICISLVASICNLLAIAVDRYVTIFYALRYHSIMTVRKALTLIVAIWVCCGVCGVVF
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE9 IAYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVHQGFG-LK
:.: . :..::...:.::..::..:::::. : :.. :: : . . : . .:
CCDS13 IVYSESKMVIVCLITMFFAMMLLMGTLYVHMFLFARLHVKRIAALPPADGVAPQQHSCMK
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE9 GAVTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAIIDPLIY
::::.:::::.:..::.::::::.::. :: .: : : .:: .:.::.::..::::::
CCDS13 GAVTITILLGVFIFCWAPFFLHLVLIITCPTNPYCICYTAHFNTYLVLIMCNSVIDPLIY
240 250 260 270 280 290
300 310
pF1KE9 AFHSQELRRTLKEVLTCSW
::.: ::: :..:.: :
CCDS13 AFRSLELRNTFREIL-CGCNGMNLG
300 310 320
>>CCDS11868.1 MC5R gene_id:4161|Hs108|chr18 (325 aa)
initn: 904 init1: 563 opt: 921 Z-score: 653.8 bits: 129.2 E(32554): 4.2e-30
Smith-Waterman score: 921; 45.9% identity (76.9% similar) in 303 aa overlap (13-315:13-312)
10 20 30 40 50 60
pF1KE9 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLEVSISDGLFLSLGLVSLVENALVV
.::.: . : ... .. : ...:. .::.::..::.:: ::.
CCDS11 MNSSFHLHFLDLNLNATEGNLS--GPNVKNKSSPCEDMGIAVEVFLTLGVISLLENILVI
10 20 30 40 50
70 80 90 100 110 120
pF1KE9 ATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAVLQQLDNVI
..:.::.::::::: :.: ::..:.::: :.. :: .: ::. :: : ....:::.
CCDS11 GAIVKNKNLHSPMYFFVCSLAVADMLVSMSSAWETITIYLLNNKHLVIADAFVRHIDNVF
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE9 DVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVASVVFSTLFI
: . : :...:.: : :::::::..:::::::: :.: :. .:.::. . . .::
CCDS11 DSMICISVVASMCSLLAIAVDRYVTIFYALRYHHIMTARRSGAIIAGIWAFCTGCGIVFI
120 130 140 150 160 170
190 200 210 220 230 240
pF1KE9 AYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVHQGFGLKGA
: . . :.:::. .:.::: :.. ::.::. : :.. :: : . ..: ...::
CCDS11 LYSESTYVILCLISMFFAMLFLLVSLYIHMFLLARTHVKRIAALPGASS-ARQRTSMQGA
180 190 200 210 220 230
250 260 270 280 290 300
pF1KE9 VTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAIIDPLIYAF
::.:.:::.: .::.::::::::.. ::.. :. ....::..: ::.::...:::::::
CCDS11 VTVTMLLGVFTVCWAPFFLHLTLMLSCPQNLYCSRFMSHFNMYLILIMCNSVMDPLIYAF
240 250 260 270 280 290
310
pF1KE9 HSQELRRTLKEVLTCSW
.:::.:.:.::.. :
CCDS11 RSQEMRKTFKEIICCRGFRIACSFPRRD
300 310 320
>>CCDS11869.1 MC2R gene_id:4158|Hs108|chr18 (297 aa)
initn: 773 init1: 390 opt: 750 Z-score: 538.2 bits: 107.7 E(32554): 1.2e-23
Smith-Waterman score: 750; 40.7% identity (73.0% similar) in 285 aa overlap (35-317:21-297)
10 20 30 40 50 60
pF1KE9 GSQRRLLGSLNSTPTAIPQLGLAANQTGARCLEVSISDGLFLSLGLVSLVENALVVATIA
: .: . . .:.....:...:: .:. ..
CCDS11 MKHIINSYENINNTARNNSDCPRVVLPEEIFFTISIVGVLENLIVLLAVF
10 20 30 40 50
70 80 90 100 110 120
pF1KE9 KNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAVLQQLDNVIDVIT
::.::..::: ::: ::.::.: : ..::. .:.: . : : :.. :..:: .
CCDS11 KNKNLQAPMYFFICSLAISDMLGSLYKILENILIILRNMGYLKPRGSFETTADDIIDSLF
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE9 CSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVASVVFSTLFIAYYD
:.:.:. :..::.::::.::.:::::::::. :. ....::. . . .. .
CCDS11 VLSLLGSIFSLSVIAADRYITIFHALRYHSIVTMRRTVVVLTVIWTFCTGTGITMVIFSH
120 130 140 150 160 170
190 200 210 220 230 240
pF1KE9 HVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVHQGFGLKGAVTLT
:: ... .. .: :::.. :::::. : .:.. :. : . . .:::.:::
CCDS11 HVPTVITFTSLFPLMLVFILCLYVHMFLLARSHTRKISTLPRAN--------MKGAITLT
180 190 200 210 220
250 260 270 280 290 300
pF1KE9 ILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAIIDPLIYAFHSQE
::::.:..::.:: ::. :...:: .: :.: .. :.. ::.:::.:::.::::.: :
CCDS11 ILLGVFIFCWAPFVLHVLLMTFCPSNPYCACYMSLFQVNGMLIMCNAVIDPFIYAFRSPE
230 240 250 260 270 280
310
pF1KE9 LRRTLKEVLTCS--W
:: ..:... :: :
CCDS11 LRDAFKKMIFCSRYW
290
317 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 02:17:49 2016 done: Tue Nov 8 02:17:50 2016
Total Scan time: 2.430 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]