FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9521, 317 aa 1>>>pF1KE9521 317 - 317 aa - 317 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7584+/-0.00109; mu= 13.3728+/- 0.064 mean_var=217.3367+/-89.325, 0's: 0 Z-trim(105.0): 173 B-trim: 1073 in 2/46 Lambda= 0.086998 statistics sampled from 7816 (8174) to 7816 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.603), E-opt: 0.2 (0.251), width: 16 Scan time: 2.430 The best scores are: opt bits E(32554) CCDS56011.1 MC1R gene_id:4157|Hs108|chr16 ( 317) 2064 272.7 2.7e-73 CCDS11976.1 MC4R gene_id:4160|Hs108|chr18 ( 332) 982 136.9 2.1e-32 CCDS13449.2 MC3R gene_id:4159|Hs108|chr20 ( 323) 958 133.9 1.7e-31 CCDS11868.1 MC5R gene_id:4161|Hs108|chr18 ( 325) 921 129.2 4.2e-30 CCDS11869.1 MC2R gene_id:4158|Hs108|chr18 ( 297) 750 107.7 1.2e-23 >>CCDS56011.1 MC1R gene_id:4157|Hs108|chr16 (317 aa) initn: 2064 init1: 2064 opt: 2064 Z-score: 1429.3 bits: 272.7 E(32554): 2.7e-73 Smith-Waterman score: 2064; 100.0% identity (100.0% similar) in 317 aa overlap (1-317:1-317) 10 20 30 40 50 60 pF1KE9 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLEVSISDGLFLSLGLVSLVENALVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLEVSISDGLFLSLGLVSLVENALVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 ATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAVLQQLDNVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 ATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAVLQQLDNVI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 DVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVASVVFSTLFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 DVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVASVVFSTLFI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 AYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVHQGFGLKGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVHQGFGLKGA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 VTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAIIDPLIYAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 VTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAIIDPLIYAF 250 260 270 280 290 300 310 pF1KE9 HSQELRRTLKEVLTCSW ::::::::::::::::: CCDS56 HSQELRRTLKEVLTCSW 310 >>CCDS11976.1 MC4R gene_id:4160|Hs108|chr18 (332 aa) initn: 982 init1: 398 opt: 982 Z-score: 695.1 bits: 136.9 E(32554): 2.1e-32 Smith-Waterman score: 982; 51.2% identity (78.1% similar) in 297 aa overlap (20-315:27-319) 10 20 30 40 50 pF1KE9 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLE-VSISDGLFLSLGLVS : .:: . .. : : : . .: .:..::..: CCDS11 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGG--CYEQLFVSPEVFVTLGVIS 10 20 30 40 50 60 70 80 90 100 110 pF1KE9 LVENALVVATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAV :.:: ::...::::.::::::: ::: ::..:.::: :: :: :: ::.. :.. . CCDS11 LLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQSFT 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE9 LQQLDNVIDVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVAS .. .::::: . :::.:.:.: : .::::::..:::::.::.:.:. :. .. ::.: CCDS11 VN-IDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAAC 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE9 VVFSTLFIAYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVH .: . ::: : : ::..::...:..::.::: :::::. : : . :: : .. CCDS11 TVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTG-AIR 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE9 QGFGLKGAVTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAI :: ..:::.:::::.:.: .::.:::::: . . ::..: : :....:::.: ::.::.: CCDS11 QGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSI 240 250 260 270 280 290 300 310 pF1KE9 IDPLIYAFHSQELRRTLKEVLTCSW :::::::..:::::.:.::.. : CCDS11 IDPLIYALRSQELRKTFKEIICCYPLGGLCDLSSRY 300 310 320 330 >>CCDS13449.2 MC3R gene_id:4159|Hs108|chr20 (323 aa) initn: 940 init1: 584 opt: 958 Z-score: 679.0 bits: 133.9 E(32554): 1.7e-31 Smith-Waterman score: 958; 49.5% identity (77.6% similar) in 295 aa overlap (23-315:22-315) 10 20 30 40 50 pF1KE9 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTG-ARCLEVSISDGLFLSLGLVSLVENALV : . .::.. : : .: :. .:::::.:::.:: :: CCDS13 MNASCCLPSVQPTLPNGSEHLQAPFFSNQSSSAFCEQVFIKPEVFLSLGIVSLLENILV 10 20 30 40 50 60 70 80 90 100 110 pF1KE9 VATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAVLQQLDNV . ....: ::::::: :.: ::..:.::: ::.::: .: .... :. . .:..::. CCDS13 ILAVVRNGNLHSPMYFFLCSLAVADMLVSVSNALETIMIAIVHSDYLTFEDQFIQHMDNI 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE9 IDVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVASVVFSTLF .: . : :...:.: : :::::::..::::::::::.:. .: ..:::: : ...: CCDS13 FDSMICISLVASICNLLAIAVDRYVTIFYALRYHSIMTVRKALTLIVAIWVCCGVCGVVF 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE9 IAYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVHQGFG-LK :.: . :..::...:.::..::..:::::. : :.. :: : . . : . .: CCDS13 IVYSESKMVIVCLITMFFAMMLLMGTLYVHMFLFARLHVKRIAALPPADGVAPQQHSCMK 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE9 GAVTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAIIDPLIY ::::.:::::.:..::.::::::.::. :: .: : : .:: .:.::.::..:::::: CCDS13 GAVTITILLGVFIFCWAPFFLHLVLIITCPTNPYCICYTAHFNTYLVLIMCNSVIDPLIY 240 250 260 270 280 290 300 310 pF1KE9 AFHSQELRRTLKEVLTCSW ::.: ::: :..:.: : CCDS13 AFRSLELRNTFREIL-CGCNGMNLG 300 310 320 >>CCDS11868.1 MC5R gene_id:4161|Hs108|chr18 (325 aa) initn: 904 init1: 563 opt: 921 Z-score: 653.8 bits: 129.2 E(32554): 4.2e-30 Smith-Waterman score: 921; 45.9% identity (76.9% similar) in 303 aa overlap (13-315:13-312) 10 20 30 40 50 60 pF1KE9 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLEVSISDGLFLSLGLVSLVENALVV .::.: . : ... .. : ...:. .::.::..::.:: ::. CCDS11 MNSSFHLHFLDLNLNATEGNLS--GPNVKNKSSPCEDMGIAVEVFLTLGVISLLENILVI 10 20 30 40 50 70 80 90 100 110 120 pF1KE9 ATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAVLQQLDNVI ..:.::.::::::: :.: ::..:.::: :.. :: .: ::. :: : ....:::. CCDS11 GAIVKNKNLHSPMYFFVCSLAVADMLVSMSSAWETITIYLLNNKHLVIADAFVRHIDNVF 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE9 DVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVASVVFSTLFI : . : :...:.: : :::::::..:::::::: :.: :. .:.::. . . .:: CCDS11 DSMICISVVASMCSLLAIAVDRYVTIFYALRYHHIMTARRSGAIIAGIWAFCTGCGIVFI 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE9 AYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVHQGFGLKGA : . . :.:::. .:.::: :.. ::.::. : :.. :: : . ..: ...:: CCDS11 LYSESTYVILCLISMFFAMLFLLVSLYIHMFLLARTHVKRIAALPGASS-ARQRTSMQGA 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE9 VTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAIIDPLIYAF ::.:.:::.: .::.::::::::.. ::.. :. ....::..: ::.::...::::::: CCDS11 VTVTMLLGVFTVCWAPFFLHLTLMLSCPQNLYCSRFMSHFNMYLILIMCNSVMDPLIYAF 240 250 260 270 280 290 310 pF1KE9 HSQELRRTLKEVLTCSW .:::.:.:.::.. : CCDS11 RSQEMRKTFKEIICCRGFRIACSFPRRD 300 310 320 >>CCDS11869.1 MC2R gene_id:4158|Hs108|chr18 (297 aa) initn: 773 init1: 390 opt: 750 Z-score: 538.2 bits: 107.7 E(32554): 1.2e-23 Smith-Waterman score: 750; 40.7% identity (73.0% similar) in 285 aa overlap (35-317:21-297) 10 20 30 40 50 60 pF1KE9 GSQRRLLGSLNSTPTAIPQLGLAANQTGARCLEVSISDGLFLSLGLVSLVENALVVATIA : .: . . .:.....:...:: .:. .. CCDS11 MKHIINSYENINNTARNNSDCPRVVLPEEIFFTISIVGVLENLIVLLAVF 10 20 30 40 50 70 80 90 100 110 120 pF1KE9 KNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAVLQQLDNVIDVIT ::.::..::: ::: ::.::.: : ..::. .:.: . : : :.. :..:: . CCDS11 KNKNLQAPMYFFICSLAISDMLGSLYKILENILIILRNMGYLKPRGSFETTADDIIDSLF 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE9 CSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVASVVFSTLFIAYYD :.:.:. :..::.::::.::.:::::::::. :. ....::. . . .. . CCDS11 VLSLLGSIFSLSVIAADRYITIFHALRYHSIVTMRRTVVVLTVIWTFCTGTGITMVIFSH 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE9 HVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVHQGFGLKGAVTLT :: ... .. .: :::.. :::::. : .:.. :. : . . .:::.::: CCDS11 HVPTVITFTSLFPLMLVFILCLYVHMFLLARSHTRKISTLPRAN--------MKGAITLT 180 190 200 210 220 250 260 270 280 290 300 pF1KE9 ILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAIIDPLIYAFHSQE ::::.:..::.:: ::. :...:: .: :.: .. :.. ::.:::.:::.::::.: : CCDS11 ILLGVFIFCWAPFVLHVLLMTFCPSNPYCACYMSLFQVNGMLIMCNAVIDPFIYAFRSPE 230 240 250 260 270 280 310 pF1KE9 LRRTLKEVLTCS--W :: ..:... :: : CCDS11 LRDAFKKMIFCSRYW 290 317 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 02:17:49 2016 done: Tue Nov 8 02:17:50 2016 Total Scan time: 2.430 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]