FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9506, 332 aa 1>>>pF1KE9506 332 - 332 aa - 332 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4070+/-0.00116; mu= 15.0502+/- 0.068 mean_var=229.3755+/-103.535, 0's: 0 Z-trim(102.2): 263 B-trim: 829 in 2/44 Lambda= 0.084684 statistics sampled from 6395 (6836) to 6395 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.541), E-opt: 0.2 (0.21), width: 16 Scan time: 2.410 The best scores are: opt bits E(32554) CCDS11976.1 MC4R gene_id:4160|Hs108|chr18 ( 332) 2182 280.7 1.1e-75 CCDS11868.1 MC5R gene_id:4161|Hs108|chr18 ( 325) 1318 175.2 6.5e-44 CCDS13449.2 MC3R gene_id:4159|Hs108|chr20 ( 323) 1273 169.7 2.9e-42 CCDS56011.1 MC1R gene_id:4157|Hs108|chr16 ( 317) 982 134.1 1.5e-31 CCDS11869.1 MC2R gene_id:4158|Hs108|chr18 ( 297) 934 128.2 8.3e-30 >>CCDS11976.1 MC4R gene_id:4160|Hs108|chr18 (332 aa) initn: 2182 init1: 2182 opt: 2182 Z-score: 1472.2 bits: 280.7 E(32554): 1.1e-75 Smith-Waterman score: 2182; 100.0% identity (100.0% similar) in 332 aa overlap (1-332:1-332) 10 20 30 40 50 60 pF1KE9 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGGCYEQLFVSPEVFVTLGVISLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGGCYEQLFVSPEVFVTLGVISLL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 ENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQSFTVN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 ENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQSFTVN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 IDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAACTVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 IDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAACTVS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 GILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAIRQGAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAIRQGAN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 MKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSIIDPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSIIDPL 250 260 270 280 290 300 310 320 330 pF1KE9 IYALRSQELRKTFKEIICCYPLGGLCDLSSRY :::::::::::::::::::::::::::::::: CCDS11 IYALRSQELRKTFKEIICCYPLGGLCDLSSRY 310 320 330 >>CCDS11868.1 MC5R gene_id:4161|Hs108|chr18 (325 aa) initn: 1297 init1: 1010 opt: 1318 Z-score: 901.8 bits: 175.2 E(32554): 6.5e-44 Smith-Waterman score: 1318; 62.2% identity (84.5% similar) in 328 aa overlap (9-331:1-324) 10 20 30 40 50 pF1KE9 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYS----DGGCYEQLFVSPEVFVTLGV :..:.:: :. ::.:. .: . .. : :.. .. :::.:::: CCDS11 MNSSFHL---HFLDLNLNATEGNLSGPNVKNKSSPC-EDMGIAVEVFLTLGV 10 20 30 40 60 70 80 90 100 110 pF1KE9 ISLLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTD-TDAQ :::::::::: ::.::::::::::::.:::::::::::.:.. :::.: :::. . :. CCDS11 ISLLENILVIGAIVKNKNLHSPMYFFVCSLAVADMLVSMSSAWETITIYLLNNKHLVIAD 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE9 SFTVNIDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWA .:. .::::.::.:: :..::.::::.:::::: ::::::.::.:::..: : ::. ::: CCDS11 AFVRHIDNVFDSMICISVVASMCSLLAIAVDRYVTIFYALRYHHIMTARRSGAIIAGIWA 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE9 ACTVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAI :: ::.::.::.:. ::.:::.:::.:: :..:::.::::.:: :.::::.:::... CCDS11 FCTGCGIVFILYSESTYVILCLISMFFAMLFLLVSLYIHMFLLARTHVKRIAALPGASSA 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE9 RQGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNS :: ..:.::.:.:.:.:::.:::::::::: ...::::: :: ::::::.::::::::: CCDS11 RQRTSMQGAVTVTMLLGVFTVCWAPFFLHLTLMLSCPQNLYCSRFMSHFNMYLILIMCNS 230 240 250 260 270 280 300 310 320 330 pF1KE9 IIDPLIYALRSQELRKTFKEIICCYPLGGLCDLSSRY ..::::::.::::.:::::::::: . :.. : CCDS11 VMDPLIYAFRSQEMRKTFKEIICCRGFRIACSFPRRD 290 300 310 320 >>CCDS13449.2 MC3R gene_id:4159|Hs108|chr20 (323 aa) initn: 1267 init1: 514 opt: 1273 Z-score: 872.1 bits: 169.7 E(32554): 2.9e-42 Smith-Waterman score: 1273; 61.3% identity (85.8% similar) in 302 aa overlap (26-319:16-317) 10 20 30 40 50 pF1KE9 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGY----SDGGCYEQLFVSPEVFVTLGV :.:: : . :... ::.:..::::..::. CCDS13 MNASCCLPSVQPTLPNGSEHLQAPFFSNQSSSAFCEQVFIKPEVFLSLGI 10 20 30 40 50 60 70 80 90 100 110 pF1KE9 ISLLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTD-TDAQ .:::::::::.:...: :::::::::.::::::::::::::. :::.:....: : . CCDS13 VSLLENILVILAVVRNGNLHSPMYFFLCSLAVADMLVSVSNALETIMIAIVHSDYLTFED 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE9 SFTVNIDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWA .: ..::..::.:: ::.::::.::.:::::: ::::::.::.::::... .: ::. CCDS13 QFIQHMDNIFDSMICISLVASICNLLAIAVDRYVTIFYALRYHSIMTVRKALTLIVAIWV 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE9 ACTVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAI : : :..::.::.:. ::.:::::::.:. ::..:::::::.::::.::::.:: . .. CCDS13 CCGVCGVVFIVYSESKMVIVCLITMFFAMMLLMGTLYVHMFLFARLHVKRIAALPPADGV 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE9 --RQGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMC .: . ::::.:.:::.:::. :::::::::.. :.:: ::::.:. .::: ::.:::: CCDS13 APQQHSCMKGAVTITILLGVFIFCWAPFFLHLVLIITCPTNPYCICYTAHFNTYLVLIMC 240 250 260 270 280 290 300 310 320 330 pF1KE9 NSIIDPLIYALRSQELRKTFKEIIC-CYPLGGLCDLSSRY ::.:::::::.:: :::.::.::.: : CCDS13 NSVIDPLIYAFRSLELRNTFREILCGCNGMNLG 300 310 320 >>CCDS56011.1 MC1R gene_id:4157|Hs108|chr16 (317 aa) initn: 982 init1: 398 opt: 982 Z-score: 680.1 bits: 134.1 E(32554): 1.5e-31 Smith-Waterman score: 982; 51.2% identity (78.1% similar) in 297 aa overlap (27-319:20-315) 10 20 30 40 50 pF1KE9 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGG--CYEQLFVSPEVFVTLGVIS : .:: . .. : : : . .: .:..::..: CCDS56 MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLE-VSISDGLFLSLGLVS 10 20 30 40 50 60 70 80 90 100 110 pF1KE9 LLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQSFT :.:: ::...::::.::::::: ::: ::..:.::: :: :: :: ::.. :.. . CCDS56 LVENALVVATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLEAGALVARAAV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE9 VN-IDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAAC .. .::::: . :::.:.:.: : .::::::..:::::.::.:.:. :. .. ::.: CCDS56 LQQLDNVIDVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPRARRAVAAIWVAS 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE9 TVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTG-AIR .: . ::: : : ::..::...:..::.::: :::::. : : . :: : .. CCDS56 VVFSTLFIAYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHAQGIARLHKRQRPVH 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE9 QGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSI :: ..:::.:::::.:.: .::.:::::: . . ::..: : :....:::.: ::.::.: CCDS56 QGFGLKGAVTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCIFKNFNLFLALIICNAI 240 250 260 270 280 290 300 310 320 330 pF1KE9 IDPLIYALRSQELRKTFKEIICCYPLGGLCDLSSRY :::::::..:::::.:.::.. : CCDS56 IDPLIYAFHSQELRRTLKEVLTCSW 300 310 >>CCDS11869.1 MC2R gene_id:4158|Hs108|chr18 (297 aa) initn: 950 init1: 397 opt: 934 Z-score: 648.6 bits: 128.2 E(32554): 8.3e-30 Smith-Waterman score: 934; 47.5% identity (77.6% similar) in 295 aa overlap (26-319:6-293) 10 20 30 40 50 60 pF1KE9 MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGGCYEQLFVSPEVFVTLGVISLL :. :.... ... .. . :.: :......: CCDS11 MKHIINSYENINNTARNNSDCPRVVLPEEIFFTISIVGVL 10 20 30 40 70 80 90 100 110 pF1KE9 ENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETIVITLLNSTDTDAQ-SFTV ::..:..:. :::::..::::::::::..::: :. . :.:.: : : . :: . CCDS11 ENLIVLLAVFKNKNLQAPMYFFICSLAISDMLGSLYKILENILIILRNMGYLKPRGSFET 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE9 NIDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNIMTVKRVGIIISCIWAACTV . :..:::.. :::.:: :: ::.:::.:::.::.::.:.:..:. .... ::. :: CCDS11 TADDIIDSLFVLSLLGSIFSLSVIAADRYITIFHALRYHSIVTMRRTVVVLTVIWTFCTG 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE9 SGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLMARLHIKRIAVLPGTGAIRQGA .:: ..:.: ..: . ..: ::... :::::::.:: : ..:..:: : : CCDS11 TGITMVIFSHHVPTVITFTSLFPLMLVFILCLYVHMFLLARSHTRKISTLP-----R--A 170 180 190 200 210 240 250 260 270 280 290 pF1KE9 NMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPYCVCFMSHFNLYLILIMCNSIIDP :::::::::::.:::. ::::: ::.... ::.::::.:.:: :.. .:::::..::: CCDS11 NMKGAITLTILLGVFIFCWAPFVLHVLLMTFCPSNPYCACYMSLFQVNGMLIMCNAVIDP 220 230 240 250 260 270 300 310 320 330 pF1KE9 LIYALRSQELRKTFKEIICCYPLGGLCDLSSRY .:::.:: ::: .::..: : CCDS11 FIYAFRSPELRDAFKKMIFCSRYW 280 290 332 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 13:40:06 2016 done: Sun Nov 6 13:40:06 2016 Total Scan time: 2.410 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]