FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9435, 617 aa 1>>>pF1KE9435 617 - 617 aa - 617 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.8880+/-0.0013; mu= -1.4629+/- 0.078 mean_var=288.0468+/-64.396, 0's: 0 Z-trim(109.9): 257 B-trim: 1249 in 2/49 Lambda= 0.075569 statistics sampled from 10881 (11238) to 10881 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.688), E-opt: 0.2 (0.345), width: 16 Scan time: 3.740 The best scores are: opt bits E(32554) CCDS44012.1 GPR50 gene_id:9248|Hs108|chrX ( 617) 4186 470.5 2.9e-132 CCDS3848.1 MTNR1A gene_id:4543|Hs108|chr4 ( 350) 1153 139.6 6.5e-33 CCDS8290.1 MTNR1B gene_id:4544|Hs108|chr11 ( 362) 1106 134.5 2.3e-31 >>CCDS44012.1 GPR50 gene_id:9248|Hs108|chrX (617 aa) initn: 4186 init1: 4186 opt: 4186 Z-score: 2488.1 bits: 470.5 E(32554): 2.9e-132 Smith-Waterman score: 4186; 99.8% identity (100.0% similar) in 617 aa overlap (1-617:1-617) 10 20 30 40 50 60 pF1KE9 MGPTLAVPTPYGCIGCKLPQPEYPPALIIFMFCAMVITIVVDLIGNSMVILAVTKNKKLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MGPTLAVPTPYGCIGCKLPQPEYPPALIIFMFCAMVITIVVDLIGNSMVILAVTKNKKLR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 NSGNIFVVSLSVADMLVAIYPYPLMLHAMSIGGWDLSQLQCQMVGFITGLSVVGSIFNIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 NSGNIFVVSLSVADMLVAIYPYPLMLHAMSIGGWDLSQLQCQMVGFITGLSVVGSIFNIV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 AIAINRYCYICHSLQYERIFSVRNTCIYLVITWIMTVLAVLPNMYIGTIEYDPRTYTCIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 AIAINRYCYICHSLQYERIFSVRNTCIYLVITWIMTVLAVLPNMYIGTIEYDPRTYTCIF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 NYLNNPVFTVTIVCIHFVLPLLIVGFCYVRIWTKVLAARDPAGQNPDNQLAEVRNFLTMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 NYLNNPVFTVTIVCIHFVLPLLIVGFCYVRIWTKVLAARDPAGQNPDNQLAEVRNFLTMF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 VIFLLFAVCWCPINVLTVLVAVSPKEMAGKIPNWLYLAAYFIAYFNSCLNAVIYGLLNEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VIFLLFAVCWCPINVLTVLVAVSPKEMAGKIPNWLYLAAYFIAYFNSCLNAVIYGLLNEN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE9 FRREYWTIFHAMRHPIIFFSGLISDIREMQEARTLARARAHARDQAREQDRAHACPAVEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 FRREYWTIFHAMRHPIIFFSGLISDIREMQEARTLARARAHARDQAREQDRAHACPAVEE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE9 TPMNVRNVPLPGDAAAGHPDRASGHPKPHSRSSSAYRKSASTHHKSVFSHSKAASGHLKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 TPMNVRNVPLPGDAAAGHPDRASGHPKPHSRSSSAYRKSASTHHKSVFSHSKAASGHLKP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE9 VSGHSKPASGHPKSATVYPKPASVHFKADSVHFKGDSVHFKPDSVHFKPASSNPKPITGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VSGHSKPASGHPKSATVYPKPASVHFKADSVHFKGDSVHFKPDSVHFKPASSNPKPITGH 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE9 HVSAGSHSKSAFSAATSHPKPTTGHIKPATSHAEPTTADYPKPATTSHPKPTAADNPELS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 HVSAGSHSKSAFSAATSHPKPTTGHIKPATSHAEPTTADYPKPATTSHPKPTAADNPELS 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE9 ASHCPEIPAIAHPVSDDSDLPESASSPAAGPTKPAASQLESDTIADLPDPTVVTTSTNDY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 ASHCPEIPAIAHPVSDDSDLPESASSPAAGPTKPAASQLESDTIADLPDPTVVTTSTNDY 550 560 570 580 590 600 610 pF1KE9 HDVVVVDVEDDPDEMAV :::::.::::::::::: CCDS44 HDVVVIDVEDDPDEMAV 610 >>CCDS3848.1 MTNR1A gene_id:4543|Hs108|chr4 (350 aa) initn: 1145 init1: 791 opt: 1153 Z-score: 704.4 bits: 139.6 E(32554): 6.5e-33 Smith-Waterman score: 1153; 51.5% identity (79.7% similar) in 305 aa overlap (24-326:23-327) 10 20 30 40 50 60 pF1KE9 MGPTLAVPTPYGCIGCKLPQPEYPPALIIFMFCAMVITIVVDLIGNSMVILAVTKNKKLR : : . :....:::::..:: .:::.: .::::: CCDS38 MQGNGSALPNASQPVLRGDGARPSWLASALACVLIFTIVVDILGNLLVILSVYRNKKLR 10 20 30 40 50 70 80 90 100 110 120 pF1KE9 NSGNIFVVSLSVADMLVAIYPYPLMLHAMSIGGWDLSQLQCQMVGFITGLSVVGSIFNIV :.::::::::.:::..::::::::.: .. .::.:. :.::. ::. ::::.::::::. CCDS38 NAGNIFVVSLAVADLVVAIYPYPLVLMSIFNNGWNLGYLHCQVSGFLMGLSVIGSIFNIT 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE9 AIAINRYCYICHSLQYERIFSVRNTCIYLVITWIMTVLAVLPNMYIGTIEYDPRTYTCIF .:::::::::::::.:....: .:. :... :..:. :::::. ::..:::: :.: : CCDS38 GIAINRYCYICHSLKYDKLYSSKNSLCYVLLIWLLTLAAVLPNLRAGTLQYDPRIYSCTF 120 130 140 150 160 170 190 200 210 220 230 pF1KE9 NYLNNPVFTVTIVCIHFVLPLLIVGFCYVRIWTKVLAARD--PAGQNPDNQLAEVRNFLT . ..:...: .::..:..:: :::.::: :: .:. ..: . . :::.: CCDS38 AQSVSSAYTIAVVVFHFLVPMIIVIFCYLRIWILVLQVRQRVKPDRKPKLKPQDFRNFVT 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE9 MFVIFLLFAVCWCPINVLTVLVAVSPKEMAGKIPNWLYLAAYFIAYFNSCLNAVIYGLLN :::.:.:::.:: :.: . . :: .: :. .::.::..:.:..:::::::::.:::::: CCDS38 MFVVFVLFAICWAPLNFIGLAVASDPASMVPRIPEWLFVASYYMAYFNSCLNAIIYGLLN 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE9 ENFRREYWTIFHAMRHPIIFFSGLISDIREMQEARTLARARAHARDQAREQDRAHACPAV .:::.:: :. .. .:: .:. CCDS38 QNFRKEYRRIIVSLCTARVFFVDSSNDVADRVKWKPSPLMTNNNVVKVDSV 300 310 320 330 340 350 >>CCDS8290.1 MTNR1B gene_id:4544|Hs108|chr11 (362 aa) initn: 1092 init1: 749 opt: 1106 Z-score: 676.5 bits: 134.5 E(32554): 2.3e-31 Smith-Waterman score: 1106; 48.3% identity (78.7% similar) in 315 aa overlap (6-315:17-329) 10 20 30 40 pF1KE9 MGPTLAVPTPYGCIGCKLP-QPEYPPALIIFMFCAMVITIVVDLIGNSM :: .. : : . :: . . ....: .::..:: . CCDS82 MSENGSFANCCEAGGWAVRPGWSGAGSARPSRTPRPPWVAPALSAVLIVTTAVDVVGNLL 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE9 VILAVTKNKKLRNSGNIFVVSLSVADMLVAIYPYPLMLHAMSIGGWDLSQLQCQMVGFIT :::.: .:.::::.::.:.:::..::..::.:::::.: :. :: :.. .:. .:. CCDS82 VILSVLRNRKLRNAGNLFLVSLALADLVVAFYPYPLILVAIFYDGWALGEEHCKASAFVM 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE9 GLSVVGSIFNIVAIAINRYCYICHSLQYERIFSVRNTCIYLVITWIMTVLAVLPNMYIGT ::::.::.:::.:::::::::::::. :.::. .: ... . :..::.:.:::...:. CCDS82 GLSVIGSVFNITAIAINRYCYICHSMAYHRIYRRWHTPLHICLIWLLTVVALLPNFFVGS 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE9 IEYDPRTYTCIFNYLNNPVFTVTIVCIHFVLPLLIVGFCYVRIWTKVLAARDPAGQNPDN .::::: :.: : . .:...: :::.::. .:.:::.:::. :: :: : .:.. CCDS82 LEYDPRIYSCTFIQTASTQYTAAVVVIHFLLPIAVVSFCYLRIWVLVLQARRKA--KPES 190 200 210 220 230 230 240 250 260 270 280 pF1KE9 QL----AEVRNFLTMFVIFLLFAVCWCPINVLTVLVAVSPKEMAGKIPNWLYLAAYFIAY .: ...:.::::::.:..::.:: :.: . . ::..:.::: .::. :....:..:: CCDS82 RLCLKPSDLRSFLTMFVVFVIFAICWAPLNCIGLAVAINPQEMAPQIPEGLFVTSYLLAY 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE9 FNSCLNAVIYGLLNENFRREYWTIFHAMRHPIIFFSGLISDIREMQEARTLARARAHARD :::::::..:::::.:::::: :. :. .: CCDS82 FNSCLNAIVYGLLNQNFRREYKRILLALWNPRHCIQDASKGSHAEGLQSPAPPIIGVQHQ 300 310 320 330 340 350 350 360 370 380 390 400 pF1KE9 QAREQDRAHACPAVEETPMNVRNVPLPGDAAAGHPDRASGHPKPHSRSSSAYRKSASTHH CCDS82 ADAL 360 617 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 10:49:10 2016 done: Sun Nov 6 10:49:10 2016 Total Scan time: 3.740 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]