FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9531, 295 aa 1>>>pF1KE9531 295 - 295 aa - 295 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2316+/-0.000916; mu= 15.2046+/- 0.054 mean_var=80.6557+/-22.688, 0's: 0 Z-trim(104.7): 155 B-trim: 1008 in 2/45 Lambda= 0.142809 statistics sampled from 7843 (8050) to 7843 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.612), E-opt: 0.2 (0.247), width: 16 Scan time: 2.070 The best scores are: opt bits E(32554) CCDS7374.1 RGR gene_id:5995|Hs108|chr10 ( 295) 1973 416.5 1.2e-116 CCDS41543.1 RGR gene_id:5995|Hs108|chr10 ( 253) 1397 297.8 5.6e-81 CCDS3687.1 RRH gene_id:10692|Hs108|chr4 ( 337) 358 83.8 1.9e-16 CCDS7376.1 OPN4 gene_id:94233|Hs108|chr10 ( 478) 325 77.1 2.8e-14 CCDS31072.1 OPN3 gene_id:23596|Hs108|chr1 ( 402) 301 72.1 7.5e-13 CCDS4923.1 OPN5 gene_id:221391|Hs108|chr6 ( 354) 295 70.8 1.6e-12 CCDS3063.1 RHO gene_id:6010|Hs108|chr3 ( 348) 269 65.5 6.5e-11 >>CCDS7374.1 RGR gene_id:5995|Hs108|chr10 (295 aa) initn: 1973 init1: 1973 opt: 1973 Z-score: 2207.7 bits: 416.5 E(32554): 1.2e-116 Smith-Waterman score: 1973; 100.0% identity (100.0% similar) in 295 aa overlap (1-295:1-295) 10 20 30 40 50 60 pF1KE9 MAETSALPTGFGELEVLAVGMVLLVEALSGLSLNTLTIFSFCKTPELRTPCHLLVLSLAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MAETSALPTGFGELEVLAVGMVLLVEALSGLSLNTLTIFSFCKTPELRTPCHLLVLSLAL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 ADSGISLNALVAATSSLLRVSHRRWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 ADSGISLNALVAATSSLLRVSHRRWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 YCTRSQLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDYEPLGTCCTLDYSKGDRNFTSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 YCTRSQLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDYEPLGTCCTLDYSKGDRNFTSF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 LFTMSFFNFAMPLFITITSYSLMEQKLGKSGHLQVNTTLPARTLLLGWGPYAILYLYAVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 LFTMSFFNFAMPLFITITSYSLMEQKLGKSGHLQVNTTLPARTLLLGWGPYAILYLYAVI 190 200 210 220 230 240 250 260 270 280 290 pF1KE9 ADVTSISPKLQMVPALIAKMVPTINAINYALGNEMVCRGIWQCLSPQKREKDRTK ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 ADVTSISPKLQMVPALIAKMVPTINAINYALGNEMVCRGIWQCLSPQKREKDRTK 250 260 270 280 290 >>CCDS41543.1 RGR gene_id:5995|Hs108|chr10 (253 aa) initn: 946 init1: 946 opt: 1397 Z-score: 1567.3 bits: 297.8 E(32554): 5.6e-81 Smith-Waterman score: 1600; 85.8% identity (85.8% similar) in 295 aa overlap (1-295:1-253) 10 20 30 40 50 60 pF1KE9 MAETSALPTGFGELEVLAVGMVLLVEALSGLSLNTLTIFSFCKTPELRTPCHLLVLSLAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MAETSALPTGFGELEVLAVGMVLLVEALSGLSLNTLTIFSFCKTPELRTPCHLLVLSLAL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 ADSGISLNALVAATSSLLRVSHRRWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHH :::::::::::::::::: :::::::::::::::::::::::::::::::::::::: CCDS41 ADSGISLNALVAATSSLL----RRWPYGSDGCQAHGFQGFVTALASICSSAAIAWGRYHH 70 80 90 100 110 130 140 150 160 170 180 pF1KE9 YCTRSQLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDYEPLGTCCTLDYSKGDRNFTSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 YCTRSQLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDYEPLGTCCTLDYSKGDRNFTSF 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE9 LFTMSFFNFAMPLFITITSYSLMEQKLGKSGHLQVNTTLPARTLLLGWGPYAILYLYAVI ::::::::::::::::::::::::::::::::::: CCDS41 LFTMSFFNFAMPLFITITSYSLMEQKLGKSGHLQV------------------------- 180 190 200 210 250 260 270 280 290 pF1KE9 ADVTSISPKLQMVPALIAKMVPTINAINYALGNEMVCRGIWQCLSPQKREKDRTK :::::::::::::::::::::::::::::::::::::::::: CCDS41 -------------PALIAKMVPTINAINYALGNEMVCRGIWQCLSPQKREKDRTK 220 230 240 250 >>CCDS3687.1 RRH gene_id:10692|Hs108|chr4 (337 aa) initn: 307 init1: 183 opt: 358 Z-score: 408.7 bits: 83.8 E(32554): 1.9e-16 Smith-Waterman score: 427; 27.1% identity (59.9% similar) in 299 aa overlap (11-287:20-315) 10 20 30 40 50 pF1KE9 MAETSALPTGFGELEVLAVGMVLLVEALSGLSLNTLTIFSFCKTPELRTPC :.. : :. :.. .. .. : ... : : ::::: CCDS36 MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMAGMISIISNIIVLGIFIKYKELRTPT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE9 HLLVLSLALADSGISLNALVAATSSLLRVSHRRWPYGSDGCQAHGFQGFVTALASICSSA . ....::..: :.: . ...: : : : .: :::... .. ..::: . CCDS36 NAIIINLAVTDIGVSSIGYPMSAASDLYGS---WKFGYAGCQVYAGLNIFFGMASIGLLT 70 80 90 100 110 120 130 140 150 160 pF1KE9 AIAWGRYHHYCTRS---QLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDYEPLGTCCTL ..: :: : . ... :. ..:.: .:... ::: .:..::. : .: :. ::. CCDS36 VVAVDRYLTICLPDVGRRMTTNTYIGLILGAWINGLFWALMPIIGWASYAPDPTGATCTI 120 130 140 150 160 170 170 180 190 200 210 pF1KE9 DYSKGDRNFTSFLFTMSFFNFAMPLFITITSY------------SLMEQKLGKSGHLQVN .. :.::.:.:. .:. .:: .:: . . : : ..:... :.. CCDS36 NWRKNDRSFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKHHTTSDCTESLNRDWSDQID 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE9 TTLPARTL----LLGWGPYAILYLYAVIADVTSISPKLQMVPALIAKMVPTINAINYALG .: . . :..:.::.:. :.: ..: .: : . .. :.:: : :... CCDS36 VTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMAIIAPLFAKSSTFYNPCIYVVA 240 250 260 270 280 290 280 290 pF1KE9 NEMVCRGI---WQCLSPQKREKDRTK :. :.. ..: . : CCDS36 NKKFRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLASGRI 300 310 320 330 >>CCDS7376.1 OPN4 gene_id:94233|Hs108|chr10 (478 aa) initn: 369 init1: 141 opt: 325 Z-score: 369.9 bits: 77.1 E(32554): 2.8e-14 Smith-Waterman score: 368; 29.1% identity (56.1% similar) in 285 aa overlap (19-271:73-352) 10 20 30 40 pF1KE9 MAETSALPTGFGELEVLAVGMVLLVEALSGLSLNTLTIFSFCKTPELR .: :.:. .:.:. : .:..::.. :: CCDS73 PSISPTAPGTWAAAWVPLPTVDVPDHAHYTLGTVILLVGLTGMLGNLTVIYTFCRSRSLR 50 60 70 80 90 100 50 60 70 80 90 100 pF1KE9 TPCHLLVLSLALADSGISLN-ALVAATSSLLRVSHRRWPYGSDGCQAHGFQGFVTALASI :: ......::..: .:.. : : :::: ...: .: ::. ..: : . ...:. CCDS73 TPANMFIINLAVSDFLMSFTQAPVFFTSSL----YKQWLFGETGCEFYAFCGALFGISSM 110 120 130 140 150 110 120 130 140 150 160 pF1KE9 CSSAAIAWGRYHHYCTRSQLAWNSA----VSLVLF-VWLSSAFWAALPLLGWGHYDYEPL . .::: :: :: ... : ...::. ::: . :. :..::. : : : CCDS73 ITLTAIALDRYL-VITRPLATFGVASKRRAAFVLLGVWLYALAWSLPPFFGWSAYVPEGL 160 170 180 190 200 210 170 180 190 200 210 pF1KE9 GTCCTLDYSKGDRNFTSFLFTMSFFNFAMPLFITITSYSLMEQKLGKSGH---------- : :. :: . .. . . : : .::.: : : .. . . ..:. CCDS73 LTSCSWDYMSFTPAVRAYTMLLCCFVFFLPLLIIIYCYIFIFRAIRETGRALQTFGACKG 220 230 240 250 260 270 220 230 240 250 pF1KE9 ----------LQVNTTLPARTLL------LGWGPYAILYLYAVIADVTSISPKLQMVPAL :: . . :: :.:.::. . : : . . ..: .. :::. CCDS73 NGESLWQRQRLQSECKMAKIMLLVILLFVLSWAPYSAVALVAFAGYAHVLTPYMSSVPAV 280 290 300 310 320 330 260 270 280 290 pF1KE9 IAKMVPTINAINYALGNEMVCRGIWQCLSPQKREKDRTK ::: : : ::. CCDS73 IAKASAIHNPIIYAITHPKYRVAIAQHLPCLGVLLGVSRRHSRPYPSYRSTHRSTLTSHT 340 350 360 370 380 390 >>CCDS31072.1 OPN3 gene_id:23596|Hs108|chr1 (402 aa) initn: 320 init1: 215 opt: 301 Z-score: 344.2 bits: 72.1 E(32554): 7.5e-13 Smith-Waterman score: 333; 26.1% identity (54.8% similar) in 303 aa overlap (12-288:39-336) 10 20 30 40 pF1KE9 MAETSALPTGFGELEVLAVGMVLLVEALSGLSLNTLTIFSF : : :: ..: .: :.. : :.. . CCDS31 GHGYWDGGGAAGAEGPAPAGTLSPAPLFSPGTYERLA--LLLGSIGLLGVGNNLLVLVLY 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE9 CKTPELRTPCHLLVLSLALADSGISLNALVAATSSLLRVSHRRWPYGSDGCQAHGFQGFV : .:::: :::.....:.: .:: ... . : :: . : . . :: ::.: . CCDS31 YKFQRLRTPTHLLLVNISLSDLLVSLFGVTFTFVSCLRNG---WVWDTVGCVWDGFSGSL 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE9 TALASICSSAAIAWGRYHHYCTRSQLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDYEP ...:: . ...:. :: . . .. : . ..:: : ::. :::::..: . CCDS31 FGIVSIATLTVLAYERYIRVVHARVINFSWAWRAITYIWLYSLAWAGAPLLGWNRYILDV 130 140 150 160 170 180 170 180 190 200 210 pF1KE9 LGTCCTLDYSKGDRNFTSFLFTMSFFNFAMPLFITITSYS--LMEQKLGKSGH----LQV : ::.:... : : .::.. . . ...:: . :. :. .. . . .:: CCDS31 HGLGCTVDWKSKDANDSSFVLFLFLGCLVVPLGVIAHCYGHILYSIRMLRCVEDLQTIQV 190 200 210 220 230 240 220 230 240 250 260 pF1KE9 NTTLPAR------------TLLLGWGPYAILYLYAVIADVTSISPKLQMVPALIAKMVPT : . :.:. : :: .. . .: . ..: ...: :.:: . CCDS31 IKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVVNGHGHLVTPTISIVSYLFAKSNTV 250 260 270 280 290 300 270 280 290 pF1KE9 INAINYALGN--------EMVCRGIWQCLSPQKREKDRTK : . :.. ...: . .: : : CCDS31 YNPVIYVFMIRKFRRSLLQLLCLRLLRCQRPAKDLPAAGSEMQIRPIVMSQKDGDRPKKK 310 320 330 340 350 360 CCDS31 VTFNSSSIIFIITSDESLSVDDSDKTNGSKVDVIQVRPL 370 380 390 400 >>CCDS4923.1 OPN5 gene_id:221391|Hs108|chr6 (354 aa) initn: 296 init1: 135 opt: 295 Z-score: 338.3 bits: 70.8 E(32554): 1.6e-12 Smith-Waterman score: 350; 26.6% identity (58.7% similar) in 286 aa overlap (17-277:34-315) 10 20 30 40 pF1KE9 MAETSALPTGFGELEVLAVGMVL-LVEALSGLSLNTLTIFSFCKTP :..:. : .. :: .. . . .: . CCDS49 NHTALPQDERLPHYLRDGDPFASKLSWEADLVAGFYLTIIGILSTFGNGYVLYMSSRRKK 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE9 ELRTPCHLLVLSLALADSGISLNALVAATSSLLRVSHRRWPYGSDGCQAHGFQGFVTALA .:: : ......::. : :::. :. ... .:: .: ::. .:. :: . . CCDS49 KLR-PAEIMTINLAVCDLGISV---VGKPFTIISCFCHRWVFGWIGCRWYGWAGFFFGCG 70 80 90 100 110 110 120 130 140 150 160 pF1KE9 SICSSAAIAWGRYHHYCTRSQLAW---NSAVSLVLFVWLSSAFWAALPLLGWGHYDYEPL :. . .:.. :: . : : .: . : . .: ..::...::.: : : ::. CCDS49 SLITMTAVSLDRYLKICYLSYGVWLKRKHAYICLAAIWAYASFWTTMPLVGLGDYVPEPF 120 130 140 150 160 170 170 180 190 200 210 pF1KE9 GTCCTLDY--SKGDRNFTSFLFTMSFFNFAMPLFITITSYSLMEQKLGKSGH-------- :: ::::. .... . :.... :: . .: . . :: . :. .:.. CCDS49 GTSCTLDWWLAQASVGGQVFILNILFFCLLLPTAVIVFSYVKIIAKVKSSSKEVAHFDSR 180 190 200 210 220 230 220 230 240 250 260 pF1KE9 ------LQVNTTLPARTL----LLGWGPYAILYLYAVIADVTSISPKLQMVPALIAKMVP :... : : . :..: :::.. ...... :: .:..::.:.:: . CCDS49 IHSSHVLEMKLTKVAMLICAGFLIAWIPYAVVSVWSAFGRPDSIPIQLSVVPTLLAKSAA 240 250 260 270 280 290 270 280 290 pF1KE9 TINAINY-ALGNEMVCRGIWQCLSPQKREKDRTK : : : .. ...: CCDS49 MYNPIIYQVIDYKFACCQTGGLKATKKKSLEGFRLHTVTTVRKSSAVLEIHEEWE 300 310 320 330 340 350 >>CCDS3063.1 RHO gene_id:6010|Hs108|chr3 (348 aa) initn: 222 init1: 160 opt: 269 Z-score: 309.4 bits: 65.5 E(32554): 6.5e-11 Smith-Waterman score: 321; 27.4% identity (55.9% similar) in 281 aa overlap (13-274:36-311) 10 20 30 40 pF1KE9 MAETSALPTGFGELEVLAVGMVLLVEALSGLSLNTLTIFSFC .. .::. : ::. . :. .: ::.. CCDS30 GPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLI--VLGFPINFLTLYVTV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE9 KTPELRTPCHLLVLSLALADSGISLNALVAATSSLLRVSHRRWPYGSDGCQAHGFQGFVT . .:::: . ..:.::.:: . :... ::.: : . .: ::. .:: . . CCDS30 QHKKLRTPLNYILLNLAVADLFMVLGGF---TSTLYTSLHGYFVFGPTGCNLEGFFATLG 70 80 90 100 110 120 110 120 130 140 150 pF1KE9 ALASICSSAAIAWGRYHHYC---TRSQLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDY . .. : ...: :: : . ... : :. : :.:. . :: :: ::..: CCDS30 GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIP 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE9 EPLGTCCTLDYS--KGDRNFTSFLFTMSFFNFAMPLFITITSYSLM--EQKLGKSGHLQV : : : .:: : . : ::.. : .:..:..: . :. . : . . . . CCDS30 EGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKEAAAQQQES 190 200 210 220 230 240 220 230 240 250 260 pF1KE9 NTTLPAR------------TLLLGWGPYAILYLYAVIADVTSISPKLQMVPALIAKMVPT :: :. ..:. : ::: . .: . ....: .. .::..:: . CCDS30 ATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAI 250 260 270 280 290 300 270 280 290 pF1KE9 INAINYALGNEMVCRGIWQCLSPQKREKDRTK : . : . :. CCDS30 YNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA 310 320 330 340 295 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:51:47 2016 done: Mon Nov 7 01:51:47 2016 Total Scan time: 2.070 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]