FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9414, 361 aa 1>>>pF1KE9414 361 - 361 aa - 361 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2547+/-0.0011; mu= 16.3856+/- 0.066 mean_var=185.8110+/-86.600, 0's: 0 Z-trim(103.9): 194 B-trim: 1030 in 2/47 Lambda= 0.094089 statistics sampled from 7178 (7638) to 7178 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.593), E-opt: 0.2 (0.235), width: 16 Scan time: 2.530 The best scores are: opt bits E(32554) CCDS30941.1 GPR52 gene_id:9293|Hs108|chr1 ( 361) 2484 350.7 1.2e-96 CCDS6849.1 GPR21 gene_id:2844|Hs108|chr9 ( 349) 1595 230.0 2.4e-60 >>CCDS30941.1 GPR52 gene_id:9293|Hs108|chr1 (361 aa) initn: 2484 init1: 2484 opt: 2484 Z-score: 1848.9 bits: 350.7 E(32554): 1.2e-96 Smith-Waterman score: 2484; 100.0% identity (100.0% similar) in 361 aa overlap (1-361:1-361) 10 20 30 40 50 60 pF1KE9 MNESRWTEWRILNMSSGIVNVSERHSCPLGFGHYSVVDVCIFETVVIVLLTFLIIAGNLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MNESRWTEWRILNMSSGIVNVSERHSCPLGFGHYSVVDVCIFETVVIVLLTFLIIAGNLT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 VIFVFHCAPLLHHYTTSYFIQTMAYADLFVGVSCLVPTLSLLHYSTGVHESLTCQVFGYI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 VIFVFHCAPLLHHYTTSYFIQTMAYADLFVGVSCLVPTLSLLHYSTGVHESLTCQVFGYI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 ISVLKSVSMACLACISVDRYLAITKPLSYNQLVTPCRLRICIILIWIYSCLIFLPSFFGW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 ISVLKSVSMACLACISVDRYLAITKPLSYNQLVTPCRLRICIILIWIYSCLIFLPSFFGW 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 GKPGYHGDIFEWCATSWLTSAYFTGFIVCLLYAPAAFVVCFTYFHIFKICRQHTKEINDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 GKPGYHGDIFEWCATSWLTSAYFTGFIVCLLYAPAAFVVCFTYFHIFKICRQHTKEINDR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 RARFPSHEVDSSRETGHSPDRRYAMVLFRITSVFYMLWLPYIIYFLLESSRVLDNPTLSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 RARFPSHEVDSSRETGHSPDRRYAMVLFRITSVFYMLWLPYIIYFLLESSRVLDNPTLSF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE9 LTTWLAISNSFCNCVIYSLSNSVFRLGLRRLSETMCTSCMCVKDQEAQEPKPRKRANSCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 LTTWLAISNSFCNCVIYSLSNSVFRLGLRRLSETMCTSCMCVKDQEAQEPKPRKRANSCS 310 320 330 340 350 360 pF1KE9 I : CCDS30 I >>CCDS6849.1 GPR21 gene_id:2844|Hs108|chr9 (349 aa) initn: 1591 init1: 1140 opt: 1595 Z-score: 1196.8 bits: 230.0 E(32554): 2.4e-60 Smith-Waterman score: 1595; 67.5% identity (85.8% similar) in 345 aa overlap (20-361:8-349) 10 20 30 40 50 60 pF1KE9 MNESRWTEWRILNMSSGIVNVSERHSCPLGFGHYSVVDVCIFETVVIVLLTFLIIAGNLT : : . : :.::. .:. :..:...::.:: :::.::. CCDS68 MNSTLDGNQSSHPFCLLAFGYLETVNFCLLEVLIIVFLTVLIISGNII 10 20 30 40 70 80 90 100 110 120 pF1KE9 VIFVFHCAPLLHHYTTSYFIQTMAYADLFVGVSCLVPTLSLLHYSTGVHESLTCQVFGYI :::::::::::.:.::::::::::::::::::::.::.:::::. :.::::::.::.. CCDS68 VIFVFHCAPLLNHHTTSYFIQTMAYADLFVGVSCVVPSLSLLHHPLPVEESLTCQIFGFV 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE9 ISVLKSVSMACLACISVDRYLAITKPLSYNQLVTPCRLRICIILIWIYSCLIFLPSFFGW .::::::::: :::::.:::.::::::.:: :::: :::.::.:::.:: :.:::::: : CCDS68 VSVLKSVSMASLACISIDRYIAITKPLTYNTLVTPWRLRLCIFLIWLYSTLVFLPSFFHW 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE9 GKPGYHGDIFEWCATSWLTSAYFTGFIVCLLYAPAAFVVCFTYFHIFKICRQHTKEINDR ::::::::.:.::: :: :..::: ::: .::::::..::::::.::.::.::::.:..: CCDS68 GKPGYHGDVFQWCAESWHTDSYFTLFIVMMLYAPAALIVCFTYFNIFRICQQHTKDISER 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE9 RARFPSHEVDSSRETGHSPDRRYAMVLFRITSVFYMLWLPYIIYFLLESSRVLDNPTLSF .::: :. ... :. ::.::::::::::::::.:::::::::::::: .: :: CCDS68 QARFSSQSGETG-EVQACPDKRYAMVLFRITSVFYILWLPYIIYFLLESSTGHSNRFASF 230 240 250 260 270 280 310 320 330 340 350 pF1KE9 LTTWLAISNSFCNCVIYSLSNSVFRLGLRRLSETMCTSCMCVKDQEAQEP---KPRKRAN ::::::::::::::::::::::::. ::.::: .::::: ... :..: . . : CCDS68 LTTWLAISNSFCNCVIYSLSNSVFQRGLKRLSGAMCTSC--ASQTTANDPYTVRSKGPLN 290 300 310 320 330 340 360 pF1KE9 SCSI .: : CCDS68 GCHI 361 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:40:38 2016 done: Sun Nov 6 12:40:38 2016 Total Scan time: 2.530 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]