FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9440, 345 aa 1>>>pF1KE9440 345 - 345 aa - 345 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2052+/-0.000888; mu= 17.4584+/- 0.053 mean_var=77.0495+/-16.183, 0's: 0 Z-trim(106.7): 31 B-trim: 1124 in 2/47 Lambda= 0.146113 statistics sampled from 9096 (9116) to 9096 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.631), E-opt: 0.2 (0.28), width: 16 Scan time: 2.610 The best scores are: opt bits E(32554) CCDS8658.1 GPRC5D gene_id:55507|Hs108|chr12 ( 345) 2348 504.4 5.7e-143 CCDS8657.1 GPRC5A gene_id:9052|Hs108|chr12 ( 357) 957 211.2 1.1e-54 CCDS10581.1 GPRC5B gene_id:51704|Hs108|chr16 ( 403) 671 150.9 1.7e-36 CCDS42378.1 GPRC5C gene_id:55890|Hs108|chr17 ( 453) 369 87.3 2.6e-17 CCDS11699.1 GPRC5C gene_id:55890|Hs108|chr17 ( 486) 369 87.3 2.8e-17 >>CCDS8658.1 GPRC5D gene_id:55507|Hs108|chr12 (345 aa) initn: 2348 init1: 2348 opt: 2348 Z-score: 2680.3 bits: 504.4 E(32554): 5.7e-143 Smith-Waterman score: 2348; 100.0% identity (100.0% similar) in 345 aa overlap (1-345:1-345) 10 20 30 40 50 60 pF1KE9 MYKDCIESTGDYFLLCDAEGPWGIILESLAILGIVVTILLLLAFLFLMRKIQDCSQWNVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 MYKDCIESTGDYFLLCDAEGPWGIILESLAILGIVVTILLLLAFLFLMRKIQDCSQWNVL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 PTQLLFLLSVLGLFGLAFAFIIELNQQTAPVRYFLFGVLFALCFSCLLAHASNLVKLVRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 PTQLLFLLSVLGLFGLAFAFIIELNQQTAPVRYFLFGVLFALCFSCLLAHASNLVKLVRG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 CVSFSWTTILCIAIGCSLLQIIIATEYVTLIMTRGMMFVNMTPCQLNVDFVVLLVYVLFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 CVSFSWTTILCIAIGCSLLQIIIATEYVTLIMTRGMMFVNMTPCQLNVDFVVLLVYVLFL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 MALTFFVSKATFCGPCENWKQHGRLIFITVLFSIIIWVVWISMLLRGNPQFQRQPQWDDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 MALTFFVSKATFCGPCENWKQHGRLIFITVLFSIIIWVVWISMLLRGNPQFQRQPQWDDP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 VVCIALVTNAWVFLLLYIVPELCILYRSCRQECPLQGNACPVTAYQHSFQVENQELSRAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS86 VVCIALVTNAWVFLLLYIVPELCILYRSCRQECPLQGNACPVTAYQHSFQVENQELSRAR 250 260 270 280 290 300 310 320 330 340 pF1KE9 DSDGAEEDVALTSYGTPIQPQTVDPTQECFIPQAKLSPQQDAGGV ::::::::::::::::::::::::::::::::::::::::::::: CCDS86 DSDGAEEDVALTSYGTPIQPQTVDPTQECFIPQAKLSPQQDAGGV 310 320 330 340 >>CCDS8657.1 GPRC5A gene_id:9052|Hs108|chr12 (357 aa) initn: 865 init1: 507 opt: 957 Z-score: 1095.4 bits: 211.2 E(32554): 1.1e-54 Smith-Waterman score: 957; 45.7% identity (74.2% similar) in 337 aa overlap (12-341:17-349) 10 20 30 40 50 pF1KE9 MYKDCIESTGDYFLLCDAEGPWGIILESLAILGIVVTILLLLAFLFLMRKIQDCS :. ::: :::.::..: :.:... ..:.. .:. :.:: . CCDS86 MATTVPDGCRNGLKSKYYRLCDKAEAWGIVLETVATAGVVTSVAFMLTLPILVCKVQDSN 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE9 QWNVLPTQLLFLLSVLGLFGLAFAFIIELNQQTAPVRYFLFGVLFALCFSCLLAHASNLV . ..::::.::::.:::.:::.::::: :. .:.:.:.::::.::..::::::::: .:. CCDS86 RRKMLPTQFLFLLGVLGIFGLTFAFIIGLDGSTGPTRFFLFGILFSICFSCLLAHAVSLT 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE9 KLVRGCVSFSWTTILCIAIGCSLLQIIIATEYVTLIMTRGMM--FVNMTPCQLNVDFVVL ::::: .: .:: .:.: ::.: .:: ::..: :.: . : ... . : :::.: CCDS86 KLVRGRKPLSLLVILGLAVGFSLVQDVIAIEYIVLTMNRTNVNVFSELSAPRRNEDFVLL 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE9 LVYVLFLMALTFFVSKATFCGPCENWKQHGRLIFITVLFSIIIWVVWISMLLRGNPQFQR :.::::::::::..:. :::: .::.:: :..:.:.:: :::.::..:. :.:.: CCDS86 LTYVLFLMALTFLMSSFTFCGSFTGWKRHGAHIYLTMLLSIAIWVAWITLLML--PDFDR 190 200 210 220 230 240 250 260 270 280 290 pF1KE9 QPQWDDPVVCIALVTNAWVFLLLYIVPELCILYRSCR-QECPLQGNACPVTAYQHSFQVE . ::: .. ::..:.::::: :. ::. .: .. .. :.. : ..:. :: CCDS86 R--WDDTILSSALAANGWVFLLAYVSPEFWLLTKQRNPMDYPVEDAFCKPQLVKKSYGVE 240 250 260 270 280 290 300 310 320 330 340 pF1KE9 NQELSRARDSDGAEE--DVALTSYGTPIQPQTVDPTQECFIPQAKL--SPQQDAGGV :. :. . ..: :: :. . :.: .: :. : .: ::.:. :: .: CCDS86 NRAYSQEEITQGFEETGDTLYAPYSTHFQLQNQPPQKEFSIPRAHAWPSPYKDYEVKKEG 300 310 320 330 340 350 CCDS86 S >>CCDS10581.1 GPRC5B gene_id:51704|Hs108|chr16 (403 aa) initn: 659 init1: 360 opt: 671 Z-score: 768.9 bits: 150.9 E(32554): 1.7e-36 Smith-Waterman score: 671; 35.2% identity (67.3% similar) in 324 aa overlap (3-317:35-351) 10 20 30 pF1KE9 MYKDC-IESTGDYFLLCDAEGPWGIILESLAI . : .. .: ::: .. :::..:..: CCDS10 SERKMRAHQVLTFLLLFVITSVASENASTSRGCGLDLLPQYVSLCDLDAIWGIVVEAVAG 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE9 LGIVVTILLLLAFLFLMRKIQDCSQWNVLPTQLLFLLSVLGLFGLAFAFIIELNQQTAPV : ..:.::.: .: . :.. . . . ..::::..::::::.:::::. .. : CCDS10 AGALITLLLMLILLVRLPFIKEKEKKSPVGLHFLFLLGTLGLFGLTFAFIIQEDETICSV 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE9 RYFLFGVLFALCFSCLLAHASNLVKLVR-GCVSFSWTTILCIAIGCSLLQIIIATEYVTL : ::.::::::::::::..: . .::: : .: . .:. :.:.:::.:...: CCDS10 RRFLWGVLFALCFSCLLSQAWRVRRLVRHGTGPAGWQLV-GLALCLMLVQVIIAVEWLVL 130 140 150 160 170 180 160 170 180 190 200 pF1KE9 IMTRGMMFVNMTP-CQLN-VDFVVLLVYVLFLMALTFFVSKATFCGPCENWKQHGRLIFI . : . : : . .:::. :.: . :...:. .. :.:: . :: .: ...: CCDS10 TVLR-----DTRPACAYEPMDFVMALIYDMVLLVVTLGLALFTLCGKFKRWKLNGAFLLI 190 200 210 220 230 210 220 230 240 250 260 pF1KE9 TVLFSIIIWVVWISMLLRGNPQFQRQPQWDDPVVCIALVTNAWVFLLLYIVPEL-CILYR :...:..:::.:..: : :: ..:. :.::.. :.:....:::.... .::. : : CCDS10 TAFLSVLIWVAWMTMYLFGNVKLQQGDAWNDPTLAITLAASGWVFVIFHAIPEIHCTLLP 240 250 260 270 280 290 270 280 290 300 310 320 pF1KE9 SCRQECPLQGNACPVTAYQHSFQVENQELSRARDSDGA----EEDVALTSYGTPIQPQTV . ... : .. . .:. :. .: :: . : :...:: . : : CCDS10 ALQENTPNYFDTSQPRMRETAFE-EDVQLPRAYMENKAFSMDEHNAALRTAGFPNGSLGK 300 310 320 330 340 350 330 340 pF1KE9 DPTQECFIPQAKLSPQQDAGGV CCDS10 RPSGSLGKRPSAPFRSNVYQPTEMAVVLNGGTIPTAPPSHTGRHLW 360 370 380 390 400 >>CCDS42378.1 GPRC5C gene_id:55890|Hs108|chr17 (453 aa) initn: 663 init1: 369 opt: 369 Z-score: 424.1 bits: 87.3 E(32554): 2.6e-17 Smith-Waterman score: 686; 37.2% identity (67.2% similar) in 296 aa overlap (12-295:51-344) 10 20 30 40 pF1KE9 MYKDCIESTGDYFLLCDAEGPWGIILESLAILGIVVTILLL :. ::: : :::.::..: :::.:..: CCDS42 MCLGLPLFLFPGAWAQGHVPPGCSQGLNPLYYNLCDRSGAWGIVLEAVAGAGIVTTFVLT 30 40 50 60 70 80 50 60 70 80 90 100 pF1KE9 LAFLFLMRKIQDCSQWNVLPTQLLFLLSVLGLFGLAFAFIIELNQQTAPVRYFLFGVLFA . .. . .:: .. ..: ::..:::..:::: :.:: ... . .: : :::::::: CCDS42 IILVASLPFVQDTKKRSLLGTQVFFLLGTLGLFCLVFACVVKPDFSTCASRRFLFGVLFA 90 100 110 120 130 140 110 120 130 140 150 pF1KE9 LCFSCLLAHASNLVKLVRGCVSFSWTTILCIAIGCSLLQIIIATEYVTLIMTRG------ .::::: ::. : :.: . .:. .:. .:...:: ::.. . ..:: CCDS42 ICFSCLAAHVFALNFLARKNHGPRGWVIFTVALLLTLVEVIINTEWLIITLVRGSGEGGP 150 160 170 180 190 200 160 170 180 190 200 pF1KE9 -----MMFVNMTPCQL-NVDFVVLLVYVLFLMALTFFVSKATFCGPCENWKQHGRLIFIT .. .:: . :.:::. :.::..:. .:. . ..:: . :..:: ....: CCDS42 QGNSSAGWAVASPCAIANMDFVMALIYVMLLLLGAFLGAWPALCGRYKRWRKHGVFVLLT 210 220 230 240 250 260 210 220 230 240 250 260 pF1KE9 VLFSIIIWVVWISMLLRGNPQFQRQPQWDDPVVCIALVTNAWVFLLLYIVPELCILYRSC . :. :::::: : :: : . .: ::::.. :::..:::.:.:.:..::. . .: CCDS42 TATSVAIWVVWIVMYTYGNKQ-HNSPTWDDPTLAIALAANAWAFVLFYVIPEVSQVTKSS 270 280 290 300 310 270 280 290 300 310 320 pF1KE9 RQECPLQGNACPVTAYQHSFQVENQELSRARDSDGAEEDVALTSYGTPIQPQTVDPTQEC : ::. :. . . ...:. CCDS42 -PEQSYQGDMYPTRGVGYETILKEQKGQSMFVENKAFSMDEPVAAKRPVSPYSGYNGQLL 320 330 340 350 360 370 >>CCDS11699.1 GPRC5C gene_id:55890|Hs108|chr17 (486 aa) initn: 663 init1: 369 opt: 369 Z-score: 423.7 bits: 87.3 E(32554): 2.8e-17 Smith-Waterman score: 686; 37.2% identity (67.2% similar) in 296 aa overlap (12-295:84-377) 10 20 30 40 pF1KE9 MYKDCIESTGDYFLLCDAEGPWGIILESLAILGIVVTILLL :. ::: : :::.::..: :::.:..: CCDS11 MCLGLPLFLFPGAWAQGHVPPGCSQGLNPLYYNLCDRSGAWGIVLEAVAGAGIVTTFVLT 60 70 80 90 100 110 50 60 70 80 90 100 pF1KE9 LAFLFLMRKIQDCSQWNVLPTQLLFLLSVLGLFGLAFAFIIELNQQTAPVRYFLFGVLFA . .. . .:: .. ..: ::..:::..:::: :.:: ... . .: : :::::::: CCDS11 IILVASLPFVQDTKKRSLLGTQVFFLLGTLGLFCLVFACVVKPDFSTCASRRFLFGVLFA 120 130 140 150 160 170 110 120 130 140 150 pF1KE9 LCFSCLLAHASNLVKLVRGCVSFSWTTILCIAIGCSLLQIIIATEYVTLIMTRG------ .::::: ::. : :.: . .:. .:. .:...:: ::.. . ..:: CCDS11 ICFSCLAAHVFALNFLARKNHGPRGWVIFTVALLLTLVEVIINTEWLIITLVRGSGEGGP 180 190 200 210 220 230 160 170 180 190 200 pF1KE9 -----MMFVNMTPCQL-NVDFVVLLVYVLFLMALTFFVSKATFCGPCENWKQHGRLIFIT .. .:: . :.:::. :.::..:. .:. . ..:: . :..:: ....: CCDS11 QGNSSAGWAVASPCAIANMDFVMALIYVMLLLLGAFLGAWPALCGRYKRWRKHGVFVLLT 240 250 260 270 280 290 210 220 230 240 250 260 pF1KE9 VLFSIIIWVVWISMLLRGNPQFQRQPQWDDPVVCIALVTNAWVFLLLYIVPELCILYRSC . :. :::::: : :: : . .: ::::.. :::..:::.:.:.:..::. . .: CCDS11 TATSVAIWVVWIVMYTYGNKQ-HNSPTWDDPTLAIALAANAWAFVLFYVIPEVSQVTKSS 300 310 320 330 340 350 270 280 290 300 310 320 pF1KE9 RQECPLQGNACPVTAYQHSFQVENQELSRARDSDGAEEDVALTSYGTPIQPQTVDPTQEC : ::. :. . . ...:. CCDS11 -PEQSYQGDMYPTRGVGYETILKEQKGQSMFVENKAFSMDEPVAAKRPVSPYSGYNGQLL 360 370 380 390 400 410 345 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:55:25 2016 done: Sun Nov 6 14:55:25 2016 Total Scan time: 2.610 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]