FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9462, 486 aa 1>>>pF1KE9462 486 - 486 aa - 486 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4239+/-0.000844; mu= 19.2341+/- 0.051 mean_var=94.5823+/-18.666, 0's: 0 Z-trim(110.0): 28 B-trim: 2 in 1/51 Lambda= 0.131877 statistics sampled from 11253 (11279) to 11253 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.693), E-opt: 0.2 (0.346), width: 16 Scan time: 3.280 The best scores are: opt bits E(32554) CCDS11699.1 GPRC5C gene_id:55890|Hs108|chr17 ( 486) 3301 638.2 5.8e-183 CCDS42378.1 GPRC5C gene_id:55890|Hs108|chr17 ( 453) 3061 592.5 3.1e-169 CCDS10581.1 GPRC5B gene_id:51704|Hs108|chr16 ( 403) 541 113.0 6.1e-25 CCDS8657.1 GPRC5A gene_id:9052|Hs108|chr12 ( 357) 478 101.0 2.3e-21 CCDS8658.1 GPRC5D gene_id:55507|Hs108|chr12 ( 345) 369 80.2 3.8e-15 >>CCDS11699.1 GPRC5C gene_id:55890|Hs108|chr17 (486 aa) initn: 3301 init1: 3301 opt: 3301 Z-score: 3398.2 bits: 638.2 E(32554): 5.8e-183 Smith-Waterman score: 3301; 100.0% identity (100.0% similar) in 486 aa overlap (1-486:1-486) 10 20 30 40 50 60 pF1KE9 MRGRGSQQQQPTRRQGQKLPSPSPAGKYESAQPGGTQPEPGLGARMAIHKALVMCLGLPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MRGRGSQQQQPTRRQGQKLPSPSPAGKYESAQPGGTQPEPGLGARMAIHKALVMCLGLPL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 FLFPGAWAQGHVPPGCSQGLNPLYYNLCDRSGAWGIVLEAVAGAGIVTTFVLTIILVASL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 FLFPGAWAQGHVPPGCSQGLNPLYYNLCDRSGAWGIVLEAVAGAGIVTTFVLTIILVASL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 PFVQDTKKRSLLGTQVFFLLGTLGLFCLVFACVVKPDFSTCASRRFLFGVLFAICFSCLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PFVQDTKKRSLLGTQVFFLLGTLGLFCLVFACVVKPDFSTCASRRFLFGVLFAICFSCLA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 AHVFALNFLARKNHGPRGWVIFTVALLLTLVEVIINTEWLIITLVRGSGEGGPQGNSSAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 AHVFALNFLARKNHGPRGWVIFTVALLLTLVEVIINTEWLIITLVRGSGEGGPQGNSSAG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 WAVASPCAIANMDFVMALIYVMLLLLGAFLGAWPALCGRYKRWRKHGVFVLLTTATSVAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 WAVASPCAIANMDFVMALIYVMLLLLGAFLGAWPALCGRYKRWRKHGVFVLLTTATSVAI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE9 WVVWIVMYTYGNKQHNSPTWDDPTLAIALAANAWAFVLFYVIPEVSQVTKSSPEQSYQGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 WVVWIVMYTYGNKQHNSPTWDDPTLAIALAANAWAFVLFYVIPEVSQVTKSSPEQSYQGD 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE9 MYPTRGVGYETILKEQKGQSMFVENKAFSMDEPVAAKRPVSPYSGYNGQLLTSVYQPTEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MYPTRGVGYETILKEQKGQSMFVENKAFSMDEPVAAKRPVSPYSGYNGQLLTSVYQPTEM 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE9 ALMHKVPSEGAYDIILPRATANSQVMGSANSTLRAEDMYSAQSHQAATPPKDGKNSQVFR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 ALMHKVPSEGAYDIILPRATANSQVMGSANSTLRAEDMYSAQSHQAATPPKDGKNSQVFR 430 440 450 460 470 480 pF1KE9 NPYVWD :::::: CCDS11 NPYVWD >>CCDS42378.1 GPRC5C gene_id:55890|Hs108|chr17 (453 aa) initn: 3061 init1: 3061 opt: 3061 Z-score: 3151.8 bits: 592.5 E(32554): 3.1e-169 Smith-Waterman score: 3061; 100.0% identity (100.0% similar) in 452 aa overlap (35-486:2-453) 10 20 30 40 50 60 pF1KE9 GSQQQQPTRRQGQKLPSPSPAGKYESAQPGGTQPEPGLGARMAIHKALVMCLGLPLFLFP :::::::::::::::::::::::::::::: CCDS42 MGTQPEPGLGARMAIHKALVMCLGLPLFLFP 10 20 30 70 80 90 100 110 120 pF1KE9 GAWAQGHVPPGCSQGLNPLYYNLCDRSGAWGIVLEAVAGAGIVTTFVLTIILVASLPFVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GAWAQGHVPPGCSQGLNPLYYNLCDRSGAWGIVLEAVAGAGIVTTFVLTIILVASLPFVQ 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE9 DTKKRSLLGTQVFFLLGTLGLFCLVFACVVKPDFSTCASRRFLFGVLFAICFSCLAAHVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 DTKKRSLLGTQVFFLLGTLGLFCLVFACVVKPDFSTCASRRFLFGVLFAICFSCLAAHVF 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE9 ALNFLARKNHGPRGWVIFTVALLLTLVEVIINTEWLIITLVRGSGEGGPQGNSSAGWAVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 ALNFLARKNHGPRGWVIFTVALLLTLVEVIINTEWLIITLVRGSGEGGPQGNSSAGWAVA 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE9 SPCAIANMDFVMALIYVMLLLLGAFLGAWPALCGRYKRWRKHGVFVLLTTATSVAIWVVW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SPCAIANMDFVMALIYVMLLLLGAFLGAWPALCGRYKRWRKHGVFVLLTTATSVAIWVVW 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE9 IVMYTYGNKQHNSPTWDDPTLAIALAANAWAFVLFYVIPEVSQVTKSSPEQSYQGDMYPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 IVMYTYGNKQHNSPTWDDPTLAIALAANAWAFVLFYVIPEVSQVTKSSPEQSYQGDMYPT 280 290 300 310 320 330 370 380 390 400 410 420 pF1KE9 RGVGYETILKEQKGQSMFVENKAFSMDEPVAAKRPVSPYSGYNGQLLTSVYQPTEMALMH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 RGVGYETILKEQKGQSMFVENKAFSMDEPVAAKRPVSPYSGYNGQLLTSVYQPTEMALMH 340 350 360 370 380 390 430 440 450 460 470 480 pF1KE9 KVPSEGAYDIILPRATANSQVMGSANSTLRAEDMYSAQSHQAATPPKDGKNSQVFRNPYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 KVPSEGAYDIILPRATANSQVMGSANSTLRAEDMYSAQSHQAATPPKDGKNSQVFRNPYV 400 410 420 430 440 450 pF1KE9 WD :: CCDS42 WD >>CCDS10581.1 GPRC5B gene_id:51704|Hs108|chr16 (403 aa) initn: 944 init1: 534 opt: 541 Z-score: 561.3 bits: 113.0 E(32554): 6.1e-25 Smith-Waterman score: 945; 40.8% identity (68.9% similar) in 395 aa overlap (45-423:8-384) 20 30 40 50 60 70 pF1KE9 QGQKLPSPSPAGKYESAQPGGTQPEPGLGARMAIHKALVMCLGLPLFLFPGAWAQG-HVP .: :..:.. : ::.. .. ... . CCDS10 MFVASERKMRAHQVLTFLL---LFVITSVASENASTS 10 20 30 80 90 100 110 120 130 pF1KE9 PGCSQGLNPLYYNLCDRSGAWGIVLEAVAGAGIVTTFVLTIILVASLPFVQDTKKRSLLG ::. : : : .::: .. ::::.::::::: . :..: .::.. :::... .:.: .: CCDS10 RGCGLDLLPQYVSLCDLDAIWGIVVEAVAGAGALITLLLMLILLVRLPFIKEKEKKSPVG 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE9 TQVFFLLGTLGLFCLVFACVVKPDFSTCASRRFLFGVLFAICFSCLAAHVFALNFLARKN . .::::::::: :.:: ... : . :. ::::.:::::.::::: .... . :.:.. CCDS10 LHFLFLLGTLGLFGLTFAFIIQEDETICSVRRFLWGVLFALCFSCLLSQAWRVRRLVRHG 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE9 HGPRGWVIFTVALLLTLVEVIINTEWLIITLVRGSGEGGPQGNSSAGWAVASPCAIANMD :: :: . .:: : ::.::: .:::..:..: . : :: :: CCDS10 TGPAGWQLVGLALCLMLVQVIIAVEWLVLTVLRDTR---PA------------CAYEPMD 160 170 180 190 260 270 280 290 300 310 pF1KE9 FVMALIYVMLLLLGAFLGAWPALCGRYKRWRKHGVFVLLTTATSVAIWVVWIVMYTYGN- ::::::: :.::. .. : .:::..:::. .:.:.:.:. :: :::.:..:: .:: CCDS10 FVMALIYDMVLLVVTLGLALFTLCGKFKRWKLNGAFLLITAFLSVLIWVAWMTMYLFGNV 200 210 220 230 240 250 320 330 340 350 360 370 pF1KE9 KQHNSPTWDDPTLAIALAANAWAFVLFYVIPEVSQVTKSSPEQSYQGDMYPTRGVGYETI : ... .:.::::::.:::..:.::.:..:::. . . ... . . .. :: CCDS10 KLQQGDAWNDPTLAITLAASGWVFVIFHAIPEIHCTLLPALQENTPNYFDTSQPRMRETA 260 270 280 290 300 310 380 390 400 410 pF1KE9 LKEQ-KGQSMFVENKAFSMDEPVAAKRPVSPYSGYNGQ-------------LLTSVYQPT ..:. . ..::::::::: :: : .. .: :. . ..::::: CCDS10 FEEDVQLPRAYMENKAFSMDEHNAALRTAGFPNGSLGKRPSGSLGKRPSAPFRSNVYQPT 320 330 340 350 360 370 420 430 440 450 460 470 pF1KE9 EMALMHKVPSEGAYDIILPRATANSQVMGSANSTLRAEDMYSAQSHQAATPPKDGKNSQV :::.. CCDS10 EMAVVLNGGTIPTAPPSHTGRHLW 380 390 400 >>CCDS8657.1 GPRC5A gene_id:9052|Hs108|chr12 (357 aa) initn: 726 init1: 456 opt: 478 Z-score: 497.2 bits: 101.0 E(32554): 2.3e-21 Smith-Waterman score: 718; 38.5% identity (64.7% similar) in 374 aa overlap (72-441:5-342) 50 60 70 80 90 100 pF1KE9 LGARMAIHKALVMCLGLPLFLFPGAWAQGHVPPGCSQGLNPLYYNLCDRSGAWGIVLEAV :: :: .::. :: :::.. :::::::.: CCDS86 MATTVPDGCRNGLKSKYYRLCDKAEAWGIVLETV 10 20 30 110 120 130 140 150 pF1KE9 AGAGIVTT--FVLTI-ILVASLPFVQDTKKRSLLGTQVFFLLGTLGLFCLVFACVVKPDF : ::.::. :.::. ::: . :::...:..: :: .::::.::.: :.:: .. : CCDS86 ATAGVVTSVAFMLTLPILVCK---VQDSNRRKMLPTQFLFLLGVLGIFGLTFAFIIGLDG 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE9 STCASRRFLFGVLFAICFSCLAAHVFALNFLARKNHGPRGWVIFTVALLLTLVEVIINTE :: .: ::::.::.:::::: ::. .:. :.: . ::. .:. ..::. .: : CCDS86 STGPTRFFLFGILFSICFSCLLAHAVSLTKLVRGRKPLSLLVILGLAVGFSLVQDVIAIE 100 110 120 130 140 150 220 230 240 250 260 270 pF1KE9 WLIITLVRGSGEGGPQGNSSAGWAVASPCAIANMDFVMALIYVMLLLLGAFLGAWPALCG ....:. : : .. ...: : :::. : ::..:. .:: . ..:: CCDS86 YIVLTMNRT--------NVNVFSELSAPRR--NEDFVLLLTYVLFLMALTFLMSSFTFCG 160 170 180 190 200 280 290 300 310 320 330 pF1KE9 RYKRWRKHGVFVLLTTATSVAIWVVWIVMYTYGNKQHNSPTWDDPTLAIALAANAWAFVL . :..::. . :: :.::::.::.. . .. ::: :. :::::.:.:.: CCDS86 SFTGWKRHGAHIYLTMLLSIAIWVAWITLLMLPDFDRR---WDDTILSSALAANGWVFLL 210 220 230 240 250 340 350 360 370 380 390 pF1KE9 FYVIPEVSQVTKSSPEQSYQGDMYPTRGVGYETILKEQK-GQSMFVENKAFSMDEPVAAK :: :: .:: : : ::.. ... : : .:. :::.:.:..: CCDS86 AYVSPEFWLLTK----QRNPMD-YPVE----DAFCKPQLVKKSYGVENRAYSQEE----- 260 270 280 290 300 400 410 420 430 440 450 pF1KE9 RPVSPYSGYNGQLLTSVYQPTEMALMHKVPSEGAYDIILPRATANSQVMGSANSTLRAED .. .:. : . :. :.. :... :.. .. .::: : CCDS86 --ITQGFEETGDTLYAPYS-THFQLQNQPPQK---EFSIPRAHAWPSPYKDYEVKKEGS 310 320 330 340 350 460 470 480 pF1KE9 MYSAQSHQAATPPKDGKNSQVFRNPYVWD >>CCDS8658.1 GPRC5D gene_id:55507|Hs108|chr12 (345 aa) initn: 663 init1: 369 opt: 369 Z-score: 385.3 bits: 80.2 E(32554): 3.8e-15 Smith-Waterman score: 686; 37.2% identity (67.2% similar) in 296 aa overlap (84-377:12-295) 60 70 80 90 100 110 pF1KE9 MCLGLPLFLFPGAWAQGHVPPGCSQGLNPLYYNLCDRSGAWGIVLEAVAGAGIVTTFVLT :. ::: : :::.::..: :::.:..: CCDS86 MYKDCIESTGDYFLLCDAEGPWGIILESLAILGIVVTILLL 10 20 30 40 120 130 140 150 160 170 pF1KE9 IILVASLPFVQDTKKRSLLGTQVFFLLGTLGLFCLVFACVVKPDFSTCASRRFLFGVLFA . .. . .:: .. ..: ::..:::..:::: :.:: ... . .: : :::::::: CCDS86 LAFLFLMRKIQDCSQWNVLPTQLLFLLSVLGLFGLAFAFIIELNQQTAPVRYFLFGVLFA 50 60 70 80 90 100 180 190 200 210 220 230 pF1KE9 ICFSCLAAHVFALNFLARKNHGPRGWVIFTVALLLTLVEVIINTEWLIITLVRGSGEGGP .::::: ::. : :.: . .:. .:. .:...:: ::.. . ..:: CCDS86 LCFSCLLAHASNLVKLVRGCVSFSWTTILCIAIGCSLLQIIIATEYVTLIMTRG------ 110 120 130 140 150 240 250 260 270 280 290 pF1KE9 QGNSSAGWAVASPCAIANMDFVMALIYVMLLLLGAFLGAWPALCGRYKRWRKHGVFVLLT .. .:: . :.:::. :.::..:. .:. . ..:: . :..:: ....: CCDS86 -----MMFVNMTPCQL-NVDFVVLLVYVLFLMALTFFVSKATFCGPCENWKQHGRLIFIT 160 170 180 190 200 300 310 320 330 340 350 pF1KE9 TATSVAIWVVWIVMYTYGNKQ-HNSPTWDDPTLAIALAANAWAFVLFYVIPEVSQVTKSS . :. :::::: : :: : . .: ::::.. :::..:::.:.:.:..::. . .: CCDS86 VLFSIIIWVVWISMLLRGNPQFQRQPQWDDPVVCIALVTNAWVFLLLYIVPELCILYRSC 210 220 230 240 250 260 360 370 380 390 400 410 pF1KE9 -PEQSYQGDMYPTRGVGYETILKEQKGQSMFVENKAFSMDEPVAAKRPVSPYSGYNGQLL : ::. :. . . ...:. CCDS86 RQECPLQGNACPVTAYQHSFQVENQELSRARDSDGAEEDVALTSYGTPIQPQTVDPTQEC 270 280 290 300 310 320 486 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 03:50:20 2016 done: Mon Nov 7 03:50:20 2016 Total Scan time: 3.280 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]