FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4029, 236 aa 1>>>pF1KE4029 236 - 236 aa - 236 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3930+/-0.000665; mu= 11.4182+/- 0.041 mean_var=115.9796+/-22.568, 0's: 0 Z-trim(115.5): 166 B-trim: 482 in 1/51 Lambda= 0.119092 statistics sampled from 15895 (16067) to 15895 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.817), E-opt: 0.2 (0.494), width: 16 Scan time: 2.700 The best scores are: opt bits E(32554) CCDS10068.1 RHOV gene_id:171177|Hs108|chr15 ( 236) 1622 288.3 2.9e-78 CCDS1575.1 RHOU gene_id:58480|Hs108|chr1 ( 258) 889 162.4 2.5e-40 CCDS221.1 CDC42 gene_id:998|Hs108|chr1 ( 191) 697 129.3 1.7e-30 CCDS5348.1 RAC1 gene_id:5879|Hs108|chr7 ( 192) 696 129.2 1.9e-30 CCDS222.1 CDC42 gene_id:998|Hs108|chr1 ( 191) 695 129.0 2.2e-30 CCDS11798.1 RAC3 gene_id:5881|Hs108|chr17 ( 192) 695 129.0 2.2e-30 CCDS33191.1 RHOQ gene_id:23433|Hs108|chr2 ( 205) 695 129.0 2.3e-30 CCDS13945.1 RAC2 gene_id:5880|Hs108|chr22 ( 192) 690 128.1 4e-30 CCDS7748.1 RHOG gene_id:391|Hs108|chr11 ( 191) 627 117.3 7.1e-27 CCDS9757.1 RHOJ gene_id:57381|Hs108|chr14 ( 214) 618 115.8 2.3e-26 CCDS2795.1 RHOA gene_id:387|Hs108|chr3 ( 193) 552 104.4 5.4e-23 CCDS1699.1 RHOB gene_id:388|Hs108|chr2 ( 196) 552 104.4 5.5e-23 CCDS854.1 RHOC gene_id:389|Hs108|chr1 ( 193) 544 103.1 1.4e-22 CCDS8155.1 RHOD gene_id:29984|Hs108|chr11 ( 210) 534 101.4 5e-22 CCDS3458.1 RHOH gene_id:399|Hs108|chr4 ( 191) 480 92.1 2.9e-19 CCDS9222.1 RHOF gene_id:54509|Hs108|chr12 ( 211) 473 90.9 7.1e-19 CCDS82775.1 RHOA gene_id:387|Hs108|chr3 ( 187) 455 87.7 5.5e-18 CCDS2190.1 RND3 gene_id:390|Hs108|chr2 ( 244) 454 87.7 7.6e-18 CCDS8771.1 RND1 gene_id:27289|Hs108|chr12 ( 232) 414 80.8 8.6e-16 CCDS11452.1 RND2 gene_id:8153|Hs108|chr17 ( 227) 411 80.3 1.2e-15 CCDS5349.1 RAC1 gene_id:5879|Hs108|chr7 ( 211) 335 67.2 9.7e-12 >>CCDS10068.1 RHOV gene_id:171177|Hs108|chr15 (236 aa) initn: 1622 init1: 1622 opt: 1622 Z-score: 1518.6 bits: 288.3 E(32554): 2.9e-78 Smith-Waterman score: 1622; 100.0% identity (100.0% similar) in 236 aa overlap (1-236:1-236) 10 20 30 40 50 60 pF1KE4 MPPRELSEAEPPPLRAPTPPPRRRSAPPELGIKCVLVGDGAVGKSSLIVSYTCNGYPARY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MPPRELSEAEPPPLRAPTPPPRRRSAPPELGIKCVLVGDGAVGKSSLIVSYTCNGYPARY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 RPTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPDTDVFLACFSVVQPSSFQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 RPTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPDTDVFLACFSVVQPSSFQN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ITEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGREGPVPQPQAQGLAEKIRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 ITEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGREGPVPQPQAQGLAEKIRA 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 CCYLECSALTQKNLKEVFDSAILSAIEHKARLEKKLNAKGVRTLSRCRWKKFFCFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 CCYLECSALTQKNLKEVFDSAILSAIEHKARLEKKLNAKGVRTLSRCRWKKFFCFV 190 200 210 220 230 >>CCDS1575.1 RHOU gene_id:58480|Hs108|chr1 (258 aa) initn: 935 init1: 655 opt: 889 Z-score: 837.4 bits: 162.4 E(32554): 2.5e-40 Smith-Waterman score: 905; 54.4% identity (76.1% similar) in 259 aa overlap (1-236:1-258) 10 20 30 40 pF1KE4 MPPRELSEAEPPPLRAPTPPPRRRSA------PPE------------LGIKCVLVGDGAV :::.. . : : .:: ::::. . : : :.:::::::::: CCDS15 MPPQQGDPAFPDRCEAPPVPPRRERGGRGGRGPGEPGGRGRAGGAEGRGVKCVLVGDGAV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 GKSSLIVSYTCNGYPARYRPTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPD ::.::.:::: ::::..: :::.:.::. : ::: :::..: :::::..::.:: ::: . CCDS15 GKTSLVVSYTTNGYPTEYIPTAFDNFSAVVSVDGRPVRLQLCDTAGQDEFDKLRPLCYTN 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE4 TDVFLACFSVVQPSSFQNITEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGR ::.:: :::::.::::::..:::.:::: : :.::..:::::.:::.::.:::.::. . CCDS15 TDIFLLCFSVVSPSSFQNVSEKWVPEIRCHCPKAPIILVGTQSDLREDVKVLIELDKC-K 130 140 150 160 170 170 180 190 200 210 220 pF1KE4 EGPVPQPQAQGLAEKIRACCYLECSALTQKNLKEVFDSAILSAIEHKARLEKKLNAKG-- : :::. :. ::.:.: :.:::::::::::::::.::...:... .. ..:. CCDS15 EKPVPEEAAKLCAEEIKAASYIECSALTQKNLKEVFDAAIVAGIQYSDTQQQPKKSKSRT 180 190 200 210 220 230 230 pF1KE4 ---VRTLSRCRWKKFFCFV ...::. :::. ::: CCDS15 PDKMKNLSKSWWKKYCCFV 240 250 >>CCDS221.1 CDC42 gene_id:998|Hs108|chr1 (191 aa) initn: 695 init1: 555 opt: 697 Z-score: 660.9 bits: 129.3 E(32554): 1.7e-30 Smith-Waterman score: 697; 57.4% identity (81.8% similar) in 176 aa overlap (32-207:4-178) 10 20 30 40 50 60 pF1KE4 PPRELSEAEPPPLRAPTPPPRRRSAPPELGIKCVLVGDGAVGKSSLIVSYTCNGYPARYR ::::.::::::::. :..::: : .:..: CCDS22 MQTIKCVVVGDGAVGKTCLLISYTTNKFPSEYV 10 20 30 70 80 90 100 110 120 pF1KE4 PTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPDTDVFLACFSVVQPSSFQNI ::..:...: :.. : : . :.:::::::.:::: : ::.:::::.:::::.::::.:. CCDS22 PTVFDNYAVTVMIGGEPYTLGLFDTAGQEDYDRLRPLSYPQTDVFLVCFSVVSPSSFENV 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE4 TEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGREGPVPQPQAQGLAEKIRAC :::.::: : :..: :::::: ::::: ... .: .. .. :. :. ::. ..: CCDS22 KEKWVPEITHHCPKTPFLLVGTQIDLRDDPSTIEKLAKN-KQKPITPETAEKLARDLKAV 100 110 120 130 140 150 190 200 210 220 230 pF1KE4 CYLECSALTQKNLKEVFDSAILSAIEHKARLEKKLNAKGVRTLSRCRWKKFFCFV :.::::::::.::.::: :::.:.: CCDS22 KYVECSALTQKGLKNVFDEAILAALEPPEPKKSRRCVLL 160 170 180 190 >>CCDS5348.1 RAC1 gene_id:5879|Hs108|chr7 (192 aa) initn: 698 init1: 570 opt: 696 Z-score: 660.0 bits: 129.2 E(32554): 1.9e-30 Smith-Waterman score: 696; 57.3% identity (82.5% similar) in 171 aa overlap (32-202:4-173) 10 20 30 40 50 60 pF1KE4 PPRELSEAEPPPLRAPTPPPRRRSAPPELGIKCVLVGDGAVGKSSLIVSYTCNGYPARYR ::::.::::::::. :..::: :..:..: CCDS53 MQAIKCVVVGDGAVGKTCLLISYTTNAFPGEYI 10 20 30 70 80 90 100 110 120 pF1KE4 PTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPDTDVFLACFSVVQPSSFQNI ::..:..:..:.::: :: . :::::::::.:::: : ::.::::: :::.:.:.::.:. CCDS53 PTVFDNYSANVMVDGKPVNLGLWDTAGQEDYDRLRPLSYPQTDVFLICFSLVSPASFENV 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE4 TEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGREGPVPQPQAQGLAEKIRAC :: ::.: : :..:..::::. ::::: ... .: . . :. ::. ..:..: : CCDS53 RAKWYPEVRHHCPNTPIILVGTKLDLRDDKDTIEKLKEK-KLTPITYPQGLAMAKEIGAV 100 110 120 130 140 150 190 200 210 220 230 pF1KE4 CYLECSALTQKNLKEVFDSAILSAIEHKARLEKKLNAKGVRTLSRCRWKKFFCFV :::::::::..:: ::: :: CCDS53 KYLECSALTQRGLKTVFDEAIRAVLCPPPVKKRKRKCLLL 160 170 180 190 >>CCDS222.1 CDC42 gene_id:998|Hs108|chr1 (191 aa) initn: 692 init1: 555 opt: 695 Z-score: 659.1 bits: 129.0 E(32554): 2.2e-30 Smith-Waterman score: 695; 54.9% identity (79.3% similar) in 184 aa overlap (32-215:4-186) 10 20 30 40 50 60 pF1KE4 PPRELSEAEPPPLRAPTPPPRRRSAPPELGIKCVLVGDGAVGKSSLIVSYTCNGYPARYR ::::.::::::::. :..::: : .:..: CCDS22 MQTIKCVVVGDGAVGKTCLLISYTTNKFPSEYV 10 20 30 70 80 90 100 110 120 pF1KE4 PTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPDTDVFLACFSVVQPSSFQNI ::..:...: :.. : : . :.:::::::.:::: : ::.:::::.:::::.::::.:. CCDS22 PTVFDNYAVTVMIGGEPYTLGLFDTAGQEDYDRLRPLSYPQTDVFLVCFSVVSPSSFENV 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE4 TEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGREGPVPQPQAQGLAEKIRAC :::.::: : :..: :::::: ::::: ... .: .. .. :. :. ::. ..: CCDS22 KEKWVPEITHHCPKTPFLLVGTQIDLRDDPSTIEKLAKN-KQKPITPETAEKLARDLKAV 100 110 120 130 140 150 190 200 210 220 230 pF1KE4 CYLECSALTQKNLKEVFDSAILSAIEHKARLEKKLNAKGVRTLSRCRWKKFFCFV :.:::::::..::.::: :::.:.: :. CCDS22 KYVECSALTQRGLKNVFDEAILAALEPPETQPKRKCCIF 160 170 180 190 >>CCDS11798.1 RAC3 gene_id:5881|Hs108|chr17 (192 aa) initn: 695 init1: 572 opt: 695 Z-score: 659.0 bits: 129.0 E(32554): 2.2e-30 Smith-Waterman score: 695; 57.3% identity (83.0% similar) in 171 aa overlap (32-202:4-173) 10 20 30 40 50 60 pF1KE4 PPRELSEAEPPPLRAPTPPPRRRSAPPELGIKCVLVGDGAVGKSSLIVSYTCNGYPARYR ::::.::::::::. :..::: :..:..: CCDS11 MQAIKCVVVGDGAVGKTCLLISYTTNAFPGEYI 10 20 30 70 80 90 100 110 120 pF1KE4 PTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPDTDVFLACFSVVQPSSFQNI ::..:..:..:.::: :: . :::::::::.:::: : ::.::::: :::.:.:.::.:. CCDS11 PTVFDNYSANVMVDGKPVNLGLWDTAGQEDYDRLRPLSYPQTDVFLICFSLVSPASFENV 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE4 TEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGREGPVPQPQAQGLAEKIRAC :: ::.: : :..:.:::::. ::::: ... .: . . .:. ::. ..:..: . CCDS11 RAKWYPEVRHHCPHTPILLVGTKLDLRDDKDTIERL-RDKKLAPITYPQGLAMAREIGSV 100 110 120 130 140 150 190 200 210 220 230 pF1KE4 CYLECSALTQKNLKEVFDSAILSAIEHKARLEKKLNAKGVRTLSRCRWKKFFCFV :::::::::..:: ::: :: CCDS11 KYLECSALTQRGLKTVFDEAIRAVLCPPPVKKPGKKCTVF 160 170 180 190 >>CCDS33191.1 RHOQ gene_id:23433|Hs108|chr2 (205 aa) initn: 667 init1: 509 opt: 695 Z-score: 658.7 bits: 129.0 E(32554): 2.3e-30 Smith-Waterman score: 695; 52.7% identity (75.1% similar) in 201 aa overlap (27-227:5-198) 10 20 30 40 50 60 pF1KE4 MPPRELSEAEPPPLRAPTPPPRRRSAPPELGIKCVLVGDGAVGKSSLIVSYTCNGYPARY : : .:::.::::::::. :..::. ...: .: CCDS33 MAHGPGALMLKCVVVGDGAVGKTCLLMSYANDAFPEEY 10 20 30 70 80 90 100 110 120 pF1KE4 RPTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPDTDVFLACFSVVQPSSFQN ::..: ..:.: : : . :.:::::::.:::: : :: ::::: :::::.:.:::: CCDS33 VPTVFDHYAVSVTVGGKQYLLGLYDTAGQEDYDRLRPLSYPMTDVFLICFSVVNPASFQN 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE4 ITEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGREGPVPQPQAQGLAEKIRA . :.:.::.. . :..: ::.::: ::::: ..: .:.. .: :. :.: ::..: : CCDS33 VKEEWVPELKEYAPNVPFLLIGTQIDLRDDPKTLARLNDM-KEKPICVEQGQKLAKEIGA 100 110 120 130 140 150 190 200 210 220 230 pF1KE4 CCYLECSALTQKNLKEVFDSAILSAIEHKARLEKKLNAKGVRTLSRCRWKKFFCFV :::.::::::::.:: ::: ::.. . : . :: : ::: CCDS33 CCYVECSALTQKGLKTVFDEAIIAILTPKKHTVKK------RIGSRCINCCLIT 160 170 180 190 200 >>CCDS13945.1 RAC2 gene_id:5880|Hs108|chr22 (192 aa) initn: 687 init1: 560 opt: 690 Z-score: 654.4 bits: 128.1 E(32554): 4e-30 Smith-Waterman score: 690; 53.0% identity (82.2% similar) in 185 aa overlap (32-215:4-187) 10 20 30 40 50 60 pF1KE4 PPRELSEAEPPPLRAPTPPPRRRSAPPELGIKCVLVGDGAVGKSSLIVSYTCNGYPARYR ::::.::::::::. :..::: :..:..: CCDS13 MQAIKCVVVGDGAVGKTCLLISYTTNAFPGEYI 10 20 30 70 80 90 100 110 120 pF1KE4 PTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPDTDVFLACFSVVQPSSFQNI ::..:..:..:.::. :: . :::::::::.:::: : ::.::::: :::.:.:.:..:. CCDS13 PTVFDNYSANVMVDSKPVNLGLWDTAGQEDYDRLRPLSYPQTDVFLICFSLVSPASYENV 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE4 TEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGREGPVPQPQAQGLAEKIRAC ::.::.: : :..:..::::. ::::: ... .: . . .:. ::. .::..: . CCDS13 RAKWFPEVRHHCPSTPIILVGTKLDLRDDKDTIEKLKEK-KLAPITYPQGLALAKEIDSV 100 110 120 130 140 150 190 200 210 220 230 pF1KE4 CYLECSALTQKNLKEVFDSAILSAI-EHKARLEKKLNAKGVRTLSRCRWKKFFCFV :::::::::..:: ::: :: ... . .: .:. CCDS13 KYLECSALTQRGLKTVFDEAIRAVLCPQPTRQQKRACSLL 160 170 180 190 >>CCDS7748.1 RHOG gene_id:391|Hs108|chr11 (191 aa) initn: 610 init1: 522 opt: 627 Z-score: 595.9 bits: 117.3 E(32554): 7.1e-27 Smith-Waterman score: 627; 52.0% identity (80.1% similar) in 171 aa overlap (32-202:4-173) 10 20 30 40 50 60 pF1KE4 PPRELSEAEPPPLRAPTPPPRRRSAPPELGIKCVLVGDGAVGKSSLIVSYTCNGYPARYR ::::.::::::::. :.. :: :..: .: CCDS77 MQSIKCVVVGDGAVGKTCLLICYTTNAFPKEYI 10 20 30 70 80 90 100 110 120 pF1KE4 PTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPDTDVFLACFSVVQPSSFQNI ::..:..:.: ::: : ..::::::::..::::.: ::.:.::. :::...: :..:. CCDS77 PTVFDNYSAQSAVDGRTVNLNLWDTAGQEEYDRLRTLSYPQTNVFVICFSIASPPSYENV 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE4 TEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGREGPVPQPQAQGLAEKIRAC .:: ::. : :..:.:::::. ::: . ..: .: . : ..:. :.:.::..:.: CCDS77 RHKWHPEVCHHCPDVPILLVGTKKDLRAQPDTLRRLKEQG-QAPITPQQGQALAKQIHAV 100 110 120 130 140 150 190 200 210 220 230 pF1KE4 CYLECSALTQKNLKEVFDSAILSAIEHKARLEKKLNAKGVRTLSRCRWKKFFCFV ::::::: : ..:::: :. CCDS77 RYLECSALQQDGVKEVFAEAVRAVLNPTPIKRGRSCILL 160 170 180 190 >>CCDS9757.1 RHOJ gene_id:57381|Hs108|chr14 (214 aa) initn: 609 init1: 478 opt: 618 Z-score: 586.9 bits: 115.8 E(32554): 2.3e-26 Smith-Waterman score: 618; 51.1% identity (76.1% similar) in 184 aa overlap (32-215:22-203) 10 20 30 40 50 60 pF1KE4 PPRELSEAEPPPLRAPTPPPRRRSAPPELGIKCVLVGDGAVGKSSLIVSYTCNGYPARYR .:::.::::::::. :..::. ...: .: CCDS97 MNCKEGTDSSCGCRGNDEKKMLKCVVVGDGAVGKTCLLMSYANDAFPEEYV 10 20 30 40 50 70 80 90 100 110 120 pF1KE4 PTALDTFSVQVLVDGAPVRIELWDTAGQEDFDRLRSLCYPDTDVFLACFSVVQPSSFQNI ::..: ..: : : : . :.:::::::...:: : ::.::::: :::::.:.:..:. CCDS97 PTVFDHYAVTVTVGGKQHLLGLYDTAGQEDYNQLRPLSYPNTDVFLICFSVVNPASYHNV 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE4 TEKWLPEIRTHNPQAPVLLVGTQADLRDDVNVLIQLDQGGREGPVPQPQAQGLAEKIRAC :.:.::.. :..: .:.::: ::::: ..: .: .: :. .. ::. : : CCDS97 QEEWVPELKDCMPHVPYVLIGTQIDLRDDPKTLARLLYM-KEKPLTYEHGVKLAKAIGAQ 120 130 140 150 160 170 190 200 210 220 230 pF1KE4 CYLECSALTQKNLKEVFDSAILSAIEHKARLEKKLNAKGVRTLSRCRWKKFFCFV :::::::::::.:: ::: :::. : : . .:. CCDS97 CYLECSALTQKGLKAVFDEAILT-IFHPKKKKKRCSEGHSCCSII 180 190 200 210 236 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:19:17 2016 done: Sun Nov 6 04:19:18 2016 Total Scan time: 2.700 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]