FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4209, 729 aa 1>>>pF1KE4209 729 - 729 aa - 729 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.1963+/-0.00104; mu= 3.7874+/- 0.062 mean_var=302.9908+/-63.686, 0's: 0 Z-trim(112.6): 52 B-trim: 4 in 1/52 Lambda= 0.073682 statistics sampled from 13271 (13309) to 13271 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.736), E-opt: 0.2 (0.409), width: 16 Scan time: 3.990 The best scores are: opt bits E(32554) CCDS4280.1 SH3RF2 gene_id:153769|Hs108|chr5 ( 729) 4853 530.2 4.4e-150 CCDS34099.1 SH3RF1 gene_id:57630|Hs108|chr4 ( 888) 733 92.3 3.4e-18 >>CCDS4280.1 SH3RF2 gene_id:153769|Hs108|chr5 (729 aa) initn: 4853 init1: 4853 opt: 4853 Z-score: 2808.0 bits: 530.2 E(32554): 4.4e-150 Smith-Waterman score: 4853; 99.6% identity (99.7% similar) in 729 aa overlap (1-729:1-729) 10 20 30 40 50 60 pF1KE4 MDDLTLLDLLECPVCFEKLDVTAKVLPCQHTFCKPCLQRVFKAHKELRCPECRTPVFSNI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MDDLTLLDLLECPVCFEKLDVTAKVLPCQHTFCKPCLQRVFKAHKELRCPECRTPVFSNI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 EALPANLLLVRLLDGVRSGQSSGRGGSFRRPGTMTLQDGRKSRTNPRRLQASPFRLVPNV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EALPANLLLVRLLDGVRSGQSSGRGGSFRRPGTMTLQDGRKSRTNPRRLQASPFRLVPNV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 RIHMDGVPRAKALCNYRGQNPGDLRFNKGDIILLRRQLDENWYQGEINGISGNFPASSVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 RIHMDGVPRAKALCNYRGQNPGDLRFNKGDIILLRRQLDENWYQGEINGISGNFPASSVE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VIKQLPQPPPLCRALYNFDLRGKDKSENQDCLTFLKDDIITVISRVDENWAEGKLGDKVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 VIKQLPQPPPLCRALYNFDLRGKDKSENQDCLTFLKDDIITVISRVDENWAEGKLGDKVG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 IFPILFVEPNLTARHLLEKNKGRQSSCTKNLSLVSSSSRGNTSTLRRGPGSRRKVPGQFS :::::::::::::::::::::::::: ::::::::::::::::::::::::::::::::: CCDS42 IFPILFVEPNLTARHLLEKNKGRQSSRTKNLSLVSSSSRGNTSTLRRGPGSRRKVPGQFS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 ITTALNTLNRMVHSPSGRHMVEISTPVLISSSNPSVITQPMEKADVPSSCVGQVSTYHPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 ITTALNTLNRMVHSPSGRHMVEISTPVLISSSNPSVITQPMEKADVPSSCVGQVSTYHPA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 PVSPGHSTAVVSLPGSQQHLSANMFVALHSYSAHGPDELDLQKGEGVRVLGKCQDGWLRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PVSPGHSTAVVSLPGSQQHLSANMFVALHSYSAHGPDELDLQKGEGVRVLGKCQDGWLRG 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 VSLVTGRVGIFPNNYVIPIFRKTSSFPDSRSPGLYTTWTLSTSSVSSQGSISEGDPRQSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 VSLVTGRVGIFPNNYVIPIFRKTSSFPDSRSPGLYTTWTLSTSSVSSQGSISEGDPRQSR 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE4 PFKSVFVPTAIVNPVRSTAGPGTLGQGSLRKGRSSMRKNGSLQRPLQSGIPTLVVGSLRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PFKSVFVPTAIVNPVRSTAGPGTLGQGSLRKGRSSMRKNGSLQRPLQSGIPTLVVGSLRR 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE4 SPTMVLRPQQFQFYQPQGIPSSPSAVVVEMGSKPALTGEPALTCISRGSEARIHSAASSL ::::::::::::::::::::::::::::::::::::::::::::::::::: :::::::: CCDS42 SPTMVLRPQQFQFYQPQGIPSSPSAVVVEMGSKPALTGEPALTCISRGSEAWIHSAASSL 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE4 IMEDKEIPIKSEPLPKPPASAPPSILVKPENSRNGIEKQVKTVRFQNYSPPPTKHYTSHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 IMEDKEIPIKSEPLPKPPASAPPSILVKPENSRNGIEKQVKTVRFQNYSPPPTKHYTSHP 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE4 TSGKPEQPATLKASQPEAASLGPEMTVLFAHRSGCHSGQQTDLRRKSALAKATTLVSTAS :::::::::::::::::::::::::::::::::::::::::::::::::.:::::::::: CCDS42 TSGKPEQPATLKASQPEAASLGPEMTVLFAHRSGCHSGQQTDLRRKSALGKATTLVSTAS 670 680 690 700 710 720 pF1KE4 GTQTVFPSK ::::::::: CCDS42 GTQTVFPSK >>CCDS34099.1 SH3RF1 gene_id:57630|Hs108|chr4 (888 aa) initn: 1069 init1: 386 opt: 733 Z-score: 440.0 bits: 92.3 E(32554): 3.4e-18 Smith-Waterman score: 1318; 34.1% identity (61.0% similar) in 775 aa overlap (1-685:1-761) 10 20 30 40 50 60 pF1KE4 MDDLTLLDLLECPVCFEKLDVTAKVLPCQHTFCKPCLQRVFKAHKELRCPECRTPVFSNI ::. .::::::::::.:.::..:::::::::::: :: . ...::::::::: : :.. CCDS34 MDESALLDLLECPVCLERLDASAKVLPCQHTFCKRCLLGIVGSRNELRCPECRTLVGSGV 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 EALPANLLLVRLLDGVRS-GQSSGRGGSFRRPGTMTLQDGRKSRTN--PRRLQASPFRLV : ::.:.::::::::... . : ::. : .:.. .. .: . ::.: CCDS34 EELPSNILLVRLLDGIKQRPWKPGPGGGSGTNCTNALRSQSSTVANCSSKDLQSSQGGQQ 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 PNVRI---HMDGVPR---AKALCNYRGQNPGDLRFNKGDIILLRRQLDENWYQGEINGIS : :. . :.:. :::: ::.:..::::.:.:::::.::::.:::::.::.::: CCDS34 PRVQSWSPPVRGIPQLPCAKALYNYEGKEPGDLKFSKGDIIILRRQVDENWYHGEVNGIH 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 GNFPASSVEVIKQLPQPPPLCRALYNFDLRGKDKSENQDCLTFLKDDIITVISRVDENWA : ::.. :..:: :::::: :.:::.:.. ::: ..::: : :::..::: ::::::: CCDS34 GFFPTNFVQIIKPLPQPPPQCKALYDFEV--KDKEADKDCLPFAKDDVLTVIRRVDENWA 190 200 210 220 230 240 250 260 270 280 290 pF1KE4 EGKLGDKVGIFPILFVEPNLTARHLLEKNKGRQSSCTKNLSLVSSSSRGNTSTLRRGPGS :: :.::.::::: .:: : .:..:.: .: . . ::. ...:: . . CCDS34 EGMLADKIGIFPISYVEFNSAAKQLIEWDKPPVPGVDAGE---CSSAAAQSSTAPKHSDT 240 250 260 270 280 290 300 310 320 330 pF1KE4 RRKVPGQFSITTALNTLNRMVHSPSGRHMVEISTPVLISSSNPS---------------- .... . :.:. :. :. .. ..:: .::: :::::::::. CCDS34 KKNTKKRHSFTS-LTMANKSSQASQNRHSMEISPPVLISSSNPTAAARISELSGLSCSAP 300 310 320 330 340 350 340 350 360 pF1KE4 ----------VITQPMEK-----------ADVP-SSCVGQVSTYHPAP----------VS ..: : . .::: .. .: .. : : . CCDS34 SQVHISTTGLIVTPPPSSPVTTGPSFTFPSDVPYQAALGTLNPPLPPPPLLAATVLASTP 360 370 380 390 400 410 370 380 390 400 pF1KE4 PGHSTAVVS-------LPGSQQHLS-------ANMFVALHSYSAHGPDELDLQKGEGVRV :: ..:... . :: .... ...::.. :. . :::.:.::: : CCDS34 PGATAAAAAAGMGPRPMAGSTDQIAHLRPQTRPSVYVAIYPYTPRKEDELELRKGEMFLV 420 430 440 450 460 470 410 420 430 440 450 460 pF1KE4 LGKCQDGWLRGVSLVTGRVGIFPNNYVIPIFRKTSSFPDSRSPGLYTTWTLSTSSVSSQG . .:::::..:.:. :...:.::.::: :. : ... ... : ..: .. .:. . CCDS34 FERCQDGWFKGTSMHTSKIGVFPGNYVAPVTRAVTNASQAKVP--MSTAGQTSRGVTMVS 480 490 500 510 520 530 470 480 490 500 510 pF1KE4 SISEGDPRQSRPFKSV-----FVPTAIVNPVRSTAGPGT------LGQGSLRKGRSSMRK . : : :. ..: ::.:.:. .. ..: . :: .. ..:...: CCDS34 PSTAGGPAQKLQGNGVAGSPSVVPAAVVSAAHIQTSPQAKVLLHMTGQMTVNQARNAVRT 540 550 560 570 580 590 520 530 540 550 560 570 pF1KE4 NGS--LQRPLQSGIPTLVVGSLRRSPTMVLRPQQFQFYQPQGIPSSPSAVVVEMGSKPAL .. .:: . : : .. ::. : .. .. .:: : :.... . . . CCDS34 VAAHNQERPTAAVTPIQVQNAAGLSPASVGLSHH-SLASPQPAPLMPGSATHTAAISISR 600 610 620 630 640 650 580 590 600 610 620 630 pF1KE4 TGEPALTCISRGSEARIHSAASSLIMEDKEIPIKSEP-LPKPPASAPP----SILVKPEN .. : :.: . . . ...:: : . . : :: : :: : .::. CCDS34 ASAP-LACAAAAPLTSPSITSASLEAEPSGRIVTVLPGLPTSPDSASSACGNSSATKPD- 660 670 680 690 700 640 650 660 670 680 690 pF1KE4 SRNGIEKQVKTVRFQNYSPPPTKHYTSHPTSGKPEQPATLKASQ-PEAASLGPEMTVLFA ... ... ... . . : .: :.: : . : ... : ...:::. CCDS34 -KDSKKEKKGLLKLLSGASTKRKPRVSPPAS--PTLEVELGSAELPLQGAVGPELPPGGG 710 720 730 740 750 760 700 710 720 pF1KE4 HRSGCHSGQQTDLRRKSALAKATTLVSTASGTQTVFPSK CCDS34 HGRAGSCPVDGDGPVTTAVAGAALAQDAFHRKASSLDSAVPIAPPPRQACSSLGPVLNES 770 780 790 800 810 820 729 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 13:49:47 2016 done: Sat Nov 5 13:49:48 2016 Total Scan time: 3.990 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]