FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9665, 672 aa 1>>>pF1KE9665 672 - 672 aa - 672 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9292+/-0.00106; mu= 15.7012+/- 0.064 mean_var=67.1243+/-13.177, 0's: 0 Z-trim(103.0): 32 B-trim: 0 in 0/50 Lambda= 0.156543 statistics sampled from 7193 (7202) to 7193 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.584), E-opt: 0.2 (0.221), width: 16 Scan time: 3.110 The best scores are: opt bits E(32554) CCDS10720.1 SHCBP1 gene_id:79801|Hs108|chr16 ( 672) 4469 1018.8 0 CCDS30955.1 SHCBP1L gene_id:81626|Hs108|chr1 ( 653) 506 123.8 7.7e-28 >>CCDS10720.1 SHCBP1 gene_id:79801|Hs108|chr16 (672 aa) initn: 4469 init1: 4469 opt: 4469 Z-score: 5450.2 bits: 1018.8 E(32554): 0 Smith-Waterman score: 4469; 99.7% identity (99.9% similar) in 672 aa overlap (1-672:1-672) 10 20 30 40 50 60 pF1KE9 MADGSLTGGGLEAAAMAPERTGWAVEQELASLEKGLFQDEDSCSDCSYRDKPGSSLQSFM :::::::::::::::::::: ::::::::::::::::::::::::::::::::::::::: CCDS10 MADGSLTGGGLEAAAMAPERMGWAVEQELASLEKGLFQDEDSCSDCSYRDKPGSSLQSFM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 PEGKTFFPEIFQTNQLLFYERFRAYQDYILADCKASEVQEFTAEFLEKVLEPSGWRAVWH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 PEGKTFFPEIFQTNQLLFYERFRAYQDYILADCKASEVQEFTAEFLEKVLEPSGWRAVWH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 TNVFKVLVEITDVDFAALKAVVRLAEPYLCDSQVSTFTMECMKELLDLKEHRLPLQELWV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 TNVFKVLVEITDVDFAALKAVVRLAEPYLCDSQVSTFTMECMKELLDLKEHRLPLQELWV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 VFDDSGVFDQTALAIEHVRFFYQNIWRSWDEEEEDEYDYFVRCVEPRLRLHYDILEDRVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VFDDSGVFDQTALAIEHVRFFYQNIWRSWDEEEEDEYDYFVRCVEPRLRLHYDILEDRVP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 SGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLKLYSEMEQLKQKLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 SGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLKLYSEMEQLKQKLK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE9 LIENPLLRYVFGYQKNSNIQAKGVRSSGQKITHVVSSTMMAGLLRSLLTDRLCQEPGEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LIENPLLRYVFGYQKNSNIQAKGVRSSGQKITHVVSSTMMAGLLRSLLTDRLCQEPGEEE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE9 REIQFHSDPLSAINACFEGDTVIVCPGHYVVHGTFSIADSIELEGYGLPDDIVIEKRGKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 REIQFHSDPLSAINACFEGDTVIVCPGHYVVHGTFSIADSIELEGYGLPDDIVIEKRGKG 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE9 DTFVDCTGADIKISGIKFVQHDAVEGILIVHRGKTTLENCVLQCETTGVTVRTSAEFLMK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DTFVDCTGADIKISGIKFVQHDAVEGILIVHRGKTTLENCVLQCETTGVTVRTSAEFLMK 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE9 NSDLYGAKGAGIEIYPGSQCTLSDNGIHHCKEGILIKDFLDEHYDIPKISMVNNIIHNNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NSDLYGAKGAGIEIYPGSQCTLSDNGIHHCKEGILIKDFLDEHYDIPKISMVNNIIHNNE 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE9 GYGVVLVKPTIFSDLQESAEDGTEENKALKIQTSGEPDVAERVDLEELIECATGKMELCA :::::::::::::::::.:::::::::::::::::::::::::::::::::::::::::: CCDS10 GYGVVLVKPTIFSDLQENAEDGTEENKALKIQTSGEPDVAERVDLEELIECATGKMELCA 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE9 RTDPSEQVEGNCEIVNELIAASTQKGQIKKKRLSELGITQADDNLMSQEMFVGIVGNQFK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 RTDPSEQVEGNCEIVNELIAASTQKGQIKKKRLSELGITQADDNLMSQEMFVGIVGNQFK 610 620 630 640 650 660 670 pF1KE9 WNGKGSFGTFLF :::::::::::: CCDS10 WNGKGSFGTFLF 670 >>CCDS30955.1 SHCBP1L gene_id:81626|Hs108|chr1 (653 aa) initn: 732 init1: 398 opt: 506 Z-score: 613.3 bits: 123.8 E(32554): 7.7e-28 Smith-Waterman score: 801; 32.3% identity (61.1% similar) in 496 aa overlap (80-549:121-597) 50 60 70 80 90 100 pF1KE9 DKPGSSLQSFMPEGKTFFPEIFQTNQLLFYERFRAYQDYILADCKASEVQEFTAEFL-EK :. : : .: :::: ...: ...: :: CCDS30 EPLLPVPEDEEEAQPLPPVCVSRMRGMWRDEKVSLYCDEVLQDCKAEDADEVMGKYLSEK 100 110 120 130 140 150 110 120 130 140 150 pF1KE9 VLEPSGWRAVWHTN--VF------------KVLVEITDVDF----AALKAVVRLAEPYLC . . : .::.:: :: .:::.: . . .:..: .:::. CCDS30 LKLKDKWLGVWKTNPSVFFVKYEEASIPFVGILVEVTCEPYQDSSSRFKVTVSVAEPF-- 160 170 180 190 200 160 170 180 190 200 210 pF1KE9 DSQVSTFTMECMKELLDLKEHRLPLQELWVVFDDSGVFDQTALAIEHVRFFYQNIWRSWD .:..... . . :.:. :: .:: :.. : .. . :::.: :::::. .::.:: CCDS30 SSNIANIPRDLVDEILEELEHSVPLLEVYPVEGQDTDIHVIALALEVVRFFYDFLWRDWD 210 220 230 240 250 260 220 230 240 250 260 270 pF1KE9 EEEEDEYDYFVRCVEPRLRLHYDILEDRVPSGLIVDYHNLLSQCEESYRKFLNLRSSLSN .:: : .. .: :. : :: . .:. . ... : . ... .... .:.... CCDS30 DEESCEN--YTALIEERINLWCDIQDGTIPGPIAQRFKKTLEKYKNKRVELIEYQSNIKE 270 280 290 300 310 320 280 290 300 310 320 330 pF1KE9 CNSDSEQENISMVEGLKLYSEMEQLKQKLKLIENPLLRYVFGYQKNSNIQAKGVRSSGQK : .: :: : : :. .: ::. :. :: . . :: : :. CCDS30 DPSAAEA-----VECWKKYYEIVMLCGLLKMWEDLRLRVHGPFFPRILRRRKGKREFGKT 330 340 350 360 370 380 340 350 360 370 380 390 pF1KE9 ITHVVSSTMMAGLLRSLLTDRLCQEPGEEEREIQFHSDPLSAINACFEGDTVIVCPGHYV :::.:.. : . ....: .: : :. :.: :.. :. :::::. ::.: CCDS30 ITHIVAKMMTTEMIKDLSSDTLLQQ----------HGDLDLALDNCYSGDTVIIFPGEYQ 390 400 410 420 430 400 410 420 430 440 450 pF1KE9 VHGTFSIADSIELEGYGLPDDIVIEKRGKGDTFVDCTGADIKISGIKFVQHDAVEGILIV . . ..:.: ..: : ..:.: .. . :.:: . ..:. ....:. .:.::..: CCDS30 AANLALLTDDIIIKGVGKREEIMITSEPSRDSFVVSKADNVKLMHLSLIQQGTVDGIVVV 440 450 460 470 480 490 460 470 480 490 500 510 pF1KE9 HRGKTTLENCVLQCETTGVTVRTSAEFLMKNSDLYGAKGAGIEIYPGSQCTLSDNGIHHC . :. :::::.:.:: ::: : :.: . . .:.. ::.:::.:.:::: : : :::: CCDS30 ESGHMTLENCILKCEGTGVCVLTGAALTITDSEITGAQGAGVELYPGSIAILERNEIHHC 500 510 520 530 540 550 520 530 540 550 560 pF1KE9 ---KEGILIKDFLD----EHYDIPKISMVNNIIHNNEGYGVVLVKPTIFSDLQESAEDGT . . :. : . ::..:.:: :..:.:::: ...: CCDS30 NNLRTSNSSKSTLGGVNMKVLPAPKLKMTNNHIYSNKGYGVSILQPMEQFFIVAEEALNK 560 570 580 590 600 610 570 580 590 600 610 620 pF1KE9 EENKALKIQTSGEPDVAERVDLEELIECATGKMELCARTDPSEQVEGNCEIVNELIAAST CCDS30 RASSGDKKDDKMLFKVMQNLNLEMNNNKIEANVKGDIRIVTS 620 630 640 650 672 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 00:12:49 2016 done: Mon Nov 7 00:12:49 2016 Total Scan time: 3.110 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]