FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE9665, 672 aa
1>>>pF1KE9665 672 - 672 aa - 672 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9292+/-0.00106; mu= 15.7012+/- 0.064
mean_var=67.1243+/-13.177, 0's: 0 Z-trim(103.0): 32 B-trim: 0 in 0/50
Lambda= 0.156543
statistics sampled from 7193 (7202) to 7193 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.584), E-opt: 0.2 (0.221), width: 16
Scan time: 3.110
The best scores are: opt bits E(32554)
CCDS10720.1 SHCBP1 gene_id:79801|Hs108|chr16 ( 672) 4469 1018.8 0
CCDS30955.1 SHCBP1L gene_id:81626|Hs108|chr1 ( 653) 506 123.8 7.7e-28
>>CCDS10720.1 SHCBP1 gene_id:79801|Hs108|chr16 (672 aa)
initn: 4469 init1: 4469 opt: 4469 Z-score: 5450.2 bits: 1018.8 E(32554): 0
Smith-Waterman score: 4469; 99.7% identity (99.9% similar) in 672 aa overlap (1-672:1-672)
10 20 30 40 50 60
pF1KE9 MADGSLTGGGLEAAAMAPERTGWAVEQELASLEKGLFQDEDSCSDCSYRDKPGSSLQSFM
:::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::
CCDS10 MADGSLTGGGLEAAAMAPERMGWAVEQELASLEKGLFQDEDSCSDCSYRDKPGSSLQSFM
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE9 PEGKTFFPEIFQTNQLLFYERFRAYQDYILADCKASEVQEFTAEFLEKVLEPSGWRAVWH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 PEGKTFFPEIFQTNQLLFYERFRAYQDYILADCKASEVQEFTAEFLEKVLEPSGWRAVWH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE9 TNVFKVLVEITDVDFAALKAVVRLAEPYLCDSQVSTFTMECMKELLDLKEHRLPLQELWV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 TNVFKVLVEITDVDFAALKAVVRLAEPYLCDSQVSTFTMECMKELLDLKEHRLPLQELWV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE9 VFDDSGVFDQTALAIEHVRFFYQNIWRSWDEEEEDEYDYFVRCVEPRLRLHYDILEDRVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 VFDDSGVFDQTALAIEHVRFFYQNIWRSWDEEEEDEYDYFVRCVEPRLRLHYDILEDRVP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE9 SGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLKLYSEMEQLKQKLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 SGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLKLYSEMEQLKQKLK
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE9 LIENPLLRYVFGYQKNSNIQAKGVRSSGQKITHVVSSTMMAGLLRSLLTDRLCQEPGEEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LIENPLLRYVFGYQKNSNIQAKGVRSSGQKITHVVSSTMMAGLLRSLLTDRLCQEPGEEE
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE9 REIQFHSDPLSAINACFEGDTVIVCPGHYVVHGTFSIADSIELEGYGLPDDIVIEKRGKG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 REIQFHSDPLSAINACFEGDTVIVCPGHYVVHGTFSIADSIELEGYGLPDDIVIEKRGKG
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE9 DTFVDCTGADIKISGIKFVQHDAVEGILIVHRGKTTLENCVLQCETTGVTVRTSAEFLMK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 DTFVDCTGADIKISGIKFVQHDAVEGILIVHRGKTTLENCVLQCETTGVTVRTSAEFLMK
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE9 NSDLYGAKGAGIEIYPGSQCTLSDNGIHHCKEGILIKDFLDEHYDIPKISMVNNIIHNNE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 NSDLYGAKGAGIEIYPGSQCTLSDNGIHHCKEGILIKDFLDEHYDIPKISMVNNIIHNNE
490 500 510 520 530 540
550 560 570 580 590 600
pF1KE9 GYGVVLVKPTIFSDLQESAEDGTEENKALKIQTSGEPDVAERVDLEELIECATGKMELCA
:::::::::::::::::.::::::::::::::::::::::::::::::::::::::::::
CCDS10 GYGVVLVKPTIFSDLQENAEDGTEENKALKIQTSGEPDVAERVDLEELIECATGKMELCA
550 560 570 580 590 600
610 620 630 640 650 660
pF1KE9 RTDPSEQVEGNCEIVNELIAASTQKGQIKKKRLSELGITQADDNLMSQEMFVGIVGNQFK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 RTDPSEQVEGNCEIVNELIAASTQKGQIKKKRLSELGITQADDNLMSQEMFVGIVGNQFK
610 620 630 640 650 660
670
pF1KE9 WNGKGSFGTFLF
::::::::::::
CCDS10 WNGKGSFGTFLF
670
>>CCDS30955.1 SHCBP1L gene_id:81626|Hs108|chr1 (653 aa)
initn: 732 init1: 398 opt: 506 Z-score: 613.3 bits: 123.8 E(32554): 7.7e-28
Smith-Waterman score: 801; 32.3% identity (61.1% similar) in 496 aa overlap (80-549:121-597)
50 60 70 80 90 100
pF1KE9 DKPGSSLQSFMPEGKTFFPEIFQTNQLLFYERFRAYQDYILADCKASEVQEFTAEFL-EK
:. : : .: :::: ...: ...: ::
CCDS30 EPLLPVPEDEEEAQPLPPVCVSRMRGMWRDEKVSLYCDEVLQDCKAEDADEVMGKYLSEK
100 110 120 130 140 150
110 120 130 140 150
pF1KE9 VLEPSGWRAVWHTN--VF------------KVLVEITDVDF----AALKAVVRLAEPYLC
. . : .::.:: :: .:::.: . . .:..: .:::.
CCDS30 LKLKDKWLGVWKTNPSVFFVKYEEASIPFVGILVEVTCEPYQDSSSRFKVTVSVAEPF--
160 170 180 190 200
160 170 180 190 200 210
pF1KE9 DSQVSTFTMECMKELLDLKEHRLPLQELWVVFDDSGVFDQTALAIEHVRFFYQNIWRSWD
.:..... . . :.:. :: .:: :.. : .. . :::.: :::::. .::.::
CCDS30 SSNIANIPRDLVDEILEELEHSVPLLEVYPVEGQDTDIHVIALALEVVRFFYDFLWRDWD
210 220 230 240 250 260
220 230 240 250 260 270
pF1KE9 EEEEDEYDYFVRCVEPRLRLHYDILEDRVPSGLIVDYHNLLSQCEESYRKFLNLRSSLSN
.:: : .. .: :. : :: . .:. . ... : . ... .... .:....
CCDS30 DEESCEN--YTALIEERINLWCDIQDGTIPGPIAQRFKKTLEKYKNKRVELIEYQSNIKE
270 280 290 300 310 320
280 290 300 310 320 330
pF1KE9 CNSDSEQENISMVEGLKLYSEMEQLKQKLKLIENPLLRYVFGYQKNSNIQAKGVRSSGQK
: .: :: : : :. .: ::. :. :: . . :: : :.
CCDS30 DPSAAEA-----VECWKKYYEIVMLCGLLKMWEDLRLRVHGPFFPRILRRRKGKREFGKT
330 340 350 360 370 380
340 350 360 370 380 390
pF1KE9 ITHVVSSTMMAGLLRSLLTDRLCQEPGEEEREIQFHSDPLSAINACFEGDTVIVCPGHYV
:::.:.. : . ....: .: : :. :.: :.. :. :::::. ::.:
CCDS30 ITHIVAKMMTTEMIKDLSSDTLLQQ----------HGDLDLALDNCYSGDTVIIFPGEYQ
390 400 410 420 430
400 410 420 430 440 450
pF1KE9 VHGTFSIADSIELEGYGLPDDIVIEKRGKGDTFVDCTGADIKISGIKFVQHDAVEGILIV
. . ..:.: ..: : ..:.: .. . :.:: . ..:. ....:. .:.::..:
CCDS30 AANLALLTDDIIIKGVGKREEIMITSEPSRDSFVVSKADNVKLMHLSLIQQGTVDGIVVV
440 450 460 470 480 490
460 470 480 490 500 510
pF1KE9 HRGKTTLENCVLQCETTGVTVRTSAEFLMKNSDLYGAKGAGIEIYPGSQCTLSDNGIHHC
. :. :::::.:.:: ::: : :.: . . .:.. ::.:::.:.:::: : : ::::
CCDS30 ESGHMTLENCILKCEGTGVCVLTGAALTITDSEITGAQGAGVELYPGSIAILERNEIHHC
500 510 520 530 540 550
520 530 540 550 560
pF1KE9 ---KEGILIKDFLD----EHYDIPKISMVNNIIHNNEGYGVVLVKPTIFSDLQESAEDGT
. . :. : . ::..:.:: :..:.:::: ...:
CCDS30 NNLRTSNSSKSTLGGVNMKVLPAPKLKMTNNHIYSNKGYGVSILQPMEQFFIVAEEALNK
560 570 580 590 600 610
570 580 590 600 610 620
pF1KE9 EENKALKIQTSGEPDVAERVDLEELIECATGKMELCARTDPSEQVEGNCEIVNELIAAST
CCDS30 RASSGDKKDDKMLFKVMQNLNLEMNNNKIEANVKGDIRIVTS
620 630 640 650
672 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 00:12:49 2016 done: Mon Nov 7 00:12:49 2016
Total Scan time: 3.110 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]