FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4107, 478 aa 1>>>pF1KE4107 478 - 478 aa - 478 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7713+/-0.00084; mu= 15.8892+/- 0.051 mean_var=77.2784+/-15.328, 0's: 0 Z-trim(108.2): 47 B-trim: 0 in 0/49 Lambda= 0.145897 statistics sampled from 9993 (10040) to 9993 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.672), E-opt: 0.2 (0.308), width: 16 Scan time: 3.460 The best scores are: opt bits E(32554) CCDS32719.1 BTBD17 gene_id:388419|Hs108|chr17 ( 478) 3203 683.7 1.2e-196 CCDS11759.1 LGALS3BP gene_id:3959|Hs108|chr17 ( 585) 429 99.8 8e-21 >>CCDS32719.1 BTBD17 gene_id:388419|Hs108|chr17 (478 aa) initn: 3203 init1: 3203 opt: 3203 Z-score: 3644.2 bits: 683.7 E(32554): 1.2e-196 Smith-Waterman score: 3203; 100.0% identity (100.0% similar) in 478 aa overlap (1-478:1-478) 10 20 30 40 50 60 pF1KE4 MPRRGYSKPGSWGSFWAMLTLVGLVTHAAQRADVGGEAAGTSINHSQAVLQRLQELLRQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MPRRGYSKPGSWGSFWAMLTLVGLVTHAAQRADVGGEAAGTSINHSQAVLQRLQELLRQG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 NASDVVLRVQAAGTDEVRVFHAHRLLLGLHSELFLELLSNQSEAVLQEPQDCAAVFDKFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 NASDVVLRVQAAGTDEVRVFHAHRLLLGLHSELFLELLSNQSEAVLQEPQDCAAVFDKFI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 RYLYCGELTVLLTQAIPLHRLATKYGVSSLQRGVADYMRAHLAGGAGPAVGWYHYAVGTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 RYLYCGELTVLLTQAIPLHRLATKYGVSSLQRGVADYMRAHLAGGAGPAVGWYHYAVGTG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 DEALRESCLQFLAWNLSAVAASTEWGAVSPELLWQLLQRSDLVLQDELELFHALEAWLGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 DEALRESCLQFLAWNLSAVAASTEWGAVSPELLWQLLQRSDLVLQDELELFHALEAWLGR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ARPPPAVAERALRAIRYPMIPPAQLFQLQARSAALARHGPAVADLLLQAYQFHAASPLHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ARPPPAVAERALRAIRYPMIPPAQLFQLQARSAALARHGPAVADLLLQAYQFHAASPLHY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 AKFFDVNGSAFLPRNYLAPAWGAPWVINNPARDDRSTSFQTQLGPSGHDAGRRVTWNVLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 AKFFDVNGSAFLPRNYLAPAWGAPWVINNPARDDRSTSFQTQLGPSGHDAGRRVTWNVLF 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 SPRWLPVSLRPVYADAAGTALPAARPEDGRPRLVVTPASSGGDAAGVSFQKTVLVGARQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 SPRWLPVSLRPVYADAAGTALPAARPEDGRPRLVVTPASSGGDAAGVSFQKTVLVGARQQ 370 380 390 400 410 420 430 440 450 460 470 pF1KE4 GRLLVRHAYSFHQSSEEAGDFLAHADLQRRNSEYLVENALHLHLIVKPVYHTLIRTPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 GRLLVRHAYSFHQSSEEAGDFLAHADLQRRNSEYLVENALHLHLIVKPVYHTLIRTPK 430 440 450 460 470 >>CCDS11759.1 LGALS3BP gene_id:3959|Hs108|chr17 (585 aa) initn: 421 init1: 292 opt: 429 Z-score: 487.3 bits: 99.8 E(32554): 8e-21 Smith-Waterman score: 429; 29.3% identity (58.3% similar) in 314 aa overlap (42-345:132-438) 20 30 40 50 60 70 pF1KE4 WGSFWAMLTLVGLVTHAAQRADVGGEAAGTSINHSQAVLQRLQELLRQGNASDVVLRVQA ... :. . . : ... . . :. . :.. CCDS11 DCKSLGWLKSNCRHERDAGVVCTNETRSTHTLDLSRELSEALGQIFDSQRGCDLSISVNV 110 120 130 140 150 160 80 90 100 110 120 pF1KE4 AGTDEVRVFHAHRLLLGLHSE---LFLELLSNQSEAVLQEPQDCAAVFDKFIRYLYCGEL : : . : .: ..: . : :. : :: . .: : :. . ..::.: .. CCDS11 QGEDALG-FCGHTVILTANLEAQALWKEPGSNVTMSVDAE---CVPMVRDLLRYFYSRRI 170 180 190 200 210 130 140 150 160 170 180 pF1KE4 TVLLTQAIPLHRLATKYGVSSLQRGVADYMRAHLAGGAG---PAVGWYHYAVGTGDEALR . :... .:.::. ::. .:: :. . : . : . : :::.::: :. CCDS11 DITLSSVKCFHKLASAYGARQLQGYCASLFAILLPQDPSFQMP-LDLYAYAVATGDALLE 220 230 240 250 260 270 190 200 210 220 230 240 pF1KE4 ESCLQFLAWNLSAVAASTEWGAVSPELLWQLLQRSDLVLQDELELFHALEAWLGRARPPP . ::::::::. :.. . : .: .:: :: ::::.. .:: :..:...: : CCDS11 KLCLQFLAWNFEALTQAEAWPSVPTDLLQLLLPRSDLAVPSELALLKAVDTWSWGERASH 280 290 300 310 320 330 250 260 270 280 290 300 pF1KE4 AVAERALRAIRYPMIPPAQLFQLQARSAALARHGPAVADLLLQAYQFHAASPLHYAKFFD .: .. ::.::. : .::.:: . : ::: .::.. :.. CCDS11 EEVEGLVEKIRFPMMLPEELFELQFNLSLYWSHEALFQKKTLQALEFHTVPFQLLARYKG 340 350 360 370 380 390 310 320 330 340 350 360 pF1KE4 VNGS--AFLPRNYLAPAWGAPWVINNPARDDRSTS--FQTQLGPSGHDAGRRVTWNVLFS .: . .. :: : .:.:.: ... . . :... .:.. :: CCDS11 LNLTEDTYKPRIYTSPTWSA--FVTDSSWSARKSQLVYQSRRGPLVKYSSDYFQAPSDYR 400 410 420 430 440 450 370 380 390 400 410 420 pF1KE4 PRWLPVSLRPVYADAAGTALPAARPEDGRPRLVVTPASSGGDAAGVSFQKTVLVGARQQG CCDS11 YYPYQSFQTPQHPSFLFQDKRVSWSLVYLPTIQSCWNYGFSCSSDELPVLGLTKSGGSDR 460 470 480 490 500 510 478 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:42:49 2016 done: Sun Nov 6 01:42:49 2016 Total Scan time: 3.460 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]