FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6239, 285 aa 1>>>pF1KE6239 285 - 285 aa - 285 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8878+/-0.000854; mu= 12.4581+/- 0.051 mean_var=73.1052+/-14.539, 0's: 0 Z-trim(106.9): 17 B-trim: 142 in 1/50 Lambda= 0.150003 statistics sampled from 9245 (9259) to 9245 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.67), E-opt: 0.2 (0.284), width: 16 Scan time: 1.460 The best scores are: opt bits E(32554) CCDS6102.1 STAR gene_id:6770|Hs108|chr8 ( 285) 1889 417.9 4.3e-117 CCDS54118.1 STARD3 gene_id:10948|Hs108|chr17 ( 427) 492 115.6 6.3e-26 CCDS54117.1 STARD3 gene_id:10948|Hs108|chr17 ( 445) 492 115.6 6.5e-26 CCDS11341.1 STARD3 gene_id:10948|Hs108|chr17 ( 445) 492 115.6 6.5e-26 CCDS11955.1 STARD6 gene_id:147323|Hs108|chr18 ( 220) 275 68.5 4.8e-12 CCDS10318.1 STARD5 gene_id:80765|Hs108|chr15 ( 213) 259 65.1 5.1e-11 >>CCDS6102.1 STAR gene_id:6770|Hs108|chr8 (285 aa) initn: 1889 init1: 1889 opt: 1889 Z-score: 2215.6 bits: 417.9 E(32554): 4.3e-117 Smith-Waterman score: 1889; 100.0% identity (100.0% similar) in 285 aa overlap (1-285:1-285) 10 20 30 40 50 60 pF1KE6 MLLATFKLCAGSSYRHMRNMKGLRQQAVMAISQELNRRALGGPTPSTWINQVRRRSSLLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 MLLATFKLCAGSSYRHMRNMKGLRQQAVMAISQELNRRALGGPTPSTWINQVRRRSSLLG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 SRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKESQQDNGDKVMSKVVPDVGKVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 SRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKESQQDNGDKVMSKVVPDVGKVF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 RLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFITHELAAEAAGNLVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 RLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFITHELAAEAAGNLVG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 PRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMVLHPLAGSPSKTKLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 PRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMVLHPLAGSPSKTKLT 190 200 210 220 230 240 250 260 270 280 pF1KE6 WLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC ::::::::::::::::::::::::::::::::::::::::::::: CCDS61 WLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC 250 260 270 280 >>CCDS54118.1 STARD3 gene_id:10948|Hs108|chr17 (427 aa) initn: 514 init1: 485 opt: 492 Z-score: 579.0 bits: 115.6 E(32554): 6.3e-26 Smith-Waterman score: 492; 37.5% identity (70.2% similar) in 208 aa overlap (68-275:213-420) 40 50 60 70 80 90 pF1KE6 RALGGPTPSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWK .: :: :..::.:: . ::...:.:: CCDS54 ALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIRQGKEATAVVDQILAQEENWK 190 200 210 220 230 240 100 110 120 130 140 150 pF1KE6 KESQQDNGDKVMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVL :.... :: :.. :: ::.: :.. . : : .:.:.. . : : :: .: ..: CCDS54 FEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEVILQPERMVLWNKTVTACQIL 250 260 270 280 290 300 160 170 180 190 200 210 pF1KE6 QKIGKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIR :.. .:.:.....: :::..:.:::::.:: .:: . . .:.::. . : . .: CCDS54 QRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRYLSSGIATSHSAKPPTHKYVR 310 320 330 340 350 360 220 230 240 250 260 270 pF1KE6 AEHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLES .:.:: ... :..: ..:.:. :::: ::. .:.: :. :. .:: :::.:. CCDS54 GENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISE 370 380 390 400 410 420 280 pF1KE6 HPASEARC CCDS54 LGARA >>CCDS54117.1 STARD3 gene_id:10948|Hs108|chr17 (445 aa) initn: 485 init1: 485 opt: 492 Z-score: 578.7 bits: 115.6 E(32554): 6.5e-26 Smith-Waterman score: 492; 37.5% identity (70.2% similar) in 208 aa overlap (68-275:231-438) 40 50 60 70 80 90 pF1KE6 RALGGPTPSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWK .: :: :..::.:: . ::...:.:: CCDS54 ALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIRQGKEATAVVDQILAQEENWK 210 220 230 240 250 260 100 110 120 130 140 150 pF1KE6 KESQQDNGDKVMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVL :.... :: :.. :: ::.: :.. . : : .:.:.. . : : :: .: ..: CCDS54 FEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEVILQPERMVLWNKTVTACQIL 270 280 290 300 310 320 160 170 180 190 200 210 pF1KE6 QKIGKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIR :.. .:.:.....: :::..:.:::::.:: .:: . . .:.::. . : . .: CCDS54 QRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRYLSSGIATSHSAKPPTHKYVR 330 340 350 360 370 380 220 230 240 250 260 270 pF1KE6 AEHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLES .:.:: ... :..: ..:.:. :::: ::. .:.: :. :. .:: :::.:. CCDS54 GENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISE 390 400 410 420 430 440 280 pF1KE6 HPASEARC CCDS54 LGARA >>CCDS11341.1 STARD3 gene_id:10948|Hs108|chr17 (445 aa) initn: 514 init1: 485 opt: 492 Z-score: 578.7 bits: 115.6 E(32554): 6.5e-26 Smith-Waterman score: 492; 37.5% identity (70.2% similar) in 208 aa overlap (68-275:231-438) 40 50 60 70 80 90 pF1KE6 RALGGPTPSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWK .: :: :..::.:: . ::...:.:: CCDS11 ALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIRQGKEATAVVDQILAQEENWK 210 220 230 240 250 260 100 110 120 130 140 150 pF1KE6 KESQQDNGDKVMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVL :.... :: :.. :: ::.: :.. . : : .:.:.. . : : :: .: ..: CCDS11 FEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEVILQPERMVLWNKTVTACQIL 270 280 290 300 310 320 160 170 180 190 200 210 pF1KE6 QKIGKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIR :.. .:.:.....: :::..:.:::::.:: .:: . . .:.::. . : . .: CCDS11 QRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRYLSSGIATSHSAKPPTHKYVR 330 340 350 360 370 380 220 230 240 250 260 270 pF1KE6 AEHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLES .:.:: ... :..: ..:.:. :::: ::. .:.: :. :. .:: :::.:. CCDS11 GENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISE 390 400 410 420 430 440 280 pF1KE6 HPASEARC CCDS11 LGARA >>CCDS11955.1 STARD6 gene_id:147323|Hs108|chr18 (220 aa) initn: 241 init1: 119 opt: 275 Z-score: 329.7 bits: 68.5 E(32554): 4.8e-12 Smith-Waterman score: 275; 23.8% identity (63.9% similar) in 202 aa overlap (80-278:8-206) 50 60 70 80 90 100 pF1KE6 NQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWK--KESQQDNGDK ... :..:: . ::: : :.. . .. CCDS11 MDFKAIAQQTAQEVLGYNRDTSGWKVVKTSKKITVSS 10 20 30 110 120 130 140 150 160 pF1KE6 VMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFIT :. :...:.: .. . .: . : . . . :. ... .....: .:::: CCDS11 KASRKFH--GNLYRVEGIIPESPAKLSDFLYQTGDRI-TWDKSLQVYNMVHRIDSDTFIC 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE6 HELAAEAAGNLVGPRDFVSVRCAKR-RGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMV : .. : . ..::::... :: .:. .... ..:: ..: ... ::. . : .: CCDS11 HTITQSFAVGSISPRDFIDLVYIKRYEGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFV 100 110 120 130 140 150 230 240 250 260 270 280 pF1KE6 LHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC :. .:. .::. ... ...: : :::.... .. :.: . . ...: CCDS11 CSPMEENPAYSKLVMFVQTEMRGKLSPSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGF 160 170 180 190 200 210 CCDS11 HHNSHS 220 >>CCDS10318.1 STARD5 gene_id:80765|Hs108|chr15 (213 aa) initn: 201 init1: 115 opt: 259 Z-score: 311.2 bits: 65.1 E(32554): 5.1e-11 Smith-Waterman score: 259; 25.8% identity (60.3% similar) in 209 aa overlap (70-273:2-206) 40 50 60 70 80 90 pF1KE6 LGGPTPSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKE : :: :..: . .: : . ::: CCDS10 MDPALAA-QMSEAVAEKMLQYRRDTAGWKI- 10 20 100 110 120 130 140 150 pF1KE6 SQQDNGDKVMSKVVPDV---GKVFRLEVVVDQPMERLYEELVERMEAMG-EWNPNVKEIK .. :: .: . :.: :...: : .: .:.... . . .. .:. :: .. CCDS10 CREGNGVSVSWR--PSVEFPGNLYRGEGIVYGTLEEVWDCVKPAVGGLRVKWDENVTGFE 30 40 50 60 70 80 160 170 180 190 200 210 pF1KE6 VLQKIGKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMA-TDFGNMPEQKG ..:.: ... . :: .:..:::::.. .:: . . .. . .. : . : CCDS10 IIQSITDTLCVSRTSTPSAAMKLISPRDFVDLVLVKRYEDGTISSNATHVEHPLCPPKPG 90 100 110 120 130 140 220 230 240 250 260 270 pF1KE6 VIRAEHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKR .:. . : .:: : :.::.:. .. ::.:.::...... . .... : .:.: CCDS10 FVRGFNHPCGCFCEPLPGEPTKTNLVTFFHTDLSGYLPQNVVDSFFPRSMTRFYANLQKA 150 160 170 180 190 200 280 pF1KE6 LESHPASEARC CCDS10 VKQFHE 210 285 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:21:51 2016 done: Tue Nov 8 11:21:51 2016 Total Scan time: 1.460 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]