FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6239, 285 aa
1>>>pF1KE6239 285 - 285 aa - 285 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.8878+/-0.000854; mu= 12.4581+/- 0.051
mean_var=73.1052+/-14.539, 0's: 0 Z-trim(106.9): 17 B-trim: 142 in 1/50
Lambda= 0.150003
statistics sampled from 9245 (9259) to 9245 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.67), E-opt: 0.2 (0.284), width: 16
Scan time: 1.460
The best scores are: opt bits E(32554)
CCDS6102.1 STAR gene_id:6770|Hs108|chr8 ( 285) 1889 417.9 4.3e-117
CCDS54118.1 STARD3 gene_id:10948|Hs108|chr17 ( 427) 492 115.6 6.3e-26
CCDS54117.1 STARD3 gene_id:10948|Hs108|chr17 ( 445) 492 115.6 6.5e-26
CCDS11341.1 STARD3 gene_id:10948|Hs108|chr17 ( 445) 492 115.6 6.5e-26
CCDS11955.1 STARD6 gene_id:147323|Hs108|chr18 ( 220) 275 68.5 4.8e-12
CCDS10318.1 STARD5 gene_id:80765|Hs108|chr15 ( 213) 259 65.1 5.1e-11
>>CCDS6102.1 STAR gene_id:6770|Hs108|chr8 (285 aa)
initn: 1889 init1: 1889 opt: 1889 Z-score: 2215.6 bits: 417.9 E(32554): 4.3e-117
Smith-Waterman score: 1889; 100.0% identity (100.0% similar) in 285 aa overlap (1-285:1-285)
10 20 30 40 50 60
pF1KE6 MLLATFKLCAGSSYRHMRNMKGLRQQAVMAISQELNRRALGGPTPSTWINQVRRRSSLLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS61 MLLATFKLCAGSSYRHMRNMKGLRQQAVMAISQELNRRALGGPTPSTWINQVRRRSSLLG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 SRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKESQQDNGDKVMSKVVPDVGKVF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS61 SRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKESQQDNGDKVMSKVVPDVGKVF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 RLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFITHELAAEAAGNLVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS61 RLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFITHELAAEAAGNLVG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 PRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMVLHPLAGSPSKTKLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS61 PRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMVLHPLAGSPSKTKLT
190 200 210 220 230 240
250 260 270 280
pF1KE6 WLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC
:::::::::::::::::::::::::::::::::::::::::::::
CCDS61 WLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC
250 260 270 280
>>CCDS54118.1 STARD3 gene_id:10948|Hs108|chr17 (427 aa)
initn: 514 init1: 485 opt: 492 Z-score: 579.0 bits: 115.6 E(32554): 6.3e-26
Smith-Waterman score: 492; 37.5% identity (70.2% similar) in 208 aa overlap (68-275:213-420)
40 50 60 70 80 90
pF1KE6 RALGGPTPSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWK
.: :: :..::.:: . ::...:.::
CCDS54 ALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIRQGKEATAVVDQILAQEENWK
190 200 210 220 230 240
100 110 120 130 140 150
pF1KE6 KESQQDNGDKVMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVL
:.... :: :.. :: ::.: :.. . : : .:.:.. . : : :: .: ..:
CCDS54 FEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEVILQPERMVLWNKTVTACQIL
250 260 270 280 290 300
160 170 180 190 200 210
pF1KE6 QKIGKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIR
:.. .:.:.....: :::..:.:::::.:: .:: . . .:.::. . : . .:
CCDS54 QRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRYLSSGIATSHSAKPPTHKYVR
310 320 330 340 350 360
220 230 240 250 260 270
pF1KE6 AEHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLES
.:.:: ... :..: ..:.:. :::: ::. .:.: :. :. .:: :::.:.
CCDS54 GENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISE
370 380 390 400 410 420
280
pF1KE6 HPASEARC
CCDS54 LGARA
>>CCDS54117.1 STARD3 gene_id:10948|Hs108|chr17 (445 aa)
initn: 485 init1: 485 opt: 492 Z-score: 578.7 bits: 115.6 E(32554): 6.5e-26
Smith-Waterman score: 492; 37.5% identity (70.2% similar) in 208 aa overlap (68-275:231-438)
40 50 60 70 80 90
pF1KE6 RALGGPTPSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWK
.: :: :..::.:: . ::...:.::
CCDS54 ALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIRQGKEATAVVDQILAQEENWK
210 220 230 240 250 260
100 110 120 130 140 150
pF1KE6 KESQQDNGDKVMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVL
:.... :: :.. :: ::.: :.. . : : .:.:.. . : : :: .: ..:
CCDS54 FEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEVILQPERMVLWNKTVTACQIL
270 280 290 300 310 320
160 170 180 190 200 210
pF1KE6 QKIGKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIR
:.. .:.:.....: :::..:.:::::.:: .:: . . .:.::. . : . .:
CCDS54 QRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRYLSSGIATSHSAKPPTHKYVR
330 340 350 360 370 380
220 230 240 250 260 270
pF1KE6 AEHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLES
.:.:: ... :..: ..:.:. :::: ::. .:.: :. :. .:: :::.:.
CCDS54 GENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISE
390 400 410 420 430 440
280
pF1KE6 HPASEARC
CCDS54 LGARA
>>CCDS11341.1 STARD3 gene_id:10948|Hs108|chr17 (445 aa)
initn: 514 init1: 485 opt: 492 Z-score: 578.7 bits: 115.6 E(32554): 6.5e-26
Smith-Waterman score: 492; 37.5% identity (70.2% similar) in 208 aa overlap (68-275:231-438)
40 50 60 70 80 90
pF1KE6 RALGGPTPSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWK
.: :: :..::.:: . ::...:.::
CCDS11 ALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIRQGKEATAVVDQILAQEENWK
210 220 230 240 250 260
100 110 120 130 140 150
pF1KE6 KESQQDNGDKVMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVL
:.... :: :.. :: ::.: :.. . : : .:.:.. . : : :: .: ..:
CCDS11 FEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEVILQPERMVLWNKTVTACQIL
270 280 290 300 310 320
160 170 180 190 200 210
pF1KE6 QKIGKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIR
:.. .:.:.....: :::..:.:::::.:: .:: . . .:.::. . : . .:
CCDS11 QRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRYLSSGIATSHSAKPPTHKYVR
330 340 350 360 370 380
220 230 240 250 260 270
pF1KE6 AEHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLES
.:.:: ... :..: ..:.:. :::: ::. .:.: :. :. .:: :::.:.
CCDS11 GENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISE
390 400 410 420 430 440
280
pF1KE6 HPASEARC
CCDS11 LGARA
>>CCDS11955.1 STARD6 gene_id:147323|Hs108|chr18 (220 aa)
initn: 241 init1: 119 opt: 275 Z-score: 329.7 bits: 68.5 E(32554): 4.8e-12
Smith-Waterman score: 275; 23.8% identity (63.9% similar) in 202 aa overlap (80-278:8-206)
50 60 70 80 90 100
pF1KE6 NQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWK--KESQQDNGDK
... :..:: . ::: : :.. . ..
CCDS11 MDFKAIAQQTAQEVLGYNRDTSGWKVVKTSKKITVSS
10 20 30
110 120 130 140 150 160
pF1KE6 VMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFIT
:. :...:.: .. . .: . : . . . :. ... .....: .::::
CCDS11 KASRKFH--GNLYRVEGIIPESPAKLSDFLYQTGDRI-TWDKSLQVYNMVHRIDSDTFIC
40 50 60 70 80 90
170 180 190 200 210 220
pF1KE6 HELAAEAAGNLVGPRDFVSVRCAKR-RGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMV
: .. : . ..::::... :: .:. .... ..:: ..: ... ::. . : .:
CCDS11 HTITQSFAVGSISPRDFIDLVYIKRYEGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFV
100 110 120 130 140 150
230 240 250 260 270 280
pF1KE6 LHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC
:. .:. .::. ... ...: : :::.... .. :.: . . ...:
CCDS11 CSPMEENPAYSKLVMFVQTEMRGKLSPSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGF
160 170 180 190 200 210
CCDS11 HHNSHS
220
>>CCDS10318.1 STARD5 gene_id:80765|Hs108|chr15 (213 aa)
initn: 201 init1: 115 opt: 259 Z-score: 311.2 bits: 65.1 E(32554): 5.1e-11
Smith-Waterman score: 259; 25.8% identity (60.3% similar) in 209 aa overlap (70-273:2-206)
40 50 60 70 80 90
pF1KE6 LGGPTPSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKE
: :: :..: . .: : . :::
CCDS10 MDPALAA-QMSEAVAEKMLQYRRDTAGWKI-
10 20
100 110 120 130 140 150
pF1KE6 SQQDNGDKVMSKVVPDV---GKVFRLEVVVDQPMERLYEELVERMEAMG-EWNPNVKEIK
.. :: .: . :.: :...: : .: .:.... . . .. .:. :: ..
CCDS10 CREGNGVSVSWR--PSVEFPGNLYRGEGIVYGTLEEVWDCVKPAVGGLRVKWDENVTGFE
30 40 50 60 70 80
160 170 180 190 200 210
pF1KE6 VLQKIGKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMA-TDFGNMPEQKG
..:.: ... . :: .:..:::::.. .:: . . .. . .. : . :
CCDS10 IIQSITDTLCVSRTSTPSAAMKLISPRDFVDLVLVKRYEDGTISSNATHVEHPLCPPKPG
90 100 110 120 130 140
220 230 240 250 260 270
pF1KE6 VIRAEHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKR
.:. . : .:: : :.::.:. .. ::.:.::...... . .... : .:.:
CCDS10 FVRGFNHPCGCFCEPLPGEPTKTNLVTFFHTDLSGYLPQNVVDSFFPRSMTRFYANLQKA
150 160 170 180 190 200
280
pF1KE6 LESHPASEARC
CCDS10 VKQFHE
210
285 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 11:21:51 2016 done: Tue Nov 8 11:21:51 2016
Total Scan time: 1.460 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]