FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4429, 445 aa 1>>>pF1KE4429 445 - 445 aa - 445 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3684+/-0.000778; mu= 16.8738+/- 0.047 mean_var=62.5789+/-12.571, 0's: 0 Z-trim(107.3): 18 B-trim: 2 in 1/49 Lambda= 0.162129 statistics sampled from 9472 (9488) to 9472 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.67), E-opt: 0.2 (0.291), width: 16 Scan time: 3.170 The best scores are: opt bits E(32554) CCDS11341.1 STARD3 gene_id:10948|Hs108|chr17 ( 445) 2904 687.9 5.5e-198 CCDS54117.1 STARD3 gene_id:10948|Hs108|chr17 ( 445) 2464 584.9 5.3e-167 CCDS54118.1 STARD3 gene_id:10948|Hs108|chr17 ( 427) 2001 476.6 2e-134 CCDS5455.1 STARD3NL gene_id:83930|Hs108|chr7 ( 234) 848 206.8 1.9e-53 CCDS6102.1 STAR gene_id:6770|Hs108|chr8 ( 285) 492 123.6 2.6e-28 CCDS11955.1 STARD6 gene_id:147323|Hs108|chr18 ( 220) 281 74.2 1.5e-13 CCDS10318.1 STARD5 gene_id:80765|Hs108|chr15 ( 213) 246 66.0 4.2e-11 >>CCDS11341.1 STARD3 gene_id:10948|Hs108|chr17 (445 aa) initn: 2904 init1: 2904 opt: 2904 Z-score: 3667.9 bits: 687.9 E(32554): 5.5e-198 Smith-Waterman score: 2904; 99.8% identity (100.0% similar) in 445 aa overlap (1-445:1-445) 10 20 30 40 50 60 pF1KE4 MSKLPRELTRDLERSLPAVASLGSSLSHSQSLSSHLLPPPEKRRAISDVRRTFCLFVTFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MSKLPRELTRDLERSLPAVASLGSSLSHSQSLSSHLLPPPEKRRAISDVRRTFCLFVTFD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LLFISLLWIIELNTNTGIRKNLEQEIIQYNFKTSFFDIFVLAFFRFSGLLLGYAVLQLRH ::::::::::::::::::::::::::::::::::::::::::::::::::::::::.::: CCDS11 LLFISLLWIIELNTNTGIRKNLEQEIIQYNFKTSFFDIFVLAFFRFSGLLLGYAVLRLRH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 WWVIAVTTLVSSAFLIVKVILSELLSKGAFGYLLPIVSFVLAWLETWFLDFKVLPQEAEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 WWVIAVTTLVSSAFLIVKVILSELLSKGAFGYLLPIVSFVLAWLETWFLDFKVLPQEAEE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 ERWYLAAQVAVARGPLLFSGALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 ERWYLAAQVAVARGPLLFSGALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 QGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 ILQPERMVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 ILQPERMVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRY 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 LSSGIATSHSAKPPTHKYVRGENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LSSGIATSHSAKPPTHKYVRGENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIH 370 380 390 400 410 420 430 440 pF1KE4 QSLAATMFEFAFHLRQRISELGARA ::::::::::::::::::::::::: CCDS11 QSLAATMFEFAFHLRQRISELGARA 430 440 >>CCDS54117.1 STARD3 gene_id:10948|Hs108|chr17 (445 aa) initn: 2398 init1: 2398 opt: 2464 Z-score: 3111.7 bits: 584.9 E(32554): 5.3e-167 Smith-Waterman score: 2464; 86.8% identity (89.2% similar) in 455 aa overlap (1-445:1-445) 10 20 30 40 50 60 pF1KE4 MSKLPRELTRDLERSLPAVASLGSSLSHSQSLSSHLLPPPEKRRAISDVRRTFCLFVTFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MSKLPRELTRDLERSLPAVASLGSSLSHSQSLSSHLLPPPEKRRAISDVRRTFCLFVTFD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LLFISLLWIIELNTNTGIRKNLEQEIIQYNFKTSFFDIFVLAFFRFSGLLLGYAVLQLRH ::::::::::::::::::::::::::::::::::::::::::::::::::::::::.::: CCDS54 LLFISLLWIIELNTNTGIRKNLEQEIIQYNFKTSFFDIFVLAFFRFSGLLLGYAVLRLRH 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 W---WVIAVTTLVSSAFLIVKVILSELLSKGAFGYLLPIVSFVLAWLETWF---LDFKVL : : . ..: :. : : .: :.: : : : : CCDS54 WSRRWCPVHSSLSRSSSL----------SCSAKGHLATCSPSSLLSSPGWRPGSLTSKSY 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 PQEAEEER----WYLAAQVAVARGPLLFSGALSEGQFYSPPESFAGSDNESDEEVAGKKS :.. .. ::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PRKLKRSDSAPPGYLAAQVAVARGPLLFSGALSEGQFYSPPESFAGSDNESDEEVAGKKS 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE4 FSAQEREYIRQGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTFLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 FSAQEREYIRQGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTFLP 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE4 CPAELVYQEVILQPERMVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDFVNV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 CPAELVYQEVILQPERMVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDFVNV 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE4 RRIERRRDRYLSSGIATSHSAKPPTHKYVRGENGPGGFIVLKSASNPRVCTFVWILNTDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RRIERRRDRYLSSGIATSHSAKPPTHKYVRGENGPGGFIVLKSASNPRVCTFVWILNTDL 360 370 380 390 400 410 420 430 440 pF1KE4 KGRLPRYLIHQSLAATMFEFAFHLRQRISELGARA ::::::::::::::::::::::::::::::::::: CCDS54 KGRLPRYLIHQSLAATMFEFAFHLRQRISELGARA 420 430 440 >>CCDS54118.1 STARD3 gene_id:10948|Hs108|chr17 (427 aa) initn: 2001 init1: 2001 opt: 2001 Z-score: 2526.7 bits: 476.6 E(32554): 2e-134 Smith-Waterman score: 2763; 95.7% identity (96.0% similar) in 445 aa overlap (1-445:1-427) 10 20 30 40 50 60 pF1KE4 MSKLPRELTRDLERSLPAVASLGSSLSHSQSLSSHLLPPPEKRRAISDVRRTFCLFVTFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MSKLPRELTRDLERSLPAVASLGSSLSHSQSLSSHLLPPPEKRRAISDVRRTFCLFVTFD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LLFISLLWIIELNTNTGIRKNLEQEIIQYNFKTSFFDIFVLAFFRFSGLLLGYAVLQLRH ::::::::::::::::::::::::::::::::::::::::::::::::::::::::.::: CCDS54 LLFISLLWIIELNTNTGIRKNLEQEIIQYNFKTSFFDIFVLAFFRFSGLLLGYAVLRLRH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 WWVIAVTTLVSSAFLIVKVILSELLSKGAFGYLLPIVSFVLAWLETWFLDFKVLPQEAEE ::::: ::::::::::::::::::::::::::::::::::::: CCDS54 WWVIA------------------LLSKGAFGYLLPIVSFVLAWLETWFLDFKVLPQEAEE 130 140 150 160 190 200 210 220 230 240 pF1KE4 ERWYLAAQVAVARGPLLFSGALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ERWYLAAQVAVARGPLLFSGALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIR 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE4 QGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEV 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE4 ILQPERMVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ILQPERMVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRY 290 300 310 320 330 340 370 380 390 400 410 420 pF1KE4 LSSGIATSHSAKPPTHKYVRGENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LSSGIATSHSAKPPTHKYVRGENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIH 350 360 370 380 390 400 430 440 pF1KE4 QSLAATMFEFAFHLRQRISELGARA ::::::::::::::::::::::::: CCDS54 QSLAATMFEFAFHLRQRISELGARA 410 420 >>CCDS5455.1 STARD3NL gene_id:83930|Hs108|chr7 (234 aa) initn: 809 init1: 712 opt: 848 Z-score: 1073.2 bits: 206.8 E(32554): 1.9e-53 Smith-Waterman score: 848; 57.6% identity (82.7% similar) in 231 aa overlap (1-229:1-229) 10 20 30 40 50 pF1KE4 MSKLPRELTRDLERSLPAVASLGS--SLSHSQSLSSHLLPPPEKRRAISDVRRTFCLFVT :..::... : : . ::: . :.. .: .. .....::::::::::::: CCDS54 MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 FDLLFISLLWIIELNTNTGIRKNLEQEIIQYNFKTSFFDIFVLAFFRFSGLLLGYAVLQL :::::..::::::::.: ::...::.:..::.. .:.::::.:: :::. :.:.::: .: CCDS54 FDLLFVTLLWIIELNVNGGIENTLEKEVMQYDYYSSYFDIFLLAVFRFKVLILAYAVCRL 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 RHWWVIAVTTLVSSAFLIVKVILSELLSKGAFGYLLPIVSFVLAWLETWFLDFKVLPQEA ::::.::.:: :.::::..:::::.:.:.:::::.:::.::.:::.:::::::::::::: CCDS54 RHWWAIALTTAVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEA 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 EEERWYLAAQVAVARGPLLFSGALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREY ::: : .: : :. :. :.::.::::::::: :::. :..:. ..: CCDS54 EEENRLLIVQDASERAALI-PGGLSDGQFYSPPESEAGSE-EAEEKQDSEKPLLEL 190 200 210 220 230 240 250 260 270 280 290 pF1KE4 IRQGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQ >>CCDS6102.1 STAR gene_id:6770|Hs108|chr8 (285 aa) initn: 514 init1: 485 opt: 492 Z-score: 621.9 bits: 123.6 E(32554): 2.6e-28 Smith-Waterman score: 492; 37.5% identity (70.2% similar) in 208 aa overlap (231-438:68-275) 210 220 230 240 250 260 pF1KE4 ALSEGQFYSPPESFAGSDNESDEEVAGKKSFSAQEREYIRQGKEATAVVDQILAQEENWK .: :: :..::.:: . ::...:.:: CCDS61 RALGGPTPSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWK 40 50 60 70 80 90 270 280 290 300 310 320 pF1KE4 FEKNNEYGDTVYTIEVPFHGKTFILKTFLPCPAELVYQEVILQPERMVLWNKTVTACQIL :.... :: :.. :: ::.: :.. . : : .:.:.. . : : :: .: ..: CCDS61 KESQQDNGDKVMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVL 100 110 120 130 140 150 330 340 350 360 370 380 pF1KE4 QRVEDNTLISYDVSAGAAGGVVSPRDFVNVRRIERRRDRYLSSGIATSHSAKPPTHKYVR :.. .:.:.....: :::..:.:::::.:: .:: . . .:.::. . : . .: CCDS61 QKIGKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIR 160 170 180 190 200 210 390 400 410 420 430 440 pF1KE4 GENGPGGFIVLKSASNPRVCTFVWILNTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISE .:.:: ... :..: ..:.:. :::: ::. .:.: :. :. .:: :::.:. CCDS61 AEHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLES 220 230 240 250 260 270 pF1KE4 LGARA CCDS61 HPASEARC 280 >>CCDS11955.1 STARD6 gene_id:147323|Hs108|chr18 (220 aa) initn: 183 init1: 86 opt: 281 Z-score: 356.9 bits: 74.2 E(32554): 1.5e-13 Smith-Waterman score: 281; 24.2% identity (62.1% similar) in 182 aa overlap (259-438:24-203) 230 240 250 260 270 280 pF1KE4 KSFSAQEREYIRQGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTF :: :... . :::. . .. . CCDS11 MDFKAIAQQTAQEVLGYNRDTSGWKVVKTSKKITVSSKASRKFHGNLYRVEGI 10 20 30 40 50 290 300 310 320 330 340 pF1KE4 LP-CPAELVYQEVILQPERMVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDF .: ::.: .. . : . :.:.. . ....:....:.: . .. . : : .::::: CCDS11 IPESPAKL--SDFLYQTGDRITWDKSLQVYNMVHRIDSDTFICHTITQSFAVGSISPRDF 60 70 80 90 100 110 350 360 370 380 390 400 pF1KE4 VNVRRIERRRDRY-LSSGIATSHSAKPPTHKYVRGENGPGGFIVLKSASNPRVCTFVWIL ... :.: . . . :. ... ::. .:.:: : : ::. :: .: .. CCDS11 IDLVYIKRYEGNMNIISSKSVDFPEYPPSSNYIRGYNHPCGFVCSPMEENPAYSKLVMFV 120 130 140 150 160 170 410 420 430 440 pF1KE4 NTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISELGARA .:...:.: .:.... ... .: .. .. : CCDS11 QTEMRGKLSPSIIEKTMPSNLVNFILNAKDGIKAHRTPSRRGFHHNSHS 180 190 200 210 220 >>CCDS10318.1 STARD5 gene_id:80765|Hs108|chr15 (213 aa) initn: 218 init1: 133 opt: 246 Z-score: 312.9 bits: 66.0 E(32554): 4.2e-11 Smith-Waterman score: 246; 25.7% identity (59.9% similar) in 202 aa overlap (247-441:12-211) 220 230 240 250 260 270 pF1KE4 SDNESDEEVAGKKSFSAQEREYIRQGKEATAVVDQILAQEEN---WKFEKNNEYGDTVYT ::....: ... ::. .... .. . CCDS10 MDPALAAQMSEAVAEKMLQYRRDTAGWKICREGNGVSVSWR 10 20 30 40 280 290 300 310 320 330 pF1KE4 IEVPFHGKTFILKTFLPCPAELVYQEVILQPER---MVLWNKTVTACQILQRVEDNTLIS : : :. . . .. : :.. : .: : :...::. .:.: . :. .: CCDS10 PSVEFPGNLYRGEGIVYGTLEEVWDCV--KPAVGGLRVKWDENVTGFEIIQSITDTLCVS 50 60 70 80 90 340 350 360 370 380 pF1KE4 YDVSAGAAGGVVSPRDFVNVRRIERRRDRYLSSGIA-TSHSAKPPTHKYVRGENGPGGFI . .:: ..::::::.. ..: .: .::. . . : :: .::: : : : . CCDS10 RTSTPSAAMKLISPRDFVDLVLVKRYEDGTISSNATHVEHPLCPPKPGFVRGFNHPCGCF 100 110 120 130 140 150 390 400 410 420 430 440 pF1KE4 VLKSASNPRVCTFVWILNTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISELGARA ..: ..: ...:::.: ::. .. . . .: .: .:.. .... CCDS10 CEPLPGEPTKTNLVTFFHTDLSGYLPQNVVDSFFPRSMTRFYANLQKAVKQFHE 160 170 180 190 200 210 445 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:11:08 2016 done: Sun Nov 6 01:11:08 2016 Total Scan time: 3.170 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]