FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9578, 237 aa
1>>>pF1KB9578 237 - 237 aa - 237 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.1527+/-0.000316; mu= 3.3096+/- 0.020
mean_var=212.7831+/-43.550, 0's: 0 Z-trim(123.3): 69 B-trim: 547 in 1/55
Lambda= 0.087924
statistics sampled from 42655 (42750) to 42655 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.802), E-opt: 0.2 (0.501), width: 16
Scan time: 7.580
The best scores are: opt bits E(85289)
NP_004307 (OMIM: 100790,209880) achaete-scute homo ( 236) 1549 208.0 1.1e-53
NP_005161 (OMIM: 601886) achaete-scute homolog 2 [ ( 193) 423 65.1 9.8e-11
NP_982260 (OMIM: 609155) achaete-scute homolog 4 [ ( 173) 254 43.6 0.00026
NP_065697 (OMIM: 609154) achaete-scute homolog 3 [ ( 181) 222 39.6 0.0044
NP_068808 (OMIM: 602407) heart- and neural crest d ( 217) 215 38.8 0.0094
>>NP_004307 (OMIM: 100790,209880) achaete-scute homolog (236 aa)
initn: 1309 init1: 1309 opt: 1549 Z-score: 1084.5 bits: 208.0 E(85289): 1.1e-53
Smith-Waterman score: 1549; 99.6% identity (99.6% similar) in 237 aa overlap (1-237:1-236)
10 20 30 40 50 60
pF1KB9 MESSAKMESGGAGQQPQPQPQQPFLPPAACFFATAAAAAAAAAAAAAQSAQQQQQQQQQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 MESSAKMESGGAGQQPQPQPQQPFLPPAACFFATAAAAAAAAAAAAAQSAQQQQQQQQQQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 QQQAPQLRPAADGQPSGGGHKSAPKQVKRQRSSSPELMRCKRRLNFSGFGYSLPQQQPAA
:: :::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 QQ-APQLRPAADGQPSGGGHKSAPKQVKRQRSSSPELMRCKRRLNFSGFGYSLPQQQPAA
70 80 90 100 110
130 140 150 160 170 180
pF1KB9 VARRNERERNRVKLVNLGFATLREHVPNGAANKKMSKVETLRSAVEYIRALQQLLDEHDA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 VARRNERERNRVKLVNLGFATLREHVPNGAANKKMSKVETLRSAVEYIRALQQLLDEHDA
120 130 140 150 160 170
190 200 210 220 230
pF1KB9 VSAAFQAGVLSPTISPNYSNDLNSMAGSPVSSYSSDEGSYDPLSPEEQELLDFTNWF
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 VSAAFQAGVLSPTISPNYSNDLNSMAGSPVSSYSSDEGSYDPLSPEEQELLDFTNWF
180 190 200 210 220 230
>>NP_005161 (OMIM: 601886) achaete-scute homolog 2 [Homo (193 aa)
initn: 517 init1: 350 opt: 423 Z-score: 313.7 bits: 65.1 E(85289): 9.8e-11
Smith-Waterman score: 472; 52.0% identity (68.0% similar) in 175 aa overlap (89-237:23-190)
60 70 80 90 100 110
pF1KB9 QQQQQAPQLRPAADGQPSGGGHKSAPKQVKRQRSSSPELMRCKRR----LNFSGFGYSLP
:.: .::::.::.:: .: :
NP_005 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAETGGG----
10 20 30 40
120 130 140 150 160 170
pF1KB9 QQQPAAVARRNERERNRVKLVNLGFATLREHVPNGAANKKMSKVETLRSAVEYIRALQQL
::::::::::::::::::::: .::.:::.:.:.::.:::::::::::::::::.:
NP_005 ---AAAVARRNERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAVEYIRALQRL
50 60 70 80 90 100
180 190 200 210
pF1KB9 LDEHDAVSAAFQAGV----LSPTI------------SPNYSNDL-----NSMAGSPVSSY
: ::::: :. .:. . :. ::. ... .: ::: :.:
NP_005 LAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGSSEPGSPRSAY
110 120 130 140 150 160
220 230
pF1KB9 SSDE-GSYDPLSPEEQELLDFTNWF
:::. : ::: :.:::::..:.
NP_005 SSDDSGCEGALSPAERELLDFSSWLGGY
170 180 190
>>NP_982260 (OMIM: 609155) achaete-scute homolog 4 [Homo (173 aa)
initn: 234 init1: 234 opt: 254 Z-score: 198.4 bits: 43.6 E(85289): 0.00026
Smith-Waterman score: 254; 38.9% identity (60.5% similar) in 162 aa overlap (67-218:17-171)
40 50 60 70 80 90
pF1KB9 AAAAAAAAAAAQSAQQQQQQQQQQQQQAPQLRPAADGQPSG--GGHKSAPKQVK-RQRSS
:: : : :. : . : .: : ..
NP_982 MMETRKPAERLALPYSLRTAPLGVPGTLPGLPRRDPLRVALRLDAA
10 20 30 40
100 110 120 130 140
pF1KB9 SPELMR--CKRRLNFSGFGY-SLPQQ---QPAAVARRNERERNRVKLVNLGFATLREHVP
: : : : :. : .: . .:: . .::::::.::. :: :.: ::.:.:
NP_982 CWEWARSGCAR-----GWQYLPVPLDSAFEPAFLRKRNERERQRVRCVNEGYARLRDHLP
50 60 70 80 90 100
150 160 170 180 190 200
pF1KB9 NGAANKKMSKVETLRSAVEYIRALQQLLDEHDAVSAAFQAGVLSPTISPNYSNDLNSMAG
:.:..:::::::.:..::. ::.:: :..: . ::.. : . ..: .: :.
NP_982 RELADKRLSKVETLRAAIDYIKHLQELL-ERQAWGLEGAAGAV-PQRRAECNSDGESKAS
110 120 130 140 150
210 220 230
pF1KB9 S-PVSSYSSDEGSYDPLSPEEQELLDFTNWF
: : : .::
NP_982 SAPSPSSEPEEGGS
160 170
>>NP_065697 (OMIM: 609154) achaete-scute homolog 3 [Homo (181 aa)
initn: 214 init1: 214 opt: 222 Z-score: 176.3 bits: 39.6 E(85289): 0.0044
Smith-Waterman score: 222; 43.3% identity (63.9% similar) in 97 aa overlap (105-201:82-175)
80 90 100 110 120 130
pF1KB9 PSGGGHKSAPKQVKRQRSSSPELMRCKRRLNFSGFGYSLPQQQPAAVARRNERERNRVKL
:. : :: :: . .::::::.:::
NP_065 ELPRLPFPSDSLILGNYSEPCPFSFPMPYPNYRGCEYSY---GPAFTRKRNERERQRVKC
60 70 80 90 100
140 150 160 170 180 190
pF1KB9 VNLGFATLREHVPNGAANKKMSKVETLRSAVEYIRALQQLLDEHDAVSAAFQAGVLSPTI
:: :.: ::.:.:. .:..:::::::.:..:: ::.:: : . . : :
NP_065 VNEGYAQLRHHLPEEYLEKRLSKVETLRAAIKYINYLQSLLYPDKAETKNNPGKVSSMIA
110 120 130 140 150 160
200 210 220 230
pF1KB9 SPNYSNDLNSMAGSPVSSYSSDEGSYDPLSPEEQELLDFTNWF
. .. :
NP_065 TTSHHADPMFRIV
170 180
>>NP_068808 (OMIM: 602407) heart- and neural crest deriv (217 aa)
initn: 219 init1: 158 opt: 215 Z-score: 170.4 bits: 38.8 E(85289): 0.0094
Smith-Waterman score: 244; 31.7% identity (56.1% similar) in 205 aa overlap (20-212:8-195)
10 20 30 40 50 60
pF1KB9 MESSAKMESGGAGQQPQPQPQQPFLPPAACFFATAAAAAAAAAAAAAQSAQQQQQQQQQQ
:..: . . ::.::::::::::. . .
NP_068 MSLVGGFPHHPVVHHEGYPFAAAAAAAAAAAAS------------RCS
10 20 30
70 80 90 100 110 120
pF1KB9 QQQAPQLRPAADGQPSGGGHKSAPKQVKRQRSSSPELMRCKRRLNFSGFGYSLPQQQPAA
... : .. :.: . .: . . : ::: :. : .: : : .
NP_068 HEENPYFHGWLIGHP-----EMSPPDYSMALSYSPEYASGAAGLDHSHYGGVPPGAGPPG
40 50 60 70 80 90
130 140 150 160 170
pF1KB9 ------VARR---NERERNRVKLVNLGFATLREHVPNGAANKKMSKVETLRSAVEYIRAL
: :: :..:: :.. .: .:: ::: .:: :. :.::..::: :. :: :
NP_068 LGGPRPVKRRGTANRKERRRTQSINSAFAELRECIPNVPADTKLSKIKTLRLATSYIAYL
100 110 120 130 140 150
180 190 200 210 220
pF1KB9 QQLL--DEHDAVSAAFQAGVLSPTISPN-YSNDLNSMAGSPVSSYSSDEGSYDPLSPEEQ
..:: :.... . ::.: . . .. . ...:: . : :::
NP_068 MDLLAKDDQNGEAEAFKAEIKKTDVKEEKRKKELNEILKSTVSSNDKKTKGRTGWPQHVW
160 170 180 190 200 210
230
pF1KB9 ELLDFTNWF
NP_068 ALELKQ
237 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 17:27:28 2016 done: Fri Nov 4 17:27:29 2016
Total Scan time: 7.580 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]