FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1679, 193 aa
1>>>pF1KE1679 193 - 193 aa - 193 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.6213+/-0.000257; mu= 6.3288+/- 0.016
mean_var=175.0388+/-35.069, 0's: 0 Z-trim(126.1): 48 B-trim: 906 in 1/61
Lambda= 0.096941
statistics sampled from 51226 (51286) to 51226 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.858), E-opt: 0.2 (0.601), width: 16
Scan time: 5.940
The best scores are: opt bits E(85289)
NP_005161 (OMIM: 601886) achaete-scute homolog 2 [ ( 193) 1297 191.7 6.4e-49
NP_004307 (OMIM: 100790,209880) achaete-scute homo ( 236) 423 69.5 4.7e-12
NP_982260 (OMIM: 609155) achaete-scute homolog 4 [ ( 173) 268 47.7 1.2e-05
XP_016871769 (OMIM: 604882,610370) PREDICTED: neur ( 214) 226 41.9 0.00086
NP_066279 (OMIM: 604882,610370) neurogenin-3 [Homo ( 214) 226 41.9 0.00086
NP_803238 (OMIM: 608606) class A basic helix-loop- ( 189) 215 40.3 0.0023
NP_065697 (OMIM: 609154) achaete-scute homolog 3 [ ( 181) 213 40.0 0.0027
NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapien ( 272) 204 38.9 0.0086
>>NP_005161 (OMIM: 601886) achaete-scute homolog 2 [Homo (193 aa)
initn: 1297 init1: 1297 opt: 1297 Z-score: 999.2 bits: 191.7 E(85289): 6.4e-49
Smith-Waterman score: 1297; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAETGGGAAAVARRNERER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAETGGGAAAVARRNERER
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 NRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAVEYIRALQRLLAEHDAVRNALAGGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 NRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAVEYIRALQRLLAEHDAVRNALAGGL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 RPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGSSEPGSPRSAYSSDDSGCEGALSPAE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 RPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGSSEPGSPRSAYSSDDSGCEGALSPAE
130 140 150 160 170 180
190
pF1KE1 RELLDFSSWLGGY
:::::::::::::
NP_005 RELLDFSSWLGGY
190
>>NP_004307 (OMIM: 100790,209880) achaete-scute homolog (236 aa)
initn: 517 init1: 350 opt: 423 Z-score: 337.4 bits: 69.5 E(85289): 4.7e-12
Smith-Waterman score: 472; 52.0% identity (68.0% similar) in 175 aa overlap (23-190:88-236)
10 20 30 40
pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAETGGG----
:.: .::::.::.:: .: :
NP_004 QQQQQAPQLRPAADGQPSGGGHKSAPKQVKRQRSSSPELMRCKRR----LNFSGFGYSLP
60 70 80 90 100 110
50 60 70 80 90 100
pF1KE1 ---AAAVARRNERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAVEYIRALQRL
::::::::::::::::::::: .::.:::.:.:.::.:::::::::::::::::.:
NP_004 QQQPAAVARRNERERNRVKLVNLGFATLREHVPNGAANKKMSKVETLRSAVEYIRALQQL
120 130 140 150 160 170
110 120 130 140 150 160
pF1KE1 LAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGSSEPGSPRSAY
: ::::: :. .:. . :. ::. ... .: ::: :.:
NP_004 LDEHDAVSAAFQAGV----LSPTI------------SPNYSNDL-----NSMAGSPVSSY
180 190 200 210
170 180 190
pF1KE1 SSDDSGCEGALSPAERELLDFSSWLGGY
:::. : ::: :.:::::..:.
NP_004 SSDE-GSYDPLSPEEQELLDFTNWF
220 230
>>NP_982260 (OMIM: 609155) achaete-scute homolog 4 [Homo (173 aa)
initn: 237 init1: 207 opt: 268 Z-score: 222.1 bits: 47.7 E(85289): 1.2e-05
Smith-Waterman score: 268; 41.0% identity (62.2% similar) in 156 aa overlap (4-155:26-173)
10 20 30
pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLR--CSR
:::: . : :. : : : : : : :.:
NP_982 MMETRKPAERLALPYSLRTAPLGVPGTLP-GLPRRDPLRV--ALRLDAACWEWARSGCAR
10 20 30 40 50
40 50 60 70 80 90
pF1KE1 --RRRPATAETGGGAAAVARRNERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRS
. :. ... : . .::::::.::. :: :. ::.:.:. :.:.::::::::.
NP_982 GWQYLPVPLDSAFEPAFLRKRNERERQRVRCVNEGYARLRDHLPRELADKRLSKVETLRA
60 70 80 90 100 110
100 110 120 130 140 150
pF1KE1 AVEYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGG
:..::. ::.:: :..: :.: :: : . : . ....:: :: : .::
NP_982 AIDYIKHLQELL-ERQAWGLEGAAGAVPQR---RAECNSDGESKASSAPS-PSSEPEEGG
120 130 140 150 160 170
160 170 180 190
pF1KE1 SSEPGSPRSAYSSDDSGCEGALSPAERELLDFSSWLGGY
:
NP_982 S
>>XP_016871769 (OMIM: 604882,610370) PREDICTED: neurogen (214 aa)
initn: 186 init1: 127 opt: 226 Z-score: 189.1 bits: 41.9 E(85289): 0.00086
Smith-Waterman score: 226; 35.2% identity (54.6% similar) in 196 aa overlap (2-186:27-213)
10 20 30
pF1KE1 MDGGTLPRSAPPAPPVPVG-CAARRRPA---SPEL
: : : ::::.: : :: .. . .:.
XP_016 MTPQPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRK
10 20 30 40 50 60
40 50 60 70 80
pF1KE1 LRCSR--RRRPATAETGGGAAAVARR---NERERNRVKLVNLGFQALRQHVPHGGASKKL
:: : : :: . : . . .:: :.:::::.. .: ...::: .: . ::
XP_016 LRARRGGRSRPKS-ELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKL
70 80 90 100 110
90 100 110 120 130 140
pF1KE1 SKVETLRSAVEYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASP-SR
.:.:::: : .:: :: . : : :: :. . ..: : :: :: :.
XP_016 TKIETLRFAHNYIWALTQTLRIADHSLYALEPPA-PHCGELGSPGGSPGDWGSLYSPVSQ
120 130 140 150 160 170
150 160 170 180 190
pF1KE1 ASS-SPGRGGSSEPGSPRSAYSSDDSGCEGALSPAERELLDFSSWLGGY
:.: ::. . .:: ...:. : :::. . ::
XP_016 AGSLSPAASLEERPGLLGATFSA----C---LSPGSLAFSDFL
180 190 200 210
>>NP_066279 (OMIM: 604882,610370) neurogenin-3 [Homo sap (214 aa)
initn: 186 init1: 127 opt: 226 Z-score: 189.1 bits: 41.9 E(85289): 0.00086
Smith-Waterman score: 226; 35.2% identity (54.6% similar) in 196 aa overlap (2-186:27-213)
10 20 30
pF1KE1 MDGGTLPRSAPPAPPVPVG-CAARRRPA---SPEL
: : : ::::.: : :: .. . .:.
NP_066 MTPQPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRK
10 20 30 40 50 60
40 50 60 70 80
pF1KE1 LRCSR--RRRPATAETGGGAAAVARR---NERERNRVKLVNLGFQALRQHVPHGGASKKL
:: : : :: . : . . .:: :.:::::.. .: ...::: .: . ::
NP_066 LRARRGGRSRPKS-ELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKL
70 80 90 100 110
90 100 110 120 130 140
pF1KE1 SKVETLRSAVEYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASP-SR
.:.:::: : .:: :: . : : :: :. . ..: : :: :: :.
NP_066 TKIETLRFAHNYIWALTQTLRIADHSLYALEPPA-PHCGELGSPGGSPGDWGSLYSPVSQ
120 130 140 150 160 170
150 160 170 180 190
pF1KE1 ASS-SPGRGGSSEPGSPRSAYSSDDSGCEGALSPAERELLDFSSWLGGY
:.: ::. . .:: ...:. : :::. . ::
NP_066 AGSLSPAASLEERPGLLGATFSA----C---LSPGSLAFSDFL
180 190 200 210
>>NP_803238 (OMIM: 608606) class A basic helix-loop-heli (189 aa)
initn: 227 init1: 162 opt: 215 Z-score: 181.5 bits: 40.3 E(85289): 0.0023
Smith-Waterman score: 215; 42.7% identity (65.0% similar) in 103 aa overlap (4-102:28-127)
10 20 30
pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSR
:.:: :.: : .: :. . .:
NP_803 MKTKNRPPRRRAPVQDTEATPGEGTPDGSLPN---PGPEPAKGLRSRPARAAARAPGEGR
10 20 30 40 50
40 50 60 70 80 90
pF1KE1 RRRPATAETGGGA-AAVARR---NERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETL
::::. . :: ... :: :::::.:.. .: .:::::. .:: :.:::::.:::
NP_803 RRRPGPSGPGGRRDSSIQRRLESNERERQRMHKLNNAFQALREVIPHVRADKKLSKIETL
60 70 80 90 100 110
100 110 120 130 140 150
pF1KE1 RSAVEYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGR
: .::..:
NP_803 TLAKNYIKSLTATILTMSSSRLPGLEGPGPKLYQHYQQQQQVAGGALGATEAQPQGHLQR
120 130 140 150 160 170
>>NP_065697 (OMIM: 609154) achaete-scute homolog 3 [Homo (181 aa)
initn: 198 init1: 198 opt: 213 Z-score: 180.2 bits: 40.0 E(85289): 0.0027
Smith-Waterman score: 213; 42.6% identity (61.4% similar) in 101 aa overlap (6-106:53-149)
10 20 30
pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCS
::: :. . .: .. : : . .
NP_065 LPLTRSFYLEPMVTFHVHPEAPVSSPYSEELPRLPFPSDSLILGNYSEPCPFSFPMPYPN
30 40 50 60 70 80
40 50 60 70 80 90
pF1KE1 RRRRPATAETGGGAAAVARRNERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSA
: : . : : . .::::::.::: :: :. ::.:.:. :.::::::::.:
NP_065 YR----GCEYSYGPAFTRKRNERERQRVKCVNEGYAQLRHHLPEEYLEKRLSKVETLRAA
90 100 110 120 130
100 110 120 130 140 150
pF1KE1 VEYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGS
..:: :: ::
NP_065 IKYINYLQSLLYPDKAETKNNPGKVSSMIATTSHHADPMFRIV
140 150 160 170 180
>>NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapiens] (272 aa)
initn: 194 init1: 132 opt: 204 Z-score: 171.1 bits: 38.9 E(85289): 0.0086
Smith-Waterman score: 212; 34.2% identity (55.6% similar) in 187 aa overlap (19-189:73-251)
10 20 30 40
pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAET---
:: ::: : . .:::. :..
NP_076 EEPGASGGARRQRGAEAGQGARGGVAAGAEGC----RPARLLGLVHDCKRRPSRARAVSR
50 60 70 80 90
50 60 70 80 90
pF1KE1 GGGAAAVARR---------NERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAV
:. .: ...: :.:::::.. .: ...:::. .: . ::.:.:::: :
NP_076 GAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAH
100 110 120 130 140 150
100 110 120 130 140 150
pF1KE1 EYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRG---
.:: :: . : : .. .::: : :. : :: . .: : : : ::.
NP_076 NYIWALTETLRLADHCGGG-GGGL-PGALFSEAVLLSPGGASAALSSSGDSPSPASTWSC
160 170 180 190 200 210
160 170 180 190
pF1KE1 -GSSEPGSPRSAYSSDDSGCEGALSPAERELLDFSSWLGGY
.: :.: :. :.. .: .:::: :.. :
NP_076 TNSPAPSSSVSSNSTSPYSC--TLSPASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI
220 230 240 250 260 270
193 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 15:16:29 2016 done: Sun Nov 6 15:16:30 2016
Total Scan time: 5.940 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]