FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1679, 193 aa 1>>>pF1KE1679 193 - 193 aa - 193 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.6213+/-0.000257; mu= 6.3288+/- 0.016 mean_var=175.0388+/-35.069, 0's: 0 Z-trim(126.1): 48 B-trim: 906 in 1/61 Lambda= 0.096941 statistics sampled from 51226 (51286) to 51226 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.858), E-opt: 0.2 (0.601), width: 16 Scan time: 5.940 The best scores are: opt bits E(85289) NP_005161 (OMIM: 601886) achaete-scute homolog 2 [ ( 193) 1297 191.7 6.4e-49 NP_004307 (OMIM: 100790,209880) achaete-scute homo ( 236) 423 69.5 4.7e-12 NP_982260 (OMIM: 609155) achaete-scute homolog 4 [ ( 173) 268 47.7 1.2e-05 XP_016871769 (OMIM: 604882,610370) PREDICTED: neur ( 214) 226 41.9 0.00086 NP_066279 (OMIM: 604882,610370) neurogenin-3 [Homo ( 214) 226 41.9 0.00086 NP_803238 (OMIM: 608606) class A basic helix-loop- ( 189) 215 40.3 0.0023 NP_065697 (OMIM: 609154) achaete-scute homolog 3 [ ( 181) 213 40.0 0.0027 NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapien ( 272) 204 38.9 0.0086 >>NP_005161 (OMIM: 601886) achaete-scute homolog 2 [Homo (193 aa) initn: 1297 init1: 1297 opt: 1297 Z-score: 999.2 bits: 191.7 E(85289): 6.4e-49 Smith-Waterman score: 1297; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193) 10 20 30 40 50 60 pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAETGGGAAAVARRNERER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAETGGGAAAVARRNERER 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 NRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAVEYIRALQRLLAEHDAVRNALAGGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 NRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAVEYIRALQRLLAEHDAVRNALAGGL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 RPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGSSEPGSPRSAYSSDDSGCEGALSPAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 RPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGSSEPGSPRSAYSSDDSGCEGALSPAE 130 140 150 160 170 180 190 pF1KE1 RELLDFSSWLGGY ::::::::::::: NP_005 RELLDFSSWLGGY 190 >>NP_004307 (OMIM: 100790,209880) achaete-scute homolog (236 aa) initn: 517 init1: 350 opt: 423 Z-score: 337.4 bits: 69.5 E(85289): 4.7e-12 Smith-Waterman score: 472; 52.0% identity (68.0% similar) in 175 aa overlap (23-190:88-236) 10 20 30 40 pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAETGGG---- :.: .::::.::.:: .: : NP_004 QQQQQAPQLRPAADGQPSGGGHKSAPKQVKRQRSSSPELMRCKRR----LNFSGFGYSLP 60 70 80 90 100 110 50 60 70 80 90 100 pF1KE1 ---AAAVARRNERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAVEYIRALQRL ::::::::::::::::::::: .::.:::.:.:.::.:::::::::::::::::.: NP_004 QQQPAAVARRNERERNRVKLVNLGFATLREHVPNGAANKKMSKVETLRSAVEYIRALQQL 120 130 140 150 160 170 110 120 130 140 150 160 pF1KE1 LAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGSSEPGSPRSAY : ::::: :. .:. . :. ::. ... .: ::: :.: NP_004 LDEHDAVSAAFQAGV----LSPTI------------SPNYSNDL-----NSMAGSPVSSY 180 190 200 210 170 180 190 pF1KE1 SSDDSGCEGALSPAERELLDFSSWLGGY :::. : ::: :.:::::..:. NP_004 SSDE-GSYDPLSPEEQELLDFTNWF 220 230 >>NP_982260 (OMIM: 609155) achaete-scute homolog 4 [Homo (173 aa) initn: 237 init1: 207 opt: 268 Z-score: 222.1 bits: 47.7 E(85289): 1.2e-05 Smith-Waterman score: 268; 41.0% identity (62.2% similar) in 156 aa overlap (4-155:26-173) 10 20 30 pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLR--CSR :::: . : :. : : : : : : :.: NP_982 MMETRKPAERLALPYSLRTAPLGVPGTLP-GLPRRDPLRV--ALRLDAACWEWARSGCAR 10 20 30 40 50 40 50 60 70 80 90 pF1KE1 --RRRPATAETGGGAAAVARRNERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRS . :. ... : . .::::::.::. :: :. ::.:.:. :.:.::::::::. NP_982 GWQYLPVPLDSAFEPAFLRKRNERERQRVRCVNEGYARLRDHLPRELADKRLSKVETLRA 60 70 80 90 100 110 100 110 120 130 140 150 pF1KE1 AVEYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGG :..::. ::.:: :..: :.: :: : . : . ....:: :: : .:: NP_982 AIDYIKHLQELL-ERQAWGLEGAAGAVPQR---RAECNSDGESKASSAPS-PSSEPEEGG 120 130 140 150 160 170 160 170 180 190 pF1KE1 SSEPGSPRSAYSSDDSGCEGALSPAERELLDFSSWLGGY : NP_982 S >>XP_016871769 (OMIM: 604882,610370) PREDICTED: neurogen (214 aa) initn: 186 init1: 127 opt: 226 Z-score: 189.1 bits: 41.9 E(85289): 0.00086 Smith-Waterman score: 226; 35.2% identity (54.6% similar) in 196 aa overlap (2-186:27-213) 10 20 30 pF1KE1 MDGGTLPRSAPPAPPVPVG-CAARRRPA---SPEL : : : ::::.: : :: .. . .:. XP_016 MTPQPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRK 10 20 30 40 50 60 40 50 60 70 80 pF1KE1 LRCSR--RRRPATAETGGGAAAVARR---NERERNRVKLVNLGFQALRQHVPHGGASKKL :: : : :: . : . . .:: :.:::::.. .: ...::: .: . :: XP_016 LRARRGGRSRPKS-ELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKL 70 80 90 100 110 90 100 110 120 130 140 pF1KE1 SKVETLRSAVEYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASP-SR .:.:::: : .:: :: . : : :: :. . ..: : :: :: :. XP_016 TKIETLRFAHNYIWALTQTLRIADHSLYALEPPA-PHCGELGSPGGSPGDWGSLYSPVSQ 120 130 140 150 160 170 150 160 170 180 190 pF1KE1 ASS-SPGRGGSSEPGSPRSAYSSDDSGCEGALSPAERELLDFSSWLGGY :.: ::. . .:: ...:. : :::. . :: XP_016 AGSLSPAASLEERPGLLGATFSA----C---LSPGSLAFSDFL 180 190 200 210 >>NP_066279 (OMIM: 604882,610370) neurogenin-3 [Homo sap (214 aa) initn: 186 init1: 127 opt: 226 Z-score: 189.1 bits: 41.9 E(85289): 0.00086 Smith-Waterman score: 226; 35.2% identity (54.6% similar) in 196 aa overlap (2-186:27-213) 10 20 30 pF1KE1 MDGGTLPRSAPPAPPVPVG-CAARRRPA---SPEL : : : ::::.: : :: .. . .:. NP_066 MTPQPSGAPTVQVTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRK 10 20 30 40 50 60 40 50 60 70 80 pF1KE1 LRCSR--RRRPATAETGGGAAAVARR---NERERNRVKLVNLGFQALRQHVPHGGASKKL :: : : :: . : . . .:: :.:::::.. .: ...::: .: . :: NP_066 LRARRGGRSRPKS-ELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKL 70 80 90 100 110 90 100 110 120 130 140 pF1KE1 SKVETLRSAVEYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASP-SR .:.:::: : .:: :: . : : :: :. . ..: : :: :: :. NP_066 TKIETLRFAHNYIWALTQTLRIADHSLYALEPPA-PHCGELGSPGGSPGDWGSLYSPVSQ 120 130 140 150 160 170 150 160 170 180 190 pF1KE1 ASS-SPGRGGSSEPGSPRSAYSSDDSGCEGALSPAERELLDFSSWLGGY :.: ::. . .:: ...:. : :::. . :: NP_066 AGSLSPAASLEERPGLLGATFSA----C---LSPGSLAFSDFL 180 190 200 210 >>NP_803238 (OMIM: 608606) class A basic helix-loop-heli (189 aa) initn: 227 init1: 162 opt: 215 Z-score: 181.5 bits: 40.3 E(85289): 0.0023 Smith-Waterman score: 215; 42.7% identity (65.0% similar) in 103 aa overlap (4-102:28-127) 10 20 30 pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSR :.:: :.: : .: :. . .: NP_803 MKTKNRPPRRRAPVQDTEATPGEGTPDGSLPN---PGPEPAKGLRSRPARAAARAPGEGR 10 20 30 40 50 40 50 60 70 80 90 pF1KE1 RRRPATAETGGGA-AAVARR---NERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETL ::::. . :: ... :: :::::.:.. .: .:::::. .:: :.:::::.::: NP_803 RRRPGPSGPGGRRDSSIQRRLESNERERQRMHKLNNAFQALREVIPHVRADKKLSKIETL 60 70 80 90 100 110 100 110 120 130 140 150 pF1KE1 RSAVEYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGR : .::..: NP_803 TLAKNYIKSLTATILTMSSSRLPGLEGPGPKLYQHYQQQQQVAGGALGATEAQPQGHLQR 120 130 140 150 160 170 >>NP_065697 (OMIM: 609154) achaete-scute homolog 3 [Homo (181 aa) initn: 198 init1: 198 opt: 213 Z-score: 180.2 bits: 40.0 E(85289): 0.0027 Smith-Waterman score: 213; 42.6% identity (61.4% similar) in 101 aa overlap (6-106:53-149) 10 20 30 pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCS ::: :. . .: .. : : . . NP_065 LPLTRSFYLEPMVTFHVHPEAPVSSPYSEELPRLPFPSDSLILGNYSEPCPFSFPMPYPN 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE1 RRRRPATAETGGGAAAVARRNERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSA : : . : : . .::::::.::: :: :. ::.:.:. :.::::::::.: NP_065 YR----GCEYSYGPAFTRKRNERERQRVKCVNEGYAQLRHHLPEEYLEKRLSKVETLRAA 90 100 110 120 130 100 110 120 130 140 150 pF1KE1 VEYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGS ..:: :: :: NP_065 IKYINYLQSLLYPDKAETKNNPGKVSSMIATTSHHADPMFRIV 140 150 160 170 180 >>NP_076924 (OMIM: 606624) neurogenin-2 [Homo sapiens] (272 aa) initn: 194 init1: 132 opt: 204 Z-score: 171.1 bits: 38.9 E(85289): 0.0086 Smith-Waterman score: 212; 34.2% identity (55.6% similar) in 187 aa overlap (19-189:73-251) 10 20 30 40 pF1KE1 MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAET--- :: ::: : . .:::. :.. NP_076 EEPGASGGARRQRGAEAGQGARGGVAAGAEGC----RPARLLGLVHDCKRRPSRARAVSR 50 60 70 80 90 50 60 70 80 90 pF1KE1 GGGAAAVARR---------NERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAV :. .: ...: :.:::::.. .: ...:::. .: . ::.:.:::: : NP_076 GAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAH 100 110 120 130 140 150 100 110 120 130 140 150 pF1KE1 EYIRALQRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRG--- .:: :: . : : .. .::: : :. : :: . .: : : : ::. NP_076 NYIWALTETLRLADHCGGG-GGGL-PGALFSEAVLLSPGGASAALSSSGDSPSPASTWSC 160 170 180 190 200 210 160 170 180 190 pF1KE1 -GSSEPGSPRSAYSSDDSGCEGALSPAERELLDFSSWLGGY .: :.: :. :.. .: .:::: :.. : NP_076 TNSPAPSSSVSSNSTSPYSC--TLSPASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI 220 230 240 250 260 270 193 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 15:16:29 2016 done: Sun Nov 6 15:16:30 2016 Total Scan time: 5.940 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]