FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB0989, 135 aa
1>>>pF1KB0989 135 - 135 aa - 135 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6950+/-0.000234; mu= 12.6870+/- 0.015
mean_var=115.0654+/-23.013, 0's: 0 Z-trim(125.4): 22 B-trim: 425 in 1/54
Lambda= 0.119564
statistics sampled from 49138 (49165) to 49138 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.865), E-opt: 0.2 (0.576), width: 16
Scan time: 5.950
The best scores are: opt bits E(85289)
NP_689781 (OMIM: 610772) homeobox protein Nkx-6.3 ( 135) 944 171.6 3.4e-43
XP_016868632 (OMIM: 610772) PREDICTED: homeobox pr ( 265) 551 104.1 1.4e-22
NP_006159 (OMIM: 602563) homeobox protein Nkx-6.1 ( 367) 246 51.7 1.2e-06
XP_016872278 (OMIM: 605955) PREDICTED: homeobox pr ( 277) 244 51.2 1.2e-06
NP_796374 (OMIM: 605955) homeobox protein Nkx-6.2 ( 277) 241 50.7 1.8e-06
>>NP_689781 (OMIM: 610772) homeobox protein Nkx-6.3 [Hom (135 aa)
initn: 944 init1: 944 opt: 944 Z-score: 896.5 bits: 171.6 E(85289): 3.4e-43
Smith-Waterman score: 944; 100.0% identity (100.0% similar) in 135 aa overlap (1-135:1-135)
10 20 30 40 50 60
pF1KB0 MQQGQLAPGSRLCSGPWGLPELQPAAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_689 MQQGQLAPGSRLCSGPWGLPELQPAAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 RTKWRKKSALEPSSSTPRAPGGAGAGAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_689 RTKWRKKSALEPSSSTPRAPGGAGAGAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRK
70 80 90 100 110 120
130
pF1KB0 HRAAFSVLSLGAHSV
:::::::::::::::
NP_689 HRAAFSVLSLGAHSV
130
>>XP_016868632 (OMIM: 610772) PREDICTED: homeobox protei (265 aa)
initn: 573 init1: 551 opt: 551 Z-score: 526.5 bits: 104.1 E(85289): 1.4e-22
Smith-Waterman score: 551; 100.0% identity (100.0% similar) in 81 aa overlap (55-135:185-265)
30 40 50 60 70 80
pF1KB0 AAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNRRTKWRKKSALEPSSSTPRAPGGAG
::::::::::::::::::::::::::::::
XP_016 EKTFEQTKYLAGPERARLAYSLGMTESQVKVWFQNRRTKWRKKSALEPSSSTPRAPGGAG
160 170 180 190 200 210
90 100 110 120 130
pF1KB0 AGAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRKHRAAFSVLSLGAHSV
:::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 AGAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRKHRAAFSVLSLGAHSV
220 230 240 250 260
>>NP_006159 (OMIM: 602563) homeobox protein Nkx-6.1 [Hom (367 aa)
initn: 294 init1: 149 opt: 246 Z-score: 240.4 bits: 51.7 E(85289): 1.2e-06
Smith-Waterman score: 246; 52.9% identity (77.1% similar) in 70 aa overlap (55-124:282-349)
30 40 50 60 70 80
pF1KB0 AAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNRRTKWRKKSALEPSSSTPRAPGGAG
::::::::::::: : : ... . . .
NP_006 EKTFEQTKYLAGPERARLAYSLGMTESQVKVWFQNRRTKWRKKHAAEMATAKKKQDSETE
260 270 280 290 300 310
90 100 110 120 130
pF1KB0 AGAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRKHRAAFSVLSLGAHSV
:... .:.:::.:::::::.:::::: ::.::...
NP_006 RLKGASE--NEEEDDDYNKPLDPNSDDEKITQLLKKHKSSSGGGGGLLLHASEPESSS
320 330 340 350 360
>>XP_016872278 (OMIM: 605955) PREDICTED: homeobox protei (277 aa)
initn: 297 init1: 141 opt: 244 Z-score: 240.1 bits: 51.2 E(85289): 1.2e-06
Smith-Waterman score: 244; 55.1% identity (75.4% similar) in 69 aa overlap (55-122:194-259)
30 40 50 60 70 80
pF1KB0 AAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNRRTKWRKKSALEPSSSTPRAPGGAG
::::::::::::. :.: .:. . . :
XP_016 EKTFEQTKYLAGPERARLAYSLGMTESQVKVWFQNRRTKWRKRHAVEMASAKKKQDSDAE
170 180 190 200 210 220
90 100 110 120 130
pF1KB0 A-GAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRKHRAAFSVLSLGAHSV
.::. : ..:::::.::::.:::::: ::.::.
XP_016 KLKVGGSDA---EDDDEYNRPLDPNSDDEKITRLLKKHKPSNLALVSPCGGGAGDAL
230 240 250 260 270
>>NP_796374 (OMIM: 605955) homeobox protein Nkx-6.2 [Hom (277 aa)
initn: 294 init1: 141 opt: 241 Z-score: 237.3 bits: 50.7 E(85289): 1.8e-06
Smith-Waterman score: 241; 55.1% identity (73.9% similar) in 69 aa overlap (55-122:194-259)
30 40 50 60 70 80
pF1KB0 AAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNRRTKWRKKSALEPSSSTPRAPGGAG
::::::::::::. : : .:. . . :
NP_796 EKTFEQTKYLAGPERARLAYSLGMTESQVKVWFQNRRTKWRKRHAAEMASAKKKQDSDAE
170 180 190 200 210 220
90 100 110 120 130
pF1KB0 A-GAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRKHRAAFSVLSLGAHSV
.::. : ..:::::.::::.:::::: ::.::.
NP_796 KLKVGGSDA---EDDDEYNRPLDPNSDDEKITRLLKKHKPSNLALVSPCGGGAGDAL
230 240 250 260 270
135 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 21:27:46 2016 done: Sat Nov 5 21:27:47 2016
Total Scan time: 5.950 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]