FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9648, 391 aa
1>>>pF1KB9648 391 - 391 aa - 391 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.7709+/-0.000406; mu= 5.1621+/- 0.026
mean_var=333.5096+/-70.170, 0's: 0 Z-trim(122.4): 76 B-trim: 46 in 1/50
Lambda= 0.070230
statistics sampled from 40330 (40439) to 40330 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.763), E-opt: 0.2 (0.474), width: 16
Scan time: 10.240
The best scores are: opt bits E(85289)
NP_005977 (OMIM: 602148) transcription factor SOX- ( 391) 2681 284.9 2.2e-76
NP_005625 (OMIM: 300123,312000,313430) transcripti ( 446) 833 97.8 5.5e-20
NP_003097 (OMIM: 184429,189960,206900) transcripti ( 317) 780 92.2 1.8e-18
NP_009015 (OMIM: 604974) transcription factor SOX- ( 276) 611 75.0 2.4e-13
NP_004180 (OMIM: 604747) transcription factor SOX- ( 240) 602 74.0 4.2e-13
NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapi ( 233) 485 62.2 1.5e-09
NP_003098 (OMIM: 184430) transcription factor SOX- ( 474) 486 62.7 2.2e-09
NP_071899 (OMIM: 610928,613674) transcription fact ( 414) 451 59.0 2.3e-08
NP_055402 (OMIM: 605923) transcription factor SOX- ( 446) 448 58.8 3e-08
NP_003099 (OMIM: 600898,615866) transcription fact ( 441) 433 57.2 8.6e-08
NP_003131 (OMIM: 400044,400045,480000) sex-determi ( 204) 425 56.0 9.4e-08
NP_113627 (OMIM: 612202) transcription factor SOX- ( 388) 430 56.9 9.8e-08
NP_008874 (OMIM: 601947) transcription factor SOX- ( 315) 417 55.4 2.2e-07
NP_060889 (OMIM: 137940,601618,607823) transcripti ( 384) 411 54.9 3.7e-07
NP_008872 (OMIM: 602229,609136,611584,613266) tran ( 466) 400 53.9 9.1e-07
NP_000337 (OMIM: 114290,608160,616425) transcripti ( 509) 381 52.1 3.6e-06
NP_821078 (OMIM: 604975,616803) transcription fact ( 377) 345 48.2 3.8e-05
XP_011519144 (OMIM: 604975,616803) PREDICTED: tran ( 415) 345 48.3 4e-05
NP_001248343 (OMIM: 604975,616803) transcription f ( 642) 345 48.5 5.3e-05
XP_016875389 (OMIM: 604975,616803) PREDICTED: tran ( 715) 345 48.6 5.6e-05
XP_016875387 (OMIM: 604975,616803) PREDICTED: tran ( 715) 345 48.6 5.6e-05
XP_016875388 (OMIM: 604975,616803) PREDICTED: tran ( 715) 345 48.6 5.6e-05
XP_016875390 (OMIM: 604975,616803) PREDICTED: tran ( 715) 345 48.6 5.6e-05
XP_016875386 (OMIM: 604975,616803) PREDICTED: tran ( 716) 345 48.6 5.6e-05
NP_001317714 (OMIM: 604975,616803) transcription f ( 728) 345 48.6 5.7e-05
XP_011519140 (OMIM: 604975,616803) PREDICTED: tran ( 729) 345 48.6 5.7e-05
NP_694534 (OMIM: 604975,616803) transcription fact ( 750) 345 48.6 5.8e-05
XP_016875385 (OMIM: 604975,616803) PREDICTED: tran ( 750) 345 48.6 5.8e-05
XP_011519137 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05
XP_016875383 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05
XP_016875382 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05
XP_011519139 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05
XP_016875384 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05
XP_016875380 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05
XP_016875379 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05
XP_016875381 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05
XP_011519136 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05
NP_001248344 (OMIM: 604975,616803) transcription f ( 753) 345 48.6 5.8e-05
XP_011519135 (OMIM: 604975,616803) PREDICTED: tran ( 754) 345 48.6 5.8e-05
NP_008871 (OMIM: 604975,616803) transcription fact ( 763) 345 48.6 5.8e-05
XP_011519134 (OMIM: 604975,616803) PREDICTED: tran ( 764) 345 48.6 5.8e-05
XP_016875378 (OMIM: 604975,616803) PREDICTED: tran ( 792) 345 48.7 6e-05
XP_016875377 (OMIM: 604975,616803) PREDICTED: tran ( 793) 345 48.7 6e-05
NP_001139283 (OMIM: 607257) transcription factor S ( 801) 330 47.1 0.00017
NP_059978 (OMIM: 607257) transcription factor SOX- ( 804) 330 47.1 0.00017
NP_201583 (OMIM: 607257) transcription factor SOX- ( 808) 330 47.1 0.00017
NP_001139291 (OMIM: 607257) transcription factor S ( 841) 330 47.2 0.00018
XP_005245680 (OMIM: 604748) PREDICTED: transcripti ( 621) 325 46.5 0.00021
NP_005677 (OMIM: 604748) transcription factor SOX- ( 622) 325 46.5 0.00021
XP_005265860 (OMIM: 606698) PREDICTED: transcripti ( 448) 279 41.6 0.0043
>>NP_005977 (OMIM: 602148) transcription factor SOX-1 [H (391 aa)
initn: 2681 init1: 2681 opt: 2681 Z-score: 1492.4 bits: 284.9 E(85289): 2.2e-76
Smith-Waterman score: 2681; 100.0% identity (100.0% similar) in 391 aa overlap (1-391:1-391)
10 20 30 40 50 60
pF1KB9 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 WSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 WSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKY
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 RPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGAAGGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 RPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGAAGGG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 YAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAHP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 YAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAHP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 HNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 HNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 AAAAAASSGALGALGSLVKSEPSGSPPAPAHSRAPCPGDLREMISMYLPAGEGGDPAAAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_005 AAAAAASSGALGALGSLVKSEPSGSPPAPAHSRAPCPGDLREMISMYLPAGEGGDPAAAA
310 320 330 340 350 360
370 380 390
pF1KB9 AAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI
:::::::::::::::::::::::::::::::
NP_005 AAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI
370 380 390
>>NP_005625 (OMIM: 300123,312000,313430) transcription f (446 aa)
initn: 1125 init1: 724 opt: 833 Z-score: 479.8 bits: 97.8 E(85289): 5.5e-20
Smith-Waterman score: 1326; 58.9% identity (77.1% similar) in 389 aa overlap (12-391:102-446)
10 20 30 40
pF1KB9 MYSMMMETDLHSPGGA-QAPTNLSGPAGAGGGGGGGGGGGG
.:::: .. .: .: :..:::..::..:::
NP_005 PAPAMYSLLETELKNPVGTPTQAAGTGGPAAPGGAGKSSANAAGGANSGGGSSGGASGGG
80 90 100 110 120 130
50 60 70 80 90 100
pF1KB9 GGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPF
:: ..::::::::::::::::::::::: ::::::::::::::::.::....::::::
NP_005 GG---TDQDRVKRPMNAFMVWSRGQRRKMALENPKMHNSEISKRLGADWKLLTDAEKRPF
140 150 160 170 180
110 120 130 140 150 160
pF1KB9 IDEAKRLRALHMKEHPDYKYRPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVG
:::::::::.::::.:::::::::::::::::::::: .::: ::....::.: .....
NP_005 IDEAKRLRAVHMKEYPDYKYRPRRKTKTLLKKDKYSLPSGLLPPGAAAAAAAAAAAAAAA
190 200 210 220 230 240
170 180 190 200 210 220
pF1KB9 VGAAAVGQRLESPGGAAGGGYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAG
. ..:::::.. :.:::::::::: ...:: ::.:.: :. .
NP_005 SSPVGVGQRLDT--------YTHVNGWANGAY-----------SLVQE-QLGYAQPPSMS
250 260 270 280
230 240 250 260 270
pF1KB9 GAHPHAHPAHPHPHHPHAHPHNPQPMHRYDMGALQYSPI--SNSQGYMS-----ASPSGY
. : : : : : :::::::..:::::. ..:.::. :. :::
NP_005 S---------PPP--PPALP----PMHRYDMAGLQYSPMMPPGAQSYMNVAAAAAAASGY
290 300 310 320 330
280 290 300 310 320 330
pF1KB9 GGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSGSPPAPA-HS
::. .:.:::::: : :. :.:::::::::. .:: .::.::::::. ::: : ::
NP_005 GGMAPSATAAAAAAYG---QQPATAAAAAAAAAAM-SLGPMGSVVKSEPSSPPPAIASHS
340 350 360 370 380
340 350 360 370 380 390
pF1KB9 RAPCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI
. : ::::.::::::: ::: : ::. .:::.. :::::::..::::::::::
NP_005 QRACLGDLRDMISMYLPP--GGDAADAASPLPGGRLHGVHQHYQGAGTAVNGTVPLTHI
390 400 410 420 430 440
>>NP_003097 (OMIM: 184429,189960,206900) transcription f (317 aa)
initn: 1037 init1: 728 opt: 780 Z-score: 452.4 bits: 92.2 E(85289): 1.8e-18
Smith-Waterman score: 1167; 52.9% identity (69.4% similar) in 399 aa overlap (1-391:1-317)
10 20 30 40 50 60
pF1KB9 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMV
::.:: ::.:. :: :. .:::::.. ....::. : . ::::::::::::
NP_003 MYNMM-ETELKPPGPQQT---------SGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMV
10 20 30 40 50
70 80 90 100 110 120
pF1KB9 WSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKY
::::::::::::::::::::::::::::::..::.:::::::::::::::::::::::::
NP_003 WSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKY
60 70 80 90 100 110
130 140 150 160 170
pF1KB9 RPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVG-AAAVGQRLESPGGAAGG
:::::::::.:::::.: ::::: : : ..: :::::.: .:.:.::..:
NP_003 RPRRKTKTLMKKDKYTLPGGLLAPG----GNSMASGVGVGAGLGAGVNQRMDS-------
120 130 140 150
180 190 200 210 220 230
pF1KB9 GYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAH
:::.:::.::.: .:::. ::.: :::: .:: :
NP_003 -YAHMNGWSNGSY-----------SMMQD-QLGYPQHPGL-----NAHGAAQM-------
160 170 180 190
240 250 260 270 280 290
pF1KB9 PHNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAA
:::::::..::::. ...:: ::..::. :. . . .. : :
NP_003 ----QPMHRYDVSALQYNSMTSSQTYMNGSPT------YSMSYSQQGTPGMA--------
200 210 220 230
300 310 320 330 340 350
pF1KB9 AAAAAAASSGALGALGSLVKSEPSGSPP---APAHSRAPCP-GDLREMISMYLPAGEGGD
::..::.:::: :.::: . .:::::: ::::.:::::::..: .
NP_003 -----------LGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPE
240 250 260 270 280
360 370 380 390
pF1KB9 PAAAAAAAAQSRLHSLPQHYQGA---GAGVNGTVPLTHI
::: :::: . ::::.. :...:::.::.:.
NP_003 PAAP------SRLH-MSQHYQSGPVPGTAINGTLPLSHM
290 300 310
>>NP_009015 (OMIM: 604974) transcription factor SOX-21 [ (276 aa)
initn: 742 init1: 555 opt: 611 Z-score: 360.6 bits: 75.0 E(85289): 2.4e-13
Smith-Waterman score: 706; 46.4% identity (64.1% similar) in 323 aa overlap (49-364:6-275)
20 30 40 50 60 70
pF1KB9 PTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHN
:.:::::::::::::.::::::::::::::
NP_009 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHN
10 20 30
80 90 100 110 120 130
pF1KB9 SEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLLKKDKYSLA
::::::::::::...:.::::::::::::::.::::::::::::::: ::::::::...
NP_009 SEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFP
40 50 60 70 80 90
140 150 160 170 180 190
pF1KB9 GGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGAAGGGYAHVNGWANGAYPGSVAA
. : :: . : .. .:.: : .:::: . . :: : ..::
NP_009 ---VPYGLGGVADAEHPALKAGAGLHA----------GAGGGLVPESLLAN---PEKAAA
100 110 120 130
200 210 220 230 240 250
pF1KB9 AAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAHPHNPQPMHRYDMGALQYSP
:::::: :.. . : .:..: : : .:. :.:. ...
NP_009 AAAAAA----ARVFFPQSAAAAAAAAAAAAAG-------------SPYSLLDLGS-KMAE
140 150 160 170 180
260 270 280 290 300 310
pF1KB9 ISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLV
::.:.. ::::... . .:..:: ...:.::::::::: :. .
NP_009 ISSSSS----------GLPYASSLGYPTAGAGAFHGAAAAAAAAAAAA--------GGHT
190 200 210 220
320 330 340 350 360 370
pF1KB9 KSEPSGSPPA---PAHSRA-PCPGDLREMISMYLPAGEGG---DPAAAAAAAAQSRLHSL
.:.:: . :. : . : : :: . . :: : : :: :: :::
NP_009 HSHPSPGNPGYMIPCNCSAWPSPGLQPPLAYILLP-GMGKPQLDPYPAAYAAAL
230 240 250 260 270
380 390
pF1KB9 PQHYQGAGAGVNGTVPLTHI
>>NP_004180 (OMIM: 604747) transcription factor SOX-14 [ (240 aa)
initn: 590 init1: 563 opt: 602 Z-score: 356.3 bits: 74.0 E(85289): 4.2e-13
Smith-Waterman score: 602; 46.5% identity (64.7% similar) in 241 aa overlap (49-285:6-239)
20 30 40 50 60 70
pF1KB9 PTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHN
:..:::::::::::::::::::::::::::
NP_004 MSKPSDHIKRPMNAFMVWSRGQRRKMAQENPKMHN
10 20 30
80 90 100 110 120 130
pF1KB9 SEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLLKKDKYSLA
::::::::::::..:::::::.::::::::: ::::::::::::::: :.:::::.: .
NP_004 SEISKRLGAEWKLLSEAEKRPYIDEAKRLRAQHMKEHPDYKYRPRRKPKNLLKKDRYVFP
40 50 60 70 80 90
140 150 160 170 180 190
pF1KB9 GGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGAAGGGYAHVNGWANGAYP--GSV
:. .:.. .:.. :. .: : : ..: . ....: : :
NP_004 LPYLGDTDPLKAAGLPVGASDGLLSAPEKARAFLPPASAPYSLLDPAQFSSSAIQKMGEV
100 110 120 130 140 150
200 210 220 230 240 250
pF1KB9 AAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPH-HPHAHPHNPQPMHRYDMGALQ
. :..:. . :.: . :: :. . : : : : : :: . . : .
NP_004 PHTLATGALPYASTLGY--QNGAFGSL-----SCPSQHTHTHPSPTNPGYVVPCNCTAWS
160 170 180 190 200
260 270 280 290 300 310
pF1KB9 YSPISNSQGYMSASPSGYGGL-PYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGAL
: .. .:. :. ::..: :.:
NP_004 ASTLQPPVAYILFPGMTKTGIDPYSSAHATAM
210 220 230 240
320 330 340 350 360 370
pF1KB9 GSLVKSEPSGSPPAPAHSRAPCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQH
>>NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapiens] (233 aa)
initn: 476 init1: 446 opt: 485 Z-score: 292.4 bits: 62.2 E(85289): 1.5e-09
Smith-Waterman score: 495; 39.8% identity (59.8% similar) in 246 aa overlap (10-246:14-228)
10 20 30 40 50
pF1KB9 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMN
:. :... : .. ::: :.:. .. : ..::::::
NP_008 MALPGSSQDQAWSLEPPAATAAASSSSGPQEREGAGSPAAPG------TLPLEKVKRPMN
10 20 30 40 50
60 70 80 90 100 110
pF1KB9 AFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHP
:::::: .:::.:::.:::::::::::::::.::...: :::::..::::::: :....:
NP_008 AFMVWSSAQRRQMAQQNPKMHNSEISKRLGAQWKLLDEDEKRPFVEEAKRLRARHLRDYP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB9 DYKYRPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGA
:::::::::.:. .::: . : : : : : : .:: :
NP_008 DYKYRPRRKAKS---------------SGAGPSR------CGQGRGNLASGGPLWGPGYA
120 130 140 150
180 190 200 210 220
pF1KB9 A-----GGGYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHP-HAH---
. : :: . ..... ::: ... . .: .. : : ..:
NP_008 TTQPSRGFGY-RPPSYSTAYLPGSYGSSHCKLEAPSPCSLPQSDPRLQGELLPTYTHYLP
160 170 180 190 200 210
230 240 250 260 270 280
pF1KB9 PAHPHPHHPHAHPHNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAA
:. : :..: : ::
NP_008 PGSPTPYNP---PLAGAPMPLTHL
220 230
>>NP_003098 (OMIM: 184430) transcription factor SOX-4 [H (474 aa)
initn: 413 init1: 413 opt: 486 Z-score: 289.5 bits: 62.7 E(85289): 2.2e-09
Smith-Waterman score: 506; 33.9% identity (57.9% similar) in 363 aa overlap (12-332:20-378)
10 20 30 40 50
pF1KB9 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVK
: .:: .... :. .. :: . . :. . ..:
NP_003 MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIK
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB9 RPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHM
::::::::::. .:::. ...: :::.::::::: .::......: ::: ::.::: ::
NP_003 RPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHM
70 80 90 100 110 120
120 130 140 150
pF1KB9 KEHPDYKYRPRRKTKTL---------------LKKDKYSLAGGLLAAGAGGGGAAVAMGV
..::::::::.:.:. : :: . .:: .:.::::.. : :
NP_003 ADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGG
130 140 150 160 170 180
160 170 180 190 200
pF1KB9 GVGV---GAAAVGQRLESPG----GAAGGGYA--HVNGWANGAYPGSVAAAAAAAAMMQE
: :. :: . . .: : :.:::: . :.. :. :. :::::::.. :
NP_003 GGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAE
190 200 210 220 230 240
210 220 230 240 250
pF1KB9 -----AQLAYG----QH-------PGAGGAHPHAHPAHPHPHHPHAHPHNPQPMHRYDMG
: : : .: :.:... : : : : . . . : .:
NP_003 QAGAAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVYLFG
250 260 270 280 290 300
260 270 280 290 300 310
pF1KB9 AL--QYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGA
.: . ::... . .:.:: :: : .:. . . . .. . ::.. ::. : .
NP_003 GLGTSSSPVGGVGA--GADPSDPLGL-YEEEGAGCSPDAPSLSGRSSAASSPAAGRSPAD
310 320 330 340 350
320 330 340 350 360 370
pF1KB9 LGALGSLVKSEPSGSPPAPAHSRAPCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHS
. .:: . :. : ::.:.
NP_003 HRGYASLRAASPAPSS-APSHASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFES
360 370 380 390 400 410
>>NP_071899 (OMIM: 610928,613674) transcription factor S (414 aa)
initn: 479 init1: 405 opt: 451 Z-score: 271.0 bits: 59.0 E(85289): 2.3e-08
Smith-Waterman score: 471; 33.4% identity (59.5% similar) in 299 aa overlap (9-293:41-312)
10 20 30
pF1KB9 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGG
:.. : .::.: ..::::.: . :
NP_071 DDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKG--EAPANSGAPAGAAGRAKG----
20 30 40 50 60
40 50 60 70 80 90
pF1KB9 GGGGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKR
..:..:::::::::.. .:...::.:: .::.:.:: :: ::... ::::
NP_071 ---------ESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKR
70 80 90 100 110
100 110 120 130 140 150
pF1KB9 PFIDEAKRLRALHMKEHPDYKYRPRRKTKTL-LKKDKYSLAGGLL---AAGAGGGGAAVA
::..::.:::. ::..::.:::::::. .. ::. . .. :: ::. : :. ::
NP_071 PFVEEAERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFLHGLAEPQAAALGPEGGRVA
120 130 140 150 160 170
160 170 180 190 200
pF1KB9 M-GVGVGVGAAA--VGQRLESPGGAAGGGYAHVNGWAN---GAYPGSVAAAAAAAAMMQE
: :.:. . .: : : :: : .. . .:: . .. .. .
NP_071 MDGLGLQFPEQGFPAGPPLLPPH--MGGHYRDCQSLGAPPLDGYPLPTPDTSPLDGVDPD
180 190 200 210 220 230
210 220 230 240 250 260
pF1KB9 AQLAYGQHPG---AGGAHPHAHPA-HPHPHHPHAHPHNPQPMHRYDMGALQYSPISNSQG
. . :: :.:.. .:. . . : .: : : .:. .: : . :
NP_071 PAFFAAPMPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPR------LGP---EPAGPSIP
240 250 260 270 280
270 280 290 300 310 320
pF1KB9 YMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSG
. : ::. . ::: .. .:..: . :
NP_071 GLLAPPSALH-VYYGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCR
290 300 310 320 330 340
>>NP_055402 (OMIM: 605923) transcription factor SOX-8 [H (446 aa)
initn: 466 init1: 410 opt: 448 Z-score: 269.0 bits: 58.8 E(85289): 3e-08
Smith-Waterman score: 478; 32.9% identity (54.0% similar) in 350 aa overlap (39-357:90-428)
10 20 30 40 50 60
pF1KB9 DLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMVWSRGQRRK
:::::: . .:::::::::::... :::
NP_055 ADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKPHVKRPMNAFMVWAQAARRK
60 70 80 90 100 110
70 80 90 100 110 120
pF1KB9 MAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKT
.:.. :..::.:.:: :: :...::.:::::..::.:::. : :.::::::.:::. :.
NP_055 LADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRR-KS
120 130 140 150 160 170
130 140 150 160 170
pF1KB9 LLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVG---AAAVGQRLESPGGA---------
. : .:. :. ::: :. .:.: : . .:: :
NP_055 AKAGHSDSDSGAELGPHPGGG-AVYKAEAGLGDGHHHGDHTGQTHGPPTPPTTPKTELQQ
180 190 200 210 220 230
180 190 200 210 220
pF1KB9 AGG-------GYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQH------PGAGGAH
::. : :.. .. ..: . .. .: . :. : : .: :
NP_055 AGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMD-AFDVHEFDQYLPLGGPAP
240 250 260 270 280 290
230 240 250 260 270
pF1KB9 PHAHPAHPHPH-HPHAHP---HNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYG
:. :. . : : : :. : . . . .: . ::. :: : :
NP_055 PEPGQAYGGAYFHAGASPVWAHKSAP--SASASPTETGPPRPHIKTEQPSPGHYGDQPRG
300 310 320 330 340 350
280 290 300 310 320 330
pF1KB9 AAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSGSPP--APAHSRAPCP
. .. .. :.::. :: :. : :. : :.: : :. : ::. . ::
NP_055 SPDYGSCSG----QSSATPAAPAGPFA--GSQGDYGDLQASSYYGAYPGYAPGLYQYPCF
360 370 380 390 400
340 350 360 370 380 390
pF1KB9 GDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI
. :. . : : . ::
NP_055 HSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP
410 420 430 440
>>NP_003099 (OMIM: 600898,615866) transcription factor S (441 aa)
initn: 393 init1: 393 opt: 433 Z-score: 260.8 bits: 57.2 E(85289): 8.6e-08
Smith-Waterman score: 466; 33.1% identity (58.8% similar) in 323 aa overlap (45-325:43-353)
20 30 40 50 60 70
pF1KB9 GAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENP
:. . ..:::::::::::. .:::. ...:
NP_003 NLPREALDTEEGEFMACSPVALDESDPDWCKTASGHIKRPMNAFMVWSKIERRKIMEQSP
20 30 40 50 60 70
80 90 100 110 120 130
pF1KB9 KMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLLKKDK
:::.::::::: .::.....:: ::: ::.::: :: ..::::::::.: : . . :
NP_003 DMHNAEISKRLGKRWKMLKDSEKIPFIREAERLRLKHMADYPDYKYRPRKKPK-MDPSAK
80 90 100 110 120 130
140 150 160 170 180
pF1KB9 YSLAGGLLAAGAGGGGAAVAMGVG---VGVGAAAVGQRLESP---GGAAGGGYAHVNGWA
: . . ..:::::.... :.: .. :.. .:..: :. ::.: : .:
NP_003 PSASQSPEKSAAGGGGGSAGGGAGGAKTSKGSSKKCGKLKAPAAAGAKAGAGKAAQSGDY
140 150 160 170 180 190
190 200 210 220
pF1KB9 NGAYP----GSV----AAAAAAAAMMQ---------------EAQLAYGQHPGAGGAHPH
.:: ::. .....:. .. : :: :.: .:
NP_003 GGAGDDYVLGSLRVSGSGGGGAGKTVKCVFLDEDDDDDDDDDELQLQIKQEPDEEDEEP-
200 210 220 230 240 250
230 240 250 260 270 280
pF1KB9 AHPAHPHPHHPHAHPHNPQP---MHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAA
::. .: . :: ..::... . :: .:. . :: : . :.
NP_003 -------PHQQLLQPPGQQPSQLLRRYNVAKVPASPTLSSS---AESPEGASLYDEVRAG
260 270 280 290 300
290 300 310 320 330
pF1KB9 AAAAAAGGAH----------QNSAVAAAAAAAAASSGALGALGSLVKSEPSGSPPAPAHS
:...:.::.. :. : : . ::: .... .: .. :::
NP_003 ATSGAGGGSRLYYSFKNITKQHPPPLAQPALSPASSRSVSTSSSSSSGSSSGSSGEDADD
310 320 330 340 350 360
340 350 360 370 380 390
pF1KB9 RAPCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI
NP_003 LMFDLSLNFSQSAHSASEQQLGGGAAAGNLSLSLVDKDLDSFSEGSLGSHFEFPDYCTPE
370 380 390 400 410 420
391 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 02:06:51 2016 done: Tue Nov 8 02:06:52 2016
Total Scan time: 10.240 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]