FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9648, 391 aa 1>>>pF1KB9648 391 - 391 aa - 391 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.7709+/-0.000406; mu= 5.1621+/- 0.026 mean_var=333.5096+/-70.170, 0's: 0 Z-trim(122.4): 76 B-trim: 46 in 1/50 Lambda= 0.070230 statistics sampled from 40330 (40439) to 40330 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.763), E-opt: 0.2 (0.474), width: 16 Scan time: 10.240 The best scores are: opt bits E(85289) NP_005977 (OMIM: 602148) transcription factor SOX- ( 391) 2681 284.9 2.2e-76 NP_005625 (OMIM: 300123,312000,313430) transcripti ( 446) 833 97.8 5.5e-20 NP_003097 (OMIM: 184429,189960,206900) transcripti ( 317) 780 92.2 1.8e-18 NP_009015 (OMIM: 604974) transcription factor SOX- ( 276) 611 75.0 2.4e-13 NP_004180 (OMIM: 604747) transcription factor SOX- ( 240) 602 74.0 4.2e-13 NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapi ( 233) 485 62.2 1.5e-09 NP_003098 (OMIM: 184430) transcription factor SOX- ( 474) 486 62.7 2.2e-09 NP_071899 (OMIM: 610928,613674) transcription fact ( 414) 451 59.0 2.3e-08 NP_055402 (OMIM: 605923) transcription factor SOX- ( 446) 448 58.8 3e-08 NP_003099 (OMIM: 600898,615866) transcription fact ( 441) 433 57.2 8.6e-08 NP_003131 (OMIM: 400044,400045,480000) sex-determi ( 204) 425 56.0 9.4e-08 NP_113627 (OMIM: 612202) transcription factor SOX- ( 388) 430 56.9 9.8e-08 NP_008874 (OMIM: 601947) transcription factor SOX- ( 315) 417 55.4 2.2e-07 NP_060889 (OMIM: 137940,601618,607823) transcripti ( 384) 411 54.9 3.7e-07 NP_008872 (OMIM: 602229,609136,611584,613266) tran ( 466) 400 53.9 9.1e-07 NP_000337 (OMIM: 114290,608160,616425) transcripti ( 509) 381 52.1 3.6e-06 NP_821078 (OMIM: 604975,616803) transcription fact ( 377) 345 48.2 3.8e-05 XP_011519144 (OMIM: 604975,616803) PREDICTED: tran ( 415) 345 48.3 4e-05 NP_001248343 (OMIM: 604975,616803) transcription f ( 642) 345 48.5 5.3e-05 XP_016875389 (OMIM: 604975,616803) PREDICTED: tran ( 715) 345 48.6 5.6e-05 XP_016875387 (OMIM: 604975,616803) PREDICTED: tran ( 715) 345 48.6 5.6e-05 XP_016875388 (OMIM: 604975,616803) PREDICTED: tran ( 715) 345 48.6 5.6e-05 XP_016875390 (OMIM: 604975,616803) PREDICTED: tran ( 715) 345 48.6 5.6e-05 XP_016875386 (OMIM: 604975,616803) PREDICTED: tran ( 716) 345 48.6 5.6e-05 NP_001317714 (OMIM: 604975,616803) transcription f ( 728) 345 48.6 5.7e-05 XP_011519140 (OMIM: 604975,616803) PREDICTED: tran ( 729) 345 48.6 5.7e-05 NP_694534 (OMIM: 604975,616803) transcription fact ( 750) 345 48.6 5.8e-05 XP_016875385 (OMIM: 604975,616803) PREDICTED: tran ( 750) 345 48.6 5.8e-05 XP_011519137 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05 XP_016875383 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05 XP_016875382 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05 XP_011519139 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05 XP_016875384 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05 XP_016875380 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05 XP_016875379 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05 XP_016875381 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05 XP_011519136 (OMIM: 604975,616803) PREDICTED: tran ( 751) 345 48.6 5.8e-05 NP_001248344 (OMIM: 604975,616803) transcription f ( 753) 345 48.6 5.8e-05 XP_011519135 (OMIM: 604975,616803) PREDICTED: tran ( 754) 345 48.6 5.8e-05 NP_008871 (OMIM: 604975,616803) transcription fact ( 763) 345 48.6 5.8e-05 XP_011519134 (OMIM: 604975,616803) PREDICTED: tran ( 764) 345 48.6 5.8e-05 XP_016875378 (OMIM: 604975,616803) PREDICTED: tran ( 792) 345 48.7 6e-05 XP_016875377 (OMIM: 604975,616803) PREDICTED: tran ( 793) 345 48.7 6e-05 NP_001139283 (OMIM: 607257) transcription factor S ( 801) 330 47.1 0.00017 NP_059978 (OMIM: 607257) transcription factor SOX- ( 804) 330 47.1 0.00017 NP_201583 (OMIM: 607257) transcription factor SOX- ( 808) 330 47.1 0.00017 NP_001139291 (OMIM: 607257) transcription factor S ( 841) 330 47.2 0.00018 XP_005245680 (OMIM: 604748) PREDICTED: transcripti ( 621) 325 46.5 0.00021 NP_005677 (OMIM: 604748) transcription factor SOX- ( 622) 325 46.5 0.00021 XP_005265860 (OMIM: 606698) PREDICTED: transcripti ( 448) 279 41.6 0.0043 >>NP_005977 (OMIM: 602148) transcription factor SOX-1 [H (391 aa) initn: 2681 init1: 2681 opt: 2681 Z-score: 1492.4 bits: 284.9 E(85289): 2.2e-76 Smith-Waterman score: 2681; 100.0% identity (100.0% similar) in 391 aa overlap (1-391:1-391) 10 20 30 40 50 60 pF1KB9 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 WSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 WSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 RPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGAAGGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 RPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGAAGGG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 YAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 YAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAHP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 HNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 HNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 AAAAAASSGALGALGSLVKSEPSGSPPAPAHSRAPCPGDLREMISMYLPAGEGGDPAAAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 AAAAAASSGALGALGSLVKSEPSGSPPAPAHSRAPCPGDLREMISMYLPAGEGGDPAAAA 310 320 330 340 350 360 370 380 390 pF1KB9 AAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI ::::::::::::::::::::::::::::::: NP_005 AAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI 370 380 390 >>NP_005625 (OMIM: 300123,312000,313430) transcription f (446 aa) initn: 1125 init1: 724 opt: 833 Z-score: 479.8 bits: 97.8 E(85289): 5.5e-20 Smith-Waterman score: 1326; 58.9% identity (77.1% similar) in 389 aa overlap (12-391:102-446) 10 20 30 40 pF1KB9 MYSMMMETDLHSPGGA-QAPTNLSGPAGAGGGGGGGGGGGG .:::: .. .: .: :..:::..::..::: NP_005 PAPAMYSLLETELKNPVGTPTQAAGTGGPAAPGGAGKSSANAAGGANSGGGSSGGASGGG 80 90 100 110 120 130 50 60 70 80 90 100 pF1KB9 GGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPF :: ..::::::::::::::::::::::: ::::::::::::::::.::....:::::: NP_005 GG---TDQDRVKRPMNAFMVWSRGQRRKMALENPKMHNSEISKRLGADWKLLTDAEKRPF 140 150 160 170 180 110 120 130 140 150 160 pF1KB9 IDEAKRLRALHMKEHPDYKYRPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVG :::::::::.::::.:::::::::::::::::::::: .::: ::....::.: ..... NP_005 IDEAKRLRAVHMKEYPDYKYRPRRKTKTLLKKDKYSLPSGLLPPGAAAAAAAAAAAAAAA 190 200 210 220 230 240 170 180 190 200 210 220 pF1KB9 VGAAAVGQRLESPGGAAGGGYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAG . ..:::::.. :.:::::::::: ...:: ::.:.: :. . NP_005 SSPVGVGQRLDT--------YTHVNGWANGAY-----------SLVQE-QLGYAQPPSMS 250 260 270 280 230 240 250 260 270 pF1KB9 GAHPHAHPAHPHPHHPHAHPHNPQPMHRYDMGALQYSPI--SNSQGYMS-----ASPSGY . : : : : : :::::::..:::::. ..:.::. :. ::: NP_005 S---------PPP--PPALP----PMHRYDMAGLQYSPMMPPGAQSYMNVAAAAAAASGY 290 300 310 320 330 280 290 300 310 320 330 pF1KB9 GGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSGSPPAPA-HS ::. .:.:::::: : :. :.:::::::::. .:: .::.::::::. ::: : :: NP_005 GGMAPSATAAAAAAYG---QQPATAAAAAAAAAAM-SLGPMGSVVKSEPSSPPPAIASHS 340 350 360 370 380 340 350 360 370 380 390 pF1KB9 RAPCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI . : ::::.::::::: ::: : ::. .:::.. :::::::..:::::::::: NP_005 QRACLGDLRDMISMYLPP--GGDAADAASPLPGGRLHGVHQHYQGAGTAVNGTVPLTHI 390 400 410 420 430 440 >>NP_003097 (OMIM: 184429,189960,206900) transcription f (317 aa) initn: 1037 init1: 728 opt: 780 Z-score: 452.4 bits: 92.2 E(85289): 1.8e-18 Smith-Waterman score: 1167; 52.9% identity (69.4% similar) in 399 aa overlap (1-391:1-317) 10 20 30 40 50 60 pF1KB9 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMV ::.:: ::.:. :: :. .:::::.. ....::. : . :::::::::::: NP_003 MYNMM-ETELKPPGPQQT---------SGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMV 10 20 30 40 50 70 80 90 100 110 120 pF1KB9 WSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKY ::::::::::::::::::::::::::::::..::.::::::::::::::::::::::::: NP_003 WSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKY 60 70 80 90 100 110 130 140 150 160 170 pF1KB9 RPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVG-AAAVGQRLESPGGAAGG :::::::::.:::::.: ::::: : : ..: :::::.: .:.:.::..: NP_003 RPRRKTKTLMKKDKYTLPGGLLAPG----GNSMASGVGVGAGLGAGVNQRMDS------- 120 130 140 150 180 190 200 210 220 230 pF1KB9 GYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAH :::.:::.::.: .:::. ::.: :::: .:: : NP_003 -YAHMNGWSNGSY-----------SMMQD-QLGYPQHPGL-----NAHGAAQM------- 160 170 180 190 240 250 260 270 280 290 pF1KB9 PHNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAA :::::::..::::. ...:: ::..::. :. . . .. : : NP_003 ----QPMHRYDVSALQYNSMTSSQTYMNGSPT------YSMSYSQQGTPGMA-------- 200 210 220 230 300 310 320 330 340 350 pF1KB9 AAAAAAASSGALGALGSLVKSEPSGSPP---APAHSRAPCP-GDLREMISMYLPAGEGGD ::..::.:::: :.::: . .:::::: ::::.:::::::..: . NP_003 -----------LGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPE 240 250 260 270 280 360 370 380 390 pF1KB9 PAAAAAAAAQSRLHSLPQHYQGA---GAGVNGTVPLTHI ::: :::: . ::::.. :...:::.::.:. NP_003 PAAP------SRLH-MSQHYQSGPVPGTAINGTLPLSHM 290 300 310 >>NP_009015 (OMIM: 604974) transcription factor SOX-21 [ (276 aa) initn: 742 init1: 555 opt: 611 Z-score: 360.6 bits: 75.0 E(85289): 2.4e-13 Smith-Waterman score: 706; 46.4% identity (64.1% similar) in 323 aa overlap (49-364:6-275) 20 30 40 50 60 70 pF1KB9 PTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHN :.:::::::::::::.:::::::::::::: NP_009 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHN 10 20 30 80 90 100 110 120 130 pF1KB9 SEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLLKKDKYSLA ::::::::::::...:.::::::::::::::.::::::::::::::: ::::::::... NP_009 SEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFP 40 50 60 70 80 90 140 150 160 170 180 190 pF1KB9 GGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGAAGGGYAHVNGWANGAYPGSVAA . : :: . : .. .:.: : .:::: . . :: : ..:: NP_009 ---VPYGLGGVADAEHPALKAGAGLHA----------GAGGGLVPESLLAN---PEKAAA 100 110 120 130 200 210 220 230 240 250 pF1KB9 AAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAHPHNPQPMHRYDMGALQYSP :::::: :.. . : .:..: : : .:. :.:. ... NP_009 AAAAAA----ARVFFPQSAAAAAAAAAAAAAG-------------SPYSLLDLGS-KMAE 140 150 160 170 180 260 270 280 290 300 310 pF1KB9 ISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLV ::.:.. ::::... . .:..:: ...:.::::::::: :. . NP_009 ISSSSS----------GLPYASSLGYPTAGAGAFHGAAAAAAAAAAAA--------GGHT 190 200 210 220 320 330 340 350 360 370 pF1KB9 KSEPSGSPPA---PAHSRA-PCPGDLREMISMYLPAGEGG---DPAAAAAAAAQSRLHSL .:.:: . :. : . : : :: . . :: : : :: :: ::: NP_009 HSHPSPGNPGYMIPCNCSAWPSPGLQPPLAYILLP-GMGKPQLDPYPAAYAAAL 230 240 250 260 270 380 390 pF1KB9 PQHYQGAGAGVNGTVPLTHI >>NP_004180 (OMIM: 604747) transcription factor SOX-14 [ (240 aa) initn: 590 init1: 563 opt: 602 Z-score: 356.3 bits: 74.0 E(85289): 4.2e-13 Smith-Waterman score: 602; 46.5% identity (64.7% similar) in 241 aa overlap (49-285:6-239) 20 30 40 50 60 70 pF1KB9 PTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHN :..::::::::::::::::::::::::::: NP_004 MSKPSDHIKRPMNAFMVWSRGQRRKMAQENPKMHN 10 20 30 80 90 100 110 120 130 pF1KB9 SEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLLKKDKYSLA ::::::::::::..:::::::.::::::::: ::::::::::::::: :.:::::.: . NP_004 SEISKRLGAEWKLLSEAEKRPYIDEAKRLRAQHMKEHPDYKYRPRRKPKNLLKKDRYVFP 40 50 60 70 80 90 140 150 160 170 180 190 pF1KB9 GGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGAAGGGYAHVNGWANGAYP--GSV :. .:.. .:.. :. .: : : ..: . ....: : : NP_004 LPYLGDTDPLKAAGLPVGASDGLLSAPEKARAFLPPASAPYSLLDPAQFSSSAIQKMGEV 100 110 120 130 140 150 200 210 220 230 240 250 pF1KB9 AAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPH-HPHAHPHNPQPMHRYDMGALQ . :..:. . :.: . :: :. . : : : : : :: . . : . NP_004 PHTLATGALPYASTLGY--QNGAFGSL-----SCPSQHTHTHPSPTNPGYVVPCNCTAWS 160 170 180 190 200 260 270 280 290 300 310 pF1KB9 YSPISNSQGYMSASPSGYGGL-PYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGAL : .. .:. :. ::..: :.: NP_004 ASTLQPPVAYILFPGMTKTGIDPYSSAHATAM 210 220 230 240 320 330 340 350 360 370 pF1KB9 GSLVKSEPSGSPPAPAHSRAPCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQH >>NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapiens] (233 aa) initn: 476 init1: 446 opt: 485 Z-score: 292.4 bits: 62.2 E(85289): 1.5e-09 Smith-Waterman score: 495; 39.8% identity (59.8% similar) in 246 aa overlap (10-246:14-228) 10 20 30 40 50 pF1KB9 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMN :. :... : .. ::: :.:. .. : ..:::::: NP_008 MALPGSSQDQAWSLEPPAATAAASSSSGPQEREGAGSPAAPG------TLPLEKVKRPMN 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 AFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHP :::::: .:::.:::.:::::::::::::::.::...: :::::..::::::: :....: NP_008 AFMVWSSAQRRQMAQQNPKMHNSEISKRLGAQWKLLDEDEKRPFVEEAKRLRARHLRDYP 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB9 DYKYRPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVGAAAVGQRLESPGGA :::::::::.:. .::: . : : : : : : .:: : NP_008 DYKYRPRRKAKS---------------SGAGPSR------CGQGRGNLASGGPLWGPGYA 120 130 140 150 180 190 200 210 220 pF1KB9 A-----GGGYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHP-HAH--- . : :: . ..... ::: ... . .: .. : : ..: NP_008 TTQPSRGFGY-RPPSYSTAYLPGSYGSSHCKLEAPSPCSLPQSDPRLQGELLPTYTHYLP 160 170 180 190 200 210 230 240 250 260 270 280 pF1KB9 PAHPHPHHPHAHPHNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAA :. : :..: : :: NP_008 PGSPTPYNP---PLAGAPMPLTHL 220 230 >>NP_003098 (OMIM: 184430) transcription factor SOX-4 [H (474 aa) initn: 413 init1: 413 opt: 486 Z-score: 289.5 bits: 62.7 E(85289): 2.2e-09 Smith-Waterman score: 506; 33.9% identity (57.9% similar) in 363 aa overlap (12-332:20-378) 10 20 30 40 50 pF1KB9 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVK : .:: .... :. .. :: . . :. . ..: NP_003 MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 RPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHM ::::::::::. .:::. ...: :::.::::::: .::......: ::: ::.::: :: NP_003 RPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHM 70 80 90 100 110 120 120 130 140 150 pF1KB9 KEHPDYKYRPRRKTKTL---------------LKKDKYSLAGGLLAAGAGGGGAAVAMGV ..::::::::.:.:. : :: . .:: .:.::::.. : : NP_003 ADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGG 130 140 150 160 170 180 160 170 180 190 200 pF1KB9 GVGV---GAAAVGQRLESPG----GAAGGGYA--HVNGWANGAYPGSVAAAAAAAAMMQE : :. :: . . .: : :.:::: . :.. :. :. :::::::.. : NP_003 GGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAE 190 200 210 220 230 240 210 220 230 240 250 pF1KB9 -----AQLAYG----QH-------PGAGGAHPHAHPAHPHPHHPHAHPHNPQPMHRYDMG : : : .: :.:... : : : : . . . : .: NP_003 QAGAAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVYLFG 250 260 270 280 290 300 260 270 280 290 300 310 pF1KB9 AL--QYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGA .: . ::... . .:.:: :: : .:. . . . .. . ::.. ::. : . NP_003 GLGTSSSPVGGVGA--GADPSDPLGL-YEEEGAGCSPDAPSLSGRSSAASSPAAGRSPAD 310 320 330 340 350 320 330 340 350 360 370 pF1KB9 LGALGSLVKSEPSGSPPAPAHSRAPCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHS . .:: . :. : ::.:. NP_003 HRGYASLRAASPAPSS-APSHASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFES 360 370 380 390 400 410 >>NP_071899 (OMIM: 610928,613674) transcription factor S (414 aa) initn: 479 init1: 405 opt: 451 Z-score: 271.0 bits: 59.0 E(85289): 2.3e-08 Smith-Waterman score: 471; 33.4% identity (59.5% similar) in 299 aa overlap (9-293:41-312) 10 20 30 pF1KB9 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGG :.. : .::.: ..::::.: . : NP_071 DDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKG--EAPANSGAPAGAAGRAKG---- 20 30 40 50 60 40 50 60 70 80 90 pF1KB9 GGGGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKR ..:..:::::::::.. .:...::.:: .::.:.:: :: ::... :::: NP_071 ---------ESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKR 70 80 90 100 110 100 110 120 130 140 150 pF1KB9 PFIDEAKRLRALHMKEHPDYKYRPRRKTKTL-LKKDKYSLAGGLL---AAGAGGGGAAVA ::..::.:::. ::..::.:::::::. .. ::. . .. :: ::. : :. :: NP_071 PFVEEAERLRVQHMQDHPNYKYRPRRRKQVKRLKRVEGGFLHGLAEPQAAALGPEGGRVA 120 130 140 150 160 170 160 170 180 190 200 pF1KB9 M-GVGVGVGAAA--VGQRLESPGGAAGGGYAHVNGWAN---GAYPGSVAAAAAAAAMMQE : :.:. . .: : : :: : .. . .:: . .. .. . NP_071 MDGLGLQFPEQGFPAGPPLLPPH--MGGHYRDCQSLGAPPLDGYPLPTPDTSPLDGVDPD 180 190 200 210 220 230 210 220 230 240 250 260 pF1KB9 AQLAYGQHPG---AGGAHPHAHPA-HPHPHHPHAHPHNPQPMHRYDMGALQYSPISNSQG . . :: :.:.. .:. . . : .: : : .:. .: : . : NP_071 PAFFAAPMPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPR------LGP---EPAGPSIP 240 250 260 270 280 270 280 290 300 310 320 pF1KB9 YMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSG . : ::. . ::: .. .:..: . : NP_071 GLLAPPSALH-VYYGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCR 290 300 310 320 330 340 >>NP_055402 (OMIM: 605923) transcription factor SOX-8 [H (446 aa) initn: 466 init1: 410 opt: 448 Z-score: 269.0 bits: 58.8 E(85289): 3e-08 Smith-Waterman score: 478; 32.9% identity (54.0% similar) in 350 aa overlap (39-357:90-428) 10 20 30 40 50 60 pF1KB9 DLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMVWSRGQRRK :::::: . .:::::::::::... ::: NP_055 ADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKPHVKRPMNAFMVWAQAARRK 60 70 80 90 100 110 70 80 90 100 110 120 pF1KB9 MAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKT .:.. :..::.:.:: :: :...::.:::::..::.:::. : :.::::::.:::. :. NP_055 LADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRR-KS 120 130 140 150 160 170 130 140 150 160 170 pF1KB9 LLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVG---AAAVGQRLESPGGA--------- . : .:. :. ::: :. .:.: : . .:: : NP_055 AKAGHSDSDSGAELGPHPGGG-AVYKAEAGLGDGHHHGDHTGQTHGPPTPPTTPKTELQQ 180 190 200 210 220 230 180 190 200 210 220 pF1KB9 AGG-------GYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQH------PGAGGAH ::. : :.. .. ..: . .. .: . :. : : .: : NP_055 AGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMD-AFDVHEFDQYLPLGGPAP 240 250 260 270 280 290 230 240 250 260 270 pF1KB9 PHAHPAHPHPH-HPHAHP---HNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYG :. :. . : : : :. : . . . .: . ::. :: : : NP_055 PEPGQAYGGAYFHAGASPVWAHKSAP--SASASPTETGPPRPHIKTEQPSPGHYGDQPRG 300 310 320 330 340 350 280 290 300 310 320 330 pF1KB9 AAAAAAAAAGGAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSGSPP--APAHSRAPCP . .. .. :.::. :: :. : :. : :.: : :. : ::. . :: NP_055 SPDYGSCSG----QSSATPAAPAGPFA--GSQGDYGDLQASSYYGAYPGYAPGLYQYPCF 360 370 380 390 400 340 350 360 370 380 390 pF1KB9 GDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI . :. . : : . :: NP_055 HSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP 410 420 430 440 >>NP_003099 (OMIM: 600898,615866) transcription factor S (441 aa) initn: 393 init1: 393 opt: 433 Z-score: 260.8 bits: 57.2 E(85289): 8.6e-08 Smith-Waterman score: 466; 33.1% identity (58.8% similar) in 323 aa overlap (45-325:43-353) 20 30 40 50 60 70 pF1KB9 GAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMVWSRGQRRKMAQENP :. . ..:::::::::::. .:::. ...: NP_003 NLPREALDTEEGEFMACSPVALDESDPDWCKTASGHIKRPMNAFMVWSKIERRKIMEQSP 20 30 40 50 60 70 80 90 100 110 120 130 pF1KB9 KMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLLKKDK :::.::::::: .::.....:: ::: ::.::: :: ..::::::::.: : . . : NP_003 DMHNAEISKRLGKRWKMLKDSEKIPFIREAERLRLKHMADYPDYKYRPRKKPK-MDPSAK 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 YSLAGGLLAAGAGGGGAAVAMGVG---VGVGAAAVGQRLESP---GGAAGGGYAHVNGWA : . . ..:::::.... :.: .. :.. .:..: :. ::.: : .: NP_003 PSASQSPEKSAAGGGGGSAGGGAGGAKTSKGSSKKCGKLKAPAAAGAKAGAGKAAQSGDY 140 150 160 170 180 190 190 200 210 220 pF1KB9 NGAYP----GSV----AAAAAAAAMMQ---------------EAQLAYGQHPGAGGAHPH .:: ::. .....:. .. : :: :.: .: NP_003 GGAGDDYVLGSLRVSGSGGGGAGKTVKCVFLDEDDDDDDDDDELQLQIKQEPDEEDEEP- 200 210 220 230 240 250 230 240 250 260 270 280 pF1KB9 AHPAHPHPHHPHAHPHNPQP---MHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAA ::. .: . :: ..::... . :: .:. . :: : . :. NP_003 -------PHQQLLQPPGQQPSQLLRRYNVAKVPASPTLSSS---AESPEGASLYDEVRAG 260 270 280 290 300 290 300 310 320 330 pF1KB9 AAAAAAGGAH----------QNSAVAAAAAAAAASSGALGALGSLVKSEPSGSPPAPAHS :...:.::.. :. : : . ::: .... .: .. ::: NP_003 ATSGAGGGSRLYYSFKNITKQHPPPLAQPALSPASSRSVSTSSSSSSGSSSGSSGEDADD 310 320 330 340 350 360 340 350 360 370 380 390 pF1KB9 RAPCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI NP_003 LMFDLSLNFSQSAHSASEQQLGGGAAAGNLSLSLVDKDLDSFSEGSLGSHFEFPDYCTPE 370 380 390 400 410 420 391 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 02:06:51 2016 done: Tue Nov 8 02:06:52 2016 Total Scan time: 10.240 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]