FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9650, 315 aa 1>>>pF1KB9650 315 - 315 aa - 315 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.1090+/-0.000278; mu= 6.7210+/- 0.018 mean_var=223.5851+/-44.163, 0's: 0 Z-trim(126.4): 155 B-trim: 0 in 0/58 Lambda= 0.085773 statistics sampled from 52082 (52260) to 52082 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.858), E-opt: 0.2 (0.613), width: 16 Scan time: 9.830 The best scores are: opt bits E(85289) NP_008874 (OMIM: 601947) transcription factor SOX- ( 315) 2222 286.5 4.8e-77 NP_003098 (OMIM: 184430) transcription factor SOX- ( 474) 647 91.8 3e-18 NP_003099 (OMIM: 600898,615866) transcription fact ( 441) 586 84.2 5.3e-16 NP_009015 (OMIM: 604974) transcription factor SOX- ( 276) 439 65.8 1.2e-10 NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapi ( 233) 424 63.9 3.7e-10 NP_008872 (OMIM: 602229,609136,611584,613266) tran ( 466) 422 63.9 7.1e-10 NP_005977 (OMIM: 602148) transcription factor SOX- ( 391) 417 63.2 9.7e-10 NP_003097 (OMIM: 184429,189960,206900) transcripti ( 317) 415 62.9 1e-09 NP_060889 (OMIM: 137940,601618,607823) transcripti ( 384) 404 61.6 2.9e-09 NP_055402 (OMIM: 605923) transcription factor SOX- ( 446) 400 61.2 4.6e-09 NP_004180 (OMIM: 604747) transcription factor SOX- ( 240) 394 60.2 5e-09 NP_113627 (OMIM: 612202) transcription factor SOX- ( 388) 397 60.8 5.4e-09 NP_071899 (OMIM: 610928,613674) transcription fact ( 414) 397 60.8 5.6e-09 NP_005625 (OMIM: 300123,312000,313430) transcripti ( 446) 394 60.4 7.7e-09 NP_000337 (OMIM: 114290,608160,616425) transcripti ( 509) 385 59.4 1.8e-08 XP_005245680 (OMIM: 604748) PREDICTED: transcripti ( 621) 344 54.4 7e-07 NP_005677 (OMIM: 604748) transcription factor SOX- ( 622) 344 54.4 7e-07 NP_821078 (OMIM: 604975,616803) transcription fact ( 377) 338 53.4 8.3e-07 XP_011519144 (OMIM: 604975,616803) PREDICTED: tran ( 415) 338 53.5 8.9e-07 NP_001248343 (OMIM: 604975,616803) transcription f ( 642) 338 53.7 1.2e-06 XP_016875388 (OMIM: 604975,616803) PREDICTED: tran ( 715) 338 53.7 1.3e-06 XP_016875389 (OMIM: 604975,616803) PREDICTED: tran ( 715) 338 53.7 1.3e-06 XP_016875390 (OMIM: 604975,616803) PREDICTED: tran ( 715) 338 53.7 1.3e-06 XP_016875387 (OMIM: 604975,616803) PREDICTED: tran ( 715) 338 53.7 1.3e-06 XP_016875386 (OMIM: 604975,616803) PREDICTED: tran ( 716) 338 53.7 1.3e-06 NP_001317714 (OMIM: 604975,616803) transcription f ( 728) 338 53.7 1.3e-06 XP_011519140 (OMIM: 604975,616803) PREDICTED: tran ( 729) 338 53.7 1.3e-06 XP_016875385 (OMIM: 604975,616803) PREDICTED: tran ( 750) 338 53.7 1.3e-06 NP_694534 (OMIM: 604975,616803) transcription fact ( 750) 338 53.7 1.3e-06 XP_016875384 (OMIM: 604975,616803) PREDICTED: tran ( 751) 338 53.7 1.3e-06 XP_016875382 (OMIM: 604975,616803) PREDICTED: tran ( 751) 338 53.7 1.3e-06 XP_011519136 (OMIM: 604975,616803) PREDICTED: tran ( 751) 338 53.7 1.3e-06 XP_016875379 (OMIM: 604975,616803) PREDICTED: tran ( 751) 338 53.7 1.3e-06 XP_016875380 (OMIM: 604975,616803) PREDICTED: tran ( 751) 338 53.7 1.3e-06 XP_016875383 (OMIM: 604975,616803) PREDICTED: tran ( 751) 338 53.7 1.3e-06 XP_016875381 (OMIM: 604975,616803) PREDICTED: tran ( 751) 338 53.7 1.3e-06 XP_011519139 (OMIM: 604975,616803) PREDICTED: tran ( 751) 338 53.7 1.3e-06 XP_011519137 (OMIM: 604975,616803) PREDICTED: tran ( 751) 338 53.7 1.3e-06 NP_001248344 (OMIM: 604975,616803) transcription f ( 753) 338 53.7 1.3e-06 XP_011519135 (OMIM: 604975,616803) PREDICTED: tran ( 754) 338 53.7 1.3e-06 NP_008871 (OMIM: 604975,616803) transcription fact ( 763) 338 53.7 1.4e-06 XP_011519134 (OMIM: 604975,616803) PREDICTED: tran ( 764) 338 53.7 1.4e-06 XP_016875378 (OMIM: 604975,616803) PREDICTED: tran ( 792) 338 53.8 1.4e-06 XP_016875377 (OMIM: 604975,616803) PREDICTED: tran ( 793) 338 53.8 1.4e-06 NP_003131 (OMIM: 400044,400045,480000) sex-determi ( 204) 322 51.2 2.1e-06 NP_001139283 (OMIM: 607257) transcription factor S ( 801) 331 52.9 2.6e-06 NP_059978 (OMIM: 607257) transcription factor SOX- ( 804) 331 52.9 2.6e-06 NP_001139291 (OMIM: 607257) transcription factor S ( 841) 331 52.9 2.6e-06 NP_201583 (OMIM: 607257) transcription factor SOX- ( 808) 330 52.8 2.8e-06 NP_008948 (OMIM: 606698) transcription factor SOX- ( 501) 265 44.5 0.00053 >>NP_008874 (OMIM: 601947) transcription factor SOX-12 [ (315 aa) initn: 2222 init1: 2222 opt: 2222 Z-score: 1504.1 bits: 286.5 E(85289): 4.8e-77 Smith-Waterman score: 2222; 100.0% identity (100.0% similar) in 315 aa overlap (1-315:1-315) 10 20 30 40 50 60 pF1KB9 MVQQRGARAKRDGGPPPPGPGPAEEGAREPGWCKTPSGHIKRPMNAFMVWSQHERRKIMD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_008 MVQQRGARAKRDGGPPPPGPGPAEEGAREPGWCKTPSGHIKRPMNAFMVWSQHERRKIMD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 QWPDMHNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKYRPRKKSKGAPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_008 QWPDMHNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKYRPRKKSKGAPA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 KARPRPPGGSGGGSRLKPGPQLPGRGGRRAAGGPLGGGAAAPEDDDEDDDEELLEVRLVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_008 KARPRPPGGSGGGSRLKPGPQLPGRGGRRAAGGPLGGGAAAPEDDDEDDDEELLEVRLVE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 TPGRELWRMVPAGRAARGQAERAQGPSGEGAAAAAAASPTPSEDEEPEEEEEEAAAAEEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_008 TPGRELWRMVPAGRAARGQAERAQGPSGEGAAAAAAASPTPSEDEEPEEEEEEAAAAEEG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 EEETVASGEESLGFLSRLPPGPAGLDCSALDRDPDLQPPSGTSHFEFPDYCTPEVTEMIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_008 EEETVASGEESLGFLSRLPPGPAGLDCSALDRDPDLQPPSGTSHFEFPDYCTPEVTEMIA 250 260 270 280 290 300 310 pF1KB9 GDWRPSSIADLVFTY ::::::::::::::: NP_008 GDWRPSSIADLVFTY 310 >>NP_003098 (OMIM: 184430) transcription factor SOX-4 [H (474 aa) initn: 926 init1: 580 opt: 647 Z-score: 448.6 bits: 91.8 E(85289): 3e-18 Smith-Waterman score: 652; 51.6% identity (67.6% similar) in 219 aa overlap (16-217:34-246) 10 20 30 40 pF1KB9 MVQQRGARAKRDGGPPPPGPGPAEEG-AREPGWCKTPSGHIKRPM : :: . : : .:.::::::::::::: NP_003 QTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIKRPM 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB9 NAFMVWSQHERRKIMDQWPDMHNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADY :::::::: ::::::.: :::::::::::::.::.::.::.::::.:::::::::::::: NP_003 NAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHMADY 70 80 90 100 110 120 110 120 130 140 150 pF1KB9 PDYKYRPRKK--------SKGAPAKARPRPPG----GSGGGSRLKPGPQLPGRGGRRAAG :::::::::: :..: :...: : ::::: : : :: :: NP_003 PDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGG-----GHGGGGGGGSSNAG 130 140 150 160 170 160 170 180 190 200 pF1KB9 GPLGGGAAAPEDDDEDDDEELLEVRLVETPG----RELWRMVPAGRAARGQAERAQGPSG : ::::.. ... ... ... : . ... :: .. :.: : . : NP_003 GG-GGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASF 180 190 200 210 220 230 210 220 230 240 250 260 pF1KB9 EGAAAAAAASPTPSEDEEPEEEEEEAAAAEEGEEETVASGEESLGFLSRLPPGPAGLDCS . :.::: NP_003 AAEQAGAAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVY 240 250 260 270 280 290 >-- initn: 308 init1: 207 opt: 281 Z-score: 203.8 bits: 46.5 E(85289): 0.00013 Smith-Waterman score: 281; 34.3% identity (56.5% similar) in 239 aa overlap (96-315:248-474) 70 80 90 100 110 120 pF1KB9 HNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKYRPRKKSKGAPAKARPR : : ::. . :. : : .: :.. NP_003 LILAGGGGGGKAAAAAAASFAAEQAGAAALLPLGAAADHHSL-YKARTPSASASASS--- 220 230 240 250 260 270 130 140 150 160 170 pF1KB9 PPGGSGGGSRLKPGPQLPGR--------GGRRAAGGPLGG-GAAAPEDDDEDDDEELLEV ..:.... :: .: . :: ....:.:: ::.: .: :: NP_003 --AASASAALAAPGKHLAEKKVKRVYLFGGLGTSSSPVGGVGAGADPSDPLGLYEEEGAG 280 290 300 310 320 330 180 190 200 210 220 pF1KB9 RLVETP---GRELWRMVPA-GRAA---RGQAE-RAQGPSGEGAAAAAAASPTPSEDEEPE ..: :: :: ::. :: : :: .:. .: . :..: . .. NP_003 CSPDAPSLSGRSSAASSPAAGRSPADHRGYASLRAASPAPSSAPSHASSSAS---SHSSS 340 350 360 370 380 230 240 250 260 270 280 pF1KB9 EEEEEAAAAEEGEEETVASGEESLGFLSRLPPGPAGLDCSALDRDPDL--QPPSGTSHFE ...... :. . . . : .: : . : . . :::::: :. .: :: :::: NP_003 SSSSGSSSSDDEFEDDLLDLNPSSNFES-MSLGSFS-SSSALDRDLDFNFEPGSG-SHFE 390 400 410 420 430 440 290 300 310 pF1KB9 FPDYCTPEVTEMIAGDWRPSSIADLVFTY :::::::::.:::.::: :::..::::: NP_003 FPDYCTPEVSEMISGDWLESSISNLVFTY 450 460 470 >>NP_003099 (OMIM: 600898,615866) transcription factor S (441 aa) initn: 820 init1: 562 opt: 586 Z-score: 408.2 bits: 84.2 E(85289): 5.3e-16 Smith-Waterman score: 604; 38.4% identity (56.9% similar) in 318 aa overlap (22-283:31-338) 10 20 30 40 50 pF1KB9 MVQQRGARAKRDGGPPPPGPGPAEEGAREPGWCKTPSGHIKRPMNAFMVWS :. .: :::: ::::::::::::::: NP_003 MVQQAESLEAESNLPREALDTEEGEFMACSPVALDESDPDWCKTASGHIKRPMNAFMVWS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 QHERRKIMDQWPDMHNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKYRP . ::::::.: :::::::::::::.::..:.:::::::.::::::::::::::::::::: NP_003 KIERRKIMEQSPDMHNAEISKRLGKRWKMLKDSEKIPFIREAERLRLKHMADYPDYKYRP 70 80 90 100 110 120 120 130 140 pF1KB9 RKKSKGAPAKARP---RPP------------GGSGGGSRLKPGP-------QLPGRGGRR ::: : :. :.: . : ::..::.. . : . :. .: . NP_003 RKKPKMDPS-AKPSASQSPEKSAAGGGGGSAGGGAGGAKTSKGSSKKCGKLKAPAAAGAK 130 140 150 160 170 150 160 170 pF1KB9 AAGGPL-----------------------GGGAAAP--------EDDDEDDDEELLEVRL :..: :::.:. ::::.:::.. :.... NP_003 AGAGKAAQSGDYGGAGDDYVLGSLRVSGSGGGGAGKTVKCVFLDEDDDDDDDDDELQLQI 180 190 200 210 220 230 180 190 200 210 220 230 pF1KB9 VETPGRELWRMVPAGRAARGQAERAQGPSGEGAAAAAAASPTPSEDEEPEEEEEEAAAAE . : .: . : . . ... . . .: . :::: : . : : .. NP_003 KQEPDEED-EEPPHQQLLQPPGQQPSQLLRRYNVAKVPASPTLSSSAESPEGASLYDEVR 240 250 260 270 280 290 240 250 260 270 280 290 pF1KB9 EGEEETVASGEE---SLGFLSRLPPGPAGLDCSALDRDPDLQPPSGTSHFEFPDYCTPEV : ...: . :. ... : : . .: :.: :. : NP_003 AGATSGAGGGSRLYYSFKNITKQHPPPLA--------QPALSPASSRSVSTSSSSSSGSS 300 310 320 330 340 350 300 310 pF1KB9 TEMIAGDWRPSSIADLVFTY NP_003 SGSSGEDADDLMFDLSLNFSQSAHSASEQQLGGGAAAGNLSLSLVDKDLDSFSEGSLGSH 360 370 380 390 400 410 >>NP_009015 (OMIM: 604974) transcription factor SOX-21 [ (276 aa) initn: 487 init1: 385 opt: 439 Z-score: 312.4 bits: 65.8 E(85289): 1.2e-10 Smith-Waterman score: 439; 34.4% identity (57.5% similar) in 273 aa overlap (36-295:4-265) 10 20 30 40 50 60 pF1KB9 GARAKRDGGPPPPGPGPAEEGAREPGWCKTPSGHIKRPMNAFMVWSQHERRKIMDQWPDM : :.:::::::::::. .:::. .. : : NP_009 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKM 10 20 30 70 80 90 100 110 120 pF1KB9 HNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKYRPRKKSKGAPAK---A ::.::::::: .:.:: .::: ::. ::.::: :: ..::::::::.: : : : NP_009 HNSEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFA 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB9 RPRPPGGSGGGSRLKPGPQLPGRGGRRAAGGPLGGGAAAPEDDDEDDDEELLEVRLVETP : : : :: : : . .: .:.. ::. .::. . .. . . . NP_009 FPVPYG--LGGVADAEHPALKAGAGLHAGA----GGGLVPESLLANPEKAAAAA--AAAA 100 110 120 130 140 190 200 210 220 230 pF1KB9 GRELWRMVPAGRAARGQAERAQGPSGEGAAAAAAASPTPSEDEEPEEEE----EEAAAAE .: .. . :. :: . : : .: . .. : . : . : .:.: NP_009 ARVFFPQSAAAAAAAAAAAAAGSPYSLLDLGSKMAEISSSSSGLPYASSLGYPTAGAGAF 150 160 170 180 190 200 240 250 260 270 280 290 pF1KB9 EGEEETVASGEESLGFLSRLPPGPAG------LDCSALDRDPDLQPPSGTSHFEFPDYCT .: ..:.. . : .. :.:.. .::: .: :::: ... .: . NP_009 HGAAAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNCSAWP-SPGLQPP--LAYILLPGMGK 210 220 230 240 250 260 300 310 pF1KB9 PEVTEMIAGDWRPSSIADLVFTY :.. NP_009 PQLDPYPAAYAAAL 270 >>NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapiens] (233 aa) initn: 450 init1: 392 opt: 424 Z-score: 303.3 bits: 63.9 E(85289): 3.7e-10 Smith-Waterman score: 425; 50.7% identity (68.8% similar) in 144 aa overlap (21-161:28-153) 10 20 30 40 50 pF1KB9 MVQQRGARAKRDGGPPPPGPGPAE-EGAREPGWCKT-PSGHIKRPMNAFMVWS :: : ::: :. : : ..::::::::::: NP_008 MALPGSSQDQAWSLEPPAATAAASSSSGPQEREGAGSPAAPGTLPLEKVKRPMNAFMVWS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 QHERRKIMDQWPDMHNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKYRP . .::.. .: : :::.::::::: .:.::...:: :::.::.::: .:. ::::::::: NP_008 SAQRRQMAQQNPKMHNSEISKRLGAQWKLLDEDEKRPFVEEAKRLRARHLRDYPDYKYRP 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB9 RKKSKGAPAKARPRPPGGSGGGSRLKPGPQLPGRG-GRRAAGGPLGGGAAAPEDDDEDDD :.:.:.. : ::. :.: : :.:::: : . : NP_008 RRKAKSSGA------------------GPSRCGQGRGNLASGGPLWGPGYATTQPSRGFG 130 140 150 160 180 190 200 210 220 230 pF1KB9 EELLEVRLVETPGRELWRMVPAGRAARGQAERAQGPSGEGAAAAAAASPTPSEDEEPEEE NP_008 YRPPSYSTAYLPGSYGSSHCKLEAPSPCSLPQSDPRLQGELLPTYTHYLPPGSPTPYNPP 170 180 190 200 210 220 >>NP_008872 (OMIM: 602229,609136,611584,613266) transcri (466 aa) initn: 441 init1: 414 opt: 422 Z-score: 298.2 bits: 63.9 E(85289): 7.1e-10 Smith-Waterman score: 446; 33.2% identity (57.2% similar) in 271 aa overlap (39-285:103-369) 10 20 30 40 50 60 pF1KB9 AKRDGGPPPPGPGPAEEGAREPGWCKTPSGHIKRPMNAFMVWSQHERRKIMDQWPDMHNA :.::::::::::.: :::. ::.: .::: NP_008 REAVSQVLSGYDWTLVPMPVRVNGASKSKPHVKRPMNAFMVWAQAARRKLADQYPHLHNA 80 90 100 110 120 130 70 80 90 100 110 120 pF1KB9 EISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKYRPRKKSKGAPAKARPRPPG :.:: ::. :.::..:.: ::..::::::..: :.:::::.::....: :... . :: NP_008 ELSKTLGKLWRLLNESDKRPFIEEAERLRMQHKKDHPDYKYQPRRRKNGKAAQGEAECPG 140 150 160 170 180 190 130 140 150 160 170 pF1KB9 G---SGGGSRLKP---GPQL----PGRGGRRAAGGPL--GGGAAAPEDDDEDDDEELLEV : .:: . .. . .: ::.:. . :.: .: . .: :: NP_008 GEAEQGGTAAIQAHYKSAHLDHRHPGEGSPMSDGNPEHPSGQSHGPPTPPTTPKTELQSG 200 210 220 230 240 250 180 190 200 210 220 230 pF1KB9 RL-VETPGRELWR----MVPAGRAARGQAERAQGPSGEGAAAAAAASPTPSEDEEPEEEE . . :: . . . : . :. . . : .: . : . . . NP_008 KADPKRDGRSMGEGGKPHIDFGNVDIGEISHEVMSNMETFDVAELDQYLPPNGHPGHVSS 260 270 280 290 300 310 240 250 260 270 280 pF1KB9 EEAAAAEEGEEETVASGEESLGFLSRLPPG-------PAGLDCSALDRDPDLQPPSGTSH ::. : .::::. . ..:. ::: : :.: .: . . :.: : NP_008 YSAAGYGLGSALAVASGHSA--WISK-PPGVALPTVSPPGVDAKAQVKT-ETAGPQGPPH 320 330 340 350 360 290 300 310 pF1KB9 FEFPDYCTPEVTEMIAGDWRPSSIADLVFTY . NP_008 YTDQPSTSQIAYTSLSLPHYGSAFPSISRPQFDYSDHQPSGPYYGHSGQASGLYSAFSYM 370 380 390 400 410 420 >>NP_005977 (OMIM: 602148) transcription factor SOX-1 [H (391 aa) initn: 413 init1: 371 opt: 417 Z-score: 295.8 bits: 63.2 E(85289): 9.7e-10 Smith-Waterman score: 428; 34.1% identity (54.5% similar) in 279 aa overlap (6-269:15-280) 10 20 30 40 pF1KB9 MVQQRGARAKRD-GGPPPPGPGPAEEGAREPGW-CKTPSGHIKRPMNAFMV ::.: . .:: : : . :. : :. . ..::::::::: NP_005 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB9 WSQHERRKIMDQWPDMHNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKY ::. .:::. .. : :::.::::::: .:......:: ::. ::.::: :: ..::::: NP_005 WSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKY 70 80 90 100 110 120 110 120 130 140 150 pF1KB9 RPRKKSKGAPAKARPRPPGG------SGGGSRLKPGPQLPGRG----GRR--AAGGPLGG :::.:.: : . :: .:::. . : . : : :.: . :: :: NP_005 RPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGV-GVGAAAVGQRLESPGGAAGG 130 140 150 160 170 160 170 180 190 200 210 pF1KB9 GAAAPEDDDEDDDEELLEVRLVETPGRELWRMVPAGRAARGQAERAQGPSGEGAAAAA-A : : . . :: . :. ..: .: :.. :: : NP_005 GYAHVNGWANG-----------AYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHP 180 190 200 210 220 220 230 240 250 260 270 pF1KB9 ASPTPSEDEEPEEEEEEAAAAEEGEEETVASGEESLGFLSRLPPGPAGLDCSALDRDPDL : : : . . .. . . : . . .: :..: : : .:: .: NP_005 AHPHPHHPHAHPHNPQPMHRYDMGALQ-YSPISNSQGYMSASPSGYGGLPYGAAAAAAAA 230 240 250 260 270 280 280 290 300 310 pF1KB9 QPPSGTSHFEFPDYCTPEVTEMIAGDWRPSSIADLVFTY NP_005 AGGAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSGSPPAPAHSRAPCPGDLREMISMY 290 300 310 320 330 340 >>NP_003097 (OMIM: 184429,189960,206900) transcription f (317 aa) initn: 432 init1: 386 opt: 415 Z-score: 295.6 bits: 62.9 E(85289): 1e-09 Smith-Waterman score: 415; 46.9% identity (65.5% similar) in 145 aa overlap (17-152:11-152) 10 20 30 40 50 pF1KB9 MVQQRGARAKRDGGPPPPGP-------GPAEEGAREPGWCKTPSGHIKRPMNAFMVWSQH :::: : .: : :. ..:::::::::::. NP_003 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRG 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 ERRKIMDQWPDMHNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKYRPRK .:::. .. : :::.::::::: .:.::...:: ::. ::.::: :: ..::::::::. NP_003 QRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB9 KSKGAPAKARPRPPGG--SGGGSRLKPGPQLPGRGGRRAAGGPLGGGAAAPEDDDEDDDE :.: : . ::: . ::. . : : :. .:: NP_003 KTKTLMKKDKYTLPGGLLAPGGNSMASGV---GVGAGLGAGVNQRMDSYAHMNGWSNGSY 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB9 ELLEVRLVETPGRELWRMVPAGRAARGQAERAQGPSGEGAAAAAAASPTPSEDEEPEEEE NP_003 SMMQDQLGYPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQG 180 190 200 210 220 230 >>NP_060889 (OMIM: 137940,601618,607823) transcription f (384 aa) initn: 377 init1: 352 opt: 404 Z-score: 287.2 bits: 61.6 E(85289): 2.9e-09 Smith-Waterman score: 405; 35.5% identity (56.2% similar) in 256 aa overlap (16-241:58-310) 10 20 30 40 pF1KB9 MVQQRGARAKRDGGPPPPGPGPAEEGAREPGWCKTPSG---HIKR :: .: :.. : : . .. .:.: NP_060 AAADTRGLAAGPAALAAPAAPASPPSPQRSPPRSPEPGRYGLSPAGRGERQAADESRIRR 30 40 50 60 70 80 50 60 70 80 90 100 pF1KB9 PMNAFMVWSQHERRKIMDQWPDMHNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMA ::::::::.. ::... .: ::.::: .:: ::. :. :. .:: :::.::::::..:. NP_060 PMNAFMVWAKDERKRLAQQNPDLHNAVLSKMLGKAWKELNAAEKRPFVEEAERLRVQHLR 90 100 110 120 130 140 110 120 130 140 pF1KB9 DYPDYKYRPRKKSKG---------------APAKARPRP-PGGSGGGS---RLKP-GPQL :.:.::::::.:... :: . :.: :..::.. .: : : .. NP_060 DHPNYKYRPRRKKQARKARRLEPGLLLPGLAPPQPPPEPFPAASGSARAFRELPPLGAEF 150 160 170 180 190 200 150 160 170 180 190 pF1KB9 PGRGGRRAAGGPLGG---GAAA--PEDDDEDDDEELLEVRLVETPGRELWRMVPAG--RA : : .:: : : :: : .: : : .: :: : :.: : NP_060 DGLGLPTPERSPLDGLEPGEAAFFPPPA-APEDCALRPFRAPYAP-TELSRD-PGGCYGA 210 220 230 240 250 260 200 210 220 230 240 250 pF1KB9 ARGQAERAQGPSGEGAAAAAAASPTPSEDEEPEEEEEEAAAAEEGEEETVASGEESLGFL ..: :. :.. :. .. ::. : :: : .: NP_060 PLAEALRTAPPAAPLAGLYYGTLGTPGPYPGPLSPPPEAPPLESAEPLGPAADLWADVDL 270 280 290 300 310 320 260 270 280 290 300 310 pF1KB9 SRLPPGPAGLDCSALDRDPDLQPPSGTSHFEFPDYCTPEVTEMIAGDWRPSSIADLVFTY NP_060 TEFDQYLNCSRTRPDAPGLPYHVALAKLGPRAMSCPEESSLISALSDASSAVYYSACISG 330 340 350 360 370 380 >>NP_055402 (OMIM: 605923) transcription factor SOX-8 [H (446 aa) initn: 383 init1: 383 opt: 400 Z-score: 283.8 bits: 61.2 E(85289): 4.6e-09 Smith-Waterman score: 417; 49.6% identity (69.8% similar) in 139 aa overlap (20-158:85-211) 10 20 30 40 pF1KB9 MVQQRGARAKRDGGPPPPGPGPAEEGAREPGWCKTPSGHIKRPMNAFMV : :.. :. : . :.::::::::: NP_055 DPAEAADERFPACIRDAVSQVLKGYDWSLVPMPVRGGG---GGALKAKPHVKRPMNAFMV 60 70 80 90 100 110 50 60 70 80 90 100 pF1KB9 WSQHERRKIMDQWPDMHNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADYPDYKY :.: :::. ::.: .::::.:: ::. :.::..::: :::.::::::..: :.::::: NP_055 WAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKY 120 130 140 150 160 170 110 120 130 140 150 160 pF1KB9 RPRKKSKGAPAKARPRPPGGSGGGSRLKPGPQLPGRGGRRAAGGPLGGGAAAPEDDDEDD .::.... ::: : : . : . ::. :: :. : . :: : NP_055 QPRRRKS---AKA-----GHSDSDSGAELGPH-PGGGAVYKAEAGLGDGHHHGDHTGQTH 180 190 200 210 220 170 180 190 200 210 220 pF1KB9 DEELLEVRLVETPGRELWRMVPAGRAARGQAERAQGPSGEGAAAAAAASPTPSEDEEPEE NP_055 GPPTPPTTPKTELQQAGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDAFDV 230 240 250 260 270 280 315 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 22:54:03 2016 done: Sat Nov 5 22:54:05 2016 Total Scan time: 9.830 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]