FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2470, 474 aa 1>>>pF1KE2470 474 - 474 aa - 474 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.6311+/-0.000413; mu= 2.6342+/- 0.026 mean_var=407.7304+/-83.553, 0's: 0 Z-trim(123.3): 99 B-trim: 83 in 1/58 Lambda= 0.063517 statistics sampled from 42783 (42895) to 42783 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.779), E-opt: 0.2 (0.503), width: 16 Scan time: 12.280 The best scores are: opt bits E(85289) NP_003098 (OMIM: 184430) transcription factor SOX- ( 474) 3123 300.1 8.8e-81 NP_003099 (OMIM: 600898,615866) transcription fact ( 441) 718 79.7 1.8e-14 NP_008874 (OMIM: 601947) transcription factor SOX- ( 315) 647 73.0 1.4e-12 NP_005977 (OMIM: 602148) transcription factor SOX- ( 391) 486 58.3 4.3e-08 NP_009015 (OMIM: 604974) transcription factor SOX- ( 276) 456 55.4 2.3e-07 NP_055402 (OMIM: 605923) transcription factor SOX- ( 446) 442 54.4 7.6e-07 NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapi ( 233) 426 52.6 1.4e-06 NP_005625 (OMIM: 300123,312000,313430) transcripti ( 446) 420 52.4 3.1e-06 NP_003097 (OMIM: 184429,189960,206900) transcripti ( 317) 402 50.5 7.9e-06 NP_008872 (OMIM: 602229,609136,611584,613266) tran ( 466) 405 51.0 8.2e-06 NP_004180 (OMIM: 604747) transcription factor SOX- ( 240) 394 49.6 1.1e-05 NP_000337 (OMIM: 114290,608160,616425) transcripti ( 509) 387 49.4 2.7e-05 NP_071899 (OMIM: 610928,613674) transcription fact ( 414) 376 48.3 4.8e-05 NP_060889 (OMIM: 137940,601618,607823) transcripti ( 384) 369 47.6 7.2e-05 NP_113627 (OMIM: 612202) transcription factor SOX- ( 388) 369 47.6 7.2e-05 NP_821078 (OMIM: 604975,616803) transcription fact ( 377) 343 45.2 0.00037 XP_011519144 (OMIM: 604975,616803) PREDICTED: tran ( 415) 343 45.3 0.00039 NP_001248343 (OMIM: 604975,616803) transcription f ( 642) 343 45.5 0.00051 XP_016875389 (OMIM: 604975,616803) PREDICTED: tran ( 715) 343 45.6 0.00055 XP_016875388 (OMIM: 604975,616803) PREDICTED: tran ( 715) 343 45.6 0.00055 XP_016875390 (OMIM: 604975,616803) PREDICTED: tran ( 715) 343 45.6 0.00055 XP_016875387 (OMIM: 604975,616803) PREDICTED: tran ( 715) 343 45.6 0.00055 XP_016875386 (OMIM: 604975,616803) PREDICTED: tran ( 716) 343 45.6 0.00055 NP_001317714 (OMIM: 604975,616803) transcription f ( 728) 343 45.6 0.00055 XP_011519140 (OMIM: 604975,616803) PREDICTED: tran ( 729) 343 45.6 0.00055 XP_016875385 (OMIM: 604975,616803) PREDICTED: tran ( 750) 343 45.6 0.00056 NP_694534 (OMIM: 604975,616803) transcription fact ( 750) 343 45.6 0.00056 XP_016875384 (OMIM: 604975,616803) PREDICTED: tran ( 751) 343 45.6 0.00056 XP_016875380 (OMIM: 604975,616803) PREDICTED: tran ( 751) 343 45.6 0.00056 XP_016875379 (OMIM: 604975,616803) PREDICTED: tran ( 751) 343 45.6 0.00056 XP_011519137 (OMIM: 604975,616803) PREDICTED: tran ( 751) 343 45.6 0.00056 XP_016875382 (OMIM: 604975,616803) PREDICTED: tran ( 751) 343 45.6 0.00056 XP_016875381 (OMIM: 604975,616803) PREDICTED: tran ( 751) 343 45.6 0.00056 XP_011519139 (OMIM: 604975,616803) PREDICTED: tran ( 751) 343 45.6 0.00056 XP_016875383 (OMIM: 604975,616803) PREDICTED: tran ( 751) 343 45.6 0.00056 XP_011519136 (OMIM: 604975,616803) PREDICTED: tran ( 751) 343 45.6 0.00056 NP_001248344 (OMIM: 604975,616803) transcription f ( 753) 343 45.6 0.00057 XP_011519135 (OMIM: 604975,616803) PREDICTED: tran ( 754) 343 45.6 0.00057 NP_008871 (OMIM: 604975,616803) transcription fact ( 763) 343 45.6 0.00057 XP_011519134 (OMIM: 604975,616803) PREDICTED: tran ( 764) 343 45.6 0.00057 XP_016875378 (OMIM: 604975,616803) PREDICTED: tran ( 792) 343 45.6 0.00058 XP_016875377 (OMIM: 604975,616803) PREDICTED: tran ( 793) 343 45.6 0.00058 NP_201583 (OMIM: 607257) transcription factor SOX- ( 808) 343 45.7 0.00059 XP_005245680 (OMIM: 604748) PREDICTED: transcripti ( 621) 340 45.2 0.00061 NP_005677 (OMIM: 604748) transcription factor SOX- ( 622) 340 45.2 0.00061 NP_001139283 (OMIM: 607257) transcription factor S ( 801) 339 45.3 0.00076 NP_059978 (OMIM: 607257) transcription factor SOX- ( 804) 339 45.3 0.00076 NP_001139291 (OMIM: 607257) transcription factor S ( 841) 339 45.3 0.00078 NP_003131 (OMIM: 400044,400045,480000) sex-determi ( 204) 321 42.9 0.001 >>NP_003098 (OMIM: 184430) transcription factor SOX-4 [H (474 aa) initn: 3123 init1: 3123 opt: 3123 Z-score: 1571.3 bits: 300.1 E(85289): 8.8e-81 Smith-Waterman score: 3123; 100.0% identity (100.0% similar) in 474 aa overlap (1-474:1-474) 10 20 30 40 50 60 pF1KE2 MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 RPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 RPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 ADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 ADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 GGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 GGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 QAGAAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVYLFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 QAGAAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVYLFG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 GLGTSSSPVGGVGAGADPSDPLGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSPADHRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 GLGTSSSPVGGVGAGADPSDPLGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSPADHRG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 YASLRAASPAPSSAPSHASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFESMSLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 YASLRAASPAPSSAPSHASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFESMSLG 370 380 390 400 410 420 430 440 450 460 470 pF1KE2 SFSSSSALDRDLDFNFEPGSGSHFEFPDYCTPEVSEMISGDWLESSISNLVFTY :::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 SFSSSSALDRDLDFNFEPGSGSHFEFPDYCTPEVSEMISGDWLESSISNLVFTY 430 440 450 460 470 >>NP_003099 (OMIM: 600898,615866) transcription factor S (441 aa) initn: 1098 init1: 628 opt: 718 Z-score: 380.6 bits: 79.7 E(85289): 1.8e-14 Smith-Waterman score: 1010; 43.3% identity (64.9% similar) in 490 aa overlap (1-474:1-441) 10 20 30 40 50 60 pF1KE2 MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIK ::::... : .:. : :. :. : :. .: ::. . . ::.:::: ::::: NP_003 MVQQAESLE-AESNLPREALDTEEG-EF-MACSPVALDES-------DPDWCKTASGHIK 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 RPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHM ::::::::::.:::::::::::::::::::::::::::.::::.:::::::::::::::: NP_003 RPMNAFMVWSKIERRKIMEQSPDMHNAEISKRLGKRWKMLKDSEKIPFIREAERLRLKHM 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 ADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGG ::::::::::::: : . :.. .::..: .: ...:::: .:::.::.... :. NP_003 ADYPDYKYRPRKKPK---MDPSAKPSASQSP----EKSAAGGGGGSAGGGAGGAKTSKGS 120 130 140 150 160 190 200 210 220 230 pF1KE2 GGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHA--KLILAGGGGGGKAAAAAAASFA . . : . . : . :. . .: ::.. .. .: ..:.:::: :. .. : NP_003 SKKCGKLKAPAAAGAKAGAGKAAQSGDYGGAGDDYVLGSLRVSGSGGGG-AGKTVKCVFL 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE2 AEQAGAAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVYL :. :. .: . :. . . : ::.. ... : : NP_003 DEDDDD------DDDDDELQLQIKQEPD---EEDEEPPHQQLLQPPGQQ--PSQLLRRYN 230 240 250 260 270 300 310 320 330 340 350 pF1KE2 FGGLGTSSSPVGGVGAGADPSDPLGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSPADH . . .::. ....:. . .::.: :: :.: :.: : . NP_003 VAKV--PASPT--LSSSAESPEGASLYDEVRAG--------------ATSGAGGGSRL-Y 280 290 300 310 360 370 380 390 400 410 pF1KE2 RGYASLRAASPAPSSAP--SHASSSASSHSSSSSSSGSSSSDDEFEDDL---LDLNPSSN .. .. : : . : : ::: . : ::::::..::.:. : ::: :.:: :.. NP_003 YSFKNITKQHPPPLAQPALSPASSRSVSTSSSSSSGSSSGSSGEDADDLMFDLSLNFSQS 320 330 340 350 360 370 420 430 440 450 460 pF1KE2 FESMS---LGSFSSSSAL-----DRDLDFNFEPGS-GSHFEFPDYCTPEVSEMISGDWLE .: : ::. .... : :.::: .: :: :::::::::::::.::::.::::: NP_003 AHSASEQQLGGGAAAGNLSLSLVDKDLD-SFSEGSLGSHFEFPDYCTPELSEMIAGDWLE 380 390 400 410 420 430 470 pF1KE2 SSISNLVFTY ...:.::::: NP_003 ANFSDLVFTY 440 >>NP_008874 (OMIM: 601947) transcription factor SOX-12 [ (315 aa) initn: 862 init1: 580 opt: 647 Z-score: 347.0 bits: 73.0 E(85289): 1.4e-12 Smith-Waterman score: 679; 38.5% identity (50.2% similar) in 442 aa overlap (34-474:16-315) 10 20 30 40 50 60 pF1KE2 QTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIKRPM : :: . : : .:.::::::::::::: NP_008 MVQQRGARAKRDGGPPPPGPGPAEEG-AREPGWCKTPSGHIKRPM 10 20 30 40 70 80 90 100 110 120 pF1KE2 NAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHMADY :::::::: ::::::.: :::::::::::::.::.::.::.::::.:::::::::::::: NP_008 NAFMVWSQHERRKIMDQWPDMHNAEISKRLGRRWQLLQDSEKIPFVREAERLRLKHMADY 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE2 PDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGGGGG :::::::::: :..: :...: : NP_008 PDYKYRPRKK--------SKGAPAKARPRPPG---------------------------- 110 120 190 200 210 220 230 240 pF1KE2 ASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAEQAG .::::. :: : .. : :: . ::: :: ::: . NP_008 GSGGGSRLKP------GPQLPG-RGGRRA--------AGGPLGGGAAAPEDDD------- 130 140 150 160 250 260 270 280 290 300 pF1KE2 AAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVYLFGGLG : . : ..: . .::..: :. NP_008 ---------EDDDEELLEVRL----------------VETPGREL-----WRMV------ 170 180 190 310 320 330 340 350 360 pF1KE2 TSSSPVGGVGAGADPSDPLGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSPADHRGYAS :.: .. : :.. : :: : ..:: NP_008 ----PAGRAARGQ---------AERAQG-----PSGEGAAAAA----------------- 200 210 370 380 390 400 410 420 pF1KE2 LRAASPAPSSAPSHASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFESMSLGSFS ::::.:: . .. . . .: .: : : :. .: NP_008 --AASPTPSEDEEPEEEEEEAAAAEEGEEETVASGEESLGFLSRLPPGPA----GL---- 220 230 240 250 260 430 440 450 460 470 pF1KE2 SSSALDRDLDFNFEPGSG-SHFEFPDYCTPEVSEMISGDWLESSISNLVFTY . :::::: : ..: :: :::::::::::::.:::.::: :::..::::: NP_008 DCSALDRDPD--LQPPSGTSHFEFPDYCTPEVTEMIAGDWRPSSIADLVFTY 270 280 290 300 310 >>NP_005977 (OMIM: 602148) transcription factor SOX-1 [H (391 aa) initn: 413 init1: 413 opt: 486 Z-score: 266.3 bits: 58.3 E(85289): 4.3e-08 Smith-Waterman score: 506; 33.9% identity (57.9% similar) in 363 aa overlap (20-378:12-332) 10 20 30 40 50 60 pF1KE2 MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIK : .:: .... :. .. :: . . :. . ..: NP_005 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVK 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 RPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHM ::::::::::. .:::. ...: :::.::::::: .::......: ::: ::.::: :: NP_005 RPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHM 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 ADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGG ..::::::::.:.:. : :: . .:: .:.::::.. : : NP_005 KEHPDYKYRPRRKTKTL---------------LKKDKYSLAGGLLAAGAGGGGAAVAMGV 120 130 140 150 190 200 210 220 230 240 pF1KE2 GGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAE : :. :: . . .: : :.:::: . :.. :. :. :::::::.. : NP_005 GVGV---GAAAVGQRLESPG----GAAGGGYA--HVNGWANGAYPGSVAAAAAAAAMMQE 160 170 180 190 200 250 260 270 280 290 300 pF1KE2 QAGAAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVYLFG : : : .: :.:... : : : : . . . : .: NP_005 -----AQLAYG----QH-------PGAGGAHPHAHPAHPHPHHPHAHPHNPQPMHRYDMG 210 220 230 240 250 310 320 330 340 350 pF1KE2 GLGTSSSPVGGVGA--GADPSDPLGL-YEEEGAGCSPDAPSLSGRSSAASSPAAGRSPAD .: . ::... . .:.:: :: : .:. . . . .. . ::.. ::. : . NP_005 AL--QYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAAAAAAAAASSGA 260 270 280 290 300 310 360 370 380 390 400 410 pF1KE2 HRGYASLRAASPAPSS-APSHASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFES . .:: . :. : ::.:. NP_005 LGALGSLVKSEPSGSPPAPAHSRAPCPGDLREMISMYLPAGEGGDPAAAAAAAAQSRLHS 320 330 340 350 360 370 >>NP_009015 (OMIM: 604974) transcription factor SOX-21 [ (276 aa) initn: 505 init1: 393 opt: 456 Z-score: 253.1 bits: 55.4 E(85289): 2.3e-07 Smith-Waterman score: 464; 40.2% identity (65.2% similar) in 244 aa overlap (55-287:4-222) 30 40 50 60 70 80 pF1KE2 GLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIKRPMNAFMVWSQIERRKIMEQSPDM : :.:::::::::::. .:::. ...: : NP_009 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKM 10 20 30 90 100 110 120 130 140 pF1KE2 HNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHMADYPDYKYRPRKKVKSGNANSSSS ::.::::::: .:::: .:.: ::: ::.::: :: ..::::::::.: :. ... . NP_009 HNSEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFA 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE2 AAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGGGGGASGGGANSKPAQKKSCGSKVA : : .:: . . : . .:.. .::.::: . : . . :.: NP_009 F-----PVPYG--LGGVADAEHPALKAGAGLHAGAGGGLV--------PESLLANPEKAA 100 110 120 130 210 220 230 240 250 pF1KE2 GGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAEQAGAA-ALLPLGA-----AADHHS ..:.... :.... ..::::::. :: ::. .:: ::. ... . NP_009 AAAAAAA----ARVFFP------QSAAAAAAAAAAAAAGSPYSLLDLGSKMAEISSSSSG 140 150 160 170 180 260 270 280 290 300 310 pF1KE2 LYKART---PSASASA--SSAASASAALAAPGKHLAEKKVKRVYLFGGLGTSSSPVGGVG : : . :.:.:.: ..::.:.:: :: : : NP_009 LPYASSLGYPTAGAGAFHGAAAAAAAAAAAAGGHTHSHPSPGNPGYMIPCNCSAWPSPGL 190 200 210 220 230 240 320 330 340 350 360 370 pF1KE2 AGADPSDPLGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSPADHRGYASLRAASPAPSS NP_009 QPPLAYILLPGMGKPQLDPYPAAYAAAL 250 260 270 >>NP_055402 (OMIM: 605923) transcription factor SOX-8 [H (446 aa) initn: 500 init1: 403 opt: 442 Z-score: 243.9 bits: 54.4 E(85289): 7.6e-07 Smith-Waterman score: 448; 34.7% identity (57.4% similar) in 329 aa overlap (58-375:101-400) 30 40 50 60 70 80 pF1KE2 LGIASSPTPGSTASTGGKADDPSWCKTPSGHIKRPMNAFMVWSQIERRKIMEQSPDMHNA :.::::::::::.: :::. .: : .::: NP_055 AVSQVLKGYDWSLVPMPVRGGGGGALKAKPHVKRPMNAFMVWAQAARRKLADQYPHLHNA 80 90 100 110 120 130 90 100 110 120 130 140 pF1KE2 EISKRLGKRWKLLKDSDKIPFIREAERLRLKHMADYPDYKYRPRKKVKSGNANSSSSAAA :.:: ::: :.::..:.: ::..::::::..: :.:::::.::.. ::..:. : NP_055 ELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKKDHPDYKYQPRRR-KSAKAGHS----- 140 150 160 170 180 150 160 170 180 190 200 pF1KE2 SSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGGGGGASGG--GANSKPAQKKSCGSKVAG :. .:. : : :::. ...:: : : : : . : . . NP_055 --------DSDSGAELGPHPGGGAVYKAEAGLGDGHHHGDHTGQTHGPPTPPTTPKTELQ 190 200 210 220 230 210 220 230 240 250 260 pF1KE2 GAGGGVSKPHAKL----ILAGGGGGGKAAAAAAASFAAEQAGAAALLPLGAAADHHSLYK :: .::. :: . .: . . . . ...: :. . . :.. NP_055 QAG---AKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDAFDVHEF-DQYLPLG 240 250 260 270 280 290 270 280 290 300 310 320 pF1KE2 ARTPSASASASSAASASAALAAPGKHLAEKKVKRVYLFGGLGTSSSPVGGVGAGADPSDP . .: ..: ..: :. :.: :.:.. . .. : ..: .:: : NP_055 GPAPPEPGQAYGGAYFHAG-ASPV--WAHKSAPSA---SASPTETGPPRPHIKTEQPS-P 300 310 320 330 340 330 340 350 360 370 pF1KE2 LGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSP-ADHRG-YASLRAAS---PAPSSAPS : : .. : ::: : ::.::: .::: .: : .: :..:.:.: :. :: NP_055 -GHYGDQPRG-SPDYGSCSGQSSA--TPAAPAGPFAGSQGDYGDLQASSYYGAYPGYAPG 350 360 370 380 390 400 380 390 400 410 420 430 pF1KE2 HASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFESMSLGSFSSSSALDRDLDFNF NP_055 LYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP 410 420 430 440 >>NP_008873 (OMIM: 601297) protein SOX-15 [Homo sapiens] (233 aa) initn: 446 init1: 396 opt: 426 Z-score: 239.0 bits: 52.6 E(85289): 1.4e-06 Smith-Waterman score: 433; 43.4% identity (69.9% similar) in 173 aa overlap (15-182:3-162) 10 20 30 40 50 pF1KE2 MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGK----ADDPSWCKT-P : : :.:.. .:: : : ....:.: . : .:. : : NP_008 MALPGSSQDQAWSLEPPAA---TAAASSSSGPQEREGAGSPAAPGTLP 10 20 30 40 60 70 80 90 100 110 pF1KE2 SGHIKRPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERL ..:::::::::::. .::.. .:.: :::.::::::: .:::: ...: ::..::.:: NP_008 LEKVKRPMNAFMVWSSAQRRQMAQQNPKMHNSEISKRLGAQWKLLDEDEKRPFVEEAKRL 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE2 RLKHMADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSS : .:. ::::::::::.:.::..:. :.. :. :. ..:: : : ... NP_008 RARHLRDYPDYKYRPRRKAKSSGAG----------PSRCGQGRGNLASGGPLWGPGYATT 110 120 130 140 150 180 190 200 210 220 230 pF1KE2 NAGGGGGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAA . . : : NP_008 QPSRGFGYRPPSYSTAYLPGSYGSSHCKLEAPSPCSLPQSDPRLQGELLPTYTHYLPPGS 160 170 180 190 200 210 >>NP_005625 (OMIM: 300123,312000,313430) transcription f (446 aa) initn: 379 init1: 379 opt: 420 Z-score: 233.0 bits: 52.4 E(85289): 3.1e-06 Smith-Waterman score: 445; 31.3% identity (55.8% similar) in 380 aa overlap (16-375:106-442) 10 20 30 40 pF1KE2 MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGK ::.:: ..:: : :. .. :: :: NP_005 MYSLLETELKNPVGTPTQAAGTGGPAAPGGAGKSSANAAG---GANSGGGSSGGASGGGG 80 90 100 110 120 130 50 60 70 80 90 100 pF1KE2 ADDPSWCKTPSGHIKRPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDK . : . ..:::::::::::. .:::. ..: :::.::::::: :::: :..: NP_005 GTDQD-------RVKRPMNAFMVWSRGQRRKMALENPKMHNSEISKRLGADWKLLTDAEK 140 150 160 170 180 110 120 130 140 150 160 pF1KE2 IPFIREAERLRLKHMADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGG ::: ::.::: :: .:::::::::.:.:. : :: . .: NP_005 RPFIDEAKRLRAVHMKEYPDYKYRPRRKTKTL---------------LKKDKYSLPSGLL 190 200 210 220 230 170 180 190 200 210 220 pF1KE2 HGGGGGGGSSNAGGGGGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGG :....... :.......: :.. :. . ..: : :.:. : . .: : . NP_005 PPGAAAAAAAAAAAAAAASSPVGVG----QRLDTYTHVNGWANGAYSLVQEQLGYAQPPS 240 250 260 270 280 230 240 250 260 270 pF1KE2 GGKAAAAAAA------SFAAEQ------AGAAALLPLGAAADHHSLYKARTPSASASASS .. : ..:. : :: . . ..::: : : . .:::.:.:.. NP_005 MSSPPPPPALPPMHRYDMAGLQYSPMMPPGAQSYMNVAAAAAAASGYGGMAPSATAAAAA 290 300 310 320 330 340 280 290 300 310 320 330 pF1KE2 AASASAALAAPGKHLAEKKVKRVYLFGGLGTSSSPVGGVGAGADPSDPLGLYEEEGAGCS : . . : :: . : . : .:.:.: . : : .. . .: NP_005 AYGQQPATAAAAAAAAA------------AMSLGPMGSVVKSEPSSPPPAIASHSQRACL 350 360 370 380 390 340 350 360 370 380 pF1KE2 PDAPSL-------SGRSSAASSP-AAGRSPADHRGYASLRAASPAPSSAPSHASSSASSH : .. .: .. :.:: .:: . :. : . :.. . ...: NP_005 GDLRDMISMYLPPGGDAADAASPLPGGRLHGVHQHYQG--AGTAVNGTVPLTHI 400 410 420 430 440 390 400 410 420 430 440 pF1KE2 SSSSSSSGSSSSDDEFEDDLLDLNPSSNFESMSLGSFSSSSALDRDLDFNFEPGSGSHFE >>NP_003097 (OMIM: 184429,189960,206900) transcription f (317 aa) initn: 369 init1: 369 opt: 402 Z-score: 225.7 bits: 50.5 E(85289): 7.9e-06 Smith-Waterman score: 428; 32.3% identity (56.0% similar) in 325 aa overlap (34-352:12-305) 10 20 30 40 50 pF1KE2 QTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWC----KTPSGHI : : .:.. :: . . :. .. NP_003 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRV 10 20 30 40 60 70 80 90 100 110 pF1KE2 KRPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKH :::::::::::. .:::. ...: :::.::::::: .::::....: ::: ::.::: : NP_003 KRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALH 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE2 MADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGG : ..::::::::.:.:. : :: : : . ::.: :.: NP_003 MKEHPDYKYRPRRKTKT---------------LMKKDKYTLPG----GLLAPGGNSMASG 110 120 130 140 180 190 200 210 220 230 pF1KE2 GGGGAS-GGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFA : ::. :.:.: :. . ... : ..:. : . .: : : .::.. NP_003 VGVGAGLGAGVN----QRMDSYAHMNGWSNGSYSMMQDQLGYPQHPG---LNAHGAAQMQ 150 160 170 180 190 240 250 260 270 280 290 pF1KE2 AEQAGAAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVYL . .. : .. .. .. : .:. : : :. .. . ::.. :. . . . . NP_003 PMHRYDVSALQYNSMTSSQT-YMNGSPTYSMSYSQQGTPGMALGSMGSVVKSEASSSPPV 200 210 220 230 240 250 300 310 320 330 340 350 pF1KE2 FGGLGTSSSPVGGVGAGADPSDPLGLYEEEGAGCSPDAPS-LSGRSSAASSPAAGRSPAD . . : .: :: : : ...: . : ::: : . :.:. : NP_003 VTSSSHSRAPC---QAG-DLRDMISMYLPGAEVPEPAAPSRLHMSQHYQSGPVPGTAING 260 270 280 290 300 310 360 370 380 390 400 410 pF1KE2 HRGYASLRAASPAPSSAPSHASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFESM NP_003 TLPLSHM >>NP_008872 (OMIM: 602229,609136,611584,613266) transcri (466 aa) initn: 369 init1: 369 opt: 405 Z-score: 225.3 bits: 51.0 E(85289): 8.2e-06 Smith-Waterman score: 435; 32.1% identity (54.8% similar) in 361 aa overlap (58-392:103-451) 30 40 50 60 70 80 pF1KE2 LGIASSPTPGSTASTGGKADDPSWCKTPSGHIKRPMNAFMVWSQIERRKIMEQSPDMHNA :.::::::::::.: :::. .: : .::: NP_008 REAVSQVLSGYDWTLVPMPVRVNGASKSKPHVKRPMNAFMVWAQAARRKLADQYPHLHNA 80 90 100 110 120 130 90 100 110 120 130 140 pF1KE2 EISKRLGKRWKLLKDSDKIPFIREAERLRLKHMADYPDYKYRPRKKVKSGNANSSSSAAA :.:: ::: :.::..::: :::.::::::..: :.:::::.::.. :.:.: .. . NP_008 ELSKTLGKLWRLLNESDKRPFIEEAERLRMQHKKDHPDYKYQPRRR-KNGKAAQGEAEC- 140 150 160 170 180 190 150 160 170 180 190 pF1KE2 SSKPGEKGDKVGGSGGGGHGGGG-------GGGSSNAGGGGGGASGG--GANSKPAQKKS :: .... : .. .: .. : :: . :. :: : . :. :. NP_008 ---PGGEAEQGGTAAIQAHYKSAHLDHRHPGEGSPMSDGNPEHPSGQSHGPPTPPTTPKT 200 210 220 230 240 200 210 220 230 240 250 pF1KE2 ---CGS---KVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAEQAGAAALLPLGA :. : : . : .::: . . : .. . . .: . : :: .. NP_008 ELQSGKADPKRDGRSMGEGGKPHIDFGNVDIGEISHEVMSNMETF--DVAELDQYLPPNG 250 260 270 280 290 300 260 270 280 290 300 310 pF1KE2 AADHHSLYKARTPSASASASSAASASAALAAP-GKHLAEKKVKRVYLFGGLGT-SSSPVG : : :.: . ... . :.. :: .. : : : . : . . : ...: : NP_008 HPGHVSSYSAAGYGLGSALAVASGHSAWISKPPGVALPTVSPPGVDAKAQVKTETAGPQG 310 320 330 340 350 360 320 330 340 350 360 pF1KE2 GVGAGADPSDP------LGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSPADHRGYAS- .:: :.: . .: : . :... . :.: : : : :: NP_008 PPHYTDQPSTSQIAYTSLSLPHYGSAFPSISRPQFDYSDHQPSGPYYG-----HSGQASG 370 380 390 400 410 420 370 380 390 400 410 420 pF1KE2 LRAASP--APSSAPSHASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFESMSLGS : .: .::. : ... : : :. .: : NP_008 LYSAFSYMGPSQRPLYTAISDPSPSGPQSHSPTHWEQPVYTTLSRP 430 440 450 460 430 440 450 460 470 pF1KE2 FSSSSALDRDLDFNFEPGSGSHFEFPDYCTPEVSEMISGDWLESSISNLVFTY 474 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 20:29:48 2016 done: Mon Nov 7 20:29:49 2016 Total Scan time: 12.280 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]