FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9652, 317 aa
1>>>pF1KB9652 317 - 317 aa - 317 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.0291+/-0.00069; mu= 9.2110+/- 0.042
mean_var=119.0964+/-24.141, 0's: 0 Z-trim(114.1): 76 B-trim: 150 in 2/51
Lambda= 0.117524
statistics sampled from 14635 (14715) to 14635 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.452), width: 16
Scan time: 2.940
The best scores are: opt bits E(32554)
CCDS3239.1 SOX2 gene_id:6657|Hs108|chr3 ( 317) 2167 377.6 7.1e-105
CCDS14669.1 SOX3 gene_id:6658|Hs108|chrX ( 446) 923 166.8 2.9e-41
CCDS9523.1 SOX1 gene_id:6656|Hs108|chr13 ( 391) 780 142.5 5.2e-34
CCDS9473.1 SOX21 gene_id:11166|Hs108|chr13 ( 276) 620 115.3 5.7e-26
CCDS3094.1 SOX14 gene_id:8403|Hs108|chr3 ( 240) 601 112.0 4.8e-25
CCDS32549.1 SOX15 gene_id:6665|Hs108|chr17 ( 233) 499 94.7 7.5e-20
CCDS6159.1 SOX17 gene_id:64321|Hs108|chr8 ( 414) 466 89.2 5.8e-18
CCDS10428.1 SOX8 gene_id:30812|Hs108|chr16 ( 446) 433 83.7 3e-16
CCDS14772.1 SRY gene_id:6736|Hs108|chrY ( 204) 424 81.9 4.5e-16
CCDS13552.1 SOX18 gene_id:54345|Hs108|chr20 ( 384) 422 81.8 9.6e-16
CCDS12995.1 SOX12 gene_id:6666|Hs108|chr20 ( 315) 415 80.5 1.9e-15
CCDS13964.1 SOX10 gene_id:6663|Hs108|chr22 ( 466) 412 80.1 3.7e-15
CCDS1654.1 SOX11 gene_id:6664|Hs108|chr2 ( 441) 410 79.8 4.4e-15
CCDS11689.1 SOX9 gene_id:6662|Hs108|chr17 ( 509) 407 79.3 7.1e-15
CCDS4547.1 SOX4 gene_id:6659|Hs108|chr6 ( 474) 402 78.4 1.2e-14
CCDS5977.1 SOX7 gene_id:83595|Hs108|chr8 ( 388) 386 75.7 6.7e-14
CCDS41761.1 SOX5 gene_id:6660|Hs108|chr12 ( 377) 338 67.5 1.8e-11
CCDS58216.1 SOX5 gene_id:6660|Hs108|chr12 ( 642) 338 67.7 2.9e-11
CCDS81672.1 SOX5 gene_id:6660|Hs108|chr12 ( 728) 338 67.7 3.2e-11
CCDS44844.1 SOX5 gene_id:6660|Hs108|chr12 ( 750) 338 67.7 3.2e-11
CCDS58217.1 SOX5 gene_id:6660|Hs108|chr12 ( 753) 338 67.7 3.3e-11
CCDS8699.1 SOX5 gene_id:6660|Hs108|chr12 ( 763) 338 67.7 3.3e-11
CCDS53604.1 SOX6 gene_id:55553|Hs108|chr11 ( 801) 329 66.2 9.9e-11
CCDS53605.1 SOX6 gene_id:55553|Hs108|chr11 ( 804) 329 66.2 9.9e-11
CCDS7821.1 SOX6 gene_id:55553|Hs108|chr11 ( 808) 329 66.2 9.9e-11
>>CCDS3239.1 SOX2 gene_id:6657|Hs108|chr3 (317 aa)
initn: 2167 init1: 2167 opt: 2167 Z-score: 1996.3 bits: 377.6 E(32554): 7.1e-105
Smith-Waterman score: 2167; 100.0% identity (100.0% similar) in 317 aa overlap (1-317:1-317)
10 20 30 40 50 60
pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 QENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 QENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLM
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 KKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 KKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGY
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 PQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 PQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 GSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 GSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQS
250 260 270 280 290 300
310
pF1KB9 GPVPGTAINGTLPLSHM
:::::::::::::::::
CCDS32 GPVPGTAINGTLPLSHM
310
>>CCDS14669.1 SOX3 gene_id:6658|Hs108|chrX (446 aa)
initn: 1183 init1: 656 opt: 923 Z-score: 854.2 bits: 166.8 E(32554): 2.9e-41
Smith-Waterman score: 1153; 52.8% identity (74.0% similar) in 377 aa overlap (1-317:76-446)
10 20
pF1KB9 MYNMMETELKPP-G-PQQTSG-------GG
::...::::: : : : :..: ::
CCDS14 ESQGLFTVAAPAPGAPSPPATLAHLLPAPAMYSLLETELKNPVGTPTQAAGTGGPAAPGG
50 60 70 80 90 100
30 40 50 60
pF1KB9 GGNSTAAAAGGNQKNS--------------PDRVKRPMNAFMVWSRGQRRKMAQENPKMH
.:.:.: :::: .... :::::::::::::::::::::: ::::::
CCDS14 AGKSSANAAGGANSGGGSSGGASGGGGGTDQDRVKRPMNAFMVWSRGQRRKMALENPKMH
110 120 130 140 150 160
70 80 90 100 110 120
pF1KB9 NSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTL
::::::::::.::::...::::::::::::::.::::.::::::::::::::.:::::.:
CCDS14 NSEISKRLGADWKLLTDAEKRPFIDEAKRLRAVHMKEYPDYKYRPRRKTKTLLKKDKYSL
170 180 190 200 210 220
130 140 150 160 170 180
pF1KB9 PGGLLAPGGNSMASGVGVGAGLGA---GVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQHP
:.::: ::. . :......:. .. ::.::.:.:.:.:::.::.::..:.:::: : :
CCDS14 PSGLLPPGAAAAAAAAAAAAAAASSPVGVGQRLDTYTHVNGWANGAYSLVQEQLGYAQPP
230 240 250 260 270 280
190 200 210 220
pF1KB9 GLNAHGAAQ-MQPMHRYDVSALQYNSMT--SSQTYMN---------G----SPTYSMS--
.... . ::::::...:::. : ..:.::: : .:. . .
CCDS14 SMSSPPPPPALPPMHRYDMAGLQYSPMMPPGAQSYMNVAAAAAAASGYGGMAPSATAAAA
290 300 310 320 330 340
230 240 250 260 270
pF1KB9 --YSQQ---------GTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMIS
:.:: .. .:.:: :::::::: :: ::.. .:::. : ::::::::
CCDS14 AAYGQQPATAAAAAAAAAAMSLGPMGSVVKSEPSSPPPAI--ASHSQRACL-GDLRDMIS
350 360 370 380 390 400
280 290 300 310
pF1KB9 MYLP-GAEVPEPAAP---SRLH-MSQHYQSGPVPGTAINGTLPLSHM
:::: :... . :.: .::: . ::::.. :::.:::.::.:.
CCDS14 MYLPPGGDAADAASPLPGGRLHGVHQHYQGA---GTAVNGTVPLTHI
410 420 430 440
>>CCDS9523.1 SOX1 gene_id:6656|Hs108|chr13 (391 aa)
initn: 1037 init1: 728 opt: 780 Z-score: 724.0 bits: 142.5 E(32554): 5.2e-34
Smith-Waterman score: 1095; 54.0% identity (69.7% similar) in 363 aa overlap (1-288:1-358)
10 20 30 40 50
pF1KB9 MYNMM-ETELKPPGPQQT---------SGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMV
::.:: ::.:. :: :. .:::::.. ....::. : . ::::::::::::
CCDS95 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMV
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB9 WSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKY
::::::::::::::::::::::::::::::..::.:::::::::::::::::::::::::
CCDS95 WSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKY
70 80 90 100 110 120
120 130 140 150
pF1KB9 RPRRKTKTLMKKDKYTLPGGLLAPG----GNSMASGVGVGAGLGAGVNQRMDS-------
:::::::::.:::::.: ::::: : : ..: :::::.: .:.:.::..:
CCDS95 RPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVG-AAAVGQRLESPGGAAGG
130 140 150 160 170
160 170 180 190
pF1KB9 -YAHMNGWSNGSY-----------SMMQD-QLGYPQHPGL-----NAHGAAQM-------
:::.:::.::.: .:::. ::.: :::: .:: :
CCDS95 GYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAH
180 190 200 210 220 230
200 210 220 230
pF1KB9 ----QPMHRYDVSALQYNSMTSSQTYMNGSPT------YSMSYSQQGTPGMA--------
:::::::..::::. ...:: ::..::. :. . . .. : :
CCDS95 PHNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAA
240 250 260 270 280 290
240 250 260 270 280
pF1KB9 -----------LGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPE
::..::.:::: :.::: . .:::::: ::::.:::::::..: .
CCDS95 AAAAAAASSGALGALGSLVKSEPSGSPP---APAHSRAPCP-GDLREMISMYLPAGEGGD
300 310 320 330 340 350
290 300 310
pF1KB9 PAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM
:::
CCDS95 PAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI
360 370 380 390
>>CCDS9473.1 SOX21 gene_id:11166|Hs108|chr13 (276 aa)
initn: 624 init1: 569 opt: 620 Z-score: 579.7 bits: 115.3 E(32554): 5.7e-26
Smith-Waterman score: 620; 43.0% identity (62.9% similar) in 272 aa overlap (39-304:6-269)
10 20 30 40 50 60
pF1KB9 LKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMAQENPKMHN
:.:::::::::::::.::::::::::::::
CCDS94 MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHN
10 20 30
70 80 90 100 110 120
pF1KB9 SEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTLP
::::::::::::::.:.::::::::::::::.::::::::::::::: :::.::::...:
CCDS94 SEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFP
40 50 60 70 80 90
130 140 150 160 170 180
pF1KB9 -----GGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQH
::. .. .:.:. :: :.:. .: . .. . .. .::
CCDS94 VPYGLGGVADAEHPALKAGAGLHAGAGGGLVP--ESLLANPEKAAAAAAAAAARVFFPQS
100 110 120 130 140 150
190 200 210 220 230 240
pF1KB9 PGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM-GS
. : .:: :.. : ..:. .. .: : :. :. : : . :.. :.
CCDS94 AAAAAAAAAAAAAGSPYSLLDLG-SKMAEISSSSSGLP-YA---SSLGYPTAGAGAFHGA
160 170 180 190 200
250 260 270 280 290 300
pF1KB9 VVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQSGP
.. . :... . :: .: . : . :. . : : : . : :
CCDS94 AAAAAAAAAAAGGHTHSHP-SPGNPGYMIPCNCSAWPSPGLQPPLAYILLPGMGKPQLDP
210 220 230 240 250 260
310
pF1KB9 VPGTAINGTLPLSHM
:
CCDS94 YPAAYAAAL
270
>>CCDS3094.1 SOX14 gene_id:8403|Hs108|chr3 (240 aa)
initn: 626 init1: 574 opt: 601 Z-score: 563.2 bits: 112.0 E(32554): 4.8e-25
Smith-Waterman score: 601; 48.6% identity (69.5% similar) in 220 aa overlap (39-254:6-216)
10 20 30 40 50 60
pF1KB9 LKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMAQENPKMHN
:..:::::::::::::::::::::::::::
CCDS30 MSKPSDHIKRPMNAFMVWSRGQRRKMAQENPKMHN
10 20 30
70 80 90 100 110 120
pF1KB9 SEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTLP
::::::::::::::::.::::.::::::::: ::::::::::::::: :.:.:::.:..:
CCDS30 SEISKRLGAEWKLLSEAEKRPYIDEAKRLRAQHMKEHPDYKYRPRRKPKNLLKKDRYVFP
40 50 60 70 80 90
130 140 150 160 170 180
pF1KB9 GGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQHPGLNA
:. :.:. :::. : .. . : . ... ::... : . . .:
CCDS30 LPYLGDTDPLKAAGLPVGASDGL-LSAPEKARAFLPP-ASAPYSLLD-----PAQFSSSA
100 110 120 130 140
190 200 210 220 230 240
pF1KB9 HGAAQMQPMHRYDVSALQYNSMTSSQTYMNGS---PT-YSMSYSQQGTPGMALGSMGSVV
: : ..:: : : . :. :: :. .. .. . .::... . ..
CCDS30 IQKMGEVP-HTLATGALPYASTLGYQNGAFGSLSCPSQHTHTHPSPTNPGYVV-PCNCTA
150 160 170 180 190 200
250 260 270 280 290 300
pF1KB9 KSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQSGPVP
: .. .:::
CCDS30 WSASTLQPPVAYILFPGMTKTGIDPYSSAHATAM
210 220 230 240
>>CCDS32549.1 SOX15 gene_id:6665|Hs108|chr17 (233 aa)
initn: 477 init1: 458 opt: 499 Z-score: 469.9 bits: 94.7 E(32554): 7.5e-20
Smith-Waterman score: 499; 43.9% identity (64.5% similar) in 214 aa overlap (13-221:28-227)
10 20 30 40
pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSP-DRVKRP
:::. :.: . :: : . : ..::::
CCDS32 MALPGSSQDQAWSLEPPAATAAASSSSGPQEREGAG----SPAAPG----TLPLEKVKRP
10 20 30 40 50
50 60 70 80 90 100
pF1KB9 MNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKE
:::::::: .:::.:::.:::::::::::::::.::::.: :::::..::::::: :...
CCDS32 MNAFMVWSSAQRRQMAQQNPKMHNSEISKRLGAQWKLLDEDEKRPFVEEAKRLRARHLRD
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB9 HPDYKYRPRRKTKTLMKKDKYTLPG-GLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHM
.::::::::::.:. . : : :: :: . : .. . : . : ::.
CCDS32 YPDYKYRPRRKAKSSGAGPSRCGQGRGNLASGGPLWGPGYATTQP-SRGFGYRPPSYS--
120 130 140 150 160
170 180 190 200 210 220
pF1KB9 NGWSNGSYSMMQDQLGYPQH---PGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGS
... :::. . .: :. : . . ... : . . : .: : . . :.
CCDS32 TAYLPGSYGSSHCKLEAPSPCSLPQSDPRLQGELLPTYTH---YLPPGSPTPYNPPLAGA
170 180 190 200 210 220
230 240 250 260 270 280
pF1KB9 PTYSMSYSQQGTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPG
:
CCDS32 PMPLTHL
230
>>CCDS6159.1 SOX17 gene_id:64321|Hs108|chr8 (414 aa)
initn: 467 init1: 398 opt: 466 Z-score: 435.9 bits: 89.2 E(32554): 5.8e-18
Smith-Waterman score: 468; 31.7% identity (56.7% similar) in 312 aa overlap (9-305:36-331)
10 20 30
pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSP
:.: : ....: . .:: : :..... ..
CCDS61 AGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAGRAKGE
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB9 DRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLR
.:..:::::::::.. .:...::.:: .::.:.:: :: :: :. .:::::..::.:::
CCDS61 SRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEEAERLR
70 80 90 100 110 120
100 110 120 130 140 150
pF1KB9 ALHMKEHPDYKYRPRRKTKTLMKKDKYTLPG---GLLAPGGNSMASGVGVGAGLGAGVNQ
. ::..::.:::::::. . .:. : . : :: : . ... : : : :..
CCDS61 VQHMQDHPNYKYRPRRRKQ--VKRLKRVEGGFLHGLAEPQAAALGPEGGRVAMDGLGLQF
130 140 150 160 170 180
160 170 180 190 200
pF1KB9 RMDSYA--------HMNGWSNGSYSMMQDQL-GYPQHPGLNAHGAAQMQPMHRYDVSALQ
... ::.: :. : ::: : . . .:. : .
CCDS61 PEQGFPAGPPLLPPHMGGHYRDCQSLGAPPLDGYPL-P------TPDTSPLDGVDPDPAF
190 200 210 220 230
210 220 230 240 250 260
pF1KB9 YNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSMGSVVKSE-ASSSPPVVTSSSHSRAPC
. . .. :. .:.. . : : : : . : :. : : . ::
CCDS61 FAAPMPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPRLGPEPAGPSIPGLL------APP
240 250 260 270 280 290
270 280 290 300 310
pF1KB9 QAGDLRDMISMYLPGAEVPE--PAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM
.: . . .: ::: . :.. :. :: . : ::
CCDS61 SALHVY-YGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCRDGTDPS
300 310 320 330 340
CCDS61 QPAELLGEVDRTEFEQYLHFVCKPEMGLPYQGHDSGVNLPDSHGAISSVVSDASSAVYYC
350 360 370 380 390 400
>>CCDS10428.1 SOX8 gene_id:30812|Hs108|chr16 (446 aa)
initn: 387 init1: 387 opt: 433 Z-score: 405.2 bits: 83.7 E(32554): 3e-16
Smith-Waterman score: 441; 31.5% identity (56.0% similar) in 327 aa overlap (14-313:85-396)
10 20 30 40
pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKR
:. . :::: : : .: .:::
CCDS10 DPAEAADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGG---------GALKAKP-HVKR
60 70 80 90 100
50 60 70 80 90 100
pF1KB9 PMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMK
::::::::... :::.:.. :..::.:.:: :: :.::::.:::::..::.:::. : :
CCDS10 PMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKK
110 120 130 140 150 160
110 120 130 140 150 160
pF1KB9 EHPDYKYRPRRKTKTLMKKDKYTLPGGLLAP--GGNSMASGVGVGAGLGAGVNQRMDSYA
.::::::.:::. :. . . :. :.: ::... . . :::: : ... : .
CCDS10 DHPDYKYQPRRR-KSAKAGHSDSDSGAELGPHPGGGAVYK---AEAGLGDG-HHHGDHTG
170 180 190 200 210
170 180 190 200 210
pF1KB9 HMNGWSNGSYSMMQDQLGYPQHPGLNAHG------AAQMQPMHRYDVSALQYNSMTSSQT
. .: . . . .: :. .: . : . :.: :. . : . ..
CCDS10 QTHGPPTPPTTPKTELQQAGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDA
220 230 240 250 260 270
220 230 240 250
pF1KB9 --------YMN-GSPT-------YSMSYSQQG-TPGMALGSMGSVVKSEASSSPPVVTSS
:. :.:. :. .: . : .: : : :. : . ..:: .
CCDS10 FDVHEFDQYLPLGGPAPPEPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPPRPHIK
280 290 300 310 320 330
260 270 280 290 300 310
pF1KB9 SHSRAPCQAGDLRDMISMY--LPGAEVPEPAAPSRLHMSQHYQSGPVPGTAINGTLPLSH
... .: . :: : : ::::. ... . : . ... :. :
CCDS10 TEQPSPGHYGDQPRGSPDYGSCSGQSSATPAAPAGPFAGSQGDYGDLQASSYYGAYPGYA
340 350 360 370 380 390
pF1KB9 M
CCDS10 PGLYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP
400 410 420 430 440
>>CCDS14772.1 SRY gene_id:6736|Hs108|chrY (204 aa)
initn: 434 init1: 416 opt: 424 Z-score: 402.0 bits: 81.9 E(32554): 4.5e-16
Smith-Waterman score: 439; 45.6% identity (69.6% similar) in 171 aa overlap (31-200:49-198)
10 20 30 40 50
pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQK-NSPDRVKRPMNAFMVWSRGQRRKM
: :.: : ::::::::::.:::: :::::
CCDS14 PAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKM
20 30 40 50 60 70
60 70 80 90 100 110
pF1KB9 AQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTL
: :::.:.::::::.:: .::.:.:.:: ::..::..:.:.: ...:.::::::::.: .
CCDS14 ALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAK-M
80 90 100 110 120 130
120 130 140 150 160 170
pF1KB9 MKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLG
. :. :: : .. . : : ..:. . . .....: :. :::
CCDS14 LPKNCSLLP----ADPASVLCSEV------------QLDNRLYRDDCTKATHSRMEHQLG
140 150 160 170 180
180 190 200 210 220 230
pF1KB9 YPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGS
. : .:: :.. : ::
CCDS14 H--LPPINA--ASSPQQRDRYSHWTKL
190 200
>>CCDS13552.1 SOX18 gene_id:54345|Hs108|chr20 (384 aa)
initn: 439 init1: 383 opt: 422 Z-score: 396.1 bits: 81.8 E(32554): 9.6e-16
Smith-Waterman score: 422; 46.5% identity (74.4% similar) in 129 aa overlap (11-135:51-176)
10 20 30
pF1KB9 MYNMMETELKPPGPQQTSGGGG--GNSTAAAAGGNQKNSP
::.::.. . : . :: .....
CCDS13 AWAPGHGAAADTRGLAAGPAALAAPAAPASPPSPQRSPPRSPEPGRYGLSPAGRGERQAA
30 40 50 60 70 80
40 50 60 70 80 90
pF1KB9 D--RVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKR
: :..:::::::::.. .:...::.:: .::. .:: :: :: :. .:::::..::.:
CCDS13 DESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAVLSKMLGKAWKELNAAEKRPFVEEAER
90 100 110 120 130 140
100 110 120 130 140 150
pF1KB9 LRALHMKEHPDYKYRPRRKTKTLMKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQR
::. :...::.:::::::: .. .: . :: :: ::
CCDS13 LRVQHLRDHPNYKYRPRRKKQA--RKARRLEPG-LLLPGLAPPQPPPEPFPAASGSARAF
150 160 170 180 190
160 170 180 190 200 210
pF1KB9 MDSYAHMNGWSNGSYSMMQDQLGYPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTY
CCDS13 RELPPLGAEFDGLGLPTPERSPLDGLEPGEAAFFPPPAAPEDCALRPFRAPYAPTELSRD
200 210 220 230 240 250
317 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 17:58:40 2016 done: Fri Nov 4 17:58:41 2016
Total Scan time: 2.940 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]