Result of FASTA (ccds) for pFN21AB9652
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB9652, 317 aa
  1>>>pF1KB9652 317 - 317 aa - 317 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 7.0291+/-0.00069; mu= 9.2110+/- 0.042
 mean_var=119.0964+/-24.141, 0's: 0 Z-trim(114.1): 76  B-trim: 150 in 2/51
 Lambda= 0.117524
 statistics sampled from 14635 (14715) to 14635 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.452), width:  16
 Scan time:  2.940

The best scores are:                                      opt bits E(32554)
CCDS3239.1 SOX2 gene_id:6657|Hs108|chr3            ( 317) 2167 377.6 7.1e-105
CCDS14669.1 SOX3 gene_id:6658|Hs108|chrX           ( 446)  923 166.8 2.9e-41
CCDS9523.1 SOX1 gene_id:6656|Hs108|chr13           ( 391)  780 142.5 5.2e-34
CCDS9473.1 SOX21 gene_id:11166|Hs108|chr13         ( 276)  620 115.3 5.7e-26
CCDS3094.1 SOX14 gene_id:8403|Hs108|chr3           ( 240)  601 112.0 4.8e-25
CCDS32549.1 SOX15 gene_id:6665|Hs108|chr17         ( 233)  499 94.7 7.5e-20
CCDS6159.1 SOX17 gene_id:64321|Hs108|chr8          ( 414)  466 89.2 5.8e-18
CCDS10428.1 SOX8 gene_id:30812|Hs108|chr16         ( 446)  433 83.7   3e-16
CCDS14772.1 SRY gene_id:6736|Hs108|chrY            ( 204)  424 81.9 4.5e-16
CCDS13552.1 SOX18 gene_id:54345|Hs108|chr20        ( 384)  422 81.8 9.6e-16
CCDS12995.1 SOX12 gene_id:6666|Hs108|chr20         ( 315)  415 80.5 1.9e-15
CCDS13964.1 SOX10 gene_id:6663|Hs108|chr22         ( 466)  412 80.1 3.7e-15
CCDS1654.1 SOX11 gene_id:6664|Hs108|chr2           ( 441)  410 79.8 4.4e-15
CCDS11689.1 SOX9 gene_id:6662|Hs108|chr17          ( 509)  407 79.3 7.1e-15
CCDS4547.1 SOX4 gene_id:6659|Hs108|chr6            ( 474)  402 78.4 1.2e-14
CCDS5977.1 SOX7 gene_id:83595|Hs108|chr8           ( 388)  386 75.7 6.7e-14
CCDS41761.1 SOX5 gene_id:6660|Hs108|chr12          ( 377)  338 67.5 1.8e-11
CCDS58216.1 SOX5 gene_id:6660|Hs108|chr12          ( 642)  338 67.7 2.9e-11
CCDS81672.1 SOX5 gene_id:6660|Hs108|chr12          ( 728)  338 67.7 3.2e-11
CCDS44844.1 SOX5 gene_id:6660|Hs108|chr12          ( 750)  338 67.7 3.2e-11
CCDS58217.1 SOX5 gene_id:6660|Hs108|chr12          ( 753)  338 67.7 3.3e-11
CCDS8699.1 SOX5 gene_id:6660|Hs108|chr12           ( 763)  338 67.7 3.3e-11
CCDS53604.1 SOX6 gene_id:55553|Hs108|chr11         ( 801)  329 66.2 9.9e-11
CCDS53605.1 SOX6 gene_id:55553|Hs108|chr11         ( 804)  329 66.2 9.9e-11
CCDS7821.1 SOX6 gene_id:55553|Hs108|chr11          ( 808)  329 66.2 9.9e-11


>>CCDS3239.1 SOX2 gene_id:6657|Hs108|chr3                 (317 aa)
 initn: 2167 init1: 2167 opt: 2167  Z-score: 1996.3  bits: 377.6 E(32554): 7.1e-105
Smith-Waterman score: 2167; 100.0% identity (100.0% similar) in 317 aa overlap (1-317:1-317)

               10        20        30        40        50        60
pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMA
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB9 QENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLM
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 QENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLM
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB9 KKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGY
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 KKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGY
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB9 PQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 PQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB9 GSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 GSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQS
              250       260       270       280       290       300

              310       
pF1KB9 GPVPGTAINGTLPLSHM
       :::::::::::::::::
CCDS32 GPVPGTAINGTLPLSHM
              310       

>>CCDS14669.1 SOX3 gene_id:6658|Hs108|chrX                (446 aa)
 initn: 1183 init1: 656 opt: 923  Z-score: 854.2  bits: 166.8 E(32554): 2.9e-41
Smith-Waterman score: 1153; 52.8% identity (74.0% similar) in 377 aa overlap (1-317:76-446)

                                             10                 20 
pF1KB9                               MYNMMETELKPP-G-PQQTSG-------GG
                                     ::...::::: : : : :..:       ::
CCDS14 ESQGLFTVAAPAPGAPSPPATLAHLLPAPAMYSLLETELKNPVGTPTQAAGTGGPAAPGG
          50        60        70        80        90       100     

              30                      40        50        60       
pF1KB9 GGNSTAAAAGGNQKNS--------------PDRVKRPMNAFMVWSRGQRRKMAQENPKMH
       .:.:.: :::: ....               :::::::::::::::::::::: ::::::
CCDS14 AGKSSANAAGGANSGGGSSGGASGGGGGTDQDRVKRPMNAFMVWSRGQRRKMALENPKMH
         110       120       130       140       150       160     

        70        80        90       100       110       120       
pF1KB9 NSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTL
       ::::::::::.::::...::::::::::::::.::::.::::::::::::::.:::::.:
CCDS14 NSEISKRLGADWKLLTDAEKRPFIDEAKRLRAVHMKEYPDYKYRPRRKTKTLLKKDKYSL
         170       180       190       200       210       220     

       130       140       150          160       170       180    
pF1KB9 PGGLLAPGGNSMASGVGVGAGLGA---GVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQHP
       :.::: ::. . :......:. ..   ::.::.:.:.:.:::.::.::..:.:::: : :
CCDS14 PSGLLPPGAAAAAAAAAAAAAAASSPVGVGQRLDTYTHVNGWANGAYSLVQEQLGYAQPP
         230       240       250       260       270       280     

          190        200       210                      220        
pF1KB9 GLNAHGAAQ-MQPMHRYDVSALQYNSMT--SSQTYMN---------G----SPTYSMS--
       ....      . ::::::...:::. :   ..:.:::         :    .:. . .  
CCDS14 SMSSPPPPPALPPMHRYDMAGLQYSPMMPPGAQSYMNVAAAAAAASGYGGMAPSATAAAA
         290       300       310       320       330       340     

          230                240       250       260       270     
pF1KB9 --YSQQ---------GTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMIS
         :.::         .. .:.:: :::::::: :: ::..  .:::.  :  ::::::::
CCDS14 AAYGQQPATAAAAAAAAAAMSLGPMGSVVKSEPSSPPPAI--ASHSQRACL-GDLRDMIS
         350       360       370       380         390        400  

          280          290        300       310       
pF1KB9 MYLP-GAEVPEPAAP---SRLH-MSQHYQSGPVPGTAINGTLPLSHM
       :::: :... . :.:   .::: . ::::..   :::.:::.::.:.
CCDS14 MYLPPGGDAADAASPLPGGRLHGVHQHYQGA---GTAVNGTVPLTHI
            410       420       430          440      

>>CCDS9523.1 SOX1 gene_id:6656|Hs108|chr13                (391 aa)
 initn: 1037 init1: 728 opt: 780  Z-score: 724.0  bits: 142.5 E(32554): 5.2e-34
Smith-Waterman score: 1095; 54.0% identity (69.7% similar) in 363 aa overlap (1-288:1-358)

                10                 20        30        40        50
pF1KB9 MYNMM-ETELKPPGPQQT---------SGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMV
       ::.:: ::.:. ::  :.         .:::::.. ....::. : . ::::::::::::
CCDS95 MYSMMMETDLHSPGGAQAPTNLSGPAGAGGGGGGGGGGGGGGGAKANQDRVKRPMNAFMV
               10        20        30        40        50        60

               60        70        80        90       100       110
pF1KB9 WSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKY
       ::::::::::::::::::::::::::::::..::.:::::::::::::::::::::::::
CCDS95 WSRGQRRKMAQENPKMHNSEISKRLGAEWKVMSEAEKRPFIDEAKRLRALHMKEHPDYKY
               70        80        90       100       110       120

              120       130           140       150                
pF1KB9 RPRRKTKTLMKKDKYTLPGGLLAPG----GNSMASGVGVGAGLGAGVNQRMDS-------
       :::::::::.:::::.: ::::: :    : ..: :::::.: .:.:.::..:       
CCDS95 RPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVGVG-AAAVGQRLESPGGAAGG
              130       140       150       160        170         

      160       170                   180            190           
pF1KB9 -YAHMNGWSNGSY-----------SMMQD-QLGYPQHPGL-----NAHGAAQM-------
        :::.:::.::.:           .:::. ::.: ::::      .:: :          
CCDS95 GYAHVNGWANGAYPGSVAAAAAAAAMMQEAQLAYGQHPGAGGAHPHAHPAHPHPHHPHAH
     180       190       200       210       220       230         

              200       210       220             230              
pF1KB9 ----QPMHRYDVSALQYNSMTSSQTYMNGSPT------YSMSYSQQGTPGMA--------
           :::::::..::::. ...:: ::..::.      :. . .  .. : :        
CCDS95 PHNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGLPYGAAAAAAAAAGGAHQNSAVAA
     240       250       260       270       280       290         

                   240       250       260       270       280     
pF1KB9 -----------LGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPE
                  ::..::.:::: :.:::   . .::::::  ::::.:::::::..:  .
CCDS95 AAAAAAASSGALGALGSLVKSEPSGSPP---APAHSRAPCP-GDLREMISMYLPAGEGGD
     300       310       320          330        340       350     

         290       300       310           
pF1KB9 PAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM    
       :::                                 
CCDS95 PAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI
         360       370       380       390 

>>CCDS9473.1 SOX21 gene_id:11166|Hs108|chr13              (276 aa)
 initn: 624 init1: 569 opt: 620  Z-score: 579.7  bits: 115.3 E(32554): 5.7e-26
Smith-Waterman score: 620; 43.0% identity (62.9% similar) in 272 aa overlap (39-304:6-269)

       10        20        30        40        50        60        
pF1KB9 LKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMAQENPKMHN
                                     :.:::::::::::::.::::::::::::::
CCDS94                          MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHN
                                        10        20        30     

       70        80        90       100       110       120        
pF1KB9 SEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTLP
       ::::::::::::::.:.::::::::::::::.::::::::::::::: :::.::::...:
CCDS94 SEISKRLGAEWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFP
          40        50        60        70        80        90     

           130       140       150       160       170       180   
pF1KB9 -----GGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQH
            ::.      .. .:.:. :: :.:.    .:       . .. .    .. .:: 
CCDS94 VPYGLGGVADAEHPALKAGAGLHAGAGGGLVP--ESLLANPEKAAAAAAAAAARVFFPQS
         100       110       120         130       140       150   

           190       200       210       220       230       240   
pF1KB9 PGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM-GS
        .  : .::       :..  :  ..:.  ..  .: : :.   :. : :  . :.. :.
CCDS94 AAAAAAAAAAAAAGSPYSLLDLG-SKMAEISSSSSGLP-YA---SSLGYPTAGAGAFHGA
           160       170        180       190           200        

            250       260       270       280       290       300  
pF1KB9 VVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQSGP
       .. . :...     . ::  .: . : .        :.  .  : :   :    . :  :
CCDS94 AAAAAAAAAAAGGHTHSHP-SPGNPGYMIPCNCSAWPSPGLQPPLAYILLPGMGKPQLDP
      210       220        230       240       250       260       

            310       
pF1KB9 VPGTAINGTLPLSHM
        :             
CCDS94 YPAAYAAAL      
       270            

>>CCDS3094.1 SOX14 gene_id:8403|Hs108|chr3                (240 aa)
 initn: 626 init1: 574 opt: 601  Z-score: 563.2  bits: 112.0 E(32554): 4.8e-25
Smith-Waterman score: 601; 48.6% identity (69.5% similar) in 220 aa overlap (39-254:6-216)

       10        20        30        40        50        60        
pF1KB9 LKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMAQENPKMHN
                                     :..:::::::::::::::::::::::::::
CCDS30                          MSKPSDHIKRPMNAFMVWSRGQRRKMAQENPKMHN
                                        10        20        30     

       70        80        90       100       110       120        
pF1KB9 SEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTLP
       ::::::::::::::::.::::.::::::::: ::::::::::::::: :.:.:::.:..:
CCDS30 SEISKRLGAEWKLLSEAEKRPYIDEAKRLRAQHMKEHPDYKYRPRRKPKNLLKKDRYVFP
          40        50        60        70        80        90     

      130       140       150       160       170       180        
pF1KB9 GGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQHPGLNA
          :.      :.:. :::. :  ..    . : .   ... ::...     : . . .:
CCDS30 LPYLGDTDPLKAAGLPVGASDGL-LSAPEKARAFLPP-ASAPYSLLD-----PAQFSSSA
         100       110        120       130        140             

      190       200       210       220           230       240    
pF1KB9 HGAAQMQPMHRYDVSALQYNSMTSSQTYMNGS---PT-YSMSYSQQGTPGMALGSMGSVV
              : :   ..:: : :  . :.   ::   :. .. .. .  .::...   . ..
CCDS30 IQKMGEVP-HTLATGALPYASTLGYQNGAFGSLSCPSQHTHTHPSPTNPGYVV-PCNCTA
      150        160       170       180       190       200       

          250       260       270       280       290       300    
pF1KB9 KSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQSGPVP
        : .. .:::                                                  
CCDS30 WSASTLQPPVAYILFPGMTKTGIDPYSSAHATAM                          
        210       220       230       240                          

>>CCDS32549.1 SOX15 gene_id:6665|Hs108|chr17              (233 aa)
 initn: 477 init1: 458 opt: 499  Z-score: 469.9  bits: 94.7 E(32554): 7.5e-20
Smith-Waterman score: 499; 43.9% identity (64.5% similar) in 214 aa overlap (13-221:28-227)

                              10        20        30         40    
pF1KB9                MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSP-DRVKRP
                                  :::.  :.:    . :: :    . : ..::::
CCDS32 MALPGSSQDQAWSLEPPAATAAASSSSGPQEREGAG----SPAAPG----TLPLEKVKRP
               10        20        30            40            50  

           50        60        70        80        90       100    
pF1KB9 MNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKE
       :::::::: .:::.:::.:::::::::::::::.::::.: :::::..::::::: :...
CCDS32 MNAFMVWSSAQRRQMAQQNPKMHNSEISKRLGAQWKLLDEDEKRPFVEEAKRLRARHLRD
             60        70        80        90       100       110  

          110       120        130       140       150       160   
pF1KB9 HPDYKYRPRRKTKTLMKKDKYTLPG-GLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHM
       .::::::::::.:.     .    : : :: ::   . : ..    . : . :  ::.  
CCDS32 YPDYKYRPRRKAKSSGAGPSRCGQGRGNLASGGPLWGPGYATTQP-SRGFGYRPPSYS--
            120       130       140       150        160           

           170       180          190       200       210       220
pF1KB9 NGWSNGSYSMMQDQLGYPQH---PGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGS
       ...  :::.  . .:  :.    :  . .  ... : . .    :  .: :  .  . :.
CCDS32 TAYLPGSYGSSHCKLEAPSPCSLPQSDPRLQGELLPTYTH---YLPPGSPTPYNPPLAGA
     170       180       190       200          210       220      

              230       240       250       260       270       280
pF1KB9 PTYSMSYSQQGTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPG
       :                                                           
CCDS32 PMPLTHL                                                     
        230                                                        

>>CCDS6159.1 SOX17 gene_id:64321|Hs108|chr8               (414 aa)
 initn: 467 init1: 398 opt: 466  Z-score: 435.9  bits: 89.2 E(32554): 5.8e-18
Smith-Waterman score: 468; 31.7% identity (56.7% similar) in 312 aa overlap (9-305:36-331)

                                     10        20        30        
pF1KB9                       MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSP
                                     :.: : ....: . .:: : :..... .. 
CCDS61 AGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEAPANSGAPAGAAGRAKGE
          10        20        30        40        50        60     

       40        50        60        70        80        90        
pF1KB9 DRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLR
       .:..:::::::::.. .:...::.:: .::.:.:: ::  :: :. .:::::..::.:::
CCDS61 SRIRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEEAERLR
          70        80        90       100       110       120     

      100       110       120          130       140       150     
pF1KB9 ALHMKEHPDYKYRPRRKTKTLMKKDKYTLPG---GLLAPGGNSMASGVGVGAGLGAGVNQ
       . ::..::.:::::::. .  .:. : .  :   ::  : . ...   :  :  : :.. 
CCDS61 VQHMQDHPNYKYRPRRRKQ--VKRLKRVEGGFLHGLAEPQAAALGPEGGRVAMDGLGLQF
         130       140         150       160       170       180   

         160               170        180       190       200      
pF1KB9 RMDSYA--------HMNGWSNGSYSMMQDQL-GYPQHPGLNAHGAAQMQPMHRYDVSALQ
         ...         ::.:      :.    : :::  :      . . .:.   : .   
CCDS61 PEQGFPAGPPLLPPHMGGHYRDCQSLGAPPLDGYPL-P------TPDTSPLDGVDPDPAF
           190       200       210        220             230      

        210       220       230       240        250       260     
pF1KB9 YNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSMGSVVKSE-ASSSPPVVTSSSHSRAPC
       . .   ..    :. .:..  .  : :    : :   .  : :. : : .       :: 
CCDS61 FAAPMPGDCPAAGTYSYAQVSDYAGPPEPPAGPMHPRLGPEPAGPSIPGLL------APP
        240       250       260       270       280             290

         270       280         290       300       310             
pF1KB9 QAGDLRDMISMYLPGAEVPE--PAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM      
       .:  .  . .:  :::   .     :.. :. :: .  : ::                  
CCDS61 SALHVY-YGAMGSPGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPSPPPEALPCRDGTDPS
               300       310       320       330       340         

CCDS61 QPAELLGEVDRTEFEQYLHFVCKPEMGLPYQGHDSGVNLPDSHGAISSVVSDASSAVYYC
     350       360       370       380       390       400         

>>CCDS10428.1 SOX8 gene_id:30812|Hs108|chr16              (446 aa)
 initn: 387 init1: 387 opt: 433  Z-score: 405.2  bits: 83.7 E(32554): 3e-16
Smith-Waterman score: 441; 31.5% identity (56.0% similar) in 327 aa overlap (14-313:85-396)

                                10        20        30        40   
pF1KB9                  MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKR
                                     :. . ::::         :  : .: .:::
CCDS10 DPAEAADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGG---------GALKAKP-HVKR
           60        70        80        90                100     

            50        60        70        80        90       100   
pF1KB9 PMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMK
       ::::::::... :::.:.. :..::.:.:: ::  :.::::.:::::..::.:::. : :
CCDS10 PMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRVQHKK
          110       120       130       140       150       160    

           110       120       130         140       150       160 
pF1KB9 EHPDYKYRPRRKTKTLMKKDKYTLPGGLLAP--GGNSMASGVGVGAGLGAGVNQRMDSYA
       .::::::.:::. :.     . .  :. :.:  ::... .   . :::: : ... :  .
CCDS10 DHPDYKYQPRRR-KSAKAGHSDSDSGAELGPHPGGGAVYK---AEAGLGDG-HHHGDHTG
          170        180       190       200          210          

             170       180       190             200       210     
pF1KB9 HMNGWSNGSYSMMQDQLGYPQHPGLNAHG------AAQMQPMHRYDVSALQYNSMTSSQT
       . .:  .   .   .      .: :. .:      . :   .   :.: :. . : . ..
CCDS10 QTHGPPTPPTTPKTELQQAGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGTMDA
     220       230       240       250       260       270         

                  220              230        240       250        
pF1KB9 --------YMN-GSPT-------YSMSYSQQG-TPGMALGSMGSVVKSEASSSPPVVTSS
               :.  :.:.       :. .: . : .:  :  :  :.  : . ..::    .
CCDS10 FDVHEFDQYLPLGGPAPPEPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPPRPHIK
     280       290       300       310       320       330         

      260       270         280       290       300       310      
pF1KB9 SHSRAPCQAGDLRDMISMY--LPGAEVPEPAAPSRLHMSQHYQSGPVPGTAINGTLPLSH
       ... .: . ::       :    :     ::::.    ... . : . ...  :. :   
CCDS10 TEQPSPGHYGDQPRGSPDYGSCSGQSSATPAAPAGPFAGSQGDYGDLQASSYYGAYPGYA
     340       350       360       370       380       390         

                                                      
pF1KB9 M                                              
                                                      
CCDS10 PGLYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP
     400       410       420       430       440      

>>CCDS14772.1 SRY gene_id:6736|Hs108|chrY                 (204 aa)
 initn: 434 init1: 416 opt: 424  Z-score: 402.0  bits: 81.9 E(32554): 4.5e-16
Smith-Waterman score: 439; 45.6% identity (69.6% similar) in 171 aa overlap (31-200:49-198)

               10        20        30         40        50         
pF1KB9 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQK-NSPDRVKRPMNAFMVWSRGQRRKM
                                     : :.: :  ::::::::::.:::: :::::
CCDS14 PAVQENIPALRRSSSFLCTESCNSKYQCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKM
       20        30        40        50        60        70        

      60        70        80        90       100       110         
pF1KB9 AQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTL
       : :::.:.::::::.:: .::.:.:.:: ::..::..:.:.: ...:.::::::::.: .
CCDS14 ALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAK-M
       80        90       100       110       120       130        

     120       130       140       150       160       170         
pF1KB9 MKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLG
       . :.   ::    :  .. . : :            ..:.  . .  .....: :. :::
CCDS14 LPKNCSLLP----ADPASVLCSEV------------QLDNRLYRDDCTKATHSRMEHQLG
       140           150                   160       170       180 

     180       190       200       210       220       230         
pF1KB9 YPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGS
       .   : .::  :.. :   ::                                       
CCDS14 H--LPPINA--ASSPQQRDRYSHWTKL                                 
                 190       200                                     

>>CCDS13552.1 SOX18 gene_id:54345|Hs108|chr20             (384 aa)
 initn: 439 init1: 383 opt: 422  Z-score: 396.1  bits: 81.8 E(32554): 9.6e-16
Smith-Waterman score: 422; 46.5% identity (74.4% similar) in 129 aa overlap (11-135:51-176)

                                   10        20          30        
pF1KB9                     MYNMMETELKPPGPQQTSGGGG--GNSTAAAAGGNQKNSP
                                     ::.::..   .   :    . :: ..... 
CCDS13 AWAPGHGAAADTRGLAAGPAALAAPAAPASPPSPQRSPPRSPEPGRYGLSPAGRGERQAA
               30        40        50        60        70        80

         40        50        60        70        80        90      
pF1KB9 D--RVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKR
       :  :..:::::::::.. .:...::.:: .::. .:: ::  :: :. .:::::..::.:
CCDS13 DESRIRRPMNAFMVWAKDERKRLAQQNPDLHNAVLSKMLGKAWKELNAAEKRPFVEEAER
               90       100       110       120       130       140

        100       110       120       130       140       150      
pF1KB9 LRALHMKEHPDYKYRPRRKTKTLMKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQR
       ::. :...::.:::::::: ..  .: .   :: :: ::                     
CCDS13 LRVQHLRDHPNYKYRPRRKKQA--RKARRLEPG-LLLPGLAPPQPPPEPFPAASGSARAF
              150       160         170        180       190       

        160       170       180       190       200       210      
pF1KB9 MDSYAHMNGWSNGSYSMMQDQLGYPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTY
                                                                   
CCDS13 RELPPLGAEFDGLGLPTPERSPLDGLEPGEAAFFPPPAAPEDCALRPFRAPYAPTELSRD
       200       210       220       230       240       250       




317 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Fri Nov  4 17:58:40 2016 done: Fri Nov  4 17:58:41 2016
 Total Scan time:  2.940 Total Display time:  0.000

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com