Result of FASTA (ccds) for pFN21AB8917
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB8917, 260 aa
  1>>>pF1KB8917 260 - 260 aa - 260 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 6.9206+/-0.000834; mu= 9.9561+/- 0.051
 mean_var=178.1111+/-36.621, 0's: 0 Z-trim(113.7): 144  B-trim: 79 in 1/51
 Lambda= 0.096101
 statistics sampled from 14100 (14285) to 14100 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.786), E-opt: 0.2 (0.439), width:  16
 Scan time:  2.500

The best scores are:                                      opt bits E(32554)
CCDS8869.1 HOXC9 gene_id:3225|Hs108|chr12          ( 260) 1815 263.0 1.5e-70
CCDS11534.1 HOXB9 gene_id:3219|Hs108|chr17         ( 250)  714 110.3 1.3e-24
CCDS5409.1 HOXA9 gene_id:3205|Hs108|chr7           ( 272)  693 107.5   1e-23
CCDS2267.2 HOXD9 gene_id:3235|Hs108|chr2           ( 352)  531 85.1 7.1e-17
CCDS5410.2 HOXA10 gene_id:3206|Hs108|chr7          ( 410)  464 75.9 4.9e-14
CCDS8868.1 HOXC10 gene_id:3226|Hs108|chr12         ( 342)  449 73.7 1.8e-13
CCDS2266.1 HOXD10 gene_id:3236|Hs108|chr2          ( 340)  431 71.2   1e-12


>>CCDS8869.1 HOXC9 gene_id:3225|Hs108|chr12               (260 aa)
 initn: 1815 init1: 1815 opt: 1815  Z-score: 1380.1  bits: 263.0 E(32554): 1.5e-70
Smith-Waterman score: 1815; 100.0% identity (100.0% similar) in 260 aa overlap (1-260:1-260)

               10        20        30        40        50        60
pF1KB8 MSATGPISNYYVDSLISHDNEDLLASRFPATGAHPAAARPSGLVPDCSDFPSCSFAPKPA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS88 MSATGPISNYYVDSLISHDNEDLLASRFPATGAHPAAARPSGLVPDCSDFPSCSFAPKPA
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB8 VFSTSWAPVPSQSSVVYHPYGPQPHLGADTRYMRTWLEPLSGAVSFPSFPAGGRHYALKP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS88 VFSTSWAPVPSQSSVVYHPYGPQPHLGADTRYMRTWLEPLSGAVSFPSFPAGGRHYALKP
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB8 DAYPGRRADCGPGEGRSYPDYMYGSPGELRDRAPQTLPSPEADALAGSKHKEEKADLDPS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS88 DAYPGRRADCGPGEGRSYPDYMYGSPGELRDRAPQTLPSPEADALAGSKHKEEKADLDPS
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB8 NPVANWIHARSTRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARVLNLTERQVKIWF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS88 NPVANWIHARSTRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARVLNLTERQVKIWF
              190       200       210       220       230       240

              250       260
pF1KB8 QNRRMKMKKMNKEKTDKEQS
       ::::::::::::::::::::
CCDS88 QNRRMKMKKMNKEKTDKEQS
              250       260

>>CCDS11534.1 HOXB9 gene_id:3219|Hs108|chr17              (250 aa)
 initn: 769 init1: 485 opt: 714  Z-score: 555.3  bits: 110.3 E(32554): 1.3e-24
Smith-Waterman score: 778; 51.9% identity (68.7% similar) in 262 aa overlap (1-254:1-247)

               10        20        30        40        50        60
pF1KB8 MSATGPISNYYVDSLISHDNEDLLASRFPATGAHPAAARPSGLVPDCSDFPSCSFAPKPA
       :: .: .:.:::::.:::..::   ..:: .: . :..:  : .    .:::::: ::  
CCDS11 MSISGTLSSYYVDSIISHESEDAPPAKFP-SGQY-ASSRQPGHAEHL-EFPSCSFQPKAP
               10        20         30         40         50       

               70            80         90       100       110     
pF1KB8 VFSTSWAPVPSQSS----VVYHPY-GPQPHLGADTRYMRTWLEPLSGAVSFPSFPAGGRH
       ::..::::.  ..:     :::::  ::    :..::.::::::   . . :    :  .
CCDS11 VFGASWAPLSPHASGSLPSVYHPYIQPQGVPPAESRYLRTWLEPAPRGEAAP----GQGQ
        60        70        80        90       100           110   

         120       130         140        150       160       170  
pF1KB8 YALKPDAYPGRRADCGPGE--GRSYPDY-MYGSPGELRDRAPQTLPSPEADALAGSKHKE
        :.: .   :     .:::   .. :.: .  : :.    . :     .     ::. ::
CCDS11 AAVKAEPLLG-----APGELLKQGTPEYSLETSAGREAVLSNQRPGYGDNKICEGSEDKE
           120            130       140       150       160        

            180       190       200       210       220       230  
pF1KB8 EKADLDPSNPVANWIHARSTRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARVLNLT
       .    : .:: :::.::::.::::::::::::::::::::::::::::::.::::.:::.
CCDS11 RP---DQTNPSANWLHARSSRKKRCPYTKYQTLELEKEFLFNMYLTRDRRHEVARLLNLS
      170          180       190       200       210       220     

            240       250       260
pF1KB8 ERQVKIWFQNRRMKMKKMNKEKTDKEQS
       :::::::::::::::::::::.      
CCDS11 ERQVKIWFQNRRMKMKKMNKEQGKE   
         230       240       250   

>>CCDS5409.1 HOXA9 gene_id:3205|Hs108|chr7                (272 aa)
 initn: 800 init1: 537 opt: 693  Z-score: 539.2  bits: 107.5 E(32554): 1e-23
Smith-Waterman score: 824; 51.3% identity (70.4% similar) in 277 aa overlap (1-258:1-272)

               10         20          30        40        50       
pF1KB8 MSATGPISNYYVDS-LISHDNEDLLA-SRF-PATGAHPAAARPSGLVPDCSDFPSCSFAP
       :..:: ..:::::: :.. :  : :. .:. :.: ..:   : .. . .  ::  :::  
CCDS54 MATTGALGNYYVDSFLLGADAADELSVGRYAPGTLGQPP--RQAATLAEHPDFSPCSFQS
               10        20        30          40        50        

        60        70                  80            90       100   
pF1KB8 KPAVFSTSWAPVPSQSS-----VVYH-----PY-GPQPHLGA---DTRYMRTWLEPLSGA
       : .::..:: :: . ..     .:::     ::  ::  ..:   : ::::.::::  ::
CCDS54 KATVFGASWNPVHAAGANAVPAAVYHHHHHHPYVHPQAPVAAAAPDGRYMRSWLEPTPGA
       60        70        80        90       100       110        

           110       120       130         140       150       160 
pF1KB8 VSFPSFPAGGRHYALKPDAYPGRRADCGPGEGR--SYPDYMYGSPGELRDRAPQTLPSPE
       .:: ..:.. : :..::.   .::.::   . .  :  ::  :::   :.. :.     :
CCDS54 LSFAGLPSS-RPYGIKPEPLSARRGDCPTLDTHTLSLTDYACGSPPVDREKQPSEGAFSE
      120        130       140       150       160       170       

             170       180       190       200       210       220 
pF1KB8 ADALAGSKHKEEKADLDPSNPVANWIHARSTRKKRCPYTKYQTLELEKEFLFNMYLTRDR
        .:   :    .:  .::.::.:::.::::::::::::::.:::::::::::::::::::
CCDS54 NNAENESGG--DKPPIDPNNPAANWLHARSTRKKRCPYTKHQTLELEKEFLFNMYLTRDR
       180         190       200       210       220       230     

             230       240       250       260
pF1KB8 RYEVARVLNLTERQVKIWFQNRRMKMKKMNKEKTDKEQS
       ::::::.:::::::::::::::::::::.::...  :  
CCDS54 RYEVARLLNLTERQVKIWFQNRRMKMKKINKDRAKDE  
         240       250       260       270    

>>CCDS2267.2 HOXD9 gene_id:3235|Hs108|chr2                (352 aa)
 initn: 827 init1: 515 opt: 531  Z-score: 416.4  bits: 85.1 E(32554): 7.1e-17
Smith-Waterman score: 703; 47.8% identity (59.5% similar) in 289 aa overlap (49-257:62-350)

       20        30        40        50        60        70        
pF1KB8 DNEDLLASRFPATGAHPAAARPSGLVPDCSDFPSCSFAPKPAVFSTSWAPVPSQS-----
                                     .: ::::::. ::::.::. ::::      
CCDS22 EVFAARFGPPGPGAQGRPAGVADGPAATAAEFASCSFAPRSAVFSASWSAVPSQPPAAAA
              40        50        60        70        80        90 

             80        90           100                            
pF1KB8 -SVVYHPYGPQPHLGADT----RYMRTWLEPLSG-------------------AVSFPSF
        : .:::: : : :.:..    ::.:.:.::: :                       :: 
CCDS22 MSGLYHPYVPPPPLAASASEPGRYVRSWMEPLPGFPGGAGGGGGGGGGGPGRGPSPGPSG
             100       110       120       130       140       150 

     110       120                            130          140     
pF1KB8 PAGGRHYALKPD--AYPG-------------------RRADCG---PGEGRSYPDYMYGS
       ::.::::..::.  : :.                   .:..:.    ..: : :..  .:
CCDS22 PANGRHYGIKPETRAAPAPATAASTTSSSSTSLSSSSKRTECSVARESQGSSGPEFSCNS
             160       170       180       190       200       210 

                       150       160            170                
pF1KB8 --------------PGELRDRAPQTLPSPEADA-----LAGSKHKEE--------KADLD
                     ::     :  :  : : .:     . : . :::        . .::
CCDS22 FLQEKAAAATGGTGPGAGIGAATGTGGSSEPSACSDHPIPGCSLKEEEKQHSQPQQQQLD
             220       230       240       250       260       270 

      180       190       200       210       220       230        
pF1KB8 PSNPVANWIHARSTRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARVLNLTERQVKI
       :.::.::::::::::::::::::::::::::::::::::::::::::::.::::::::::
CCDS22 PNNPAANWIHARSTRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARILNLTERQVKI
             280       290       300       310       320       330 

      240       250       260
pF1KB8 WFQNRRMKMKKMNKEKTDKEQS
       ::::::::::::.:::  :   
CCDS22 WFQNRRMKMKKMSKEKCPKGD 
             340       350   

>>CCDS5410.2 HOXA10 gene_id:3206|Hs108|chr7               (410 aa)
 initn: 470 init1: 392 opt: 464  Z-score: 365.4  bits: 75.9 E(32554): 4.9e-14
Smith-Waterman score: 472; 40.3% identity (58.8% similar) in 243 aa overlap (25-258:175-402)

                     10        20        30          40        50  
pF1KB8       MSATGPISNYYVDSLISHDNEDLLASRFPATGAHPA--AARPSGLVPDCSDFPS
                                     :.. : ..:  :  :  : :  ::   . .
CCDS54 PQPPQPAPQATSCSFAQNIKEESSYCLYDSADKCPKVSATAAELAPFPRGPPPDGCALGT
          150       160       170       180       190       200    

             60        70        80        90       100       110  
pF1KB8 CSFAPKPAVFSTSWAPVPSQSSVVYHPYGPQPHLGADTRYMRTWLEPLSGAVSFPSFPAG
        : .: :. :  : :   ...       :   .:::     .    :  :    :.. .:
CCDS54 SSGVPVPGYFRLSQAYGTAKG--YGSGGGGAQQLGAGPFPAQP---PGRGFDLPPALASG
          210       220         230       240          250         

            120          130       140       150           160     
pF1KB8 GRHYALKP---DAYPGRRADCGPGEGRSYPDYMYGSPGELRDRAP----QTLPSPEADAL
       .   : :    :. :     :: : : .  .  ..: .  .. .:    ..  ::: :.:
CCDS54 SADAARKERALDSPPPPTLACGSGGGSQGDEEAHASSSAAEELSPAPSESSKASPEKDSL
     260       270       280       290       300       310         

         170       180       190       200       210       220     
pF1KB8 AGSKHKEEKADLDPSNPVANWIHARSTRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEV
       ..::   :.:        :::. :.: :::::::::.:::::::::::::::::.:: :.
CCDS54 GNSKG--ENA--------ANWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEI
     320                 330       340       350       360         

         230       240       250       260      
pF1KB8 ARVLNLTERQVKIWFQNRRMKMKKMNKEKTDKEQS      
       .: ..::.:::::::::::::.::::.:.  .:        
CCDS54 SRSVHLTDRQVKIWFQNRRMKLKKMNRENRIRELTANFNFS
     370       380       390       400       410

>>CCDS8868.1 HOXC10 gene_id:3226|Hs108|chr12              (342 aa)
 initn: 439 init1: 418 opt: 449  Z-score: 355.1  bits: 73.7 E(32554): 1.8e-13
Smith-Waterman score: 470; 35.9% identity (60.2% similar) in 259 aa overlap (6-258:95-334)

                                        10        20        30     
pF1KB8                          MSATGPISNYYVDSLISHDNEDLLASRFPATGAHP
                                     :.:.      ....:   . :    . . :
CCDS88 ALNTYPSYLSQLDSWGDPKAAYRLEQPVGRPLSSCSYPPSVKEENVCCMYSAEKRAKSGP
           70        80        90       100       110       120    

          40            50         60        70        80        90
pF1KB8 AAARPSGLVPD-C---SDFPSCSFA-PKPAVFSTSWAPVPSQSSVVYHPYGPQPHLGADT
        ::  :  .:. :    . :  :.   .:.  . . .:  : ..    :.  .  :.  .
CCDS88 EAALYSHPLPESCLGEHEVPVPSYYRASPSYSALDKTPHCSGANDFEAPFEQRASLNPRA
          130       140       150       160       170       180    

              100       110       120       130       140          
pF1KB8 RYMRTWLEPLSGAVSFPSFPAGGRHYALKPDAYPGRRADCGPGEGRSYPDYMYGSPGEL-
       .....    :.: ::::  : .  .   .:.    ...  ::           :::.:  
CCDS88 EHLES--PQLGGKVSFPETPKSDSQTP-SPNEIKTEQSLAGPK----------GSPSESE
            190       200        210       220                 230 

     150       160       170       180       190       200         
pF1KB8 RDRAPQTLPSPEADALAGSKHKEEKADLDPSNPVANWIHARSTRKKRCPYTKYQTLELEK
       ..::  .  ::.      .. .: : ..   : ..::. :.: :::::::::.:::::::
CCDS88 KERAKAADSSPD------TSDNEAKEEIKAENTTGNWLTAKSGRKKRCPYTKHQTLELEK
             240             250       260       270       280     

     210       220       230       240       250       260      
pF1KB8 EFLFNMYLTRDRRYEVARVLNLTERQVKIWFQNRRMKMKKMNKEKTDKEQS      
       ::::::::::.:: :.....:::.:::::::::::::.::::.:.  .:        
CCDS88 EFLFNMYLTRERRLEISKTINLTDRQVKIWFQNRRMKLKKMNRENRIRELTSNFNFT
         290       300       310       320       330       340  

>>CCDS2266.1 HOXD10 gene_id:3236|Hs108|chr2               (340 aa)
 initn: 485 init1: 411 opt: 431  Z-score: 341.7  bits: 71.2 E(32554): 1e-12
Smith-Waterman score: 435; 39.4% identity (66.2% similar) in 216 aa overlap (43-258:134-332)

             20        30        40        50        60        70  
pF1KB8 DSLISHDNEDLLASRFPATGAHPAAARPSGLVPDCSDFPSCSFAPKPAVFSTSWAPVPSQ
                                     :::.     .   .: :. :  : . . ..
CCDS22 TTNIKEESNCCMYSDKRNKLISAEVPSYQRLVPESCPVEN-PEVPVPGYFRLSQTYATGK
           110       120       130       140        150       160  

             80        90       100       110       120       130  
pF1KB8 SSVVYHPYGPQPHLGADTRYMRTWLEPLSGAVSFPSFPAGGRHYALKPDAYPGRRADCGP
       .    . :. .:. :..: ...  :.:  ::.. :.. :.  ..  : .   . .     
CCDS22 T----QEYNNSPE-GSSTVMLQ--LNP-RGAAK-PQLSAAQLQMEKKMNEPVSGQEPTKV
                170          180         190       200       210   

            140       150       160       170       180       190  
pF1KB8 GEGRSYPDYMYGSPGELRDRAPQTLPSPEADALAGSKHKEEKADLDPSNPVANWIHARST
       .. .: :.   : : :    :  .. :::.      ..:: : ..  ..:..::. :.: 
CCDS22 SQVES-PEAKGGLPEERSCLAEVSVSSPEV------QEKESKEEIKSDTPTSNWLTAKSG
            220       230       240             250       260      

            200       210       220       230       240       250  
pF1KB8 RKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARVLNLTERQVKIWFQNRRMKMKKMNK
       :::::::::.:::::::::::::::::.:: :... .:::.:::::::::::::.:::..
CCDS22 RKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISKSVNLTDRQVKIWFQNRRMKLKKMSR
        270       280       290       300       310       320      

            260      
pF1KB8 EKTDKEQS      
       :.  .:        
CCDS22 ENRIRELTANLTFS
        330       340




260 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Fri Nov  4 16:29:14 2016 done: Fri Nov  4 16:29:14 2016
 Total Scan time:  2.500 Total Display time:  0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com