Result of FASTA (ccds) for pF1KB5973
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB5973, 519 aa
  1>>>pF1KB5973 519 - 519 aa - 519 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 7.8041+/-0.000833; mu= 9.2423+/- 0.051
 mean_var=182.4933+/-37.567, 0's: 0 Z-trim(113.9): 55  B-trim: 696 in 1/51
 Lambda= 0.094940
 statistics sampled from 14454 (14508) to 14454 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.767), E-opt: 0.2 (0.446), width:  16
 Scan time:  3.640

The best scores are:                                      opt bits E(32554)
CCDS3867.1 IRX4 gene_id:50805|Hs108|chr5           ( 519) 3501 491.6 8.9e-139
CCDS75225.1 IRX4 gene_id:50805|Hs108|chr5          ( 545) 2589 366.7 3.7e-101
CCDS32449.1 IRX6 gene_id:79190|Hs108|chr16         ( 446)  876 132.0 1.4e-30
CCDS10750.1 IRX3 gene_id:79191|Hs108|chr16         ( 501)  703 108.4   2e-23
CCDS10751.1 IRX5 gene_id:10265|Hs108|chr16         ( 483)  702 108.2 2.2e-23
CCDS58462.1 IRX5 gene_id:10265|Hs108|chr16         ( 482)  694 107.1 4.6e-23
CCDS3868.1 IRX2 gene_id:153572|Hs108|chr5          ( 471)  568 89.9 7.1e-18
CCDS34132.1 IRX1 gene_id:79192|Hs108|chr5          ( 480)  564 89.3 1.1e-17


>>CCDS3867.1 IRX4 gene_id:50805|Hs108|chr5                (519 aa)
 initn: 3501 init1: 3501 opt: 3501  Z-score: 2604.9  bits: 491.6 E(32554): 8.9e-139
Smith-Waterman score: 3501; 100.0% identity (100.0% similar) in 519 aa overlap (1-519:1-519)

               10        20        30        40        50        60
pF1KB5 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 RHELNSAAALGVYGGPYGGSQGYGNYVTYGSEASAFYSLNSFDSKDGSGSAHGGLAPAAA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 RHELNSAAALGVYGGPYGGSQGYGNYVTYGSEASAFYSLNSFDSKDGSGSAHGGLAPAAA
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB5 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB5 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREEPLK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREEPLK
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB5 SSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSLDGGLERVPAAPDGPVKEA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSLDGGLERVPAAPDGPVKEA
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB5 SGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEGGPQVCEAKLGFVPAGASA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEGGPQVCEAKLGFVPAGASA
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KB5 GLEAKPRIWSLAHTATAAAAAATSLSQTEFPSCMLKRQGPAAPAAVSSAPATSPSVALPH
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 GLEAKPRIWSLAHTATAAAAAATSLSQTEFPSCMLKRQGPAAPAAVSSAPATSPSVALPH
              370       380       390       400       410       420

              430       440       450       460       470       480
pF1KB5 SGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLDPGPLGRSLGAGANV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLDPGPLGRSLGAGANV
              430       440       450       460       470       480

              490       500       510         
pF1KB5 LTAPLARAFPPAVPQDAPAAGAARELLALPKAGGKPFCA
       :::::::::::::::::::::::::::::::::::::::
CCDS38 LTAPLARAFPPAVPQDAPAAGAARELLALPKAGGKPFCA
              490       500       510         

>>CCDS75225.1 IRX4 gene_id:50805|Hs108|chr5               (545 aa)
 initn: 2589 init1: 2589 opt: 2589  Z-score: 1929.6  bits: 366.7 E(32554): 3.7e-101
Smith-Waterman score: 3439; 95.2% identity (95.2% similar) in 545 aa overlap (1-519:1-545)

               10        20        30        40        50        60
pF1KB5 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 RHELNSAAALGVYGGPYGGSQGYGNYVTYGSEASAFYSLNSFDSKDGSGSAHGGLAPAAA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 RHELNSAAALGVYGGPYGGSQGYGNYVTYGSEASAFYSLNSFDSKDGSGSAHGGLAPAAA
               70        80        90       100       110       120

              130                                 140       150    
pF1KB5 AYYPYEPALGQYPYDR--------------------------YGTMDSGTRRKNATRETT
       ::::::::::::::::                          ::::::::::::::::::
CCDS75 AYYPYEPALGQYPYDRIKRLGGHPHKGIGLDLSGLGRSPGSLYGTMDSGTRRKNATRETT
              130       140       150       160       170       180

          160       170       180       190       200       210    
pF1KB5 STLKAWLQEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWPPRNKC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 STLKAWLQEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWPPRNKC
              190       200       210       220       230       240

          220       230       240       250       260       270    
pF1KB5 ADEKRPYAEGEEEEGGEEEAREEPLKSSKNAEPVGKEEKELELSDLDDFDPLEAEPPACE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 ADEKRPYAEGEEEEGGEEEAREEPLKSSKNAEPVGKEEKELELSDLDDFDPLEAEPPACE
              250       260       270       280       290       300

          280       290       300       310       320       330    
pF1KB5 LKPPFHSLDGGLERVPAAPDGPVKEASGALRMSLAAGGGAALDEDLERARSCLRSAAAGP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 LKPPFHSLDGGLERVPAAPDGPVKEASGALRMSLAAGGGAALDEDLERARSCLRSAAAGP
              310       320       330       340       350       360

          340       350       360       370       380       390    
pF1KB5 EPLPGAEGGPQVCEAKLGFVPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTEFPSCM
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 EPLPGAEGGPQVCEAKLGFVPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTEFPSCM
              370       380       390       400       410       420

          400       410       420       430       440       450    
pF1KB5 LKRQGPAAPAAVSSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 LKRQGPAAPAAVSSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLN
              430       440       450       460       470       480

          460       470       480       490       500       510    
pF1KB5 QAWATAKGALLDPGPLGRSLGAGANVLTAPLARAFPPAVPQDAPAAGAARELLALPKAGG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 QAWATAKGALLDPGPLGRSLGAGANVLTAPLARAFPPAVPQDAPAAGAARELLALPKAGG
              490       500       510       520       530       540

            
pF1KB5 KPFCA
       :::::
CCDS75 KPFCA
            

>>CCDS32449.1 IRX6 gene_id:79190|Hs108|chr16              (446 aa)
 initn: 765 init1: 452 opt: 876  Z-score: 662.7  bits: 132.0 E(32554): 1.4e-30
Smith-Waterman score: 973; 44.1% identity (66.4% similar) in 440 aa overlap (1-431:1-407)

               10        20        30        40        50        60
pF1KB5 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
       ::.:.::.:: .: ::: ...: .:::::  :...: . ...    . :  :.::::..:
CCDS32 MSFPHFGHPYRGASQFLASASSSTTCCESTQRSVSDVASGSTPAPALCCAPYDSRLLGSA
               10        20        30        40        50        60

               70        80           90        100        110     
pF1KB5 RHELNSAAALGVYGGPYGGS---QGYGNYVTYGSEASAFY-SLN-SFDSKDGSGSAHGGL
       : ::  .::::.::.::...   :.: .:. :. :  ..: .:: ... :...::  ..:
CCDS32 RPEL--GAALGIYGAPYAAAAAAQSYPGYLPYSPEPPSLYGALNPQYEFKEAAGSFTSSL
                 70        80        90       100       110        

         120       130       140         150       160       170   
pF1KB5 APAAAAYYPYEPALGQYPYDRYGTMD-SGT-RRKNATRETTSTLKAWLQEHRKNPYPTKG
       :  .: ::::: .:::: :.:::... ::. :::::::::::::::::.:::::::::::
CCDS32 AQPGA-YYPYERTLGQYQYERYGAVELSGAGRRKNATRETTSTLKAWLNEHRKNPYPTKG
      120        130       140       150       160       170       

           180       190       200       210       220       230   
pF1KB5 EKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEE
       ::::::::::::::::::::::::::::::::::: :.:: ..:..        :::::.
CCDS32 EKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWAPKNKGGEERKA-------EGGEED
       180       190       200       210       220              230

           240       250       260       270       280       290   
pF1KB5 AREEPLKSSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSLDGGLERVPAAP
       .      ..:..  ...: . :.::::.:..  : :    : .          : : .: 
CCDS32 SLGCLTADTKEVT-ASQEARGLRLSDLEDLEEEEEEEEEAEDE----------EVVATAG
              240        250       260       270                   

           300       310       320       330       340       350   
pF1KB5 DGPVKEASGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEGGPQVCEAKLGF
       :  ..  .::  .:: .  .:: .  ::: : :  .:       :..    .   :. : 
CCDS32 DRLTEFRKGA--QSLPGPCAAAREGRLER-RECGLAAPRFSFNDPSGSEEADFLSAETGS
     280         290       300        310       320       330      

           360       370       380         390       400       410 
pF1KB5 VPAGASAGLEAKPRIWSLAHTATAAAA--AATSLSQTEFPSCMLKRQGPAAPAAVSSAPA
                  :::::::::::::.:.  :  .  . . : :   :. :. :      ::
CCDS32 PRLTMHYPCLEKPRIWSLAHTATASAVEGAPPARPRPRSPEC---RMIPGQP------PA
        340       350       360       370          380             

             420       430       440       450       460       470 
pF1KB5 TSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLDPGPLG
       ..  ...:...: :. .  :                                        
CCDS32 SARRLSVPRDSACDESSCIPKAFGNPKFALQGLPLNCAPCPRRSEPVVQCQYPSGAEAG 
       390       400       410       420       430       440       

>>CCDS10750.1 IRX3 gene_id:79191|Hs108|chr16              (501 aa)
 initn: 654 init1: 473 opt: 703  Z-score: 533.9  bits: 108.4 E(32554): 2e-23
Smith-Waterman score: 739; 36.7% identity (56.9% similar) in 534 aa overlap (1-519:1-465)

               10        20        30        40        50        60
pF1KB5 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
       ::.::.:: :   :  :. ..  ..   ::: . : .: .:.:          :.: :..
CCDS10 MSFPQLGYQYIR-P--LYPSERPGAAGGSGGSAGARGGLGAGA----------SELNASG
               10           20        30        40                 

               70        80              90       100        110   
pF1KB5 RHELNSAAALGVYGGPYGGS------QGYGNYVTYGSEASAFYSLNS-FDSKDGSGSAHG
          :... . .:::.::...      :::: .. :..:   : .:.. .. ::. :  : 
CCDS10 --SLSNVLS-SVYGAPYAAAAAAAAAQGYGAFLPYAAELPIFPQLGAQYELKDSPGVQH-
          50         60        70        80        90       100    

           120        130       140       150       160       170  
pF1KB5 GLAPAAAAYYPY-EPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTK
          ::::: .:. .::.  ::: .:   :  .: ::::::.::::::::.::::::::::
CCDS10 ---PAAAAAFPHPHPAF--YPYGQYQFGDP-SRPKNATRESTSTLKAWLNEHRKNPYPTK
              110         120        130       140       150       

            180       190       200       210       220       230  
pF1KB5 GEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEE
       :::::::::::::::::::::::::::::::::::: ::..  .:   :.  .:::  ::
CCDS10 GEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWAPRSRTDEEGNAYGSEREEEDEEE
       160       170       180       190       200       210       

            240       250          260       270       280         
pF1KB5 EAREEPLKSSKNAEPVGKEEKELE---LSDLDDFDPLEAEPPACELKPPFHSLDGGLERV
       . ..   .   . : .: ::..     :.: :. . .. :        :  :: :. .: 
CCDS10 DEEDGKRELELEEEELGGEEEDTGGEGLADDDEDEEIDLENLDGAATEPELSLAGAARRD
       220       230       240       250       260       270       

     290       300       310       320       330       340         
pF1KB5 PAAPDGPVKEASGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEGGPQVCEA
            ::....    . : .  .. .: ::  :    :  :   : : : : ..:..   
CCDS10 GDLGLGPISDS----KNSDSEDSSEGL-ED--RPLPVLSLA---PAPPPVAVASPSLPSP
       280           290       300          310          320       

     350          360       370       380       390       400      
pF1KB5 KLGF---VPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTEFPSCMLKRQGPA-APAA
        ...   .:: : :.   ::.:::::.:::.      :      :.   .  : : ::.:
CCDS10 PVSLDPCAPAPAPASALQKPKIWSLAETATSPDNPRRS-----PPGAGGSPPGAAVAPSA
       330       340       350       360            370       380  

         410       420       430       440       450       460     
pF1KB5 VSSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALL
       .. .::.. ..:  :     :  ..:. ..  :..  :  :               :  :
CCDS10 LQLSPAAAAAAA--H-----RLVSAPLGKFPAWTNRPFPGP-------------PPGPRL
            390              400       410                    420  

         470       480       490       500       510               
pF1KB5 DPGPLGRSLGAGANVLTAPLARAFPPAVPQDAPAAGAARELLALPKAGGKPFCA      
        :  :   ::..      :   ..: :. . : ::. ::   : :. ::   :.      
CCDS10 HPLSL---LGSAP-----PHLLGLPGAAGHPAAAAAFARP--AEPE-GGTDRCSALEVEK
               430            440       450          460       470 

CCDS10 KLLKTAFQPVPRRPQNHLDAALVLSALSSS
             480       490       500 

>>CCDS10751.1 IRX5 gene_id:10265|Hs108|chr16              (483 aa)
 initn: 562 init1: 451 opt: 702  Z-score: 533.4  bits: 108.2 E(32554): 2.2e-23
Smith-Waterman score: 702; 38.9% identity (59.4% similar) in 406 aa overlap (39-422:11-394)

       10        20        30        40        50        60        
pF1KB5 PYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATARHELNSAA
                                     :.::  :   ::.: . ...  : .  . .
CCDS10                     MSYPQGYLYQPSASL-ALYSCPAYSTSVISGPRTDELGRS
                                   10         20        30         

       70        80               90       100       110        120
pF1KB5 ALGVYGGPYGGSQ-------GYGNYVTYGSEASAFYSLNSFDSKDGSGSAHG-GLAPAAA
       . :   .::.::        ::.... ::.. .:  .  .:.:  ::   :  :.: .. 
CCDS10 SSGSAFSPYAGSTAFTAPSPGYNSHLQYGADPAA-AAAAAFSSYVGSPYDHTPGMA-GSL
      40        50        60        70         80        90        

              130       140       150       160       170       180
pF1KB5 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
       .:.::   ::.:::       . . ::::::..:.::::::.::::::::::::::::::
CCDS10 GYHPYAAPLGSYPY------GDPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAI
       100       110             120       130       140       150 

              190       200       210       220       230       240
pF1KB5 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREEPLK
       :::::::::::::::::::::::::::: :::.  ::     : ::.   :.. ..:: :
CCDS10 ITKMTLTQVSTWFANARRRLKKENKMTWTPRNRSEDE-----EEEENIDLEKNDEDEPQK
             160       170       180            190       200      

              250       260       270       280        290         
pF1KB5 SSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSL-DGGLERVPAAPDGPVKE
          ...: : :    : .  .  . :.. ::.   :    :: :. ... :.  .: .  
CCDS10 PEDKGDPEGPEAGGAEQKAASGCERLQG-PPTPAGKETEGSLSDSDFKEPPS--EGRLDA
        210       220       230        240       250         260   

     300         310        320       330           340       350  
pF1KB5 ASGALRMS--LAAGGGAA-LDEDLERARSCLRSAAAGPEP----LPGAEGGPQVCEAKLG
        .:  : .    :: .:: : ::    .    . : ::.:    .: . :::.: ..   
CCDS10 LQGPPRTGGPSPAGPAAARLAED-PAPHYPAGAPAPGPHPAAGEVPPGPGGPSVIHSP--
           270       280        290       300       310       320  

            360       370       380          390       400         
pF1KB5 FVPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTE---FPSCMLKRQGPA---APAAV
         :     .. :::..::::. ::..  .  . . .:    : :     : :   . :. 
CCDS10 --PPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCPPCPGPIAGQALGGSRASP
                330       340       350       360       370        

        410       420       430       440       450       460      
pF1KB5 SSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLD
       . ::. :::.  :  :                                            
CCDS10 APAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGHLHGHPGPGPGPTTGPGSH
      380       390       400       410       420       430        

>>CCDS58462.1 IRX5 gene_id:10265|Hs108|chr16              (482 aa)
 initn: 547 init1: 436 opt: 694  Z-score: 527.5  bits: 107.1 E(32554): 4.6e-23
Smith-Waterman score: 694; 38.9% identity (59.4% similar) in 406 aa overlap (39-422:11-393)

       10        20        30        40        50        60        
pF1KB5 PYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATARHELNSAA
                                     :.::  :   ::.: . ...  : .  . .
CCDS58                     MSYPQGYLYQPSASL-ALYSCPAYSTSVISGPRTDELGRS
                                   10         20        30         

       70        80               90       100       110        120
pF1KB5 ALGVYGGPYGGSQ-------GYGNYVTYGSEASAFYSLNSFDSKDGSGSAHG-GLAPAAA
       . :   .::.::        ::.... ::.. .:  .  .:.:  ::   :  :.: .. 
CCDS58 SSGSAFSPYAGSTAFTAPSPGYNSHLQYGADPAA-AAAAAFSSYVGSPYDHTPGMA-GSL
      40        50        60        70         80        90        

              130       140       150       160       170       180
pF1KB5 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
       .:.::   ::.:::       . . ::::::..:.::::::.::::::::::::::::::
CCDS58 GYHPYAAPLGSYPY------GDPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAI
       100       110             120       130       140       150 

              190       200       210       220       230       240
pF1KB5 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREEPLK
       :::::::::::::::::::::::::::: :::.  ::     : ::.   :.. ..:: :
CCDS58 ITKMTLTQVSTWFANARRRLKKENKMTWTPRNRSEDE-----EEEENIDLEKNDEDEPQK
             160       170       180            190       200      

              250       260       270       280        290         
pF1KB5 SSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSL-DGGLERVPAAPDGPVKE
          ...: : :    : .  .  . :.. ::.   :    :: :. ... :.  .: .  
CCDS58 PEDKGDPEGPEAGA-EQKAASGCERLQG-PPTPAGKETEGSLSDSDFKEPPS--EGRLDA
        210       220        230        240       250         260  

     300         310        320       330           340       350  
pF1KB5 ASGALRMS--LAAGGGAA-LDEDLERARSCLRSAAAGPEP----LPGAEGGPQVCEAKLG
        .:  : .    :: .:: : ::    .    . : ::.:    .: . :::.: ..   
CCDS58 LQGPPRTGGPSPAGPAAARLAED-PAPHYPAGAPAPGPHPAAGEVPPGPGGPSVIHSP--
            270       280        290       300       310           

            360       370       380          390       400         
pF1KB5 FVPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTE---FPSCMLKRQGPA---APAAV
         :     .. :::..::::. ::..  .  . . .:    : :     : :   . :. 
CCDS58 --PPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCPPCPGPIAGQALGGSRASP
       320       330       340       350       360       370       

        410       420       430       440       450       460      
pF1KB5 SSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLD
       . ::. :::.  :  :                                            
CCDS58 APAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGHLHGHPGPGPGPTTGPGSH
       380       390       400       410       420       430       

>>CCDS3868.1 IRX2 gene_id:153572|Hs108|chr5               (471 aa)
 initn: 554 init1: 443 opt: 568  Z-score: 434.4  bits: 89.9 E(32554): 7.1e-18
Smith-Waterman score: 638; 36.8% identity (55.7% similar) in 476 aa overlap (40-481:11-439)

      10        20        30        40        50        60         
pF1KB5 YSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATARHELNSAAA
                                     : .. :   ::.: .  ::. : :  . .:
CCDS38                     MSYPQGYLYQAPGSLALYSCPAYGASALAAPRSEELARSA
                                   10        20        30        40

      70        80                90       100        110       120
pF1KB5 LGVYGGPYGGSQ--------GYGNYVTYGSEASAFYSLNSFDSKDGSG-SAHGGLAPAAA
        :   .:: ::         :.:. . :...:.:  .  .: :  :.  .::     .: 
CCDS38 SGSAFSPYPGSAAFTAQAATGFGSPLQYSADAAA--AAAGFPSYMGAPYDAHTTGMTGAI
               50        60        70          80        90        

              130       140       150       160       170       180
pF1KB5 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
       .:.::  :   :::.    ... . ::::::..:.::::::.::::::::::::::::::
CCDS38 SYHPYGSA--AYPYQ----LNDPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAI
      100         110           120       130       140       150  

              190       200       210       220       230          
pF1KB5 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREE-PL
       :::::::::::::::::::::::::::: ::::  ::       .:.::   ....: : 
CCDS38 ITKMTLTQVSTWFANARRRLKKENKMTWAPRNKSEDE-------DEDEGDATRSKDESPD
            160       170       180              190       200     

     240       250        260       270       280       290        
pF1KB5 KSSKNAEPVGKEEK-ELELSDLDDFDPLEAEPPACELKPPFHSLDGGLERVPAAPDGPVK
       :.....:  ...:   :....: :         .:  .      ::  :..:     :. 
CCDS38 KAQEGTETSAEDEGISLHVDSLTDH--------SCSAES-----DG--EKLPCRAGDPLC
         210       220       230                      240       250

      300       310       320       330       340          350     
pF1KB5 EASGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEG---GPQVCEAKLGF--
       : ::.   .         :.: :  :.      .   :: : :.   .:    :  :   
CCDS38 E-SGSECKDKYDDLEDDEDDDEEGERGLAPPKPVTSSPLTGLEAPLLSPPPEAAPRGGRK
               260       270       280       290       300         

             360          370       380       390        400       
pF1KB5 VPAGA--SAGLE---AKPRIWSLAHTATAAAAAATSLSQTEF-PSCMLKRQGP------A
       .: :.  : :     .::..::::. ::      ..:.:  . :.:     ::      :
CCDS38 TPQGSRTSPGAPPPASKPKLWSLAEIAT------SDLKQPSLGPGC-----GPPGLPAAA
     310       320       330             340            350        

             410       420       430           440       450       
pF1KB5 APAAVSSAPATSPSVALPHSGALDRHQDSPV----TSLRNWVDGVFHDPILRHSTLNQAW
       :::.... :. ::  : :  :    .  ::     :.  :   ..  . .::.   :.: 
CCDS38 APASTGAPPGGSPYPASPLLGR-PLYYTSPFYGNYTNYGNLNAALQGQGLLRY---NSA-
      360       370       380        390       400       410       

       460       470         480       490       500       510     
pF1KB5 ATAKGALLDPGPLGRSLG--AGANVLTAPLARAFPPAVPQDAPAAGAARELLALPKAGGK
       :.: :  :  .: . : .  :::. :                                  
CCDS38 AAAPGEALHTAPKAASDAGKAGAHPLESHYRSPGGGYEPKKDASEGCTVVGGGVQPYL  
           420       430       440       450       460       470   

>>CCDS34132.1 IRX1 gene_id:79192|Hs108|chr5               (480 aa)
 initn: 552 init1: 416 opt: 564  Z-score: 431.3  bits: 89.3 E(32554): 1.1e-17
Smith-Waterman score: 687; 36.3% identity (56.8% similar) in 512 aa overlap (1-502:1-422)

               10        20        30        40        50        60
pF1KB5 MSYPQFGYPYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATA
       ::.::.:::     :.: :..  .   :  : .:: .. ::.: .     . :   :. .
CCDS34 MSFPQLGYP-----QYLSAAGPGAYGGERPG-VLAAAAAAAAAASSGRPGAAE---LGGG
                    10        20         30        40           50 

               70          80        90       100        110       
pF1KB5 RHELNSAAALGVYG--GPYGGSQGYGNYVTYGSEASAFYSLNS-FDSKDGSGSAHGGLAP
             ...::.:.  :::.:. .:. .. :... : : ...: .. ::. :   . .: 
CCDS34 AGAAAVTSVLGMYAAAGPYAGAPNYSAFLPYAADLSLFSQMGSQYELKDNPGVHPATFAA
              60        70        80        90       100       110 

        120       130       140       150       160       170      
pF1KB5 -AAAAYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKI
        .: :::::    ::.   .::  : : : ::::::.::::::::.::::::::::::::
CCDS34 HTAPAYYPY----GQF---QYG--DPG-RPKNATRESTSTLKAWLNEHRKNPYPTKGEKI
             120                 130       140       150       160 

        180       190       200       210       220       230      
pF1KB5 MLAIITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEARE
       :::::::::::::::::::::::::::::.::  :.:  :..     : . ::  :.:..
CCDS34 MLAIITKMTLTQVSTWFANARRRLKKENKVTWGARSK--DQEDGALFGSDTEGDPEKAED
             170       180       190         200       210         

        240       250       260       270       280       290      
pF1KB5 EPLKSSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSLDGGLERVPAAPDGP
       .   . .. .    .:.. . :. :: :  .:: :        :.        ::::.. 
CCDS34 DEEIDLESIDIDKIDEHDGDQSNEDDED--KAEAP--------HA--------PAAPSAL
     220       230       240         250                       260 

        300       310       320       330       340       350      
pF1KB5 VKEASGALRMSLAAGGGAALDEDLERARSCLRSAAAGPEPLPGAEGGPQVCEAKLGFVPA
       ... .. :         :: :  :.   : :  :  .:::     :. ..      . :.
CCDS34 ARDQGSPL---------AAADV-LKPQDSPLGLAKEAPEP-----GSTRL------LSPG
                      270        280       290                  300

        360           370       380       390       400       410  
pF1KB5 GASAGLEA----KPRIWSLAHTATAAAAAATSLSQTEFPSCMLKRQGPAAPAAVSSAPAT
       .:..::..    ::.:::::.:::.  .:    ..   :.     .::.:     .::  
CCDS34 AAAGGLQGAPHGKPKIWSLAETATSPDGAPK--ASPPPPAGHPGAHGPSA-----GAPLQ
              310       320       330         340            350   

            420       430       440       450       460       470  
pF1KB5 SPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLDPGPLGR
        :.  ::  :    :    . .. ::....:                :.:.::.   .  
CCDS34 HPAF-LPSHGLYTCH----IGKFSNWTNSAF---------------LAQGSLLN---MRS
            360           370                      380          390

              480       490       500       510                    
pF1KB5 SLGAGA--NVLTAPLARAFPPAVPQDAPAAGAARELLALPKAGGKPFCA           
        ::.::   .  .:   : ::  :  : : ::                            
CCDS34 FLGVGAPHAAPHGPHLPAPPPPQPPVAIAPGALNGDKASVRSSPTLPERDLVPRPDSPAQ
              400       410       420       430       440       450

CCDS34 QLKSPFQPVRDNSLAPQEGTPRILAALPSA
              460       470       480




519 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Sat Nov  5 10:49:15 2016 done: Sat Nov  5 10:49:16 2016
 Total Scan time:  3.640 Total Display time:  0.090

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com