Result of FASTA (ccds) for pFN21AB8941
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB8941, 301 aa
  1>>>pF1KB8941 301 - 301 aa - 301 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 8.7084+/-0.000849; mu= 3.6585+/- 0.052
 mean_var=303.6330+/-62.267, 0's: 0 Z-trim(117.1): 131  B-trim: 0 in 0/53
 Lambda= 0.073604
 statistics sampled from 17702 (17845) to 17702 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.548), width:  16
 Scan time:  2.710

The best scores are:                                      opt bits E(32554)
CCDS32675.1 HOXB1 gene_id:3211|Hs108|chr17         ( 301) 2090 234.4 7.9e-62
CCDS5401.1 HOXA1 gene_id:3198|Hs108|chr7           ( 335)  627 79.1 4.9e-15
CCDS2271.1 HOXD1 gene_id:3231|Hs108|chr2           ( 328)  538 69.7 3.4e-12


>>CCDS32675.1 HOXB1 gene_id:3211|Hs108|chr17              (301 aa)
 initn: 2090 init1: 2090 opt: 2090  Z-score: 1223.5  bits: 234.4 E(32554): 7.9e-62
Smith-Waterman score: 2090; 99.7% identity (99.7% similar) in 301 aa overlap (1-301:1-301)

               10        20        30        40        50        60
pF1KB8 MDYNRMNSFLEYPLCNRGPSAYSAHSAPTSFPPSSAQAVDSYASEGRYGGGLSSPAFQQN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MDYNRMNSFLEYPLCNRGPSAYSAHSAPTSFPPSSAQAVDSYASEGRYGGGLSSPAFQQN
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB8 SGYPAQQPPSTLGVPFPSSAPSGYAPAACSPSYGPSQYYPLGQSEGDGGYFHPSSYGAQL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 SGYPAQQPPSTLGVPFPSSAPSGYAPAACSPSYGPSQYYPLGQSEGDGGYFHPSSYGAQL
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB8 GGLSDGYGAGGAGPGPYPPQHPPYGNEQTASFAPAYADLLSEDKETPCPSEPNTPTARTF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 GGLSDGYGAGGAGPGPYPPQHPPYGNEQTASFAPAYADLLSEDKETPCPSEPNTPTARTF
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB8 DWMKVKRNPPKTAKVSEPGLGSPSGLRTNFTTRQLTELEKEFHFNKYLSRARRVEIAATL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 DWMKVKRNPPKTAKVSEPGLGSPSGLRTNFTTRQLTELEKEFHFNKYLSRARRVEIAATL
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB8 ELNETQVKIWFQNRRMKQKKREREGGRVPPAPPGCPKEAAGDASDQSTCTSPEASPSSVT
       :::::::::::::::::::::::: :::::::::::::::::::::::::::::::::::
CCDS32 ELNETQVKIWFQNRRMKQKKREREEGRVPPAPPGCPKEAAGDASDQSTCTSPEASPSSVT
              250       260       270       280       290       300

        
pF1KB8 S
       :
CCDS32 S
        

>>CCDS5401.1 HOXA1 gene_id:3198|Hs108|chr7                (335 aa)
 initn: 623 init1: 386 opt: 627  Z-score: 383.4  bits: 79.1 E(32554): 4.9e-15
Smith-Waterman score: 732; 44.5% identity (66.2% similar) in 337 aa overlap (1-301:1-328)

               10        20              30        40          50  
pF1KB8 MDYNRMNSFLEYPLCNRGPSAY-SAHSAP-----TSFPPSSAQAVDSYASEGRY--GGG-
       ::  :::::::::. . : :.  ::.. :     :.:  : : ...: ... :.  : : 
CCDS54 MDNARMNSFLEYPILSSGDSGTCSARAYPSDHRITTFQ-SCAVSANSCGGDDRFLVGRGV
               10        20        30         40        50         

               60        70        80          90             100  
pF1KB8 -LSSPAFQQNSGYPAQQPPSTLGVPFPSSAPSG--YAPAACSPSYG------PSQYYPLG
        ..::  ...  .   :: .     . .:.  :  :. ..:.::::      : . : :.
CCDS54 QIGSPHHHHHHHHRHPQPAT-----YQTSGNLGVSYSHSSCGPSYGSQNFSAPYSPYALN
      60        70             80        90       100       110    

              110         120              130       140       150 
pF1KB8 QSEGD--GGYFH--PSSYGAQLGG-------LSDGYGAGGAGPGPYPPQHPPYGNEQTAS
       : :.:  ::: .  :. :...:..         .::..:..:   :   :  ::.:. . 
CCDS54 Q-EADVSGGYPQCAPAVYSGNLSSPMVQHHHHHQGYAGGAVGSPQYI--HHSYGQEHQSL
           120       130       140       150       160         170 

             160            170       180       190        200     
pF1KB8 FAPAYADLLSE---DKETPC--PSEPNTPTARTFDWMKVKRNPPKTAKVSEPG-LGSPSG
          .: . ::    ...  :  :.  ..  :.::::::::::::::.::.: : ::.:..
CCDS54 ALATYNNSLSPLHASHQEACRSPASETSSPAQTFDWMKVKRNPPKTGKVGEYGYLGQPNA
             180       190       200       210       220       230 

         210       220       230       240       250       260     
pF1KB8 LRTNFTTRQLTELEKEFHFNKYLSRARRVEIAATLELNETQVKIWFQNRRMKQKKREREG
       .::::::.:::::::::::::::.:::::::::.:.:::::::::::::::::::::.::
CCDS54 VRTNFTTKQLTELEKEFHFNKYLTRARRVEIAASLQLNETQVKIWFQNRRMKQKKREKEG
             240       250       260       270       280       290 

          270       280       290       300        
pF1KB8 GR-VPPAPPGCPKEAAGDASDQSTCTSPEASPSSVTS       
          . :: :    : : ..:..:. .    ::.: ::       
CCDS54 LLPISPATPPGNDEKAEESSEKSSSSPCVPSPGSSTSDTLTTSH
             300       310       320       330     

>>CCDS2271.1 HOXD1 gene_id:3231|Hs108|chr2                (328 aa)
 initn: 427 init1: 352 opt: 538  Z-score: 332.4  bits: 69.7 E(32554): 3.4e-12
Smith-Waterman score: 560; 41.9% identity (58.7% similar) in 315 aa overlap (6-276:1-305)

               10         20        30        40         50        
pF1KB8 MDYNRMNSFLEYPLCNR-GPSAYSAHSAPTSFPPSSAQAVD-SYASEGRYGGGLSSPAFQ
            :.:.:::  :.  :  . .. :   .:  :.:. :  . :     : :     . 
CCDS22      MSSYLEYVSCSSSGGVGGDVLSLAPKFCRSDARPVALQPAFPLGNGDGAFVSCLP
                    10        20        30        40        50     

       60        70          80                    90              
pF1KB8 QNSGYPAQQPPSTLGVPF--PSSAPS------------GYAPAACS--PSYG-----PSQ
         .. :. .::.. . :   : .::.            : :::: .   .::     :. 
CCDS22 LAAARPSPSPPAAPARPSVPPPAAPQYAQCTLEGAYEPGAAPAAAAGGADYGFLGSGPAY
          60        70        80        90       100       110     

       100          110       120             130              140 
pF1KB8 YYP--LGQSEGDGG-YFHPSSYGAQLGG---LSDG---YGAGGAGPGPYPP-------QH
        .:  ::..  ::: . : .. ..  ::   : .:   :.: :  :::.:         :
CCDS22 DFPGVLGRAADDGGSHVHYATSAVFSGGGSFLLSGQVDYAAFGE-PGPFPACLKASADGH
         120       130       140       150        160       170    

             150        160       170       180       190       200
pF1KB8 PPYGNEQTASFAPA-YADLLSEDKETPCPSEPNTPTARTFDWMKVKRNPPKTAKVSEPGL
       :  :  :::: ::. :   .:     :  . : . .  ::.:::::::  : .:..: : 
CCDS22 P--GAFQTASPAPGTYPKSVS-----PASGLPAAFS--TFEWMKVKRNASKKGKLAEYGA
            180       190            200         210       220     

               210       220       230       240       250         
pF1KB8 GSPSG-LRTNFTTRQLTELEKEFHFNKYLSRARRVEIAATLELNETQVKIWFQNRRMKQK
       .:::. .::::.:.:::::::::::::::.::::.:::  :.::.:::::::::::::::
CCDS22 ASPSSAIRTNFSTKQLTELEKEFHFNKYLTRARRIEIANCLHLNDTQVKIWFQNRRMKQK
         230       240       250       260       270       280     

     260          270       280       290       300 
pF1KB8 KREREG---GRVPPAPPGCPKEAAGDASDQSTCTSPEASPSSVTS
       ::::::     .: ::   :                         
CCDS22 KREREGLLATAIPVAPLQLPLSGTTPTKFIKNPGSPSQSQEPS  
         290       300       310       320          




301 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Tue Nov  8 04:22:10 2016 done: Tue Nov  8 04:22:10 2016
 Total Scan time:  2.710 Total Display time:  0.000

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com