Result of FASTA (ccds) for pF1KB5204
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB5204, 269 aa
  1>>>pF1KB5204 269 - 269 aa - 269 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 7.7779+/-0.000749; mu= 5.3681+/- 0.046
 mean_var=159.0825+/-32.252, 0's: 0 Z-trim(115.0): 17  B-trim: 0 in 0/51
 Lambda= 0.101686
 statistics sampled from 15544 (15558) to 15544 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.802), E-opt: 0.2 (0.478), width:  16
 Scan time:  2.620

The best scores are:                                      opt bits E(32554)
CCDS5664.1 CPSF4 gene_id:10898|Hs108|chr7          ( 269) 1972 300.2   1e-81
CCDS47652.1 CPSF4 gene_id:10898|Hs108|chr7         ( 244) 1447 223.1 1.5e-58
CCDS83205.1 CPSF4 gene_id:10898|Hs108|chr7         ( 191) 1087 170.2 9.5e-43
CCDS45768.1 CPSF4L gene_id:642843|Hs108|chr17      ( 179)  901 142.9 1.5e-34


>>CCDS5664.1 CPSF4 gene_id:10898|Hs108|chr7               (269 aa)
 initn: 1972 init1: 1972 opt: 1972  Z-score: 1580.5  bits: 300.2 E(32554): 1e-81
Smith-Waterman score: 1972; 100.0% identity (100.0% similar) in 269 aa overlap (1-269:1-269)

               10        20        30        40        50        60
pF1KB5 MQEIIASVDHIKFDLEIAVEQQLGAQPLPFPGMDKSGAAVCEFFLKAACGKGGMCPFRHI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 MQEIIASVDHIKFDLEIAVEQQLGAQPLPFPGMDKSGAAVCEFFLKAACGKGGMCPFRHI
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 SGEKTVVCKHWLRGLCKKGDQCEFLHEYDMTKMPECYFYSKFGECSNKECPFLHIDPESK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 SGEKTVVCKHWLRGLCKKGDQCEFLHEYDMTKMPECYFYSKFGECSNKECPFLHIDPESK
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB5 IKDCPWYDRGFCKHGPLCRHRHTRRVICVNYLVGFCPEGPSCKFMHPRFELPMGTTEQPP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 IKDCPWYDRGFCKHGPLCRHRHTRRVICVNYLVGFCPEGPSCKFMHPRFELPMGTTEQPP
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB5 LPQQTQPPAKQSNNPPLQRSSSLIQLTSQNSSPNQQRTPQVIGVMQSQNSSAGNRGPRPL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LPQQTQPPAKQSNNPPLQRSSSLIQLTSQNSSPNQQRTPQVIGVMQSQNSSAGNRGPRPL
              190       200       210       220       230       240

              250       260         
pF1KB5 EQVTCYKCGEKGHYANRCTKGHLAFLSGQ
       :::::::::::::::::::::::::::::
CCDS56 EQVTCYKCGEKGHYANRCTKGHLAFLSGQ
              250       260         

>>CCDS47652.1 CPSF4 gene_id:10898|Hs108|chr7              (244 aa)
 initn: 1477 init1: 1437 opt: 1447  Z-score: 1164.8  bits: 223.1 E(32554): 1.5e-58
Smith-Waterman score: 1752; 90.7% identity (90.7% similar) in 269 aa overlap (1-269:1-244)

               10        20        30        40        50        60
pF1KB5 MQEIIASVDHIKFDLEIAVEQQLGAQPLPFPGMDKSGAAVCEFFLKAACGKGGMCPFRHI
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 MQEIIASVDHIKFDLEIAVEQQLGAQPLPFPGMDKSGAAVCEFFLKAACGKGGMCPFRHI
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 SGEKTVVCKHWLRGLCKKGDQCEFLHEYDMTKMPECYFYSKFGECSNKECPFLHIDPESK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 SGEKTVVCKHWLRGLCKKGDQCEFLHEYDMTKMPECYFYSKFGECSNKECPFLHIDPESK
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB5 IKDCPWYDRGFCKHGPLCRHRHTRRVICVNYLVGFCPEGPSCKFMHPRFELPMGTTEQPP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 IKDCPWYDRGFCKHGPLCRHRHTRRVICVNYLVGFCPEGPSCKFMHPRFELPMGTTEQPP
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB5 LPQQTQPPAKQSNNPPLQRSSSLIQLTSQNSSPNQQRTPQVIGVMQSQNSSAGNRGPRPL
       :::::::::::                         ::::::::::::::::::::::::
CCDS47 LPQQTQPPAKQ-------------------------RTPQVIGVMQSQNSSAGNRGPRPL
              190                                200       210     

              250       260         
pF1KB5 EQVTCYKCGEKGHYANRCTKGHLAFLSGQ
       :::::::::::::::::::::::::::::
CCDS47 EQVTCYKCGEKGHYANRCTKGHLAFLSGQ
         220       230       240    

>>CCDS83205.1 CPSF4 gene_id:10898|Hs108|chr7              (191 aa)
 initn: 1117 init1: 1077 opt: 1087  Z-score: 880.9  bits: 170.2 E(32554): 9.5e-43
Smith-Waterman score: 1392; 88.4% identity (88.4% similar) in 216 aa overlap (54-269:1-191)

            30        40        50        60        70        80   
pF1KB5 GAQPLPFPGMDKSGAAVCEFFLKAACGKGGMCPFRHISGEKTVVCKHWLRGLCKKGDQCE
                                     ::::::::::::::::::::::::::::::
CCDS83                               MCPFRHISGEKTVVCKHWLRGLCKKGDQCE
                                             10        20        30

            90       100       110       120       130       140   
pF1KB5 FLHEYDMTKMPECYFYSKFGECSNKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS83 FLHEYDMTKMPECYFYSKFGECSNKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHT
               40        50        60        70        80        90

           150       160       170       180       190       200   
pF1KB5 RRVICVNYLVGFCPEGPSCKFMHPRFELPMGTTEQPPLPQQTQPPAKQSNNPPLQRSSSL
       :::::::::::::::::::::::::::::::::::::::::::::::             
CCDS83 RRVICVNYLVGFCPEGPSCKFMHPRFELPMGTTEQPPLPQQTQPPAK-------------
              100       110       120       130                    

           210       220       230       240       250       260   
pF1KB5 IQLTSQNSSPNQQRTPQVIGVMQSQNSSAGNRGPRPLEQVTCYKCGEKGHYANRCTKGHL
                   ::::::::::::::::::::::::::::::::::::::::::::::::
CCDS83 ------------QRTPQVIGVMQSQNSSAGNRGPRPLEQVTCYKCGEKGHYANRCTKGHL
                   140       150       160       170       180     

             
pF1KB5 AFLSGQ
       ::::::
CCDS83 AFLSGQ
         190 

>>CCDS45768.1 CPSF4L gene_id:642843|Hs108|chr17           (179 aa)
 initn: 1098 init1: 899 opt: 901  Z-score: 733.9  bits: 142.9 E(32554): 1.5e-34
Smith-Waterman score: 901; 64.4% identity (84.2% similar) in 177 aa overlap (1-175:1-177)

               10        20        30        40        50        60
pF1KB5 MQEIIASVDHIKFDLEIAVEQQLGAQPLPFPGMDKSGAAVCEFFLKAACGKGGMCPFRHI
       :::.::..... : .:  ::.: :.  ::: :::::..:::.:: :. : :: .::::: 
CCDS45 MQEVIAGLERFTFAFEKDVEMQKGTGLLPFQGMDKSASAVCNFFTKGLCEKGKLCPFRHD
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 SGEKTVVCKHWLRGLCKKGDQCEFLHEYDMTKMPECYFYSKFGECSNKECPFLHIDPESK
        ::: :::::::::::::::.:.:::.::.:.:::::::::::.:::::: :::. :  :
CCDS45 RGEKMVVCKHWLRGLCKKGDHCKFLHQYDLTRMPECYFYSKFGDCSNKECSFLHVKPAFK
               70        80        90       100       110       120

              130       140       150       160         170        
pF1KB5 IKDCPWYDRGFCKHGPLCRHRHTRRVICVNYLVGFCPEGPSCKFMHP--RFELPMGTTEQ
        .::::::.:::: ::::..::. :..:.:::::::::::.:.: .   .:.:  :.   
CCDS45 SQDCPWYDQGFCKDGPLCKYRHVPRIMCLNYLVGFCPEGPKCQFAQKIREFKLLPGSKI 
              130       140       150       160       170          

      180       190       200       210       220       230        
pF1KB5 PPLPQQTQPPAKQSNNPPLQRSSSLIQLTSQNSSPNQQRTPQVIGVMQSQNSSAGNRGPR




269 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Sat Nov  5 06:27:38 2016 done: Sat Nov  5 06:27:39 2016
 Total Scan time:  2.620 Total Display time: -0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com