Result of FASTA (ccds) for pFN21AE9541
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE9541, 448 aa
  1>>>pF1KE9541 448 - 448 aa - 448 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 6.0613+/-0.000664; mu= 15.7171+/- 0.040
 mean_var=103.0599+/-20.839, 0's: 0 Z-trim(114.1): 6  B-trim: 878 in 1/52
 Lambda= 0.126337
 statistics sampled from 14720 (14723) to 14720 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.783), E-opt: 0.2 (0.452), width:  16
 Scan time:  3.020

The best scores are:                                      opt bits E(32554)
CCDS11066.1 SLC52A1 gene_id:55065|Hs108|chr17      ( 448) 2978 552.8 2.6e-157
CCDS6423.1 SLC52A2 gene_id:79581|Hs108|chr8        ( 445) 2542 473.3 2.2e-133
CCDS13007.1 SLC52A3 gene_id:113278|Hs108|chr20     ( 469)  528 106.2 7.2e-23


>>CCDS11066.1 SLC52A1 gene_id:55065|Hs108|chr17           (448 aa)
 initn: 2978 init1: 2978 opt: 2978  Z-score: 2937.7  bits: 552.8 E(32554): 2.6e-157
Smith-Waterman score: 2978; 99.6% identity (100.0% similar) in 448 aa overlap (1-448:1-448)

               10        20        30        40        50        60
pF1KE9 MAAPTLGRLVLTHLLVALFGMGSWAAVNGIWVELPVVVKDLPEGWSLPSYLSVVVALGNL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MAAPTLGRLVLTHLLVALFGMGSWAAVNGIWVELPVVVKDLPEGWSLPSYLSVVVALGNL
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE9 GLLVVTLWRRLAPGKGEQVPIQVVQVLSVVGTALLAPLWHHVAPVAGQLHSVAFLTLALV
       :::::::::.::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 GLLVVTLWRQLAPGKGEQVPIQVVQVLSVVGTALLAPLWHHVAPVAGQLHSVAFLTLALV
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE9 LAMACCTSNVTFLPFLSHLPPPFLRSFFLGQGLSALLPCVLALVQGVGRLECPPAPTNGT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 LAMACCTSNVTFLPFLSHLPPPFLRSFFLGQGLSALLPCVLALVQGVGRLECPPAPTNGT
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE9 SGPPLDFPERFPASTFFWALTALLVTSAAAFRGLLLLLPSLPSVTTGGSGPELQLGSPGA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 SGPPLDFPERFPASTFFWALTALLVTSAAAFRGLLLLLPSLPSVTTGGSGPELQLGSPGA
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE9 EEEEKEEEEALPLQEPPSQAAGTIPGPDPEVHQLFSAHGAFLLGLMAFTSAVTNGVLPSV
       ::::::::::::::::::::::::::::::.:::::::::::::::::::::::::::::
CCDS11 EEEEKEEEEALPLQEPPSQAAGTIPGPDPEAHQLFSAHGAFLLGLMAFTSAVTNGVLPSV
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KE9 QSFSCLPYGRLAYHLAVVLGSAANPLACFLAMGVLCRSLAGLVGLSLLGMLFGAYLMALA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 QSFSCLPYGRLAYHLAVVLGSAANPLACFLAMGVLCRSLAGLVGLSLLGMLFGAYLMALA
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KE9 ILSPCPPLVGTTAGVVLVVLSWVLCLCVFSYVKVAASSLLHGGGRPALLAAGVAIQVGSL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 ILSPCPPLVGTTAGVVLVVLSWVLCLCVFSYVKVAASSLLHGGGRPALLAAGVAIQVGSL
              370       380       390       400       410       420

              430       440        
pF1KE9 LGAGAMFPPTSIYHVFQSRKDCVDPCGP
       ::::::::::::::::::::::::::::
CCDS11 LGAGAMFPPTSIYHVFQSRKDCVDPCGP
              430       440        

>>CCDS6423.1 SLC52A2 gene_id:79581|Hs108|chr8             (445 aa)
 initn: 1412 init1: 1412 opt: 2542  Z-score: 2508.3  bits: 473.3 E(32554): 2.2e-133
Smith-Waterman score: 2542; 86.5% identity (93.9% similar) in 446 aa overlap (1-446:1-443)

               10        20        30        40        50        60
pF1KE9 MAAPTLGRLVLTHLLVALFGMGSWAAVNGIWVELPVVVKDLPEGWSLPSYLSVVVALGNL
       ::::: .: ::::::::::::::::::::::::::::::.::::::::::.::.::::::
CCDS64 MAAPTPARPVLTHLLVALFGMGSWAAVNGIWVELPVVVKELPEGWSLPSYVSVLVALGNL
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE9 GLLVVTLWRRLAPGKGEQVPIQVVQVLSVVGTALLAPLWHHVAPVAGQLHSVAFLTLALV
       ::::::::::::::: :::::.:::::..::::::: ::::::::::::::::::.::.:
CCDS64 GLLVVTLWRRLAPGKDEQVPIRVVQVLGMVGTALLASLWHHVAPVAGQLHSVAFLALAFV
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE9 LAMACCTSNVTFLPFLSHLPPPFLRSFFLGQGLSALLPCVLALVQGVGRLECPPAPTNGT
       ::.:::.:::::::::::::: :::::::::::::::::::::::::::::::::: :::
CCDS64 LALACCASNVTFLPFLSHLPPRFLRSFFLGQGLSALLPCVLALVQGVGRLECPPAPINGT
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE9 SGPPLDFPERFPASTFFWALTALLVTSAAAFRGLLLLLPSLPSVTTGGSGPELQLGSPGA
        :::::: :::::::::::::::::.:::::.:::::::  ::: ::  :  ::.:.:::
CCDS64 PGPPLDFLERFPASTFFWALTALLVASAAAFQGLLLLLPPPPSVPTGELGSGLQVGAPGA
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE9 EEEEKEEEEALPLQEPPSQAAGTIPGPDPEVHQLFSAHGAFLLGLMAFTSAVTNGVLPSV
       :::    ::. :::::::::::: :::::...::.::..: ::::.: :.:.::::::.:
CCDS64 EEE---VEESSPLQEPPSQAAGTTPGPDPKAYQLLSARSACLLGLLAATNALTNGVLPAV
                 250       260       270       280       290       

              310       320       330       340       350       360
pF1KE9 QSFSCLPYGRLAYHLAVVLGSAANPLACFLAMGVLCRSLAGLVGLSLLGMLFGAYLMALA
       :::::::::::::::::::::::::::::::::::::::::: ::::::.. :.::::::
CCDS64 QSFSCLPYGRLAYHLAVVLGSAANPLACFLAMGVLCRSLAGLGGLSLLGVFCGGYLMALA
       300       310       320       330       340       350       

              370       380       390       400       410       420
pF1KE9 ILSPCPPLVGTTAGVVLVVLSWVLCLCVFSYVKVAASSLLHGGGRPALLAAGVAIQVGSL
       .::::::::::.:::::::::::::: :::::::::::::::::::::::::::::::::
CCDS64 VLSPCPPLVGTSAGVVLVVLSWVLCLGVFSYVKVAASSLLHGGGRPALLAAGVAIQVGSL
       360       370       380       390       400       410       

              430       440        
pF1KE9 LGAGAMFPPTSIYHVFQSRKDCVDPCGP
       ::: ::::::::::::.:::::.:::  
CCDS64 LGAVAMFPPTSIYHVFHSRKDCADPCDS
       420       430       440     

>>CCDS13007.1 SLC52A3 gene_id:113278|Hs108|chr20          (469 aa)
 initn: 1060 init1: 506 opt: 528  Z-score: 524.1  bits: 106.2 E(32554): 7.2e-23
Smith-Waterman score: 1022; 41.7% identity (64.2% similar) in 472 aa overlap (11-446:4-467)

               10        20        30        40        50        60
pF1KE9 MAAPTLGRLVLTHLLVALFGMGSWAAVNGIWVELPVVVKDLPEGWSLPSYLSVVVALGNL
                 : :::: .::::::...::.:::::..: .::::: :::::.::. :.:.
CCDS13        MAFLMHLLVCVFGMGSWVTINGLWVELPLLVMELPEGWYLPSYLTVVIQLANI
                      10        20        30        40        50   

               70        80          90       100       110        
pF1KE9 GLLVVTLWRRLAPGKGEQVPIQVVQVLSV--VGTALLAPLWHHVAPVAGQLHSVAFLTLA
       : :.::: ... :.   .::: .  .:.:  :   ..: ::. .. :    ::.:::.:.
CCDS13 GPLLVTLLHHFRPSCLSEVPI-IFTLLGVGTVTCIIFAFLWNMTSWVLDGHHSIAFLVLT
            60        70         80        90       100       110  

      120       130       140       150       160       170        
pF1KE9 LVLAMACCTSNVTFLPFLSHLPPPFLRSFFLGQGLSALLPCVLALVQGVGRLEC------
       . ::.. :::.::::::.:.::  .: .::.:.:::.::: ..::.:: :   :      
CCDS13 FFLALVDCTSSVTFLPFMSRLPTYYLTTFFVGEGLSGLLPALVALAQGSGLTTCVNVTEI
            120       130       140       150       160       170  

                 180                             190       200     
pF1KE9 ----P-PAPTNGTS---G-P------------PLD------FPERFPASTFFWALTALLV
           : :.::  :.   : :            ::.      .: .:   .::  :. ...
CCDS13 SDSVPSPVPTRETDIAQGVPRALVSALPGMEAPLSHLESRYLPAHFSPLVFFLLLSIMMA
            180       190       200       210       220       230  

         210       220       230       240       250        260    
pF1KE9 TSAAAFRGLLLLLPSLPSVTTGGSGPELQLGSPGAEEEEKEEEEALPLQE-PPSQAAGTI
          .::    ..:   :    ..    :.        . .::..  :      ::. : .
CCDS13 CCLVAF----FVLQRQPRCWEASVEDLLNDQVTLHSIRPREENDLGPAGTVDSSQGQGYL
                240       250       260       270       280        

          270       280       290       300       310       320    
pF1KE9 PGPDPEVHQLFSAHGAFLLGLMAFTSAVTNGVLPSVQSFSCLPYGRLAYHLAVVLGSAAN
          . ..     :: ::.  :.::..:.:::.:::::..::: :: .:::::..:. .::
CCDS13 ---EEKAAPCCPAHLAFIYTLVAFVNALTNGMLPSVQTYSCLSYGPVAYHLAATLSIVAN
         290       300       310       320       330       340     

          330       340       350       360       370       380    
pF1KE9 PLACFLAMGVLCRSLAGLVGLSLLGMLFGAYLMALAILSPCPPLVGTTAGVVLVVLSWVL
       ::: ...: .  :::  :  ::.::  ::.: ::.:..:::: : :  .: ::.: ::::
CCDS13 PLASLVSMFLPNRSLLFLGVLSVLGTCFGGYNMAMAVMSPCPLLQGHWGGEVLIVASWVL
         350       360       370       380       390       400     

          390       400       410       420       430       440    
pF1KE9 CLCVFSYVKVAASSLLHGGGRPALLAAGVAIQVGSLLGAGAMFPPTSIYHVFQSRKDCVD
           .:::::  . .:.  .: :::  :.:.:.::::::  ::: ... ..:.:   :  
CCDS13 FSGCLSYVKVMLGVVLRDLSRSALLWCGAAVQLGSLLGALLMFPLVNVLRLFSSADFCNL
         410       420       430       440       450       460     

           
pF1KE9 PCGP
        :  
CCDS13 HCPA
           




448 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Sun Nov  6 07:03:23 2016 done: Sun Nov  6 07:03:24 2016
 Total Scan time:  3.020 Total Display time: -0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com