Result of FASTA (ccds) for pFN21AE6637
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE6637, 215 aa
  1>>>pF1KE6637 215 - 215 aa - 215 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.4261+/-0.000718; mu= 12.5587+/- 0.043
 mean_var=59.8364+/-11.994, 0's: 0 Z-trim(108.1): 14  B-trim: 0 in 0/52
 Lambda= 0.165803
 statistics sampled from 9987 (10001) to 9987 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.694), E-opt: 0.2 (0.307), width:  16
 Scan time:  1.540

The best scores are:                                      opt bits E(32554)
CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17        ( 215) 1537 375.7 1.2e-104
CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22        ( 196) 1002 247.7 3.7e-66
CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2          ( 197)  788 196.5 9.5e-51
CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22        ( 252)  659 165.7 2.3e-41
CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22        ( 205)  579 146.5 1.1e-35
CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22        ( 211)  578 146.3 1.3e-35
CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2           ( 174)  371 96.8   9e-21
CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3           ( 178)  370 96.5 1.1e-20
CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2           ( 174)  363 94.9 3.4e-20
CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7         ( 182)  353 92.5 1.9e-19
CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2           ( 175)  348 91.3 4.1e-19
CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2          ( 174)  322 85.0   3e-17
CCDS34506.1 AIM1 gene_id:202|Hs108|chr6            (1723)  302 80.6 6.6e-15
CCDS78289.1 CRYGN gene_id:155051|Hs108|chr7        ( 125)  251 68.0 2.9e-12


>>CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17             (215 aa)
 initn: 1537 init1: 1537 opt: 1537  Z-score: 1992.3  bits: 375.7 E(32554): 1.2e-104
Smith-Waterman score: 1537; 100.0% identity (100.0% similar) in 215 aa overlap (1-215:1-215)

               10        20        30        40        50        60
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE6 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE6 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ
              130       140       150       160       170       180

              190       200       210     
pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
       :::::::::::::::::::::::::::::::::::
CCDS11 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
              190       200       210     

>>CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22             (196 aa)
 initn: 1002 init1: 1002 opt: 1002  Z-score: 1301.3  bits: 247.7 E(32554): 3.7e-66
Smith-Waterman score: 1002; 68.3% identity (89.9% similar) in 189 aa overlap (27-215:8-196)

               10        20        30        40        50        60
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF
                                 : ::::....:...:::.: :::. ::.: : .:
CCDS13                    MTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGF
                                  10        20        30        40 

               70        80        90       100       110       120
pF1KE6 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN
       ..:::::: ::::.:.::..: :::.:::::::: ::::.:..::  ::: ::::   ::
CCDS13 ETVRSLKVLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACAN
              50        60        70        80        90       100 

              130       140       150       160       170       180
pF1KE6 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ
       :..:..::::.:::.:.. :.:::::::::::: .:::::....:::::: :.:::::.:
CCDS13 HRDSRLTIFEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQ
             110       120       130       140       150       160 

              190       200       210     
pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
       :.::::::.:::::.::::::: : :.::::::::
CCDS13 YVLECDHHSGDYKHFREWGSHAPTFQVQSIRRIQQ
             170       180       190      

>>CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2               (197 aa)
 initn: 907 init1: 677 opt: 788  Z-score: 1024.6  bits: 196.5 E(32554): 9.5e-51
Smith-Waterman score: 788; 53.1% identity (85.7% similar) in 196 aa overlap (21-215:3-197)

               10        20        30        40        50          
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERS-
                           . :.::  .: ..:..:.:.:::.: .. :.: :: ::. 
CCDS24                   MSSAPAPGP-APASLTLWDEEDFQGRRCRLLSDCANVCERGG
                                  10        20        30        40 

      60        70        80        90       100       110         
pF1KE6 FDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSA
       .  :::.:::.:.:...:. .: :::::::.:.::::.:::::.... ..:.::::.  :
CCDS24 LPRVRSVKVENGVWVAFEYPDFQGQQFILEKGDYPRWSAWSGSSSHNSNQLLSFRPVLCA
              50        60        70        80        90       100 

     120       130       140       150       160       170         
pF1KE6 NHKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGY
       ::..:..:.:: .:: : .... :::::: .::: ...:::.:..::::: :::::::::
CCDS24 NHNDSRVTLFEGDNFQGCKFDLVDDYPSLPSMGWASKDVGSLKVSSGAWVAYQYPGYRGY
             110       120       130       140       150       160 

     180       190       200       210     
pF1KE6 QYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
       ::.:: :.:.:..  . : :..:.:.:.:::::.:.
CCDS24 QYVLERDRHSGEFCTYGELGTQAHTGQLQSIRRVQH
             170       180       190       

>>CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22             (252 aa)
 initn: 573 init1: 309 opt: 659  Z-score: 856.1  bits: 165.7 E(32554): 2.3e-41
Smith-Waterman score: 676; 48.5% identity (77.0% similar) in 204 aa overlap (12-214:43-233)

                                  10        20        30        40 
pF1KE6                    METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENF
                                     :.: :.   ..  ::.   ....... :::
CCDS13 VAVNPGPDTKGKGAPPAGTSPSPGTTLAPTTVPITSAKAAELPPGN---YRLVVFELENF
             20        30        40        50           60         

              50        60        70        80        90       100 
pF1KE6 QGKRMEFTSSCPNVSERSFDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSG
       ::.: ::.. : :...:.:: :::. : .: :...:...: :..::::.::::::..::.
CCDS13 QGRRAEFSGECSNLADRGFDRVRSIIVSAGPWVAFEQSNFRGEMFILEKGEYPRWNTWSS
      70        80        90       100       110       120         

             110       120       130       140        150       160
pF1KE6 SNAYHIERLMSFRPICSANHKESKMTIFEKENFIGRQWEIS-DDYPSLQAMGWFNNEVGS
       :  :. .:::::::: . . .: :...::  :: :   ::. :: ::: ..: :...:::
CCDS13 S--YRSDRLMSFRPI-KMDAQEHKISLFEGANFKGNTIEIQGDDAPSLWVYG-FSDRVGS
     130         140        150       160       170        180     

              170       180       190       200       210          
pF1KE6 MKIQSGAWVCYQYPGYRGYQYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ     
       .:..::.:: :::::::::::.::     ::..:: :::  :   :.::.::..      
CCDS13 VKVSSGTWVGYQYPGYRGYQYLLE----PGDFRHWNEWG--AFQPQMQSLRRLRDKQWHL
         190       200           210       220         230         

CCDS13 EGSFPVLATEPPK
     240       250  

>>CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22             (205 aa)
 initn: 511 init1: 282 opt: 579  Z-score: 754.1  bits: 146.5 E(32554): 1.1e-35
Smith-Waterman score: 585; 46.1% identity (76.4% similar) in 191 aa overlap (25-214:12-191)

               10        20        30        40        50        60
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF
                               : ::.: :: :..::::::.  :... :::..: . 
CCDS13              MASDHQTQAGKPQSLNP-KIIIFEQENFQGHSHELNGPCPNLKETGV
                            10         20        30        40      

               70        80        90       100       110       120
pF1KE6 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN
       ... :. :..: :.:::...  :.::..:.:::::::.:..:   . . : :.::: ...
CCDS13 EKAGSVLVQAGPWVGYEQANCKGEQFVFEKGEYPRWDSWTSS--RRTDSLSSLRPI-KVD
         50        60        70        80          90       100    

              130       140        150       160       170         
pF1KE6 HKESKMTIFEKENFIGRQWEI-SDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGY
        .: :. ..:. :: :.. :: .:: ::..: : ....:.:...:::.:: :::::::: 
CCDS13 SQEHKIILYENPNFTGKKMEIIDDDVPSFHAHG-YQEKVSSVRVQSGTWVGYQYPGYRGL
           110       120       130        140       150       160  

     180       190       200       210                  
pF1KE6 QYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ             
       ::.::     ::::   ..:  :   :.::.:::.              
CCDS13 QYLLE----KGDYKDSSDFG--APHPQVQSVRRIRDMQWHQRGAFHPSN
                170         180       190       200     

>>CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22             (211 aa)
 initn: 573 init1: 307 opt: 578  Z-score: 752.6  bits: 146.3 E(32554): 1.3e-35
Smith-Waterman score: 596; 43.5% identity (74.0% similar) in 200 aa overlap (17-214:9-198)

               10        20         30        40        50         
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLG-PWKITIYDQENFQGKRMEFTSSCPNVSERS
                       ..: .. . :.::  .:. .:. ::::::: :... ::....  
CCDS13         MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSL
                       10        20        30        40        50  

      60        70        80        90       100       110         
pF1KE6 FDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSA
       ...: :..:::: :...:  .: :.::.::.:.::::::::  :.   . :.:.::. . 
CCDS13 LEKVGSIQVESGPWLAFESRAFRGEQFVLEKGDYPRWDAWS--NSRDSDSLLSLRPL-NI
             60        70        80        90         100          

     120       130       140        150       160       170        
pF1KE6 NHKESKMTIFEKENFIGRQWEI-SDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRG
       .  . :. .::.  : ::. :: .:: ::: : : :...:.:..  .:.:: :..:::::
CCDS13 DSPHHKLHLFENPAFSGRKMEIVDDDVPSLWAHG-FQDRVASVRAINGTWVGYEFPGYRG
     110       120       130       140        150       160        

      180       190       200       210                 
pF1KE6 YQYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ            
        ::..:     :.:.:: ::   :.  :.::.:::.             
CCDS13 RQYVFE----RGEYRHWNEWD--ASQPQLQSVRRIRDQKWHKRGRFPSS
      170           180         190       200       210 

>>CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2                (174 aa)
 initn: 298 init1: 143 opt: 371  Z-score: 486.4  bits: 96.8 E(32554): 9e-21
Smith-Waterman score: 398; 35.5% identity (66.1% similar) in 183 aa overlap (32-213:3-170)

              10        20        30        40        50        60 
pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD
                                     :::.:... :::. .: :..:::..   :.
CCDS23                             MGKITFYEDRAFQGRSYETTTDCPNLQPY-FS
                                           10        20         30 

              70        80        90       100       110       120 
pF1KE6 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSANH
          :..:::: :. ::. .. :::..:.::::: .. : : .        :.:  :   .
CCDS23 RCNSIRVESGCWMLYERPNYQGQQYLLRRGEYPDYQQWMGLSD-------SIRSCCLIPQ
              40        50        60        70               80    

              130       140       150       160       170       180
pF1KE6 KES-KMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ
         : .. ..:.:.  : . :.:.: ::.:   .  .:. :...  : :: :. :.::: :
CCDS23 TVSHRLRLYEREDHKGLMMELSEDCPSIQDR-FHLSEIRSLHVLEGCWVLYELPNYRGRQ
           90       100       110        120       130       140   

              190       200       210       
pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ  
       :.:. .    .:.. ..::  :. ..  :.::.    
CCDS23 YLLRPQ----EYRRCQDWG--AMDAKAGSLRRVVDLY
               150         160       170    

>>CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3                (178 aa)
 initn: 357 init1: 134 opt: 370  Z-score: 484.9  bits: 96.5 E(32554): 1.1e-20
Smith-Waterman score: 403; 37.2% identity (64.5% similar) in 183 aa overlap (32-213:7-176)

              10        20        30        40        50        60 
pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD
                                     :::.:...::::.:..   .: .     ..
CCDS32                         MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTY-LS
                                       10        20        30      

              70        80        90       100       110       120 
pF1KE6 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSANH
          :.:::.:.:  ::. .: : ..:: .::::... : : :    .:: : : .   . 
CCDS32 RCNSIKVEGGTWAVYERPNFAGYMYILPQGEYPEYQRWMGLN----DRLSSCRAVHLPSG
          40        50        60        70            80        90 

             130       140       150        160       170       180
pF1KE6 KESKMTIFEKENFIGRQWEISDDYPSLQAMGWFN-NEVGSMKIQSGAWVCYQYPGYRGYQ
        . :. :::: .: :...: ..: ::.  :  :.  :. : :.  :.:. :. :.::: :
CCDS32 GQYKIQIFEKGDFSGQMYETTEDCPSI--MEQFHMREIHSCKVLEGVWIFYELPNYRGRQ
             100       110         120       130       140         

              190       200       210     
pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
       :.:.      .:..  .::  : .  .::.:::  
CCDS32 YLLDKK----EYRKPIDWG--AASPAVQSFRRIVE
     150           160         170        

>>CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2                (174 aa)
 initn: 300 init1: 103 opt: 363  Z-score: 476.0  bits: 94.9 E(32554): 3.4e-20
Smith-Waterman score: 363; 33.3% identity (67.2% similar) in 183 aa overlap (32-213:3-170)

              10        20        30        40        50        60 
pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD
                                     :::.:....:::...: .:. ::..   ..
CCDS23                             MGKITLYEDRGFQGRHYECSSDHPNLQPY-LS
                                           10        20         30 

              70        80        90       100        110       120
pF1KE6 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSG-SNAYHIERLMSFRPICSAN
          : .:.:: :. ::. .. : :..:.::.:   . : : :.. .  ::.      :..
CCDS23 RCNSARVDSGCWMLYEQPNYSGLQYFLRRGDYADHQQWMGLSDSVRSCRLIPH----SGS
              40        50        60        70        80           

              130       140       150       160       170       180
pF1KE6 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ
       :.   . ..:.:.. :.. :...:   ::    :: :. :...  :.:: :.  .::: :
CCDS23 HR---IRLYEREDYRGQMIEFTEDCSCLQDRFRFN-EIHSLNVLEGSWVLYELSNYRGRQ
           90       100       110        120       130       140   

              190       200       210       
pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ  
       :.:      :::.....::  : .... :.::.    
CCDS23 YLL----MPGDYRRYQDWG--ATNARVGSLRRVIDFS
               150         160       170    

>>CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7              (182 aa)
 initn: 259 init1: 148 opt: 353  Z-score: 462.8  bits: 92.5 E(32554): 1.9e-19
Smith-Waterman score: 353; 38.6% identity (68.6% similar) in 153 aa overlap (32-180:7-152)

              10        20        30        40        50        60 
pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD
                                     :::.:. ..: :...:  ..: : ..:.: 
CCDS59                         MAQRSGKITLYEGKHFTGQKLEVFGDCDNFQDRGFM
                                       10        20        30      

               70        80        90       100       110       120
pF1KE6 N-VRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN
       : : :..::::::. ..: .: :::::::.:.:: .  :..    : ... : ::.  . 
CCDS59 NRVNSIHVESGAWVCFNHPDFRGQQFILEHGDYPDFFRWNS----HSDHMGSCRPV--GM
         40        50        60        70            80          90

               130       140       150       160         170       
pF1KE6 HKES-KMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKI--QSGAWVCYQYPGYR
       : :  .. :::  :: :.  :. .: : ::. :: .: :...:.  ...::   .. : .
CCDS59 HGEHFRLEIFEGCNFTGQCLEFLEDSPFLQSRGWVKNCVNTIKVYGDGAAWSPRSF-GAE
              100       110       120       130       140          

       180       190       200       210     
pF1KE6 GYQYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
        .:                                   
CCDS59 DFQLSSSLQSDQGPEEATTKPATTQPPFLTANL     
     150       160       170       180       




215 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Tue Nov  8 14:59:45 2016 done: Tue Nov  8 14:59:46 2016
 Total Scan time:  1.540 Total Display time: -0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com