FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6637, 215 aa
1>>>pF1KE6637 215 - 215 aa - 215 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4261+/-0.000718; mu= 12.5587+/- 0.043
mean_var=59.8364+/-11.994, 0's: 0 Z-trim(108.1): 14 B-trim: 0 in 0/52
Lambda= 0.165803
statistics sampled from 9987 (10001) to 9987 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.694), E-opt: 0.2 (0.307), width: 16
Scan time: 1.540
The best scores are: opt bits E(32554)
CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 ( 215) 1537 375.7 1.2e-104
CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 ( 196) 1002 247.7 3.7e-66
CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 ( 197) 788 196.5 9.5e-51
CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 ( 252) 659 165.7 2.3e-41
CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 ( 205) 579 146.5 1.1e-35
CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 ( 211) 578 146.3 1.3e-35
CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 ( 174) 371 96.8 9e-21
CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 ( 178) 370 96.5 1.1e-20
CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 ( 174) 363 94.9 3.4e-20
CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7 ( 182) 353 92.5 1.9e-19
CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 ( 175) 348 91.3 4.1e-19
CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 ( 174) 322 85.0 3e-17
CCDS34506.1 AIM1 gene_id:202|Hs108|chr6 (1723) 302 80.6 6.6e-15
CCDS78289.1 CRYGN gene_id:155051|Hs108|chr7 ( 125) 251 68.0 2.9e-12
>>CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 (215 aa)
initn: 1537 init1: 1537 opt: 1537 Z-score: 1992.3 bits: 375.7 E(32554): 1.2e-104
Smith-Waterman score: 1537; 100.0% identity (100.0% similar) in 215 aa overlap (1-215:1-215)
10 20 30 40 50 60
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ
130 140 150 160 170 180
190 200 210
pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
:::::::::::::::::::::::::::::::::::
CCDS11 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
190 200 210
>>CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 (196 aa)
initn: 1002 init1: 1002 opt: 1002 Z-score: 1301.3 bits: 247.7 E(32554): 3.7e-66
Smith-Waterman score: 1002; 68.3% identity (89.9% similar) in 189 aa overlap (27-215:8-196)
10 20 30 40 50 60
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF
: ::::....:...:::.: :::. ::.: : .:
CCDS13 MTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGF
10 20 30 40
70 80 90 100 110 120
pF1KE6 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN
..:::::: ::::.:.::..: :::.:::::::: ::::.:..:: ::: :::: ::
CCDS13 ETVRSLKVLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACAN
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE6 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ
:..:..::::.:::.:.. :.:::::::::::: .:::::....:::::: :.:::::.:
CCDS13 HRDSRLTIFEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQ
110 120 130 140 150 160
190 200 210
pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
:.::::::.:::::.::::::: : :.::::::::
CCDS13 YVLECDHHSGDYKHFREWGSHAPTFQVQSIRRIQQ
170 180 190
>>CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 (197 aa)
initn: 907 init1: 677 opt: 788 Z-score: 1024.6 bits: 196.5 E(32554): 9.5e-51
Smith-Waterman score: 788; 53.1% identity (85.7% similar) in 196 aa overlap (21-215:3-197)
10 20 30 40 50
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERS-
. :.:: .: ..:..:.:.:::.: .. :.: :: ::.
CCDS24 MSSAPAPGP-APASLTLWDEEDFQGRRCRLLSDCANVCERGG
10 20 30 40
60 70 80 90 100 110
pF1KE6 FDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSA
. :::.:::.:.:...:. .: :::::::.:.::::.:::::.... ..:.::::. :
CCDS24 LPRVRSVKVENGVWVAFEYPDFQGQQFILEKGDYPRWSAWSGSSSHNSNQLLSFRPVLCA
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE6 NHKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGY
::..:..:.:: .:: : .... :::::: .::: ...:::.:..::::: :::::::::
CCDS24 NHNDSRVTLFEGDNFQGCKFDLVDDYPSLPSMGWASKDVGSLKVSSGAWVAYQYPGYRGY
110 120 130 140 150 160
180 190 200 210
pF1KE6 QYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
::.:: :.:.:.. . : :..:.:.:.:::::.:.
CCDS24 QYVLERDRHSGEFCTYGELGTQAHTGQLQSIRRVQH
170 180 190
>>CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 (252 aa)
initn: 573 init1: 309 opt: 659 Z-score: 856.1 bits: 165.7 E(32554): 2.3e-41
Smith-Waterman score: 676; 48.5% identity (77.0% similar) in 204 aa overlap (12-214:43-233)
10 20 30 40
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENF
:.: :. .. ::. ....... :::
CCDS13 VAVNPGPDTKGKGAPPAGTSPSPGTTLAPTTVPITSAKAAELPPGN---YRLVVFELENF
20 30 40 50 60
50 60 70 80 90 100
pF1KE6 QGKRMEFTSSCPNVSERSFDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSG
::.: ::.. : :...:.:: :::. : .: :...:...: :..::::.::::::..::.
CCDS13 QGRRAEFSGECSNLADRGFDRVRSIIVSAGPWVAFEQSNFRGEMFILEKGEYPRWNTWSS
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE6 SNAYHIERLMSFRPICSANHKESKMTIFEKENFIGRQWEIS-DDYPSLQAMGWFNNEVGS
: :. .:::::::: . . .: :...:: :: : ::. :: ::: ..: :...:::
CCDS13 S--YRSDRLMSFRPI-KMDAQEHKISLFEGANFKGNTIEIQGDDAPSLWVYG-FSDRVGS
130 140 150 160 170 180
170 180 190 200 210
pF1KE6 MKIQSGAWVCYQYPGYRGYQYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
.:..::.:: :::::::::::.:: ::..:: ::: : :.::.::..
CCDS13 VKVSSGTWVGYQYPGYRGYQYLLE----PGDFRHWNEWG--AFQPQMQSLRRLRDKQWHL
190 200 210 220 230
CCDS13 EGSFPVLATEPPK
240 250
>>CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 (205 aa)
initn: 511 init1: 282 opt: 579 Z-score: 754.1 bits: 146.5 E(32554): 1.1e-35
Smith-Waterman score: 585; 46.1% identity (76.4% similar) in 191 aa overlap (25-214:12-191)
10 20 30 40 50 60
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF
: ::.: :: :..::::::. :... :::..: .
CCDS13 MASDHQTQAGKPQSLNP-KIIIFEQENFQGHSHELNGPCPNLKETGV
10 20 30 40
70 80 90 100 110 120
pF1KE6 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN
... :. :..: :.:::... :.::..:.:::::::.:..: . . : :.::: ...
CCDS13 EKAGSVLVQAGPWVGYEQANCKGEQFVFEKGEYPRWDSWTSS--RRTDSLSSLRPI-KVD
50 60 70 80 90 100
130 140 150 160 170
pF1KE6 HKESKMTIFEKENFIGRQWEI-SDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGY
.: :. ..:. :: :.. :: .:: ::..: : ....:.:...:::.:: ::::::::
CCDS13 SQEHKIILYENPNFTGKKMEIIDDDVPSFHAHG-YQEKVSSVRVQSGTWVGYQYPGYRGL
110 120 130 140 150 160
180 190 200 210
pF1KE6 QYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
::.:: :::: ..: : :.::.:::.
CCDS13 QYLLE----KGDYKDSSDFG--APHPQVQSVRRIRDMQWHQRGAFHPSN
170 180 190 200
>>CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 (211 aa)
initn: 573 init1: 307 opt: 578 Z-score: 752.6 bits: 146.3 E(32554): 1.3e-35
Smith-Waterman score: 596; 43.5% identity (74.0% similar) in 200 aa overlap (17-214:9-198)
10 20 30 40 50
pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLG-PWKITIYDQENFQGKRMEFTSSCPNVSERS
..: .. . :.:: .:. .:. ::::::: :... ::....
CCDS13 MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSL
10 20 30 40 50
60 70 80 90 100 110
pF1KE6 FDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSA
...: :..:::: :...: .: :.::.::.:.:::::::: :. . :.:.::. .
CCDS13 LEKVGSIQVESGPWLAFESRAFRGEQFVLEKGDYPRWDAWS--NSRDSDSLLSLRPL-NI
60 70 80 90 100
120 130 140 150 160 170
pF1KE6 NHKESKMTIFEKENFIGRQWEI-SDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRG
. . :. .::. : ::. :: .:: ::: : : :...:.:.. .:.:: :..:::::
CCDS13 DSPHHKLHLFENPAFSGRKMEIVDDDVPSLWAHG-FQDRVASVRAINGTWVGYEFPGYRG
110 120 130 140 150 160
180 190 200 210
pF1KE6 YQYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
::..: :.:.:: :: :. :.::.:::.
CCDS13 RQYVFE----RGEYRHWNEWD--ASQPQLQSVRRIRDQKWHKRGRFPSS
170 180 190 200 210
>>CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 (174 aa)
initn: 298 init1: 143 opt: 371 Z-score: 486.4 bits: 96.8 E(32554): 9e-21
Smith-Waterman score: 398; 35.5% identity (66.1% similar) in 183 aa overlap (32-213:3-170)
10 20 30 40 50 60
pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD
:::.:... :::. .: :..:::.. :.
CCDS23 MGKITFYEDRAFQGRSYETTTDCPNLQPY-FS
10 20 30
70 80 90 100 110 120
pF1KE6 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSANH
:..:::: :. ::. .. :::..:.::::: .. : : . :.: : .
CCDS23 RCNSIRVESGCWMLYERPNYQGQQYLLRRGEYPDYQQWMGLSD-------SIRSCCLIPQ
40 50 60 70 80
130 140 150 160 170 180
pF1KE6 KES-KMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ
: .. ..:.:. : . :.:.: ::.: . .:. :... : :: :. :.::: :
CCDS23 TVSHRLRLYEREDHKGLMMELSEDCPSIQDR-FHLSEIRSLHVLEGCWVLYELPNYRGRQ
90 100 110 120 130 140
190 200 210
pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
:.:. . .:.. ..:: :. .. :.::.
CCDS23 YLLRPQ----EYRRCQDWG--AMDAKAGSLRRVVDLY
150 160 170
>>CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 (178 aa)
initn: 357 init1: 134 opt: 370 Z-score: 484.9 bits: 96.5 E(32554): 1.1e-20
Smith-Waterman score: 403; 37.2% identity (64.5% similar) in 183 aa overlap (32-213:7-176)
10 20 30 40 50 60
pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD
:::.:...::::.:.. .: . ..
CCDS32 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTY-LS
10 20 30
70 80 90 100 110 120
pF1KE6 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSANH
:.:::.:.: ::. .: : ..:: .::::... : : : .:: : : . .
CCDS32 RCNSIKVEGGTWAVYERPNFAGYMYILPQGEYPEYQRWMGLN----DRLSSCRAVHLPSG
40 50 60 70 80 90
130 140 150 160 170 180
pF1KE6 KESKMTIFEKENFIGRQWEISDDYPSLQAMGWFN-NEVGSMKIQSGAWVCYQYPGYRGYQ
. :. :::: .: :...: ..: ::. : :. :. : :. :.:. :. :.::: :
CCDS32 GQYKIQIFEKGDFSGQMYETTEDCPSI--MEQFHMREIHSCKVLEGVWIFYELPNYRGRQ
100 110 120 130 140
190 200 210
pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
:.:. .:.. .:: : . .::.:::
CCDS32 YLLDKK----EYRKPIDWG--AASPAVQSFRRIVE
150 160 170
>>CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 (174 aa)
initn: 300 init1: 103 opt: 363 Z-score: 476.0 bits: 94.9 E(32554): 3.4e-20
Smith-Waterman score: 363; 33.3% identity (67.2% similar) in 183 aa overlap (32-213:3-170)
10 20 30 40 50 60
pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD
:::.:....:::...: .:. ::.. ..
CCDS23 MGKITLYEDRGFQGRHYECSSDHPNLQPY-LS
10 20 30
70 80 90 100 110 120
pF1KE6 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSG-SNAYHIERLMSFRPICSAN
: .:.:: :. ::. .. : :..:.::.: . : : :.. . ::. :..
CCDS23 RCNSARVDSGCWMLYEQPNYSGLQYFLRRGDYADHQQWMGLSDSVRSCRLIPH----SGS
40 50 60 70 80
130 140 150 160 170 180
pF1KE6 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ
:. . ..:.:.. :.. :...: :: :: :. :... :.:: :. .::: :
CCDS23 HR---IRLYEREDYRGQMIEFTEDCSCLQDRFRFN-EIHSLNVLEGSWVLYELSNYRGRQ
90 100 110 120 130 140
190 200 210
pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
:.: :::.....:: : .... :.::.
CCDS23 YLL----MPGDYRRYQDWG--ATNARVGSLRRVIDFS
150 160 170
>>CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7 (182 aa)
initn: 259 init1: 148 opt: 353 Z-score: 462.8 bits: 92.5 E(32554): 1.9e-19
Smith-Waterman score: 353; 38.6% identity (68.6% similar) in 153 aa overlap (32-180:7-152)
10 20 30 40 50 60
pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD
:::.:. ..: :...: ..: : ..:.:
CCDS59 MAQRSGKITLYEGKHFTGQKLEVFGDCDNFQDRGFM
10 20 30
70 80 90 100 110 120
pF1KE6 N-VRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN
: : :..::::::. ..: .: :::::::.:.:: . :.. : ... : ::. .
CCDS59 NRVNSIHVESGAWVCFNHPDFRGQQFILEHGDYPDFFRWNS----HSDHMGSCRPV--GM
40 50 60 70 80 90
130 140 150 160 170
pF1KE6 HKES-KMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKI--QSGAWVCYQYPGYR
: : .. ::: :: :. :. .: : ::. :: .: :...:. ...:: .. : .
CCDS59 HGEHFRLEIFEGCNFTGQCLEFLEDSPFLQSRGWVKNCVNTIKVYGDGAAWSPRSF-GAE
100 110 120 130 140
180 190 200 210
pF1KE6 GYQYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
.:
CCDS59 DFQLSSSLQSDQGPEEATTKPATTQPPFLTANL
150 160 170 180
215 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 14:59:45 2016 done: Tue Nov 8 14:59:46 2016
Total Scan time: 1.540 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]