FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5273, 178 aa
1>>>pF1KE5273 178 - 178 aa - 178 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.8800+/-0.00076; mu= 14.7202+/- 0.046
mean_var=59.3324+/-12.079, 0's: 0 Z-trim(107.4): 22 B-trim: 501 in 1/49
Lambda= 0.166505
statistics sampled from 9512 (9526) to 9512 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.684), E-opt: 0.2 (0.293), width: 16
Scan time: 1.610
The best scores are: opt bits E(32554)
CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 ( 178) 1277 314.7 1.9e-86
CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 ( 175) 734 184.3 3.5e-47
CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 ( 174) 698 175.6 1.4e-44
CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 ( 174) 682 171.8 2e-43
CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 ( 174) 660 166.5 7.7e-42
CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 ( 211) 417 108.2 3.4e-24
CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 ( 252) 415 107.7 5.4e-24
CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 ( 205) 406 105.5 2.1e-23
CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 ( 215) 370 96.9 8.6e-21
CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 ( 197) 348 91.6 3.1e-19
CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7 ( 182) 334 88.2 3e-18
CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 ( 196) 303 80.8 5.6e-16
CCDS78289.1 CRYGN gene_id:155051|Hs108|chr7 ( 125) 291 77.8 2.8e-15
CCDS34506.1 AIM1 gene_id:202|Hs108|chr6 (1723) 255 69.8 9.6e-12
>>CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 (178 aa)
initn: 1277 init1: 1277 opt: 1277 Z-score: 1665.5 bits: 314.7 E(32554): 1.9e-86
Smith-Waterman score: 1277; 100.0% identity (100.0% similar) in 178 aa overlap (1-178:1-178)
10 20 30 40 50 60
pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME
70 80 90 100 110 120
130 140 150 160 170
pF1KE5 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE
130 140 150 160 170
>>CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 (175 aa)
initn: 693 init1: 388 opt: 734 Z-score: 960.7 bits: 184.3 E(32554): 3.5e-47
Smith-Waterman score: 734; 54.7% identity (80.2% similar) in 172 aa overlap (7-178:3-173)
10 20 30 40 50 60
pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY
:::::::. :::: :.: :: ... :.::::::.::.: : .:::::. :..:
CCDS23 MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQY
10 20 30 40 50
70 80 90 100 110 120
pF1KE5 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME
.: .::::.::.::::.: . :: . : .: :...:... .. ::: : :.:: :...
CCDS23 FLRRGEYPDYQQWMGLSDSIRSCCLIP-PHSGAYRMKIYDRDELRGQMSELTDDCISVQD
60 70 80 90 100 110
130 140 150 160 170
pF1KE5 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE
.::. :::: .:::: ::.::.:::::::::: :::. .:::: . : :.::...
CCDS23 RFHLTEIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY
120 130 140 150 160 170
>>CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 (174 aa)
initn: 649 init1: 367 opt: 698 Z-score: 914.0 bits: 175.6 E(32554): 1.4e-44
Smith-Waterman score: 698; 53.5% identity (79.7% similar) in 172 aa overlap (7-178:3-172)
10 20 30 40 50 60
pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY
:::::::. :::: :. :: ... :.::::::.::.: : .:::::. : .:
CCDS23 MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQY
10 20 30 40 50
70 80 90 100 110 120
pF1KE5 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME
.: .::::.::.::::.: . :: . :. .......:. : .: :.: .::::::..
CCDS23 LLRRGEYPDYQQWMGLSDSIRSCCLI--PQTVSHRLRLYEREDHKGLMMELSEDCPSIQD
60 70 80 90 100 110
130 140 150 160 170
pF1KE5 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE
.::. ::.: .:::: :..::::::::::::: .:::. :::: . . :.::.:.
CCDS23 RFHLSEIRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY
120 130 140 150 160 170
>>CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 (174 aa)
initn: 791 init1: 379 opt: 682 Z-score: 893.2 bits: 171.8 E(32554): 2e-43
Smith-Waterman score: 682; 51.2% identity (80.2% similar) in 172 aa overlap (7-178:3-172)
10 20 30 40 50 60
pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY
:::::::..:::: :.: :: ....:.::::::.:..: : .:::::. :..:
CCDS33 MGKITFYEDRDFQGRCYNCISDCPNLRVYFSRCNSIRVDSGCWMLYERPNYQGHQY
10 20 30 40 50
70 80 90 100 110 120
pF1KE5 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME
.: .:.::.::.::::.: ..::: . : ...:....:. :. : : : :.:: . :
CCDS33 FLRRGKYPDYQHWMGLSDSVQSCRII--PHTSSHKLRLYERDDYRGLMSELTDDCACVPE
60 70 80 90 100 110
130 140 150 160 170
pF1KE5 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE
:.. ::.: .:::: :..::.:::::::::: .::. :::.:. : :.::...
CCDS33 LFRLPEIYSLHVLEGCWVLYEMPNYRGRQYLLRPGDYRRYHDWGGADAKVGSLRRVTDLY
120 130 140 150 160 170
>>CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 (174 aa)
initn: 761 init1: 336 opt: 660 Z-score: 864.6 bits: 166.5 E(32554): 7.7e-42
Smith-Waterman score: 660; 50.0% identity (80.2% similar) in 172 aa overlap (7-178:3-172)
10 20 30 40 50 60
pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY
:::.:::..::::.:.:. : ... ::::::: .:..: : .::.::..: .:
CCDS23 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQY
10 20 30 40 50
70 80 90 100 110 120
pF1KE5 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME
.: .:.: ..:.::::.: . ::: . : .:...:...:. :. ::: : :::: ...
CCDS23 FLRRGDYADHQQWMGLSDSVRSCRLI--PHSGSHRIRLYEREDYRGQMIEFTEDCSCLQD
60 70 80 90 100 110
130 140 150 160 170
pF1KE5 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE
.:.. :::: .:::: :..::: ::::::::: .::. ::::.. : :.::...
CCDS23 RFRFNEIHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS
120 130 140 150 160 170
>>CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 (211 aa)
initn: 354 init1: 203 opt: 417 Z-score: 547.9 bits: 108.2 E(32554): 3.4e-24
Smith-Waterman score: 417; 35.6% identity (70.7% similar) in 174 aa overlap (7-176:25-197)
10 20 30 40
pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADF-HTYLSRCNSIK
:. .:: .::::.: . . .: .. . : . .::.
CCDS13 MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSLLEKVGSIQ
10 20 30 40 50 60
50 60 70 80 90
pF1KE5 VEGGTWAVYERPNFAGYMYILPQGEYPEYQRWMGL--NDRLSSCRAVHLPSGGQYKIQIF
::.: : ..: : : ...: .:.::... : . .: : : : ... : ..:...:
CCDS13 VESGPWLAFESRAFRGEQFVLEKGDYPRWDAWSNSRDSDSLLSLRPLNIDSP-HHKLHLF
70 80 90 100 110
100 110 120 130 140 150
pF1KE5 EKGDFSGQMYETTED-CPSIMEQFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYR
:. :::. .: ..: ::. . . .. : ....:.:. ::.:.::::::.... :::
CCDS13 ENPAFSGRKMEIVDDDVPSLWAHGFQDRVASVRAINGTWVGYEFPGYRGRQYVFERGEYR
120 130 140 150 160 170
160 170
pF1KE5 KPIDWGAASPAVQSFRRIVE
. .: :..: .:: :::
CCDS13 HWNEWDASQPQLQSVRRIRDQKWHKRGRFPSS
180 190 200 210
>>CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 (252 aa)
initn: 474 init1: 185 opt: 415 Z-score: 544.2 bits: 107.7 E(32554): 5.4e-24
Smith-Waterman score: 415; 36.8% identity (67.8% similar) in 174 aa overlap (7-176:60-232)
10 20 30
pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTY-LS
... .: .:::::: . . .:... ..
CCDS13 GTSPSPGTTLAPTTVPITSAKAAELPPGNYRLVVFELENFQGRRAEFSGECSNLADRGFD
30 40 50 60 70 80
40 50 60 70 80 90
pF1KE5 RCNSIKVEGGTWAVYERPNFAGYMYILPQGEYPEYQRWMGL--NDRLSSCRAVHLPSGGQ
: :: : .: :...:. :: : :.:: .::::... : . .::: : : ... . .
CCDS13 RVRSIIVSAGPWVAFEQSNFRGEMFILEKGEYPRWNTWSSSYRSDRLMSFRPIKMDAQ-E
90 100 110 120 130 140
100 110 120 130 140 150
pF1KE5 YKIQIFEKGDFSGQMYETT-EDCPSIMEQFHMREIHSCKVLEGVWIFYELPNYRGRQYLL
.::..:: ..:.:. : .: ::. .. : :: :.:. :. :.::: ::::
CCDS13 HKISLFEGANFKGNTIEIQGDDAPSLWVYGFSDRVGSVKVSSGTWVGYQYPGYRGYQYLL
150 160 170 180 190 200
160 170
pF1KE5 DKKEYRKPIDWGAASPAVQSFRRIVE
. ..:. .::: .: .::.::.
CCDS13 EPGDFRHWNEWGAFQPQMQSLRRLRDKQWHLEGSFPVLATEPPK
210 220 230 240 250
>>CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 (205 aa)
initn: 318 init1: 199 opt: 406 Z-score: 533.8 bits: 105.5 E(32554): 2.1e-23
Smith-Waterman score: 406; 35.6% identity (69.0% similar) in 174 aa overlap (7-176:18-190)
10 20 30 40
pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFH-TYLSRCNSIKVEGGTWA
:: ..:..::::. .. . : ... : . . .:. :..: :.
CCDS13 MASDHQTQAGKPQSLNPKIIIFEQENFQGHSHELNGPCPNLKETGVEKAGSVLVQAGPWV
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE5 VYERPNFAGYMYILPQGEYPEYQRWMGL--NDRLSSCRAVHLPSGGQYKIQIFEKGDFSG
::. : : .... .::::... : . .: ::: : ... : ..:: ..:. .:.:
CCDS13 GYEQANCKGEQFVFEKGEYPRWDSWTSSRRTDSLSSLRPIKVDSQ-EHKIILYENPNFTG
70 80 90 100 110
110 120 130 140 150 160
pF1KE5 QMYETTED-CPSIMEQFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGA
. .: .: ::. . ..... : .: :.:. :. :.::: ::::.: .:. :.::
CCDS13 KKMEIIDDDVPSFHAHGYQEKVSSVRVQSGTWVGYQYPGYRGLQYLLEKGDYKDSSDFGA
120 130 140 150 160 170
170
pF1KE5 ASPAVQSFRRIVE
: ::: :::
CCDS13 PHPQVQSVRRIRDMQWHQRGAFHPSN
180 190 200
>>CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 (215 aa)
initn: 357 init1: 134 opt: 370 Z-score: 486.8 bits: 96.9 E(32554): 8.6e-21
Smith-Waterman score: 403; 37.2% identity (64.5% similar) in 183 aa overlap (7-176:32-213)
10 20 30
pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTY-LS
:::.:...::::.:.. .: . ..
CCDS11 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD
10 20 30 40 50 60
40 50 60 70 80 90
pF1KE5 RCNSIKVEGGTWAVYERPNFAGYMYILPQGEYPEYQRWMGLN----DRLSSCRAVHLPSG
:.:::.:.: ::. .: : ..:: .::::... : : : .:: : : . .
CCDS11 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSANH
70 80 90 100 110 120
100 110 120 130 140
pF1KE5 GQYKIQIFEKGDFSGQMYETTEDCPSI--MEQFHMREIHSCKVLEGVWIFYELPNYRGRQ
. :. :::: .: :...: ..: ::. : :. :. : :. :.:. :. :.::: :
CCDS11 KESKMTIFEKENFIGRQWEISDDYPSLQAMGWFN-NEVGSMKIQSGAWVCYQYPGYRGYQ
130 140 150 160 170 180
150 160 170
pF1KE5 YLLDKK----EYRKPIDWG--AASPAVQSFRRIVE
:.:. .:.. .:: : . .::.:::
CCDS11 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ
190 200 210
>>CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 (197 aa)
initn: 326 init1: 163 opt: 348 Z-score: 458.8 bits: 91.6 E(32554): 3.1e-19
Smith-Waterman score: 354; 31.9% identity (63.7% similar) in 182 aa overlap (8-176:14-195)
10 20 30 40 50
pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTY--LSRCNSIKVEGGTWAVYER
.:......::::: :::. : : :.:::.:.:...:
CCDS24 MSSAPAPGPAPASLTLWDEEDFQGRRCRLLSDCANVCERGGLPRVRSVKVENGVWVAFEY
10 20 30 40 50 60
60 70 80 90 100
pF1KE5 PNFAGYMYILPQGEYPEYQRWMGLN----DRLSSCRAVHLPSGGQYKIQIFEKGDFSGQM
:.: : ..:: .:.::... : : . ..: : : : . .. .. .:: .:.:
CCDS24 PDFQGQQFILEKGDYPRWSAWSGSSSHNSNQLLSFRPVLCANHNDSRVTLFEGDNFQGCK
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE5 YETTEDCPSIMEQ-FHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPI-DWG--
.. ..: ::. . . ... : :: :.:. :. :.::: ::.:.. .. . .:
CCDS24 FDLVDDYPSLPSMGWASKDVGSLKVSSGAWVAYQYPGYRGYQYVLERDRHSGEFCTYGEL
130 140 150 160 170 180
170
pF1KE5 ---AASPAVQSFRRIVE
: . .::.::.
CCDS24 GTQAHTGQLQSIRRVQH
190
178 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 23:19:46 2016 done: Mon Nov 7 23:19:46 2016
Total Scan time: 1.610 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]