FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6637, 215 aa 1>>>pF1KE6637 215 - 215 aa - 215 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4261+/-0.000718; mu= 12.5587+/- 0.043 mean_var=59.8364+/-11.994, 0's: 0 Z-trim(108.1): 14 B-trim: 0 in 0/52 Lambda= 0.165803 statistics sampled from 9987 (10001) to 9987 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.694), E-opt: 0.2 (0.307), width: 16 Scan time: 1.540 The best scores are: opt bits E(32554) CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 ( 215) 1537 375.7 1.2e-104 CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 ( 196) 1002 247.7 3.7e-66 CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 ( 197) 788 196.5 9.5e-51 CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 ( 252) 659 165.7 2.3e-41 CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 ( 205) 579 146.5 1.1e-35 CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 ( 211) 578 146.3 1.3e-35 CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 ( 174) 371 96.8 9e-21 CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 ( 178) 370 96.5 1.1e-20 CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 ( 174) 363 94.9 3.4e-20 CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7 ( 182) 353 92.5 1.9e-19 CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 ( 175) 348 91.3 4.1e-19 CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 ( 174) 322 85.0 3e-17 CCDS34506.1 AIM1 gene_id:202|Hs108|chr6 (1723) 302 80.6 6.6e-15 CCDS78289.1 CRYGN gene_id:155051|Hs108|chr7 ( 125) 251 68.0 2.9e-12 >>CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 (215 aa) initn: 1537 init1: 1537 opt: 1537 Z-score: 1992.3 bits: 375.7 E(32554): 1.2e-104 Smith-Waterman score: 1537; 100.0% identity (100.0% similar) in 215 aa overlap (1-215:1-215) 10 20 30 40 50 60 pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ 130 140 150 160 170 180 190 200 210 pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ ::::::::::::::::::::::::::::::::::: CCDS11 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ 190 200 210 >>CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 (196 aa) initn: 1002 init1: 1002 opt: 1002 Z-score: 1301.3 bits: 247.7 E(32554): 3.7e-66 Smith-Waterman score: 1002; 68.3% identity (89.9% similar) in 189 aa overlap (27-215:8-196) 10 20 30 40 50 60 pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF : ::::....:...:::.: :::. ::.: : .: CCDS13 MTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGF 10 20 30 40 70 80 90 100 110 120 pF1KE6 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN ..:::::: ::::.:.::..: :::.:::::::: ::::.:..:: ::: :::: :: CCDS13 ETVRSLKVLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACAN 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE6 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ :..:..::::.:::.:.. :.:::::::::::: .:::::....:::::: :.:::::.: CCDS13 HRDSRLTIFEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQ 110 120 130 140 150 160 190 200 210 pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ :.::::::.:::::.::::::: : :.:::::::: CCDS13 YVLECDHHSGDYKHFREWGSHAPTFQVQSIRRIQQ 170 180 190 >>CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 (197 aa) initn: 907 init1: 677 opt: 788 Z-score: 1024.6 bits: 196.5 E(32554): 9.5e-51 Smith-Waterman score: 788; 53.1% identity (85.7% similar) in 196 aa overlap (21-215:3-197) 10 20 30 40 50 pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERS- . :.:: .: ..:..:.:.:::.: .. :.: :: ::. CCDS24 MSSAPAPGP-APASLTLWDEEDFQGRRCRLLSDCANVCERGG 10 20 30 40 60 70 80 90 100 110 pF1KE6 FDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSA . :::.:::.:.:...:. .: :::::::.:.::::.:::::.... ..:.::::. : CCDS24 LPRVRSVKVENGVWVAFEYPDFQGQQFILEKGDYPRWSAWSGSSSHNSNQLLSFRPVLCA 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE6 NHKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGY ::..:..:.:: .:: : .... :::::: .::: ...:::.:..::::: ::::::::: CCDS24 NHNDSRVTLFEGDNFQGCKFDLVDDYPSLPSMGWASKDVGSLKVSSGAWVAYQYPGYRGY 110 120 130 140 150 160 180 190 200 210 pF1KE6 QYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ ::.:: :.:.:.. . : :..:.:.:.:::::.:. CCDS24 QYVLERDRHSGEFCTYGELGTQAHTGQLQSIRRVQH 170 180 190 >>CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 (252 aa) initn: 573 init1: 309 opt: 659 Z-score: 856.1 bits: 165.7 E(32554): 2.3e-41 Smith-Waterman score: 676; 48.5% identity (77.0% similar) in 204 aa overlap (12-214:43-233) 10 20 30 40 pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENF :.: :. .. ::. ....... ::: CCDS13 VAVNPGPDTKGKGAPPAGTSPSPGTTLAPTTVPITSAKAAELPPGN---YRLVVFELENF 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 QGKRMEFTSSCPNVSERSFDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSG ::.: ::.. : :...:.:: :::. : .: :...:...: :..::::.::::::..::. CCDS13 QGRRAEFSGECSNLADRGFDRVRSIIVSAGPWVAFEQSNFRGEMFILEKGEYPRWNTWSS 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 SNAYHIERLMSFRPICSANHKESKMTIFEKENFIGRQWEIS-DDYPSLQAMGWFNNEVGS : :. .:::::::: . . .: :...:: :: : ::. :: ::: ..: :...::: CCDS13 S--YRSDRLMSFRPI-KMDAQEHKISLFEGANFKGNTIEIQGDDAPSLWVYG-FSDRVGS 130 140 150 160 170 180 170 180 190 200 210 pF1KE6 MKIQSGAWVCYQYPGYRGYQYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ .:..::.:: :::::::::::.:: ::..:: ::: : :.::.::.. CCDS13 VKVSSGTWVGYQYPGYRGYQYLLE----PGDFRHWNEWG--AFQPQMQSLRRLRDKQWHL 190 200 210 220 230 CCDS13 EGSFPVLATEPPK 240 250 >>CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 (205 aa) initn: 511 init1: 282 opt: 579 Z-score: 754.1 bits: 146.5 E(32554): 1.1e-35 Smith-Waterman score: 585; 46.1% identity (76.4% similar) in 191 aa overlap (25-214:12-191) 10 20 30 40 50 60 pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSF : ::.: :: :..::::::. :... :::..: . CCDS13 MASDHQTQAGKPQSLNP-KIIIFEQENFQGHSHELNGPCPNLKETGV 10 20 30 40 70 80 90 100 110 120 pF1KE6 DNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN ... :. :..: :.:::... :.::..:.:::::::.:..: . . : :.::: ... CCDS13 EKAGSVLVQAGPWVGYEQANCKGEQFVFEKGEYPRWDSWTSS--RRTDSLSSLRPI-KVD 50 60 70 80 90 100 130 140 150 160 170 pF1KE6 HKESKMTIFEKENFIGRQWEI-SDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGY .: :. ..:. :: :.. :: .:: ::..: : ....:.:...:::.:: :::::::: CCDS13 SQEHKIILYENPNFTGKKMEIIDDDVPSFHAHG-YQEKVSSVRVQSGTWVGYQYPGYRGL 110 120 130 140 150 160 180 190 200 210 pF1KE6 QYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ ::.:: :::: ..: : :.::.:::. CCDS13 QYLLE----KGDYKDSSDFG--APHPQVQSVRRIRDMQWHQRGAFHPSN 170 180 190 200 >>CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 (211 aa) initn: 573 init1: 307 opt: 578 Z-score: 752.6 bits: 146.3 E(32554): 1.3e-35 Smith-Waterman score: 596; 43.5% identity (74.0% similar) in 200 aa overlap (17-214:9-198) 10 20 30 40 50 pF1KE6 METQAEQQELETLPTTKMAQTNPTPGSLG-PWKITIYDQENFQGKRMEFTSSCPNVSERS ..: .. . :.:: .:. .:. ::::::: :... ::.... CCDS13 MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSL 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 FDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSA ...: :..:::: :...: .: :.::.::.:.:::::::: :. . :.:.::. . CCDS13 LEKVGSIQVESGPWLAFESRAFRGEQFVLEKGDYPRWDAWS--NSRDSDSLLSLRPL-NI 60 70 80 90 100 120 130 140 150 160 170 pF1KE6 NHKESKMTIFEKENFIGRQWEI-SDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRG . . :. .::. : ::. :: .:: ::: : : :...:.:.. .:.:: :..::::: CCDS13 DSPHHKLHLFENPAFSGRKMEIVDDDVPSLWAHG-FQDRVASVRAINGTWVGYEFPGYRG 110 120 130 140 150 160 180 190 200 210 pF1KE6 YQYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ ::..: :.:.:: :: :. :.::.:::. CCDS13 RQYVFE----RGEYRHWNEWD--ASQPQLQSVRRIRDQKWHKRGRFPSS 170 180 190 200 210 >>CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 (174 aa) initn: 298 init1: 143 opt: 371 Z-score: 486.4 bits: 96.8 E(32554): 9e-21 Smith-Waterman score: 398; 35.5% identity (66.1% similar) in 183 aa overlap (32-213:3-170) 10 20 30 40 50 60 pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD :::.:... :::. .: :..:::.. :. CCDS23 MGKITFYEDRAFQGRSYETTTDCPNLQPY-FS 10 20 30 70 80 90 100 110 120 pF1KE6 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSANH :..:::: :. ::. .. :::..:.::::: .. : : . :.: : . CCDS23 RCNSIRVESGCWMLYERPNYQGQQYLLRRGEYPDYQQWMGLSD-------SIRSCCLIPQ 40 50 60 70 80 130 140 150 160 170 180 pF1KE6 KES-KMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ : .. ..:.:. : . :.:.: ::.: . .:. :... : :: :. :.::: : CCDS23 TVSHRLRLYEREDHKGLMMELSEDCPSIQDR-FHLSEIRSLHVLEGCWVLYELPNYRGRQ 90 100 110 120 130 140 190 200 210 pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ :.:. . .:.. ..:: :. .. :.::. CCDS23 YLLRPQ----EYRRCQDWG--AMDAKAGSLRRVVDLY 150 160 170 >>CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 (178 aa) initn: 357 init1: 134 opt: 370 Z-score: 484.9 bits: 96.5 E(32554): 1.1e-20 Smith-Waterman score: 403; 37.2% identity (64.5% similar) in 183 aa overlap (32-213:7-176) 10 20 30 40 50 60 pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD :::.:...::::.:.. .: . .. CCDS32 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTY-LS 10 20 30 70 80 90 100 110 120 pF1KE6 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSANH :.:::.:.: ::. .: : ..:: .::::... : : : .:: : : . . CCDS32 RCNSIKVEGGTWAVYERPNFAGYMYILPQGEYPEYQRWMGLN----DRLSSCRAVHLPSG 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 KESKMTIFEKENFIGRQWEISDDYPSLQAMGWFN-NEVGSMKIQSGAWVCYQYPGYRGYQ . :. :::: .: :...: ..: ::. : :. :. : :. :.:. :. :.::: : CCDS32 GQYKIQIFEKGDFSGQMYETTEDCPSI--MEQFHMREIHSCKVLEGVWIFYELPNYRGRQ 100 110 120 130 140 190 200 210 pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ :.:. .:.. .:: : . .::.::: CCDS32 YLLDKK----EYRKPIDWG--AASPAVQSFRRIVE 150 160 170 >>CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 (174 aa) initn: 300 init1: 103 opt: 363 Z-score: 476.0 bits: 94.9 E(32554): 3.4e-20 Smith-Waterman score: 363; 33.3% identity (67.2% similar) in 183 aa overlap (32-213:3-170) 10 20 30 40 50 60 pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD :::.:....:::...: .:. ::.. .. CCDS23 MGKITLYEDRGFQGRHYECSSDHPNLQPY-LS 10 20 30 70 80 90 100 110 120 pF1KE6 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSG-SNAYHIERLMSFRPICSAN : .:.:: :. ::. .. : :..:.::.: . : : :.. . ::. :.. CCDS23 RCNSARVDSGCWMLYEQPNYSGLQYFLRRGDYADHQQWMGLSDSVRSCRLIPH----SGS 40 50 60 70 80 130 140 150 160 170 180 pF1KE6 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ :. . ..:.:.. :.. :...: :: :: :. :... :.:: :. .::: : CCDS23 HR---IRLYEREDYRGQMIEFTEDCSCLQDRFRFN-EIHSLNVLEGSWVLYELSNYRGRQ 90 100 110 120 130 140 190 200 210 pF1KE6 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ :.: :::.....:: : .... :.::. CCDS23 YLL----MPGDYRRYQDWG--ATNARVGSLRRVIDFS 150 160 170 >>CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7 (182 aa) initn: 259 init1: 148 opt: 353 Z-score: 462.8 bits: 92.5 E(32554): 1.9e-19 Smith-Waterman score: 353; 38.6% identity (68.6% similar) in 153 aa overlap (32-180:7-152) 10 20 30 40 50 60 pF1KE6 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD :::.:. ..: :...: ..: : ..:.: CCDS59 MAQRSGKITLYEGKHFTGQKLEVFGDCDNFQDRGFM 10 20 30 70 80 90 100 110 120 pF1KE6 N-VRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSAN : : :..::::::. ..: .: :::::::.:.:: . :.. : ... : ::. . CCDS59 NRVNSIHVESGAWVCFNHPDFRGQQFILEHGDYPDFFRWNS----HSDHMGSCRPV--GM 40 50 60 70 80 90 130 140 150 160 170 pF1KE6 HKES-KMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKI--QSGAWVCYQYPGYR : : .. ::: :: :. :. .: : ::. :: .: :...:. ...:: .. : . CCDS59 HGEHFRLEIFEGCNFTGQCLEFLEDSPFLQSRGWVKNCVNTIKVYGDGAAWSPRSF-GAE 100 110 120 130 140 180 190 200 210 pF1KE6 GYQYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ .: CCDS59 DFQLSSSLQSDQGPEEATTKPATTQPPFLTANL 150 160 170 180 215 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:59:45 2016 done: Tue Nov 8 14:59:46 2016 Total Scan time: 1.540 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]