FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5273, 178 aa 1>>>pF1KE5273 178 - 178 aa - 178 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8800+/-0.00076; mu= 14.7202+/- 0.046 mean_var=59.3324+/-12.079, 0's: 0 Z-trim(107.4): 22 B-trim: 501 in 1/49 Lambda= 0.166505 statistics sampled from 9512 (9526) to 9512 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.684), E-opt: 0.2 (0.293), width: 16 Scan time: 1.610 The best scores are: opt bits E(32554) CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 ( 178) 1277 314.7 1.9e-86 CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 ( 175) 734 184.3 3.5e-47 CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 ( 174) 698 175.6 1.4e-44 CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 ( 174) 682 171.8 2e-43 CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 ( 174) 660 166.5 7.7e-42 CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 ( 211) 417 108.2 3.4e-24 CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 ( 252) 415 107.7 5.4e-24 CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 ( 205) 406 105.5 2.1e-23 CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 ( 215) 370 96.9 8.6e-21 CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 ( 197) 348 91.6 3.1e-19 CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7 ( 182) 334 88.2 3e-18 CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 ( 196) 303 80.8 5.6e-16 CCDS78289.1 CRYGN gene_id:155051|Hs108|chr7 ( 125) 291 77.8 2.8e-15 CCDS34506.1 AIM1 gene_id:202|Hs108|chr6 (1723) 255 69.8 9.6e-12 >>CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 (178 aa) initn: 1277 init1: 1277 opt: 1277 Z-score: 1665.5 bits: 314.7 E(32554): 1.9e-86 Smith-Waterman score: 1277; 100.0% identity (100.0% similar) in 178 aa overlap (1-178:1-178) 10 20 30 40 50 60 pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE 130 140 150 160 170 >>CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 (175 aa) initn: 693 init1: 388 opt: 734 Z-score: 960.7 bits: 184.3 E(32554): 3.5e-47 Smith-Waterman score: 734; 54.7% identity (80.2% similar) in 172 aa overlap (7-178:3-173) 10 20 30 40 50 60 pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY :::::::. :::: :.: :: ... :.::::::.::.: : .:::::. :..: CCDS23 MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQY 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME .: .::::.::.::::.: . :: . : .: :...:... .. ::: : :.:: :... CCDS23 FLRRGEYPDYQQWMGLSDSIRSCCLIP-PHSGAYRMKIYDRDELRGQMSELTDDCISVQD 60 70 80 90 100 110 130 140 150 160 170 pF1KE5 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE .::. :::: .:::: ::.::.:::::::::: :::. .:::: . : :.::... CCDS23 RFHLTEIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY 120 130 140 150 160 170 >>CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 (174 aa) initn: 649 init1: 367 opt: 698 Z-score: 914.0 bits: 175.6 E(32554): 1.4e-44 Smith-Waterman score: 698; 53.5% identity (79.7% similar) in 172 aa overlap (7-178:3-172) 10 20 30 40 50 60 pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY :::::::. :::: :. :: ... :.::::::.::.: : .:::::. : .: CCDS23 MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQY 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME .: .::::.::.::::.: . :: . :. .......:. : .: :.: .::::::.. CCDS23 LLRRGEYPDYQQWMGLSDSIRSCCLI--PQTVSHRLRLYEREDHKGLMMELSEDCPSIQD 60 70 80 90 100 110 130 140 150 160 170 pF1KE5 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE .::. ::.: .:::: :..::::::::::::: .:::. :::: . . :.::.:. CCDS23 RFHLSEIRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY 120 130 140 150 160 170 >>CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 (174 aa) initn: 791 init1: 379 opt: 682 Z-score: 893.2 bits: 171.8 E(32554): 2e-43 Smith-Waterman score: 682; 51.2% identity (80.2% similar) in 172 aa overlap (7-178:3-172) 10 20 30 40 50 60 pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY :::::::..:::: :.: :: ....:.::::::.:..: : .:::::. :..: CCDS33 MGKITFYEDRDFQGRCYNCISDCPNLRVYFSRCNSIRVDSGCWMLYERPNYQGHQY 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME .: .:.::.::.::::.: ..::: . : ...:....:. :. : : : :.:: . : CCDS33 FLRRGKYPDYQHWMGLSDSVQSCRII--PHTSSHKLRLYERDDYRGLMSELTDDCACVPE 60 70 80 90 100 110 130 140 150 160 170 pF1KE5 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE :.. ::.: .:::: :..::.:::::::::: .::. :::.:. : :.::... CCDS33 LFRLPEIYSLHVLEGCWVLYEMPNYRGRQYLLRPGDYRRYHDWGGADAKVGSLRRVTDLY 120 130 140 150 160 170 >>CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 (174 aa) initn: 761 init1: 336 opt: 660 Z-score: 864.6 bits: 166.5 E(32554): 7.7e-42 Smith-Waterman score: 660; 50.0% identity (80.2% similar) in 172 aa overlap (7-178:3-172) 10 20 30 40 50 60 pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY :::.:::..::::.:.:. : ... ::::::: .:..: : .::.::..: .: CCDS23 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQY 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME .: .:.: ..:.::::.: . ::: . : .:...:...:. :. ::: : :::: ... CCDS23 FLRRGDYADHQQWMGLSDSVRSCRLI--PHSGSHRIRLYEREDYRGQMIEFTEDCSCLQD 60 70 80 90 100 110 130 140 150 160 170 pF1KE5 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE .:.. :::: .:::: :..::: ::::::::: .::. ::::.. : :.::... CCDS23 RFRFNEIHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS 120 130 140 150 160 170 >>CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 (211 aa) initn: 354 init1: 203 opt: 417 Z-score: 547.9 bits: 108.2 E(32554): 3.4e-24 Smith-Waterman score: 417; 35.6% identity (70.7% similar) in 174 aa overlap (7-176:25-197) 10 20 30 40 pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADF-HTYLSRCNSIK :. .:: .::::.: . . .: .. . : . .::. CCDS13 MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSLLEKVGSIQ 10 20 30 40 50 60 50 60 70 80 90 pF1KE5 VEGGTWAVYERPNFAGYMYILPQGEYPEYQRWMGL--NDRLSSCRAVHLPSGGQYKIQIF ::.: : ..: : : ...: .:.::... : . .: : : : ... : ..:...: CCDS13 VESGPWLAFESRAFRGEQFVLEKGDYPRWDAWSNSRDSDSLLSLRPLNIDSP-HHKLHLF 70 80 90 100 110 100 110 120 130 140 150 pF1KE5 EKGDFSGQMYETTED-CPSIMEQFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYR :. :::. .: ..: ::. . . .. : ....:.:. ::.:.::::::.... ::: CCDS13 ENPAFSGRKMEIVDDDVPSLWAHGFQDRVASVRAINGTWVGYEFPGYRGRQYVFERGEYR 120 130 140 150 160 170 160 170 pF1KE5 KPIDWGAASPAVQSFRRIVE . .: :..: .:: ::: CCDS13 HWNEWDASQPQLQSVRRIRDQKWHKRGRFPSS 180 190 200 210 >>CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 (252 aa) initn: 474 init1: 185 opt: 415 Z-score: 544.2 bits: 107.7 E(32554): 5.4e-24 Smith-Waterman score: 415; 36.8% identity (67.8% similar) in 174 aa overlap (7-176:60-232) 10 20 30 pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTY-LS ... .: .:::::: . . .:... .. CCDS13 GTSPSPGTTLAPTTVPITSAKAAELPPGNYRLVVFELENFQGRRAEFSGECSNLADRGFD 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE5 RCNSIKVEGGTWAVYERPNFAGYMYILPQGEYPEYQRWMGL--NDRLSSCRAVHLPSGGQ : :: : .: :...:. :: : :.:: .::::... : . .::: : : ... . . CCDS13 RVRSIIVSAGPWVAFEQSNFRGEMFILEKGEYPRWNTWSSSYRSDRLMSFRPIKMDAQ-E 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE5 YKIQIFEKGDFSGQMYETT-EDCPSIMEQFHMREIHSCKVLEGVWIFYELPNYRGRQYLL .::..:: ..:.:. : .: ::. .. : :: :.:. :. :.::: :::: CCDS13 HKISLFEGANFKGNTIEIQGDDAPSLWVYGFSDRVGSVKVSSGTWVGYQYPGYRGYQYLL 150 160 170 180 190 200 160 170 pF1KE5 DKKEYRKPIDWGAASPAVQSFRRIVE . ..:. .::: .: .::.::. CCDS13 EPGDFRHWNEWGAFQPQMQSLRRLRDKQWHLEGSFPVLATEPPK 210 220 230 240 250 >>CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 (205 aa) initn: 318 init1: 199 opt: 406 Z-score: 533.8 bits: 105.5 E(32554): 2.1e-23 Smith-Waterman score: 406; 35.6% identity (69.0% similar) in 174 aa overlap (7-176:18-190) 10 20 30 40 pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFH-TYLSRCNSIKVEGGTWA :: ..:..::::. .. . : ... : . . .:. :..: :. CCDS13 MASDHQTQAGKPQSLNPKIIIFEQENFQGHSHELNGPCPNLKETGVEKAGSVLVQAGPWV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE5 VYERPNFAGYMYILPQGEYPEYQRWMGL--NDRLSSCRAVHLPSGGQYKIQIFEKGDFSG ::. : : .... .::::... : . .: ::: : ... : ..:: ..:. .:.: CCDS13 GYEQANCKGEQFVFEKGEYPRWDSWTSSRRTDSLSSLRPIKVDSQ-EHKIILYENPNFTG 70 80 90 100 110 110 120 130 140 150 160 pF1KE5 QMYETTED-CPSIMEQFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGA . .: .: ::. . ..... : .: :.:. :. :.::: ::::.: .:. :.:: CCDS13 KKMEIIDDDVPSFHAHGYQEKVSSVRVQSGTWVGYQYPGYRGLQYLLEKGDYKDSSDFGA 120 130 140 150 160 170 170 pF1KE5 ASPAVQSFRRIVE : ::: ::: CCDS13 PHPQVQSVRRIRDMQWHQRGAFHPSN 180 190 200 >>CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 (215 aa) initn: 357 init1: 134 opt: 370 Z-score: 486.8 bits: 96.9 E(32554): 8.6e-21 Smith-Waterman score: 403; 37.2% identity (64.5% similar) in 183 aa overlap (7-176:32-213) 10 20 30 pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTY-LS :::.:...::::.:.. .: . .. CCDS11 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE5 RCNSIKVEGGTWAVYERPNFAGYMYILPQGEYPEYQRWMGLN----DRLSSCRAVHLPSG :.:::.:.: ::. .: : ..:: .::::... : : : .:: : : . . CCDS11 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSANH 70 80 90 100 110 120 100 110 120 130 140 pF1KE5 GQYKIQIFEKGDFSGQMYETTEDCPSI--MEQFHMREIHSCKVLEGVWIFYELPNYRGRQ . :. :::: .: :...: ..: ::. : :. :. : :. :.:. :. :.::: : CCDS11 KESKMTIFEKENFIGRQWEISDDYPSLQAMGWFN-NEVGSMKIQSGAWVCYQYPGYRGYQ 130 140 150 160 170 180 150 160 170 pF1KE5 YLLDKK----EYRKPIDWG--AASPAVQSFRRIVE :.:. .:.. .:: : . .::.::: CCDS11 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ 190 200 210 >>CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 (197 aa) initn: 326 init1: 163 opt: 348 Z-score: 458.8 bits: 91.6 E(32554): 3.1e-19 Smith-Waterman score: 354; 31.9% identity (63.7% similar) in 182 aa overlap (8-176:14-195) 10 20 30 40 50 pF1KE5 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTY--LSRCNSIKVEGGTWAVYER .:......::::: :::. : : :.:::.:.:...: CCDS24 MSSAPAPGPAPASLTLWDEEDFQGRRCRLLSDCANVCERGGLPRVRSVKVENGVWVAFEY 10 20 30 40 50 60 60 70 80 90 100 pF1KE5 PNFAGYMYILPQGEYPEYQRWMGLN----DRLSSCRAVHLPSGGQYKIQIFEKGDFSGQM :.: : ..:: .:.::... : : . ..: : : : . .. .. .:: .:.: CCDS24 PDFQGQQFILEKGDYPRWSAWSGSSSHNSNQLLSFRPVLCANHNDSRVTLFEGDNFQGCK 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE5 YETTEDCPSIMEQ-FHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPI-DWG-- .. ..: ::. . . ... : :: :.:. :. :.::: ::.:.. .. . .: CCDS24 FDLVDDYPSLPSMGWASKDVGSLKVSSGAWVAYQYPGYRGYQYVLERDRHSGEFCTYGEL 130 140 150 160 170 180 170 pF1KE5 ---AASPAVQSFRRIVE : . .::.::. CCDS24 GTQAHTGQLQSIRRVQH 190 178 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 23:19:46 2016 done: Mon Nov 7 23:19:46 2016 Total Scan time: 1.610 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]