FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6528, 175 aa 1>>>pF1KE6528 175 - 175 aa - 175 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9996+/-0.000654; mu= 14.0270+/- 0.039 mean_var=60.3044+/-11.877, 0's: 0 Z-trim(109.8): 18 B-trim: 0 in 0/52 Lambda= 0.165158 statistics sampled from 11116 (11130) to 11116 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.727), E-opt: 0.2 (0.342), width: 16 Scan time: 1.850 The best scores are: opt bits E(32554) CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 ( 175) 1264 309.0 9.8e-85 CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 ( 174) 1019 250.6 3.6e-67 CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 ( 174) 974 239.9 6.2e-64 CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 ( 174) 951 234.4 2.8e-62 CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 ( 178) 733 182.4 1.2e-46 CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 ( 205) 395 101.9 2.4e-22 CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 ( 211) 384 99.3 1.5e-21 CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 ( 252) 369 95.8 2.1e-20 CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 ( 215) 347 90.5 6.9e-19 CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 ( 196) 322 84.5 4e-17 CCDS34506.1 AIM1 gene_id:202|Hs108|chr6 (1723) 322 85.1 2.4e-16 CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7 ( 182) 310 81.7 2.7e-16 CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 ( 197) 310 81.7 2.9e-16 >>CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 (175 aa) initn: 1264 init1: 1264 opt: 1264 Z-score: 1634.7 bits: 309.0 E(32554): 9.8e-85 Smith-Waterman score: 1264; 99.4% identity (100.0% similar) in 175 aa overlap (1-175:1-175) 10 20 30 40 50 60 pF1KE6 MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQYFLRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQYFLRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GEYPDYQQWMGLSDSIRSCCLIPPHSGAYRMKIYDRDELRGQMSELTDDCLSVQDRFHLT ::::::::::::::::::::::::::::::::::::::::::::::::::.::::::::: CCDS23 GEYPDYQQWMGLSDSIRSCCLIPPHSGAYRMKIYDRDELRGQMSELTDDCISVQDRFHLT 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 EIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 EIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY 130 140 150 160 170 >>CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 (174 aa) initn: 592 init1: 592 opt: 1019 Z-score: 1319.3 bits: 250.6 E(32554): 3.6e-67 Smith-Waterman score: 1019; 78.9% identity (93.1% similar) in 175 aa overlap (1-175:1-174) 10 20 30 40 50 60 pF1KE6 MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQYFLRR :::::::::::::::::: :::::::::::::::::::::::::.::::::::.::.::: CCDS23 MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQYLLRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GEYPDYQQWMGLSDSIRSCCLIPPHSGAYRMKIYDRDELRGQMSELTDDCLSVQDRFHLT ::::::::::::::::::::::: .. ..:...:.:.. .: : ::..:: :.::::::. CCDS23 GEYPDYQQWMGLSDSIRSCCLIP-QTVSHRLRLYEREDHKGLMMELSEDCPSIQDRFHLS 70 80 90 100 110 130 140 150 160 170 pF1KE6 EIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY ::.::.:::: :.:::.:::::::::::: :::: :::: .::.::::::.::: CCDS23 EIRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY 120 130 140 150 160 170 >>CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 (174 aa) initn: 561 init1: 534 opt: 974 Z-score: 1261.3 bits: 239.9 E(32554): 6.2e-64 Smith-Waterman score: 974; 73.7% identity (90.3% similar) in 175 aa overlap (1-175:1-174) 10 20 30 40 50 60 pF1KE6 MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQYFLRR :::::::::: :::: :.: .:::::. ::::::::::.:::::.::::::::::::::: CCDS33 MGKITFYEDRDFQGRCYNCISDCPNLRVYFSRCNSIRVDSGCWMLYERPNYQGHQYFLRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GEYPDYQQWMGLSDSIRSCCLIPPHSGAYRMKIYDRDELRGQMSELTDDCLSVQDRFHLT :.:::::.:::::::..:: .:: :........:.::. :: :::::::: : . :.: CCDS33 GKYPDYQHWMGLSDSVQSCRIIP-HTSSHKLRLYERDDYRGLMSELTDDCACVPELFRLP 70 80 90 100 110 130 140 150 160 170 pF1KE6 EIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY ::.::.:::: :.:::::::::::::::::.:::. :::. .::::::::: ::: CCDS33 EIYSLHVLEGCWVLYEMPNYRGRQYLLRPGDYRRYHDWGGADAKVGSLRRVTDLY 120 130 140 150 160 170 >>CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 (174 aa) initn: 987 init1: 500 opt: 951 Z-score: 1231.7 bits: 234.4 E(32554): 2.8e-62 Smith-Waterman score: 951; 72.4% identity (91.4% similar) in 174 aa overlap (1-174:1-173) 10 20 30 40 50 60 pF1KE6 MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQYFLRR :::::.::::.:::: :::..: ::::::.::::: ::.:::::.::.:::.: :::::: CCDS23 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GEYPDYQQWMGLSDSIRSCCLIPPHSGAYRMKIYDRDELRGQMSELTDDCLSVQDRFHLT :.: :.:::::::::.::: ::: :::..:...:.:.. :::: :.:.:: .::::... CCDS23 GDYADHQQWMGLSDSVRSCRLIP-HSGSHRIRLYEREDYRGQMIEFTEDCSCLQDRFRFN 70 80 90 100 110 130 140 150 160 170 pF1KE6 EIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY ::::::::::::.:::. ::::::::: ::.:::. :::: ::.:::::::.:. CCDS23 EIHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS 120 130 140 150 160 170 >>CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 (178 aa) initn: 692 init1: 388 opt: 733 Z-score: 950.9 bits: 182.4 E(32554): 1.2e-46 Smith-Waterman score: 733; 54.7% identity (80.2% similar) in 172 aa overlap (3-173:7-178) 10 20 30 40 50 pF1KE6 MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQY :::::::. :::: :.: :: ... :.::::::.::.: : .:::::. :..: CCDS32 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 FLRRGEYPDYQQWMGLSDSIRSCCLIP-PHSGAYRMKIYDRDELRGQMSELTDDCLSVQD .: .::::.::.::::.: . :: . : .: :...:... .. ::: : :.:: :... CCDS32 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 RFHLTEIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY .::. :::: .:::: ::.::.:::::::::: :::. .:::: . : :.::... CCDS32 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE 130 140 150 160 170 >>CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 (205 aa) initn: 417 init1: 192 opt: 395 Z-score: 514.7 bits: 101.9 E(32554): 2.4e-22 Smith-Waterman score: 395; 34.7% identity (67.0% similar) in 176 aa overlap (3-174:18-193) 10 20 30 40 pF1KE6 MGKITFYEDRAFQGRSYECTTDCPNLQPY-FSRCNSIRVESGCWM :: ..:.. :::.:.: . ::::. . .:. :..: :. CCDS13 MASDHQTQAGKPQSLNPKIIIFEQENFQGHSHELNGPCPNLKETGVEKAGSVLVQAGPWV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 IYERPNYQGHQYFLRRGEYPDYQQWMGL--SDSIRSCCLIPPHSGAYRMKIYDRDELRGQ ::. : .:.:. ...:::: ...: . .::. : : : ... .:. .. :. CCDS13 GYEQANCKGEQFVFEKGEYPRWDSWTSSRRTDSLSSLRPIKVDSQEHKIILYENPNFTGK 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 MSELTDDCL-SVQDRFHLTEIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAP :. :: . : . . . .. :. : :.:. :..:.::: ::::. :.:. :.::: CCDS13 KMEIIDDDVPSFHAHGYQEKVSSVRVQSGTWVGYQYPGYRGLQYLLEKGDYKDSSDFGAP 130 140 150 160 170 180 170 pF1KE6 NAKVGSLRRVMDLY . .: :.::. :. CCDS13 HPQVQSVRRIRDMQWHQRGAFHPSN 190 200 >>CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 (211 aa) initn: 359 init1: 184 opt: 384 Z-score: 500.3 bits: 99.3 E(32554): 1.5e-21 Smith-Waterman score: 384; 32.0% identity (66.9% similar) in 178 aa overlap (3-173:25-199) 10 20 30 pF1KE6 MGKITFYEDRAFQGRSYECTTDCPNL-QPYFSRCNSIR :. .:: . :::. : ...::.: . . . .::. CCDS13 MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSLLEKVGSIQ 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 VESGCWMIYERPNYQGHQYFLRRGEYPDYQQWMGLSDS-----IRSCCLIPPHSGAYRMK :::: :. .: ..:.:. :..:.:: .. : . :: .: . :: .... CCDS13 VESGPWLAFESRAFRGEQFVLEKGDYPRWDAWSNSRDSDSLLSLRPLNIDSPH---HKLH 70 80 90 100 110 100 110 120 130 140 150 pF1KE6 IYDRDELRGQMSELTDDCL-SVQDRFHLTEIHSLNVLEGSWILYEMPNYRGRQYLLRPGE ... . :. :..:: . :. . .. :. ...:.:. ::.:.::::::... :: CCDS13 LFENPAFSGRKMEIVDDDVPSLWAHGFQDRVASVRAINGTWVGYEFPGYRGRQYVFERGE 120 130 140 150 160 170 160 170 pF1KE6 YRRFLDWGAPNAKVGSLRRVMDLY ::.. .: : . .. :.::. : CCDS13 YRHWNEWDASQPQLQSVRRIRDQKWHKRGRFPSS 180 190 200 210 >>CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 (252 aa) initn: 356 init1: 188 opt: 369 Z-score: 479.9 bits: 95.8 E(32554): 2.1e-20 Smith-Waterman score: 369; 33.1% identity (65.7% similar) in 175 aa overlap (3-173:60-234) 10 20 30 pF1KE6 MGKITFYEDRAFQGRSYECTTDCPNLQPY-FS ... .: . :::: : . .: :: :. CCDS13 GTSPSPGTTLAPTTVPITSAKAAELPPGNYRLVVFELENFQGRRAEFSGECSNLADRGFD 30 40 50 60 70 80 40 50 60 70 80 pF1KE6 RCNSIRVESGCWMIYERPNYQGHQYFLRRGEYPDYQQWMGL--SDSIRSCCLIPPHSGAY : :: : .: :. .:. :..:....:..:::: .. : . :: . : : . . CCDS13 RVRSIIVSAGPWVAFEQSNFRGEMFILEKGEYPRWNTWSSSYRSDRLMSFRPIKMDAQEH 90 100 110 120 130 140 90 100 110 120 130 140 pF1KE6 RMKIYDRDELRGQMSELT-DDCLSVQDRFHLTEIHSLNVLEGSWILYEMPNYRGRQYLLR ...... ...:. :. :: :. .. :..: :.:. :..:.::: ::::. CCDS13 KISLFEGANFKGNTIEIQGDDAPSLWVYGFSDRVGSVKVSSGTWVGYQYPGYRGYQYLLE 150 160 170 180 190 200 150 160 170 pF1KE6 PGEYRRFLDWGAPNAKVGSLRRVMDLY ::..:.. .::: . .. ::::. : CCDS13 PGDFRHWNEWGAFQPQMQSLRRLRDKQWHLEGSFPVLATEPPK 210 220 230 240 250 >>CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 (215 aa) initn: 243 init1: 135 opt: 347 Z-score: 452.6 bits: 90.5 E(32554): 6.9e-19 Smith-Waterman score: 391; 33.7% identity (67.4% similar) in 184 aa overlap (3-171:32-213) 10 20 30 pF1KE6 MGKITFYEDRAFQGRSYECTTDCPNLQPY-FS :::.:... :::. .: :..:::.. :. CCDS11 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD 10 20 30 40 50 60 40 50 60 70 80 pF1KE6 RCNSIRVESGCWMIYERPNYQGHQYFLRRGEYPDYQQWMGLSD-------SIRSCCLIPP :..:::: :. ::. .. :.:..:.::::: .. : : . :.: : CCDS11 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPIC--SA 70 80 90 100 110 90 100 110 120 130 140 pF1KE6 HSGAYRMKIYDRDELRGQMSELTDDCLSVQDR-FHLTEIHSLNVLEGSWILYEMPNYRGR . .: :...... :.. :..:: :.: . .:. :... :.:. :..:.::: CCDS11 NHKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGY 120 130 140 150 160 170 150 160 170 pF1KE6 QYLLR----PGEYRRFLDWG--APNAKVGSLRRVMDLY ::.:. :.:... .:: : .... :.::. CCDS11 QYILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ 180 190 200 210 >>CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 (196 aa) initn: 339 init1: 149 opt: 322 Z-score: 421.0 bits: 84.5 E(32554): 4e-17 Smith-Waterman score: 382; 34.6% identity (68.1% similar) in 182 aa overlap (3-171:13-194) 10 20 30 40 pF1KE6 MGKITFYEDRAFQGRSYECTTDCPN-LQPYFSRCNSIRVESGCWMIYERP :.. ... .:::: .: :..::. :. : :..: :: :. .:. CCDS13 MTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKVLSGAWVGFEHA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 NYQGHQYFLRRGEYPDYQQWMGLS--DSIRSCCLIPPHSGAYR---MKIYDRDELRGQMS ..::.::.:.:::::... : : . . : . : . .: . :...... :. . CCDS13 GFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTIFEQENFLGKKG 70 80 90 100 110 120 110 120 130 140 150 pF1KE6 ELTDDCLSVQDR-FHLTEIHSLNVLEGSWILYEMPNYRGRQYLLR----PGEYRRFLDWG ::.:: :.: .. .:. :..: :.:. ..:.::: ::.:. :.:..: .:: CCDS13 ELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHHSGDYKHFREWG 130 140 150 160 170 180 160 170 pF1KE6 --APNAKVGSLRRVMDLY ::. .: :.::. CCDS13 SHAPTFQVQSIRRIQQ 190 175 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:07:47 2016 done: Tue Nov 8 14:07:47 2016 Total Scan time: 1.850 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]