FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2220, 174 aa 1>>>pF1KE2220 174 - 174 aa - 174 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4148+/-0.000689; mu= 11.7144+/- 0.041 mean_var=62.9792+/-12.601, 0's: 0 Z-trim(109.6): 17 B-trim: 54 in 1/50 Lambda= 0.161613 statistics sampled from 10985 (11000) to 10985 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.716), E-opt: 0.2 (0.338), width: 16 Scan time: 1.840 The best scores are: opt bits E(32554) CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 ( 174) 1251 299.7 5.9e-82 CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 ( 175) 1020 245.9 9.6e-66 CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 ( 174) 972 234.7 2.2e-62 CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 ( 174) 920 222.5 1e-58 CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 ( 178) 698 170.8 3.9e-43 CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 ( 211) 400 101.3 3.7e-22 CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 ( 205) 388 98.5 2.5e-21 CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 ( 252) 382 97.2 8e-21 CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 ( 215) 371 94.6 4.1e-20 CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 ( 196) 351 89.9 9.6e-19 CCDS2429.1 CRYBA2 gene_id:1412|Hs108|chr2 ( 197) 330 85.0 2.9e-17 CCDS5926.1 CRYGN gene_id:155051|Hs108|chr7 ( 182) 321 82.9 1.1e-16 CCDS34506.1 AIM1 gene_id:202|Hs108|chr6 (1723) 292 76.5 8.9e-14 CCDS78289.1 CRYGN gene_id:155051|Hs108|chr7 ( 125) 275 72.1 1.4e-13 >>CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 (174 aa) initn: 1251 init1: 1251 opt: 1251 Z-score: 1584.9 bits: 299.7 E(32554): 5.9e-82 Smith-Waterman score: 1251; 100.0% identity (100.0% similar) in 174 aa overlap (1-174:1-174) 10 20 30 40 50 60 pF1KE2 MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQYLLRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQYLLRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 GEYPDYQQWMGLSDSIRSCCLIPQTVSHRLRLYEREDHKGLMMELSEDCPSIQDRFHLSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 GEYPDYQQWMGLSDSIRSCCLIPQTVSHRLRLYEREDHKGLMMELSEDCPSIQDRFHLSE 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 IRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 IRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY 130 140 150 160 170 >>CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 (175 aa) initn: 592 init1: 592 opt: 1020 Z-score: 1293.7 bits: 245.9 E(32554): 9.6e-66 Smith-Waterman score: 1020; 78.9% identity (93.1% similar) in 175 aa overlap (1-174:1-175) 10 20 30 40 50 60 pF1KE2 MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQYLLRR :::::::::::::::::: :::::::::::::::::::::::::.::::::::.::.::: CCDS23 MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQYFLRR 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 GEYPDYQQWMGLSDSIRSCCLIP-QTVSHRLRLYEREDHKGLMMELSEDCPSIQDRFHLS ::::::::::::::::::::::: .. ..:...:.:.. .: : ::..:: :.::::::. CCDS23 GEYPDYQQWMGLSDSIRSCCLIPPHSGAYRMKIYDRDELRGQMSELTDDCISVQDRFHLT 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE2 EIRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY ::.::.:::: :.:::.:::::::::::: :::: :::: .::.::::::.::: CCDS23 EIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY 130 140 150 160 170 >>CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 (174 aa) initn: 972 init1: 972 opt: 972 Z-score: 1233.3 bits: 234.7 E(32554): 2.2e-62 Smith-Waterman score: 972; 74.7% identity (90.2% similar) in 174 aa overlap (1-174:1-174) 10 20 30 40 50 60 pF1KE2 MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQYLLRR :::::::::: :::: :. .:::::. ::::::::::.::::::::::::::.::.::: CCDS33 MGKITFYEDRDFQGRCYNCISDCPNLRVYFSRCNSIRVDSGCWMLYERPNYQGHQYFLRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 GEYPDYQQWMGLSDSIRSCCLIPQTVSHRLRLYEREDHKGLMMELSEDCPSIQDRFHLSE :.:::::.:::::::..:: .::.: ::.::::::.:..::: ::..:: . . :.: : CCDS33 GKYPDYQHWMGLSDSVQSCRIIPHTSSHKLRLYERDDYRGLMSELTDDCACVPELFRLPE 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 IRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY : :::::::::::::.:::::::::::: .::: .:::. :::.::::::.::: CCDS33 IYSLHVLEGCWVLYEMPNYRGRQYLLRPGDYRRYHDWGGADAKVGSLRRVTDLY 130 140 150 160 170 >>CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 (174 aa) initn: 1060 init1: 920 opt: 920 Z-score: 1167.8 bits: 222.5 E(32554): 1e-58 Smith-Waterman score: 920; 71.7% identity (90.2% similar) in 173 aa overlap (1-173:1-173) 10 20 30 40 50 60 pF1KE2 MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQYLLRR :::::.::::.:::: :: ..: ::::::.::::: ::.::::::::.:::.: ::.::: CCDS23 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 GEYPDYQQWMGLSDSIRSCCLIPQTVSHRLRLYEREDHKGLMMELSEDCPSIQDRFHLSE :.: :.:::::::::.::: :::.. :::.:::::::..: :.:..::: .::::...: CCDS23 GDYADHQQWMGLSDSVRSCRLIPHSGSHRIRLYEREDYRGQMIEFTEDCSCLQDRFRFNE 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 IRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY :.::.:::: :::::: ::::::::: : .::: ::::: .:..::::::.:. CCDS23 IHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS 130 140 150 160 170 >>CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 (178 aa) initn: 649 init1: 367 opt: 698 Z-score: 887.9 bits: 170.8 E(32554): 3.9e-43 Smith-Waterman score: 698; 53.5% identity (79.7% similar) in 172 aa overlap (3-172:7-178) 10 20 30 40 50 pF1KE2 MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQY :::::::. :::: :. :: ... :.::::::.::.: : .:::::. : .: CCDS32 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 LLRRGEYPDYQQWMGLSDSIRSCCLI--PQTVSHRLRLYEREDHKGLMMELSEDCPSIQD .: .::::.::.::::.: . :: . :. .......:. : .: :.: .::::::.. CCDS32 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE2 RFHLSEIRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY .::. ::.: .:::: :..::::::::::::: .:::. :::: . . :.::.:. CCDS32 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE 130 140 150 160 170 >>CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 (211 aa) initn: 347 init1: 168 opt: 400 Z-score: 511.2 bits: 101.3 E(32554): 3.7e-22 Smith-Waterman score: 400; 34.9% identity (68.0% similar) in 175 aa overlap (3-172:25-199) 10 20 30 pF1KE2 MGKITFYEDRAFQGRSYETTTDCPNL-QPYFSRCNSIR :. .:: . :::. : ...::.: . . . .::. CCDS13 MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSLLEKVGSIQ 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE2 VESGCWMLYERPNYQGQQYLLRRGEYPDYQQWMGLSDSIRSCCLIPQTVS---HRLRLYE :::: :. .: ..:.:..:..:.:: .. : . :: : : ... :.:.:.: CCDS13 VESGPWLAFESRAFRGEQFVLEKGDYPRWDAWSNSRDSDSLLSLRPLNIDSPHHKLHLFE 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE2 REDHKGLMMEL-SEDCPSIQDRFHLSEIRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRR .: ::. ..: ::. . ... :.....: :: ::.:.::::::... :::. CCDS13 NPAFSGRKMEIVDDDVPSLWAHGFQDRVASVRAINGTWVGYEFPGYRGRQYVFERGEYRH 130 140 150 160 170 180 160 170 pF1KE2 CQDWGAMDAKAGSLRRVVDLY ..: : . . :.::. : CCDS13 WNEWDASQPQLQSVRRIRDQKWHKRGRFPSS 190 200 210 >>CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 (205 aa) initn: 399 init1: 164 opt: 388 Z-score: 496.3 bits: 98.5 E(32554): 2.5e-21 Smith-Waterman score: 388; 36.0% identity (66.9% similar) in 178 aa overlap (3-173:18-193) 10 20 30 40 pF1KE2 MGKITFYEDRAFQGRSYETTTDCPNLQPY-FSRCNSIRVESGCWM :: ..:.. :::.:.: . ::::. . .:. :..: :. CCDS13 MASDHQTQAGKPQSLNPKIIIFEQENFQGHSHELNGPCPNLKETGVEKAGSVLVQAGPWV 10 20 30 40 50 60 50 60 70 80 90 pF1KE2 LYERPNYQGQQYLLRRGEYPDYQQWMGL--SDSIRSCCLIPQTVS---HRLRLYEREDHK ::. : .:.:.....:::: ...: . .::. : : : :. :.. ::: . CCDS13 GYEQANCKGEQFVFEKGEYPRWDSWTSSRRTDSLSS--LRPIKVDSQEHKIILYENPNFT 70 80 90 100 110 100 110 120 130 140 150 pF1KE2 GLMMEL-SEDCPSIQDRFHLSEIRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWG : ::. ..: ::.. . . .. :..: : :: :. :.::: ::::. .:. .:.: CCDS13 GKKMEIIDDDVPSFHAHGYQEKVSSVRVQSGTWVGYQYPGYRGLQYLLEKGDYKDSSDFG 120 130 140 150 160 170 160 170 pF1KE2 AMDAKAGSLRRVVDLY : .. :.::. :. CCDS13 APHPQVQSVRRIRDMQWHQRGAFHPSN 180 190 200 >>CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 (252 aa) initn: 350 init1: 174 opt: 382 Z-score: 487.3 bits: 97.2 E(32554): 8e-21 Smith-Waterman score: 382; 34.9% identity (64.6% similar) in 175 aa overlap (3-172:60-234) 10 20 30 pF1KE2 MGKITFYEDRAFQGRSYETTTDCPNLQPY-FS ... .: . :::: : . .: :: :. CCDS13 GTSPSPGTTLAPTTVPITSAKAAELPPGNYRLVVFELENFQGRRAEFSGECSNLADRGFD 30 40 50 60 70 80 40 50 60 70 80 pF1KE2 RCNSIRVESGCWMLYERPNYQGQQYLLRRGEYPDYQQWMGLSDSIRSCCLIPQTVS---H : :: : .: :. .:. :..:....:..:::: .. : . : : . : .. : CCDS13 RVRSIIVSAGPWVAFEQSNFRGEMFILEKGEYPRWNTWSSSYRSDRLMSFRPIKMDAQEH 90 100 110 120 130 140 90 100 110 120 130 140 pF1KE2 RLRLYEREDHKGLMMELS-EDCPSIQDRFHLSEIRSLHVLEGCWVLYELPNYRGRQYLLR .. :.: . :: .:.. .: ::. ... :..: : :: :. :.::: ::::. CCDS13 KISLFEGANFKGNTIEIQGDDAPSLWVYGFSDRVGSVKVSSGTWVGYQYPGYRGYQYLLE 150 160 170 180 190 200 150 160 170 pF1KE2 PQEYRRCQDWGAMDAKAGSLRRVVDLY : ..:. ..:::.. . ::::. : CCDS13 PGDFRHWNEWGAFQPQMQSLRRLRDKQWHLEGSFPVLATEPPK 210 220 230 240 250 >>CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 (215 aa) initn: 288 init1: 143 opt: 371 Z-score: 474.5 bits: 94.6 E(32554): 4.1e-20 Smith-Waterman score: 398; 35.5% identity (66.1% similar) in 183 aa overlap (3-170:32-213) 10 20 30 pF1KE2 MGKITFYEDRAFQGRSYETTTDCPNLQPY-FS :::.:... :::. .: :..:::.. :. CCDS11 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD 10 20 30 40 50 60 40 50 60 70 80 pF1KE2 RCNSIRVESGCWMLYERPNYQGQQYLLRRGEYPDYQQWMGLSD-------SIRSCCLIPQ :..:::: :. ::. .. :::..:.::::: .. : : . :.: : . CCDS11 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFRPICSANH 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE2 TVSHRLRLYEREDHKGLMMELSEDCPSIQDR-FHLSEIRSLHVLEGCWVLYELPNYRGRQ : .. ..:.:. : . :.:.: ::.: . .:. :... : :: :. :.::: : CCDS11 KES-KMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ 130 140 150 160 170 180 150 160 170 pF1KE2 YLLRPQ----EYRRCQDWG--AMDAKAGSLRRVVDLY :.:. . .:.. ..:: :. .. :.::. CCDS11 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ 190 200 210 >>CCDS13841.1 CRYBA4 gene_id:1413|Hs108|chr22 (196 aa) initn: 329 init1: 141 opt: 351 Z-score: 450.0 bits: 89.9 E(32554): 9.6e-19 Smith-Waterman score: 376; 35.7% identity (65.4% similar) in 182 aa overlap (3-170:13-194) 10 20 30 40 pF1KE2 MGKITFYEDRAFQGRSYETTTDCPN-LQPYFSRCNSIRVESGCWMLYERP :.. ... .:::: .: :..::. :. : :..: :: :. .:. CCDS13 MTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETVRSLKVLSGAWVGFEHA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE2 NYQGQQYLLRRGEYPDYQQWMGLS--DSIRSCCLIPQTVSH----RLRLYEREDHKGLMM ..:::::.:.:::::... : : . . : . : . .. :: ..:.:. : CCDS13 GFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANHRDSRLTIFEQENFLGKKG 70 80 90 100 110 120 110 120 130 140 150 pF1KE2 ELSEDCPSIQDR-FHLSEIRSLHVLEGCWVLYELPNYRGRQYLLR----PQEYRRCQDWG :::.: ::.: .. .:. :.:: : :: ..:.::: ::.:. .:.. ..:: CCDS13 ELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGFQYVLECDHHSGDYKHFREWG 130 140 150 160 170 180 160 170 pF1KE2 --AMDAKAGSLRRVVDLY : .. :.::. CCDS13 SHAPTFQVQSIRRIQQ 190 174 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 18:05:05 2016 done: Sun Nov 6 18:05:05 2016 Total Scan time: 1.840 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]