FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6527, 174 aa 1>>>pF1KE6527 174 - 174 aa - 174 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0881+/-0.000711; mu= 13.4934+/- 0.043 mean_var=57.1407+/-11.543, 0's: 0 Z-trim(108.1): 22 B-trim: 50 in 1/50 Lambda= 0.169669 statistics sampled from 9950 (9971) to 9950 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.697), E-opt: 0.2 (0.306), width: 16 Scan time: 1.160 The best scores are: opt bits E(32554) CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 ( 174) 1248 313.2 5.2e-86 CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 ( 175) 951 240.5 4e-64 CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 ( 174) 931 235.6 1.2e-62 CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 ( 174) 920 232.9 7.7e-62 CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 ( 178) 660 169.3 1.1e-42 CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 ( 215) 363 96.6 1e-20 CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 ( 211) 360 95.9 1.7e-20 CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 ( 252) 357 95.2 3.2e-20 CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 ( 205) 348 92.9 1.2e-19 CCDS34506.1 AIM1 gene_id:202|Hs108|chr6 (1723) 276 75.7 1.6e-13 >>CCDS2378.1 CRYGD gene_id:1421|Hs108|chr2 (174 aa) initn: 1248 init1: 1248 opt: 1248 Z-score: 1657.6 bits: 313.2 E(32554): 5.2e-86 Smith-Waterman score: 1248; 100.0% identity (100.0% similar) in 174 aa overlap (1-174:1-174) 10 20 30 40 50 60 pF1KE6 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GDYADHQQWMGLSDSVRSCRLIPHSGSHRIRLYEREDYRGQMIEFTEDCSCLQDRFRFNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 GDYADHQQWMGLSDSVRSCRLIPHSGSHRIRLYEREDYRGQMIEFTEDCSCLQDRFRFNE 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 IHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS23 IHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS 130 140 150 160 170 >>CCDS2380.1 CRYGB gene_id:1419|Hs108|chr2 (175 aa) initn: 987 init1: 500 opt: 951 Z-score: 1264.7 bits: 240.5 E(32554): 4e-64 Smith-Waterman score: 951; 72.4% identity (91.4% similar) in 174 aa overlap (1-173:1-174) 10 20 30 40 50 60 pF1KE6 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLRR :::::.::::.:::: :::..: ::::::.::::: ::.:::::.::.:::.: :::::: CCDS23 MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWMIYERPNYQGHQYFLRR 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 GDYADHQQWMGLSDSVRSCRLIP-HSGSHRIRLYEREDYRGQMIEFTEDCSCLQDRFRFN :.: :.:::::::::.::: ::: :::..:...:.:.. :::: :.:.:: .::::... CCDS23 GEYPDYQQWMGLSDSIRSCCLIPPHSGAYRMKIYDRDELRGQMSELTDDCISVQDRFHLT 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 EIHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS ::::::::::::.:::. ::::::::: ::.:::. :::: ::.:::::::.:. CCDS23 EIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGAPNAKVGSLRRVMDLY 130 140 150 160 170 >>CCDS33367.1 CRYGA gene_id:1418|Hs108|chr2 (174 aa) initn: 1079 init1: 931 opt: 931 Z-score: 1238.3 bits: 235.6 E(32554): 1.2e-62 Smith-Waterman score: 931; 72.3% identity (89.6% similar) in 173 aa overlap (1-173:1-173) 10 20 30 40 50 60 pF1KE6 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLRR :::::.:::: :::: :.: :: :::. :.::::: :::::::::::.:::.: :::::: CCDS33 MGKITFYEDRDFQGRCYNCISDCPNLRVYFSRCNSIRVDSGCWMLYERPNYQGHQYFLRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GDYADHQQWMGLSDSVRSCRLIPHSGSHRIRLYEREDYRGQMIEFTEDCSCLQDRFRFNE : : :.:.::::::::.:::.:::..::..:::::.:::: : :.:.::.:. . ::. : CCDS33 GKYPDYQHWMGLSDSVQSCRIIPHTSSHKLRLYERDDYRGLMSELTDDCACVPELFRLPE 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 IHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS :.::.:::: :::::. ::::::::: :::::::.:::...:.::::::: :. CCDS33 IYSLHVLEGCWVLYEMPNYRGRQYLLRPGDYRRYHDWGGADAKVGSLRRVTDLY 130 140 150 160 170 >>CCDS2379.1 CRYGC gene_id:1420|Hs108|chr2 (174 aa) initn: 1060 init1: 920 opt: 920 Z-score: 1223.7 bits: 232.9 E(32554): 7.7e-62 Smith-Waterman score: 920; 71.7% identity (90.2% similar) in 173 aa overlap (1-173:1-173) 10 20 30 40 50 60 pF1KE6 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLRR :::::.::::.:::: :: ..: ::::::.::::: ::.::::::::.:::.: ::.::: CCDS23 MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWMLYERPNYQGQQYLLRR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GDYADHQQWMGLSDSVRSCRLIPHSGSHRIRLYEREDYRGQMIEFTEDCSCLQDRFRFNE :.: :.:::::::::.::: :::.. :::.:::::::..: :.:..::: .::::...: CCDS23 GEYPDYQQWMGLSDSIRSCCLIPQTVSHRLRLYEREDHKGLMMELSEDCPSIQDRFHLSE 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 IHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS :.::.:::: :::::: ::::::::: : .::: ::::: .:..::::::.:. CCDS23 IRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAMDAKAGSLRRVVDLY 130 140 150 160 170 >>CCDS3275.1 CRYGS gene_id:1427|Hs108|chr3 (178 aa) initn: 761 init1: 336 opt: 660 Z-score: 879.6 bits: 169.3 E(32554): 1.1e-42 Smith-Waterman score: 660; 50.0% identity (80.2% similar) in 172 aa overlap (3-172:7-178) 10 20 30 40 50 pF1KE6 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQY :::.:::..::::.:.:. : ... ::::::: .:..: : .::.::..: .: CCDS32 MSKTGTKITFYEDKNFQGRRYDCDCDCADFHTYLSRCNSIKVEGGTWAVYERPNFAGYMY 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 FLRRGDYADHQQWMGLSDSVRSCRLI--PHSGSHRIRLYEREDYRGQMIEFTEDCSCLQD .: .:.: ..:.::::.: . ::: . : .:...:...:. :. ::: : :::: ... CCDS32 ILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQIFEKGDFSGQMYETTEDCPSIME 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 RFRFNEIHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS .:.. :::: .:::: :..::: ::::::::: .::. ::::.. : :.::... CCDS32 QFHMREIHSCKVLEGVWIFYELPNYRGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE 130 140 150 160 170 >>CCDS11249.1 CRYBA1 gene_id:1411|Hs108|chr17 (215 aa) initn: 300 init1: 103 opt: 363 Z-score: 485.4 bits: 96.6 E(32554): 1e-20 Smith-Waterman score: 363; 33.3% identity (67.2% similar) in 183 aa overlap (3-170:32-213) 10 20 30 pF1KE6 MGKITLYEDRGFQGRHYECSSDHPNLQPY-LS :::.:....:::...: .:. ::.. .. CCDS11 ETQAEQQELETLPTTKMAQTNPTPGSLGPWKITIYDQENFQGKRMEFTSSCPNVSERSFD 10 20 30 40 50 60 40 50 60 70 80 pF1KE6 RCNSARVDSGCWMLYEQPNYSGLQYFLRRGDYADHQQWMGLSDSVRSCRLIPH----SGS : .:.:: :. ::. .. : :..:.::.: . : : :.. . ::. :.. CCDS11 NVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSG-SNAYHIERLMSFRPICSAN 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE6 HR---IRLYEREDYRGQMIEFTEDCSCLQDRFRFN-EIHSLNVLEGSWVLYELSNYRGRQ :. . ..:.:.. :.. :...: :: :: :. :... :.:: :. .::: : CCDS11 HKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQSGAWVCYQYPGYRGYQ 130 140 150 160 170 180 150 160 170 pF1KE6 YLL----MPGDYRRYQDWG--ATNARVGSLRRVIDFS :.: :::.....:: : .... :.::. CCDS11 YILECDHHGGDYKHWREWGSHAQTSQIQSIRRIQQ 190 200 210 >>CCDS13830.1 CRYBB3 gene_id:1417|Hs108|chr22 (211 aa) initn: 280 init1: 149 opt: 360 Z-score: 481.6 bits: 95.9 E(32554): 1.7e-20 Smith-Waterman score: 360; 34.3% identity (66.9% similar) in 175 aa overlap (3-172:25-199) 10 20 30 pF1KE6 MGKITLYEDRGFQGRHYECSSDHPNL-QPYLSRCNSAR :. ::: ..:::.. : :.. :.: . : . .: . CCDS13 MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSLLEKVGSIQ 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 VDSGCWMLYEQPNYSGLQYFLRRGDYADHQQWMGL--SDSVRSCR-LIPHSGSHRIRLYE :.:: :. .:. . : :. :..::: . : . :::. : : : : :...:.: CCDS13 VESGPWLAFESRAFRGEQFVLEKGDYPRWDAWSNSRDSDSLLSLRPLNIDSPHHKLHLFE 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE6 REDYRGQMIEFTED-CSCLQDRFRFNEIHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRR . :. .:...: : . ... :. ...:.:: ::. .::::::.. :.::. CCDS13 NPAFSGRKMEIVDDDVPSLWAHGFQDRVASVRAINGTWVGYEFPGYRGRQYVFERGEYRH 130 140 150 160 170 180 160 170 pF1KE6 YQDWGATNARVGSLRRVIDFS ...: :.. .. :.::. : CCDS13 WNEWDASQPQLQSVRRIRDQKWHKRGRFPSS 190 200 210 >>CCDS13840.1 CRYBB1 gene_id:1414|Hs108|chr22 (252 aa) initn: 351 init1: 164 opt: 357 Z-score: 476.4 bits: 95.2 E(32554): 3.2e-20 Smith-Waterman score: 357; 35.4% identity (64.6% similar) in 178 aa overlap (3-172:60-234) 10 20 30 pF1KE6 MGKITLYEDRGFQGRHYECSSDHPNLQPY-LS .....: ..::::. : :.. :: .. CCDS13 GTSPSPGTTLAPTTVPITSAKAAELPPGNYRLVVFELENFQGRRAEFSGECSNLADRGFD 30 40 50 60 70 80 40 50 60 70 80 pF1KE6 RCNSARVDSGCWMLYEQPNYSGLQYFLRRGDYADHQQWMGLSDSVRSCRLI---P---HS : : :..: :. .:: :. : ...:..:.: .: :.: :: ::. : . CCDS13 RVRSIIVSAGPWVAFEQSNFRGEMFILEKGEYP---RWNTWSSSYRSDRLMSFRPIKMDA 90 100 110 120 130 140 90 100 110 120 130 140 pF1KE6 GSHRIRLYEREDYRGQMIEFT-EDCSCLQDRFRFNEIHSLNVLEGSWVLYELSNYRGRQY :.: :.: ...:. ::. .: : ... :..: :.:: :. .::: :: CCDS13 QEHKISLFEGANFKGNTIEIQGDDAPSLWVYGFSDRVGSVKVSSGTWVGYQYPGYRGYQY 150 160 170 180 190 200 150 160 170 pF1KE6 LLMPGDYRRYQDWGATNARVGSLRRVIDFS :: :::.:....::: . .. ::::. : CCDS13 LLEPGDFRHWNEWGAFQPQMQSLRRLRDKQWHLEGSFPVLATEPPK 210 220 230 240 250 >>CCDS13831.1 CRYBB2 gene_id:1415|Hs108|chr22 (205 aa) initn: 365 init1: 142 opt: 348 Z-score: 465.9 bits: 92.9 E(32554): 1.2e-19 Smith-Waterman score: 348; 34.9% identity (64.6% similar) in 175 aa overlap (3-172:18-192) 10 20 30 40 pF1KE6 MGKITLYEDRGFQGRHYECSSDHPNLQPY-LSRCNSARVDSGCWM :: ..:...:::. .: .. :::. . . .:. :..: :. CCDS13 MASDHQTQAGKPQSLNPKIIIFEQENFQGHSHELNGPCPNLKETGVEKAGSVLVQAGPWV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 LYEQPNYSGLQYFLRRGDYADHQQWMGL--SDSVRSCRLIP-HSGSHRIRLYEREDYRGQ ::: : .: :. ...:.: ..: . .::. : : : : :.: ::: .. :. CCDS13 GYEQANCKGEQFVFEKGEYPRWDSWTSSRRTDSLSSLRPIKVDSQEHKIILYENPNFTGK 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 MIEFTED-CSCLQDRFRFNEIHSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGAT .:. .: .. . ... :. : :.:: :. .::: :::: :::. .:.:: CCDS13 KMEIIDDDVPSFHAHGYQEKVSSVRVQSGTWVGYQYPGYRGLQYLLEKGDYKDSSDFGAP 130 140 150 160 170 180 170 pF1KE6 NARVGSLRRVIDFS . .: :.::. : CCDS13 HPQVQSVRRIRDMQWHQRGAFHPSN 190 200 >>CCDS34506.1 AIM1 gene_id:202|Hs108|chr6 (1723 aa) initn: 360 init1: 158 opt: 276 Z-score: 356.3 bits: 75.7 E(32554): 1.6e-13 Smith-Waterman score: 276; 33.5% identity (62.9% similar) in 170 aa overlap (3-168:1416-1581) 10 20 30 pF1KE6 MGKITLYEDRGFQGRHYECSSDHPNLQPYLSR .: :. . :::. ... .: CCDS34 SFEDWGGKNCKISSVQPICLDSFTGPRRRNQIHLFSEPQFQGHSQSFEETTSQIDDSFST 1390 1400 1410 1420 1430 1440 40 50 60 70 80 pF1KE6 CNSARVDSGCWMLYEQPNYSGLQYFLRRGDYADHQQWMGLSD--SVRSCRLIPHSGSH-R .: ::..: :..:. :..: :: :..: : . :: . .: :.: :. CCDS34 -KSCRVSGGSWVVYDGENFTGNQYVLEEGHYP-CLSAMGCPPGATFKSLRFIDVEFSEPT 1450 1460 1470 1480 1490 1500 90 100 110 120 130 140 pF1KE6 IRLYEREDYRGQMIEFTEDCSCLQDRFRFN-EIHSLNVLEGSWVLYELSNYRGRQYLLMP : :.::::..:. ::.. . :.. . :: .:.:..:. : :: :: ..:::::.:: : CCDS34 IILFEREDFKGKKIELNAETVNLRS-LGFNTQIRSVQVIGGIWVTYEYGSYRGRQFLLSP 1510 1520 1530 1540 1550 1560 150 160 170 pF1KE6 GDYRRYQDWGATNARVGSLRRVIDFS .. . .... ..:::: CCDS34 AEVPNWYEFSGCR-QIGSLRPFVQKRIYFRLRNKATGLFMSTNGNLEDLKLLRIQVMEDV 1570 1580 1590 1600 1610 1620 174 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:07:11 2016 done: Tue Nov 8 14:07:11 2016 Total Scan time: 1.160 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]