FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0438, 328 aa
1>>>pF1KE0438 328 - 328 aa - 328 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1389+/-0.000815; mu= 16.5991+/- 0.049
mean_var=62.0671+/-12.497, 0's: 0 Z-trim(106.8): 20 B-trim: 85 in 1/50
Lambda= 0.162796
statistics sampled from 9210 (9221) to 9210 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.662), E-opt: 0.2 (0.283), width: 16
Scan time: 2.360
The best scores are: opt bits E(32554)
CCDS6609.1 GRHPR gene_id:9380|Hs108|chr9 ( 328) 2155 514.6 4.3e-146
CCDS904.1 PHGDH gene_id:26227|Hs108|chr1 ( 533) 467 118.3 1.4e-26
CCDS43203.1 CTBP1 gene_id:1487|Hs108|chr4 ( 429) 318 83.2 4e-16
CCDS3348.1 CTBP1 gene_id:1487|Hs108|chr4 ( 440) 318 83.2 4.1e-16
CCDS7643.1 CTBP2 gene_id:1488|Hs108|chr10 ( 445) 314 82.3 8e-16
CCDS7644.1 CTBP2 gene_id:1488|Hs108|chr10 ( 985) 314 82.5 1.6e-15
>>CCDS6609.1 GRHPR gene_id:9380|Hs108|chr9 (328 aa)
initn: 2155 init1: 2155 opt: 2155 Z-score: 2736.4 bits: 514.6 E(32554): 4.3e-146
Smith-Waterman score: 2155; 100.0% identity (100.0% similar) in 328 aa overlap (1-328:1-328)
10 20 30 40 50 60
pF1KE0 MRPVRLMKVFVTRRIPAEGRVALARAADCEVEQWDSDEPIPAKELERGVAGAHGLLCLLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 MRPVRLMKVFVTRRIPAEGRVALARAADCEVEQWDSDEPIPAKELERGVAGAHGLLCLLS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 DHVDKRILDAAGANLKVISTMSVGIDHLALDEIKKRGIRVGYTPDVLTDTTAELAVSLLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 DHVDKRILDAAGANLKVISTMSVGIDHLALDEIKKRGIRVGYTPDVLTDTTAELAVSLLL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 TTCRRLPEAIEEVKNGGWTSWKPLWLCGYGLTQSTVGIIGLGRIGQAIARRLKPFGVQRF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 TTCRRLPEAIEEVKNGGWTSWKPLWLCGYGLTQSTVGIIGLGRIGQAIARRLKPFGVQRF
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 LYTGRQPRPEEAAEFQAEFVSTPELAAQSDFIVVACSLTPATEGLCNKDFFQKMKETAVF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 LYTGRQPRPEEAAEFQAEFVSTPELAAQSDFIVVACSLTPATEGLCNKDFFQKMKETAVF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 INISRGDVVNQDDLYQALASGKIAAAGLDVTSPEPLPTNHPLLTLKNCVILPHIGSATHR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 INISRGDVVNQDDLYQALASGKIAAAGLDVTSPEPLPTNHPLLTLKNCVILPHIGSATHR
250 260 270 280 290 300
310 320
pF1KE0 TRNTMSLLAANNLLAGLRGEPMPSELKL
::::::::::::::::::::::::::::
CCDS66 TRNTMSLLAANNLLAGLRGEPMPSELKL
310 320
>>CCDS904.1 PHGDH gene_id:26227|Hs108|chr1 (533 aa)
initn: 333 init1: 191 opt: 467 Z-score: 590.6 bits: 118.3 E(32554): 1.4e-26
Smith-Waterman score: 470; 29.2% identity (60.2% similar) in 319 aa overlap (6-322:6-312)
10 20 30 40 50 60
pF1KE0 MRPVRLMKVFVTRRIPAEGRVALARAADCEVEQWDSDEPIPAKELERGVAGAHGLLCLLS
: ::... . : : .. ::. . . .:: . .::. .
CCDS90 MAFANLRKVLISDSLDPCCRKILQDGGLQVVEKQN----LSKEELIAELQDCEGLIVRSA
10 20 30 40 50
70 80 90 100 110 120
pF1KE0 DHVDKRILDAAGANLKVISTMSVGIDHLALDEIKKRGIRVGYTPDVLTDTTAELAVSLLL
.: ...:: .:.:.. ..:.:.. :. ..:: : ::. . ..:::. ....
CCDS90 TKVTADVINAA-EKLQVVGRAGTGVDNVDLEAATRKGILVMNTPNGNSLSAAELTCGMIM
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE0 TTCRRLPEAIEEVKNGGWTSWKPLWLCGYGLTQSTVGIIGLGRIGQAIARRLKPFGVQRF
:..:.: .:.: : : . : :. .:.::.::::::. .: :.. ::.. .
CCDS90 CLARQIPQATASMKDGKWERKKFM---GTELNGKTLGILGLGRIGREVATRMQSFGMKTI
120 130 140 150 160 170
190 200 210 220 230
pF1KE0 LYTGRQP--RPEEAAEFQAEFVSTPELAAQSDFIVVACSLTPATEGLCNKDFFQKMKETA
: .: :: .: : .. . :. :::.: : :.: :: : . : . :. .
CCDS90 ---GYDPIISPEVSASFGVQQLPLEEIWPLCDFITVHTPLLPSTTGLLNDNTFAQCKKGV
180 190 200 210 220
240 250 260 270 280 290
pF1KE0 VFINISRGDVVNQDDLYQALASGKIAAAGLDVTSPEPLPTNHPLLTLKNCVILPHIGSAT
.: .:: .:.. : .:: ::. :.:.::: . :: : .. :. .: . ::.:..:
CCDS90 RVVNCARGGIVDEGALLRALQSGQCAGAALDVFTEEP-PRDRALVDHENVISCPHLGAST
230 240 250 260 270 280
300 310 320
pF1KE0 HRTRNTMSLLAANNLLAGLRGEPMPSELKL
..... . : ... ..:. .
CCDS90 KEAQSRCGEEIAVQFVDMVKGKSLTGVVNAQALTSAFSPHTKPWIGLAEALGTLMRAWAG
290 300 310 320 330 340
>>CCDS43203.1 CTBP1 gene_id:1487|Hs108|chr4 (429 aa)
initn: 346 init1: 246 opt: 318 Z-score: 402.9 bits: 83.2 E(32554): 4e-16
Smith-Waterman score: 318; 27.7% identity (55.6% similar) in 311 aa overlap (23-327:37-337)
10 20 30 40 50
pF1KE0 MRPVRLMKVFVTRRIPAEGRVALARAADCEVEQWDSDEPIPAKELERGVAGA
.: .: :... : . : : :...:
CCDS43 PIMNGPLHPRPLVALLDGRDCTVEMPILKDVATVAFCDAQ---STQEIHEKVLNEAV---
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 HGLLCLLSDHVDKRILDAAGANLKVISTMSVGIDHLALDEIKKRGIRVGYTPDVLTDTTA
: : . . .. :. : :..: .. :.:.. . :: : .: . .. ::
CCDS43 -GALMYHTITLTREDLEKFKA-LRIIVRIGSGFDNIDIKSAGDLGIAVCNVPAASVEETA
70 80 90 100 110
120 130 140 150 160
pF1KE0 ELAVSLLLTTCRRLPEAIEEVKNGGWT-SWKPLWLCGYGLTQ---STVGIIGLGRIGQAI
. .. .:. :: . ...: . : . . . : .. :.:::::::.:::.
CCDS43 DSTLCHILNLYRRATWLHQALREGTRVQSVEQIREVASGAARIRGETLGIIGLGRVGQAV
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE0 ARRLKPFGVQRFLYTGRQPRPEEAAEFQAEFVSTPELAAQSDFIVVACSLTPATEGLCNK
: : : :: . ..: : : . . .: .:: ... :.:. .. : :
CCDS43 ALRAKAFGFNVLFYDPYLSDGVERALGLQRVSTLQDLLFHSDCVTLHCGLNEHNHHLIN-
180 190 200 210 220 230
230 240 250 260 270 280
pF1KE0 DF-FQKMKETAVFINISRGDVVNQDDLYQALASGKIAAAGLDVTSPEPLPTNH-PLLTLK
:: ..:.. : ..: .:: .:.. : ::: :.: .:.::: ::. .. ::
CCDS43 DFTVKQMRQGAFLVNTARGGLVDEKALAQALKEGRIRGAALDVHESEPFSFSQGPLKDAP
240 250 260 270 280 290
290 300 310 320
pF1KE0 NCVILPHIGSATHRTRNTMSLLAANNLLAGLRGEPMPSELKL
: . :: . .... : :: .. .. :. .:. ::
CCDS43 NLICTPHAAWYSEQASIEMREEAAREIRRAITGR-IPDSLKNCVNKDHLTAATHWASMDP
300 310 320 330 340 350
CCDS43 AVVHPELNGAAYRYPPGVVGVAPTGIPAAVEGIVPSAMSLSHGLPPVAHPPHAPSPGQTV
360 370 380 390 400 410
>>CCDS3348.1 CTBP1 gene_id:1487|Hs108|chr4 (440 aa)
initn: 346 init1: 246 opt: 318 Z-score: 402.7 bits: 83.2 E(32554): 4.1e-16
Smith-Waterman score: 318; 27.7% identity (55.6% similar) in 311 aa overlap (23-327:48-348)
10 20 30 40 50
pF1KE0 MRPVRLMKVFVTRRIPAEGRVALARAADCEVEQWDSDEPIPAKELERGVAGA
.: .: :... : . : : :...:
CCDS33 PIMNGPLHPRPLVALLDGRDCTVEMPILKDVATVAFCDAQ---STQEIHEKVLNEAV---
20 30 40 50 60 70
60 70 80 90 100 110
pF1KE0 HGLLCLLSDHVDKRILDAAGANLKVISTMSVGIDHLALDEIKKRGIRVGYTPDVLTDTTA
: : . . .. :. : :..: .. :.:.. . :: : .: . .. ::
CCDS33 -GALMYHTITLTREDLEKFKA-LRIIVRIGSGFDNIDIKSAGDLGIAVCNVPAASVEETA
80 90 100 110 120
120 130 140 150 160
pF1KE0 ELAVSLLLTTCRRLPEAIEEVKNGGWT-SWKPLWLCGYGLTQ---STVGIIGLGRIGQAI
. .. .:. :: . ...: . : . . . : .. :.:::::::.:::.
CCDS33 DSTLCHILNLYRRATWLHQALREGTRVQSVEQIREVASGAARIRGETLGIIGLGRVGQAV
130 140 150 160 170 180
170 180 190 200 210 220
pF1KE0 ARRLKPFGVQRFLYTGRQPRPEEAAEFQAEFVSTPELAAQSDFIVVACSLTPATEGLCNK
: : : :: . ..: : : . . .: .:: ... :.:. .. : :
CCDS33 ALRAKAFGFNVLFYDPYLSDGVERALGLQRVSTLQDLLFHSDCVTLHCGLNEHNHHLIN-
190 200 210 220 230 240
230 240 250 260 270 280
pF1KE0 DF-FQKMKETAVFINISRGDVVNQDDLYQALASGKIAAAGLDVTSPEPLPTNH-PLLTLK
:: ..:.. : ..: .:: .:.. : ::: :.: .:.::: ::. .. ::
CCDS33 DFTVKQMRQGAFLVNTARGGLVDEKALAQALKEGRIRGAALDVHESEPFSFSQGPLKDAP
250 260 270 280 290 300
290 300 310 320
pF1KE0 NCVILPHIGSATHRTRNTMSLLAANNLLAGLRGEPMPSELKL
: . :: . .... : :: .. .. :. .:. ::
CCDS33 NLICTPHAAWYSEQASIEMREEAAREIRRAITGR-IPDSLKNCVNKDHLTAATHWASMDP
310 320 330 340 350 360
CCDS33 AVVHPELNGAAYRYPPGVVGVAPTGIPAAVEGIVPSAMSLSHGLPPVAHPPHAPSPGQTV
370 380 390 400 410 420
>>CCDS7643.1 CTBP2 gene_id:1488|Hs108|chr10 (445 aa)
initn: 284 init1: 235 opt: 314 Z-score: 397.6 bits: 82.3 E(32554): 8e-16
Smith-Waterman score: 314; 27.9% identity (55.4% similar) in 312 aa overlap (23-327:54-354)
10 20 30 40 50
pF1KE0 MRPVRLMKVFVTRRIPAEGRVALARAADCEVEQWDSDEPIPAKELERGVAGA
:: .: :... : . : : :...: ::
CCDS76 QIMNGPLHPRPLVALLDGRDCTVEMPILKDLATVAFCDAQ---STQEIHEKVLNEAV-GA
30 40 50 60 70
60 70 80 90 100 110
pF1KE0 HGLLCLLSDHVDKRILDAAGANLKVISTMSVGIDHLALDEIKKRGIRVGYTPDVLTDTTA
. . : . . : :.:: .. : :.. . . :: : :.. .. ::
CCDS76 MMYHTITLTREDLEKFKA----LRVIVRIGSGYDNVDIKAAGELGIAVCNIPSAAVEETA
80 90 100 110 120 130
120 130 140 150 160
pF1KE0 ELAVSLLLTTCRRLPEAIEEVKNGGWT-SWKPLWLCGYGLTQ---STVGIIGLGRIGQAI
. .. .:. :: . ...: . : . . . : .. :.:.::.:: :::.
CCDS76 DSTICHILNLYRRNTWLYQALREGTRVQSVEQIREVASGAARIRGETLGLIGFGRTGQAV
140 150 160 170 180 190
170 180 190 200 210 220
pF1KE0 ARRLKPFGVQRFLYTGR-QPRPEEAAEFQAEFVSTPELAAQSDFIVVACSLTPATEGLCN
: : : :: . ..: : :.. : . . .: ::: . . :.:. .. : :
CCDS76 AVRAKAFGFSVIFYDPYLQDGIERSLGVQRVY-TLQDLLYQSDCVSLHCNLNEHNHHLIN
200 210 220 230 240 250
230 240 250 260 270 280
pF1KE0 KDF-FQKMKETAVFINISRGDVVNQDDLYQALASGKIAAAGLDVTSPEPLPTNH-PLLTL
:: ...:.. : ..: .:: .:.. : ::: :.: .:.::: ::. . ::
CCDS76 -DFTIKQMRQGAFLVNAARGGLVDEKALAQALKEGRIRGAALDVHESEPFSFAQGPLKDA
260 270 280 290 300 310
290 300 310 320
pF1KE0 KNCVILPHIGSATHRTRNTMSLLAANNLLAGLRGEPMPSELKL
: . :: . .... : ::... .. :. .: :.
CCDS76 PNLICTPHTAWYSEQASLEMREAAATEIRRAITGR-IPESLRNCVNKEFFVTSAPWSVID
320 330 340 350 360 370
CCDS76 QQAIHPELNGATYRYPPGIVGVAPGGLPAAMEGIIPGGIPVTHNLPTVAHPSQAPSPNQP
380 390 400 410 420 430
>>CCDS7644.1 CTBP2 gene_id:1488|Hs108|chr10 (985 aa)
initn: 235 init1: 235 opt: 314 Z-score: 392.4 bits: 82.5 E(32554): 1.6e-15
Smith-Waterman score: 314; 27.9% identity (55.4% similar) in 312 aa overlap (23-327:594-894)
10 20 30 40 50
pF1KE0 MRPVRLMKVFVTRRIPAEGRVALARAADCEVEQWDSDEPIPAKELERGVAGA
:: .: :... : . : : :...: ::
CCDS76 QIMNGPLHPRPLVALLDGRDCTVEMPILKDLATVAFCDAQ---STQEIHEKVLNEAV-GA
570 580 590 600 610
60 70 80 90 100 110
pF1KE0 HGLLCLLSDHVDKRILDAAGANLKVISTMSVGIDHLALDEIKKRGIRVGYTPDVLTDTTA
. . : . . : :.:: .. : :.. . . :: : :.. .. ::
CCDS76 MMYHTITLTREDLEKFKA----LRVIVRIGSGYDNVDIKAAGELGIAVCNIPSAAVEETA
620 630 640 650 660 670
120 130 140 150 160
pF1KE0 ELAVSLLLTTCRRLPEAIEEVKNGGWT-SWKPLWLCGYGLTQ---STVGIIGLGRIGQAI
. .. .:. :: . ...: . : . . . : .. :.:.::.:: :::.
CCDS76 DSTICHILNLYRRNTWLYQALREGTRVQSVEQIREVASGAARIRGETLGLIGFGRTGQAV
680 690 700 710 720 730
170 180 190 200 210 220
pF1KE0 ARRLKPFGVQRFLYTGR-QPRPEEAAEFQAEFVSTPELAAQSDFIVVACSLTPATEGLCN
: : : :: . ..: : :.. : . . .: ::: . . :.:. .. : :
CCDS76 AVRAKAFGFSVIFYDPYLQDGIERSLGVQRVY-TLQDLLYQSDCVSLHCNLNEHNHHLIN
740 750 760 770 780 790
230 240 250 260 270 280
pF1KE0 KDF-FQKMKETAVFINISRGDVVNQDDLYQALASGKIAAAGLDVTSPEPLPTNH-PLLTL
:: ...:.. : ..: .:: .:.. : ::: :.: .:.::: ::. . ::
CCDS76 -DFTIKQMRQGAFLVNAARGGLVDEKALAQALKEGRIRGAALDVHESEPFSFAQGPLKDA
800 810 820 830 840 850
290 300 310 320
pF1KE0 KNCVILPHIGSATHRTRNTMSLLAANNLLAGLRGEPMPSELKL
: . :: . .... : ::... .. :. .: :.
CCDS76 PNLICTPHTAWYSEQASLEMREAAATEIRRAITGR-IPESLRNCVNKEFFVTSAPWSVID
860 870 880 890 900 910
CCDS76 QQAIHPELNGATYRYPPGIVGVAPGGLPAAMEGIIPGGIPVTHNLPTVAHPSQAPSPNQP
920 930 940 950 960 970
328 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 08:58:52 2016 done: Thu Nov 3 08:58:52 2016
Total Scan time: 2.360 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]