FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4417, 430 aa
1>>>pF1KE4417 430 - 430 aa - 430 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6429+/-0.00087; mu= 14.9925+/- 0.052
mean_var=63.9214+/-12.838, 0's: 0 Z-trim(105.5): 16 B-trim: 0 in 0/51
Lambda= 0.160417
statistics sampled from 8471 (8484) to 8471 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.64), E-opt: 0.2 (0.261), width: 16
Scan time: 2.810
The best scores are: opt bits E(32554)
CCDS10801.1 GOT2 gene_id:2806|Hs108|chr16 ( 430) 2889 677.5 6.8e-195
CCDS67045.1 GOT2 gene_id:2806|Hs108|chr16 ( 387) 2070 487.9 7.1e-138
CCDS7479.1 GOT1 gene_id:2805|Hs108|chr10 ( 413) 1324 315.3 7.1e-86
CCDS47839.1 GOT1L1 gene_id:137362|Hs108|chr8 ( 421) 686 167.6 2e-41
>>CCDS10801.1 GOT2 gene_id:2806|Hs108|chr16 (430 aa)
initn: 2889 init1: 2889 opt: 2889 Z-score: 3612.4 bits: 677.5 E(32554): 6.8e-195
Smith-Waterman score: 2889; 99.8% identity (100.0% similar) in 430 aa overlap (1-430:1-430)
10 20 30 40 50 60
pF1KE4 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 NLGVGAYRDDNGKPYVLPSVRKAEAQIAAKNLDKEYLPIGGLAEFCKASAELALGENSEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 NLGVGAYRDDNGKPYVLPSVRKAEAQIAAKNLDKEYLPIGGLAEFCKASAELALGENSEV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 LKSGRFVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LKSGRFVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 YYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 FFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 FFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 AKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 AKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVS
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 NLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 NLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGY
370 380 390 400 410 420
430
pF1KE4 LAHAIHQATK
:::::::.::
CCDS10 LAHAIHQVTK
430
>>CCDS67045.1 GOT2 gene_id:2806|Hs108|chr16 (387 aa)
initn: 2068 init1: 2068 opt: 2070 Z-score: 2588.8 bits: 487.9 E(32554): 7.1e-138
Smith-Waterman score: 2526; 89.8% identity (90.0% similar) in 430 aa overlap (1-430:1-387)
10 20 30 40 50 60
pF1KE4 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 NLGVGAYRDDNGKPYVLPSVRKAEAQIAAKNLDKEYLPIGGLAEFCKASAELALGENSEV
::::::::::::::::::::::
CCDS67 NLGVGAYRDDNGKPYVLPSVRK--------------------------------------
70 80
130 140 150 160 170 180
pF1KE4 LKSGRFVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYR
:::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 -----FVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYR
90 100 110 120 130
190 200 210 220 230 240
pF1KE4 YYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 YYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFA
140 150 160 170 180 190
250 260 270 280 290 300
pF1KE4 FFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 FFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADE
200 210 220 230 240 250
310 320 330 340 350 360
pF1KE4 AKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 AKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVS
260 270 280 290 300 310
370 380 390 400 410 420
pF1KE4 NLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 NLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGY
320 330 340 350 360 370
430
pF1KE4 LAHAIHQATK
:::::::.::
CCDS67 LAHAIHQVTK
380
>>CCDS7479.1 GOT1 gene_id:2805|Hs108|chr10 (413 aa)
initn: 1298 init1: 919 opt: 1324 Z-score: 1655.2 bits: 315.3 E(32554): 7.1e-86
Smith-Waterman score: 1324; 48.8% identity (78.2% similar) in 404 aa overlap (31-428:5-408)
10 20 30 40 50 60
pF1KE4 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM
: ...: .. : .. .: :..: . .:.
CCDS74 MAPPSVFAEVPQAQPVLVFKLTADFREDPDPRKV
10 20 30
70 80 90 100 110
pF1KE4 NLGVGAYRDDNGKPYVLPSVRKAEAQIAAKN-LDKEYLPIGGLAEFCKASAELALGENSE
:::::::: :. .:.::: :.:.: .:: : :..::::: ::::: . ...::::..:
CCDS74 NLGVGAYRTDDCHPWVLPVVKKVEQKIANDNSLNHEYLPILGLAEFRSCASRLALGDDSP
40 50 60 70 80 90
120 130 140 150 160 170
pF1KE4 VLKSGRFVTVQTISGTGALRIGASFLQRFFKFSRD----VFLPKPTWGNHTPIFRDAGMQ
.:: : ::...:::::::::.:: :... . . :.. .::: ::. .: ::..
CCDS74 ALKEKRVGGVQSLGGTGALRIGADFLARWYNGTNNKNTPVYVSSPTWENHNAVFSAAGFK
100 110 120 130 140 150
180 190 200 210 220 230
pF1KE4 -LQGYRYYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVK
...:::.: . :.:. : ..:. . :: :...::::::::::.:: :::::.::.:.:
CCDS74 DIRSYRYWDAEKRGLDLQGFLNDLENAPEFSIVVLHACAHNPTGIDPTPEQWKQIASVMK
160 170 180 190 200 210
240 250 260 270 280 290
pF1KE4 KRNLFAFFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMV
.: :: ::: ::::::::. ..::::.:.:. .:.. ::..::.:::.:::: .:.:
CCDS74 HRFLFPFFDSAYQGFASGNLERDAWAIRYFVSEGFEFFCAQSFSKNFGLYNERVGNLTVV
220 230 240 250 260 270
300 310 320 330 340 350
pF1KE4 CKDADEAKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGM
:. . .: ::.. ..: .:::: .::::.:. :..:.: ..: .::.:::::. :
CCDS74 GKEPESILQVLSQMEKIVRITWSNPPAQGARIVASTLSNPELFEEWTGNVKTMADRILTM
280 290 300 310 320 330
360 370 380 390 400 410
pF1KE4 RTQLVSNLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVT
:..: . :. . .:.::::::::: ::::.:.::: :..: ::. .:::.:.:.:
CCDS74 RSELRARLEALKTPGTWNHITDQIGMFSFTGLNPKQVEYLVNEKHIYLLPSGRINVSGLT
340 350 360 370 380 390
420 430
pF1KE4 SSNVGYLAHAIHQATK
..:. :.: .::.:
CCDS74 TKNLDYVATSIHEAVTKIQ
400 410
>>CCDS47839.1 GOT1L1 gene_id:137362|Hs108|chr8 (421 aa)
initn: 597 init1: 408 opt: 686 Z-score: 857.1 bits: 167.6 E(32554): 2e-41
Smith-Waterman score: 686; 30.2% identity (64.8% similar) in 384 aa overlap (49-428:22-398)
20 30 40 50 60 70
pF1KE4 PGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKMNLGVGAYRDDNGKPYVLP
...:.: .:. :. . ..:.:.:
CCDS47 MPTLSVFMDVPLAHKLEGSLLKTYKQDDYPNKIFLAYRVCMTNEGHPWVSL
10 20 30 40 50
80 90 100 110 120 130
pF1KE4 SVRKAEAQIAAK-NLDKEYLPIGGLAEFCKASAELALGENSEVLKSGRFVTVQTISGTGA
:.:.. ::. .:. :::: :: : .:: : .:..:... .: :.:.. .::
CCDS47 VVQKTRLQISQDPSLNYEYLPTMGLKSFIQASLALLFGKHSQAIVENRVGGVHTVGDSGA
60 70 80 90 100 110
140 150 160 170 180 190
pF1KE4 LRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYRYYDPKTCGFDFTGAVED
...:..::. . : .: :.. . : .:.: :. . : .::: .: ..
CCDS47 FQLGVQFLRAWHKDARIVYIISSQKELHGLVFQDMGFTVYEYSVWDPKKLCMDPDILLNV
120 130 140 150 160 170
200 210 220 230 240 250
pF1KE4 ISKIPEQSVLLLHA---CAHNPTGVDPRPEQWKEIATVVKKRNLFAFFDMAYQGFASGDG
. .::. ::.. : .:.: : .. ...:....: :::. ::. ..:
CCDS47 VEQIPHGCVLVMGNIIDCKLTPSG-------WAKLMSMIKSKQIFPFFDIPCQGLYTSDL
180 190 200 210 220
260 270 280 290 300 310
pF1KE4 DKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADEAKRVESQLKILIRP
..:. ...:. ::.. :: .::.:.: : :: ...: . .. : :::. : .
CCDS47 EEDTRILQYFVSQGFEFFCSQSLSKNFGIYDEGVGMLVVVAVNNQQLLCVLSQLEGLAQA
230 240 250 260 270 280
320 330 340 350 360 370
pF1KE4 MYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVSNLKKEGSTHNWQHI
.. ::: .:::. ..:: .: : .: : .: ... :. . .. .:. :. .: ::
CCDS47 LWLNPPNTGARVITSILCNPALLGEWKQSLKEVVENIMLTKEKVKEKLQLLGTPGSWGHI
290 300 310 320 330 340
380 390 400 410 420 430
pF1KE4 TDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGYLAHAIHQATK
:.: : . ::. .::: :... ::. :.:.:. . ....:..:....:..:
CCDS47 TEQSGTHGYLGLNSQQVEYLVRKKHIYIPKNGQINFSCINANNINYITEGINEAVLLTES
350 360 370 380 390 400
CCDS47 SEMCLPKEKKTLIGIKL
410 420
430 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 00:43:24 2016 done: Sun Nov 6 00:43:25 2016
Total Scan time: 2.810 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]