FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4417, 430 aa 1>>>pF1KE4417 430 - 430 aa - 430 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6429+/-0.00087; mu= 14.9925+/- 0.052 mean_var=63.9214+/-12.838, 0's: 0 Z-trim(105.5): 16 B-trim: 0 in 0/51 Lambda= 0.160417 statistics sampled from 8471 (8484) to 8471 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.64), E-opt: 0.2 (0.261), width: 16 Scan time: 2.810 The best scores are: opt bits E(32554) CCDS10801.1 GOT2 gene_id:2806|Hs108|chr16 ( 430) 2889 677.5 6.8e-195 CCDS67045.1 GOT2 gene_id:2806|Hs108|chr16 ( 387) 2070 487.9 7.1e-138 CCDS7479.1 GOT1 gene_id:2805|Hs108|chr10 ( 413) 1324 315.3 7.1e-86 CCDS47839.1 GOT1L1 gene_id:137362|Hs108|chr8 ( 421) 686 167.6 2e-41 >>CCDS10801.1 GOT2 gene_id:2806|Hs108|chr16 (430 aa) initn: 2889 init1: 2889 opt: 2889 Z-score: 3612.4 bits: 677.5 E(32554): 6.8e-195 Smith-Waterman score: 2889; 99.8% identity (100.0% similar) in 430 aa overlap (1-430:1-430) 10 20 30 40 50 60 pF1KE4 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 NLGVGAYRDDNGKPYVLPSVRKAEAQIAAKNLDKEYLPIGGLAEFCKASAELALGENSEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NLGVGAYRDDNGKPYVLPSVRKAEAQIAAKNLDKEYLPIGGLAEFCKASAELALGENSEV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LKSGRFVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LKSGRFVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 YYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 FFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 FFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 AKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 NLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGY 370 380 390 400 410 420 430 pF1KE4 LAHAIHQATK :::::::.:: CCDS10 LAHAIHQVTK 430 >>CCDS67045.1 GOT2 gene_id:2806|Hs108|chr16 (387 aa) initn: 2068 init1: 2068 opt: 2070 Z-score: 2588.8 bits: 487.9 E(32554): 7.1e-138 Smith-Waterman score: 2526; 89.8% identity (90.0% similar) in 430 aa overlap (1-430:1-387) 10 20 30 40 50 60 pF1KE4 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 NLGVGAYRDDNGKPYVLPSVRKAEAQIAAKNLDKEYLPIGGLAEFCKASAELALGENSEV :::::::::::::::::::::: CCDS67 NLGVGAYRDDNGKPYVLPSVRK-------------------------------------- 70 80 130 140 150 160 170 180 pF1KE4 LKSGRFVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYR ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 -----FVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYR 90 100 110 120 130 190 200 210 220 230 240 pF1KE4 YYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 YYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFA 140 150 160 170 180 190 250 260 270 280 290 300 pF1KE4 FFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 FFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADE 200 210 220 230 240 250 310 320 330 340 350 360 pF1KE4 AKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 AKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVS 260 270 280 290 300 310 370 380 390 400 410 420 pF1KE4 NLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 NLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGY 320 330 340 350 360 370 430 pF1KE4 LAHAIHQATK :::::::.:: CCDS67 LAHAIHQVTK 380 >>CCDS7479.1 GOT1 gene_id:2805|Hs108|chr10 (413 aa) initn: 1298 init1: 919 opt: 1324 Z-score: 1655.2 bits: 315.3 E(32554): 7.1e-86 Smith-Waterman score: 1324; 48.8% identity (78.2% similar) in 404 aa overlap (31-428:5-408) 10 20 30 40 50 60 pF1KE4 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM : ...: .. : .. .: :..: . .:. CCDS74 MAPPSVFAEVPQAQPVLVFKLTADFREDPDPRKV 10 20 30 70 80 90 100 110 pF1KE4 NLGVGAYRDDNGKPYVLPSVRKAEAQIAAKN-LDKEYLPIGGLAEFCKASAELALGENSE :::::::: :. .:.::: :.:.: .:: : :..::::: ::::: . ...::::..: CCDS74 NLGVGAYRTDDCHPWVLPVVKKVEQKIANDNSLNHEYLPILGLAEFRSCASRLALGDDSP 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE4 VLKSGRFVTVQTISGTGALRIGASFLQRFFKFSRD----VFLPKPTWGNHTPIFRDAGMQ .:: : ::...:::::::::.:: :... . . :.. .::: ::. .: ::.. CCDS74 ALKEKRVGGVQSLGGTGALRIGADFLARWYNGTNNKNTPVYVSSPTWENHNAVFSAAGFK 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE4 -LQGYRYYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVK ...:::.: . :.:. : ..:. . :: :...::::::::::.:: :::::.::.:.: CCDS74 DIRSYRYWDAEKRGLDLQGFLNDLENAPEFSIVVLHACAHNPTGIDPTPEQWKQIASVMK 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE4 KRNLFAFFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMV .: :: ::: ::::::::. ..::::.:.:. .:.. ::..::.:::.:::: .:.: CCDS74 HRFLFPFFDSAYQGFASGNLERDAWAIRYFVSEGFEFFCAQSFSKNFGLYNERVGNLTVV 220 230 240 250 260 270 300 310 320 330 340 350 pF1KE4 CKDADEAKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGM :. . .: ::.. ..: .:::: .::::.:. :..:.: ..: .::.:::::. : CCDS74 GKEPESILQVLSQMEKIVRITWSNPPAQGARIVASTLSNPELFEEWTGNVKTMADRILTM 280 290 300 310 320 330 360 370 380 390 400 410 pF1KE4 RTQLVSNLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVT :..: . :. . .:.::::::::: ::::.:.::: :..: ::. .:::.:.:.: CCDS74 RSELRARLEALKTPGTWNHITDQIGMFSFTGLNPKQVEYLVNEKHIYLLPSGRINVSGLT 340 350 360 370 380 390 420 430 pF1KE4 SSNVGYLAHAIHQATK ..:. :.: .::.: CCDS74 TKNLDYVATSIHEAVTKIQ 400 410 >>CCDS47839.1 GOT1L1 gene_id:137362|Hs108|chr8 (421 aa) initn: 597 init1: 408 opt: 686 Z-score: 857.1 bits: 167.6 E(32554): 2e-41 Smith-Waterman score: 686; 30.2% identity (64.8% similar) in 384 aa overlap (49-428:22-398) 20 30 40 50 60 70 pF1KE4 PGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKMNLGVGAYRDDNGKPYVLP ...:.: .:. :. . ..:.:.: CCDS47 MPTLSVFMDVPLAHKLEGSLLKTYKQDDYPNKIFLAYRVCMTNEGHPWVSL 10 20 30 40 50 80 90 100 110 120 130 pF1KE4 SVRKAEAQIAAK-NLDKEYLPIGGLAEFCKASAELALGENSEVLKSGRFVTVQTISGTGA :.:.. ::. .:. :::: :: : .:: : .:..:... .: :.:.. .:: CCDS47 VVQKTRLQISQDPSLNYEYLPTMGLKSFIQASLALLFGKHSQAIVENRVGGVHTVGDSGA 60 70 80 90 100 110 140 150 160 170 180 190 pF1KE4 LRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYRYYDPKTCGFDFTGAVED ...:..::. . : .: :.. . : .:.: :. . : .::: .: .. CCDS47 FQLGVQFLRAWHKDARIVYIISSQKELHGLVFQDMGFTVYEYSVWDPKKLCMDPDILLNV 120 130 140 150 160 170 200 210 220 230 240 250 pF1KE4 ISKIPEQSVLLLHA---CAHNPTGVDPRPEQWKEIATVVKKRNLFAFFDMAYQGFASGDG . .::. ::.. : .:.: : .. ...:....: :::. ::. ..: CCDS47 VEQIPHGCVLVMGNIIDCKLTPSG-------WAKLMSMIKSKQIFPFFDIPCQGLYTSDL 180 190 200 210 220 260 270 280 290 300 310 pF1KE4 DKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADEAKRVESQLKILIRP ..:. ...:. ::.. :: .::.:.: : :: ...: . .. : :::. : . CCDS47 EEDTRILQYFVSQGFEFFCSQSLSKNFGIYDEGVGMLVVVAVNNQQLLCVLSQLEGLAQA 230 240 250 260 270 280 320 330 340 350 360 370 pF1KE4 MYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVSNLKKEGSTHNWQHI .. ::: .:::. ..:: .: : .: : .: ... :. . .. .:. :. .: :: CCDS47 LWLNPPNTGARVITSILCNPALLGEWKQSLKEVVENIMLTKEKVKEKLQLLGTPGSWGHI 290 300 310 320 330 340 380 390 400 410 420 430 pF1KE4 TDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGYLAHAIHQATK :.: : . ::. .::: :... ::. :.:.:. . ....:..:....:..: CCDS47 TEQSGTHGYLGLNSQQVEYLVRKKHIYIPKNGQINFSCINANNINYITEGINEAVLLTES 350 360 370 380 390 400 CCDS47 SEMCLPKEKKTLIGIKL 410 420 430 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:43:24 2016 done: Sun Nov 6 00:43:25 2016 Total Scan time: 2.810 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]