FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1505, 496 aa 1>>>pF1KE1505 496 - 496 aa - 496 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1499+/-0.000792; mu= 13.7760+/- 0.048 mean_var=71.1260+/-14.486, 0's: 0 Z-trim(108.4): 14 B-trim: 184 in 1/49 Lambda= 0.152076 statistics sampled from 10154 (10166) to 10154 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.682), E-opt: 0.2 (0.312), width: 16 Scan time: 3.300 The best scores are: opt bits E(32554) CCDS6430.1 GPT gene_id:2875|Hs108|chr8 ( 496) 3294 731.7 4.3e-211 CCDS10725.1 GPT2 gene_id:84706|Hs108|chr16 ( 523) 2363 527.5 1.4e-149 CCDS45478.1 GPT2 gene_id:84706|Hs108|chr16 ( 423) 2091 467.8 1e-131 >>CCDS6430.1 GPT gene_id:2875|Hs108|chr8 (496 aa) initn: 3294 init1: 3294 opt: 3294 Z-score: 3903.3 bits: 731.7 E(32554): 4.3e-211 Smith-Waterman score: 3294; 100.0% identity (100.0% similar) in 496 aa overlap (1-496:1-496) 10 20 30 40 50 60 pF1KE1 MASSTGDRSQAVRHGLRAKVLTLDGMNPRVRRVEYAVRGPIVQRALELEQELRQGVKKPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 MASSTGDRSQAVRHGLRAKVLTLDGMNPRVRRVEYAVRGPIVQRALELEQELRQGVKKPF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 TEVIRANIGDAQAMGQRPITFLRQVLALCVNPDLLSSPNFPDDAKKRAERILQACGGHSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 TEVIRANIGDAQAMGQRPITFLRQVLALCVNPDLLSSPNFPDDAKKRAERILQACGGHSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GAYSVSSGIQLIREDVARYIERRDGGIPADPNNVFLSTGASDAIVTVLKLLVAGEGHTRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 GAYSVSSGIQLIREDVARYIERRDGGIPADPNNVFLSTGASDAIVTVLKLLVAGEGHTRT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 GVLIPIPQYPLYSATLAELGAVQVDYYLDEERAWALDVAELHRALGQARDHCRPRALCVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 GVLIPIPQYPLYSATLAELGAVQVDYYLDEERAWALDVAELHRALGQARDHCRPRALCVI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 NPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQDNVYAAGSQFHSFKKVLMEMGPPY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 NPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQDNVYAAGSQFHSFKKVLMEMGPPY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 AGQQELASFHSTSKGYMGECGFRGGYVEVVNMDAAVQQQMLKLMSVRLCPPVPGQALLDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 AGQQELASFHSTSKGYMGECGFRGGYVEVVNMDAAVQQQMLKLMSVRLCPPVPGQALLDL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 VVSPPAPTDPSFAQFQAEKQAVLAELAAKAKLTEQVFNEAPGISCNPVQGAMYSFPRVQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 VVSPPAPTDPSFAQFQAEKQAVLAELAAKAKLTEQVFNEAPGISCNPVQGAMYSFPRVQL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 PPRAVERAQELGLAPDMFFCLRLLEETGICVVPGSGFGQREGTYHFRMTILPPLEKLRLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 PPRAVERAQELGLAPDMFFCLRLLEETGICVVPGSGFGQREGTYHFRMTILPPLEKLRLL 430 440 450 460 470 480 490 pF1KE1 LEKLSRFHAKFTLEYS :::::::::::::::: CCDS64 LEKLSRFHAKFTLEYS 490 >>CCDS10725.1 GPT2 gene_id:84706|Hs108|chr16 (523 aa) initn: 2387 init1: 2350 opt: 2363 Z-score: 2799.0 bits: 527.5 E(32554): 1.4e-149 Smith-Waterman score: 2363; 69.0% identity (90.2% similar) in 480 aa overlap (17-496:44-523) 10 20 30 40 pF1KE1 MASSTGDRSQAVRHGLRAKVLTLDGMNPRVRRVEYAVRGPIVQRAL : ..:::..:::.:. :::::::::: .: CCDS10 PRTPSSWGRSQSSAAAEASAVLKVRPERSRRERILTLESMNPQVKAVEYAVRGPIVLKAG 20 30 40 50 60 70 50 60 70 80 90 100 pF1KE1 ELEQELRQGVKKPFTEVIRANIGDAQAMGQRPITFLRQVLALCVNPDLLSSPNFPDDAKK :.: ::..:.::::::::::::::::::::.::::::::.:::. :.::.::.::.:::: CCDS10 EIELELQRGIKKPFTEVIRANIGDAQAMGQQPITFLRQVMALCTYPNLLDSPSFPEDAKK 80 90 100 110 120 130 110 120 130 140 150 160 pF1KE1 RAERILQACGGHSLGAYSVSSGIQLIREDVARYIERRDGGIPADPNNVFLSTGASDAIVT ::.::::::::.:::.::.:.:.. :::::: :: :::::.::::.:..:.:::::.: : CCDS10 RARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDNIYLTTGASDGIST 140 150 160 170 180 190 170 180 190 200 210 220 pF1KE1 VLKLLVAGEGHTRTGVLIPIPQYPLYSATLAELGAVQVDYYLDEERAWALDVAELHRALG .::.::.: :..::::.:::::::::::...:: :.::.:::::: :::.: ::.::. CCDS10 ILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENCWALNVNELRRAVQ 200 210 220 230 240 250 230 240 250 260 270 280 pF1KE1 QARDHCRPRALCVINPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQDNVYAAGSQF .:.::: :..::.:::::::::::.:.::: ::.::.::.::::::::::::::. .: CCDS10 EAKDHCDPKVLCIINPGNPTGQVQSRKCIEDVIHFAWEEKLFLLADEVYQDNVYSPDCRF 260 270 280 290 300 310 290 300 310 320 330 340 pF1KE1 HSFKKVLMEMGPPYAGQQELASFHSTSKGYMGECGFRGGYVEVVNMDAAVQQQMLKLMSV :::::::.:::: :... :::::::::::::::::.::::.::.:. .. :..::.:: CCDS10 HSFKKVLYEMGPEYSSNVELASFHSTSKGYMGECGYRGGYMEVINLHPEIKGQLVKLLSV 320 330 340 350 360 370 350 360 370 380 390 400 pF1KE1 RLCPPVPGQALLDLVVSPPAPTDPSFAQFQAEKQAVLAELAAKAKLTEQVFNEAPGISCN :::::: ::: .:.::.::. . :: ::. ::..::..:: ::::::..::..::: :: CCDS10 RLCPPVSGQAAMDIVVNPPVAGEESFEQFSREKESVLGNLAKKAKLTEDLFNQVPGIHCN 380 390 400 410 420 430 410 420 430 440 450 460 pF1KE1 PVQGAMYSFPRVQLPPRAVERAQELGLAPDMFFCLRLLEETGICVVPGSGFGQREGTYHF :.:::::.:::. .: .::: :: .:::::.:..:::::::::::::::::::::::: CCDS10 PLQGAMYAFPRIFIPAKAVEAAQAHQMAPDMFYCMKLLEETGICVVPGSGFGQREGTYHF 440 450 460 470 480 490 470 480 490 pF1KE1 RMTILPPLEKLRLLLEKLSRFHAKFTLEYS :::::::.:::. .:.:.. :: .: .:. CCDS10 RMTILPPVEKLKTVLQKVKDFHINFLEKYA 500 510 520 >>CCDS45478.1 GPT2 gene_id:84706|Hs108|chr16 (423 aa) initn: 2128 init1: 2091 opt: 2091 Z-score: 2478.0 bits: 467.8 E(32554): 1e-131 Smith-Waterman score: 2091; 68.6% identity (90.1% similar) in 423 aa overlap (74-496:1-423) 50 60 70 80 90 100 pF1KE1 RALELEQELRQGVKKPFTEVIRANIGDAQAMGQRPITFLRQVLALCVNPDLLSSPNFPDD :::.::::::::.:::. :.::.::.::.: CCDS45 MGQQPITFLRQVMALCTYPNLLDSPSFPED 10 20 30 110 120 130 140 150 160 pF1KE1 AKKRAERILQACGGHSLGAYSVSSGIQLIREDVARYIERRDGGIPADPNNVFLSTGASDA :::::.::::::::.:::.::.:.:.. :::::: :: :::::.::::.:..:.:::::. CCDS45 AKKRARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDNIYLTTGASDG 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE1 IVTVLKLLVAGEGHTRTGVLIPIPQYPLYSATLAELGAVQVDYYLDEERAWALDVAELHR : :.::.::.: :..::::.:::::::::::...:: :.::.:::::: :::.: ::.: CCDS45 ISTILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENCWALNVNELRR 100 110 120 130 140 150 230 240 250 260 270 280 pF1KE1 ALGQARDHCRPRALCVINPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQDNVYAAG :. .:.::: :..::.:::::::::::.:.::: ::.::.::.::::::::::::::. CCDS45 AVQEAKDHCDPKVLCIINPGNPTGQVQSRKCIEDVIHFAWEEKLFLLADEVYQDNVYSPD 160 170 180 190 200 210 290 300 310 320 330 340 pF1KE1 SQFHSFKKVLMEMGPPYAGQQELASFHSTSKGYMGECGFRGGYVEVVNMDAAVQQQMLKL .::::::::.:::: :... :::::::::::::::::.::::.::.:. .. :..:: CCDS45 CRFHSFKKVLYEMGPEYSSNVELASFHSTSKGYMGECGYRGGYMEVINLHPEIKGQLVKL 220 230 240 250 260 270 350 360 370 380 390 400 pF1KE1 MSVRLCPPVPGQALLDLVVSPPAPTDPSFAQFQAEKQAVLAELAAKAKLTEQVFNEAPGI .:::::::: ::: .:.::.::. . :: ::. ::..::..:: ::::::..::..::: CCDS45 LSVRLCPPVSGQAAMDIVVNPPVAGEESFEQFSREKESVLGNLAKKAKLTEDLFNQVPGI 280 290 300 310 320 330 410 420 430 440 450 460 pF1KE1 SCNPVQGAMYSFPRVQLPPRAVERAQELGLAPDMFFCLRLLEETGICVVPGSGFGQREGT :::.:::::.:::. .: .::: :: .:::::.:..::::::::::::::::::::: CCDS45 HCNPLQGAMYAFPRIFIPAKAVEAAQAHQMAPDMFYCMKLLEETGICVVPGSGFGQREGT 340 350 360 370 380 390 470 480 490 pF1KE1 YHFRMTILPPLEKLRLLLEKLSRFHAKFTLEYS ::::::::::.:::. .:.:.. :: .: .:. CCDS45 YHFRMTILPPVEKLKTVLQKVKDFHINFLEKYA 400 410 420 496 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 22:47:10 2016 done: Sun Nov 6 22:47:10 2016 Total Scan time: 3.300 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]