FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1505, 496 aa
1>>>pF1KE1505 496 - 496 aa - 496 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.1499+/-0.000792; mu= 13.7760+/- 0.048
mean_var=71.1260+/-14.486, 0's: 0 Z-trim(108.4): 14 B-trim: 184 in 1/49
Lambda= 0.152076
statistics sampled from 10154 (10166) to 10154 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.682), E-opt: 0.2 (0.312), width: 16
Scan time: 3.300
The best scores are: opt bits E(32554)
CCDS6430.1 GPT gene_id:2875|Hs108|chr8 ( 496) 3294 731.7 4.3e-211
CCDS10725.1 GPT2 gene_id:84706|Hs108|chr16 ( 523) 2363 527.5 1.4e-149
CCDS45478.1 GPT2 gene_id:84706|Hs108|chr16 ( 423) 2091 467.8 1e-131
>>CCDS6430.1 GPT gene_id:2875|Hs108|chr8 (496 aa)
initn: 3294 init1: 3294 opt: 3294 Z-score: 3903.3 bits: 731.7 E(32554): 4.3e-211
Smith-Waterman score: 3294; 100.0% identity (100.0% similar) in 496 aa overlap (1-496:1-496)
10 20 30 40 50 60
pF1KE1 MASSTGDRSQAVRHGLRAKVLTLDGMNPRVRRVEYAVRGPIVQRALELEQELRQGVKKPF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 MASSTGDRSQAVRHGLRAKVLTLDGMNPRVRRVEYAVRGPIVQRALELEQELRQGVKKPF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 TEVIRANIGDAQAMGQRPITFLRQVLALCVNPDLLSSPNFPDDAKKRAERILQACGGHSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 TEVIRANIGDAQAMGQRPITFLRQVLALCVNPDLLSSPNFPDDAKKRAERILQACGGHSL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 GAYSVSSGIQLIREDVARYIERRDGGIPADPNNVFLSTGASDAIVTVLKLLVAGEGHTRT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 GAYSVSSGIQLIREDVARYIERRDGGIPADPNNVFLSTGASDAIVTVLKLLVAGEGHTRT
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 GVLIPIPQYPLYSATLAELGAVQVDYYLDEERAWALDVAELHRALGQARDHCRPRALCVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 GVLIPIPQYPLYSATLAELGAVQVDYYLDEERAWALDVAELHRALGQARDHCRPRALCVI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 NPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQDNVYAAGSQFHSFKKVLMEMGPPY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 NPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQDNVYAAGSQFHSFKKVLMEMGPPY
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 AGQQELASFHSTSKGYMGECGFRGGYVEVVNMDAAVQQQMLKLMSVRLCPPVPGQALLDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 AGQQELASFHSTSKGYMGECGFRGGYVEVVNMDAAVQQQMLKLMSVRLCPPVPGQALLDL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE1 VVSPPAPTDPSFAQFQAEKQAVLAELAAKAKLTEQVFNEAPGISCNPVQGAMYSFPRVQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 VVSPPAPTDPSFAQFQAEKQAVLAELAAKAKLTEQVFNEAPGISCNPVQGAMYSFPRVQL
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE1 PPRAVERAQELGLAPDMFFCLRLLEETGICVVPGSGFGQREGTYHFRMTILPPLEKLRLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS64 PPRAVERAQELGLAPDMFFCLRLLEETGICVVPGSGFGQREGTYHFRMTILPPLEKLRLL
430 440 450 460 470 480
490
pF1KE1 LEKLSRFHAKFTLEYS
::::::::::::::::
CCDS64 LEKLSRFHAKFTLEYS
490
>>CCDS10725.1 GPT2 gene_id:84706|Hs108|chr16 (523 aa)
initn: 2387 init1: 2350 opt: 2363 Z-score: 2799.0 bits: 527.5 E(32554): 1.4e-149
Smith-Waterman score: 2363; 69.0% identity (90.2% similar) in 480 aa overlap (17-496:44-523)
10 20 30 40
pF1KE1 MASSTGDRSQAVRHGLRAKVLTLDGMNPRVRRVEYAVRGPIVQRAL
: ..:::..:::.:. :::::::::: .:
CCDS10 PRTPSSWGRSQSSAAAEASAVLKVRPERSRRERILTLESMNPQVKAVEYAVRGPIVLKAG
20 30 40 50 60 70
50 60 70 80 90 100
pF1KE1 ELEQELRQGVKKPFTEVIRANIGDAQAMGQRPITFLRQVLALCVNPDLLSSPNFPDDAKK
:.: ::..:.::::::::::::::::::::.::::::::.:::. :.::.::.::.::::
CCDS10 EIELELQRGIKKPFTEVIRANIGDAQAMGQQPITFLRQVMALCTYPNLLDSPSFPEDAKK
80 90 100 110 120 130
110 120 130 140 150 160
pF1KE1 RAERILQACGGHSLGAYSVSSGIQLIREDVARYIERRDGGIPADPNNVFLSTGASDAIVT
::.::::::::.:::.::.:.:.. :::::: :: :::::.::::.:..:.:::::.: :
CCDS10 RARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDNIYLTTGASDGIST
140 150 160 170 180 190
170 180 190 200 210 220
pF1KE1 VLKLLVAGEGHTRTGVLIPIPQYPLYSATLAELGAVQVDYYLDEERAWALDVAELHRALG
.::.::.: :..::::.:::::::::::...:: :.::.:::::: :::.: ::.::.
CCDS10 ILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENCWALNVNELRRAVQ
200 210 220 230 240 250
230 240 250 260 270 280
pF1KE1 QARDHCRPRALCVINPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQDNVYAAGSQF
.:.::: :..::.:::::::::::.:.::: ::.::.::.::::::::::::::. .:
CCDS10 EAKDHCDPKVLCIINPGNPTGQVQSRKCIEDVIHFAWEEKLFLLADEVYQDNVYSPDCRF
260 270 280 290 300 310
290 300 310 320 330 340
pF1KE1 HSFKKVLMEMGPPYAGQQELASFHSTSKGYMGECGFRGGYVEVVNMDAAVQQQMLKLMSV
:::::::.:::: :... :::::::::::::::::.::::.::.:. .. :..::.::
CCDS10 HSFKKVLYEMGPEYSSNVELASFHSTSKGYMGECGYRGGYMEVINLHPEIKGQLVKLLSV
320 330 340 350 360 370
350 360 370 380 390 400
pF1KE1 RLCPPVPGQALLDLVVSPPAPTDPSFAQFQAEKQAVLAELAAKAKLTEQVFNEAPGISCN
:::::: ::: .:.::.::. . :: ::. ::..::..:: ::::::..::..::: ::
CCDS10 RLCPPVSGQAAMDIVVNPPVAGEESFEQFSREKESVLGNLAKKAKLTEDLFNQVPGIHCN
380 390 400 410 420 430
410 420 430 440 450 460
pF1KE1 PVQGAMYSFPRVQLPPRAVERAQELGLAPDMFFCLRLLEETGICVVPGSGFGQREGTYHF
:.:::::.:::. .: .::: :: .:::::.:..::::::::::::::::::::::::
CCDS10 PLQGAMYAFPRIFIPAKAVEAAQAHQMAPDMFYCMKLLEETGICVVPGSGFGQREGTYHF
440 450 460 470 480 490
470 480 490
pF1KE1 RMTILPPLEKLRLLLEKLSRFHAKFTLEYS
:::::::.:::. .:.:.. :: .: .:.
CCDS10 RMTILPPVEKLKTVLQKVKDFHINFLEKYA
500 510 520
>>CCDS45478.1 GPT2 gene_id:84706|Hs108|chr16 (423 aa)
initn: 2128 init1: 2091 opt: 2091 Z-score: 2478.0 bits: 467.8 E(32554): 1e-131
Smith-Waterman score: 2091; 68.6% identity (90.1% similar) in 423 aa overlap (74-496:1-423)
50 60 70 80 90 100
pF1KE1 RALELEQELRQGVKKPFTEVIRANIGDAQAMGQRPITFLRQVLALCVNPDLLSSPNFPDD
:::.::::::::.:::. :.::.::.::.:
CCDS45 MGQQPITFLRQVMALCTYPNLLDSPSFPED
10 20 30
110 120 130 140 150 160
pF1KE1 AKKRAERILQACGGHSLGAYSVSSGIQLIREDVARYIERRDGGIPADPNNVFLSTGASDA
:::::.::::::::.:::.::.:.:.. :::::: :: :::::.::::.:..:.:::::.
CCDS45 AKKRARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDNIYLTTGASDG
40 50 60 70 80 90
170 180 190 200 210 220
pF1KE1 IVTVLKLLVAGEGHTRTGVLIPIPQYPLYSATLAELGAVQVDYYLDEERAWALDVAELHR
: :.::.::.: :..::::.:::::::::::...:: :.::.:::::: :::.: ::.:
CCDS45 ISTILKILVSGGGKSRTGVMIPIPQYPLYSAVISELDAIQVNYYLDEENCWALNVNELRR
100 110 120 130 140 150
230 240 250 260 270 280
pF1KE1 ALGQARDHCRPRALCVINPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQDNVYAAG
:. .:.::: :..::.:::::::::::.:.::: ::.::.::.::::::::::::::.
CCDS45 AVQEAKDHCDPKVLCIINPGNPTGQVQSRKCIEDVIHFAWEEKLFLLADEVYQDNVYSPD
160 170 180 190 200 210
290 300 310 320 330 340
pF1KE1 SQFHSFKKVLMEMGPPYAGQQELASFHSTSKGYMGECGFRGGYVEVVNMDAAVQQQMLKL
.::::::::.:::: :... :::::::::::::::::.::::.::.:. .. :..::
CCDS45 CRFHSFKKVLYEMGPEYSSNVELASFHSTSKGYMGECGYRGGYMEVINLHPEIKGQLVKL
220 230 240 250 260 270
350 360 370 380 390 400
pF1KE1 MSVRLCPPVPGQALLDLVVSPPAPTDPSFAQFQAEKQAVLAELAAKAKLTEQVFNEAPGI
.:::::::: ::: .:.::.::. . :: ::. ::..::..:: ::::::..::..:::
CCDS45 LSVRLCPPVSGQAAMDIVVNPPVAGEESFEQFSREKESVLGNLAKKAKLTEDLFNQVPGI
280 290 300 310 320 330
410 420 430 440 450 460
pF1KE1 SCNPVQGAMYSFPRVQLPPRAVERAQELGLAPDMFFCLRLLEETGICVVPGSGFGQREGT
:::.:::::.:::. .: .::: :: .:::::.:..:::::::::::::::::::::
CCDS45 HCNPLQGAMYAFPRIFIPAKAVEAAQAHQMAPDMFYCMKLLEETGICVVPGSGFGQREGT
340 350 360 370 380 390
470 480 490
pF1KE1 YHFRMTILPPLEKLRLLLEKLSRFHAKFTLEYS
::::::::::.:::. .:.:.. :: .: .:.
CCDS45 YHFRMTILPPVEKLKTVLQKVKDFHINFLEKYA
400 410 420
496 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 22:47:10 2016 done: Sun Nov 6 22:47:10 2016
Total Scan time: 3.300 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]