FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0236, 276 aa
1>>>pF1KE0236 276 - 276 aa - 276 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.9838+/-0.000891; mu= 15.9898+/- 0.053
mean_var=52.4531+/-10.561, 0's: 0 Z-trim(104.5): 27 B-trim: 0 in 0/50
Lambda= 0.177088
statistics sampled from 7905 (7912) to 7905 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.621), E-opt: 0.2 (0.243), width: 16
Scan time: 2.010
The best scores are: opt bits E(32554)
CCDS43900.1 GLT6D1 gene_id:360203|Hs108|chr9 ( 276) 1908 495.3 2e-140
CCDS60080.1 A3GALT2 gene_id:127550|Hs108|chr1 ( 340) 582 156.6 2.2e-38
CCDS6960.1 GBGT1 gene_id:26301|Hs108|chr9 ( 347) 575 154.8 7.9e-38
CCDS65175.1 GBGT1 gene_id:26301|Hs108|chr9 ( 330) 571 153.8 1.5e-37
>>CCDS43900.1 GLT6D1 gene_id:360203|Hs108|chr9 (276 aa)
initn: 1908 init1: 1908 opt: 1908 Z-score: 2634.7 bits: 495.3 E(32554): 2e-140
Smith-Waterman score: 1908; 100.0% identity (100.0% similar) in 276 aa overlap (1-276:1-276)
10 20 30 40 50 60
pF1KE0 MNSKRMLLLVLFAFSLMLVERYFRNHQVEELRLSDWFHPRKRPDVITKTDWLAPVLWEGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 MNSKRMLLLVLFAFSLMLVERYFRNHQVEELRLSDWFHPRKRPDVITKTDWLAPVLWEGT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 FDRRVLEKHYRRRNITVGLAVFATGRFAEEYLRPFLHSANKHFMTGYRVIFYIMVDAFFK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 FDRRVLEKHYRRRNITVGLAVFATGRFAEEYLRPFLHSANKHFMTGYRVIFYIMVDAFFK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 LPDIEPSPLRTFKAFKVGTERWWLDGPLVHVKSLGEHIASHIQDEVDFLFSMAANQVFQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LPDIEPSPLRTFKAFKVGTERWWLDGPLVHVKSLGEHIASHIQDEVDFLFSMAANQVFQN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 EFGVETLGPLVAQLHAWWYFRNTKNFPYERRPTSAACIPFGQGDFYYGNLMVGGTPHNIL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 EFGVETLGPLVAQLHAWWYFRNTKNFPYERRPTSAACIPFGQGDFYYGNLMVGGTPHNIL
190 200 210 220 230 240
250 260 270
pF1KE0 DFIKEYLNGVIHDIKNGLNSTYEKHLNKYFYLNKPT
::::::::::::::::::::::::::::::::::::
CCDS43 DFIKEYLNGVIHDIKNGLNSTYEKHLNKYFYLNKPT
250 260 270
>>CCDS60080.1 A3GALT2 gene_id:127550|Hs108|chr1 (340 aa)
initn: 575 init1: 519 opt: 582 Z-score: 802.4 bits: 156.6 E(32554): 2.2e-38
Smith-Waterman score: 582; 36.5% identity (68.0% similar) in 241 aa overlap (37-275:63-303)
10 20 30 40 50 60
pF1KE0 LLLVLFAFSLMLVERYFRNHQVEELRLSDWFHPRKRPDVITKTDWLAPVLWEGTFDRRVL
..: ::.:.: : : ::..:.:.:: :
CCDS60 PKFRHLEALIPMGVCPSATMSQLRDNFTGALRPWARPEVLTCTPWGAPIIWDGSFDPDVA
40 50 60 70 80 90
70 80 90 100 110 120
pF1KE0 EKHYRRRNITVGLAVFATGRFAEEYLRPFLHSANKHFMTGYRVIFYIMVDAFFKLPDIEP
... :..:.:.::..::.::. :.::. ::..:..:::.: :..:.... .: .
CCDS60 KQEARQQNLTIGLTIFAVGRYLEKYLERFLETAEQHFMAGQSVMYYVFTELPGAVPRVAL
100 110 120 130 140 150
130 140 150 160 170 180
pF1KE0 SPLRTFKAFKVGTERWWLDGPLVHVKSLGEHIASHIQDEVDFLFSMAANQVFQNEFGVET
.: : . . .:. :: : : ......: ... :. :.: : ..: :.. :: :.
CCDS60 GPGRRLPVERVARERRWQDVSMARMRTLHAALGGLPGREAHFMFCMDVDQHFSGTFGPEA
160 170 180 190 200 210
190 200 210 220 230 240
pF1KE0 LGPLVAQLHAWWYFRNTKNFPYERRPTSAACIPFGQGDFYYGNLMVGGTPHNILDFIKEY
:. :::::.: : . .:.:: ::: . .:::::: . ::. . . .
CCDS60 LAESVAQLHSWHYHWPSWLLPFERDAHSAAAMAWGQGDFYNHAAVFGGSVAALRGLTAHC
220 230 240 250 260 270
250 260 270
pF1KE0 LNGVIHDIKNGLNSTY--EKHLNKYFYLNKPT
.:. : ::.. . :.::::.:.:.::
CCDS60 AGGLDWDRARGLEARWHDESHLNKFFWLHKPAKVLSPEFCWSPDIGPRAEIRRPRLLWAP
280 290 300 310 320 330
CCDS60 KGYRLLRN
340
>>CCDS6960.1 GBGT1 gene_id:26301|Hs108|chr9 (347 aa)
initn: 538 init1: 381 opt: 575 Z-score: 792.6 bits: 154.8 E(32554): 7.9e-38
Smith-Waterman score: 575; 34.6% identity (69.9% similar) in 269 aa overlap (14-276:46-311)
10 20 30 40
pF1KE0 MNSKRMLLLVLFAFSLMLVERYFRNHQVEELRLSDWFHPR---
:.. : .: :.. .. . :.. .:.
CCDS69 AGTSLSVLWVYLENWLPVSYVPYYLPCPEIFNMKL--HYKREKPLQPVVWSQYPQPKLLE
20 30 40 50 60 70
50 60 70 80 90
pF1KE0 KRP-DVITKTDWLAPVLWEGTFDRRVLEKHYRRRNITVGLAVFATGRFAEEYLRPFLHSA
.:: ...: : ::::.. ::::. ..:.. :. :.:.:..:::.:.... ... ::.::
CCDS69 HRPTQLLTLTPWLAPIVSEGTFNPELLQHIYQPLNLTIGVTVFAVGKYTH-FIQSFLESA
80 90 100 110 120 130
100 110 120 130 140 150
pF1KE0 NKHFMTGYRVIFYIMVDAFFKLPDIEPSPLRTFKAFKVGTERWWLDGPLVHVKSLGEHIA
.. :: :::: .::..: .: . .: : .... . . : . . .......:::
CCDS69 EEFFMRGYRVHYYIFTDNPAAVPGVPLGPHRLLSSIPIQGHSHWEETSMRRMETISQHIA
140 150 160 170 180 190
160 170 180 190 200 210
pF1KE0 SHIQDEVDFLFSMAANQVFQNEFGVETLGPLVAQLHAWWYFRNTKNFPYERRPTSAACIP
.. . :::.:: . ...::.: .: :::: ::: .: .: ..:::::: .:.: .
CCDS69 KRAHREVDYLFCLDVDMVFRNPWGPETLGDLVAAIHPSYYAVPRQQFPYERRRVSTAFVA
200 210 220 230 240 250
220 230 240 250 260 270
pF1KE0 FGQGDFYYGNLMVGGTPHNILDFIKEYLNGVIHDIKNGLNSTY--EKHLNKYFYLNKPT
..::::::. . :: . .: . ... : ::. ... :.:::..: :::.
CCDS69 DSEGDFYYGGAVFGGQVARVYEFTRGCHMAILADKANGIMAAWREESHLNRHFISNKPSK
260 270 280 290 300 310
CCDS69 VLSPEYLWDDRKPQPPSLKLIRFSTLDKDISCLRS
320 330 340
>>CCDS65175.1 GBGT1 gene_id:26301|Hs108|chr9 (330 aa)
initn: 538 init1: 381 opt: 571 Z-score: 787.5 bits: 153.8 E(32554): 1.5e-37
Smith-Waterman score: 571; 36.0% identity (70.8% similar) in 250 aa overlap (33-276:46-294)
10 20 30 40 50
pF1KE0 SKRMLLLVLFAFSLMLVERYFRNHQVEELRLSDWFHPR---KRP-DVITKTDWLAPVLWE
::.. .:. .:: ...: : ::::.. :
CCDS65 AGTSLSVLWVYLENWLPVSYVPYYLPCPEILSQYPQPKLLEHRPTQLLTLTPWLAPIVSE
20 30 40 50 60 70
60 70 80 90 100 110
pF1KE0 GTFDRRVLEKHYRRRNITVGLAVFATGRFAEEYLRPFLHSANKHFMTGYRVIFYIMVDAF
:::. ..:.. :. :.:.:..:::.:.... ... ::.::.. :: :::: .::..:
CCDS65 GTFNPELLQHIYQPLNLTIGVTVFAVGKYTH-FIQSFLESAEEFFMRGYRVHYYIFTDNP
80 90 100 110 120 130
120 130 140 150 160 170
pF1KE0 FKLPDIEPSPLRTFKAFKVGTERWWLDGPLVHVKSLGEHIASHIQDEVDFLFSMAANQVF
.: . .: : .... . . : . . .......:::.. . :::.:: . ...::
CCDS65 AAVPGVPLGPHRLLSSIPIQGHSHWEETSMRRMETISQHIAKRAHREVDYLFCLDVDMVF
140 150 160 170 180 190
180 190 200 210 220 230
pF1KE0 QNEFGVETLGPLVAQLHAWWYFRNTKNFPYERRPTSAACIPFGQGDFYYGNLMVGGTPHN
.: .: :::: ::: .: .: ..:::::: .:.: . ..::::::. . ::
CCDS65 RNPWGPETLGDLVAAIHPSYYAVPRQQFPYERRRVSTAFVADSEGDFYYGGAVFGGQVAR
200 210 220 230 240 250
240 250 260 270
pF1KE0 ILDFIKEYLNGVIHDIKNGLNSTY--EKHLNKYFYLNKPT
. .: . ... : ::. ... :.:::..: :::.
CCDS65 VYEFTRGCHMAILADKANGIMAAWREESHLNRHFISNKPSKVLSPEYLWDDRKPQPPSLK
260 270 280 290 300 310
CCDS65 LIRFSTLDKDISCLRS
320 330
276 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 20:02:22 2016 done: Thu Nov 3 20:02:22 2016
Total Scan time: 2.010 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]