FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6616, 335 aa
1>>>pF1KE6616 335 - 335 aa - 335 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.0542+/-0.00061; mu= 9.2350+/- 0.037
mean_var=109.9348+/-21.989, 0's: 0 Z-trim(115.6): 11 B-trim: 0 in 0/53
Lambda= 0.122323
statistics sampled from 16171 (16178) to 16171 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.803), E-opt: 0.2 (0.497), width: 16
Scan time: 2.430
The best scores are: opt bits E(32554)
CCDS8025.1 B3GAT3 gene_id:26229|Hs108|chr11 ( 335) 2288 413.5 1.2e-115
CCDS76417.1 B3GAT3 gene_id:26229|Hs108|chr11 ( 319) 2081 377.0 1.1e-104
CCDS76418.1 B3GAT3 gene_id:26229|Hs108|chr11 ( 315) 2071 375.2 3.8e-104
CCDS8500.1 B3GAT1 gene_id:27087|Hs108|chr11 ( 334) 870 163.3 2.6e-40
CCDS4974.1 B3GAT2 gene_id:135152|Hs108|chr6 ( 323) 578 111.7 8.1e-25
>>CCDS8025.1 B3GAT3 gene_id:26229|Hs108|chr11 (335 aa)
initn: 2288 init1: 2288 opt: 2288 Z-score: 2189.7 bits: 413.5 E(32554): 1.2e-115
Smith-Waterman score: 2288; 100.0% identity (100.0% similar) in 335 aa overlap (1-335:1-335)
10 20 30 40 50 60
pF1KE6 MKLKLKNVFLAYFLVSIAGLLYALVQLGQPCDCLPPLRAAAEQLRQKDLRISQLQAELRR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 MKLKLKNVFLAYFLVSIAGLLYALVQLGQPCDCLPPLRAAAEQLRQKDLRISQLQAELRR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 PPPAPAQPPEPEALPTIYVVTPTYARLVQKAELVRLSQTLSLVPRLHWLLVEDAEGPTPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 PPPAPAQPPEPEALPTIYVVTPTYARLVQKAELVRLSQTLSLVPRLHWLLVEDAEGPTPL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 VSGLLAASGLLFTHLVVLTPKAQRLREGEPGWVHPRGVEQRNKALDWLRGRGGAVGGEKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 VSGLLAASGLLFTHLVVLTPKAQRLREGEPGWVHPRGVEQRNKALDWLRGRGGAVGGEKD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 PPPPGTQGVVYFADDDNTYSRELFEEMRWTRGVSVWPVGLVGGLRFEGPQVQDGRVVGFH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 PPPPGTQGVVYFADDDNTYSRELFEEMRWTRGVSVWPVGLVGGLRFEGPQVQDGRVVGFH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE6 TAWEPSRPFPVDMAGFAVALPLLLDKPNAQFDSTAPRGHLESSLLSHLVDPKDLEPRAAN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 TAWEPSRPFPVDMAGFAVALPLLLDKPNAQFDSTAPRGHLESSLLSHLVDPKDLEPRAAN
250 260 270 280 290 300
310 320 330
pF1KE6 CTRVLVWHTRTEKPKMKQEEQLQRQGRGSDPAIEV
:::::::::::::::::::::::::::::::::::
CCDS80 CTRVLVWHTRTEKPKMKQEEQLQRQGRGSDPAIEV
310 320 330
>>CCDS76417.1 B3GAT3 gene_id:26229|Hs108|chr11 (319 aa)
initn: 2120 init1: 2074 opt: 2081 Z-score: 1992.6 bits: 377.0 E(32554): 1.1e-104
Smith-Waterman score: 2081; 98.1% identity (98.4% similar) in 312 aa overlap (1-312:1-312)
10 20 30 40 50 60
pF1KE6 MKLKLKNVFLAYFLVSIAGLLYALVQLGQPCDCLPPLRAAAEQLRQKDLRISQLQAELRR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 MKLKLKNVFLAYFLVSIAGLLYALVQLGQPCDCLPPLRAAAEQLRQKDLRISQLQAELRR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 PPPAPAQPPEPEALPTIYVVTPTYARLVQKAELVRLSQTLSLVPRLHWLLVEDAEGPTPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 PPPAPAQPPEPEALPTIYVVTPTYARLVQKAELVRLSQTLSLVPRLHWLLVEDAEGPTPL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 VSGLLAASGLLFTHLVVLTPKAQRLREGEPGWVHPRGVEQRNKALDWLRGRGGAVGGEKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 VSGLLAASGLLFTHLVVLTPKAQRLREGEPGWVHPRGVEQRNKALDWLRGRGGAVGGEKD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 PPPPGTQGVVYFADDDNTYSRELFEEMRWTRGVSVWPVGLVGGLRFEGPQVQDGRVVGFH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 PPPPGTQGVVYFADDDNTYSRELFEEMRWTRGVSVWPVGLVGGLRFEGPQVQDGRVVGFH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE6 TAWEPSRPFPVDMAGFAVALPLLLDKPNAQFDSTAPRGHLESSLLSHLVDPKDLEPRAAN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 TAWEPSRPFPVDMAGFAVALPLLLDKPNAQFDSTAPRGHLESSLLSHLVDPKDLEPRAAN
250 260 270 280 290 300
310 320 330
pF1KE6 CTRVLVWHTRTEKPKMKQEEQLQRQGRGSDPAIEV
::: :. : :
CCDS76 CTRSLAVSPRLECSSAILA
310
>>CCDS76418.1 B3GAT3 gene_id:26229|Hs108|chr11 (315 aa)
initn: 2071 init1: 2071 opt: 2071 Z-score: 1983.1 bits: 375.2 E(32554): 3.8e-104
Smith-Waterman score: 2071; 100.0% identity (100.0% similar) in 303 aa overlap (1-303:1-303)
10 20 30 40 50 60
pF1KE6 MKLKLKNVFLAYFLVSIAGLLYALVQLGQPCDCLPPLRAAAEQLRQKDLRISQLQAELRR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 MKLKLKNVFLAYFLVSIAGLLYALVQLGQPCDCLPPLRAAAEQLRQKDLRISQLQAELRR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 PPPAPAQPPEPEALPTIYVVTPTYARLVQKAELVRLSQTLSLVPRLHWLLVEDAEGPTPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 PPPAPAQPPEPEALPTIYVVTPTYARLVQKAELVRLSQTLSLVPRLHWLLVEDAEGPTPL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 VSGLLAASGLLFTHLVVLTPKAQRLREGEPGWVHPRGVEQRNKALDWLRGRGGAVGGEKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 VSGLLAASGLLFTHLVVLTPKAQRLREGEPGWVHPRGVEQRNKALDWLRGRGGAVGGEKD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 PPPPGTQGVVYFADDDNTYSRELFEEMRWTRGVSVWPVGLVGGLRFEGPQVQDGRVVGFH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 PPPPGTQGVVYFADDDNTYSRELFEEMRWTRGVSVWPVGLVGGLRFEGPQVQDGRVVGFH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE6 TAWEPSRPFPVDMAGFAVALPLLLDKPNAQFDSTAPRGHLESSLLSHLVDPKDLEPRAAN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 TAWEPSRPFPVDMAGFAVALPLLLDKPNAQFDSTAPRGHLESSLLSHLVDPKDLEPRAAN
250 260 270 280 290 300
310 320 330
pF1KE6 CTRVLVWHTRTEKPKMKQEEQLQRQGRGSDPAIEV
:::
CCDS76 CTRTESRCVTQAGVQ
310
>>CCDS8500.1 B3GAT1 gene_id:27087|Hs108|chr11 (334 aa)
initn: 863 init1: 347 opt: 870 Z-score: 837.3 bits: 163.3 E(32554): 2.6e-40
Smith-Waterman score: 885; 55.2% identity (73.7% similar) in 270 aa overlap (68-335:77-334)
40 50 60 70 80 90
pF1KE6 RAAAEQLRQKDLRISQLQAELRRPPPAPAQPPEPEALPTIYVVTPTYARLVQKAELVRLS
:: ..::::.::::::.: ::::::.:..
CCDS85 TPPGADPREYCTSDRDIVEVVRTEYVYTRPPPWSDTLPTIHVVTPTYSRPVQKAELTRMA
50 60 70 80 90 100
100 110 120 130 140 150
pF1KE6 QTLSLVPRLHWLLVEDAEGPTPLVSGLLAASGLLFTHLVVLTPKAQRLREGEPGWVHPRG
.:: :: ::::.:::: :::.. :: .:: .::: : ::. .:: :::
CCDS85 NTLLHVPNLHWLVVEDAPRRTPLTARLLRDTGLNYTHLHVETPRNYKLRGDARDPRIPRG
110 120 130 140 150 160
160 170 180 190 200 210
pF1KE6 VEQRNKALDWLRGRGGAVGGEKDPPPPGTQGVVYFADDDNTYSRELFEEMRWTRGVSVWP
. ::: :: ::: : : . ::::::::::::: ::::::: :: :::::
CCDS85 TMQRNLALRWLR--------ETFPRNSSQPGVVYFADDDNTYSLELFEEMRSTRRVSVWP
170 180 190 200 210
220 230 240 250 260 270
pF1KE6 VGLVGGLRFEGPQVQD-GRVVGFHTAWEPSRPFPVDMAGFAVALPLLLDKPNAQFDSTAP
:..:::::.:.:.:. :.:::..:...: ::: .::::::: : :.:.. .: : .
CCDS85 VAFVGGLRYEAPRVNGAGKVVGWKTVFDPHRPFAIDMAGFAVNLRLILQRSQAYFKLRGV
220 230 240 250 260 270
280 290 300 310 320 330
pF1KE6 RG-HLESSLLSHLVDPKDLEPRAANCTRVLVWHTRTEKPKMKQEEQLQRQGRGSDPAIEV
.: . ::::: .:: .::::.:::::..:::::::::: . .: ..: .::..:.
CCDS85 KGGYQESSLLRELVTLNDLEPKAANCTKILVWHTRTEKPVLVNE---GKKGF-TDPSVEI
280 290 300 310 320 330
>>CCDS4974.1 B3GAT2 gene_id:135152|Hs108|chr6 (323 aa)
initn: 835 init1: 392 opt: 578 Z-score: 559.0 bits: 111.7 E(32554): 8.1e-25
Smith-Waterman score: 837; 52.3% identity (73.9% similar) in 264 aa overlap (59-321:65-313)
30 40 50 60 70 80
pF1KE6 QPCDCLPPLRAAAEQLRQKDLRISQLQAELRRPPPAPAQPPEPEALPTIYVVTPTYARLV
.: : :::. :::::..::::.: :
CCDS49 TPRPYFSPYAVGRGGARLPLRRGGPAHGTQKRNQSRPQPQPEPQ-LPTIYAITPTYSRPV
40 50 60 70 80 90
90 100 110 120 130 140
pF1KE6 QKAELVRLSQTLSLVPRLHWLLVEDAEGPTPLVSGLLAASGLLFTHLVVLTPKAQRLREG
:::::.::..:. : .:::.::::: . . ::: .:: .:: ::: : ::. :
CCDS49 QKAELTRLANTFRQVAQLHWILVEDAAARSELVSRFLARAGLPSTHLHVPTPR----RYK
100 110 120 130 140
150 160 170 180 190 200
pF1KE6 EPGWVHPRGVEQRNKALDWLRGRGGAVGGEKDPPPPGTQGVVYFADDDNTYSRELFEEMR
.:: ::..:::: .: ::: : .. : ::..::::::::: :::.:::
CCDS49 RPGL--PRATEQRNAGLAWLRQRHQ---HQRAQP-----GVLFFADDDNTYSLELFQEMR
150 160 170 180 190
210 220 230 240 250 260
pF1KE6 WTRGVSVWPVGLVGGLRFEGPQVQDGRVVGFHTAWEPSRPFPVDMAGFAVALPLLLDKPN
:: ::::::::::: :.: : :..:.:::..:.:. .::: .:::::::.: ..:..:.
CCDS49 TTRKVSVWPVGLVGGRRYERPLVENGKVVGWYTGWRADRPFAIDMAGFAVSLQVILSNPK
200 210 220 230 240 250
270 280 290 300 310 320
pF1KE6 AQFDSTAPR-GHLESSLLSHLVDPKDLEPRAANCTRVLVWHTRTEKPKMKQEEQLQRQGR
: : . . : ::..:.... ..:::.: :::.:::::::::: .. .: .
CCDS49 AVFKRRGSQPGMQESDFLKQITTVEELEPKANNCTKVLVWHTRTEKVNLANEPKYHLDTV
260 270 280 290 300 310
330
pF1KE6 GSDPAIEV
CCDS49 KIEV
320
335 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 14:48:19 2016 done: Tue Nov 8 14:48:19 2016
Total Scan time: 2.430 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]