FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KA0219, 2671 aa
1>>>pF1KA0219 2671 - 2671 aa - 2671 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.5981+/-0.00126; mu= 19.3624+/- 0.076
mean_var=81.5354+/-16.075, 0's: 0 Z-trim(100.8): 39 B-trim: 0 in 0/52
Lambda= 0.142037
statistics sampled from 6240 (6253) to 6240 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.527), E-opt: 0.2 (0.192), width: 16
Scan time: 8.750
The best scores are: opt bits E(32554)
CCDS41847.1 GCN1 gene_id:10985|Hs108|chr12 (2671) 17080 3511.3 0
>>CCDS41847.1 GCN1 gene_id:10985|Hs108|chr12 (2671 aa)
initn: 17080 init1: 17080 opt: 17080 Z-score: 18899.1 bits: 3511.3 E(32554): 0
Smith-Waterman score: 17080; 100.0% identity (100.0% similar) in 2671 aa overlap (1-2671:1-2671)
10 20 30 40 50 60
pF1KA0 MAADTQVSETLKRFAGKVTTASVKERREILSELGKCVAGKDLPEGAVKGLCKLFCLTLHR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 MAADTQVSETLKRFAGKVTTASVKERREILSELGKCVAGKDLPEGAVKGLCKLFCLTLHR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA0 YRDAASRRALQAAIQQLAEAQPEATAKNLLHSLQSSGIGSKAGVPSKSSGSAALLALTWT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 YRDAASRRALQAAIQQLAEAQPEATAKNLLHSLQSSGIGSKAGVPSKSSGSAALLALTWT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA0 CLLVRIVFPSRAKRQGDIWNKLVEVQCLLLLEVLGGSHKHAVDGAVKKLTKLWKENPGLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 CLLVRIVFPSRAKRQGDIWNKLVEVQCLLLLEVLGGSHKHAVDGAVKKLTKLWKENPGLV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA0 EQYLSAILSLEPNQNYAGMLGLLVQFCTSHKEMDVVSQHKSALLDFYMKNILMSKVKPPK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 EQYLSAILSLEPNQNYAGMLGLLVQFCTSHKEMDVVSQHKSALLDFYMKNILMSKVKPPK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA0 YLLDSCAPLLRYLSHSEFKDLILPTIQKSLLRSPENVIETISSLLASVTLDFSQYAMDIV
:::::::::::::::::::::::::::::::::::::::::::::::::::.::::::::
CCDS41 YLLDSCAPLLRYLSHSEFKDLILPTIQKSLLRSPENVIETISSLLASVTLDLSQYAMDIV
250 260 270 280 290 300
310 320 330 340 350 360
pF1KA0 KGLAGHLKSNSPRLMDEAVLALRNLARQCSDSSAMESLTKHLFAILGGSEGKLTVVAQKM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 KGLAGHLKSNSPRLMDEAVLALRNLARQCSDSSAMESLTKHLFAILGGSEGKLTVVAQKM
310 320 330 340 350 360
370 380 390 400 410 420
pF1KA0 SVLSGIGSVSHHVVSGPSSQVLNGIVAELFIPFLQQEVHEGTLVHAVSVLALWCNRFTME
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 SVLSGIGSVSHHVVSGPSSQVLNGIVAELFIPFLQQEVHEGTLVHAVSVLALWCNRFTME
370 380 390 400 410 420
430 440 450 460 470 480
pF1KA0 VPKKLTEWFKKAFSLKTSTSAVRHAYLQCMLASYRGDTLLQALDLLPLLIQTVEKAASQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 VPKKLTEWFKKAFSLKTSTSAVRHAYLQCMLASYRGDTLLQALDLLPLLIQTVEKAASQS
430 440 450 460 470 480
490 500 510 520 530 540
pF1KA0 TQVPTITEGVAAALLLLKLSVADSQAEAKLSSFWQLIVDEKKQVFTSEKFLVMASEDALC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 TQVPTITEGVAAALLLLKLSVADSQAEAKLSSFWQLIVDEKKQVFTSEKFLVMASEDALC
490 500 510 520 530 540
550 560 570 580 590 600
pF1KA0 TVLHLTERLFLDHPHRLTGNKVQQYHRALVAVLLSRTWHVRRQAQQTVRKLLSSLGGFKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 TVLHLTERLFLDHPHRLTGNKVQQYHRALVAVLLSRTWHVRRQAQQTVRKLLSSLGGFKL
550 560 570 580 590 600
610 620 630 640 650 660
pF1KA0 AHGLLEELKTVLSSHKVLPLEALVTDAGEVTEAGKAYVPPRVLQEALCVISGVPGLKGDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 AHGLLEELKTVLSSHKVLPLEALVTDAGEVTEAGKAYVPPRVLQEALCVISGVPGLKGDV
610 620 630 640 650 660
670 680 690 700 710 720
pF1KA0 TDTEQLAQEMLIISHHPSLVAVQSGLWPALLARMKIDPEAFITRHLDQIIPRMTTQSPLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 TDTEQLAQEMLIISHHPSLVAVQSGLWPALLARMKIDPEAFITRHLDQIIPRMTTQSPLN
670 680 690 700 710 720
730 740 750 760 770 780
pF1KA0 QSSMNAMGSLSVLSPDRVLPQLISTITASVQNPALRLVTREEFAIMQTPAGELYDKSIIQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 QSSMNAMGSLSVLSPDRVLPQLISTITASVQNPALRLVTREEFAIMQTPAGELYDKSIIQ
730 740 750 760 770 780
790 800 810 820 830 840
pF1KA0 SAQQDSIKKANMKRENKAYSFKEQIIELELKEEIKKKKGIKEEVQLTSKQKEMLQAQLDR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 SAQQDSIKKANMKRENKAYSFKEQIIELELKEEIKKKKGIKEEVQLTSKQKEMLQAQLDR
790 800 810 820 830 840
850 860 870 880 890 900
pF1KA0 EAQVRRRLQELDGELEAALGLLDIILAKNPSGLTQYIPVLVDSFLPLLKSPLAAPRIKNP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 EAQVRRRLQELDGELEAALGLLDIILAKNPSGLTQYIPVLVDSFLPLLKSPLAAPRIKNP
850 860 870 880 890 900
910 920 930 940 950 960
pF1KA0 FLSLAACVMPSRLKALGTLVSHVTLRLLKPECVLDKSWCQEELSVAVKRAVMLLHTHTIT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 FLSLAACVMPSRLKALGTLVSHVTLRLLKPECVLDKSWCQEELSVAVKRAVMLLHTHTIT
910 920 930 940 950 960
970 980 990 1000 1010 1020
pF1KA0 SRVGKGEPGAAPLSAPAFSLVFPFLKMVLTEMPHHSEEEEEWMAQILQILTVQAQLRASP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 SRVGKGEPGAAPLSAPAFSLVFPFLKMVLTEMPHHSEEEEEWMAQILQILTVQAQLRASP
970 980 990 1000 1010 1020
1030 1040 1050 1060 1070 1080
pF1KA0 NTPPGRVDENGPELLPRVAMLRLLTWVIGTGSPRLQVLASDTLTTLCASSSGDDGCAFAE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 NTPPGRVDENGPELLPRVAMLRLLTWVIGTGSPRLQVLASDTLTTLCASSSGDDGCAFAE
1030 1040 1050 1060 1070 1080
1090 1100 1110 1120 1130 1140
pF1KA0 QEEVDVLLCALQSPCASVRETVLRGLMELHMVLPAPDTDEKNGLNLLRRLWVVKFDKEEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 QEEVDVLLCALQSPCASVRETVLRGLMELHMVLPAPDTDEKNGLNLLRRLWVVKFDKEEE
1090 1100 1110 1120 1130 1140
1150 1160 1170 1180 1190 1200
pF1KA0 IRKLAERLWSMMGLDLQPDLCSLLIDDVIYHEAAVRQAGAEALSQAVARYQRQAAEVMGR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 IRKLAERLWSMMGLDLQPDLCSLLIDDVIYHEAAVRQAGAEALSQAVARYQRQAAEVMGR
1150 1160 1170 1180 1190 1200
1210 1220 1230 1240 1250 1260
pF1KA0 LMEIYQEKLYRPPPVLDALGRVISESPPDQWEARCGLALALNKLSQYLDSSQVKPLFQFF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 LMEIYQEKLYRPPPVLDALGRVISESPPDQWEARCGLALALNKLSQYLDSSQVKPLFQFF
1210 1220 1230 1240 1250 1260
1270 1280 1290 1300 1310 1320
pF1KA0 VPDALNDRHPDVRKCMLDAALATLNTHGKENVNSLLPVFEEFLKNAPNDASYDAVRQSVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 VPDALNDRHPDVRKCMLDAALATLNTHGKENVNSLLPVFEEFLKNAPNDASYDAVRQSVV
1270 1280 1290 1300 1310 1320
1330 1340 1350 1360 1370 1380
pF1KA0 VLMGSLAKHLDKSDPKVKPIVAKLIAALSTPSQQVQESVASCLPPLVPAIKEDAGGMIQR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 VLMGSLAKHLDKSDPKVKPIVAKLIAALSTPSQQVQESVASCLPPLVPAIKEDAGGMIQR
1330 1340 1350 1360 1370 1380
1390 1400 1410 1420 1430 1440
pF1KA0 LMQQLLESDKYAERKGAAYGLAGLVKGLGILSLKQQEMMAALTDAIQDKKNFRRREGALF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 LMQQLLESDKYAERKGAAYGLAGLVKGLGILSLKQQEMMAALTDAIQDKKNFRRREGALF
1390 1400 1410 1420 1430 1440
1450 1460 1470 1480 1490 1500
pF1KA0 AFEMLCTMLGKLFEPYVVHVLPHLLLCFGDGNQYVREAADDCAKAVMSNLSAHGVKLVLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 AFEMLCTMLGKLFEPYVVHVLPHLLLCFGDGNQYVREAADDCAKAVMSNLSAHGVKLVLP
1450 1460 1470 1480 1490 1500
1510 1520 1530 1540 1550 1560
pF1KA0 SLLAALEEESWRTKAGSVELLGAMAYCAPKQLSSCLPNIVPKLTEVLTDSHVKVQKAGQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 SLLAALEEESWRTKAGSVELLGAMAYCAPKQLSSCLPNIVPKLTEVLTDSHVKVQKAGQQ
1510 1520 1530 1540 1550 1560
1570 1580 1590 1600 1610 1620
pF1KA0 ALRQIGSVIRNPEILAIAPVLLDALTDPSRKTQKCLQTLLDTKFVHFIDAPSLALIMPIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 ALRQIGSVIRNPEILAIAPVLLDALTDPSRKTQKCLQTLLDTKFVHFIDAPSLALIMPIV
1570 1580 1590 1600 1610 1620
1630 1640 1650 1660 1670 1680
pF1KA0 QRAFQDRSTDTRKMAAQIIGNMYSLTDQKDLAPYLPSVTPGLKASLLDPVPEVRTVSAKA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 QRAFQDRSTDTRKMAAQIIGNMYSLTDQKDLAPYLPSVTPGLKASLLDPVPEVRTVSAKA
1630 1640 1650 1660 1670 1680
1690 1700 1710 1720 1730 1740
pF1KA0 LGAMVKGMGESCFEDLLPWLMETLTYEQSSVDRSGAAQGLAEVMAGLGVEKLEKLMPEIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 LGAMVKGMGESCFEDLLPWLMETLTYEQSSVDRSGAAQGLAEVMAGLGVEKLEKLMPEIV
1690 1700 1710 1720 1730 1740
1750 1760 1770 1780 1790 1800
pF1KA0 ATASKVDIAPHVRDGYIMMFNYLPITFGDKFTPYVGPIIPCILKALADENEFVRDTALRA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 ATASKVDIAPHVRDGYIMMFNYLPITFGDKFTPYVGPIIPCILKALADENEFVRDTALRA
1750 1760 1770 1780 1790 1800
1810 1820 1830 1840 1850 1860
pF1KA0 GQRVISMYAETAIALLLPQLEQGLFDDLWRIRFSSVQLLGDLLFHISGVTGKMTTETASE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 GQRVISMYAETAIALLLPQLEQGLFDDLWRIRFSSVQLLGDLLFHISGVTGKMTTETASE
1810 1820 1830 1840 1850 1860
1870 1880 1890 1900 1910 1920
pF1KA0 DDNFGTAQSNKAIITALGVERRNRVLAGLYMGRSDTQLVVRQASLHVWKIVVSNTPRTLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 DDNFGTAQSNKAIITALGVERRNRVLAGLYMGRSDTQLVVRQASLHVWKIVVSNTPRTLR
1870 1880 1890 1900 1910 1920
1930 1940 1950 1960 1970 1980
pF1KA0 EILPTLFGLLLGFLASTCADKRTIAARTLGDLVRKLGEKILPEIIPILEEGLRSQKSDER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 EILPTLFGLLLGFLASTCADKRTIAARTLGDLVRKLGEKILPEIIPILEEGLRSQKSDER
1930 1940 1950 1960 1970 1980
1990 2000 2010 2020 2030 2040
pF1KA0 QGVCIGLSEIMKSTSRDAVLYFSESLVPTARKALCDPLEEVREAAAKTFEQLHSTIGHQA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 QGVCIGLSEIMKSTSRDAVLYFSESLVPTARKALCDPLEEVREAAAKTFEQLHSTIGHQA
1990 2000 2010 2020 2030 2040
2050 2060 2070 2080 2090 2100
pF1KA0 LEDILPFLLKQLDDEEVSEFALDGLKQVMAIKSRVVLPYLVPKLTTPPVNTRVLAFLSSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 LEDILPFLLKQLDDEEVSEFALDGLKQVMAIKSRVVLPYLVPKLTTPPVNTRVLAFLSSV
2050 2060 2070 2080 2090 2100
2110 2120 2130 2140 2150 2160
pF1KA0 AGDALTRHLGVILPAVMLALKEKLGTPDEQLEMANCQAVILSVEDDTGHRIIIEDLLEAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 AGDALTRHLGVILPAVMLALKEKLGTPDEQLEMANCQAVILSVEDDTGHRIIIEDLLEAT
2110 2120 2130 2140 2150 2160
2170 2180 2190 2200 2210 2220
pF1KA0 RSPEVGMRQAAAIILNIYCSRSKADYTSHLRSLVSGLIRLFNDSSPVVLEESWDALNAIT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 RSPEVGMRQAAAIILNIYCSRSKADYTSHLRSLVSGLIRLFNDSSPVVLEESWDALNAIT
2170 2180 2190 2200 2210 2220
2230 2240 2250 2260 2270 2280
pF1KA0 KKLDAGNQLALIEELHKEIRLIGNESKGEHVPGFCLPKKGVTSILPVLREGVLTGSPEQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 KKLDAGNQLALIEELHKEIRLIGNESKGEHVPGFCLPKKGVTSILPVLREGVLTGSPEQK
2230 2240 2250 2260 2270 2280
2290 2300 2310 2320 2330 2340
pF1KA0 EEAAKALGLVIRLTSADALRPSVVSITGPLIRILGDRFSWNVKAALLETLSLLLAKVGIA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 EEAAKALGLVIRLTSADALRPSVVSITGPLIRILGDRFSWNVKAALLETLSLLLAKVGIA
2290 2300 2310 2320 2330 2340
2350 2360 2370 2380 2390 2400
pF1KA0 LKPFLPQLQTTFTKALQDSNRGVRLKAADALGKLISIHIKVDPLFTELLNGIRAMEDPGV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 LKPFLPQLQTTFTKALQDSNRGVRLKAADALGKLISIHIKVDPLFTELLNGIRAMEDPGV
2350 2360 2370 2380 2390 2400
2410 2420 2430 2440 2450 2460
pF1KA0 RDTMLQALRFVIQGAGAKVDAVIRKNIVSLLLSMLGHDEDNTRISSAGCLGELCAFLTEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 RDTMLQALRFVIQGAGAKVDAVIRKNIVSLLLSMLGHDEDNTRISSAGCLGELCAFLTEE
2410 2420 2430 2440 2450 2460
2470 2480 2490 2500 2510 2520
pF1KA0 ELSAVLQQCLLADVSGIDWMVRHGRSLALSVAVNVAPGRLCAGRYSSDVQEMILSSATAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 ELSAVLQQCLLADVSGIDWMVRHGRSLALSVAVNVAPGRLCAGRYSSDVQEMILSSATAD
2470 2480 2490 2500 2510 2520
2530 2540 2550 2560 2570 2580
pF1KA0 RIPIAVSGVRGMGFLMRHHIETGGGQLPAKLSSLFVKCLQNPSSDIRLVAEKMIWWANKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 RIPIAVSGVRGMGFLMRHHIETGGGQLPAKLSSLFVKCLQNPSSDIRLVAEKMIWWANKD
2530 2540 2550 2560 2570 2580
2590 2600 2610 2620 2630 2640
pF1KA0 PLPPLDPQAIKPILKALLDNTKDKNTVVRAYSDQAIVNLLKMRQGEEVFQSLSKILDVAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 PLPPLDPQAIKPILKALLDNTKDKNTVVRAYSDQAIVNLLKMRQGEEVFQSLSKILDVAS
2590 2600 2610 2620 2630 2640
2650 2660 2670
pF1KA0 LEVLNEVNRRSLKKLASQADSTEQVDDTILT
:::::::::::::::::::::::::::::::
CCDS41 LEVLNEVNRRSLKKLASQADSTEQVDDTILT
2650 2660 2670
2671 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 20:46:46 2016 done: Sat Nov 5 20:46:48 2016
Total Scan time: 8.750 Total Display time: 0.360
Function used was FASTA [36.3.4 Apr, 2011]