FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8403, 715 aa
1>>>pF1KB8403 715 - 715 aa - 715 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.5560+/-0.00103; mu= 14.1076+/- 0.062
mean_var=90.9061+/-17.568, 0's: 0 Z-trim(105.2): 13 B-trim: 0 in 0/51
Lambda= 0.134517
statistics sampled from 8297 (8304) to 8297 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.627), E-opt: 0.2 (0.255), width: 16
Scan time: 3.000
The best scores are: opt bits E(32554)
CCDS11954.2 POLI gene_id:11201|Hs108|chr18 ( 740) 4692 921.2 0
CCDS4030.1 POLK gene_id:51426|Hs108|chr5 ( 870) 317 72.2 3.7e-12
>>CCDS11954.2 POLI gene_id:11201|Hs108|chr18 (740 aa)
initn: 4692 init1: 4692 opt: 4692 Z-score: 4921.5 bits: 921.2 E(32554): 0
Smith-Waterman score: 4692; 99.6% identity (99.9% similar) in 715 aa overlap (1-715:26-740)
10 20 30
pF1KB8 MELADVGAAASSQGVHDQVLPTPNASSRVIVHVDL
:::::::::::::::::::::::::::::::::::
CCDS11 MEKLGVEPEEEGGGDDDEEDAEAWAMELADVGAAASSQGVHDQVLPTPNASSRVIVHVDL
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB8 DCFYAQVEMISNPELKDKPLGVQQKYLVVTCNYEARKLGVKKLMNVRDAKEKCPQLVLVN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DCFYAQVEMISNPELKDKPLGVQQKYLVVTCNYEARKLGVKKLMNVRDAKEKCPQLVLVN
70 80 90 100 110 120
100 110 120 130 140 150
pF1KB8 GEDLTRYREMSYKVTELLEEFSPVVERLGFDENFVDLTEMVEKRLQQLQSDELSAVTVSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 GEDLTRYREMSYKVTELLEEFSPVVERLGFDENFVDLTEMVEKRLQQLQSDELSAVTVSG
130 140 150 160 170 180
160 170 180 190 200 210
pF1KB8 HVYNNQSINLLDVLHIRLLVGSQIAAEMREAMYNQLGLTGCAGVASNKLLAKLVSGVFKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 HVYNNQSINLLDVLHIRLLVGSQIAAEMREAMYNQLGLTGCAGVASNKLLAKLVSGVFKP
190 200 210 220 230 240
220 230 240 250 260 270
pF1KB8 NQQTVLLPESCQHLIHSLNHIKEIPGIGYKTAKCLEALGINSVRDLQTFSPKILEKELGI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 NQQTVLLPESCQHLIHSLNHIKEIPGIGYKTAKCLEALGINSVRDLQTFSPKILEKELGI
250 260 270 280 290 300
280 290 300 310 320 330
pF1KB8 SVAQRIQKLSFGEDNSPVILSGPPQSFSEEDSFKKCTSEVEAKNKIEELLASLLNRVCQD
::::::::::::::::::::::::::::::::::::.:::::::::::::::::::::::
CCDS11 SVAQRIQKLSFGEDNSPVILSGPPQSFSEEDSFKKCSSEVEAKNKIEELLASLLNRVCQD
310 320 330 340 350 360
340 350 360 370 380 390
pF1KB8 GRKPHTVRLIIRRYSSEKHYGRESRQCPIPSHVIQKLGTGNYDVMTPMVDILMKLFRNMV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 GRKPHTVRLIIRRYSSEKHYGRESRQCPIPSHVIQKLGTGNYDVMTPMVDILMKLFRNMV
370 380 390 400 410 420
400 410 420 430 440 450
pF1KB8 NVKMPFHLTLLSVCFCNLKALNTAKKGLIDYYLMPSLSTTSRSGKHSFKMKDTHMEDFPK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 NVKMPFHLTLLSVCFCNLKALNTAKKGLIDYYLMPSLSTTSRSGKHSFKMKDTHMEDFPK
430 440 450 460 470 480
460 470 480 490 500 510
pF1KB8 DKETNRDFLPSGRIESTRTRESPLDTTNFSKEKDINEFPLCSLPEGVDQEVSKQLPVDIQ
::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::::
CCDS11 DKETNRDFLPSGRIESTRTRESPLDTTNFSKEKDINEFPLCSLPEGVDQEVFKQLPVDIQ
490 500 510 520 530 540
520 530 540 550 560 570
pF1KB8 EEILSGKSREKFQGKGSVSCPLHASRGVLSFFSKKQMQDIPINPRDHLSSSKQVSSVSPC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 EEILSGKSREKFQGKGSVSCPLHASRGVLSFFSKKQMQDIPINPRDHLSSSKQVSSVSPC
550 560 570 580 590 600
580 590 600 610 620 630
pF1KB8 EPGTSGFNSSSSSYMSSQKDYSYYLDNRLKDERISQGPKEPQGFHFTNSNPAVSAFHSFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 EPGTSGFNSSSSSYMSSQKDYSYYLDNRLKDERISQGPKEPQGFHFTNSNPAVSAFHSFP
610 620 630 640 650 660
640 650 660 670 680 690
pF1KB8 NLQSEQLFSRNHTTDSHKQTVATDSHEGLTENREPDSVDEKITFPSDIDPQVFYELPEAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 NLQSEQLFSRNHTTDSHKQTVATDSHEGLTENREPDSVDEKITFPSDIDPQVFYELPEAV
670 680 690 700 710 720
700 710
pF1KB8 QKELLAEWKRTGSDFHIGHK
::::::::::.:::::::::
CCDS11 QKELLAEWKRAGSDFHIGHK
730 740
>>CCDS4030.1 POLK gene_id:51426|Hs108|chr5 (870 aa)
initn: 358 init1: 237 opt: 317 Z-score: 331.8 bits: 72.2 E(32554): 3.7e-12
Smith-Waterman score: 436; 23.8% identity (53.0% similar) in 677 aa overlap (27-636:100-753)
10 20 30 40 50
pF1KB8 MELADVGAAASSQGVHDQVLPTPNASSRVIVHVDLDCFYAQVEMISNPELKDKPLG
: .:::.:.: ::: ::: .::::::::..
CCDS40 QKAQITSQQLRKAQLQVDRFAMELEQSRNLSNTIVHIDMDAFYAAVEMRDNPELKDKPIA
70 80 90 100 110 120
60 70 80 90 100 110
pF1KB8 VQQKYLVVTCNYEARKLGVKKLMNVRDAKEKCPQLVLVNGEDLTRYREMSYKVTELLEEF
: . .. : ::.::..::. : ::. ::::..: .. .:: .: .: :.: ..
CCDS40 VGSMSMLSTSNYHARRFGVRAAMPGFIAKRLCPQLIIVP-PNFDKYRAVSKEVKEILADY
130 140 150 160 170 180
120 130 140 150 160
pF1KB8 SPVVERLGFDENFVDLTEMVEKRLQQLQSDELSAVTVSGHVYNN---QSINLLD------
.: ...:: ....:. .:.: . .. . . ... : :. . .: :.
CCDS40 DPNFMAMSLDEAYLNITKHLEERQNWPEDKRRYFIKMGSSVENDNPGKEVNKLSEHERSI
190 200 210 220 230 240
170 180 190
pF1KB8 ------------------------------VLHIRLLVGS---QIAAEMREAMYNQLGLT
.:. .. :. ... :.: . .. ::
CCDS40 SPLLFEESPSDVQPPGDPFQVNFEEQNNPQILQNSVVFGTSAQEVVKEIRFRIEQKTTLT
250 260 270 280 290 300
200 210 220 230 240 250
pF1KB8 GCAGVASNKLLAKLVSGVFKPNQQTVLLP--ESCQHLIHSLNHIKEIPGIGYKTAKCLEA
. ::.: : .:::. : ::: : .:: .. . .:..: :... ::: : : :.:
CCDS40 ASAGIAPNTMLAKVCSDKNKPNGQYQILPNRQAVMDFIKDLP-IRKVSGIGKVTEKMLKA
310 320 330 340 350 360
260 270 280 290 300 310
pF1KB8 LGINSVRDLQTFSPKILEKELGISVA-QRIQKLSFGEDNSPVILSGPPQSFSEEDSFKKC
::: . .: .. . : . : .. . . ..:.: .. . .: .:.: : .:..
CCDS40 LGIITCTEL--YQQRALLSLLFSETSWHYFLHISLGLGSTHLTRDGERKSMSVERTFSEI
370 380 390 400 410 420
320 330 340 350 360
pF1KB8 TSEVEAKNKIEELLASLLNRVCQDGRKPHTVRLIIRRYSSEKHYGRESRQCPIPSH----
.. : . .:: . : . . .. : .:: . .. . : . : : . :
CCDS40 NKAEEQYSLCQELCSELAQDLQKERLKGRTVTIKLKNVNFEVKT-RASTVSSVVSTAEEI
430 440 450 460 470 480
370 380 390 400 410
pF1KB8 --VIQKLGTGNYDVMTP------MVDILMKLFRNMVNVKMPFHLTLLSVCFCNLKALN--
. ..: . :. : .. . .. : : . : . .... . .::.
CCDS40 FAIAKELLKTEIDADFPHPLRLRLMGVRISSFPNEEDRKHQ-QRSIIGFLQAGNQALSAT
490 500 510 520 530 540
420 430 440 450 460 470
pF1KB8 --TAKKGLIDYYLMPSLSTTSRS--GKHSFKMKDTHMEDFPKDKETNRDFLPSGRIESTR
: .: : .. : . ..: :. . : .:.. : . ....: : .. .
CCDS40 ECTLEKTDKDKFVKPLEMSHKKSFFDKKRSERKWSHQDTFKCEAVNKQSFQTSQPFQVLK
550 560 570 580 590 600
480 490 500 510 520
pF1KB8 TRESP-LDTTNFSKEKDINEFPLCSLPEGVDQEVSKQLPVDIQEEILSGKS-REKFQ--G
. . :. .. : . .: :.: .: . . . :: : :.: : :.:. .
CCDS40 KKMNENLEISENSDDCQILTCPVCFRAQGCISLEALNKHVD---ECLDGPSISENFKMFS
610 620 630 640 650 660
530 540 550 560 570 580
pF1KB8 KGSVSCPLHASRGVLSFFSKKQMQDIPINPRDHLSSSKQVSSVSPCEPGTSGFNSSSSSY
. :: .. . : . :: .:. :..:::. : .. ...::..
CCDS40 CSHVSATKVNKKENVPASSLCEKQDYEAHPK-----IKEISSVD-CIALVDTIDNSSKA-
670 680 690 700 710
590 600 610 620 630 640
pF1KB8 MSSQKDYSYYLDNRLKDERISQGPKEPQGFHFTNSNPAVSAFHSFPNLQSEQLFSRNHTT
. :.:. . :. :. :. ..:.. . . :. :. :
CCDS40 -----ESIDALSNKHSKEECSSLPS--KSFNIEHCHQNSSSTVSLENEDVGSFRQEYRQP
720 730 740 750 760
650 660 670 680 690 700
pF1KB8 DSHKQTVATDSHEGLTENREPDSVDEKITFPSDIDPQVFYELPEAVQKELLAEWKRTGSD
CCDS40 YLCEVKTGQALVCPVCNVEQKTSDLTLFNVHVDVCLNKSFIQELRKDKFNPVNQPKESSR
770 780 790 800 810 820
715 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 12:38:04 2016 done: Fri Nov 4 12:38:04 2016
Total Scan time: 3.000 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]