FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4557, 679 aa
1>>>pF1KE4557 679 - 679 aa - 679 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4210+/-0.00113; mu= 19.4954+/- 0.068
mean_var=75.2533+/-14.991, 0's: 0 Z-trim(102.5): 25 B-trim: 0 in 0/47
Lambda= 0.147847
statistics sampled from 6984 (6987) to 6984 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.561), E-opt: 0.2 (0.215), width: 16
Scan time: 3.480
The best scores are: opt bits E(32554)
CCDS2099.1 SLC20A1 gene_id:6574|Hs108|chr2 ( 679) 4456 960.6 0
CCDS6132.1 SLC20A2 gene_id:6575|Hs108|chr8 ( 652) 1238 274.2 4.1e-73
>>CCDS2099.1 SLC20A1 gene_id:6574|Hs108|chr2 (679 aa)
initn: 4456 init1: 4456 opt: 4456 Z-score: 5135.5 bits: 960.6 E(32554): 0
Smith-Waterman score: 4456; 100.0% identity (100.0% similar) in 679 aa overlap (1-679:1-679)
10 20 30 40 50 60
pF1KE4 MATLITSTTAATAASGPLVDYLWMLILGFIIAFVLAFSVGANDVANSFGTAVGSGVVTLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 MATLITSTTAATAASGPLVDYLWMLILGFIIAFVLAFSVGANDVANSFGTAVGSGVVTLK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 QACILASIFETVGSVLLGAKVSETIRKGLIDVEMYNSTQGLLMAGSVSAMFGSAVWQLVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 QACILASIFETVGSVLLGAKVSETIRKGLIDVEMYNSTQGLLMAGSVSAMFGSAVWQLVA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 SFLKLPISGTHCIVGATIGFSLVAKGQEGVKWSELIKIVMSWFVSPLLSGIMSGILFFLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 SFLKLPISGTHCIVGATIGFSLVAKGQEGVKWSELIKIVMSWFVSPLLSGIMSGILFFLV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 RAFILHKADPVPNGLRALPVFYACTVGINLFSIMYTGAPLLGFDKLPLWGTILISVGCAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 RAFILHKADPVPNGLRALPVFYACTVGINLFSIMYTGAPLLGFDKLPLWGTILISVGCAV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 FCALIVWFFVCPRMKRKIEREIKCSPSESPLMEKKNSLKEDHEETKLSVGDIENKHPVSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 FCALIVWFFVCPRMKRKIEREIKCSPSESPLMEKKNSLKEDHEETKLSVGDIENKHPVSE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 VGPATVPLQAVVEERTVSFKLGDLEEAPERERLPSVDLKEETSIDSTVNGAVQLPNGNLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 VGPATVPLQAVVEERTVSFKLGDLEEAPERERLPSVDLKEETSIDSTVNGAVQLPNGNLV
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 QFSQAVSNQINSSGHYQYHTVHKDSGLYKELLHKLHLAKVGDCMGDSGDKPLRRNNSYTS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 QFSQAVSNQINSSGHYQYHTVHKDSGLYKELLHKLHLAKVGDCMGDSGDKPLRRNNSYTS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 YTMAICGMPLDSFRAKEGEQKGEEMEKLTWPNADSKKRIRMDSYTSYCNAVSDLHSASEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 YTMAICGMPLDSFRAKEGEQKGEEMEKLTWPNADSKKRIRMDSYTSYCNAVSDLHSASEI
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE4 DMSVKAEMGLGDRKGSNGSLEEWYDQDKPEVSLLFQFLQILTACFGSFAHGGNDVSNAIG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 DMSVKAEMGLGDRKGSNGSLEEWYDQDKPEVSLLFQFLQILTACFGSFAHGGNDVSNAIG
490 500 510 520 530 540
550 560 570 580 590 600
pF1KE4 PLVALYLVYDTGDVSSKVATPIWLLLYGGVGICVGLWVWGRRVIQTMGKDLTPITPSSGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 PLVALYLVYDTGDVSSKVATPIWLLLYGGVGICVGLWVWGRRVIQTMGKDLTPITPSSGF
550 560 570 580 590 600
610 620 630 640 650 660
pF1KE4 SIELASALTVVIASNIGLPISTTHCKVGSVVSVGWLRSKKAVDWRLFRNIFMAWFVTVPI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 SIELASALTVVIASNIGLPISTTHCKVGSVVSVGWLRSKKAVDWRLFRNIFMAWFVTVPI
610 620 630 640 650 660
670
pF1KE4 SGVISAAIMAIFRYVILRM
:::::::::::::::::::
CCDS20 SGVISAAIMAIFRYVILRM
670
>>CCDS6132.1 SLC20A2 gene_id:6575|Hs108|chr8 (652 aa)
initn: 2430 init1: 1096 opt: 1238 Z-score: 1426.1 bits: 274.2 E(32554): 4.1e-73
Smith-Waterman score: 2521; 60.7% identity (80.1% similar) in 672 aa overlap (20-677:5-649)
10 20 30 40 50 60
pF1KE4 MATLITSTTAATAASGPLVDYLWMLILGFIIAFVLAFSVGANDVANSFGTAVGSGVVTLK
.::::.::::::::.:::::::::::::::::::::::::.
CCDS61 MAMDEYLWMVILGFIIAFILAFSVGANDVANSFGTAVGSGVVTLR
10 20 30 40
70 80 90 100 110 120
pF1KE4 QACILASIFETVGSVLLGAKVSETIRKGLIDVEMYNSTQGLLMAGSVSAMFGSAVWQLVA
:::::::::::.:::::::::.::::::.:::..:: : :::: :::: :::::::.:
CCDS61 QACILASIFETTGSVLLGAKVGETIRKGIIDVNLYNETVETLMAGEVSAMVGSAVWQLIA
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE4 SFLKLPISGTHCIVGATIGFSLVAKGQEGVKWSELIKIVMSWFVSPLLSGIMSGILFFLV
:::.:::::::::::.:::::::: : .::.: ::.::: :::.::::::.:::.:: :.
CCDS61 SFLRLPISGTHCIVGSTIGFSLVAIGTKGVQWMELVKIVASWFISPLLSGFMSGLLFVLI
110 120 130 140 150 160
190 200 210 220 230 240
pF1KE4 RAFILHKADPVPNGLRALPVFYACTVGINLFSIMYTGAPLLGFDKLPLWGTILISVGCAV
: :::.: ::::::::::::::: :..::.:::::::::.::. ::.:. ::: : :.
CCDS61 RIFILKKEDPVPNGLRALPVFYAATIAINVFSIMYTGAPVLGL-VLPMWAIALISFGVAL
170 180 190 200 210 220
250 260 270 280 290 300
pF1KE4 FCALIVWFFVCPRMKRKIEREIKCSPSESPLMEKKNSLKEDHEETKLSVGDIENKHPVSE
. :..::.:::: :.::: . ..:...:.. .:. .: . :. :: .
CCDS61 LFAFFVWLFVCPWMRRKITGK----------LQKEGALSRVSDESLSKVQEAES--PVFK
230 240 250 260 270
310 320 330 340 350
pF1KE4 VGP-------ATVPLQAVVEERTVSFKLGDLEEAPERERLPSVDLKEETSIDSTVNGAVQ
: .:.:: ... : :: :. : . . :. ..:.:.
CCDS61 ELPGAKANDDSTIPLTGAAGET-----LGT-SEGTSAGSHPRAAYGRALSM---THGSVK
280 290 300 310 320
360 370 380 390 400 410
pF1KE4 LPNGNLVQFSQAVSNQINSSGHYQYHTVHKDSGLYKELLHKLHLAKVGDC--MGDSGDKP
: .: . :. ... :.:: ::::::::::::.::::.:. . . .:. .
CCDS61 SPISNGT-FG--FDGHTRSDGHV-YHTVHKDSGLYKDLLHKIHIDRGPEEKPAQESNYRL
330 340 350 360 370
420 430 440 450 460
pF1KE4 LRRNNSYTSYTMAICGMPLD-SFRAKEGEQKGEEMEKLTWPNAD-SKKRIRMDSYTSYCN
:::::::: :: ::::.:. .::: .. . :. :::. ... ::::.:.:::.::::
CCDS61 LRRNNSYTCYTAAICGLPVHATFRAADS-SAPEDSEKLVGDTVSYSKKRLRYDSYSSYCN
380 390 400 410 420 430
470 480 490 500 510 520
pF1KE4 AVSDLHSASE---IDMSVKAEMGLGDRKGSNGSLEEWYDQDKPEVSLLFQFLQILTACFG
::.. . .: ..:.. .:.. :. . . :: ..: ::: :::.:::.::::::
CCDS61 AVAEAEIEAEEGGVEMKLASELADPDQPREDPAEEEKEEKDAPEVHLLFHFLQVLTACFG
440 450 460 470 480 490
530 540 550 560 570 580
pF1KE4 SFAHGGNDVSNAIGPLVALYLVYDTGDVSSKVATPIWLLLYGGVGICVGLWVWGRRVIQT
:::::::::::::::::::.:.: : :....:::.:::.:::::::.::::::::::::
CCDS61 SFAHGGNDVSNAIGPLVALWLIYKQGGVTQEAATPVWLLFYGGVGICTGLWVWGRRVIQT
500 510 520 530 540 550
590 600 610 620 630 640
pF1KE4 MGKDLTPITPSSGFSIELASALTVVIASNIGLPISTTHCKVGSVVSVGWLRSKKAVDWRL
::::::::::::::.::::::.:::::::::::.:::::::::::.:::.::.:::::::
CCDS61 MGKDLTPITPSSGFTIELASAFTVVIASNIGLPVSTTHCKVGSVVAVGWIRSRKAVDWRL
560 570 580 590 600 610
650 660 670
pF1KE4 FRNIFMAWFVTVPISGVISAAIMAIFRYVILRM
:::::.:::::::..:..:::.::.. : ::
CCDS61 FRNIFVAWFVTVPVAGLFSAAVMALLMYGILPYV
620 630 640 650
679 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 23:57:34 2016 done: Sat Nov 5 23:57:34 2016
Total Scan time: 3.480 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]