FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7651, 330 aa
1>>>pF1KB7651 330 - 330 aa - 330 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.0489+/-0.000959; mu= 8.4218+/- 0.058
mean_var=358.3709+/-75.318, 0's: 0 Z-trim(115.7): 147 B-trim: 141 in 1/53
Lambda= 0.067750
statistics sampled from 16139 (16300) to 16139 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.795), E-opt: 0.2 (0.501), width: 16
Scan time: 2.770
The best scores are: opt bits E(32554)
CCDS7510.1 TLX1 gene_id:3195|Hs108|chr10 ( 330) 2285 236.5 2.2e-62
CCDS55725.1 TLX1 gene_id:3195|Hs108|chr10 ( 257) 1812 190.1 1.6e-48
CCDS1947.1 TLX2 gene_id:3196|Hs108|chr2 ( 284) 866 97.7 1.2e-20
CCDS34288.1 TLX3 gene_id:30012|Hs108|chr5 ( 291) 852 96.4 3e-20
>>CCDS7510.1 TLX1 gene_id:3195|Hs108|chr10 (330 aa)
initn: 2285 init1: 2285 opt: 2285 Z-score: 1233.3 bits: 236.5 E(32554): 2.2e-62
Smith-Waterman score: 2285; 100.0% identity (100.0% similar) in 330 aa overlap (1-330:1-330)
10 20 30 40 50 60
pF1KB7 MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGGG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 GSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPLTGSYNVNMALAGGPGPGGGGGSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 GSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPLTGSYNVNMALAGGPGPGGGGGSS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 GGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWMESN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 GGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWMESN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 RRYTKDRFTGHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQKYLASAERAALAKALKM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 RRYTKDRFTGHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQKYLASAERAALAKALKM
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 TDAQVKTWFQNRRTKWRRQTAEEREAERQQANRILLQLQQEAFQKSLAQPLPADPLCVHN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 TDAQVKTWFQNRRTKWRRQTAEEREAERQQANRILLQLQQEAFQKSLAQPLPADPLCVHN
250 260 270 280 290 300
310 320 330
pF1KB7 SSLFALQNLQPWSDDSTKITSVTSVASACE
::::::::::::::::::::::::::::::
CCDS75 SSLFALQNLQPWSDDSTKITSVTSVASACE
310 320 330
>>CCDS55725.1 TLX1 gene_id:3195|Hs108|chr10 (257 aa)
initn: 1812 init1: 1812 opt: 1812 Z-score: 984.5 bits: 190.1 E(32554): 1.6e-48
Smith-Waterman score: 1812; 100.0% identity (100.0% similar) in 257 aa overlap (1-257:1-257)
10 20 30 40 50 60
pF1KB7 MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGGG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 GSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPLTGSYNVNMALAGGPGPGGGGGSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 GSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPLTGSYNVNMALAGGPGPGGGGGSS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 GGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWMESN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 GGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWMESN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 RRYTKDRFTGHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQKYLASAERAALAKALKM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 RRYTKDRFTGHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQKYLASAERAALAKALKM
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 TDAQVKTWFQNRRTKWRRQTAEEREAERQQANRILLQLQQEAFQKSLAQPLPADPLCVHN
:::::::::::::::::
CCDS55 TDAQVKTWFQNRRTKWR
250
>>CCDS1947.1 TLX2 gene_id:3196|Hs108|chr2 (284 aa)
initn: 1021 init1: 715 opt: 866 Z-score: 484.4 bits: 97.7 E(32554): 1.2e-20
Smith-Waterman score: 1043; 55.5% identity (68.8% similar) in 337 aa overlap (4-327:6-282)
10 20 30 40 50
pF1KB7 MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYG
::::.: : : :::::::::::..:. : : ::: :
CCDS19 MEPGMLGPHNL-PHH-EPISFGIDQILSGPETPG----------GGLGLG--------RG
10 20 30 40
60 70 80 90 100 110
pF1KB7 GGGSAAATGAGGAGAYGTGGPGGPG-GPAGGGGACSMGPLTGSYNVNMALAGGPGPGGGG
: : : ::.. : :. : :::: :..:: :: .: ::::
CCDS19 GQGH------GENGAFSGGYHGASGYGPAG-----SLAPLPGS--------SGVGPGG--
50 60 70
120 130 140 150 160 170
pF1KB7 GSSGGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWM
:::::::::: : : : : :.::. .. :...:.:::::::
CCDS19 -------------VIRVPAHRPLP----VPPP-AGGAPAVPGPSGLGGAGGLAGLTFPWM
80 90 100 110 120
180 190 200 210 220
pF1KB7 ESNRRYTKDRFT------------GHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQKY
.:.::..:::.: :::::::::::.:::::::.: :. :::.:: ::::
CCDS19 DSGRRFAKDRLTAALSPFSGTRRIGHPYQNRTPPKRKKPRTSFSRSQVLELERRFLRQKY
130 140 150 160 170 180
230 240 250 260 270 280
pF1KB7 LASAERAALAKALKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQANRILLQLQQEAFQK
:::::::::::::.:::::::::::::::::::::::::::::..:.:.::.:::.:. .
CCDS19 LASAERAALAKALRMTDAQVKTWFQNRRTKWRRQTAEEREAERHRAGRLLLHLQQDALPR
190 200 210 220 230 240
290 300 310 320 330
pF1KB7 SLAQPLPADPLCVHNSSLFALQNLQPWSDDSTKITSVTSVASACE
: ::: ::::.::::::::::::::..:. :..::...::
CCDS19 PLRPPLPPDPLCLHNSSLFALQNLQPWAEDN-KVASVSGLASVV
250 260 270 280
>>CCDS34288.1 TLX3 gene_id:30012|Hs108|chr5 (291 aa)
initn: 1111 init1: 792 opt: 852 Z-score: 476.9 bits: 96.4 E(32554): 3e-20
Smith-Waterman score: 1127; 59.6% identity (71.7% similar) in 332 aa overlap (10-325:11-290)
10 20 30 40 50
pF1KB7 MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGG
:: : :::::::::::::::: . .:: : :: ::
CCDS34 MEAPASAQTPHP-H-EPISFGIDQILNSPDQDS--APAPRGPDGASYLG-----------
10 20 30 40
60 70 80 90 100 110
pF1KB7 GGSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPL--TGSYNVNMALAGGPGPGGGG
: : : :. :. :.. :: :. :. .:::.::..:: :
CCDS34 -------GPPG-GRPGATYPSLPASFAGLGA-----PFEDAGSYSVNLSLA--P------
50 60 70 80
120 130 140 150 160 170
pF1KB7 GSSGGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWM
:::::::::::: ::: : :: ..::..::::. :..: ::.::::
CCDS34 -----------AGVIRVPAHRPLPGAV--PPPLPSALPAMPSVPT---VSSLGGLNFPWM
90 100 110 120
180 190 200 210 220
pF1KB7 ESNRRYTKDRFT--------------GHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQ
::.::..::::: :::::::::::.:::::::.:.::::::::::::
CCDS34 ESSRRFVKDRFTAAAALTPFTVTRRIGHPYQNRTPPKRKKPRTSFSRVQICELEKRFHRQ
130 140 150 160 170 180
230 240 250 260 270 280
pF1KB7 KYLASAERAALAKALKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQANRILLQLQQEAF
:::::::::::::.::::::::::::::::::::::::::::::::::.:..::::..::
CCDS34 KYLASAERAALAKSLKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQASRLMLQLQHDAF
190 200 210 220 230 240
290 300 310 320 330
pF1KB7 QKSLAQPLPADPLCVHNSSLFALQNLQPWSDDSTKITSVTSVASACE
:::: . . ::::.:::::::::::::: .::.:. .:::.
CCDS34 QKSLNDSIQPDPLCLHNSSLFALQNLQPWEEDSSKVPAVTSLV
250 260 270 280 290
330 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 21:25:15 2016 done: Fri Nov 4 21:25:16 2016
Total Scan time: 2.770 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]