FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9719, 402 aa
1>>>pF1KB9719 402 - 402 aa - 402 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.8119+/-0.00101; mu= 5.5047+/- 0.062
mean_var=349.7526+/-72.264, 0's: 0 Z-trim(115.5): 113 B-trim: 0 in 0/52
Lambda= 0.068579
statistics sampled from 15905 (16020) to 15905 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.802), E-opt: 0.2 (0.492), width: 16
Scan time: 2.920
The best scores are: opt bits E(32554)
CCDS6995.1 LHX3 gene_id:8022|Hs108|chr9 ( 402) 2794 289.8 3e-78
CCDS6994.1 LHX3 gene_id:8022|Hs108|chr9 ( 397) 2606 271.2 1.2e-72
CCDS1338.1 LHX4 gene_id:89884|Hs108|chr1 ( 390) 1718 183.3 3.3e-46
CCDS9171.1 LHX5 gene_id:64211|Hs108|chr12 ( 402) 556 68.4 1.4e-11
CCDS11316.1 LHX1 gene_id:3975|Hs108|chr17 ( 406) 535 66.3 5.8e-11
>>CCDS6995.1 LHX3 gene_id:8022|Hs108|chr9 (402 aa)
initn: 2794 init1: 2794 opt: 2794 Z-score: 1518.3 bits: 289.8 E(32554): 3e-78
Smith-Waterman score: 2794; 100.0% identity (100.0% similar) in 402 aa overlap (1-402:1-402)
10 20 30 40 50 60
pF1KB9 MEARGELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 MEARGELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 LKCSDCHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 LKCSDCHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHC
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 FACVVCKRQLATGDEFYLMEDSRLVCKADYETAKQREAEATAKRPRTTITAKQLETLKSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 FACVVCKRQLATGDEFYLMEDSRLVCKADYETAKQREAEATAKRPRTTITAKQLETLKSA
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 YNTSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRQRWGQYFRNMKRSR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 YNTSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRQRWGQYFRNMKRSR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 GGSKSDKDSVQEGQDSDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGALGNFSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 GGSKSDKDSVQEGQDSDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGALGNFSL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 EHGGLAGPEQYRELRPGSPYGVPPSPAAPQSLPGPQPLLSSLVYPDTSLGLVPSGAPGGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 EHGGLAGPEQYRELRPGSPYGVPPSPAAPQSLPGPQPLLSSLVYPDTSLGLVPSGAPGGP
310 320 330 340 350 360
370 380 390 400
pF1KB9 PPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF
::::::::::::::::::::::::::::::::::::::::::
CCDS69 PPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF
370 380 390 400
>>CCDS6994.1 LHX3 gene_id:8022|Hs108|chr9 (397 aa)
initn: 2606 init1: 2606 opt: 2606 Z-score: 1417.8 bits: 271.2 E(32554): 1.2e-72
Smith-Waterman score: 2606; 100.0% identity (100.0% similar) in 372 aa overlap (31-402:26-397)
10 20 30 40 50 60
pF1KB9 MEARGELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKC
::::::::::::::::::::::::::::::
CCDS69 MLLETGLERDRARPGAAAVCTLGGTREIPLCAGCDQHILDRFILKALDRHWHSKC
10 20 30 40 50
70 80 90 100 110 120
pF1KB9 LKCSDCHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 LKCSDCHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHC
60 70 80 90 100 110
130 140 150 160 170 180
pF1KB9 FACVVCKRQLATGDEFYLMEDSRLVCKADYETAKQREAEATAKRPRTTITAKQLETLKSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 FACVVCKRQLATGDEFYLMEDSRLVCKADYETAKQREAEATAKRPRTTITAKQLETLKSA
120 130 140 150 160 170
190 200 210 220 230 240
pF1KB9 YNTSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRQRWGQYFRNMKRSR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 YNTSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRQRWGQYFRNMKRSR
180 190 200 210 220 230
250 260 270 280 290 300
pF1KB9 GGSKSDKDSVQEGQDSDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGALGNFSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 GGSKSDKDSVQEGQDSDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGALGNFSL
240 250 260 270 280 290
310 320 330 340 350 360
pF1KB9 EHGGLAGPEQYRELRPGSPYGVPPSPAAPQSLPGPQPLLSSLVYPDTSLGLVPSGAPGGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 EHGGLAGPEQYRELRPGSPYGVPPSPAAPQSLPGPQPLLSSLVYPDTSLGLVPSGAPGGP
300 310 320 330 340 350
370 380 390 400
pF1KB9 PPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF
::::::::::::::::::::::::::::::::::::::::::
CCDS69 PPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF
360 370 380 390
>>CCDS1338.1 LHX4 gene_id:89884|Hs108|chr1 (390 aa)
initn: 1166 init1: 790 opt: 1718 Z-score: 943.1 bits: 183.3 E(32554): 3.3e-46
Smith-Waterman score: 1718; 66.3% identity (85.4% similar) in 377 aa overlap (31-402:25-390)
10 20 30 40 50 60
pF1KB9 MEARGELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKC
..:: ::::.:::::.::::.:::::::.:
CCDS13 MMQSATVPAEGAVKGLPEMLGVPMQQIPQCAGCNQHILDKFILKVLDRHWHSSC
10 20 30 40 50
70 80 90 100 110 120
pF1KB9 LKCSDCHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHC
:::.::. ::.:::::. :::::.::::::::::.::: :::::::::.::::::::::
CCDS13 LKCADCQMQLADRCFSRAGSVYCKEDFFKRFGTKCTACQQGIPPTQVVRKAQDFVYHLHC
60 70 80 90 100 110
130 140 150 160 170
pF1KB9 FACVVCKRQLATGDEFYLMEDSRLVCKADYETAKQRE-AEATAKRPRTTITAKQLETLKS
:::..:.::::::::::::::.::::: ::::::: . .:: :::::::::::::::::.
CCDS13 FACIICNRQLATGDEFYLMEDGRLVCKEDYETAKQNDDSEAGAKRPRTTITAKQLETLKN
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB9 AYNTSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRQRWGQYFRNMKRS
::..:::::::::::::::::::::::::::::::::::::::::::.::::.....:::
CCDS13 AYKNSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRHRWGQFYKSVKRS
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB9 RGGSKSDKDSVQEGQD-SDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGAL--G
::.::..:.: : ::.:.:: .. :.:.: .: .::..:. : .: : :
CCDS13 RGSSKQEKESSAEDCGVSDSELSFREDQILSELGHTNRIYGNVGDVT------GGQLMNG
240 250 260 270 280
300 310 320 330 340 350
pF1KB9 NFSLEHGGLAGPEQYRELRPGSPYGVPPSPAAPQSLPGPQPLLSSLVYP-DTSLGLVPSG
.::.. : ..:..:: :::::.: ::.. .:::. :::..: : :..::.. .
CCDS13 SFSMDGTG----QSYQDLRDGSPYGIPQSPSSISSLPSHAPLLNGLDYTVDSNLGIIAHA
290 300 310 320 330 340
360 370 380 390 400
pF1KB9 APGGPPPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF
. : .:..:: ::.::.::::: ::::::.::.:::::.:: :
CCDS13 GQGVSQTLRAMAG-GPTSDISTGSSVGYPDFPTSPGSWLDEMDHPPF
350 360 370 380 390
>>CCDS9171.1 LHX5 gene_id:64211|Hs108|chr12 (402 aa)
initn: 934 init1: 549 opt: 556 Z-score: 321.6 bits: 68.4 E(32554): 1.4e-11
Smith-Waterman score: 862; 38.7% identity (61.3% similar) in 419 aa overlap (36-396:5-398)
10 20 30 40 50 60
pF1KB9 ELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKCLKCSD
::::.. :::::.:..::: :: ::..: .
CCDS91 MMVHCAGCERPILDRFLLNVLDRAWHIKCVQCCE
10 20 30
70 80 90 100 110 120
pF1KB9 CHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHCFACVV
:.: :.:.:::: ..:::.:::.:::::::.: :: :...::.:.. :.::.::.:.:
CCDS91 CKTNLSEKCFSREGKLYCKNDFFRRFGTKCAGCAQGISPSDLVRKARSKVFHLNCFTCMV
40 50 60 70 80 90
130 140 150
pF1KB9 CKRQLATGDEFYLMEDSRLVCKADY-----------------------------------
:..::.::.:.:.......::: ::
CCDS91 CNKQLSTGEELYVIDENKFVCKDDYLSSSSLKEGSLNSVSSCTDRSLSPDLQDALQDDPK
100 110 120 130 140 150
160 170 180 190
pF1KB9 ----------ETAKQREAE---ATAKR-PRTTITAKQLETLKSAYNTSPKPARHVREQLS
:::.... : .: .: ::::: ::::::::.:. ..:::.::.::::.
CCDS91 ETDNSTSSDKETANNENEEQNSGTKRRGPRTTIKAKQLETLKAAFAATPKPTRHIREQLA
160 170 180 190 200 210
200 210 220 230 240 250
pF1KB9 SETGLDMRVVQVWFQNRRAKEKRLKK-DAGRQRWGQYFRNMKRSRG-GSKSDKDSVQEGQ
.::::.:::.::::::::.::.:.:. .: : .::. .: : :.. :.. :
CCDS91 QETGLNMRVIQVWFQNRRSKERRMKQLSALGARRHAFFRSPRRMRPLGGRLDES---EML
220 230 240 250 260 270
260 270 280 290 300 310
pF1KB9 DSDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGALGNFSLEHGGLAGPEQYREL
: . . : . .: . . : :.:: : : ..: :
CCDS91 GSTPYTYYGDYQGDYYAPGSNYDFFAHGPPSQA---QSPADSSFLAASG-----------
280 290 300 310
320 330 340 350 360
pF1KB9 RPGS-PYG-VPPSPAAPQSLPGPQPLLSSLVYPDTSLGLVPSGAPGGP----P-PMRVLA
::: : : . : :.:.. .:. . . . .::: :: :: : : : .:..
CCDS91 -PGSTPLGALEPPLAGPHAADNPR-FTDMISHPDT-----PSPEPGLPGTLHPMPGEVFS
320 330 340 350 360 370
370 380 390 400
pF1KB9 GNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF
: ::: . ....:: . : :.:
CCDS91 G-GPSPPFPMSGTSGYSGPLSHPNPELNEAAVW
380 390 400
>>CCDS11316.1 LHX1 gene_id:3975|Hs108|chr17 (406 aa)
initn: 925 init1: 531 opt: 535 Z-score: 310.4 bits: 66.3 E(32554): 5.8e-11
Smith-Waterman score: 815; 39.5% identity (59.8% similar) in 403 aa overlap (36-363:4-400)
10 20 30 40 50 60
pF1KB9 ELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKCLKCSD
:::: . :::::.:..::: :: ::..: .
CCDS11 MVHCAGCKRPILDRFLLNVLDRAWHVKCVQCCE
10 20 30
70 80 90 100 110 120
pF1KB9 CHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHCFACVV
:. :.:.:::: ..:::.:::. ::::::.: :: :...::::.. :.::.::.:..
CCDS11 CKCNLTEKCFSREGKLYCKNDFFRCFGTKCAGCAQGISPSDLVRRARSKVFHLNCFTCMM
40 50 60 70 80 90
130 140 150
pF1KB9 CKRQLATGDEFYLMEDSRLVCKADY-----------------------------------
:..::.::.:.:.......::: ::
CCDS11 CNKQLSTGEELYIIDENKFVCKEDYLSNSSVAKENSLHSATTGSDPSLSPDSQDPSQDDA
100 110 120 130 140 150
160 170 180 190
pF1KB9 ---ETAKQREAEAT----------AKR--PRTTITAKQLETLKSAYNTSPKPARHVREQL
:.:. . :: ::: ::::: ::::::::.:. ..:::.::.::::
CCDS11 KDSESANVSDKEAGSNENDDQNLGAKRRGPRTTIKAKQLETLKAAFAATPKPTRHIREQL
160 170 180 190 200 210
200 210 220 230 240 250
pF1KB9 SSETGLDMRVVQVWFQNRRAKEKRLKK-DAGRQRWGQYFRNMKRSRGGSKSDKDSVQEGQ
..::::.:::.::::::::.::.:.:. .: : .::. .: : : .. :.
CCDS11 AQETGLNMRVIQVWFQNRRSKERRMKQLSALGARRHAFFRSPRRMR----PLVDRLEPGE
220 230 240 250 260
260 270 280 290 300
pF1KB9 D-SDAEVSFPDEPSLAEMGPA-NGLYGSLGEPTQALGRP--------SGALGNF--SLEH
.. :: . . .::. : . : :.. : :: :. .:::
CCDS11 LIPNGPFSFYGDYQSEYYGPGGNYDFFPQGPPSSQAQTPVDLPFVPSSGPSGTPLGGLEH
270 280 290 300 310 320
310 320 330 340 350
pF1KB9 GGLAGPEQYREL-RPGSPYGVPP--SPAAPQSLPGPQPLLSSLVY----PDTSLGLVPSG
: : . : : . . :: ::. ::::: .:. :. : .::. : .:
CCDS11 P-LPGHHPSSEAQRFTDILAHPPGDSPSPEPSLPGPLHSMSAEVFGPSPPFSSLS-VNGG
330 340 350 360 370 380
360 370 380 390 400
pF1KB9 APGG-----PPPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF
: : :: :
CCDS11 ASYGNHLSHPPEMNEAAVW
390 400
402 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 15:38:34 2016 done: Sun Nov 6 15:38:34 2016
Total Scan time: 2.920 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]