FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9719, 402 aa 1>>>pF1KB9719 402 - 402 aa - 402 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.8119+/-0.00101; mu= 5.5047+/- 0.062 mean_var=349.7526+/-72.264, 0's: 0 Z-trim(115.5): 113 B-trim: 0 in 0/52 Lambda= 0.068579 statistics sampled from 15905 (16020) to 15905 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.802), E-opt: 0.2 (0.492), width: 16 Scan time: 2.920 The best scores are: opt bits E(32554) CCDS6995.1 LHX3 gene_id:8022|Hs108|chr9 ( 402) 2794 289.8 3e-78 CCDS6994.1 LHX3 gene_id:8022|Hs108|chr9 ( 397) 2606 271.2 1.2e-72 CCDS1338.1 LHX4 gene_id:89884|Hs108|chr1 ( 390) 1718 183.3 3.3e-46 CCDS9171.1 LHX5 gene_id:64211|Hs108|chr12 ( 402) 556 68.4 1.4e-11 CCDS11316.1 LHX1 gene_id:3975|Hs108|chr17 ( 406) 535 66.3 5.8e-11 >>CCDS6995.1 LHX3 gene_id:8022|Hs108|chr9 (402 aa) initn: 2794 init1: 2794 opt: 2794 Z-score: 1518.3 bits: 289.8 E(32554): 3e-78 Smith-Waterman score: 2794; 100.0% identity (100.0% similar) in 402 aa overlap (1-402:1-402) 10 20 30 40 50 60 pF1KB9 MEARGELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MEARGELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 LKCSDCHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 LKCSDCHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 FACVVCKRQLATGDEFYLMEDSRLVCKADYETAKQREAEATAKRPRTTITAKQLETLKSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 FACVVCKRQLATGDEFYLMEDSRLVCKADYETAKQREAEATAKRPRTTITAKQLETLKSA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 YNTSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRQRWGQYFRNMKRSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 YNTSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRQRWGQYFRNMKRSR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 GGSKSDKDSVQEGQDSDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGALGNFSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 GGSKSDKDSVQEGQDSDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGALGNFSL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 EHGGLAGPEQYRELRPGSPYGVPPSPAAPQSLPGPQPLLSSLVYPDTSLGLVPSGAPGGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 EHGGLAGPEQYRELRPGSPYGVPPSPAAPQSLPGPQPLLSSLVYPDTSLGLVPSGAPGGP 310 320 330 340 350 360 370 380 390 400 pF1KB9 PPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF :::::::::::::::::::::::::::::::::::::::::: CCDS69 PPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF 370 380 390 400 >>CCDS6994.1 LHX3 gene_id:8022|Hs108|chr9 (397 aa) initn: 2606 init1: 2606 opt: 2606 Z-score: 1417.8 bits: 271.2 E(32554): 1.2e-72 Smith-Waterman score: 2606; 100.0% identity (100.0% similar) in 372 aa overlap (31-402:26-397) 10 20 30 40 50 60 pF1KB9 MEARGELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKC :::::::::::::::::::::::::::::: CCDS69 MLLETGLERDRARPGAAAVCTLGGTREIPLCAGCDQHILDRFILKALDRHWHSKC 10 20 30 40 50 70 80 90 100 110 120 pF1KB9 LKCSDCHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 LKCSDCHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHC 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB9 FACVVCKRQLATGDEFYLMEDSRLVCKADYETAKQREAEATAKRPRTTITAKQLETLKSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 FACVVCKRQLATGDEFYLMEDSRLVCKADYETAKQREAEATAKRPRTTITAKQLETLKSA 120 130 140 150 160 170 190 200 210 220 230 240 pF1KB9 YNTSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRQRWGQYFRNMKRSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 YNTSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRQRWGQYFRNMKRSR 180 190 200 210 220 230 250 260 270 280 290 300 pF1KB9 GGSKSDKDSVQEGQDSDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGALGNFSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 GGSKSDKDSVQEGQDSDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGALGNFSL 240 250 260 270 280 290 310 320 330 340 350 360 pF1KB9 EHGGLAGPEQYRELRPGSPYGVPPSPAAPQSLPGPQPLLSSLVYPDTSLGLVPSGAPGGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 EHGGLAGPEQYRELRPGSPYGVPPSPAAPQSLPGPQPLLSSLVYPDTSLGLVPSGAPGGP 300 310 320 330 340 350 370 380 390 400 pF1KB9 PPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF :::::::::::::::::::::::::::::::::::::::::: CCDS69 PPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF 360 370 380 390 >>CCDS1338.1 LHX4 gene_id:89884|Hs108|chr1 (390 aa) initn: 1166 init1: 790 opt: 1718 Z-score: 943.1 bits: 183.3 E(32554): 3.3e-46 Smith-Waterman score: 1718; 66.3% identity (85.4% similar) in 377 aa overlap (31-402:25-390) 10 20 30 40 50 60 pF1KB9 MEARGELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKC ..:: ::::.:::::.::::.:::::::.: CCDS13 MMQSATVPAEGAVKGLPEMLGVPMQQIPQCAGCNQHILDKFILKVLDRHWHSSC 10 20 30 40 50 70 80 90 100 110 120 pF1KB9 LKCSDCHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHC :::.::. ::.:::::. :::::.::::::::::.::: :::::::::.:::::::::: CCDS13 LKCADCQMQLADRCFSRAGSVYCKEDFFKRFGTKCTACQQGIPPTQVVRKAQDFVYHLHC 60 70 80 90 100 110 130 140 150 160 170 pF1KB9 FACVVCKRQLATGDEFYLMEDSRLVCKADYETAKQRE-AEATAKRPRTTITAKQLETLKS :::..:.::::::::::::::.::::: ::::::: . .:: :::::::::::::::::. CCDS13 FACIICNRQLATGDEFYLMEDGRLVCKEDYETAKQNDDSEAGAKRPRTTITAKQLETLKN 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB9 AYNTSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRQRWGQYFRNMKRS ::..:::::::::::::::::::::::::::::::::::::::::::.::::.....::: CCDS13 AYKNSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRHRWGQFYKSVKRS 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB9 RGGSKSDKDSVQEGQD-SDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGAL--G ::.::..:.: : ::.:.:: .. :.:.: .: .::..:. : .: : : CCDS13 RGSSKQEKESSAEDCGVSDSELSFREDQILSELGHTNRIYGNVGDVT------GGQLMNG 240 250 260 270 280 300 310 320 330 340 350 pF1KB9 NFSLEHGGLAGPEQYRELRPGSPYGVPPSPAAPQSLPGPQPLLSSLVYP-DTSLGLVPSG .::.. : ..:..:: :::::.: ::.. .:::. :::..: : :..::.. . CCDS13 SFSMDGTG----QSYQDLRDGSPYGIPQSPSSISSLPSHAPLLNGLDYTVDSNLGIIAHA 290 300 310 320 330 340 360 370 380 390 400 pF1KB9 APGGPPPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF . : .:..:: ::.::.::::: ::::::.::.:::::.:: : CCDS13 GQGVSQTLRAMAG-GPTSDISTGSSVGYPDFPTSPGSWLDEMDHPPF 350 360 370 380 390 >>CCDS9171.1 LHX5 gene_id:64211|Hs108|chr12 (402 aa) initn: 934 init1: 549 opt: 556 Z-score: 321.6 bits: 68.4 E(32554): 1.4e-11 Smith-Waterman score: 862; 38.7% identity (61.3% similar) in 419 aa overlap (36-396:5-398) 10 20 30 40 50 60 pF1KB9 ELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKCLKCSD ::::.. :::::.:..::: :: ::..: . CCDS91 MMVHCAGCERPILDRFLLNVLDRAWHIKCVQCCE 10 20 30 70 80 90 100 110 120 pF1KB9 CHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHCFACVV :.: :.:.:::: ..:::.:::.:::::::.: :: :...::.:.. :.::.::.:.: CCDS91 CKTNLSEKCFSREGKLYCKNDFFRRFGTKCAGCAQGISPSDLVRKARSKVFHLNCFTCMV 40 50 60 70 80 90 130 140 150 pF1KB9 CKRQLATGDEFYLMEDSRLVCKADY----------------------------------- :..::.::.:.:.......::: :: CCDS91 CNKQLSTGEELYVIDENKFVCKDDYLSSSSLKEGSLNSVSSCTDRSLSPDLQDALQDDPK 100 110 120 130 140 150 160 170 180 190 pF1KB9 ----------ETAKQREAE---ATAKR-PRTTITAKQLETLKSAYNTSPKPARHVREQLS :::.... : .: .: ::::: ::::::::.:. ..:::.::.::::. CCDS91 ETDNSTSSDKETANNENEEQNSGTKRRGPRTTIKAKQLETLKAAFAATPKPTRHIREQLA 160 170 180 190 200 210 200 210 220 230 240 250 pF1KB9 SETGLDMRVVQVWFQNRRAKEKRLKK-DAGRQRWGQYFRNMKRSRG-GSKSDKDSVQEGQ .::::.:::.::::::::.::.:.:. .: : .::. .: : :.. :.. : CCDS91 QETGLNMRVIQVWFQNRRSKERRMKQLSALGARRHAFFRSPRRMRPLGGRLDES---EML 220 230 240 250 260 270 260 270 280 290 300 310 pF1KB9 DSDAEVSFPDEPSLAEMGPANGLYGSLGEPTQALGRPSGALGNFSLEHGGLAGPEQYREL : . . : . .: . . : :.:: : : ..: : CCDS91 GSTPYTYYGDYQGDYYAPGSNYDFFAHGPPSQA---QSPADSSFLAASG----------- 280 290 300 310 320 330 340 350 360 pF1KB9 RPGS-PYG-VPPSPAAPQSLPGPQPLLSSLVYPDTSLGLVPSGAPGGP----P-PMRVLA ::: : : . : :.:.. .:. . . . .::: :: :: : : : .:.. CCDS91 -PGSTPLGALEPPLAGPHAADNPR-FTDMISHPDT-----PSPEPGLPGTLHPMPGEVFS 320 330 340 350 360 370 370 380 390 400 pF1KB9 GNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF : ::: . ....:: . : :.: CCDS91 G-GPSPPFPMSGTSGYSGPLSHPNPELNEAAVW 380 390 400 >>CCDS11316.1 LHX1 gene_id:3975|Hs108|chr17 (406 aa) initn: 925 init1: 531 opt: 535 Z-score: 310.4 bits: 66.3 E(32554): 5.8e-11 Smith-Waterman score: 815; 39.5% identity (59.8% similar) in 403 aa overlap (36-363:4-400) 10 20 30 40 50 60 pF1KB9 ELGPARESAGGDLLLALLARRADLRREIPLCAGCDQHILDRFILKALDRHWHSKCLKCSD :::: . :::::.:..::: :: ::..: . CCDS11 MVHCAGCKRPILDRFLLNVLDRAWHVKCVQCCE 10 20 30 70 80 90 100 110 120 pF1KB9 CHTPLAERCFSRGESVYCKDDFFKRFGTKCAACQLGIPPTQVVRRAQDFVYHLHCFACVV :. :.:.:::: ..:::.:::. ::::::.: :: :...::::.. :.::.::.:.. CCDS11 CKCNLTEKCFSREGKLYCKNDFFRCFGTKCAGCAQGISPSDLVRRARSKVFHLNCFTCMM 40 50 60 70 80 90 130 140 150 pF1KB9 CKRQLATGDEFYLMEDSRLVCKADY----------------------------------- :..::.::.:.:.......::: :: CCDS11 CNKQLSTGEELYIIDENKFVCKEDYLSNSSVAKENSLHSATTGSDPSLSPDSQDPSQDDA 100 110 120 130 140 150 160 170 180 190 pF1KB9 ---ETAKQREAEAT----------AKR--PRTTITAKQLETLKSAYNTSPKPARHVREQL :.:. . :: ::: ::::: ::::::::.:. ..:::.::.:::: CCDS11 KDSESANVSDKEAGSNENDDQNLGAKRRGPRTTIKAKQLETLKAAFAATPKPTRHIREQL 160 170 180 190 200 210 200 210 220 230 240 250 pF1KB9 SSETGLDMRVVQVWFQNRRAKEKRLKK-DAGRQRWGQYFRNMKRSRGGSKSDKDSVQEGQ ..::::.:::.::::::::.::.:.:. .: : .::. .: : : .. :. CCDS11 AQETGLNMRVIQVWFQNRRSKERRMKQLSALGARRHAFFRSPRRMR----PLVDRLEPGE 220 230 240 250 260 260 270 280 290 300 pF1KB9 D-SDAEVSFPDEPSLAEMGPA-NGLYGSLGEPTQALGRP--------SGALGNF--SLEH .. :: . . .::. : . : :.. : :: :. .::: CCDS11 LIPNGPFSFYGDYQSEYYGPGGNYDFFPQGPPSSQAQTPVDLPFVPSSGPSGTPLGGLEH 270 280 290 300 310 320 310 320 330 340 350 pF1KB9 GGLAGPEQYREL-RPGSPYGVPP--SPAAPQSLPGPQPLLSSLVY----PDTSLGLVPSG : : . : : . . :: ::. ::::: .:. :. : .::. : .: CCDS11 P-LPGHHPSSEAQRFTDILAHPPGDSPSPEPSLPGPLHSMSAEVFGPSPPFSSLS-VNGG 330 340 350 360 370 380 360 370 380 390 400 pF1KB9 APGG-----PPPMRVLAGNGPSSDLSTGSSGGYPDFPASPASWLDEVDHAQF : : :: : CCDS11 ASYGNHLSHPPEMNEAAVW 390 400 402 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 15:38:34 2016 done: Sun Nov 6 15:38:34 2016 Total Scan time: 2.920 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]