FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3147, 360 aa 1>>>pF1KE3147 360 - 360 aa - 360 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4706+/-0.00087; mu= 14.9507+/- 0.052 mean_var=64.1092+/-13.026, 0's: 0 Z-trim(105.8): 24 B-trim: 0 in 0/49 Lambda= 0.160182 statistics sampled from 8588 (8597) to 8588 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.648), E-opt: 0.2 (0.264), width: 16 Scan time: 2.680 The best scores are: opt bits E(32554) CCDS8806.1 LETMD1 gene_id:25875|Hs108|chr12 ( 360) 2447 574.3 5.4e-164 CCDS58231.1 LETMD1 gene_id:25875|Hs108|chr12 ( 373) 1550 367.0 1.4e-101 CCDS73469.1 LETMD1 gene_id:25875|Hs108|chr12 ( 236) 908 218.6 4.3e-57 CCDS69466.1 LETM2 gene_id:137994|Hs108|chr8 ( 491) 278 73.1 5.5e-13 CCDS56534.1 LETM2 gene_id:137994|Hs108|chr8 ( 444) 273 72.0 1.1e-12 CCDS3355.1 LETM1 gene_id:3954|Hs108|chr4 ( 739) 263 69.7 8.7e-12 >>CCDS8806.1 LETMD1 gene_id:25875|Hs108|chr12 (360 aa) initn: 2447 init1: 2447 opt: 2447 Z-score: 3057.6 bits: 574.3 E(32554): 5.4e-164 Smith-Waterman score: 2447; 100.0% identity (100.0% similar) in 360 aa overlap (1-360:1-360) 10 20 30 40 50 60 pF1KE3 MALSRVCWARSAVWGSAVTPGHFVTRRLQLGRSGLAWGAPRSSKLHLSPKADVKNLMSYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 MALSRVCWARSAVWGSAVTPGHFVTRRLQLGRSGLAWGAPRSSKLHLSPKADVKNLMSYV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 VTKTKAINGKYHRFLGRHFPRFYVLYTIFMKGLQMLWADAKKARRIKTNMWKHNIKFHQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 VTKTKAINGKYHRFLGRHFPRFYVLYTIFMKGLQMLWADAKKARRIKTNMWKHNIKFHQL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 PYREMEHLRQFRQDVTKCLFLGIISIPPFANYLVFLLMYLFPRQLLIRHFWTPKQQTDFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 PYREMEHLRQFRQDVTKCLFLGIISIPPFANYLVFLLMYLFPRQLLIRHFWTPKQQTDFL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 DIYHAFRKQSHPEIISYLEKVIPLISDAGLRWRLTDLCTKIQRGTHPAIHDILALRECFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 DIYHAFRKQSHPEIISYLEKVIPLISDAGLRWRLTDLCTKIQRGTHPAIHDILALRECFS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 NHPLGMNQLQALHVKALSRAMLLTSYLPPPLLRHRLKTHTTVIHQLDKALAKLGIGQLTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 NHPLGMNQLQALHVKALSRAMLLTSYLPPPLLRHRLKTHTTVIHQLDKALAKLGIGQLTA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 QEVKSACYLRGLNSTHIGEDRCRTWLGEWLQISCSLKEAELSLLLHNVVLLSTNYLGTRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 QEVKSACYLRGLNSTHIGEDRCRTWLGEWLQISCSLKEAELSLLLHNVVLLSTNYLGTRR 310 320 330 340 350 360 >>CCDS58231.1 LETMD1 gene_id:25875|Hs108|chr12 (373 aa) initn: 1550 init1: 1550 opt: 1550 Z-score: 1937.1 bits: 367.0 E(32554): 1.4e-101 Smith-Waterman score: 2411; 96.5% identity (96.5% similar) in 373 aa overlap (1-360:1-373) 10 20 30 40 50 60 pF1KE3 MALSRVCWARSAVWGSAVTPGHFVTRRLQLGRSGLAWGAPRSSKLHLSPKADVKNLMSYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MALSRVCWARSAVWGSAVTPGHFVTRRLQLGRSGLAWGAPRSSKLHLSPKADVKNLMSYV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 VTKTKAINGKYHRFLGRHFPRFYVLYTIFMKGLQMLWADAKKARRIKTNMWKHNIKFHQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VTKTKAINGKYHRFLGRHFPRFYVLYTIFMKGLQMLWADAKKARRIKTNMWKHNIKFHQL 70 80 90 100 110 120 130 140 150 160 pF1KE3 PYREMEHLRQ-------------FRQDVTKCLFLGIISIPPFANYLVFLLMYLFPRQLLI :::::::::: ::::::::::::::::::::::::::::::::::::: CCDS58 PYREMEHLRQVWARGRYPEVHGEFRQDVTKCLFLGIISIPPFANYLVFLLMYLFPRQLLI 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE3 RHFWTPKQQTDFLDIYHAFRKQSHPEIISYLEKVIPLISDAGLRWRLTDLCTKIQRGTHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 RHFWTPKQQTDFLDIYHAFRKQSHPEIISYLEKVIPLISDAGLRWRLTDLCTKIQRGTHP 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE3 AIHDILALRECFSNHPLGMNQLQALHVKALSRAMLLTSYLPPPLLRHRLKTHTTVIHQLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 AIHDILALRECFSNHPLGMNQLQALHVKALSRAMLLTSYLPPPLLRHRLKTHTTVIHQLD 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE3 KALAKLGIGQLTAQEVKSACYLRGLNSTHIGEDRCRTWLGEWLQISCSLKEAELSLLLHN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KALAKLGIGQLTAQEVKSACYLRGLNSTHIGEDRCRTWLGEWLQISCSLKEAELSLLLHN 310 320 330 340 350 360 350 360 pF1KE3 VVLLSTNYLGTRR ::::::::::::: CCDS58 VVLLSTNYLGTRR 370 >>CCDS73469.1 LETMD1 gene_id:25875|Hs108|chr12 (236 aa) initn: 926 init1: 899 opt: 908 Z-score: 1138.4 bits: 218.6 E(32554): 4.3e-57 Smith-Waterman score: 1331; 65.6% identity (65.6% similar) in 360 aa overlap (1-360:1-236) 10 20 30 40 50 60 pF1KE3 MALSRVCWARSAVWGSAVTPGHFVTRRLQLGRSGLAWGAPRSSKLHLSPKADVKNLMSYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MALSRVCWARSAVWGSAVTPGHFVTRRLQLGRSGLAWGAPRSSKLHLSPKADVKNLMSYV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 VTKTKAINGKYHRFLGRHFPRFYVLYTIFMKGLQMLWADAKKARRIKTNMWKHNIKFHQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 VTKTKAINGKYHRFLGRHFPRFYVLYTIFMKGLQMLWADAKKARRIKTNMWKHNIKFHQL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 PYREMEHLRQFRQDVTKCLFLGIISIPPFANYLVFLLMYLFPRQLLIRHFWTPKQQTDFL :::::::::: CCDS73 PYREMEHLRQ-------------------------------------------------- 130 190 200 210 220 230 240 pF1KE3 DIYHAFRKQSHPEIISYLEKVIPLISDAGLRWRLTDLCTKIQRGTHPAIHDILALRECFS CCDS73 ------------------------------------------------------------ 250 260 270 280 290 300 pF1KE3 NHPLGMNQLQALHVKALSRAMLLTSYLPPPLLRHRLKTHTTVIHQLDKALAKLGIGQLTA :::::::::::::::::::::::::::::::::::::::::::::: CCDS73 --------------KALSRAMLLTSYLPPPLLRHRLKTHTTVIHQLDKALAKLGIGQLTA 140 150 160 170 310 320 330 340 350 360 pF1KE3 QEVKSACYLRGLNSTHIGEDRCRTWLGEWLQISCSLKEAELSLLLHNVVLLSTNYLGTRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 QEVKSACYLRGLNSTHIGEDRCRTWLGEWLQISCSLKEAELSLLLHNVVLLSTNYLGTRR 180 190 200 210 220 230 >>CCDS69466.1 LETM2 gene_id:137994|Hs108|chr8 (491 aa) initn: 236 init1: 139 opt: 278 Z-score: 346.6 bits: 73.1 E(32554): 5.5e-13 Smith-Waterman score: 278; 23.6% identity (54.8% similar) in 356 aa overlap (3-345:43-384) 10 20 pF1KE3 MALSRVCWA--RSAVWGSAVTPGHFV----TR :...: .: ... ::. : :: CCDS69 ARTRFPSHFVHPTCSSYSPSCAFLHLPDSHLNKTCMKNYESKKYSDPSQPGNTVLHPGTR 20 30 40 50 60 70 30 40 50 60 70 80 pF1KE3 RLQLGRSGLAWGAPRSSKLHLSPKADVKNLMSYVVTKTKAINGKYHRFLGRHFPRFYVLY .: ... : .: .: . .. : .:: ... : :.. : .. CCDS69 LIQKLHTSTCWLQEVPGKPQLEQATKHPQVTSPQATKETGMEIKE----GKQSYRQKIMD 80 90 100 110 120 90 100 110 120 130 140 pF1KE3 TI--FMKGLQMLWADAKKARRIKTNMWKHNIKFHQLPYREMEHLRQFRQDVTKCLFLGII . ...:. .:: ::: : :. .:. .. . : :: ..: . : . . . .. CCDS69 ELKYYYNGFYLLWIDAKVAARM---VWRL-LHGQVLTRRERRRLLRTCVDFFRLVPFMVF 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE3 SIPPFANYLVFLLMYLFPRQLLIRHFWTP----KQQTDFLDIYHAFRKQSHPEIISYLEK : :: ..:. ... ::: ..: : . ..: . . . : . . . .. CCDS69 LIVPFMEFLLPVFLKLFP-EMLPSTFESESKKEEKQKKKMAVKLELAKFLQETMTEMARR 190 200 210 220 230 240 210 220 230 240 250 260 pF1KE3 VIPLISDAGLRWRLTDLCTKIQRGTHPAIHDILALRECFSNHPLGMNQLQALHVKALSRA ..::. . :.. ..: : .:. ..:. . . : .. :....:. .. :: . CCDS69 NRAKMGDASTQ--LSSYVKQVQTGHKPSTKEIVRFSKLFEDQ-LALEHLDRPQLVALCKL 250 260 270 280 290 300 270 280 290 300 310 320 pF1KE3 MLLTSYLPPPLLRHRLKTHTTVIHQLDKALAKLGIGQLTAQEVKSACYLRGLNSTHIGED . : .. ::: .: . :. :. .:: :. :...:...:: ::. : . :. CCDS69 LELQTFGTNNLLRFQLLMKLKSIKADDEIIAKEGVTALSVSELQAACRARGMRSLGLTEE 310 320 330 340 350 360 330 340 350 360 pF1KE3 RCRTWLGEWLQISCSLKE-AELSLLLHNVVLLSTNYLGTRR . : : :: .. ::: . :::: CCDS69 QLRQQLTEWQDL--HLKENVPPSLLLLSRTFYLIDVKPKPIEIPLSGEAPKTDILVELPT 370 380 390 400 410 >>CCDS56534.1 LETM2 gene_id:137994|Hs108|chr8 (444 aa) initn: 236 init1: 139 opt: 273 Z-score: 341.0 bits: 72.0 E(32554): 1.1e-12 Smith-Waterman score: 273; 24.0% identity (54.9% similar) in 337 aa overlap (20-345:15-337) 10 20 30 40 50 pF1KE3 MALSRVCWARSAVWGSAVTPGHFV----TRRLQLGRSGLAWGAPRSSKLHLSPKADVKNL ::. : :: .: ... : .: .: . .. CCDS56 MKNYESKKYSDPSQPGNTVLHPGTRLIQKLHTSTCWLQEVPGKPQLEQATKHPQV 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 MSYVVTKTKAINGKYHRFLGRHFPRFYVLYTI--FMKGLQMLWADAKKARRIKTNMWKHN : .:: ... : :.. : .. . ...:. .:: ::: : :. .:. CCDS56 TSPQATKETGMEIKE----GKQSYRQKIMDELKYYYNGFYLLWIDAKVAARM---VWRL- 60 70 80 90 100 120 130 140 150 160 170 pF1KE3 IKFHQLPYREMEHLRQFRQDVTKCLFLGIISIPPFANYLVFLLMYLFPRQLLIRHFWTP- .. . : :: ..: . : . . . .. : :: ..:. ... ::: ..: : . CCDS56 LHGQVLTRRERRRLLRTCVDFFRLVPFMVFLIVPFMEFLLPVFLKLFP-EMLPSTFESES 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE3 ---KQQTDFLDIYHAFRKQSHPEIISYLEKVIPLISDAGLRWRLTDLCTKIQRGTHPAIH ..: . . . : . . . .. ..::. .:.. ..: : .:. . CCDS56 KKEEKQKKKMAVKLELAKFLQETMTEMARRNRAKMGDAST--QLSSYVKQVQTGHKPSTK 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE3 DILALRECFSNHPLGMNQLQALHVKALSRAMLLTSYLPPPLLRHRLKTHTTVIHQLDKAL .:. . . : .. :....:. .. :: . . : .. ::: .: . :. :. . CCDS56 EIVRFSKLFEDQ-LALEHLDRPQLVALCKLLELQTFGTNNLLRFQLLMKLKSIKADDEII 230 240 250 260 270 280 300 310 320 330 340 pF1KE3 AKLGIGQLTAQEVKSACYLRGLNSTHIGEDRCRTWLGEWLQISCSLKE-AELSLLLHNVV :: :. :...:...:: ::. : . :.. : : :: .. ::: . :::: CCDS56 AKEGVTALSVSELQAACRARGMRSLGLTEEQLRQQLTEWQDL--HLKENVPPSLLLLSRT 290 300 310 320 330 340 350 360 pF1KE3 LLSTNYLGTRR CCDS56 FYLIDVKPKPIEIPLSGEAPKTDILVELPTFTESKENMVDLAPQLKGTKDEDFIQPPPVT 350 360 370 380 390 400 >>CCDS3355.1 LETM1 gene_id:3954|Hs108|chr4 (739 aa) initn: 193 init1: 148 opt: 263 Z-score: 325.0 bits: 69.7 E(32554): 8.7e-12 Smith-Waterman score: 263; 25.0% identity (58.0% similar) in 264 aa overlap (89-345:164-416) 60 70 80 90 100 110 pF1KE3 YVVTKTKAINGKYHRFLGRHFPRFYVLYTIFMKGLQMLWADAKKARRIKTNMWKHNIKFH ...:...:: :.: : :. .:. .. : CCDS33 LEEGGPVYSPPAEVVVKKSLGQRVLDELKHYYHGFRLLWIDTKIAARM---LWR-ILNGH 140 150 160 170 180 120 130 140 150 160 170 pF1KE3 QLPYREMEHLRQFRQDVTKCLFLGIISIPPFANYLVFLLMYLFPRQLLIRHFWTPKQQTD .: :: ... .. :. . . . .. . :: ..:. . . ::: ..: : : . . . CCDS33 SLTRRERRQFLRICADLFRLVPFLVFVVVPFMEFLLPVAVKLFP-NMLPSTFETQSLKEE 190 200 210 220 230 240 180 190 200 210 220 230 pF1KE3 FLDIYHAFRKQSHPEIISYLEKVIP---LISDAGLRWRLTDLCTKIQR----GTHPAIHD : . .: . :. ..:. .: : . :. :. . .:. : .:. .. CCDS33 RLK--KELRVKL--ELAKFLQDTIEEMALKNKAAKGSATKDFSVFFQKIRETGERPSNEE 250 260 270 280 290 300 240 250 260 270 280 290 pF1KE3 ILALRECFSNHPLGMNQLQALHVKALSRAMLLTSYLPPPLLRHRLKTHTTVIHQLDKALA :. . . : .. : ...: .. :: . . : : .:: .: . :. :: .: CCDS33 IMRFSKLFEDE-LTLDNLTRPQLVALCKLLELQSIGTNNFLRFQLTMRLRSIKADDKLIA 310 320 330 340 350 360 300 310 320 330 340 350 pF1KE3 KLGIGQLTAQEVKSACYLRGLNSTHIGEDRCRTWLGEWLQISCSLKEAELSLLLHNVVLL . :. .:...:...:: ::. . . ::: : : .::.. .: :::. CCDS33 EEGVDSLNVKELQAACRARGMRALGVTEDRLRGQLKQWLDLHLH-QEIPTSLLILSRAMY 370 380 390 400 410 420 360 pF1KE3 STNYLGTRR CCDS33 LPDTLSPADQLKSTLQTLPEIVAKEAQVKVAEVEGEQVDNKAKLEATLQEEAAIQQEHRE 430 440 450 460 470 480 360 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:50:07 2016 done: Sun Nov 6 14:50:08 2016 Total Scan time: 2.680 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]