FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4035, 246 aa 1>>>pF1KE4035 246 - 246 aa - 246 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1974+/-0.000637; mu= 15.8152+/- 0.038 mean_var=80.2784+/-15.909, 0's: 0 Z-trim(112.7): 24 B-trim: 4 in 1/50 Lambda= 0.143145 statistics sampled from 13378 (13400) to 13378 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.776), E-opt: 0.2 (0.412), width: 16 Scan time: 2.500 The best scores are: opt bits E(32554) CCDS12202.1 MARCH2 gene_id:51257|Hs108|chr19 ( 246) 1726 365.3 2.2e-101 CCDS4141.1 MARCH3 gene_id:115123|Hs108|chr5 ( 253) 1085 232.9 1.6e-61 CCDS32894.1 MARCH2 gene_id:51257|Hs108|chr19 ( 176) 886 191.7 2.8e-49 CCDS7213.1 MARCH8 gene_id:220972|Hs108|chr10 ( 291) 327 76.4 2.3e-14 CCDS54814.1 MARCH1 gene_id:55016|Hs108|chr4 ( 289) 306 72.1 4.6e-13 CCDS60519.1 MARCH8 gene_id:220972|Hs108|chr10 ( 573) 309 72.9 5e-13 CCDS3806.1 MARCH1 gene_id:55016|Hs108|chr4 ( 272) 300 70.8 1e-12 >>CCDS12202.1 MARCH2 gene_id:51257|Hs108|chr19 (246 aa) initn: 1726 init1: 1726 opt: 1726 Z-score: 1933.7 bits: 365.3 E(32554): 2.2e-101 Smith-Waterman score: 1726; 99.6% identity (100.0% similar) in 246 aa overlap (1-246:1-246) 10 20 30 40 50 60 pF1KE4 MTTGDCCHLPGSLCDCSGSPAFSKVVEATGLGPPQYVAQVTSRDGRLLSTVIRTLDTPSD :::::::::::::::::::::::::::::::::::::::::::::::::::::.:::::: CCDS12 MTTGDCCHLPGSLCDCSGSPAFSKVVEATGLGPPQYVAQVTSRDGRLLSTVIRALDTPSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 GPFCRICHEGANGECLLSPCGCTGTLGAVHKSCLEKWLSSSNTSYCELCHTEFAVEKRPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 GPFCRICHEGANGECLLSPCGCTGTLGAVHKSCLEKWLSSSNTSYCELCHTEFAVEKRPR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 PLTEWLKDPGPRTEKRTLCCDMVCFLFITPLAAISGWLCLRGAQDHLRLHSQLEAVGLIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PLTEWLKDPGPRTEKRTLCCDMVCFLFITPLAAISGWLCLRGAQDHLRLHSQLEAVGLIA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 LTIALFTIYVLWTLVSFRYHCQLYSEWRKTNQKVRLKIREADSPEGPQHSPLAAGLLKKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LTIALFTIYVLWTLVSFRYHCQLYSEWRKTNQKVRLKIREADSPEGPQHSPLAAGLLKKV 190 200 210 220 230 240 pF1KE4 AEETPV :::::: CCDS12 AEETPV >>CCDS4141.1 MARCH3 gene_id:115123|Hs108|chr5 (253 aa) initn: 1123 init1: 893 opt: 1085 Z-score: 1218.1 bits: 232.9 E(32554): 1.6e-61 Smith-Waterman score: 1085; 65.1% identity (82.0% similar) in 255 aa overlap (1-246:1-253) 10 20 30 40 50 pF1KE4 MTTGDCCHLPGSLCDCSGSPA-FSKVVEATGL---GPPQYVAQVTSRDGRLLSTVIRTLD :::. : ::: : ::..: : :.:: : : :::: ::...::.:::::.::: CCDS41 MTTSRCSHLPEVLPDCTSSAAPVVKTVEDCGSLVNGQPQYVMQVSAKDGQLLSTVVRTLA 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 TPS---DGPFCRICHEGANGECLLSPCGCTGTLGAVHKSCLEKWLSSSNTSYCELCHTEF : : : :.:::::::.. : ::::: ::::::..:.::::.:::::::::::::: .: CCDS41 TQSPFNDRPMCRICHEGSSQEDLLSPCECTGTLGTIHRSCLEHWLSSSNTSYCELCHFRF 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 AVEKRPRPLTEWLKDPGPRTEKRTLCCDMVCFLFITPLAAISGWLCLRGAQDHLRLHSQL :::..::::.:::..:::. ::::: ::::::::::::.:::::::::: :::.. :.: CCDS41 AVERKPRPLVEWLRNPGPQHEKRTLFGDMVCFLFITPLATISGWLCLRGAVDHLHFSSRL 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 EAVGLIALTIALFTIYVLWTLVSFRYHCQLYSEWRKTNQKVRLKIREADSPEGPQHSPLA :::::::::.::::::..::::::::::.::.:::.:::.: : : . : . :...: CCDS41 EAVGLIALTVALFTIYLFWTLVSFRYHCRLYNEWRRTNQRVILLIPK--SVNVPSNQPSL 190 200 210 220 230 240 pF1KE4 AGL--LKKVAEETPV :: .:. ..:: : CCDS41 LGLHSVKRNSKETVV 240 250 >>CCDS32894.1 MARCH2 gene_id:51257|Hs108|chr19 (176 aa) initn: 886 init1: 886 opt: 886 Z-score: 998.1 bits: 191.7 E(32554): 2.8e-49 Smith-Waterman score: 1088; 71.1% identity (71.5% similar) in 246 aa overlap (1-246:1-176) 10 20 30 40 50 60 pF1KE4 MTTGDCCHLPGSLCDCSGSPAFSKVVEATGLGPPQYVAQVTSRDGRLLSTVIRTLDTPSD :::::::::::::::::::::::::::::::::::::::::::::::::::::.:::::: CCDS32 MTTGDCCHLPGSLCDCSGSPAFSKVVEATGLGPPQYVAQVTSRDGRLLSTVIRALDTPSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 GPFCRICHEGANGECLLSPCGCTGTLGAVHKSCLEKWLSSSNTSYCELCHTEFAVEKRPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 GPFCRICHEGANGECLLSPCGCTGTLGAVHKSCLEKWLSSSNTSYCELCHTEFAVEKRPR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 PLTEWLKDPGPRTEKRTLCCDMVCFLFITPLAAISGWLCLRGAQDHLRLHSQLEAVGLIA :::: CCDS32 PLTE-------------------------------------------------------- 190 200 210 220 230 240 pF1KE4 LTIALFTIYVLWTLVSFRYHCQLYSEWRKTNQKVRLKIREADSPEGPQHSPLAAGLLKKV :::::::::::::::::::::::::::::::::::::::::::::: CCDS32 --------------VSFRYHCQLYSEWRKTNQKVRLKIREADSPEGPQHSPLAAGLLKKV 130 140 150 160 170 pF1KE4 AEETPV :::::: CCDS32 AEETPV >>CCDS7213.1 MARCH8 gene_id:220972|Hs108|chr10 (291 aa) initn: 283 init1: 223 opt: 327 Z-score: 371.3 bits: 76.4 E(32554): 2.3e-14 Smith-Waterman score: 327; 30.4% identity (62.0% similar) in 184 aa overlap (48-225:64-243) 20 30 40 50 60 70 pF1KE4 GSPAFSKVVEATGLGPPQYVAQVTSRDGRLLSTVIRTLDTPSDGPFCRICH-EGANGECL .:. :: :::. .::::: :: . : CCDS72 QNEKTLGHFMSHSSNISKAGSPPSASAPAPVSSFSRTSITPSSQDICRICHCEGDDESPL 40 50 60 70 80 90 80 90 100 110 120 130 pF1KE4 LSPCGCTGTLGAVHKSCLEKWLSSSNTSYCELCHTEFAVEKRPRPLTEWLKDPGPRTEKR ..:: :::.: ::..::..:..::.: ::::. :: .: . .:: .: : .:.: CCDS72 ITPCHCTGSLHFVHQACLQQWIKSSDTRCCELCKYEFIMETKLKPLRKWEKLQMTSSERR 100 110 120 130 140 150 140 150 160 170 180 190 pF1KE4 TLCCDMVCFLFITPLAAISGWLCLRGAQDHLRLHSQLEAVGLIALTI--ALFTIYVLWT- . :... .. .. : .. . . .... : .:.:.. . : .. . .: CCDS72 KIMCSVTFHVIAITCVVWSLYVLIDRTAEEIK---QGQATGILEWPFWTKLVVVAIGFTG 160 170 180 190 200 210 200 210 220 230 240 pF1KE4 -LVSFRYHCQLYSE-WRKTNQKVRLKIREADSPEGPQHSPLAAGLLKKVAEETPV :. . .:..: . :.. . :. : . :: CCDS72 GLLFMYVQCKVYVQLWKRLKAYNRV-IYVQNCPETSKKNIFEKSPLTEPNFENKHGYGIC 220 230 240 250 260 CCDS72 HSDTNSSCCTEPEDTGAEIIHV 270 280 290 >>CCDS54814.1 MARCH1 gene_id:55016|Hs108|chr4 (289 aa) initn: 265 init1: 220 opt: 306 Z-score: 347.9 bits: 72.1 E(32554): 4.6e-13 Smith-Waterman score: 306; 26.8% identity (59.2% similar) in 213 aa overlap (9-216:24-235) 10 20 30 40 pF1KE4 MTTGDCCHLPGSLCDCSGSPAFSKVVEATGLGPPQYVAQVTS-RD . :.: : : . .... . . . . .....: CCDS54 MLGWCEAIARNPHRIPNNTRTPEISGDLADASQTSTLNEKSPGRSASRSSNISKASSPTT 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 GRLLSTVIRTLDTPSDGPFCRICH-EGANGECLLSPCGCTGTLGAVHKSCLEKWLSSSNT : . : :: .::::: :: . :..:: ::::: ::.:::..:..::.: CCDS54 GTAPRSQSRLSVCPSTQDICRICHCEGDEESPLITPCRCTGTLRFVHQSCLHQWIKSSDT 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE4 SYCELCHTEFAVEKRPRPLTEWLKDPGPRTEKRTLCCDMVCFLFITPLAAISGWLCLRGA ::::. .: .: . .:: .: : .:.: . :... .. .. : .. . . CCDS54 RCCELCKYDFIMETKLKPLRKWEKLQMTTSERRKIFCSVTFHVIAITCVVWSLYVLIDRT 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE4 QDHLRLHSQLEAVGLIALTIALFTIYVLWT--LVSFRYHCQLYSE-WRKTNQKVRLKIRE .... ... ..: . : .. . .: :: . .:..: . ::. . :. CCDS54 AEEIK-QGNDNGVLEWPFWTKLVVVAIGFTGGLVFMYVQCKVYVQLWRRLKAYNRVIFVQ 190 200 210 220 230 230 240 pF1KE4 ADSPEGPQHSPLAAGLLKKVAEETPV CCDS54 NCPDTAKKLEKNFSCNVNTDIKDAVVVPVPQTGANSLPSAEGGPPEVVSV 240 250 260 270 280 >>CCDS60519.1 MARCH8 gene_id:220972|Hs108|chr10 (573 aa) initn: 289 init1: 223 opt: 309 Z-score: 347.3 bits: 72.9 E(32554): 5e-13 Smith-Waterman score: 309; 29.5% identity (61.8% similar) in 173 aa overlap (59-225:357-525) 30 40 50 60 70 80 pF1KE4 TGLGPPQYVAQVTSRDGRLLSTVIRTLDTPSDGPFCRICH-EGANGECLLSPCGCTGTLG ..: ::::: :: . :..:: :::.: CCDS60 RAPLCSTEKDSDLDCPSPFSEKLPPISPVSTSGDVCRICHCEGDDESPLITPCHCTGSLH 330 340 350 360 370 380 90 100 110 120 130 140 pF1KE4 AVHKSCLEKWLSSSNTSYCELCHTEFAVEKRPRPLTEWLKDPGPRTEKRTLCCDMVCFLF ::..::..:..::.: ::::. :: .: . .:: .: : .:.: . :... .. CCDS60 FVHQACLQQWIKSSDTRCCELCKYEFIMETKLKPLRKWEKLQMTSSERRKIMCSVTFHVI 390 400 410 420 430 440 150 160 170 180 190 200 pF1KE4 ITPLAAISGWLCLRGAQDHLRLHSQLEAVGLIALTI--ALFTIYVLWT--LVSFRYHCQL .. : .. . . .... : .:.:.. . : .. . .: :. . .:.. CCDS60 AITCVVWSLYVLIDRTAEEIK---QGQATGILEWPFWTKLVVVAIGFTGGLLFMYVQCKV 450 460 470 480 490 500 210 220 230 240 pF1KE4 YSE-WRKTNQKVRLKIREADSPEGPQHSPLAAGLLKKVAEETPV : . :.. . :. : . :: CCDS60 YVQLWKRLKAYNRV-IYVQNCPETSKKNIFEKSPLTEPNFENKHGYGICHSDTNSSCCTE 510 520 530 540 550 560 >>CCDS3806.1 MARCH1 gene_id:55016|Hs108|chr4 (272 aa) initn: 265 init1: 220 opt: 300 Z-score: 341.6 bits: 70.8 E(32554): 1e-12 Smith-Waterman score: 300; 30.7% identity (62.6% similar) in 163 aa overlap (58-216:57-218) 30 40 50 60 70 80 pF1KE4 ATGLGPPQYVAQVTSRDGRLLSTVIRTLDTPSDGPFCRICH-EGANGECLLSPCGCTGTL :: .::::: :: . :..:: ::::: CCDS38 QDAKLSNLFLQASSPTTGTAPRSQSRLSVCPSTQDICRICHCEGDEESPLITPCRCTGTL 30 40 50 60 70 80 90 100 110 120 130 140 pF1KE4 GAVHKSCLEKWLSSSNTSYCELCHTEFAVEKRPRPLTEWLKDPGPRTEKRTLCCDMVCFL ::.:::..:..::.: ::::. .: .: . .:: .: : .:.: . :... . CCDS38 RFVHQSCLHQWIKSSDTRCCELCKYDFIMETKLKPLRKWEKLQMTTSERRKIFCSVTFHV 90 100 110 120 130 140 150 160 170 180 190 200 pF1KE4 FITPLAAISGWLCLRGAQDHLRLHSQLEAVGLIALTIALFTIYVLWT--LVSFRYHCQLY . .. : .. . . .... ... ..: . : .. . .: :: . .:..: CCDS38 IAITCVVWSLYVLIDRTAEEIK-QGNDNGVLEWPFWTKLVVVAIGFTGGLVFMYVQCKVY 150 160 170 180 190 200 210 220 230 240 pF1KE4 SE-WRKTNQKVRLKIREADSPEGPQHSPLAAGLLKKVAEETPV . ::. . :. CCDS38 VQLWRRLKAYNRVIFVQNCPDTAKKLEKNFSCNVNTDIKDAVVVPVPQTGANSLPSAEGG 210 220 230 240 250 260 246 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:45:53 2016 done: Sun Nov 6 01:45:53 2016 Total Scan time: 2.500 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]