FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1743, 245 aa 1>>>pF1KE1743 245 - 245 aa - 245 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.6551+/-0.000926; mu= 6.1639+/- 0.056 mean_var=209.0342+/-41.909, 0's: 0 Z-trim(114.2): 131 B-trim: 0 in 0/53 Lambda= 0.088709 statistics sampled from 14617 (14754) to 14617 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.779), E-opt: 0.2 (0.453), width: 16 Scan time: 2.040 The best scores are: opt bits E(32554) CCDS226.1 C1QA gene_id:712|Hs108|chr1 ( 245) 1718 231.5 3.9e-61 CCDS228.1 C1QB gene_id:713|Hs108|chr1 ( 253) 601 88.6 4.3e-18 CCDS227.1 C1QC gene_id:714|Hs108|chr1 ( 245) 539 80.7 1e-15 CCDS3414.1 C1QTNF7 gene_id:114905|Hs108|chr4 ( 289) 450 69.3 3.1e-12 CCDS47025.1 C1QTNF7 gene_id:114905|Hs108|chr4 ( 296) 450 69.3 3.1e-12 CCDS31793.1 C1QL4 gene_id:338761|Hs108|chr12 ( 238) 420 65.4 3.9e-11 CCDS8420.1 C1QTNF5 gene_id:114902|Hs108|chr11 ( 243) 416 64.9 5.6e-11 CCDS3284.1 ADIPOQ gene_id:9370|Hs108|chr3 ( 244) 415 64.8 6.1e-11 >>CCDS226.1 C1QA gene_id:712|Hs108|chr1 (245 aa) initn: 1718 init1: 1718 opt: 1718 Z-score: 1211.0 bits: 231.5 E(32554): 3.9e-61 Smith-Waterman score: 1718; 100.0% identity (100.0% similar) in 245 aa overlap (1-245:1-245) 10 20 30 40 50 60 pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 TGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 TGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 RRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 RRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 GQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGSEADSVFSGFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 GQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGSEADSVFSGFL 190 200 210 220 230 240 pF1KE1 IFPSA ::::: CCDS22 IFPSA >>CCDS228.1 C1QB gene_id:713|Hs108|chr1 (253 aa) initn: 342 init1: 202 opt: 601 Z-score: 438.3 bits: 88.6 E(32554): 4.3e-18 Smith-Waterman score: 601; 41.3% identity (67.1% similar) in 252 aa overlap (1-243:3-249) 10 20 30 40 50 pF1KE1 MEGPRGWL--VLCVLAISLASMVTEDL-CRAPD---GKKGEAGRPGRRGRPGLKGEQG :. : : . .. .: ..: .. .: : .: : : : :: :.:: : .: CCDS22 MMMKIPWGSIPVLMLLLLLGLIDISQAQLSCTGPPAIPGIPGIPGTPGPDGQPGTPGIKG 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 EPGAPGIRTGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQ : : ::. .: .: :..:.:: ::::::: :: :: :. : :: : :: :. : CCDS22 EKGLPGL-AGDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAPGAPGPKGESGDYKAT 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 PRPAFSAIRR-NPPMGGNVVI-FDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWE . :::: : : :. . .: :: ::::... :. .::.:.: ::: ::::... :. . CCDS22 QKIAFSATRTINVPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYHASSRGN 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 ICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGS .:.... . : .... . ::: . . :::..:::::.:.::..:... :.. . : CCDS22 LCVNLMRG-RERAQKVVTFCDYAYN-TFQVTTGGMVLKLEQGENVFLQATDKNSLL--GM 180 190 200 210 220 230 240 pF1KE1 E-ADSVFSGFLIFPSA : :.:.:::::.:: CCDS22 EGANSIFSGFLLFPDMEA 240 250 >>CCDS227.1 C1QC gene_id:714|Hs108|chr1 (245 aa) initn: 318 init1: 169 opt: 539 Z-score: 395.6 bits: 80.7 E(32554): 1e-15 Smith-Waterman score: 539; 38.8% identity (63.2% similar) in 242 aa overlap (8-243:15-244) 10 20 30 40 50 pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGE :.: .: . : .... : . : : : ::. : :: : .:: CCDS22 MDVGPSSLPHLGLKLLLLLLLLPLRGQANTG-CYGIPGMPGLPGAPGKDGYDGLPGPKGE 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 PGAPGIRTGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGN---IK :: :.: ::.: ::..:::: :.::: :: :: : :.:: : : ::. : CCDS22 PGIPAI-PGIRGPKGQKGEPGLPGHPGK---NGPMGPPGMPGVPGPMGIPGEPGEEGRYK 60 70 80 90 100 110 120 130 140 150 160 pF1KE1 DQPRPAFSAIRRN--PPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQ .. . .:.. :.. :: .... :..:.:: . :.. .:.:.: ::: :::.... CCDS22 QKFQSVFTVTRQTHQPPAPNSLIRFNAVLTNPQGDYDTSTGKFTCKVPGLYYFVYHASHT 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 WEICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHI-Y ..:. . :. . . :: :.: :: :::..:.:: :..::. . . CCDS22 ANLCVLLYRSGV----KVVTFCGHTSK-TNQVNSGGVLLRLQVGEEVWLAVNDYYDMVGI 180 190 200 210 220 230 230 240 pF1KE1 QGSEADSVFSGFLIFPSA ::: ::::::::.:: CCDS22 QGS--DSVFSGFLLFPD 240 >>CCDS3414.1 C1QTNF7 gene_id:114905|Hs108|chr4 (289 aa) initn: 318 init1: 150 opt: 450 Z-score: 333.2 bits: 69.3 E(32554): 3.1e-12 Smith-Waterman score: 450; 38.3% identity (60.4% similar) in 227 aa overlap (31-242:53-274) 10 20 30 40 50 60 pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR : .:. : ::: :: : :::.:: :. :.: CCDS34 LKGENYSPRYICSIPGLPGPPGPPGANGSPGPHGRIGLPGRDGRDGRKGEKGEKGTAGLR 30 40 50 60 70 80 70 80 90 100 110 pF1KE1 -----TGIQGLKGDQGEPG---PSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQ :. : :::::: : : : :. : :: :: : .: : .: : :: . CCDS34 GKTGPLGLAGEKGDQGETGKKGPIGPEGEKGEVGPIGPPGPKGDRGEQGDPGLPGVCRCG 90 100 110 120 130 140 120 130 140 150 160 pF1KE1 P---RPAFSA-IRRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQV-LS . :::. : . : .::. :. :. : :. .:.:.:. :: :::.... :. CCDS34 SIVLKSAFSVGITTSYPEERLPIIFNKVLFNEGEHYNPATGKFICAFPGIYYFSYDITLA 150 160 170 180 190 200 170 180 190 200 210 220 pF1KE1 QWEICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEK--DPKKGH . .. ...: . :: : . :. : : .:.::. :. :: :.::.: ..: CCDS34 NKHLAIGLVHN--GQYR--IKTFDA-NTGNHDVASGSTVIYLQPEDEVWLEIFFTDQNGL 210 220 230 240 250 230 240 pF1KE1 IYQGSEADSVFSGFLIFPSA . . . :::.:::::.. CCDS34 FSDPGWADSLFSGFLLYVDTDYLDSISEDDEL 260 270 280 >>CCDS47025.1 C1QTNF7 gene_id:114905|Hs108|chr4 (296 aa) initn: 318 init1: 150 opt: 450 Z-score: 333.0 bits: 69.3 E(32554): 3.1e-12 Smith-Waterman score: 450; 38.3% identity (60.4% similar) in 227 aa overlap (31-242:60-281) 10 20 30 40 50 60 pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR : .:. : ::: :: : :::.:: :. :.: CCDS47 LKGENYSPRYICSIPGLPGPPGPPGANGSPGPHGRIGLPGRDGRDGRKGEKGEKGTAGLR 30 40 50 60 70 80 70 80 90 100 110 pF1KE1 -----TGIQGLKGDQGEPG---PSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQ :. : :::::: : : : :. : :: :: : .: : .: : :: . CCDS47 GKTGPLGLAGEKGDQGETGKKGPIGPEGEKGEVGPIGPPGPKGDRGEQGDPGLPGVCRCG 90 100 110 120 130 140 120 130 140 150 160 pF1KE1 P---RPAFSA-IRRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQV-LS . :::. : . : .::. :. :. : :. .:.:.:. :: :::.... :. CCDS47 SIVLKSAFSVGITTSYPEERLPIIFNKVLFNEGEHYNPATGKFICAFPGIYYFSYDITLA 150 160 170 180 190 200 170 180 190 200 210 220 pF1KE1 QWEICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEK--DPKKGH . .. ...: . :: : . :. : : .:.::. :. :: :.::.: ..: CCDS47 NKHLAIGLVHN--GQYR--IKTFDA-NTGNHDVASGSTVIYLQPEDEVWLEIFFTDQNGL 210 220 230 240 250 260 230 240 pF1KE1 IYQGSEADSVFSGFLIFPSA . . . :::.:::::.. CCDS47 FSDPGWADSLFSGFLLYVDTDYLDSISEDDEL 270 280 290 >>CCDS31793.1 C1QL4 gene_id:338761|Hs108|chr12 (238 aa) initn: 297 init1: 167 opt: 420 Z-score: 313.4 bits: 65.4 E(32554): 3.9e-11 Smith-Waterman score: 421; 37.2% identity (58.4% similar) in 226 aa overlap (26-243:28-237) 10 20 30 40 50 pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRA---PDGKKGEAGRPGRRGRPGLKGEQGEPG :: : : .: :: : :. . :: CCDS31 MVLLLLVAIPLLVHSSRGPAHYEMLGRCRMVCDPHGPRG----PGPDGAPA-SVPPFPPG 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 APGI--RTGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQP : : : : ::.: : ::: : ::. : ::: :: : :: :. . : . : CCDS31 AKGEVGRRGKAGLRGPPGPPGPRGPPGEPGRPGPPGPPG----PGPGGVAPAAGYV---P 60 70 80 90 100 120 130 140 150 160 170 pF1KE1 RPAFSAIRRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICL : :: : : : : .:. :: :.:: . :. ::.:.: .:: :.:...:: . CCDS31 RIAFYAGLRRPHEGYEVLRFDDVVTNVGNAYEAASGKFTCPMPGVYFFAYHVLMRGGDGT 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 SIVSS--SRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGSE :. .. . :::: : :. .. .. .:....:.:. ::.:... : :... :. CCDS31 SMWADLMKNGQVRASAIAQDADQN--YDYASNSVILHLDVGDEVFIKLD--GGKVHGGNT 170 180 190 200 210 220 240 pF1KE1 AD-SVFSGFLIFPSA :.::::.:.: CCDS31 NKYSTFSGFIIYPD 230 >>CCDS8420.1 C1QTNF5 gene_id:114902|Hs108|chr11 (243 aa) initn: 279 init1: 210 opt: 416 Z-score: 310.6 bits: 64.9 E(32554): 5.6e-11 Smith-Waterman score: 416; 35.4% identity (56.0% similar) in 243 aa overlap (5-242:2-233) 10 20 30 40 50 60 pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR : ::: .:... .: .: . :. :. : :: :: .: :: :: : CCDS84 MRPLLVLLLLGLAAGSPPLDD-NKIPSLCPGHPGLPGT---PGHHGSQGLPG----R 10 20 30 40 70 80 90 100 110 120 pF1KE1 TGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAI : .: : : :: .:. :. : ::: : : :: : : : :. . :: :::: CCDS84 DGRDGRDGAPGAPGEKGEGGRPGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAFSAK 50 60 70 80 90 100 130 140 150 160 170 pF1KE1 R---RNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVS : : :: . . :: :..:.. :. .:.:.: ::: :::. .. . .. :.. CCDS84 RSESRVPPPSDAPLPFDRVLVNEQGHYDAVTGKFTCQVPGVYYFAVHA-TVYRASLQFDL 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 SSRGQ-VRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKG-HIYQGSEADSV . :. . . : : .::: ...:. :::::. :: . ..::. CCDS84 VKNGESIASFFQFFGGWPKP--ASLSGGAMVRLEPEDQVWVQVGVGDYIGIYASIKTDST 170 180 190 200 210 220 240 pF1KE1 FSGFLIFPSA :::::.. CCDS84 FSGFLVYSDWHSSPVFA 230 240 >>CCDS3284.1 ADIPOQ gene_id:9370|Hs108|chr3 (244 aa) initn: 404 init1: 180 opt: 415 Z-score: 309.8 bits: 64.8 E(32554): 6.1e-11 Smith-Waterman score: 416; 34.3% identity (62.9% similar) in 213 aa overlap (34-242:42-240) 10 20 30 40 50 60 pF1KE1 PRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIRTGI : :.::. : :: :..: :: CCDS32 ALPGHDQETTTQGPGVLLPLPKGACTGWMAGIPGHPGHNGAPGRDGRDGTPGE------- 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 QGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAIRRN .: ::: : ::.:. :..: :: :: ::.:::.: :: ::. : :::. .. CCDS32 KGEKGDPGLIGPKGDIGETGVPGAEGP---RGFPGIQGRKGEPGEGAYVYRSAFSVGLET 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 PPMGGNVVI-FDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSRGQ :. : : .. ::.. :.. .:.: :..:: :::.... . .. :. : . CCDS32 YVTIPNMPIRFTKIFYNQQNHYDGSTGKFHCNIPGLYYFAYHI----TVYMKDVKVSLFK 130 140 150 160 170 190 200 210 220 230 pF1KE1 VRRSLGFC-DTTNKGLFQVVSGGMVLQLQQGDQVWVE--KDPKKGHIYQGSEADSVFSGF ... : : ... . .::...:.:. :::::.. . ... .: .. ::.:.:: CCDS32 KDKAMLFTYDQYQENNVDQASGSVLLHLEVGDQVWLQVYGEGERNGLYADNDNDSTFTGF 180 190 200 210 220 230 240 pF1KE1 LIFPSA :.. CCDS32 LLYHDTN 240 245 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 19:27:53 2016 done: Sun Nov 6 19:27:53 2016 Total Scan time: 2.040 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]