FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3898, 309 aa 1>>>pF1KE3898 309 - 309 aa - 309 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2978+/-0.000719; mu= 16.1482+/- 0.043 mean_var=69.3837+/-13.817, 0's: 0 Z-trim(110.1): 10 B-trim: 349 in 1/49 Lambda= 0.153973 statistics sampled from 11531 (11539) to 11531 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.716), E-opt: 0.2 (0.345), width: 16 Scan time: 1.230 The best scores are: opt bits E(33420) CCDS61839.1 NUBP1 gene_id:4682|Hs109|chr16 ( 309) 2095 474.0 6.8e-134 CCDS10543.1 NUBP1 gene_id:4682|Hs109|chr16 ( 320) 1356 309.8 1.8e-84 CCDS66898.1 NUBP2 gene_id:10101|Hs109|chr16 ( 211) 690 161.7 4.4e-40 CCDS10445.1 NUBP2 gene_id:10101|Hs109|chr16 ( 271) 690 161.8 5.4e-40 CCDS41940.1 NUBPL gene_id:80224|Hs109|chr14 ( 319) 636 149.9 2.5e-36 >>CCDS61839.1 NUBP1 gene_id:4682|Hs109|chr16 (309 aa) initn: 2095 init1: 2095 opt: 2095 Z-score: 2517.5 bits: 474.0 E(33420): 6.8e-134 Smith-Waterman score: 2095; 100.0% identity (100.0% similar) in 309 aa overlap (1-309:1-309) 10 20 30 40 50 60 pF1KE3 MEEVPHDCPGADSAQAGRGASCQGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 MEEVPHDCPGADSAQAGRGASCQGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SGKGGVGKSTFSAHLAHGLAEDENTQIALLDIDICGPSIPKIMGLEGEQYVEDNLGVMSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 SGKGGVGKSTFSAHLAHGLAEDENTQIALLDIDICGPSIPKIMGLEGEQYVEDNLGVMSV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 GFLLSSPDDAVIWRGPKKNGMIKQFLRDVDWGEVDYLIVDTPPGTSDEHLSVVRYLATAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 GFLLSSPDDAVIWRGPKKNGMIKQFLRDVDWGEVDYLIVDTPPGTSDEHLSVVRYLATAH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 IDGAVIITTPQEVSLQDVRKEINFCRKVKLPIIGVVENMSGFICPKCKKESQIFPPTTGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 IDGAVIITTPQEVSLQDVRKEINFCRKVKLPIIGVVENMSGFICPKCKKESQIFPPTTGG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 AELMCQDLEVPLLGRVPLDPLIGKNCDKGQSFFIDAPDSPATLAYRSIIQRIQEFCNLHQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 AELMCQDLEVPLLGRVPLDPLIGKNCDKGQSFFIDAPDSPATLAYRSIIQRIQEFCNLHQ 250 260 270 280 290 300 pF1KE3 SKEENLISS ::::::::: CCDS61 SKEENLISS >>CCDS10543.1 NUBP1 gene_id:4682|Hs109|chr16 (320 aa) initn: 2081 init1: 1356 opt: 1356 Z-score: 1630.1 bits: 309.8 E(33420): 1.8e-84 Smith-Waterman score: 2063; 96.6% identity (96.6% similar) in 320 aa overlap (1-309:1-320) 10 20 30 40 50 60 pF1KE3 MEEVPHDCPGADSAQAGRGASCQGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MEEVPHDCPGADSAQAGRGASCQGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVL 10 20 30 40 50 60 70 80 90 100 pF1KE3 SGKGGVGKSTFSAHLAHGLAEDENTQIALLDIDICGPSIPKIMGLEGEQ----------- ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 SGKGGVGKSTFSAHLAHGLAEDENTQIALLDIDICGPSIPKIMGLEGEQVHQSGSGWSPV 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE3 YVEDNLGVMSVGFLLSSPDDAVIWRGPKKNGMIKQFLRDVDWGEVDYLIVDTPPGTSDEH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YVEDNLGVMSVGFLLSSPDDAVIWRGPKKNGMIKQFLRDVDWGEVDYLIVDTPPGTSDEH 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE3 LSVVRYLATAHIDGAVIITTPQEVSLQDVRKEINFCRKVKLPIIGVVENMSGFICPKCKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LSVVRYLATAHIDGAVIITTPQEVSLQDVRKEINFCRKVKLPIIGVVENMSGFICPKCKK 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE3 ESQIFPPTTGGAELMCQDLEVPLLGRVPLDPLIGKNCDKGQSFFIDAPDSPATLAYRSII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 ESQIFPPTTGGAELMCQDLEVPLLGRVPLDPLIGKNCDKGQSFFIDAPDSPATLAYRSII 250 260 270 280 290 300 290 300 pF1KE3 QRIQEFCNLHQSKEENLISS :::::::::::::::::::: CCDS10 QRIQEFCNLHQSKEENLISS 310 320 >>CCDS66898.1 NUBP2 gene_id:10101|Hs109|chr16 (211 aa) initn: 679 init1: 542 opt: 690 Z-score: 833.2 bits: 161.7 E(33420): 4.4e-40 Smith-Waterman score: 690; 53.0% identity (80.1% similar) in 181 aa overlap (112-292:24-202) 90 100 110 120 130 140 pF1KE3 DENTQIALLDIDICGPSIPKIMGLEGEQYVEDNLGVMSVGFLLSSPDDAVIWRGPKKNGM :.....::::::: .::.::.:::::::.. CCDS66 MLGAQGRAVHQCDRGWAPVFLDREQSISLMSVGFLLEKPDEAVVWRGPKKNAL 10 20 30 40 50 150 160 170 180 190 200 pF1KE3 IKQFLRDVDWGEVDYLIVDTPPGTSDEHLSVVRYLATAHIDGAVIITTPQEVSLQDVRKE ::::. :: :::.:::.:::::::::::..... : . ::...:::: ::. :::.: CCDS66 IKQFVSDVAWGELDYLVVDTPPGTSDEHMATIEALRPYQPLGALVVTTPQAVSVGDVRRE 60 70 80 90 100 110 210 220 230 240 250 260 pF1KE3 INFCRKVKLPIIGVVENMSGFICPKCKKESQIFPPTTGGAELMCQDLEVPLLGRVPLDPL ..::::. : ..:.::::::: ::.: . ...: . ::.: . : ::.:: ::::: CCDS66 LTFCRKTGLRVMGIVENMSGFTCPHCTECTSVF--SRGGGEELAQLAGVPFLGSVPLDPA 120 130 140 150 160 170 270 280 290 300 pF1KE3 IGKNCDKGQSFFIDAPDSPATLAYRSIIQRIQEFCNLHQSKEENLISS . .. ..:..:. . : ::: : :: :.: CCDS66 LMRTLEEGHDFIQEFPGSPAFAALTSIAQKILDATPACLP 180 190 200 210 >>CCDS10445.1 NUBP2 gene_id:10101|Hs109|chr16 (271 aa) initn: 854 init1: 542 opt: 690 Z-score: 831.6 bits: 161.8 E(33420): 5.4e-40 Smith-Waterman score: 860; 50.2% identity (76.7% similar) in 253 aa overlap (53-292:13-262) 30 40 50 60 70 80 pF1KE3 QGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVLSGKGGVGKSTFSAHLAHGLAED :.: ::::::::::::::.:..:: .: . CCDS10 MEAAAEPGNLAGVRHIILVLSGKGGVGKSTISTELALAL-RH 10 20 30 40 90 100 110 120 pF1KE3 ENTQIALLDIDICGPSIPKIMGLEGEQY-------------VEDNLGVMSVGFLLSSPDD . ....::.:.::::::...: .:. :.....::::::: .::. CCDS10 AGKKVGILDVDLCGPSIPRMLGAQGRAVHQCDRGWAPVFLDREQSISLMSVGFLLEKPDE 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE3 AVIWRGPKKNGMIKQFLRDVDWGEVDYLIVDTPPGTSDEHLSVVRYLATAHIDGAVIITT ::.:::::::..::::. :: :::.:::.:::::::::::..... : . ::...:: CCDS10 AVVWRGPKKNALIKQFVSDVAWGELDYLVVDTPPGTSDEHMATIEALRPYQPLGALVVTT 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE3 PQEVSLQDVRKEINFCRKVKLPIIGVVENMSGFICPKCKKESQIFPPTTGGAELMCQDLE :: ::. :::.:..::::. : ..:.::::::: ::.: . ...: . ::.: . : CCDS10 PQAVSVGDVRRELTFCRKTGLRVMGIVENMSGFTCPHCTECTSVF--SRGGGEELAQLAG 170 180 190 200 210 250 260 270 280 290 300 pF1KE3 VPLLGRVPLDPLIGKNCDKGQSFFIDAPDSPATLAYRSIIQRIQEFCNLHQSKEENLISS ::.:: ::::: . .. ..:..:. . : ::: : :: :.: CCDS10 VPFLGSVPLDPALMRTLEEGHDFIQEFPGSPAFAALTSIAQKILDATPACLP 220 230 240 250 260 270 >>CCDS41940.1 NUBPL gene_id:80224|Hs109|chr14 (319 aa) initn: 548 init1: 208 opt: 636 Z-score: 765.8 bits: 149.9 E(33420): 2.5e-36 Smith-Waterman score: 645; 40.4% identity (67.5% similar) in 302 aa overlap (10-288:11-306) 10 20 30 40 pF1KE3 MEEVPHDCPGADSAQAGRGASCQ-GCPNQRLCA---SGAGA-TPDTAIEEI------KE :. : .:: ::. : .:. ::::. : .: :. CCDS41 MGIWQRLLLFGGVSLRAGGGATAPLGGSRAMVCGRQLSGAGSETLKQRRTQIMSRGLPKQ 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 K-MKTVKHKILVLSGKGGVGKSTFSAHLAHGLAEDENTQ-IALLDIDICGPSIPKIMGLE : .. ::. :.: :::::::::: ...:: .:: ..... :.:::.:. :::.::.:.:. CCDS41 KPIEGVKQVIVVASGKGGVGKSTTAVNLALALAANDSSKAIGLLDVDVYGPSVPKMMNLK 70 80 90 100 110 120 110 120 130 140 150 pF1KE3 GEQYVED--------NLGV--MSVGFLLSSPDDAVIWRGPKKNGMIKQFLRDVDWGEVDY :. . . : :. ::.:::. .. :.::: . :...::.::::..:: CCDS41 GNPELSQSNLMRPLLNYGIACMSMGFLVEE-SEPVVWRGLMVMSAIEKLLRQVDWGQLDY 130 140 150 160 170 160 170 180 190 200 210 pF1KE3 LIVDTPPGTSDEHLSVVRYLATAHIDGAVIITTPQEVSLQDVRKEINFCRKVKLPIIGVV :.:: ::::.: .::: . . : ::::..:::...:.:..: .. :.:..:..:.: CCDS41 LVVDMPPGTGDVQLSVSQNIP---ITGAVIVSTPQDIALMDAHKGAEMFRRVHVPVLGLV 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE3 ENMSGFICPKCKKESQIFPPTTGGAELMCQDLEVPLLGRVPLDPLIGKNCDKGQSFFIDA .::: : :::::....:: . ::. . : : . .:: .:: : . : :: . .. CCDS41 QNMSVFQCPKCKHKTHIFG--ADGARKLAQTLGLEVLGDIPLHLNIREASDTGQPIVFSQ 240 250 260 270 280 290 280 290 300 pF1KE3 PDSPATLAYRSIIQRIQEFCNLHQSKEENLISS :.: . :: : CCDS41 PESDEAKAYLRIAVEVVRRLPSPSE 300 310 309 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Aug 4 20:32:51 2021 done: Wed Aug 4 20:32:52 2021 Total Scan time: 1.230 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]