FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3898, 309 aa
1>>>pF1KE3898 309 - 309 aa - 309 aa
Library: human.CCDS.faa
18921897 residues in 33420 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2978+/-0.000719; mu= 16.1482+/- 0.043
mean_var=69.3837+/-13.817, 0's: 0 Z-trim(110.1): 10 B-trim: 349 in 1/49
Lambda= 0.153973
statistics sampled from 11531 (11539) to 11531 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.716), E-opt: 0.2 (0.345), width: 16
Scan time: 1.230
The best scores are: opt bits E(33420)
CCDS61839.1 NUBP1 gene_id:4682|Hs109|chr16 ( 309) 2095 474.0 6.8e-134
CCDS10543.1 NUBP1 gene_id:4682|Hs109|chr16 ( 320) 1356 309.8 1.8e-84
CCDS66898.1 NUBP2 gene_id:10101|Hs109|chr16 ( 211) 690 161.7 4.4e-40
CCDS10445.1 NUBP2 gene_id:10101|Hs109|chr16 ( 271) 690 161.8 5.4e-40
CCDS41940.1 NUBPL gene_id:80224|Hs109|chr14 ( 319) 636 149.9 2.5e-36
>>CCDS61839.1 NUBP1 gene_id:4682|Hs109|chr16 (309 aa)
initn: 2095 init1: 2095 opt: 2095 Z-score: 2517.5 bits: 474.0 E(33420): 6.8e-134
Smith-Waterman score: 2095; 100.0% identity (100.0% similar) in 309 aa overlap (1-309:1-309)
10 20 30 40 50 60
pF1KE3 MEEVPHDCPGADSAQAGRGASCQGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS61 MEEVPHDCPGADSAQAGRGASCQGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 SGKGGVGKSTFSAHLAHGLAEDENTQIALLDIDICGPSIPKIMGLEGEQYVEDNLGVMSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS61 SGKGGVGKSTFSAHLAHGLAEDENTQIALLDIDICGPSIPKIMGLEGEQYVEDNLGVMSV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 GFLLSSPDDAVIWRGPKKNGMIKQFLRDVDWGEVDYLIVDTPPGTSDEHLSVVRYLATAH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS61 GFLLSSPDDAVIWRGPKKNGMIKQFLRDVDWGEVDYLIVDTPPGTSDEHLSVVRYLATAH
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 IDGAVIITTPQEVSLQDVRKEINFCRKVKLPIIGVVENMSGFICPKCKKESQIFPPTTGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS61 IDGAVIITTPQEVSLQDVRKEINFCRKVKLPIIGVVENMSGFICPKCKKESQIFPPTTGG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE3 AELMCQDLEVPLLGRVPLDPLIGKNCDKGQSFFIDAPDSPATLAYRSIIQRIQEFCNLHQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS61 AELMCQDLEVPLLGRVPLDPLIGKNCDKGQSFFIDAPDSPATLAYRSIIQRIQEFCNLHQ
250 260 270 280 290 300
pF1KE3 SKEENLISS
:::::::::
CCDS61 SKEENLISS
>>CCDS10543.1 NUBP1 gene_id:4682|Hs109|chr16 (320 aa)
initn: 2081 init1: 1356 opt: 1356 Z-score: 1630.1 bits: 309.8 E(33420): 1.8e-84
Smith-Waterman score: 2063; 96.6% identity (96.6% similar) in 320 aa overlap (1-309:1-320)
10 20 30 40 50 60
pF1KE3 MEEVPHDCPGADSAQAGRGASCQGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MEEVPHDCPGADSAQAGRGASCQGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVL
10 20 30 40 50 60
70 80 90 100
pF1KE3 SGKGGVGKSTFSAHLAHGLAEDENTQIALLDIDICGPSIPKIMGLEGEQ-----------
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 SGKGGVGKSTFSAHLAHGLAEDENTQIALLDIDICGPSIPKIMGLEGEQVHQSGSGWSPV
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE3 YVEDNLGVMSVGFLLSSPDDAVIWRGPKKNGMIKQFLRDVDWGEVDYLIVDTPPGTSDEH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YVEDNLGVMSVGFLLSSPDDAVIWRGPKKNGMIKQFLRDVDWGEVDYLIVDTPPGTSDEH
130 140 150 160 170 180
170 180 190 200 210 220
pF1KE3 LSVVRYLATAHIDGAVIITTPQEVSLQDVRKEINFCRKVKLPIIGVVENMSGFICPKCKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LSVVRYLATAHIDGAVIITTPQEVSLQDVRKEINFCRKVKLPIIGVVENMSGFICPKCKK
190 200 210 220 230 240
230 240 250 260 270 280
pF1KE3 ESQIFPPTTGGAELMCQDLEVPLLGRVPLDPLIGKNCDKGQSFFIDAPDSPATLAYRSII
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 ESQIFPPTTGGAELMCQDLEVPLLGRVPLDPLIGKNCDKGQSFFIDAPDSPATLAYRSII
250 260 270 280 290 300
290 300
pF1KE3 QRIQEFCNLHQSKEENLISS
::::::::::::::::::::
CCDS10 QRIQEFCNLHQSKEENLISS
310 320
>>CCDS66898.1 NUBP2 gene_id:10101|Hs109|chr16 (211 aa)
initn: 679 init1: 542 opt: 690 Z-score: 833.2 bits: 161.7 E(33420): 4.4e-40
Smith-Waterman score: 690; 53.0% identity (80.1% similar) in 181 aa overlap (112-292:24-202)
90 100 110 120 130 140
pF1KE3 DENTQIALLDIDICGPSIPKIMGLEGEQYVEDNLGVMSVGFLLSSPDDAVIWRGPKKNGM
:.....::::::: .::.::.:::::::..
CCDS66 MLGAQGRAVHQCDRGWAPVFLDREQSISLMSVGFLLEKPDEAVVWRGPKKNAL
10 20 30 40 50
150 160 170 180 190 200
pF1KE3 IKQFLRDVDWGEVDYLIVDTPPGTSDEHLSVVRYLATAHIDGAVIITTPQEVSLQDVRKE
::::. :: :::.:::.:::::::::::..... : . ::...:::: ::. :::.:
CCDS66 IKQFVSDVAWGELDYLVVDTPPGTSDEHMATIEALRPYQPLGALVVTTPQAVSVGDVRRE
60 70 80 90 100 110
210 220 230 240 250 260
pF1KE3 INFCRKVKLPIIGVVENMSGFICPKCKKESQIFPPTTGGAELMCQDLEVPLLGRVPLDPL
..::::. : ..:.::::::: ::.: . ...: . ::.: . : ::.:: :::::
CCDS66 LTFCRKTGLRVMGIVENMSGFTCPHCTECTSVF--SRGGGEELAQLAGVPFLGSVPLDPA
120 130 140 150 160 170
270 280 290 300
pF1KE3 IGKNCDKGQSFFIDAPDSPATLAYRSIIQRIQEFCNLHQSKEENLISS
. .. ..:..:. . : ::: : :: :.:
CCDS66 LMRTLEEGHDFIQEFPGSPAFAALTSIAQKILDATPACLP
180 190 200 210
>>CCDS10445.1 NUBP2 gene_id:10101|Hs109|chr16 (271 aa)
initn: 854 init1: 542 opt: 690 Z-score: 831.6 bits: 161.8 E(33420): 5.4e-40
Smith-Waterman score: 860; 50.2% identity (76.7% similar) in 253 aa overlap (53-292:13-262)
30 40 50 60 70 80
pF1KE3 QGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVLSGKGGVGKSTFSAHLAHGLAED
:.: ::::::::::::::.:..:: .: .
CCDS10 MEAAAEPGNLAGVRHIILVLSGKGGVGKSTISTELALAL-RH
10 20 30 40
90 100 110 120
pF1KE3 ENTQIALLDIDICGPSIPKIMGLEGEQY-------------VEDNLGVMSVGFLLSSPDD
. ....::.:.::::::...: .:. :.....::::::: .::.
CCDS10 AGKKVGILDVDLCGPSIPRMLGAQGRAVHQCDRGWAPVFLDREQSISLMSVGFLLEKPDE
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE3 AVIWRGPKKNGMIKQFLRDVDWGEVDYLIVDTPPGTSDEHLSVVRYLATAHIDGAVIITT
::.:::::::..::::. :: :::.:::.:::::::::::..... : . ::...::
CCDS10 AVVWRGPKKNALIKQFVSDVAWGELDYLVVDTPPGTSDEHMATIEALRPYQPLGALVVTT
110 120 130 140 150 160
190 200 210 220 230 240
pF1KE3 PQEVSLQDVRKEINFCRKVKLPIIGVVENMSGFICPKCKKESQIFPPTTGGAELMCQDLE
:: ::. :::.:..::::. : ..:.::::::: ::.: . ...: . ::.: . :
CCDS10 PQAVSVGDVRRELTFCRKTGLRVMGIVENMSGFTCPHCTECTSVF--SRGGGEELAQLAG
170 180 190 200 210
250 260 270 280 290 300
pF1KE3 VPLLGRVPLDPLIGKNCDKGQSFFIDAPDSPATLAYRSIIQRIQEFCNLHQSKEENLISS
::.:: ::::: . .. ..:..:. . : ::: : :: :.:
CCDS10 VPFLGSVPLDPALMRTLEEGHDFIQEFPGSPAFAALTSIAQKILDATPACLP
220 230 240 250 260 270
>>CCDS41940.1 NUBPL gene_id:80224|Hs109|chr14 (319 aa)
initn: 548 init1: 208 opt: 636 Z-score: 765.8 bits: 149.9 E(33420): 2.5e-36
Smith-Waterman score: 645; 40.4% identity (67.5% similar) in 302 aa overlap (10-288:11-306)
10 20 30 40
pF1KE3 MEEVPHDCPGADSAQAGRGASCQ-GCPNQRLCA---SGAGA-TPDTAIEEI------KE
:. : .:: ::. : .:. ::::. : .: :.
CCDS41 MGIWQRLLLFGGVSLRAGGGATAPLGGSRAMVCGRQLSGAGSETLKQRRTQIMSRGLPKQ
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE3 K-MKTVKHKILVLSGKGGVGKSTFSAHLAHGLAEDENTQ-IALLDIDICGPSIPKIMGLE
: .. ::. :.: :::::::::: ...:: .:: ..... :.:::.:. :::.::.:.:.
CCDS41 KPIEGVKQVIVVASGKGGVGKSTTAVNLALALAANDSSKAIGLLDVDVYGPSVPKMMNLK
70 80 90 100 110 120
110 120 130 140 150
pF1KE3 GEQYVED--------NLGV--MSVGFLLSSPDDAVIWRGPKKNGMIKQFLRDVDWGEVDY
:. . . : :. ::.:::. .. :.::: . :...::.::::..::
CCDS41 GNPELSQSNLMRPLLNYGIACMSMGFLVEE-SEPVVWRGLMVMSAIEKLLRQVDWGQLDY
130 140 150 160 170
160 170 180 190 200 210
pF1KE3 LIVDTPPGTSDEHLSVVRYLATAHIDGAVIITTPQEVSLQDVRKEINFCRKVKLPIIGVV
:.:: ::::.: .::: . . : ::::..:::...:.:..: .. :.:..:..:.:
CCDS41 LVVDMPPGTGDVQLSVSQNIP---ITGAVIVSTPQDIALMDAHKGAEMFRRVHVPVLGLV
180 190 200 210 220 230
220 230 240 250 260 270
pF1KE3 ENMSGFICPKCKKESQIFPPTTGGAELMCQDLEVPLLGRVPLDPLIGKNCDKGQSFFIDA
.::: : :::::....:: . ::. . : : . .:: .:: : . : :: . ..
CCDS41 QNMSVFQCPKCKHKTHIFG--ADGARKLAQTLGLEVLGDIPLHLNIREASDTGQPIVFSQ
240 250 260 270 280 290
280 290 300
pF1KE3 PDSPATLAYRSIIQRIQEFCNLHQSKEENLISS
:.: . :: :
CCDS41 PESDEAKAYLRIAVEVVRRLPSPSE
300 310
309 residues in 1 query sequences
18921897 residues in 33420 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Aug 4 20:32:51 2021 done: Wed Aug 4 20:32:52 2021
Total Scan time: 1.230 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]