FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5971, 518 aa
1>>>pF1KB5971 518 - 518 aa - 518 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.8652+/-0.00035; mu= 11.0970+/- 0.022
mean_var=115.4235+/-23.042, 0's: 0 Z-trim(117.4): 29 B-trim: 197 in 1/56
Lambda= 0.119379
statistics sampled from 29340 (29369) to 29340 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.712), E-opt: 0.2 (0.344), width: 16
Scan time: 10.680
The best scores are: opt bits E(85289)
NP_055296 (OMIM: 300773) DNA-(apurinic or apyrimid ( 518) 3553 623.0 6.5e-178
NP_001258677 (OMIM: 300773) DNA-(apurinic or apyri ( 347) 2411 426.2 7.5e-119
NP_542379 (OMIM: 107748) DNA-(apurinic or apyrimid ( 318) 187 43.2 0.0014
NP_001632 (OMIM: 107748) DNA-(apurinic or apyrimid ( 318) 187 43.2 0.0014
NP_001231178 (OMIM: 107748) DNA-(apurinic or apyri ( 318) 187 43.2 0.0014
NP_542380 (OMIM: 107748) DNA-(apurinic or apyrimid ( 318) 187 43.2 0.0014
>>NP_055296 (OMIM: 300773) DNA-(apurinic or apyrimidinic (518 aa)
initn: 3553 init1: 3553 opt: 3553 Z-score: 3315.2 bits: 623.0 E(85289): 6.5e-178
Smith-Waterman score: 3553; 100.0% identity (100.0% similar) in 518 aa overlap (1-518:1-518)
10 20 30 40 50 60
pF1KB5 MLRVVSWNINGIRRPLQGVANQEPSNCAAVAVGRILDELDADIVCLQETKVTRDALTEPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 MLRVVSWNINGIRRPLQGVANQEPSNCAAVAVGRILDELDADIVCLQETKVTRDALTEPL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 AIVEGYNSYFSFSRNRSGYSGVATFCKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 AIVEGYNSYFSFSRNRSGYSGVATFCKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 QEELRALDSEGRALLTQHKIRTWEGKEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 QEELRALDSEGRALLTQHKIRTWEGKEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 RAEALLAAGSHVIILGDLNTAHRPIDHWDAVNLECFEEDPGRKWMDSLLSNLGCQSASHV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 RAEALLAAGSHVIILGDLNTAHRPIDHWDAVNLECFEEDPGRKWMDSLLSNLGCQSASHV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 GPFIDSYRCFQPKQEGAFTCWSAVTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 GPFIDSYRCFQPKQEGAFTCWSAVTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVM
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 GSDHCPVGAVLSVSSVPAKQCPPLCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 GSDHCPVGAVLSVSSVPAKQCPPLCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHN
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB5 NQTRVQTCQNKAQVRSTRPQPSQVGSSRGQKNLKSYFQPSPSCPQASPDIELPSLPLMSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 NQTRVQTCQNKAQVRSTRPQPSQVGSSRGQKNLKSYFQPSPSCPQASPDIELPSLPLMSA
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB5 LMTPKTPEEKAVAKVVKGQAKTSEAKDEKELRTSFWKSVLAGPLRTPLCGGHREPCVMRT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 LMTPKTPEEKAVAKVVKGQAKTSEAKDEKELRTSFWKSVLAGPLRTPLCGGHREPCVMRT
430 440 450 460 470 480
490 500 510
pF1KB5 VKKPGPNLGRRFYMCARPRGPPTDPSSRCNFFLWSRPS
::::::::::::::::::::::::::::::::::::::
NP_055 VKKPGPNLGRRFYMCARPRGPPTDPSSRCNFFLWSRPS
490 500 510
>>NP_001258677 (OMIM: 300773) DNA-(apurinic or apyrimidi (347 aa)
initn: 2411 init1: 2411 opt: 2411 Z-score: 2254.7 bits: 426.2 E(85289): 7.5e-119
Smith-Waterman score: 2411; 100.0% identity (100.0% similar) in 347 aa overlap (172-518:1-347)
150 160 170 180 190 200
pF1KB5 TWEGKEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTA
::::::::::::::::::::::::::::::
NP_001 MRFYRLLQIRAEALLAAGSHVIILGDLNTA
10 20 30
210 220 230 240 250 260
pF1KB5 HRPIDHWDAVNLECFEEDPGRKWMDSLLSNLGCQSASHVGPFIDSYRCFQPKQEGAFTCW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 HRPIDHWDAVNLECFEEDPGRKWMDSLLSNLGCQSASHVGPFIDSYRCFQPKQEGAFTCW
40 50 60 70 80 90
270 280 290 300 310 320
pF1KB5 SAVTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SAVTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQC
100 110 120 130 140 150
330 340 350 360 370 380
pF1KB5 PPLCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PPLCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQP
160 170 180 190 200 210
390 400 410 420 430 440
pF1KB5 SQVGSSRGQKNLKSYFQPSPSCPQASPDIELPSLPLMSALMTPKTPEEKAVAKVVKGQAK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SQVGSSRGQKNLKSYFQPSPSCPQASPDIELPSLPLMSALMTPKTPEEKAVAKVVKGQAK
220 230 240 250 260 270
450 460 470 480 490 500
pF1KB5 TSEAKDEKELRTSFWKSVLAGPLRTPLCGGHREPCVMRTVKKPGPNLGRRFYMCARPRGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 TSEAKDEKELRTSFWKSVLAGPLRTPLCGGHREPCVMRTVKKPGPNLGRRFYMCARPRGP
280 290 300 310 320 330
510
pF1KB5 PTDPSSRCNFFLWSRPS
:::::::::::::::::
NP_001 PTDPSSRCNFFLWSRPS
340
>>NP_542379 (OMIM: 107748) DNA-(apurinic or apyrimidinic (318 aa)
initn: 276 init1: 107 opt: 187 Z-score: 185.2 bits: 43.2 E(85289): 0.0014
Smith-Waterman score: 285; 26.9% identity (51.9% similar) in 320 aa overlap (2-313:62-318)
10 20
pF1KB5 MLRVVSWNINGIR-----RPLQGVANQEPSN
:.. :::..:.: . :. : .. :
NP_542 KNDKEAAGEGPALYEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAP--
40 50 60 70 80
30 40 50 60 70 80
pF1KB5 CAAVAVGRILDELDADIVCLQETKVTRDALTEPLAIVEGYN-SYFSFSRNRSGYSGVATF
::.:::::: ... : : . : . .:.: .. :::::
NP_542 ---------------DILCLQETKCSENKLPAELQELPGLSHQYWSAPSDKEGYSGV---
90 100 110 120 130
90 100 110 120 130 140
pF1KB5 CKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEG
::.. : :: :: :. :.:::.....
NP_542 ---------------GLLSRQCPLKVSYGIGDE---EH----DQEGRVIVAEF-------
140 150 160
150 160 170 180 190 200
pF1KB5 KEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPI
...:...: :.: : :: ...:. . .. ..: :. . ... ::::.::. :
NP_542 --DSFVLVTAYVPNAGRGLV-RLEYRQRWDEAFRKFLKGL-ASRKPLVLCGDLNVAHEEI
170 180 190 200 210
210 220 230 240 250 260
pF1KB5 DHWDAVNLECFEEDPGRKWMDSLLSN--LGCQSASHVGPFIDSYRCFQPKQEGAFTCWSA
: ... : : .. . : .. :. ::.: . :. :.: :.
NP_542 D---------LRNPKGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTY
220 230 240 250 260
270 280 290 300 310 320
pF1KB5 VTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQCPP
. .:: : : :::: : ...: . .. : . ...::::::. :..
NP_542 MMNARSKNVGWRLDYFLLSHSL-LPALCDSKIRSKALGSDHCPITLYLAL
270 280 290 300 310
330 340 350 360 370 380
pF1KB5 LCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQPSQ
>>NP_001632 (OMIM: 107748) DNA-(apurinic or apyrimidinic (318 aa)
initn: 276 init1: 107 opt: 187 Z-score: 185.2 bits: 43.2 E(85289): 0.0014
Smith-Waterman score: 285; 26.9% identity (51.9% similar) in 320 aa overlap (2-313:62-318)
10 20
pF1KB5 MLRVVSWNINGIR-----RPLQGVANQEPSN
:.. :::..:.: . :. : .. :
NP_001 KNDKEAAGEGPALYEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAP--
40 50 60 70 80
30 40 50 60 70 80
pF1KB5 CAAVAVGRILDELDADIVCLQETKVTRDALTEPLAIVEGYN-SYFSFSRNRSGYSGVATF
::.:::::: ... : : . : . .:.: .. :::::
NP_001 ---------------DILCLQETKCSENKLPAELQELPGLSHQYWSAPSDKEGYSGV---
90 100 110 120 130
90 100 110 120 130 140
pF1KB5 CKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEG
::.. : :: :: :. :.:::.....
NP_001 ---------------GLLSRQCPLKVSYGIGDE---EH----DQEGRVIVAEF-------
140 150 160
150 160 170 180 190 200
pF1KB5 KEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPI
...:...: :.: : :: ...:. . .. ..: :. . ... ::::.::. :
NP_001 --DSFVLVTAYVPNAGRGLV-RLEYRQRWDEAFRKFLKGL-ASRKPLVLCGDLNVAHEEI
170 180 190 200 210
210 220 230 240 250 260
pF1KB5 DHWDAVNLECFEEDPGRKWMDSLLSN--LGCQSASHVGPFIDSYRCFQPKQEGAFTCWSA
: ... : : .. . : .. :. ::.: . :. :.: :.
NP_001 D---------LRNPKGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTY
220 230 240 250 260
270 280 290 300 310 320
pF1KB5 VTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQCPP
. .:: : : :::: : ...: . .. : . ...::::::. :..
NP_001 MMNARSKNVGWRLDYFLLSHSL-LPALCDSKIRSKALGSDHCPITLYLAL
270 280 290 300 310
330 340 350 360 370 380
pF1KB5 LCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQPSQ
>>NP_001231178 (OMIM: 107748) DNA-(apurinic or apyrimidi (318 aa)
initn: 276 init1: 107 opt: 187 Z-score: 185.2 bits: 43.2 E(85289): 0.0014
Smith-Waterman score: 285; 26.9% identity (51.9% similar) in 320 aa overlap (2-313:62-318)
10 20
pF1KB5 MLRVVSWNINGIR-----RPLQGVANQEPSN
:.. :::..:.: . :. : .. :
NP_001 KNDKEAAGEGPALYEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAP--
40 50 60 70 80
30 40 50 60 70 80
pF1KB5 CAAVAVGRILDELDADIVCLQETKVTRDALTEPLAIVEGYN-SYFSFSRNRSGYSGVATF
::.:::::: ... : : . : . .:.: .. :::::
NP_001 ---------------DILCLQETKCSENKLPAELQELPGLSHQYWSAPSDKEGYSGV---
90 100 110 120 130
90 100 110 120 130 140
pF1KB5 CKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEG
::.. : :: :: :. :.:::.....
NP_001 ---------------GLLSRQCPLKVSYGIGDE---EH----DQEGRVIVAEF-------
140 150 160
150 160 170 180 190 200
pF1KB5 KEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPI
...:...: :.: : :: ...:. . .. ..: :. . ... ::::.::. :
NP_001 --DSFVLVTAYVPNAGRGLV-RLEYRQRWDEAFRKFLKGL-ASRKPLVLCGDLNVAHEEI
170 180 190 200 210
210 220 230 240 250 260
pF1KB5 DHWDAVNLECFEEDPGRKWMDSLLSN--LGCQSASHVGPFIDSYRCFQPKQEGAFTCWSA
: ... : : .. . : .. :. ::.: . :. :.: :.
NP_001 D---------LRNPKGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTY
220 230 240 250 260
270 280 290 300 310 320
pF1KB5 VTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQCPP
. .:: : : :::: : ...: . .. : . ...::::::. :..
NP_001 MMNARSKNVGWRLDYFLLSHSL-LPALCDSKIRSKALGSDHCPITLYLAL
270 280 290 300 310
330 340 350 360 370 380
pF1KB5 LCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQPSQ
>>NP_542380 (OMIM: 107748) DNA-(apurinic or apyrimidinic (318 aa)
initn: 276 init1: 107 opt: 187 Z-score: 185.2 bits: 43.2 E(85289): 0.0014
Smith-Waterman score: 285; 26.9% identity (51.9% similar) in 320 aa overlap (2-313:62-318)
10 20
pF1KB5 MLRVVSWNINGIR-----RPLQGVANQEPSN
:.. :::..:.: . :. : .. :
NP_542 KNDKEAAGEGPALYEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAP--
40 50 60 70 80
30 40 50 60 70 80
pF1KB5 CAAVAVGRILDELDADIVCLQETKVTRDALTEPLAIVEGYN-SYFSFSRNRSGYSGVATF
::.:::::: ... : : . : . .:.: .. :::::
NP_542 ---------------DILCLQETKCSENKLPAELQELPGLSHQYWSAPSDKEGYSGV---
90 100 110 120 130
90 100 110 120 130 140
pF1KB5 CKDNATPVAAEEGLSGLFATQNGDVGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEG
::.. : :: :: :. :.:::.....
NP_542 ---------------GLLSRQCPLKVSYGIGDE---EH----DQEGRVIVAEF-------
140 150 160
150 160 170 180 190 200
pF1KB5 KEKTLTLINVYCPHADPGRPERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPI
...:...: :.: : :: ...:. . .. ..: :. . ... ::::.::. :
NP_542 --DSFVLVTAYVPNAGRGLV-RLEYRQRWDEAFRKFLKGL-ASRKPLVLCGDLNVAHEEI
170 180 190 200 210
210 220 230 240 250 260
pF1KB5 DHWDAVNLECFEEDPGRKWMDSLLSN--LGCQSASHVGPFIDSYRCFQPKQEGAFTCWSA
: ... : : .. . : .. :. ::.: . :. :.: :.
NP_542 D---------LRNPKGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTY
220 230 240 250 260
270 280 290 300 310 320
pF1KB5 VTGARHLNYGSRLDYVLGDRTLVIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQCPP
. .:: : : :::: : ...: . .. : . ...::::::. :..
NP_542 MMNARSKNVGWRLDYFLLSHSL-LPALCDSKIRSKALGSDHCPITLYLAL
270 280 290 300 310
330 340 350 360 370 380
pF1KB5 LCTRFLPEFAGTQLKILRFLVPLEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQPSQ
518 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 21:41:24 2016 done: Fri Nov 4 21:41:25 2016
Total Scan time: 10.680 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]