FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5825, 125 aa
1>>>pF1KB5825 125 - 125 aa - 125 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.3194+/-0.000338; mu= 16.9086+/- 0.021
mean_var=98.1454+/-24.186, 0's: 0 Z-trim(115.3): 130 B-trim: 1550 in 1/51
Lambda= 0.129461
statistics sampled from 25478 (25641) to 25478 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.694), E-opt: 0.2 (0.301), width: 16
Scan time: 4.150
The best scores are: opt bits E(85289)
NP_057131 (OMIM: 607835) splicing factor 3B subuni ( 125) 833 165.0 2.8e-41
NP_001177938 (OMIM: 605221) serine/arginine-rich s ( 165) 166 40.6 0.001
NP_001287865 (OMIM: 605221) serine/arginine-rich s ( 172) 166 40.7 0.0011
NP_001177935 (OMIM: 605221) serine/arginine-rich s ( 173) 166 40.7 0.0011
NP_001177936 (OMIM: 605221) serine/arginine-rich s ( 182) 166 40.7 0.0011
NP_006616 (OMIM: 605221) serine/arginine-rich spli ( 183) 166 40.7 0.0011
NP_001287866 (OMIM: 605221) serine/arginine-rich s ( 217) 166 40.8 0.0012
NP_001177934 (OMIM: 605221) serine/arginine-rich s ( 261) 166 40.9 0.0013
NP_473357 (OMIM: 605221) serine/arginine-rich spli ( 262) 166 40.9 0.0013
XP_005250918 (OMIM: 604679) PREDICTED: polyadenyla ( 636) 156 39.6 0.008
NP_002559 (OMIM: 604679) polyadenylate-binding pro ( 636) 156 39.6 0.008
>>NP_057131 (OMIM: 607835) splicing factor 3B subunit 6 (125 aa)
initn: 833 init1: 833 opt: 833 Z-score: 862.1 bits: 165.0 E(85289): 2.8e-41
Smith-Waterman score: 833; 100.0% identity (100.0% similar) in 125 aa overlap (1-125:1-125)
10 20 30 40 50 60
pF1KB5 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVGNTPETRGTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_057 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVGNTPETRGTA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 YVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKYGIN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_057 YVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKYGIN
70 80 90 100 110 120
pF1KB5 TDPPK
:::::
NP_057 TDPPK
>>NP_001177938 (OMIM: 605221) serine/arginine-rich splic (165 aa)
initn: 141 init1: 84 opt: 166 Z-score: 187.6 bits: 40.6 E(85289): 0.001
Smith-Waterman score: 166; 33.7% identity (61.6% similar) in 86 aa overlap (14-96:7-90)
10 20 30 40 50
pF1KB5 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVG---NTPETR
:: : :..::. .:.. ::.:::: .. : : . :
NP_001 MSRYLRPP--NTSLFVRNVADDTRSEDLRREFGRYGPIVDVYVPLDFYTRRPR
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 GTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKY
: ::: .::. ::..: .:. .:.: . . . ...:
NP_001 GFAYVQFEDVRDAEDALHNLDRKWICGRQIEIQFAQGDRKTPNQMKAKEGRNVYSSSRYD
60 70 80 90 100 110
120
pF1KB5 GINTDPPK
NP_001 DYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRKPNCSWNTQYSSAYYTSRKI
120 130 140 150 160
>>NP_001287865 (OMIM: 605221) serine/arginine-rich splic (172 aa)
initn: 141 init1: 84 opt: 166 Z-score: 187.4 bits: 40.7 E(85289): 0.0011
Smith-Waterman score: 166; 33.7% identity (61.6% similar) in 86 aa overlap (14-96:7-90)
10 20 30 40 50
pF1KB5 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVG---NTPETR
:: : :..::. .:.. ::.:::: .. : : . :
NP_001 MSRYLRPP--NTSLFVRNVADDTRSEDLRREFGRYGPIVDVYVPLDFYTRRPR
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 GTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKY
: ::: .::. ::..: .:. .:.: . . . ...:
NP_001 GFAYVQFEDVRDAEDALHNLDRKWICGRQIEIQFAQGDRKTPNQMKAKEGRNVYSSSRYD
60 70 80 90 100 110
120
pF1KB5 GINTDPPK
NP_001 DYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNRPTGRPRRSRSHSDNDSQVSKKKNE
120 130 140 150 160 170
>>NP_001177935 (OMIM: 605221) serine/arginine-rich splic (173 aa)
initn: 141 init1: 84 opt: 166 Z-score: 187.4 bits: 40.7 E(85289): 0.0011
Smith-Waterman score: 166; 33.7% identity (61.6% similar) in 86 aa overlap (14-96:7-90)
10 20 30 40 50
pF1KB5 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVG---NTPETR
:: : :..::. .:.. ::.:::: .. : : . :
NP_001 MSRYLRPP--NTSLFVRNVADDTRSEDLRREFGRYGPIVDVYVPLDFYTRRPR
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 GTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKY
: ::: .::. ::..: .:. .:.: . . . ...:
NP_001 GFAYVQFEDVRDAEDALHNLDRKWICGRQIEIQFAQGDRKTPNQMKAKEGRNVYSSSRYD
60 70 80 90 100 110
120
pF1KB5 GINTDPPK
NP_001 DYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNSRPTGRPRRSRSHSDNDSQVSKKKN
120 130 140 150 160 170
>>NP_001177936 (OMIM: 605221) serine/arginine-rich splic (182 aa)
initn: 141 init1: 84 opt: 166 Z-score: 187.2 bits: 40.7 E(85289): 0.0011
Smith-Waterman score: 166; 33.7% identity (61.6% similar) in 86 aa overlap (14-96:7-90)
10 20 30 40 50
pF1KB5 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVG---NTPETR
:: : :..::. .:.. ::.:::: .. : : . :
NP_001 MSRYLRPP--NTSLFVRNVADDTRSEDLRREFGRYGPIVDVYVPLDFYTRRPR
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 GTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKY
: ::: .::. ::..: .:. .:.: . . . ...:
NP_001 GFAYVQFEDVRDAEDALHNLDRKWICGRQIEIQFAQGDRKTPNQMKAKEGRNVYSSSRYD
60 70 80 90 100 110
120
pF1KB5 GINTDPPK
NP_001 DYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNRPTGRPRRSRSHSDNDRPNCSWNTQ
120 130 140 150 160 170
>>NP_006616 (OMIM: 605221) serine/arginine-rich splicing (183 aa)
initn: 141 init1: 84 opt: 166 Z-score: 187.2 bits: 40.7 E(85289): 0.0011
Smith-Waterman score: 166; 33.7% identity (61.6% similar) in 86 aa overlap (14-96:7-90)
10 20 30 40 50
pF1KB5 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVG---NTPETR
:: : :..::. .:.. ::.:::: .. : : . :
NP_006 MSRYLRPP--NTSLFVRNVADDTRSEDLRREFGRYGPIVDVYVPLDFYTRRPR
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 GTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKY
: ::: .::. ::..: .:. .:.: . . . ...:
NP_006 GFAYVQFEDVRDAEDALHNLDRKWICGRQIEIQFAQGDRKTPNQMKAKEGRNVYSSSRYD
60 70 80 90 100 110
120
pF1KB5 GINTDPPK
NP_006 DYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNSRPTGRPRRSRSHSDNDRPNCSWNT
120 130 140 150 160 170
>>NP_001287866 (OMIM: 605221) serine/arginine-rich splic (217 aa)
initn: 141 init1: 84 opt: 166 Z-score: 186.4 bits: 40.8 E(85289): 0.0012
Smith-Waterman score: 166; 33.7% identity (61.6% similar) in 86 aa overlap (14-96:7-90)
10 20 30 40 50
pF1KB5 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVG---NTPETR
:: : :..::. .:.. ::.:::: .. : : . :
NP_001 MSRYLRPP--NTSLFVRNVADDTRSEDLRREFGRYGPIVDVYVPLDFYTRRPR
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 GTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKY
: ::: .::. ::..: .:. .:.: . . . ...:
NP_001 GFAYVQFEDVRDAEDALHNLDRKWICGRQIEIQFAQGDRKTPNQMKAKEGRNVYSSSRYD
60 70 80 90 100 110
120
pF1KB5 GINTDPPK
NP_001 DYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNSRPTGRPRRSRSHSDNDRFKHRNRS
120 130 140 150 160 170
>>NP_001177934 (OMIM: 605221) serine/arginine-rich splic (261 aa)
initn: 157 init1: 84 opt: 166 Z-score: 185.6 bits: 40.9 E(85289): 0.0013
Smith-Waterman score: 166; 33.7% identity (61.6% similar) in 86 aa overlap (14-96:7-90)
10 20 30 40 50
pF1KB5 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVG---NTPETR
:: : :..::. .:.. ::.:::: .. : : . :
NP_001 MSRYLRPP--NTSLFVRNVADDTRSEDLRREFGRYGPIVDVYVPLDFYTRRPR
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 GTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKY
: ::: .::. ::..: .:. .:.: . . . ...:
NP_001 GFAYVQFEDVRDAEDALHNLDRKWICGRQIEIQFAQGDRKTPNQMKAKEGRNVYSSSRYD
60 70 80 90 100 110
120
pF1KB5 GINTDPPK
NP_001 DYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNRPTGRPRRSRSHSDNDRFKHRNRSF
120 130 140 150 160 170
>>NP_473357 (OMIM: 605221) serine/arginine-rich splicing (262 aa)
initn: 157 init1: 84 opt: 166 Z-score: 185.6 bits: 40.9 E(85289): 0.0013
Smith-Waterman score: 166; 33.7% identity (61.6% similar) in 86 aa overlap (14-96:7-90)
10 20 30 40 50
pF1KB5 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVG---NTPETR
:: : :..::. .:.. ::.:::: .. : : . :
NP_473 MSRYLRPP--NTSLFVRNVADDTRSEDLRREFGRYGPIVDVYVPLDFYTRRPR
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 GTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKY
: ::: .::. ::..: .:. .:.: . . . ...:
NP_473 GFAYVQFEDVRDAEDALHNLDRKWICGRQIEIQFAQGDRKTPNQMKAKEGRNVYSSSRYD
60 70 80 90 100 110
120
pF1KB5 GINTDPPK
NP_473 DYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNSRPTGRPRRSRSHSDNDRFKHRNRS
120 130 140 150 160 170
>>XP_005250918 (OMIM: 604679) PREDICTED: polyadenylate-b (636 aa)
initn: 110 init1: 87 opt: 156 Z-score: 171.6 bits: 39.6 E(85289): 0.008
Smith-Waterman score: 156; 26.9% identity (65.4% similar) in 104 aa overlap (21-120:193-295)
10 20 30 40 50
pF1KB5 MAMQAAKRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRV
.::.:. . :.. :.:::.:: ...:
XP_005 LNDRKVFVGRFKSRKEREAELGARAKEFTNVYIKNFGEDMDDERLKDLFGKFGPALSVKV
170 180 190 200 210 220
60 70 80 90 100
pF1KB5 --GNTPETRGTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEE
.. ...: ..: .: ::..: :...: .. .. . : ... .. . :.: :
XP_005 MTDESGKSKGFGFVSFERHEDAQKAVDEMNGKELNGKQIYV-GRAQKKVERQTELKRKFE
230 240 250 260 270 280
110 120
pF1KB5 QLKLLK-EKY-GINTDPPK
:.: . .: :.:
XP_005 QMKQDRITRYQGVNLYVKNLDDGIDDERLRKEFSPFGTITSAKVMMEGGRSKGFGFVCFS
290 300 310 320 330 340
125 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 15:01:46 2016 done: Sat Nov 5 15:01:46 2016
Total Scan time: 4.150 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]