FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7626, 372 aa
1>>>pF1KB7626 372 - 372 aa - 372 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.6778+/-0.000829; mu= 5.1595+/- 0.050
mean_var=249.6288+/-49.626, 0's: 0 Z-trim(116.1): 7 B-trim: 0 in 0/54
Lambda= 0.081176
statistics sampled from 16668 (16675) to 16668 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.785), E-opt: 0.2 (0.512), width: 16
Scan time: 2.890
The best scores are: opt bits E(32554)
CCDS8630.1 YBX3 gene_id:8531|Hs108|chr12 ( 372) 2538 309.5 3.1e-84
CCDS44831.1 YBX3 gene_id:8531|Hs108|chr12 ( 303) 1313 165.9 4.1e-41
CCDS470.1 YBX1 gene_id:4904|Hs108|chr1 ( 324) 780 103.5 2.7e-22
CCDS11098.1 YBX2 gene_id:51087|Hs108|chr17 ( 364) 746 99.6 4.5e-21
>>CCDS8630.1 YBX3 gene_id:8531|Hs108|chr12 (372 aa)
initn: 2538 init1: 2538 opt: 2538 Z-score: 1625.8 bits: 309.5 E(32554): 3.1e-84
Smith-Waterman score: 2538; 99.7% identity (100.0% similar) in 372 aa overlap (1-372:1-372)
10 20 30 40 50 60
pF1KB7 MSEAGEATTTTTTTLPQAPTEAAAAAPQDPAPKSPVGSGAPQAAAPAPAAHVAGNPGGDA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 MSEAGEATTTTTTTLPQAPTEAAAAAPQDPAPKSPVGSGAPQAAAPAPAAHVAGNPGGDA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 APAATGTAAAASLAAAAGSEDAEKKVLATKVLGTVKWFNVRNGYGFINRNDTKEDVFVHQ
::::::::::::::.:::::::::::::::::::::::::::::::::::::::::::::
CCDS86 APAATGTAAAASLATAAGSEDAEKKVLATKVLGTVKWFNVRNGYGFINRNDTKEDVFVHQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 TAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGPDGVPVEGSRYAADRRRYRRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 TAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGPDGVPVEGSRYAADRRRYRRG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 YYGRRRGPPRNYAGEEEEEGSGSSEGFDPPATDRQFSGARNQLRRPQYRPQYRQRRFPPY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 YYGRRRGPPRNYAGEEEEEGSGSSEGFDPPATDRQFSGARNQLRRPQYRPQYRQRRFPPY
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 HVGQTFDRRSRVLPHPNRIQAGEIGEMKDGVPEGAQLQGPVHRNPTYRPRYRSRGPPRPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 HVGQTFDRRSRVLPHPNRIQAGEIGEMKDGVPEGAQLQGPVHRNPTYRPRYRSRGPPRPR
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 PAPAVGEAEDKENQQATSGPNQPSVRRGYRRPYNYRRRPRPPNAPSQDGKEAKAGEAPTE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS86 PAPAVGEAEDKENQQATSGPNQPSVRRGYRRPYNYRRRPRPPNAPSQDGKEAKAGEAPTE
310 320 330 340 350 360
370
pF1KB7 NPAPPTQQSSAE
::::::::::::
CCDS86 NPAPPTQQSSAE
370
>>CCDS44831.1 YBX3 gene_id:8531|Hs108|chr12 (303 aa)
initn: 2030 init1: 1248 opt: 1313 Z-score: 851.6 bits: 165.9 E(32554): 4.1e-41
Smith-Waterman score: 1896; 81.2% identity (81.5% similar) in 372 aa overlap (1-372:1-303)
10 20 30 40 50 60
pF1KB7 MSEAGEATTTTTTTLPQAPTEAAAAAPQDPAPKSPVGSGAPQAAAPAPAAHVAGNPGGDA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MSEAGEATTTTTTTLPQAPTEAAAAAPQDPAPKSPVGSGAPQAAAPAPAAHVAGNPGGDA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 APAATGTAAAASLAAAAGSEDAEKKVLATKVLGTVKWFNVRNGYGFINRNDTKEDVFVHQ
::::::::::::::.:::::::::::::::::::::::::::::::::::::::::::::
CCDS44 APAATGTAAAASLATAAGSEDAEKKVLATKVLGTVKWFNVRNGYGFINRNDTKEDVFVHQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 TAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGPDGVPVEGSRYAADRRRYRRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 TAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGPDGVPVEGSRYAADRRRYRRG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 YYGRRRGPPRNYAGEEEEEGSGSSEGFDPPATDRQFSGARNQLRRPQYRPQYRQRRFPPY
:::::::::::
CCDS44 YYGRRRGPPRN-------------------------------------------------
190
250 260 270 280 290 300
pF1KB7 HVGQTFDRRSRVLPHPNRIQAGEIGEMKDGVPEGAQLQGPVHRNPTYRPRYRSRGPPRPR
::::::::::::::::::::::::::::::::::::::::
CCDS44 --------------------AGEIGEMKDGVPEGAQLQGPVHRNPTYRPRYRSRGPPRPR
200 210 220 230
310 320 330 340 350 360
pF1KB7 PAPAVGEAEDKENQQATSGPNQPSVRRGYRRPYNYRRRPRPPNAPSQDGKEAKAGEAPTE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PAPAVGEAEDKENQQATSGPNQPSVRRGYRRPYNYRRRPRPPNAPSQDGKEAKAGEAPTE
240 250 260 270 280 290
370
pF1KB7 NPAPPTQQSSAE
::::::::::::
CCDS44 NPAPPTQQSSAE
300
>>CCDS470.1 YBX1 gene_id:4904|Hs108|chr1 (324 aa)
initn: 1013 init1: 609 opt: 780 Z-score: 513.9 bits: 103.5 E(32554): 2.7e-22
Smith-Waterman score: 1093; 57.7% identity (71.6% similar) in 338 aa overlap (41-372:10-324)
20 30 40 50 60
pF1KB7 TTTTLPQAPTEAAAAAPQDPAPKSPVGSGAPQAAAPA-PAAHVAGN-PGGDAAPAATGTA
: :: :: :: .: . :: .. :..:
CCDS47 MSSEAETQQPPAAPPAAPALSAADTKPGTTGSGAGSGGP
10 20 30
70 80 90 100 110 120
pF1KB7 AAASLAAAAGSEDAEKKVLATKVLGTVKWFNVRNGYGFINRNDTKEDVFVHQTAIKKNNP
.. . :: :: ..:::.:::::::::::::::::::::::::::::::::::::::::
CCDS47 GGLTSAAPAG---GDKKVIATKVLGTVKWFNVRNGYGFINRNDTKEDVFVHQTAIKKNNP
40 50 60 70 80 90
130 140 150 160 170 180
pF1KB7 RKYLRSVGDGETVEFDVVEGEKGAEAANVTGPDGVPVEGSRYAADRRRYRRGYYGRRRGP
:::::::::::::::::::::::::::::::: ::::.::.::::: .::: : :::::
CCDS47 RKYLRSVGDGETVEFDVVEGEKGAEAANVTGPGGVPVQGSKYAADRNHYRR--YPRRRGP
100 110 120 130 140 150
190 200 210 220 230 240
pF1KB7 PRNYAGEEEEEGSGSSEGFDPPATDRQFSGARNQLRRPQYRPQYRQRRFPPYHVGQTFDR
:::: . .. :: .. . : . : : ::: ::.::::::.. . . :
CCDS47 PRNYQQNYQNSESGEKNEGSESAPEGQ-----AQQRRP-----YRRRRFPPYYMRRPYGR
160 170 180 190 200
250 260 270 280 290 300
pF1KB7 RSRVLPHPNRIQAGEIGEMKDGVPEGAQLQG-PVHRNPT--YRPRYRSRGPPRPRPAPAV
: . : .: ::. : :. .:: :: ::..: ::::.: ::::: :
CCDS47 RPQYSNPP--VQ-GEVMEGADN--QGAGEQGRPVRQNMYRGYRPRFR-RGPPRQRQPRED
210 220 230 240 250
310 320 330 340 350 360
pF1KB7 GEAEDKENQQATSGPNQPSVRRGYRRPYNYRRRPRPPNAPSQDGKEAKAGEAPTENP-AP
:. :::::: . .:: :: ::: .::::: :: : :::::.::.. :.:: ::
CCDS47 GNEEDKENQGDETQGQQPPQRR-YRRNFNYRRR-RPENPKPQDGKETKAADPPAENSSAP
260 270 280 290 300 310
370
pF1KB7 PTQQSSAE
..:..::
CCDS47 EAEQGGAE
320
>>CCDS11098.1 YBX2 gene_id:51087|Hs108|chr17 (364 aa)
initn: 659 init1: 590 opt: 746 Z-score: 491.7 bits: 99.6 E(32554): 4.5e-21
Smith-Waterman score: 843; 46.2% identity (64.3% similar) in 381 aa overlap (1-366:1-360)
10 20 30 40 50
pF1KB7 MSE---AGEATTTTTTTLPQAPTEAAAAAPQDPA--PKSPVGSGAPQAAAPAPAAHVAGN
::: :. ::.. ..:.: . . ..:.. :: :.. :.:. .:: .::: . .
CCDS11 MSEVEAAAGATAVPAATVPATAAGVVAVVVPVPAGEPQKGGGAGGGGGAASGPAAGTPSA
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 PGGDAAPAATGTAAAASLAAAAGSEDAEKKVLATKVLGTVKWFNVRNGYGFINRNDTKED
::. . :. .::.... : : :. :.: ::: .:::::::::::::::::::::::::
CCDS11 PGSRT-PGNPATAVSGTPAPPARSQ-ADKPVLAIQVLGTVKWFNVRNGYGFINRNDTKED
70 80 90 100 110
120 130 140 150 160 170
pF1KB7 VFVHQTAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGPDGVPVEGSRYAADRR
:::::::::.:::::.:::::::::::::::::::::::.::::: ::::.::::: .::
CCDS11 VFVHQTAIKRNNPRKFLRSVGDGETVEFDVVEGEKGAEATNVTGPGGVPVKGSRYAPNRR
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB7 RYRRGYYGR--RRGPPRNYAGEEEEEGSG-SSEGFDPPATDRQFSGARNQLRRPQYRPQY
. :: . : .:: : : :.: .:.: : : :: : :: . :
CCDS11 KSRR-FIPRPPSVAPPPMVA-EIPSAGTGPGSKG--ERAED---SGQRP--RR--WCP--
180 190 200 210 220
240 250 260 270 280 290
pF1KB7 RQRRFPPYHVGQTFDRRSRVLPHPNRIQAGEIGEMKDGVP-EGAQLQGPVHRNPT-YRPR
::. . : : : . . :.. . : :. .: :: : :: . : .:::
CCDS11 -----PPFFYRRRFVRGPRPPNQQQPIEGTDRVEPKETAPLEGHQQQGDERVPPPRFRPR
230 240 250 260 270 280
300 310 320 330 340
pF1KB7 YRSRGPPRPRPAPAV--GEAEDKENQQATSGPNQPSVRRGYRRPYNYRRR---PRPPNAP
:: :::: :.. :..: : .: ..: ..: .: ::: ::: : : .::
CCDS11 YRRPFRPRPRQQPTTEGGDGETKPSQGPADG-SRPEPQRPRNRPYFQRRRQQAPGPQQAP
290 300 310 320 330
350 360 370
pF1KB7 SQDGKEAKAGEAPTENPAPPTQQSSAE
. : ::... : :
CCDS11 GPRQPAAPETSAPVNSGDPTTTILE
340 350 360
372 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 04:08:54 2016 done: Mon Nov 7 04:08:54 2016
Total Scan time: 2.890 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]