FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KSDA0669, 780 aa
1>>>pF1KSDA0669 780 - 780 aa - 780 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 17.1311+/-0.000517; mu= -34.1922+/- 0.032
mean_var=814.9057+/-174.061, 0's: 0 Z-trim(124.6): 42 B-trim: 878 in 1/60
Lambda= 0.044928
statistics sampled from 46619 (46678) to 46619 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.547), width: 16
Scan time: 12.110
The best scores are: opt bits E(85289)
NP_904358 (OMIM: 607715) TSC22 domain family prote (1073) 613 55.6 1.3e-06
NP_006013 (OMIM: 607715) TSC22 domain family prote ( 144) 402 41.2 0.0037
>>NP_904358 (OMIM: 607715) TSC22 domain family protein 1 (1073 aa)
initn: 794 init1: 337 opt: 613 Z-score: 239.5 bits: 55.6 E(85289): 1.3e-06
Smith-Waterman score: 967; 32.5% identity (58.5% similar) in 805 aa overlap (71-775:312-1071)
50 60 70 80 90
pF1KSD TEDVSSEIFDVSRATDYGPEEVCERSSSEETLNNVGDAETPGTVSPNLL--LDGQLAAA-
:.::: . . :. .::. . :.. .
NP_904 SSGSPASVMTNMRAPSTTGGIGINSVTGTSTVNNV-NITAVGSFNPNVTSSMLGNVNIST
290 300 310 320 330 340
100 110 120 130 140
pF1KSD -----AAAPANGGGVVSARSV---SGALASTLAAAATSAPAPGAPGGPQLAGSSAGPVTA
::. . : ::.:. .: :: .:....:. . .:.: :: ..: :..
NP_904 SNIPSAAGVSVGPGVTSGVNVNILSGMGNGTISSSAAVSSVPNAA-----AGMTGGSVSS
350 360 370 380 390
150 160 170 180 190 200
pF1KSD APSQPPTTCSSRFRVIKLDHGSGEPYRRGRWTCMEYYERD-----SDSSVLTRSGDCIRH
.: ::. .:::::.::: .:.::...::::: :.::.. ... .... . ...
NP_904 Q-QQQPTVNTSRFRVVKLD-SSSEPFKKGRWTCTEFYEKENAVPATEGVLINKVVETVKQ
400 410 420 430 440 450
210 220 230 240 250 260
pF1KSD SSTFDQTAERDSGLGATGGSVVVVVASMQGAHGP-ESGTDSSLTAVSQLPPSEKMSQPT-
. .. :.::.: :.. .: : ... . . : : :. . .. .: .....::.
NP_904 NP-IEVTSERESTSGSSVSSSVSTLSHYTESVGSGEMGAPTVVVQQQQQQQQQQQQQPAL
460 470 480 490 500 510
270 280 290 300
pF1KSD ---PAQPQSFSVGQPQPPPP-PVGGAVAQS--------SAPLPPFPGAATGPQPM---MA
: ..:. :: : . ...:: : : . : :. :.
NP_904 QGVTLQQMDFGSTGPQSIPAVSIPQSISQSQISQVQLQSQELSYQQKQGLQPVPLQATMS
520 530 540 550 560 570
310 320 330 340 350
pF1KSD AAQPSQPQGAGPGGQT----LPPTNVTLAQPAM----SLPPQPGPAVGAPAAQQPQQFAY
:: ::. .. : : :. .:::: . . :: : ::: :: :..
NP_904 AATGIQPSPVNVVGVTSALGQQPSISSLAQPQLPYSQAAPPVQTPLPGAPPPQQ-LQYGQ
580 590 600 610 620 630
360 370 380 390 400 410
pF1KSD PQP----QIPPGHLLPVQPSGQSEYLQQHVAGLQPPSPAQPSSTGAAASPATAATLPVGT
:: :. :::. : . :::.::. : .::::.:..:. ....::.
NP_904 QQPMVSTQMAPGHVKSVTQNPASEYVQQQPILQTAMSSGQPSSAGVGAG---TTVIPVAQ
640 650 660 670 680
420 430 440 450
pF1KSD GQNA------SSVGAQLMGASSQP-SEAMAPRTGPAQGGQVA--------PC---QP-TG
:. ..: :: ::: :: ..: : .. :.:.: : :: :
NP_904 PQGIQLPVQPTAVPAQPAGASVQPVGQAPAAVSAVPTGSQIANIGQQANIPTAVQQPSTQ
690 700 710 720 730 740
460 470 480 490 500
pF1KSD VPPATV--GG-----VVQPC-LGPAGAGQPQSVP--PPQM--GGSGPLSAVPGGPHAVVP
:::... :. :: : : : :.: : :. .... : .:: :..: :
NP_904 VPPSVIQQGAPPSSQVVPPAQTGIIHQGVQTSAPSLPQQLVIASQSSLLTVPPQPQGVEP
750 760 770 780 790 800
510 520 530 540 550
pF1KSD ---GV--PNVPAAVPAPSVPSVSTTSVT-------MPNVPAPLAQSQQLSSHTPVSRSSS
:. ..::. ::. :.:.:: . ::..:. :. :... .::......
NP_904 VAQGIVSQQLPAVSSLPSASSISVTSQVSSTGPSGMPSAPTNLVPPQNIA-QTPATQNGN
810 820 830 840 850 860
560 570 580 590 600
pF1KSD IIQHVGLPLAPGTHSA-PTS--LPQSDLSQFQTQT--QPLVGQVDDTRRKSEP----LPQ
..: :. : .:.. : . .: :. .::..:. : . .:..:.:: .:: :::
NP_904 LVQSVSQPPLIATNTNLPLAQQIPLSS-TQFSAQSLAQAIGSQIEDARRAAEPSLVGLPQ
870 880 890 900 910 920
610 620 630 640 650 660
pF1KSD PPLSLIAENKPVVKPPVADSLANPLQLTPMNSLATSVFSIAIP-VDGDEDRNPSTAFYQA
.: . . .:. ..::: .: :.. :. .. : :::...
NP_904 T-ISGDSGGMSAVSDGSSSSLAASASLFPLK-----VLPLTTPLVDGEDE----------
930 940 950 960 970
670 680 690 700 710 720
pF1KSD FHLNTLKESKSLWDSASGGGVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELVER
:.::..:::::::::::::::::::::::::::::::::::::.:.
NP_904 --------------SSSGASVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELIEK
980 990 1000 1010
730 740 750 760 770 780
pF1KSD NSLLERENALLKSLSSNDQLSQLPTQ--QANPGSTSQQQAVIAQPPQPTQPPQQPNVSSA
:: ::.:: :::.:.: .::.:. .: ..: .:.: :.. : ::.. . :
NP_904 NSQLEQENNLLKTLASPEQLAQFQAQLQTGSPPATTQPQGTTQPPAQPASQGSGPTA
1020 1030 1040 1050 1060 1070
>>NP_006013 (OMIM: 607715) TSC22 domain family protein 1 (144 aa)
initn: 384 init1: 337 opt: 402 Z-score: 177.7 bits: 41.2 E(85289): 0.0037
Smith-Waterman score: 402; 53.3% identity (77.8% similar) in 135 aa overlap (643-775:9-142)
620 630 640 650 660 670
pF1KSD KPVVKPPVADSLANPLQLTPMNSLATSVFSIAIPVDGDEDRNPSTAFYQAFHLNTLKESK
.:. . . :. : .: ... :.: . :
NP_006 MKSQWCRPVAMDLGVYQLRHFSISFLSSL-LGTENASV
10 20 30
680 690 700 710 720 730
pF1KSD SLWDSASGGGVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELVERNSLLERENAL
: .:.::..:::::::::::::::::::::::::::::::::::::.:.:: ::.:: :
NP_006 RLDNSSSGASVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELIEKNSQLEQENNL
40 50 60 70 80 90
740 750 760 770 780
pF1KSD LKSLSSNDQLSQLPTQ--QANPGSTSQQQAVIAQPPQPTQPPQQPNVSSA
::.:.: .::.:. .: ..: .:.: :.. : ::.. . :
NP_006 LKTLASPEQLAQFQAQLQTGSPPATTQPQGTTQPPAQPASQGSGPTA
100 110 120 130 140
780 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 02:38:38 2016 done: Thu Nov 3 02:38:40 2016
Total Scan time: 12.110 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]