FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5460, 334 aa
1>>>pF1KE5460 334 - 334 aa - 334 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4287+/-0.00109; mu= 14.9684+/- 0.065
mean_var=58.9560+/-12.018, 0's: 0 Z-trim(102.2): 27 B-trim: 274 in 1/50
Lambda= 0.167036
statistics sampled from 6840 (6862) to 6840 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.571), E-opt: 0.2 (0.211), width: 16
Scan time: 2.290
The best scores are: opt bits E(32554)
CCDS1874.1 MDH1 gene_id:4190|Hs108|chr2 ( 334) 2155 528.0 4.3e-150
CCDS56121.1 MDH1 gene_id:4190|Hs108|chr2 ( 352) 2148 526.3 1.4e-149
CCDS56122.1 MDH1 gene_id:4190|Hs108|chr2 ( 245) 1594 392.7 1.6e-109
CCDS82561.1 MDH1B gene_id:130752|Hs108|chr2 ( 420) 453 117.8 1.5e-26
CCDS63102.1 MDH1B gene_id:130752|Hs108|chr2 ( 517) 453 117.9 1.9e-26
CCDS33365.1 MDH1B gene_id:130752|Hs108|chr2 ( 518) 453 117.9 1.9e-26
>>CCDS1874.1 MDH1 gene_id:4190|Hs108|chr2 (334 aa)
initn: 2155 init1: 2155 opt: 2155 Z-score: 2808.2 bits: 528.0 E(32554): 4.3e-150
Smith-Waterman score: 2155; 100.0% identity (100.0% similar) in 334 aa overlap (1-334:1-334)
10 20 30 40 50 60
pF1KE5 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKDQPIILVLLDITPMMGVLDGVLMELQDC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS18 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKDQPIILVLLDITPMMGVLDGVLMELQDC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 ALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLLKANVKIFKSQGAALDKYA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS18 ALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLLKANVKIFKSQGAALDKYA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 KKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNRAKAQIALKLGVTANDVKN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS18 KKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNRAKAQIALKLGVTANDVKN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 VIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGEFVTTVQQRGAAVIKARKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS18 VIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGEFVTTVQQRGAAVIKARKL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 SSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKFV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS18 SSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKFV
250 260 270 280 290 300
310 320 330
pF1KE5 EGLPINDFSREKMDLTAKELTEEKESAFEFLSSA
::::::::::::::::::::::::::::::::::
CCDS18 EGLPINDFSREKMDLTAKELTEEKESAFEFLSSA
310 320 330
>>CCDS56121.1 MDH1 gene_id:4190|Hs108|chr2 (352 aa)
initn: 2148 init1: 2148 opt: 2148 Z-score: 2798.7 bits: 526.3 E(32554): 1.4e-149
Smith-Waterman score: 2148; 100.0% identity (100.0% similar) in 333 aa overlap (2-334:20-352)
10 20 30 40
pF1KE5 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKDQPIILVLLD
:::::::::::::::::::::::::::::::::::::::::
CCDS56 MRRCSYFPKDVTVFDKDDKSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKDQPIILVLLD
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE5 ITPMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 ITPMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLL
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE5 KANVKIFKSQGAALDKYAKKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 KANVKIFKSQGAALDKYAKKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNR
130 140 150 160 170 180
170 180 190 200 210 220
pF1KE5 AKAQIALKLGVTANDVKNVIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 AKAQIALKLGVTANDVKNVIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGE
190 200 210 220 230 240
230 240 250 260 270 280
pF1KE5 FVTTVQQRGAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 FVTTVQQRGAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPD
250 260 270 280 290 300
290 300 310 320 330
pF1KE5 DLLYSFPVVIKNKTWKFVEGLPINDFSREKMDLTAKELTEEKESAFEFLSSA
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 DLLYSFPVVIKNKTWKFVEGLPINDFSREKMDLTAKELTEEKESAFEFLSSA
310 320 330 340 350
>>CCDS56122.1 MDH1 gene_id:4190|Hs108|chr2 (245 aa)
initn: 1594 init1: 1594 opt: 1594 Z-score: 2079.7 bits: 392.7 E(32554): 1.6e-109
Smith-Waterman score: 1594; 100.0% identity (100.0% similar) in 245 aa overlap (90-334:1-245)
60 70 80 90 100 110
pF1KE5 CALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLLKANVKIFKSQGAALDKY
::::::::::::::::::::::::::::::
CCDS56 MPRREGMERKDLLKANVKIFKSQGAALDKY
10 20 30
120 130 140 150 160 170
pF1KE5 AKKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNRAKAQIALKLGVTANDVK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 AKKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNRAKAQIALKLGVTANDVK
40 50 60 70 80 90
180 190 200 210 220 230
pF1KE5 NVIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGEFVTTVQQRGAAVIKARK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 NVIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGEFVTTVQQRGAAVIKARK
100 110 120 130 140 150
240 250 260 270 280 290
pF1KE5 LSSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LSSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKF
160 170 180 190 200 210
300 310 320 330
pF1KE5 VEGLPINDFSREKMDLTAKELTEEKESAFEFLSSA
:::::::::::::::::::::::::::::::::::
CCDS56 VEGLPINDFSREKMDLTAKELTEEKESAFEFLSSA
220 230 240
>>CCDS82561.1 MDH1B gene_id:130752|Hs108|chr2 (420 aa)
initn: 364 init1: 212 opt: 453 Z-score: 589.9 bits: 117.8 E(32554): 1.5e-26
Smith-Waterman score: 453; 28.4% identity (62.0% similar) in 334 aa overlap (4-328:33-360)
10 20 30
pF1KE5 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKD
:..: .:.:.. :.:. . .: :::
CCDS82 TELMMVIAQENLGAHIEKEQEEEALKTCINPLQVWITSASAPACYNLIPILTSGEVFGMH
10 20 30 40 50 60
40 50 60 70 80 90
pF1KE5 QPIILVLLDITPMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRR
: ..:.: : ....: :: : :.:..: : . ::.. : ... . .
CCDS82 TEISITLFDNKQAEEHLKSLVVETQDLASPVLRSVSICTKVEEAFRQAHVIVVLDDSTNK
70 80 90 100 110 120
100 110 120 130 140 150
pF1KE5 EGMERKDLLKANVKIFKSQGAALDKYAKKSVKVIVVGNP-ANTNCLTASKSAPSIPKENF
: . .: :.. : . . : ..: :..::.::: : .: . . . :: : .. .
CCDS82 EVFTLEDCLRSRVPLCRLYGYLIEKNAHESVRVIVGGRTFVNLKTVLLMRYAPRIAHNII
130 140 150 160 170 180
160 170 180 190 200
pF1KE5 SCLTRLDHNRAKAQIALKLGVTANDVKNVIIWGNHSSTQYPDVNHAKV-KLQGKEVG---
. .. ..::: .: :: .. . .:.:::::: :...: :. ...: . .. :
CCDS82 AVALGVE-GEAKAILARKLKTAPSYIKDVIIWGNISGNNYVDLRKTRVYRYESAIWGPLH
190 200 210 220 230 240
210 220 230 240 250 260
pF1KE5 ----VYEALKDDSWLKGEFVTTVQQRGAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEG
: . . :. :.: :::. ... . .:.... . ::..: .. . :.: :
CCDS82 YSRPVLNLIFDSEWVKREFVAILKN---LTTTGRQFGGIL-AAHSIATTLKYWYHGSPPG
250 260 270 280 290
270 280 290 300 310 320
pF1KE5 EFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKFVEGLPINDFSREKMDLTAKELTEEK
:.::.:..:.:. .:.: ...:.:: ..: :: . : ..:.. : ...: .::
CCDS82 EIVSLGILSEGQ-FGIPKGIVFSMPVKFENGTWVVLTDLKDVEISEQIMTRMTSDLIQEK
300 310 320 330 340 350
330
pF1KE5 ESAFEFLSSA
:.
CCDS82 LVALGDKIHFQPYQSGHKDLVPDEEKNLAMSDAAEFPNQIPQTTFEKPQSLEFLNEFEGK
360 370 380 390 400 410
>>CCDS63102.1 MDH1B gene_id:130752|Hs108|chr2 (517 aa)
initn: 364 init1: 212 opt: 453 Z-score: 588.4 bits: 117.9 E(32554): 1.9e-26
Smith-Waterman score: 453; 28.4% identity (62.0% similar) in 334 aa overlap (4-328:131-458)
10 20 30
pF1KE5 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKD
:..: .:.:.. :.:. . .: :::
CCDS63 TELMMVIAQENLGAHIEKEQEEEALKTCINPLQVWITSASAPACYNLIPILTSGEVFGMH
110 120 130 140 150 160
40 50 60 70 80 90
pF1KE5 QPIILVLLDITPMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRR
: ..:.: : ....: :: : :.:..: : . ::.. : ... . .
CCDS63 TEISITLFDNKQAEEHLKSLVVETQDLASPVLRSVSICTKVEEAFRQAHVIVVLDDSTNK
170 180 190 200 210 220
100 110 120 130 140 150
pF1KE5 EGMERKDLLKANVKIFKSQGAALDKYAKKSVKVIVVGNP-ANTNCLTASKSAPSIPKENF
: . .: :.. : . . : ..: :..::.::: : .: . . . :: : .. .
CCDS63 EVFTLEDCLRSRVPLCRLYGYLIEKNAHESVRVIVGGRTFVNLKTVLLMRYAPRIAHNII
230 240 250 260 270 280
160 170 180 190 200
pF1KE5 SCLTRLDHNRAKAQIALKLGVTANDVKNVIIWGNHSSTQYPDVNHAKV-KLQGKEVG---
. .. ..::: .: :: .. . .:.:::::: :...: :. ...: . .. :
CCDS63 AVALGVE-GEAKAILARKLKTAPSYIKDVIIWGNISGNNYVDLRKTRVYRYESAIWGPLH
290 300 310 320 330
210 220 230 240 250 260
pF1KE5 ----VYEALKDDSWLKGEFVTTVQQRGAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEG
: . . :. :.: :::. ... . .:.... . ::..: .. . :.: :
CCDS63 YSRPVLNLIFDSEWVKREFVAILKN---LTTTGRQFGGIL-AAHSIATTLKYWYHGSPPG
340 350 360 370 380 390
270 280 290 300 310 320
pF1KE5 EFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKFVEGLPINDFSREKMDLTAKELTEEK
:.::.:..:.:. .:.: ...:.:: ..: :: . : ..:.. : ...: .::
CCDS63 EIVSLGILSEGQ-FGIPKGIVFSMPVKFENGTWVVLTDLKDVEISEQIMTRMTSDLIQEK
400 410 420 430 440 450
330
pF1KE5 ESAFEFLSSA
:.
CCDS63 LVALGDKIHFQPYQSGHKDLVPDEEKNLAMSDAEFPNQIPQTTFEKPQSLEFLNEFEGKT
460 470 480 490 500 510
>>CCDS33365.1 MDH1B gene_id:130752|Hs108|chr2 (518 aa)
initn: 292 init1: 212 opt: 453 Z-score: 588.4 bits: 117.9 E(32554): 1.9e-26
Smith-Waterman score: 453; 28.4% identity (62.0% similar) in 334 aa overlap (4-328:131-458)
10 20 30
pF1KE5 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKD
:..: .:.:.. :.:. . .: :::
CCDS33 TELMMVIAQENLGAHIEKEQEEEALKTCINPLQVWITSASAPACYNLIPILTSGEVFGMH
110 120 130 140 150 160
40 50 60 70 80 90
pF1KE5 QPIILVLLDITPMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRR
: ..:.: : ....: :: : :.:..: : . ::.. : ... . .
CCDS33 TEISITLFDNKQAEEHLKSLVVETQDLASPVLRSVSICTKVEEAFRQAHVIVVLDDSTNK
170 180 190 200 210 220
100 110 120 130 140 150
pF1KE5 EGMERKDLLKANVKIFKSQGAALDKYAKKSVKVIVVGNP-ANTNCLTASKSAPSIPKENF
: . .: :.. : . . : ..: :..::.::: : .: . . . :: : .. .
CCDS33 EVFTLEDCLRSRVPLCRLYGYLIEKNAHESVRVIVGGRTFVNLKTVLLMRYAPRIAHNII
230 240 250 260 270 280
160 170 180 190 200
pF1KE5 SCLTRLDHNRAKAQIALKLGVTANDVKNVIIWGNHSSTQYPDVNHAKV-KLQGKEVG---
. .. ..::: .: :: .. . .:.:::::: :...: :. ...: . .. :
CCDS33 AVALGVE-GEAKAILARKLKTAPSYIKDVIIWGNISGNNYVDLRKTRVYRYESAIWGPLH
290 300 310 320 330
210 220 230 240 250 260
pF1KE5 ----VYEALKDDSWLKGEFVTTVQQRGAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEG
: . . :. :.: :::. ... . .:.... . ::..: .. . :.: :
CCDS33 YSRPVLNLIFDSEWVKREFVAILKN---LTTTGRQFGGIL-AAHSIATTLKYWYHGSPPG
340 350 360 370 380 390
270 280 290 300 310 320
pF1KE5 EFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKFVEGLPINDFSREKMDLTAKELTEEK
:.::.:..:.:. .:.: ...:.:: ..: :: . : ..:.. : ...: .::
CCDS33 EIVSLGILSEGQ-FGIPKGIVFSMPVKFENGTWVVLTDLKDVEISEQIMTRMTSDLIQEK
400 410 420 430 440 450
330
pF1KE5 ESAFEFLSSA
:.
CCDS33 LVALGDKIHFQPYQSGHKDLVPDEEKNLAMSDAAEFPNQIPQTTFEKPQSLEFLNEFEGK
460 470 480 490 500 510
334 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 00:59:30 2016 done: Tue Nov 8 00:59:30 2016
Total Scan time: 2.290 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]