FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KSDA0723, 847 aa
1>>>pF1KSDA0723 847 - 847 aa - 847 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.1842+/- 0.001; mu= 8.9759+/- 0.061
mean_var=181.6986+/-36.209, 0's: 0 Z-trim(111.8): 36 B-trim: 141 in 1/51
Lambda= 0.095148
statistics sampled from 12622 (12657) to 12622 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.722), E-opt: 0.2 (0.389), width: 16
Scan time: 4.610
The best scores are: opt bits E(32554)
CCDS4210.1 MATR3 gene_id:9782|Hs108|chr5 ( 847) 5688 793.6 0
CCDS54908.1 MATR3 gene_id:9782|Hs108|chr5 ( 559) 3630 511.0 2.3e-144
CCDS75316.1 MATR3 gene_id:9782|Hs108|chr5 ( 509) 3366 474.7 1.7e-133
>>CCDS4210.1 MATR3 gene_id:9782|Hs108|chr5 (847 aa)
initn: 5688 init1: 5688 opt: 5688 Z-score: 4229.5 bits: 793.6 E(32554): 0
Smith-Waterman score: 5688; 100.0% identity (100.0% similar) in 847 aa overlap (1-847:1-847)
10 20 30 40 50 60
pF1KSD MSKSFQQSSLSRDSQGHGRDLSAAGIGLLAAATQSLSMPASLGRMNQGTARLASLMNLGM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 MSKSFQQSSLSRDSQGHGRDLSAAGIGLLAAATQSLSMPASLGRMNQGTARLASLMNLGM
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD SSSLNQQGAHSALSSASTSSHNLQSIFNIGSRGPLPLSSQHRGDADQASNILASFGLSAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 SSSLNQQGAHSALSSASTSSHNLQSIFNIGSRGPLPLSSQHRGDADQASNILASFGLSAR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD DLDELSRYPEDKITPENLPQILLQLKRRRTEEGPTLSYGRDGRSATREPPYRVPRDDWEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 DLDELSRYPEDKITPENLPQILLQLKRRRTEEGPTLSYGRDGRSATREPPYRVPRDDWEE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KSD KRHFRRDSFDDRGPSLNPVLDYDHGSRSQESGYYDRMDYEDDRLRDGERCRDDSFFGETS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 KRHFRRDSFDDRGPSLNPVLDYDHGSRSQESGYYDRMDYEDDRLRDGERCRDDSFFGETS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KSD HNYHKFDSEYERMGRGPGPLQERSLFEKKRGAPPSSNIEDFHGLLPKGYPHLCSICDLPV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 HNYHKFDSEYERMGRGPGPLQERSLFEKKRGAPPSSNIEDFHGLLPKGYPHLCSICDLPV
250 260 270 280 290 300
310 320 330 340 350 360
pF1KSD HSNKEWSQHINGASHSRRCQLLLEIYPEWNPDNDTGHTMGDPFMLQQSTNPAPGILGPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 HSNKEWSQHINGASHSRRCQLLLEIYPEWNPDNDTGHTMGDPFMLQQSTNPAPGILGPPP
310 320 330 340 350 360
370 380 390 400 410 420
pF1KSD PSFHLGGPAVGPRGNLGAGNGNLQGPRHMQKGRVETSRVVHIMDFQRGKNLRYQLLQLVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 PSFHLGGPAVGPRGNLGAGNGNLQGPRHMQKGRVETSRVVHIMDFQRGKNLRYQLLQLVE
370 380 390 400 410 420
430 440 450 460 470 480
pF1KSD PFGVISNHLILNKINEAFIEMATTEDAQAAVDYYTTTPALVFGKPVRVHLSQKYKRIKKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 PFGVISNHLILNKINEAFIEMATTEDAQAAVDYYTTTPALVFGKPVRVHLSQKYKRIKKP
430 440 450 460 470 480
490 500 510 520 530 540
pF1KSD EGKPDQKFDQKQELGRVIHLSNLPHSGYSDSAVLKLAEPYGKIKNYILMRMKSQAFIEME
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 EGKPDQKFDQKQELGRVIHLSNLPHSGYSDSAVLKLAEPYGKIKNYILMRMKSQAFIEME
490 500 510 520 530 540
550 560 570 580 590 600
pF1KSD TREDAMAMVDHCLKKALWFQGRCVKVDLSEKYKKLVLRIPNRGIDLLKKDKSRKRSYSPD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 TREDAMAMVDHCLKKALWFQGRCVKVDLSEKYKKLVLRIPNRGIDLLKKDKSRKRSYSPD
550 560 570 580 590 600
610 620 630 640 650 660
pF1KSD GKESPSDKKSKTDGSQKTESSTEGKEQEEKSGEDGEKDTKDDQTEQEPNMLLESEDELLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 GKESPSDKKSKTDGSQKTESSTEGKEQEEKSGEDGEKDTKDDQTEQEPNMLLESEDELLV
610 620 630 640 650 660
670 680 690 700 710 720
pF1KSD DEEEAAALLESGSSVGDETDLANLGDVASDGKKEPSDKAVKKDGSASAAAKKKLKKVDKI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 DEEEAAALLESGSSVGDETDLANLGDVASDGKKEPSDKAVKKDGSASAAAKKKLKKVDKI
670 680 690 700 710 720
730 740 750 760 770 780
pF1KSD EELDQENEAALENGIKNEENTEPGAESSENADDPNKDTSENADGQSDENKDDYTIPDEYR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 EELDQENEAALENGIKNEENTEPGAESSENADDPNKDTSENADGQSDENKDDYTIPDEYR
730 740 750 760 770 780
790 800 810 820 830 840
pF1KSD IGPYQPNVPVGIDYVIPKTGFYCKLCSLFYTNEEVAKNTHCSSLPHYQKLKKFLNKLAEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 IGPYQPNVPVGIDYVIPKTGFYCKLCSLFYTNEEVAKNTHCSSLPHYQKLKKFLNKLAEE
790 800 810 820 830 840
pF1KSD RRQKKET
:::::::
CCDS42 RRQKKET
>>CCDS54908.1 MATR3 gene_id:9782|Hs108|chr5 (559 aa)
initn: 3630 init1: 3630 opt: 3630 Z-score: 2705.3 bits: 511.0 E(32554): 2.3e-144
Smith-Waterman score: 3630; 99.1% identity (99.6% similar) in 549 aa overlap (299-847:11-559)
270 280 290 300 310 320
pF1KSD KRGAPPSSNIEDFHGLLPKGYPHLCSICDLPVHSNKEWSQHINGASHSRRCQLLLEIYPE
: .. .::::::::::::::::::::::::
CCDS54 MLGAQWRRNQPSRAAEEWSQHINGASHSRRCQLLLEIYPE
10 20 30 40
330 340 350 360 370 380
pF1KSD WNPDNDTGHTMGDPFMLQQSTNPAPGILGPPPPSFHLGGPAVGPRGNLGAGNGNLQGPRH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 WNPDNDTGHTMGDPFMLQQSTNPAPGILGPPPPSFHLGGPAVGPRGNLGAGNGNLQGPRH
50 60 70 80 90 100
390 400 410 420 430 440
pF1KSD MQKGRVETSRVVHIMDFQRGKNLRYQLLQLVEPFGVISNHLILNKINEAFIEMATTEDAQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MQKGRVETSRVVHIMDFQRGKNLRYQLLQLVEPFGVISNHLILNKINEAFIEMATTEDAQ
110 120 130 140 150 160
450 460 470 480 490 500
pF1KSD AAVDYYTTTPALVFGKPVRVHLSQKYKRIKKPEGKPDQKFDQKQELGRVIHLSNLPHSGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 AAVDYYTTTPALVFGKPVRVHLSQKYKRIKKPEGKPDQKFDQKQELGRVIHLSNLPHSGY
170 180 190 200 210 220
510 520 530 540 550 560
pF1KSD SDSAVLKLAEPYGKIKNYILMRMKSQAFIEMETREDAMAMVDHCLKKALWFQGRCVKVDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 SDSAVLKLAEPYGKIKNYILMRMKSQAFIEMETREDAMAMVDHCLKKALWFQGRCVKVDL
230 240 250 260 270 280
570 580 590 600 610 620
pF1KSD SEKYKKLVLRIPNRGIDLLKKDKSRKRSYSPDGKESPSDKKSKTDGSQKTESSTEGKEQE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 SEKYKKLVLRIPNRGIDLLKKDKSRKRSYSPDGKESPSDKKSKTDGSQKTESSTEGKEQE
290 300 310 320 330 340
630 640 650 660 670 680
pF1KSD EKSGEDGEKDTKDDQTEQEPNMLLESEDELLVDEEEAAALLESGSSVGDETDLANLGDVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 EKSGEDGEKDTKDDQTEQEPNMLLESEDELLVDEEEAAALLESGSSVGDETDLANLGDVA
350 360 370 380 390 400
690 700 710 720 730 740
pF1KSD SDGKKEPSDKAVKKDGSASAAAKKKLKKVDKIEELDQENEAALENGIKNEENTEPGAESS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 SDGKKEPSDKAVKKDGSASAAAKKKLKKVDKIEELDQENEAALENGIKNEENTEPGAESS
410 420 430 440 450 460
750 760 770 780 790 800
pF1KSD ENADDPNKDTSENADGQSDENKDDYTIPDEYRIGPYQPNVPVGIDYVIPKTGFYCKLCSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 ENADDPNKDTSENADGQSDENKDDYTIPDEYRIGPYQPNVPVGIDYVIPKTGFYCKLCSL
470 480 490 500 510 520
810 820 830 840
pF1KSD FYTNEEVAKNTHCSSLPHYQKLKKFLNKLAEERRQKKET
:::::::::::::::::::::::::::::::::::::::
CCDS54 FYTNEEVAKNTHCSSLPHYQKLKKFLNKLAEERRQKKET
530 540 550
>>CCDS75316.1 MATR3 gene_id:9782|Hs108|chr5 (509 aa)
initn: 3366 init1: 3366 opt: 3366 Z-score: 2510.0 bits: 474.7 E(32554): 1.7e-133
Smith-Waterman score: 3366; 100.0% identity (100.0% similar) in 509 aa overlap (339-847:1-509)
310 320 330 340 350 360
pF1KSD HINGASHSRRCQLLLEIYPEWNPDNDTGHTMGDPFMLQQSTNPAPGILGPPPPSFHLGGP
::::::::::::::::::::::::::::::
CCDS75 MGDPFMLQQSTNPAPGILGPPPPSFHLGGP
10 20 30
370 380 390 400 410 420
pF1KSD AVGPRGNLGAGNGNLQGPRHMQKGRVETSRVVHIMDFQRGKNLRYQLLQLVEPFGVISNH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 AVGPRGNLGAGNGNLQGPRHMQKGRVETSRVVHIMDFQRGKNLRYQLLQLVEPFGVISNH
40 50 60 70 80 90
430 440 450 460 470 480
pF1KSD LILNKINEAFIEMATTEDAQAAVDYYTTTPALVFGKPVRVHLSQKYKRIKKPEGKPDQKF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 LILNKINEAFIEMATTEDAQAAVDYYTTTPALVFGKPVRVHLSQKYKRIKKPEGKPDQKF
100 110 120 130 140 150
490 500 510 520 530 540
pF1KSD DQKQELGRVIHLSNLPHSGYSDSAVLKLAEPYGKIKNYILMRMKSQAFIEMETREDAMAM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 DQKQELGRVIHLSNLPHSGYSDSAVLKLAEPYGKIKNYILMRMKSQAFIEMETREDAMAM
160 170 180 190 200 210
550 560 570 580 590 600
pF1KSD VDHCLKKALWFQGRCVKVDLSEKYKKLVLRIPNRGIDLLKKDKSRKRSYSPDGKESPSDK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 VDHCLKKALWFQGRCVKVDLSEKYKKLVLRIPNRGIDLLKKDKSRKRSYSPDGKESPSDK
220 230 240 250 260 270
610 620 630 640 650 660
pF1KSD KSKTDGSQKTESSTEGKEQEEKSGEDGEKDTKDDQTEQEPNMLLESEDELLVDEEEAAAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 KSKTDGSQKTESSTEGKEQEEKSGEDGEKDTKDDQTEQEPNMLLESEDELLVDEEEAAAL
280 290 300 310 320 330
670 680 690 700 710 720
pF1KSD LESGSSVGDETDLANLGDVASDGKKEPSDKAVKKDGSASAAAKKKLKKVDKIEELDQENE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 LESGSSVGDETDLANLGDVASDGKKEPSDKAVKKDGSASAAAKKKLKKVDKIEELDQENE
340 350 360 370 380 390
730 740 750 760 770 780
pF1KSD AALENGIKNEENTEPGAESSENADDPNKDTSENADGQSDENKDDYTIPDEYRIGPYQPNV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 AALENGIKNEENTEPGAESSENADDPNKDTSENADGQSDENKDDYTIPDEYRIGPYQPNV
400 410 420 430 440 450
790 800 810 820 830 840
pF1KSD PVGIDYVIPKTGFYCKLCSLFYTNEEVAKNTHCSSLPHYQKLKKFLNKLAEERRQKKET
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 PVGIDYVIPKTGFYCKLCSLFYTNEEVAKNTHCSSLPHYQKLKKFLNKLAEERRQKKET
460 470 480 490 500
847 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 02:48:26 2016 done: Thu Nov 3 02:48:27 2016
Total Scan time: 4.610 Total Display time: 0.040
Function used was FASTA [36.3.4 Apr, 2011]