FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6332, 415 aa
1>>>pF1KE6332 415 - 415 aa - 415 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2181+/-0.000771; mu= 17.5503+/- 0.046
mean_var=63.2381+/-12.557, 0's: 0 Z-trim(107.2): 12 B-trim: 0 in 0/53
Lambda= 0.161282
statistics sampled from 9438 (9445) to 9438 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.672), E-opt: 0.2 (0.29), width: 16
Scan time: 2.170
The best scores are: opt bits E(32554)
CCDS8136.1 B4GAT1 gene_id:11041|Hs108|chr11 ( 415) 2820 664.8 4.3e-191
CCDS13912.1 LARGE1 gene_id:9215|Hs108|chr22 ( 756) 359 92.3 1.7e-18
CCDS76399.1 LARGE2 gene_id:120071|Hs108|chr11 ( 690) 316 82.3 1.6e-15
CCDS31473.1 LARGE2 gene_id:120071|Hs108|chr11 ( 721) 316 82.3 1.7e-15
>>CCDS8136.1 B4GAT1 gene_id:11041|Hs108|chr11 (415 aa)
initn: 2820 init1: 2820 opt: 2820 Z-score: 3544.2 bits: 664.8 E(32554): 4.3e-191
Smith-Waterman score: 2820; 100.0% identity (100.0% similar) in 415 aa overlap (1-415:1-415)
10 20 30 40 50 60
pF1KE6 MQMSYAIRCAFYQLLLAALMLVAMLQLLYLSLLSGLHGQEEQDQYFEFFPPSPRSVDQVK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 MQMSYAIRCAFYQLLLAALMLVAMLQLLYLSLLSGLHGQEEQDQYFEFFPPSPRSVDQVK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 AQLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 AQLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 SVSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 SVSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 SCQEVFDKLARVAQPGINYALGTNVSYPNNLLRNLAREGANYALVIDVDMVPSEGLWRGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 SCQEVFDKLARVAQPGINYALGTNVSYPNNLLRNLAREGANYALVIDVDMVPSEGLWRGL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE6 REMLDQSNQWGGTALVVPAFEIRRARRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 REMLDQSNQWGGTALVVPAFEIRRARRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTN
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE6 YSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQACELHVAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 YSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQACELHVAG
310 320 330 340 350 360
370 380 390 400 410
pF1KE6 FDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKILYRQFKQELKAKYPNSPRRC
:::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 FDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKILYRQFKQELKAKYPNSPRRC
370 380 390 400 410
>>CCDS13912.1 LARGE1 gene_id:9215|Hs108|chr22 (756 aa)
initn: 339 init1: 123 opt: 359 Z-score: 445.5 bits: 92.3 E(32554): 1.7e-18
Smith-Waterman score: 395; 28.7% identity (54.6% similar) in 328 aa overlap (91-408:470-742)
70 80 90 100 110 120
pF1KE6 AQLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPL
: .:: :... :.: : : .. ..::::.
CCDS13 DDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDRLQMLEAICKHWEGPI
440 450 460 470 480 490
130 140 150 160 170 180
pF1KE6 SVSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLR
:.... . : :. : :: .:. : :..:.: . :.:
CCDS13 SLALYLSDAEAQQF---LRYAQGSEVLMSRHNVGYHIV------------YKEGQF----
500 510 520 530 540
190 200 210 220 230
pF1KE6 SCQEVFDKLARVAQPGINYALGTNVSYPNNLLRNLAREGAN--YALVIDVDMVPSEGLWR
:: :::::.: . . : .. :.:..: ::..
CCDS13 --------------------------YPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYE
550 560 570
240 250 260 270 280 290
pF1KE6 GLRE---MLDQSNQWGGTALVVPAFEIRRAR-RMPMNKNELVQLYQVGEVRPFYYGLCTP
::. .:: .: :..::::: : : .: .: ::... ..: . : : . :
CCDS13 YLRKSVIQLDLANT--KKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTK
580 590 600 610 620 630
300 310 320 330 340 350
pF1KE6 CQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQAC
.::::...: .. : : : :. .::. :. : .:.:: .:.:....
CCDS13 GHAPTNFAKW-----RTATTP-YRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNKVAHIM
640 650 660 670 680
360 370 380 390 400 410
pF1KE6 ELHVAGFDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKI----LYRQFKQELKAKYPN
:: : ..: :: .....: . .: .: : :.. .: : ..:.:... .:
CCDS13 ELDVQEYEFIVLPNAYMIH--MPHAPSFDITKFRSNKQYRICLKTLKEEFQQDMSRRYGF
690 700 710 720 730 740
pF1KE6 SPRRC
CCDS13 AALKYLTAENNS
750
>>CCDS76399.1 LARGE2 gene_id:120071|Hs108|chr11 (690 aa)
initn: 292 init1: 119 opt: 316 Z-score: 392.1 bits: 82.3 E(32554): 1.6e-15
Smith-Waterman score: 339; 28.9% identity (51.6% similar) in 322 aa overlap (92-408:398-662)
70 80 90 100 110 120
pF1KE6 QLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPLS
:.:: :... :.: : : .: ..: ::.:
CCDS76 EDPCFEFRQQQLTVHRVHVTFLPHEPPPPRPHDVTLVAQLSMDRLQMLEALCRHWPGPMS
370 380 390 400 410 420
130 140 150 160 170 180
pF1KE6 VSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLRS
.... . : :. : .. .: : ::.:.: : :: : .
CCDS76 LALYLTDAEAQQF---LHFVEASPVLAARQDVAYHVV----Y-------RE-GPL-----
430 440 450 460
190 200 210 220 230
pF1KE6 CQEVFDKLARVAQPGINYALGTNVSYPNNLLRN--LAREGANYALVIDVDMVPSEGLWRG
:: : ::: ::. . :... :.:..:. .:.
CCDS76 -------------------------YPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDY
470 480 490 500
240 250 260 270 280 290
pF1KE6 LREMLDQSNQWG--GTALVVPAFEIRRAR-RMPMNKNELVQLYQVGEVRPFYYGLCTPCQ
:: ..: . . .:::::::: : : .: .: ::. : ..: . : : .
CCDS76 LRASIEQLGLGSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH
510 520 530 540 550 560
300 310 320 330 340 350
pF1KE6 APTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQACEL
:::.:.:: .:. . : : : .::. :. : .: :: .:.:.... ::
CCDS76 APTDYARW----REA--QAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNKVAHIVEL
570 580 590 600 610
360 370 380 390 400 410
pF1KE6 HVAGFDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKILYRQFKQELKAKYPNSPRRC
. ... :: :.: .: : :. . ... ::. : :: ..
CCDS76 DAQEYELLVLPEAFTIH------LPHAPSLDISRFRSSPTYRDCLQALKDEFHQDLSRHH
620 630 640 650 660 670
CCDS76 GAAALKYLPALQQPQSPARG
680 690
>>CCDS31473.1 LARGE2 gene_id:120071|Hs108|chr11 (721 aa)
initn: 292 init1: 119 opt: 316 Z-score: 391.8 bits: 82.3 E(32554): 1.7e-15
Smith-Waterman score: 339; 28.9% identity (51.6% similar) in 322 aa overlap (92-408:429-693)
70 80 90 100 110 120
pF1KE6 QLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHASVDNLLHLSGLLERWEGPLS
:.:: :... :.: : : .: ..: ::.:
CCDS31 EDPCFEFRQQQLTVHRVHVTFLPHEPPPPRPHDVTLVAQLSMDRLQMLEALCRHWPGPMS
400 410 420 430 440 450
130 140 150 160 170 180
pF1KE6 VSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCPSRYEAAVPDPREPGEFALLRS
.... . : :. : .. .: : ::.:.: : :: : .
CCDS31 LALYLTDAEAQQF---LHFVEASPVLAARQDVAYHVV----Y-------RE-GPL-----
460 470 480 490
190 200 210 220 230
pF1KE6 CQEVFDKLARVAQPGINYALGTNVSYPNNLLRN--LAREGANYALVIDVDMVPSEGLWRG
:: : ::: ::. . :... :.:..:. .:.
CCDS31 -------------------------YPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDY
500 510 520 530
240 250 260 270 280 290
pF1KE6 LREMLDQSNQWG--GTALVVPAFEIRRAR-RMPMNKNELVQLYQVGEVRPFYYGLCTPCQ
:: ..: . . .:::::::: : : .: .: ::. : ..: . : : .
CCDS31 LRASIEQLGLGSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH
540 550 560 570 580 590
300 310 320 330 340 350
pF1KE6 APTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNRISQACEL
:::.:.:: .:. . : : : .::. :. : .: :: .:.:.... ::
CCDS31 APTDYARW----REA--QAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNKVAHIVEL
600 610 620 630 640
360 370 380 390 400 410
pF1KE6 HVAGFDFEVLNEGFLVHKGFKEALKFHPQKEAENQHNKILYRQFKQELKAKYPNSPRRC
. ... :: :.: .: : :. . ... ::. : :: ..
CCDS31 DAQEYELLVLPEAFTIH------LPHAPSLDISRFRSSPTYRDCLQALKDEFHQDLSRHH
650 660 670 680 690 700
CCDS31 GAAALKYLPALQQPQSPARG
710 720
415 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 12:11:32 2016 done: Tue Nov 8 12:11:32 2016
Total Scan time: 2.170 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]