FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3649, 437 aa
1>>>pF1KE3649 437 - 437 aa - 437 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1252+/-0.00072; mu= 18.6174+/- 0.044
mean_var=66.1326+/-13.144, 0's: 0 Z-trim(109.8): 12 B-trim: 4 in 1/50
Lambda= 0.157713
statistics sampled from 11166 (11171) to 11166 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.715), E-opt: 0.2 (0.343), width: 16
Scan time: 3.130
The best scores are: opt bits E(32554)
CCDS9769.1 FNTB gene_id:2342|Hs108|chr14 ( 437) 3048 702.1 2.7e-202
CCDS669.1 RABGGTB gene_id:5876|Hs108|chr1 ( 331) 421 104.3 1.9e-22
CCDS4116.1 PGGT1B gene_id:5229|Hs108|chr5 ( 377) 337 85.2 1.2e-16
>>CCDS9769.1 FNTB gene_id:2342|Hs108|chr14 (437 aa)
initn: 3048 init1: 3048 opt: 3048 Z-score: 3745.2 bits: 702.1 E(32554): 2.7e-202
Smith-Waterman score: 3048; 100.0% identity (100.0% similar) in 437 aa overlap (1-437:1-437)
10 20 30 40 50 60
pF1KE3 MASPSSFTYYCPPSSSPVWSEPLYSLRPEHARERLQDDSVETVTSIEQAKVEEKIQEVFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 MASPSSFTYYCPPSSSPVWSEPLYSLRPEHARERLQDDSVETVTSIEQAKVEEKIQEVFS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 SYKFNHLVPRLVLQREKHFHYLKRGLRQLTDAYECLDASRPWLCYWILHSLELLDEPIPQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 SYKFNHLVPRLVLQREKHFHYLKRGLRQLTDAYECLDASRPWLCYWILHSLELLDEPIPQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 IVATDVCQFLELCQSPEGGFGGGPGQYPHLAPTYAAVNALCIIGTEEAYDIINREKLLQY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 IVATDVCQFLELCQSPEGGFGGGPGQYPHLAPTYAAVNALCIIGTEEAYDIINREKLLQY
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 LYSLKQPDGSFLMHVGGEVDVRSAYCAASVASLTNIITPDLFEGTAEWIARCQNWEGGIG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 LYSLKQPDGSFLMHVGGEVDVRSAYCAASVASLTNIITPDLFEGTAEWIARCQNWEGGIG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE3 GVPGMEAHGGYTFCGLAALVILKRERSLNLKSLLQWVTSRQMRFEGGFQGRCNKLVDGCY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 GVPGMEAHGGYTFCGLAALVILKRERSLNLKSLLQWVTSRQMRFEGGFQGRCNKLVDGCY
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE3 SFWQAGLLPLLHRALHAQGDPALSMSHWMFHQQALQEYILMCCQCPAGGLLDKPGKSRDF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 SFWQAGLLPLLHRALHAQGDPALSMSHWMFHQQALQEYILMCCQCPAGGLLDKPGKSRDF
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE3 YHTCYCLSGLSIAQHFGSGAMLHDVVLGVPENALQPTHPVYNIGPDKVIQATTYFLQKPV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 YHTCYCLSGLSIAQHFGSGAMLHDVVLGVPENALQPTHPVYNIGPDKVIQATTYFLQKPV
370 380 390 400 410 420
430
pF1KE3 PGFEELKDETSAEPATD
:::::::::::::::::
CCDS97 PGFEELKDETSAEPATD
430
>>CCDS669.1 RABGGTB gene_id:5876|Hs108|chr1 (331 aa)
initn: 490 init1: 209 opt: 421 Z-score: 516.6 bits: 104.3 E(32554): 1.9e-22
Smith-Waterman score: 521; 29.9% identity (59.5% similar) in 338 aa overlap (76-410:21-321)
50 60 70 80 90 100
pF1KE3 IEQAKVEEKIQEVFSSYKFNHLVPRLVLQREKHFHYL-KRGLRQLTDAYE-CLDAS-RPW
::: :. . : .. : :: :.. :
CCDS66 MGTPQKDVIIKSDAPDTLLLEKHADYIASYGSKK--DDYEYCMSEYLRMS
10 20 30 40
110 120 130 140 150 160
pF1KE3 LCYWILHSLELLDEPIPQIVATDVCQFLELCQSPEGGFGGGPGQYPHLAPTYAAVNALCI
:: : ..:. . . .. .. :.. :: ::.... :. ::: : .::. : .
CCDS66 GIYWGLTVMDLMGQ-LHRMNREEILAFIKSCQHECGGISASIGHDPHLLYTLSAVQILTL
50 60 70 80 90 100
170 180 190 200 210 220
pF1KE3 IGTEEAYDIINREKLLQYLYSLKQPDGSFLMHVGGEVDVRSAYCAASVASLTNIITPDLF
.. ..:. .:...:. .:.. :::: . ::.:.: ..::... .: . .
CCDS66 Y---DSINVIDVNKVVEYVKGLQKEDGSFAGDIWGEIDTRFSFCAVATLALLGKLDAINV
110 120 130 140 150 160
230 240 250 260 270 280
pF1KE3 EGTAEWIARCQNWEGGIGGVPGMEAHGGYTFCGLAALVILKRERSLNLKSLLQWVTSRQM
: . :.. :.:..::.: :: :.:.: .: . :.: .. ...: : :. ::.
CCDS66 EKAIEFVLSCMNFDGGFGCRPGSESHAGQIYCCTGFLAITSQLHQVNSDLLGWWLCERQL
170 180 190 200 210 220
290 300 310 320 330 340
pF1KE3 RFEGGFQGRCNKLVDGCYSFWQAGLLPLLHRALHAQGDPALSMSHWMFHQQALQEYILMC
::..:: .:: : :::.: . : .. : ::. .. :...:: :
CCDS66 P-SGGLNGRPEKLPDVCYSWWVLASLKIIGRL------------HWI-DREKLRNFILAC
230 240 250 260 270
350 360 370 380 390 400
pF1KE3 CQCPAGGLLDKPGKSRDFYHTCYCLSGLSIAQHFGSGAMLHDVVLGVPENALQPTHPVYN
. .::. :.:: : .:: . ..:::. :: :. ..:..::.
CCDS66 QDEETGGFADRPGDMVDPFHTLFGIAGLSL--------------LG--EEQIKPVNPVFC
280 290 300 310
410 420 430
pF1KE3 IGPDKVIQATTYFLQKPVPGFEELKDETSAEPATD
. :..:.:
CCDS66 M-PEEVLQRVNVQPELVS
320 330
>>CCDS4116.1 PGGT1B gene_id:5229|Hs108|chr5 (377 aa)
initn: 324 init1: 132 opt: 337 Z-score: 412.5 bits: 85.2 E(32554): 1.2e-16
Smith-Waterman score: 392; 28.4% identity (55.6% similar) in 338 aa overlap (70-377:17-336)
40 50 60 70 80 90
pF1KE3 VETVTSIEQAKVEEKIQEVFSSYKFNHLVPRLVLQREKHFHYLKRGLRQLTDAYECLDAS
:: . :..: ....: :. : . : :..:
CCDS41 MAATEDERLAGSGEGERLDFLRDRHVRFFQRCLQVLPERYSSLETS
10 20 30 40
100 110 120 130 140
pF1KE3 RPWLCYWILHSLELLD--------EPIPQIVATDVCQFLELCQSPEGGFGGG--------
: . .. : .:..:: . : : . .: . . . :: :.
CCDS41 RLTIAFFALSGLDMLDSLDVVNKDDIIEWIYSLQVLPTEDRSNLNRCGFRGSSYLGIPFN
50 60 70 80 90 100
150 160 170 180 190
pF1KE3 PGQYP---------HLAPTYAAVNALCIIGTEEAYDIINREKLLQYLYSLKQPDGSFL-M
:.. : :.: ::.... : :.: . . .:.: : : .:. :::: .
CCDS41 PSKAPGTAHPYDSGHIAMTYTGLSCLVILGDDLSR--VNKEACLAGLRALQLEDGSFCAV
110 120 130 140 150 160
200 210 220 230 240 250
pF1KE3 HVGGEVDVRSAYCAASVASLTNIITPDLFEGTAEWIARCQNWEGGIGGVPGMEAHGGYTF
:.: :.: .:::. . . : . .. . .: : .....:.. :.:.::: ::
CCDS41 PEGSENDMRFVYCASCICYMLNNWSGMDMKKAITYIRRSMSYDNGLAQGAGLESHGGSTF
170 180 190 200 210 220
260 270 280 290 300
pF1KE3 CGLAALVIL-KRERSLNLKSL---LQWVTSRQMRFEGGFQGRCNKLVDGCYSFWQAGLLP
::.:.: .. : :. .. : : .: :: ..:..:: :: :: ::::: .. :
CCDS41 CGIASLCLMGKLEEVFSEKELNRIKRWCIMRQ---QNGYHGRPNKPVDTCYSFWVGATLK
230 240 250 260 270 280
310 320 330 340 350 360
pF1KE3 LLHRALHAQGDPALSMSHWMFHQQALQEYILMCCQCPAGGLLDKPGKSRDFYHTCYCLSG
::. ... :... ..::: . .::. : . : :. . . :
CCDS41 LLK-----------IFQYTNFEKN--RNYILSTQDRLVGGFAKWPDSHPDALHAYFGICG
290 300 310 320
370 380 390 400 410 420
pF1KE3 LSIAQHFGSGAMLHDVVLGVPENALQPTHPVYNIGPDKVIQATTYFLQKPVPGFEELKDE
::. .. :
CCDS41 LSLMEESGICKVHPALNVSTRTSERLLDLHQSWKTKDSKQCSENVHIST
330 340 350 360 370
437 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 00:46:11 2016 done: Mon Nov 7 00:46:11 2016
Total Scan time: 3.130 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]