FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0909, 225 aa
1>>>pF1KE0909 225 - 225 aa - 225 aa
Library: human.CCDS.faa
18921897 residues in 33420 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.8931+/-0.000608; mu= 11.3796+/- 0.036
mean_var=72.5087+/-14.776, 0's: 0 Z-trim(112.3): 9 B-trim: 451 in 1/52
Lambda= 0.150619
statistics sampled from 13228 (13234) to 13228 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.771), E-opt: 0.2 (0.396), width: 16
Scan time: 1.140
The best scores are: opt bits E(33420)
CCDS10404.1 ARHGDIG gene_id:398|Hs109|chr16 ( 225) 1518 338.2 2.5e-93
CCDS11788.1 ARHGDIA gene_id:396|Hs109|chr17 ( 204) 793 180.7 6.2e-46
CCDS8671.1 ARHGDIB gene_id:397|Hs109|chr12 ( 201) 763 174.2 5.6e-44
CCDS77133.1 ARHGDIA gene_id:396|Hs109|chr17 ( 235) 625 144.2 6.9e-35
CCDS58609.1 ARHGDIA gene_id:396|Hs109|chr17 ( 160) 456 107.4 5.5e-24
>>CCDS10404.1 ARHGDIG gene_id:398|Hs109|chr16 (225 aa)
initn: 1518 init1: 1518 opt: 1518 Z-score: 1789.1 bits: 338.2 E(33420): 2.5e-93
Smith-Waterman score: 1518; 100.0% identity (100.0% similar) in 225 aa overlap (1-225:1-225)
10 20 30 40 50 60
pF1KE0 MLGLDACELGAQLLELLRLALCARVLLADKEGGPPAVDEVLDEAVPEYRAPGRKSLLEIR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MLGLDACELGAQLLELLRLALCARVLLADKEGGPPAVDEVLDEAVPEYRAPGRKSLLEIR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 QLDPDDRSLAKYKRVLLGPLPPAVDPSLPNVQVTRLTLLSEQAPGPVVMDLTGDLAVLKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 QLDPDDRSLAKYKRVLLGPLPPAVDPSLPNVQVTRLTLLSEQAPGPVVMDLTGDLAVLKD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 QVFVLKEGVDYRVKISFKVHREIVSGLKCLHHTYRRGLRVDKTVYMVGSYGPSAQEYEFV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 QVFVLKEGVDYRVKISFKVHREIVSGLKCLHHTYRRGLRVDKTVYMVGSYGPSAQEYEFV
130 140 150 160 170 180
190 200 210 220
pF1KE0 TPVEEAPRGALVRGPYLVVSLFTDDDRTHHLSWEWGLCICQDWKD
:::::::::::::::::::::::::::::::::::::::::::::
CCDS10 TPVEEAPRGALVRGPYLVVSLFTDDDRTHHLSWEWGLCICQDWKD
190 200 210 220
>>CCDS11788.1 ARHGDIA gene_id:396|Hs109|chr17 (204 aa)
initn: 792 init1: 792 opt: 793 Z-score: 938.3 bits: 180.7 E(33420): 6.2e-46
Smith-Waterman score: 793; 61.1% identity (83.2% similar) in 190 aa overlap (36-225:15-204)
10 20 30 40 50 60
pF1KE0 ACELGAQLLELLRLALCARVLLADKEGGPPAVDEVLDEAVPEYRAPGRKSLLEIRQLDPD
:... :: .:. :..::. ::..:: :
CCDS11 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKD
10 20 30 40
70 80 90 100 110 120
pF1KE0 DRSLAKYKRVLLGPLPPAVDPSLPNVQVTRLTLLSEQAPGPVVMDLTGDLAVLKDQVFVL
:.:: :::..::: . ..::..::: :: :::. .::::. .:::::: .: : :::
CCDS11 DESLRKYKEALLGRVAVSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVL
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE0 KEGVDYRVKISFKVHREIVSGLKCLHHTYRRGLRVDKTVYMVGSYGPSAQEYEFVTPVEE
::::.::.::::.:.::::::.: ..::::.:...::: :::::::: :.::::.:::::
CCDS11 KEGVEYRIKISFRVNREIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEE
110 120 130 140 150 160
190 200 210 220
pF1KE0 APRGALVRGPYLVVSLFTDDDRTHHLSWEWGLCICQDWKD
::.: :.:: : . : :::::.: ::::::.: : .::::
CCDS11 APKGMLARGSYSIKSRFTDDDKTDHLSWEWNLTIKKDWKD
170 180 190 200
>>CCDS8671.1 ARHGDIB gene_id:397|Hs109|chr12 (201 aa)
initn: 757 init1: 736 opt: 763 Z-score: 903.2 bits: 174.2 E(33420): 5.6e-44
Smith-Waterman score: 763; 57.8% identity (80.4% similar) in 199 aa overlap (31-225:3-201)
10 20 30 40 50
pF1KE0 MLGLDACELGAQLLELLRLALCARVLLADKEGGP-PAVDEVLDEAVP---EYRAPGRKSL
: .: : :.: :. . .:. : .:::
CCDS86 MTEKAPEPHVEEDDDDELDSKLNYKPPPQKSL
10 20 30
60 70 80 90 100 110
pF1KE0 LEIRQLDPDDRSLAKYKRVLLGPLPPAVDPSLPNVQVTRLTLLSEQAPGPVVMDLTGDLA
:....: ::.:: :::..::: : ..::. ::: ::::::. :.::::..:::::::
CCDS86 KELQEMDKDDESLIKYKKTLLGDGPVVTDPKAPNVVVTRLTLVCESAPGPITMDLTGDLE
40 50 60 70 80 90
120 130 140 150 160 170
pF1KE0 VLKDQVFVLKEGVDYRVKISFKVHREIVSGLKCLHHTYRRGLRVDKTVYMVGSYGPSAQE
.:: ...::::: .::::: :::.:.:::::: ..:::: :..:::...::::::: .:
CCDS86 ALKKETIVLKEGSEYRVKIHFKVNRDIVSGLKYVQHTYRTGVKVDKATFMVGSYGPRPEE
100 110 120 130 140 150
180 190 200 210 220
pF1KE0 YEFVTPVEEAPRGALVRGPYLVVSLFTDDDRTHHLSWEWGLCICQDWKD
:::.:::::::.: :.:: : :.:::::. ::::::.: : ..: .
CCDS86 YEFLTPVEEAPKGMLARGTYHNKSFFTDDDKQDHLSWEWNLSIKKEWTE
160 170 180 190 200
>>CCDS77133.1 ARHGDIA gene_id:396|Hs109|chr17 (235 aa)
initn: 624 init1: 624 opt: 625 Z-score: 740.1 bits: 144.2 E(33420): 6.9e-35
Smith-Waterman score: 625; 59.6% identity (84.6% similar) in 156 aa overlap (36-191:15-170)
10 20 30 40 50 60
pF1KE0 ACELGAQLLELLRLALCARVLLADKEGGPPAVDEVLDEAVPEYRAPGRKSLLEIRQLDPD
:... :: .:. :..::. ::..:: :
CCDS77 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKD
10 20 30 40
70 80 90 100 110 120
pF1KE0 DRSLAKYKRVLLGPLPPAVDPSLPNVQVTRLTLLSEQAPGPVVMDLTGDLAVLKDQVFVL
:.:: :::..::: . ..::..::: :: :::. .::::. .:::::: .: : :::
CCDS77 DESLRKYKEALLGRVAVSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVL
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE0 KEGVDYRVKISFKVHREIVSGLKCLHHTYRRGLRVDKTVYMVGSYGPSAQEYEFVTPVEE
::::.::.::::.:.::::::.: ..::::.:...::: :::::::: :.::::.:::::
CCDS77 KEGVEYRIKISFRVNREIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPVEE
110 120 130 140 150 160
190 200 210 220
pF1KE0 APRGALVRGPYLVVSLFTDDDRTHHLSWEWGLCICQDWKD
::.:..
CCDS77 APKGSISPSHPRPGFRRERSSHSPGPVVAPGRVRLLLRGGAGVWDARPRGGRAVLQPRCS
170 180 190 200 210 220
>>CCDS58609.1 ARHGDIA gene_id:396|Hs109|chr17 (160 aa)
initn: 455 init1: 455 opt: 456 Z-score: 544.2 bits: 107.4 E(33420): 5.5e-24
Smith-Waterman score: 479; 44.2% identity (63.7% similar) in 190 aa overlap (36-225:15-160)
10 20 30 40 50 60
pF1KE0 ACELGAQLLELLRLALCARVLLADKEGGPPAVDEVLDEAVPEYRAPGRKSLLEIRQLDPD
:... :: .:. :..::. ::..:: :
CCDS58 MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKD
10 20 30 40
70 80 90 100 110 120
pF1KE0 DRSLAKYKRVLLGPLPPAVDPSLPNVQVTRLTLLSEQAPGPVVMDLTGDLAVLKDQVFVL
:.:: :::..::: . ..::..::: :: :::. .::::. .:::::: .: : :::
CCDS58 DESLRKYKEALLGRVAVSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVL
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE0 KEGVDYRVKISFKVHREIVSGLKCLHHTYRRGLRVDKTVYMVGSYGPSAQEYEFVTPVEE
::::.::.::::.:.::::::.: ..::::.:..
CCDS58 KEGVEYRIKISFRVNREIVSGMKYIQHTYRKGVK--------------------------
110 120 130
190 200 210 220
pF1KE0 APRGALVRGPYLVVSLFTDDDRTHHLSWEWGLCICQDWKD
.::.: ::::::.: : .::::
CCDS58 ------------------NDDKTDHLSWEWNLTIKKDWKD
140 150 160
225 residues in 1 query sequences
18921897 residues in 33420 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Oct 24 21:45:55 2019 done: Thu Oct 24 21:45:55 2019
Total Scan time: 1.140 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]