FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB3219, 669 aa
1>>>pF1KB3219 669 - 669 aa - 669 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.0001+/-0.00108; mu= 12.1106+/- 0.064
mean_var=148.1580+/-30.100, 0's: 0 Z-trim(108.7): 180 B-trim: 58 in 1/50
Lambda= 0.105369
statistics sampled from 10172 (10371) to 10172 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.674), E-opt: 0.2 (0.319), width: 16
Scan time: 4.130
The best scores are: opt bits E(32554)
CCDS12135.1 FEM1A gene_id:55527|Hs108|chr19 ( 669) 4470 692.0 7.1e-199
CCDS4118.1 FEM1C gene_id:56929|Hs108|chr5 ( 617) 1702 271.2 3.1e-72
CCDS10228.1 FEM1B gene_id:10116|Hs108|chr15 ( 627) 672 114.6 4.3e-25
>>CCDS12135.1 FEM1A gene_id:55527|Hs108|chr19 (669 aa)
initn: 4470 init1: 4470 opt: 4470 Z-score: 3683.9 bits: 692.0 E(32554): 7.1e-199
Smith-Waterman score: 4470; 100.0% identity (100.0% similar) in 669 aa overlap (1-669:1-669)
10 20 30 40 50 60
pF1KB3 MDLRTAVYNAARDGKLQLLQKLLSGRSREELDELTGEVAGGGTPLLIAARYGHLDVVEYL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MDLRTAVYNAARDGKLQLLQKLLSGRSREELDELTGEVAGGGTPLLIAARYGHLDVVEYL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 VDRCGASVEAGGSVHFDGETIEGAPPLWAASAAGHLDVVRSLLRRGASVNRTTRTNSTPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 VDRCGASVEAGGSVHFDGETIEGAPPLWAASAAGHLDVVRSLLRRGASVNRTTRTNSTPL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 RAACFDGHLEVVRYLVGEHQADLEVANRHGHTCLMISCYKGHREIARYLLEQGAQVNRRS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 RAACFDGHLEVVRYLVGEHQADLEVANRHGHTCLMISCYKGHREIARYLLEQGAQVNRRS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 AKGNTALHDCAESGSLEILQLLLGCKARMERDGYGMTPLLAASVTGHTNIVEYLIQEQPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 AKGNTALHDCAESGSLEILQLLLGCKARMERDGYGMTPLLAASVTGHTNIVEYLIQEQPG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 QEQVAGGEAQPGLPQEDPSTSQGCAQPQGAPCCSSSPEEPLNGESYESCCPTSREAAVEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 QEQVAGGEAQPGLPQEDPSTSQGCAQPQGAPCCSSSPEEPLNGESYESCCPTSREAAVEA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB3 LELLGATYVDKKRDLLGALKHWRRAMELRHQGGEYLPKPEPPQLVLAYDYSREVNTTEEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 LELLGATYVDKKRDLLGALKHWRRAMELRHQGGEYLPKPEPPQLVLAYDYSREVNTTEEL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB3 EALITDPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFERCIRLWKYALDMQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 EALITDPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFERCIRLWKYALDMQ
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB3 QSNLEPLSPMTASSFLSFAELFSYVLQDRAAKGSLGTQIGFADLMGVLTKGVREVERALQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 QSNLEPLSPMTASSFLSFAELFSYVLQDRAAKGSLGTQIGFADLMGVLTKGVREVERALQ
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB3 LPREPGDSAQFTKALAIILHLLYLLEKVECTPSQEHLKHQTVYRLLKCAPRGKNGFTPLH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 LPREPGDSAQFTKALAIILHLLYLLEKVECTPSQEHLKHQTVYRLLKCAPRGKNGFTPLH
490 500 510 520 530 540
550 560 570 580 590 600
pF1KB3 MAVDKDTTNVGRYPVGRFPSLHVVKVLLDCGADPDSRDFDNNTPLHIAAQNNCPAIMNAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MAVDKDTTNVGRYPVGRFPSLHVVKVLLDCGADPDSRDFDNNTPLHIAAQNNCPAIMNAL
550 560 570 580 590 600
610 620 630 640 650 660
pF1KB3 IEAGAHMDATNAFKKTAYELLDEKLLARGTMQPFNYVTLQCLAARALDKNKIPYKGFIPE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 IEAGAHMDATNAFKKTAYELLDEKLLARGTMQPFNYVTLQCLAARALDKNKIPYKGFIPE
610 620 630 640 650 660
pF1KB3 DLEAFIELH
:::::::::
CCDS12 DLEAFIELH
>>CCDS4118.1 FEM1C gene_id:56929|Hs108|chr5 (617 aa)
initn: 2238 init1: 969 opt: 1702 Z-score: 1410.3 bits: 271.2 E(32554): 3.1e-72
Smith-Waterman score: 2791; 64.8% identity (82.8% similar) in 670 aa overlap (1-669:1-616)
10 20 30 40 50 60
pF1KB3 MDLRTAVYNAARDGKLQLLQKLLSGRSREELDELTGEVAGGGTPLLIAARYGHLDVVEYL
:::.:::.::::::::.:: :::...:.::.. : .: ..:.::::.::::::::.::.:
CCDS41 MDLKTAVFNAARDGKLRLLTKLLASKSKEEVSSLISEKTNGATPLLMAARYGHLDMVEFL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 VDRCGASVEAGGSVHFDGETIEGAPPLWAASAAGHLDVVRSLLRRGASVNRTTRTNSTPL
...:.::.:.::::.::::::::::::::::::::: ::.::: .::::: :: ::::::
CCDS41 LEQCSASIEVGGSVNFDGETIEGAPPLWAASAAGHLKVVQSLLNHGASVNNTTLTNSTPL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 RAACFDGHLEVVRYLVGEHQADLEVANRHGHTCLMISCYKGHREIARYLLEQGAQVNRRS
::::::::::.:.::: ::.:::::.::::::::::::::::.:::.::::.::.:::.:
CCDS41 RAACFDGHLEIVKYLV-EHKADLEVSNRHGHTCLMISCYKGHKEIAQYLLEKGADVNRKS
130 140 150 160 170
190 200 210 220 230 240
pF1KB3 AKGNTALHDCAESGSLEILQLLLGCKARMERDGYGMTPLLAASVTGHTNIVEYLIQEQPG
.:::::::::::::::.:...:: :.::.:::::::::.::::::::::..: ..
CCDS41 VKGNTALHDCAESGSLDIMKMLLMYCAKMEKDGYGMTPLLSASVTGHTNIVDFLTHH---
180 190 200 210 220 230
250 260 270 280 290 300
pF1KB3 QEQVAGGEAQPGLPQEDPSTSQGCAQPQGAPCCSSSPEEPLNGESYESCCPTSREAAVEA
:: ::. ..:
CCDS41 --------AQ-----------------------------------------TSKTERINA
240
310 320 330 340 350
pF1KB3 LELLGATYVDKKRDLLGALKHWRRAMELRHQG-GEYLPKPEPPQLVLAYDYSREVNTTEE
:::::::.::::::::::::.:..::..:.. . . :: : :..::::..:::..::
CCDS41 LELLGATFVDKKRDLLGALKYWKKAMNMRYSDRTNIISKPVPQTLIMAYDYAKEVNSAEE
250 260 270 280 290 300
360 370 380 390 400 410
pF1KB3 LEALITDPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFERCIRLWKYALDM
::.::.:::::::::::::::::::::::::::::::::::::::::.::: ::::::::
CCDS41 LEGLIADPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFKRCINLWKYALDM
310 320 330 340 350 360
420 430 440 450 460 470
pF1KB3 QQSNLEPLSPMTASSFLSFAELFSYVLQDRAAKGSLGTQIGFADLMGVLTKGVREVERAL
:::::.:::::::::.::::::::..::::: :: ::: . : ::::.: :.: :.:::.
CCDS41 QQSNLDPLSPMTASSLLSFAELFSFMLQDRA-KGLLGTTVTFDDLMGILCKSVLEIERAI
370 380 390 400 410 420
480 490 500 510 520 530
pF1KB3 QLPREPGDSAQFTKALAIILHLLYLLEKVECTPSQEHLKHQTVYRLLKCAPRGKNGFTPL
. . :.: :..:::.:::::. ::::: :: :.:.:.::.::.:: :::::.:.::
CCDS41 KQTQCPADPLQLNKALSIILHLICLLEKVPCTLEQDHFKKQTIYRFLKLHPRGKNNFSPL
430 440 450 460 470 480
540 550 560 570 580 590
pF1KB3 HMAVDKDTTNVGRYPVGRFPSLHVVKVLLDCGADPDSRDFDNNTPLHIAAQNNCPAIMNA
:.::::.:: :::::: .::::.:. .:..:::: . :: :.:.:::::: :: : :::
CCDS41 HLAVDKNTTCVGRYPVCKFPSLQVTAILIECGADVNVRDSDDNSPLHIAALNNHPDIMNL
490 500 510 520 530 540
600 610 620 630 640 650
pF1KB3 LIEAGAHMDATNAFKKTAYELLDEKLLARGTMQPFNYVTLQCLAARALDKNKIPYKGFIP
::..:::.:::: :.:: .::::: .:.. .::.:..::::::::.. ...: ::: ::
CCDS41 LIKSGAHFDATNLHKQTASDLLDEKEIAKNLIQPINHTTLQCLAARVIVNHRIYYKGHIP
550 560 570 580 590 600
660
pF1KB3 EDLEAFIELH
: ::.:. ::
CCDS41 EKLETFVSLHR
610
>>CCDS10228.1 FEM1B gene_id:10116|Hs108|chr15 (627 aa)
initn: 1114 init1: 349 opt: 672 Z-score: 564.0 bits: 114.6 E(32554): 4.3e-25
Smith-Waterman score: 1232; 35.8% identity (63.0% similar) in 689 aa overlap (7-669:8-627)
10 20 30 40 50
pF1KB3 MDLRTAVYNAARDGKLQLLQKLLSGRSREELDELTGEVA--GG--GTPLLIAARYGHLD
::.:: .::. : :: .::. .. : : :. :: .:::.:::: ::
CCDS10 MEGLAGYVYKAASEGKVLTLAALLLNRSESDIRYLLGYVSQQGGQRSTPLIIAARNGHAK
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB3 VVEYLVDRCGASVEAGGSVHFDGETIEGAPPLWAASAAGHLDVVRSLLRRGASVNRTTRT
::. :... .... :.:.::: .:.:: :: :..:::..::. :. .::.::.:: :
CCDS10 VVRLLLEHYRVQTQQTGTVRFDGYVIDGATALWCAAGAGHFEVVKLLVSHGANVNHTTVT
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB3 NSTPLRAACFDGHLEVVRYLVGEHQADLEVANRHGHTCLMISCYKGHREIARYLLEQGAQ
::::::::::::.:..:.::: :..:.. .::.. .:::::. :::: ...:::::: :.
CCDS10 NSTPLRAACFDGRLDIVKYLV-ENNANISIANKYDNTCLMIAAYKGHTDVVRYLLEQRAD
130 140 150 160 170
180 190 200 210 220 230
pF1KB3 VNRRSAKGNTALHDCAESGSLEILQLLLGCKARMERDGYGMTPLLAASVTGHTNIVEYLI
: .. : :::: ::.: ..:.. :. .: . .:.::::: .:. . ....:: :.
CCDS10 PNAKAHCGATALHFAAEAGHIDIVKELIKWRAAIVVNGHGMTPLKVAAESCKADVVELLL
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB3 QEQPGQEQVAGGEAQPGLPQEDPSTSQGCAQPQGAPCCSSSPEEPLNGESYESCCPTSRE
:. .: .:.
CCDS10 -------------------------------------------------SHADC---DRR
240
300 310 320 330 340 350
pF1KB3 AAVEALELLGATYVDKKR--DLLGALKHWRRAMELRHQGGEYLPKPE--PPQLVLAYDYS
. .::::::::.... .. :.. . .. :: : : :. . . : :: . ::
CCDS10 SRIEALELLGASFANDRENYDIIKTYHYLYLAMLERFQDGDNILEKEVLPP--IHAYGNR
250 260 270 280 290 300
360 370 380 390 400 410
pF1KB3 REVNTTEELEALITDPDEMRMQALLIRERILGPSHPDTSYYIRYRGAVYADSGNFERCIR
: . .:::.. : : ..:..:..:::::: .. :.:. : ::::::::. .::.::.
CCDS10 TECRNPQELESIRQDRDALHMEGLIVRERILGADNIDVSHPIIYRGAVYADNMEFEQCIK
310 320 330 340 350 360
420 430 440 450 460 470
pF1KB3 LWKYALDMQQSNLEPLSPMTASSFLSFAELFSYVLQDRAAKGSLGTQIGFADLMGVLTKG
:: .:: ..:.. . : ...: ::..:: ... :. . :. :: .
CCDS10 LWLHALHLRQKG----NRNTHKDLLRFAQVFSQMIH-------LNETVKAPDIECVLRCS
370 380 390 400 410
480 490 500 510 520
pF1KB3 VREVERALQLPREPGDSA------QFTKALAIILHLLYLLEKVECTPSQEHLKHQTVYRL
: :.:.... .. .:. .. : .:.:. . :..:. .. .. .: :
CCDS10 VLEIEQSMNRVKNISDADVHNAMDNYECNLYTFLYLVCISTKTQCSEEDQCKINKQIYNL
420 430 440 450 460 470
530 540 550 560 570 580
pF1KB3 LKCAPRGKNGFTPLHMAVDKDTT--NVGRYPVGRFPSLHVVKVLLDCGADPDSRDFDNNT
.. :: ..::: ::.::...: . : ::. :.:.::::::. .. : ..:.
CCDS10 IHLDPRTREGFTLLHLAVNSNTPVDDFHTNDVCSFPNALVTKLLLDCGAEVNAVDNEGNS
480 490 500 510 520 530
590 600 610 620 630
pF1KB3 PLHIAAQNNCP--------AIMNALIEAGAHMDATNAFKKTAYELLDEKL--LARGTMQP
::: .: : : .:. .:.::::: : :: .:: ::.. ... ..
CCDS10 ALHIIVQYNRPISDFLTLHSIIISLVEAGAHTDMTNKQNKTP---LDKSTTGVSEILLKT
540 550 560 570 580 590
640 650 660
pF1KB3 FNYVTLQCLAARALDKNKIPYKGFIPEDLEAFIELH
..:.::::::. : : :. ::. :: :. .:
CCDS10 QMKMSLKCLAARAVRANDINYQDQIPRTLEEFVGFH
600 610 620
669 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 12:31:38 2016 done: Thu Nov 3 12:31:39 2016
Total Scan time: 4.130 Total Display time: 0.040
Function used was FASTA [36.3.4 Apr, 2011]