FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6369, 348 aa 1>>>pF1KE6369 348 - 348 aa - 348 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.4168+/-0.000419; mu= 21.3473+/- 0.026 mean_var=65.7405+/-14.660, 0's: 0 Z-trim(109.9): 33 B-trim: 1967 in 2/48 Lambda= 0.158182 statistics sampled from 18105 (18137) to 18105 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.571), E-opt: 0.2 (0.213), width: 16 Scan time: 4.440 The best scores are: opt bits E(85289) NP_000443 (OMIM: 601295,613291) ileal sodium/bile ( 348) 2328 540.5 2.1e-153 NP_932069 (OMIM: 613366) solute carrier family 10 ( 377) 1115 263.7 4.7e-70 NP_003040 (OMIM: 182396) sodium/bile acid cotransp ( 349) 709 171.0 3.5e-42 NP_001135863 (OMIM: 312090) P3 protein isoform 2 p ( 448) 434 108.3 3.3e-23 XP_011529503 (OMIM: 312090) PREDICTED: P3 protein ( 477) 434 108.4 3.4e-23 NP_062822 (OMIM: 312090) P3 protein isoform 1 prec ( 477) 434 108.4 3.4e-23 NP_001135864 (OMIM: 312090) P3 protein isoform 1 p ( 477) 434 108.4 3.4e-23 XP_011529502 (OMIM: 312090) PREDICTED: P3 protein ( 532) 434 108.4 3.7e-23 XP_005277970 (OMIM: 312090) PREDICTED: P3 protein ( 532) 434 108.4 3.7e-23 XP_006724911 (OMIM: 312090) PREDICTED: P3 protein ( 541) 434 108.4 3.7e-23 XP_006724910 (OMIM: 312090) PREDICTED: P3 protein ( 570) 434 108.4 3.8e-23 >>NP_000443 (OMIM: 601295,613291) ileal sodium/bile acid (348 aa) initn: 2328 init1: 2328 opt: 2328 Z-score: 2875.1 bits: 540.5 E(85289): 2.1e-153 Smith-Waterman score: 2328; 100.0% identity (100.0% similar) in 348 aa overlap (1-348:1-348) 10 20 30 40 50 60 pF1KE6 MNDPNSCVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMFSMGCNVEIKKFLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MNDPNSCVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMFSMGCNVEIKKFLG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HIKRPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCCPGGTASNILAYWVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 HIKRPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCCPGGTASNILAYWVD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 GDMDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVALVVPVSIGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 GDMDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVALVVPVSIGM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 FVNHKWPQKAKIILKIGSIAGAILIVLIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 FVNHKWPQKAKIILKIGSIAGAILIVLIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYSL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 GFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 GFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLAF 250 260 270 280 290 300 310 320 330 340 pF1KE6 AAIFLGFYVAYKKCHGKNKAEIPESKENGTEPESSFYKANGGFQPDEK :::::::::::::::::::::::::::::::::::::::::::::::: NP_000 AAIFLGFYVAYKKCHGKNKAEIPESKENGTEPESSFYKANGGFQPDEK 310 320 330 340 >>NP_932069 (OMIM: 613366) solute carrier family 10 memb (377 aa) initn: 1198 init1: 1070 opt: 1115 Z-score: 1378.6 bits: 263.7 E(85289): 4.7e-70 Smith-Waterman score: 1118; 47.9% identity (81.0% similar) in 326 aa overlap (30-345:30-355) 10 20 30 40 50 60 pF1KE6 MNDPNSCVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMFSMGCNVEIKKFLG : .:...: :....:.:::.::.:::.:. . NP_932 MRANCSSSSACPANSSEEELPVGLEVHGNLELVFTVVSTVMMGLLMFSLGCSVEIRKLWS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HIKRPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCCPGGTASNILAYWVD ::.::::: ::.:::::.::.:...:...:.. :.::..:::.::::::: :::...::: NP_932 HIRRPWGIAVGLLCQFGLMPFTAYLLAISFSLKPVQAIAVLIMGCCPGGTISNIFTFWVD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 GDMDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVALVVPVSIGM ::::::.:::::::. ::::::::. .:: : . ...:::.::: .:: :..::..:. NP_932 GDMDLSISMTTCSTVAALGMMPLCIYLYTWSWSLQQNLTIPYQNIGITLVCLTIPVAFGV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 FVNHKWPQKAKIILKIGSIAGAILIVLIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYSL .::..::...:::::::...:..:....::.: .: ...: : :. :::. :. NP_932 YVNYRWPKQSKIILKIGAVVGGVLLLVVAVAGVVLAKGSWNSDITLLTISFIFPLIGHVT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 GFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLAF ::::: .. : ::::...::: :: :.: :..::::: :.: ...::: :..::: NP_932 GFLLALFTHQSWQRCRTISLETGAQNIQMCITMLQLSFTAEHLVQMLSFPLAYGLFQLID 250 260 270 280 290 300 310 320 330 340 pF1KE6 AAIFLGFYVAYKK----CHGKNKA---EIPESKENGTEPESS-FYKAN--GGFQPDEK . .... : .::. :::... :. ..... . :.. : ..: :.. : NP_932 GFLIVAAYQTYKRRLKNKHGKKNSGCTEVCHTRKSTSSRETNAFLEVNEEGAITPGPPGP 310 320 330 340 350 360 NP_932 MDCHRALEPVGHITSCE 370 >>NP_003040 (OMIM: 182396) sodium/bile acid cotransporte (349 aa) initn: 691 init1: 401 opt: 709 Z-score: 878.3 bits: 171.0 E(85289): 3.5e-42 Smith-Waterman score: 709; 38.4% identity (73.4% similar) in 297 aa overlap (32-322:25-319) 10 20 30 40 50 60 pF1KE6 NDPNSCVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMFSMGCNVEIKKFLGH ..::..:...: ..:.:.::..:..:. .: NP_003 MEAHNASAPFNFTLPPNFGKRPTDLALSVILVFMLFFIMLSLGCTMEFSKIKAH 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 IKRPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCCPGGTASNILAYWVDG . .: :. .... :.::::::.:.:. .: . ..:...:. :: :::. ::... . : NP_003 LWKPKGLAIALVAQYGIMPLTAFVLGKVFRLKNIEALAILVCGCSPGGNLSNVFSLAMKG 60 70 80 90 100 110 130 140 150 160 170 pF1KE6 DMDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIV--IPYDNIGTSLVALVVPVSIG ::.::. ::::::. ::::::: : ::.. : :.. .:: .: ::: ...: .:: NP_003 DMNLSIVMTTCSTFCALGMMPLLLYIYSRGIYD-GDLKDKVPYKGIVISLVLVLIPCTIG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 MFVNHKWPQKAKIILKIGSIAGAILIVLIAVVGGILYQSAWIIA--PKLWIIGTIFPVAG . .. : :: . ..: : : . : ..:...: .. ..: : : ....: : NP_003 IVLKSKRPQYMRYVIKGGMIIILLCSVAVTVLSAINVGKSIMFAMTPLLIATSSLMPFIG 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 YSLGFLLARIAGLPWYRCR-TVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIF . ::..:. . : ::: ::..::: ::.::::::....: :: .. .: :::.: :: NP_003 FLLGYVLSALFCLNG-RCRRTVSMETGCQNVQLCSTILNVAFPPEVIGPLFFFPLLYMIF 240 250 260 270 280 290 300 310 320 330 340 pF1KE6 QLAFAAIFLGFYVAYKKCHG-KNKAEIPESKENGTEPESSFYKANGGFQPDEK ::. . ...... :.: . :.:... NP_003 QLGEGLLLIAIFWCYEKFKTPKDKTKMIYTAATTEETIPGALGNGTYKGEDCSPCTA 300 310 320 330 340 >>NP_001135863 (OMIM: 312090) P3 protein isoform 2 precu (448 aa) initn: 336 init1: 225 opt: 434 Z-score: 537.8 bits: 108.3 E(85289): 3.3e-23 Smith-Waterman score: 434; 28.8% identity (64.1% similar) in 281 aa overlap (37-309:163-440) 10 20 30 40 50 60 pF1KE6 CVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMF---SMGCNVEIKKFLGHIK .: .:: :.. :.::.::.. . : .. NP_001 ERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIFVNKCSFGCKVELEVLKGLMQ 140 150 160 170 180 190 70 80 90 100 110 120 pF1KE6 RPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCC-PGGTASNILAYWVDGD : . .:.: :: .::: .:... .: .:: .. ::: : ::: .: ... . :: NP_001 SPQPMLLGLLGQFLVMPLYAFLMAKVF-MLPKALALGLIITCSSPGGGGSYLFSLLLGGD 200 210 220 230 240 250 130 140 150 160 170 180 pF1KE6 MDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVALVVPVSIGMFV . :..::: ::. : :..:: ::... .. .: ..: .:. ...:...:... NP_001 VTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVPISKILGTLLFIAIPIAVGVLI 260 270 280 290 300 310 190 200 210 220 230 pF1KE6 NHKWPQKAKIILKIGSIAGAILIV---LIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYS . : :. ....:.. . . .:.. ..: :. . : : : . ..: :..: NP_001 KSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGV-FILAGIRLP-IVLVGITVPLVGLL 320 330 340 350 360 240 250 260 270 280 290 pF1KE6 LGFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLA .:. :: :: . :::..:.:.::. : ...:::. . . . :.: .. . NP_001 VGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLSLRRLQADYASQAPFIVALSGTS 370 380 390 400 410 420 300 310 320 330 340 pF1KE6 -FAAIFLGFYVAYKKCHGKNKAEIPESKENGTEPESSFYKANGGFQPDEK . :. .: .. NP_001 EMLALVIGHFIYSSLFPVP 430 440 >>XP_011529503 (OMIM: 312090) PREDICTED: P3 protein isof (477 aa) initn: 290 init1: 225 opt: 434 Z-score: 537.4 bits: 108.4 E(85289): 3.4e-23 Smith-Waterman score: 434; 28.8% identity (64.1% similar) in 281 aa overlap (37-309:192-469) 10 20 30 40 50 60 pF1KE6 CVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMF---SMGCNVEIKKFLGHIK .: .:: :.. :.::.::.. . : .. XP_011 ERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIFVNKCSFGCKVELEVLKGLMQ 170 180 190 200 210 220 70 80 90 100 110 120 pF1KE6 RPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCC-PGGTASNILAYWVDGD : . .:.: :: .::: .:... .: .:: .. ::: : ::: .: ... . :: XP_011 SPQPMLLGLLGQFLVMPLYAFLMAKVF-MLPKALALGLIITCSSPGGGGSYLFSLLLGGD 230 240 250 260 270 280 130 140 150 160 170 180 pF1KE6 MDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVALVVPVSIGMFV . :..::: ::. : :..:: ::... .. .: ..: .:. ...:...:... XP_011 VTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVPISKILGTLLFIAIPIAVGVLI 290 300 310 320 330 340 190 200 210 220 230 pF1KE6 NHKWPQKAKIILKIGSIAGAILIV---LIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYS . : :. ....:.. . . .:.. ..: :. . : : : . ..: :..: XP_011 KSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGV-FILAGIRLP-IVLVGITVPLVGLL 350 360 370 380 390 240 250 260 270 280 290 pF1KE6 LGFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLA .:. :: :: . :::..:.:.::. : ...:::. . . . :.: .. . XP_011 VGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLSLRRLQADYASQAPFIVALSGTS 400 410 420 430 440 450 300 310 320 330 340 pF1KE6 -FAAIFLGFYVAYKKCHGKNKAEIPESKENGTEPESSFYKANGGFQPDEK . :. .: .. XP_011 EMLALVIGHFIYSSLFPVP 460 470 >>NP_062822 (OMIM: 312090) P3 protein isoform 1 precurso (477 aa) initn: 290 init1: 225 opt: 434 Z-score: 537.4 bits: 108.4 E(85289): 3.4e-23 Smith-Waterman score: 434; 28.8% identity (64.1% similar) in 281 aa overlap (37-309:192-469) 10 20 30 40 50 60 pF1KE6 CVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMF---SMGCNVEIKKFLGHIK .: .:: :.. :.::.::.. . : .. NP_062 ERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIFVNKCSFGCKVELEVLKGLMQ 170 180 190 200 210 220 70 80 90 100 110 120 pF1KE6 RPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCC-PGGTASNILAYWVDGD : . .:.: :: .::: .:... .: .:: .. ::: : ::: .: ... . :: NP_062 SPQPMLLGLLGQFLVMPLYAFLMAKVF-MLPKALALGLIITCSSPGGGGSYLFSLLLGGD 230 240 250 260 270 280 130 140 150 160 170 180 pF1KE6 MDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVALVVPVSIGMFV . :..::: ::. : :..:: ::... .. .: ..: .:. ...:...:... NP_062 VTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVPISKILGTLLFIAIPIAVGVLI 290 300 310 320 330 340 190 200 210 220 230 pF1KE6 NHKWPQKAKIILKIGSIAGAILIV---LIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYS . : :. ....:.. . . .:.. ..: :. . : : : . ..: :..: NP_062 KSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGV-FILAGIRLP-IVLVGITVPLVGLL 350 360 370 380 390 240 250 260 270 280 290 pF1KE6 LGFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLA .:. :: :: . :::..:.:.::. : ...:::. . . . :.: .. . NP_062 VGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLSLRRLQADYASQAPFIVALSGTS 400 410 420 430 440 450 300 310 320 330 340 pF1KE6 -FAAIFLGFYVAYKKCHGKNKAEIPESKENGTEPESSFYKANGGFQPDEK . :. .: .. NP_062 EMLALVIGHFIYSSLFPVP 460 470 >>NP_001135864 (OMIM: 312090) P3 protein isoform 1 precu (477 aa) initn: 290 init1: 225 opt: 434 Z-score: 537.4 bits: 108.4 E(85289): 3.4e-23 Smith-Waterman score: 434; 28.8% identity (64.1% similar) in 281 aa overlap (37-309:192-469) 10 20 30 40 50 60 pF1KE6 CVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMF---SMGCNVEIKKFLGHIK .: .:: :.. :.::.::.. . : .. NP_001 ERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIFVNKCSFGCKVELEVLKGLMQ 170 180 190 200 210 220 70 80 90 100 110 120 pF1KE6 RPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCC-PGGTASNILAYWVDGD : . .:.: :: .::: .:... .: .:: .. ::: : ::: .: ... . :: NP_001 SPQPMLLGLLGQFLVMPLYAFLMAKVF-MLPKALALGLIITCSSPGGGGSYLFSLLLGGD 230 240 250 260 270 280 130 140 150 160 170 180 pF1KE6 MDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVALVVPVSIGMFV . :..::: ::. : :..:: ::... .. .: ..: .:. ...:...:... NP_001 VTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVPISKILGTLLFIAIPIAVGVLI 290 300 310 320 330 340 190 200 210 220 230 pF1KE6 NHKWPQKAKIILKIGSIAGAILIV---LIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYS . : :. ....:.. . . .:.. ..: :. . : : : . ..: :..: NP_001 KSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGV-FILAGIRLP-IVLVGITVPLVGLL 350 360 370 380 390 240 250 260 270 280 290 pF1KE6 LGFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLA .:. :: :: . :::..:.:.::. : ...:::. . . . :.: .. . NP_001 VGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLSLRRLQADYASQAPFIVALSGTS 400 410 420 430 440 450 300 310 320 330 340 pF1KE6 -FAAIFLGFYVAYKKCHGKNKAEIPESKENGTEPESSFYKANGGFQPDEK . :. .: .. NP_001 EMLALVIGHFIYSSLFPVP 460 470 >>XP_011529502 (OMIM: 312090) PREDICTED: P3 protein isof (532 aa) initn: 290 init1: 225 opt: 434 Z-score: 536.8 bits: 108.4 E(85289): 3.7e-23 Smith-Waterman score: 434; 28.8% identity (64.1% similar) in 281 aa overlap (37-309:247-524) 10 20 30 40 50 60 pF1KE6 CVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMF---SMGCNVEIKKFLGHIK .: .:: :.. :.::.::.. . : .. XP_011 ERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIFVNKCSFGCKVELEVLKGLMQ 220 230 240 250 260 270 70 80 90 100 110 120 pF1KE6 RPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCC-PGGTASNILAYWVDGD : . .:.: :: .::: .:... .: .:: .. ::: : ::: .: ... . :: XP_011 SPQPMLLGLLGQFLVMPLYAFLMAKVF-MLPKALALGLIITCSSPGGGGSYLFSLLLGGD 280 290 300 310 320 330 130 140 150 160 170 180 pF1KE6 MDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVALVVPVSIGMFV . :..::: ::. : :..:: ::... .. .: ..: .:. ...:...:... XP_011 VTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVPISKILGTLLFIAIPIAVGVLI 340 350 360 370 380 390 190 200 210 220 230 pF1KE6 NHKWPQKAKIILKIGSIAGAILIV---LIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYS . : :. ....:.. . . .:.. ..: :. . : : : . ..: :..: XP_011 KSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGV-FILAGIRLP-IVLVGITVPLVGLL 400 410 420 430 440 450 240 250 260 270 280 290 pF1KE6 LGFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLA .:. :: :: . :::..:.:.::. : ...:::. . . . :.: .. . XP_011 VGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLSLRRLQADYASQAPFIVALSGTS 460 470 480 490 500 510 300 310 320 330 340 pF1KE6 -FAAIFLGFYVAYKKCHGKNKAEIPESKENGTEPESSFYKANGGFQPDEK . :. .: .. XP_011 EMLALVIGHFIYSSLFPVP 520 530 >>XP_005277970 (OMIM: 312090) PREDICTED: P3 protein isof (532 aa) initn: 290 init1: 225 opt: 434 Z-score: 536.8 bits: 108.4 E(85289): 3.7e-23 Smith-Waterman score: 434; 28.8% identity (64.1% similar) in 281 aa overlap (37-309:247-524) 10 20 30 40 50 60 pF1KE6 CVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMF---SMGCNVEIKKFLGHIK .: .:: :.. :.::.::.. . : .. XP_005 ERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIFVNKCSFGCKVELEVLKGLMQ 220 230 240 250 260 270 70 80 90 100 110 120 pF1KE6 RPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCC-PGGTASNILAYWVDGD : . .:.: :: .::: .:... .: .:: .. ::: : ::: .: ... . :: XP_005 SPQPMLLGLLGQFLVMPLYAFLMAKVF-MLPKALALGLIITCSSPGGGGSYLFSLLLGGD 280 290 300 310 320 330 130 140 150 160 170 180 pF1KE6 MDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVALVVPVSIGMFV . :..::: ::. : :..:: ::... .. .: ..: .:. ...:...:... XP_005 VTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVPISKILGTLLFIAIPIAVGVLI 340 350 360 370 380 390 190 200 210 220 230 pF1KE6 NHKWPQKAKIILKIGSIAGAILIV---LIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYS . : :. ....:.. . . .:.. ..: :. . : : : . ..: :..: XP_005 KSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGV-FILAGIRLP-IVLVGITVPLVGLL 400 410 420 430 440 450 240 250 260 270 280 290 pF1KE6 LGFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLA .:. :: :: . :::..:.:.::. : ...:::. . . . :.: .. . XP_005 VGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLSLRRLQADYASQAPFIVALSGTS 460 470 480 490 500 510 300 310 320 330 340 pF1KE6 -FAAIFLGFYVAYKKCHGKNKAEIPESKENGTEPESSFYKANGGFQPDEK . :. .: .. XP_005 EMLALVIGHFIYSSLFPVP 520 530 >>XP_006724911 (OMIM: 312090) PREDICTED: P3 protein isof (541 aa) initn: 290 init1: 225 opt: 434 Z-score: 536.7 bits: 108.4 E(85289): 3.7e-23 Smith-Waterman score: 434; 28.8% identity (64.1% similar) in 281 aa overlap (37-309:256-533) 10 20 30 40 50 60 pF1KE6 CVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLALVMF---SMGCNVEIKKFLGHIK .: .:: :.. :.::.::.. . : .. XP_006 ERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIFVNKCSFGCKVELEVLKGLMQ 230 240 250 260 270 280 70 80 90 100 110 120 pF1KE6 RPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLIIGCC-PGGTASNILAYWVDGD : . .:.: :: .::: .:... .: .:: .. ::: : ::: .: ... . :: XP_006 SPQPMLLGLLGQFLVMPLYAFLMAKVF-MLPKALALGLIITCSSPGGGGSYLFSLLLGGD 290 300 310 320 330 340 130 140 150 160 170 180 pF1KE6 MDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVIPYDNIGTSLVALVVPVSIGMFV . :..::: ::. : :..:: ::... .. .: ..: .:. ...:...:... XP_006 VTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVPISKILGTLLFIAIPIAVGVLI 350 360 370 380 390 400 190 200 210 220 230 pF1KE6 NHKWPQKAKIILKIGSIAGAILIV---LIAVVGGILYQSAWIIAPKLWIIGTIFPVAGYS . : :. ....:.. . . .:.. ..: :. . : : : . ..: :..: XP_006 KSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGV-FILAGIRLP-IVLVGITVPLVGLL 410 420 430 440 450 460 240 250 260 270 280 290 pF1KE6 LGFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQLSFTPEELNVVFTFPLIYSIFQLA .:. :: :: . :::..:.:.::. : ...:::. . . . :.: .. . XP_006 VGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLSLRRLQADYASQAPFIVALSGTS 470 480 490 500 510 520 300 310 320 330 340 pF1KE6 -FAAIFLGFYVAYKKCHGKNKAEIPESKENGTEPESSFYKANGGFQPDEK . :. .: .. XP_006 EMLALVIGHFIYSSLFPVP 530 540 348 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:32:38 2016 done: Tue Nov 8 12:32:38 2016 Total Scan time: 4.440 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]