FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6310, 417 aa 1>>>pF1KE6310 417 - 417 aa - 417 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5948+/-0.00036; mu= 16.7395+/- 0.023 mean_var=76.1490+/-15.731, 0's: 0 Z-trim(114.2): 24 B-trim: 1210 in 2/52 Lambda= 0.146975 statistics sampled from 23875 (23896) to 23875 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.645), E-opt: 0.2 (0.28), width: 16 Scan time: 8.500 The best scores are: opt bits E(85289) NP_005133 (OMIM: 261000,609342) gastric intrinsic ( 417) 2707 583.4 3.6e-166 XP_011543241 (OMIM: 261000,609342) PREDICTED: gast ( 403) 1780 386.8 5.3e-107 NP_001053 (OMIM: 189905) transcobalamin-1 precurso ( 433) 613 139.4 1.7e-32 NP_001171655 (OMIM: 275350,613441) transcobalamin- ( 400) 283 69.4 1.9e-11 NP_000346 (OMIM: 275350,613441) transcobalamin-2 i ( 427) 249 62.2 2.9e-09 >>NP_005133 (OMIM: 261000,609342) gastric intrinsic fact (417 aa) initn: 2707 init1: 2707 opt: 2707 Z-score: 3104.2 bits: 583.4 E(85289): 3.6e-166 Smith-Waterman score: 2707; 100.0% identity (100.0% similar) in 417 aa overlap (1-417:1-417) 10 20 30 40 50 60 pF1KE6 MAWFALYLLSLLWATAGTSTQTQSSCSVPSAQEPLVNGIQVLMENSVTSSAYPNPSILIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 MAWFALYLLSLLWATAGTSTQTQSSCSVPSAQEPLVNGIQVLMENSVTSSAYPNPSILIA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 MNLAGAYNLKAQKLLTYQLMSSDNNDLTIGQLGLTIMALTSSCRDPGDKVSILQRQMENW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 MNLAGAYNLKAQKLLTYQLMSSDNNDLTIGQLGLTIMALTSSCRDPGDKVSILQRQMENW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 APSSPNAEASAFYGPSLAILALCQKNSEATLPIAVRFAKTLLANSSPFNVDTGAMATLAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 APSSPNAEASAFYGPSLAILALCQKNSEATLPIAVRFAKTLLANSSPFNVDTGAMATLAL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 TCMYNKIPVGSEEGYRSLFGQVLKDIVEKISMKIKDNGIIGDIYSTGLAMQALSVTPEPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 TCMYNKIPVGSEEGYRSLFGQVLKDIVEKISMKIKDNGIIGDIYSTGLAMQALSVTPEPS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 KKEWNCKKTTDMILNEIKQGKFHNPMSIAQILPSLKGKTYLDVPQVTCSPDHEVQPTLPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 KKEWNCKKTTDMILNEIKQGKFHNPMSIAQILPSLKGKTYLDVPQVTCSPDHEVQPTLPS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 NPGPGPTSASNITVIYTINNQLRGVELLFNETINVSVKSGSVLLVVLEEAQRKNPMFKFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 NPGPGPTSASNITVIYTINNQLRGVELLFNETINVSVKSGSVLLVVLEEAQRKNPMFKFE 310 320 330 340 350 360 370 380 390 400 410 pF1KE6 TTMTSWGLVVSSINNIAENVNHKTYWQFLSGVTPLNEGVADYIPFNHEHITANFTQY ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 TTMTSWGLVVSSINNIAENVNHKTYWQFLSGVTPLNEGVADYIPFNHEHITANFTQY 370 380 390 400 410 >>XP_011543241 (OMIM: 261000,609342) PREDICTED: gastric (403 aa) initn: 2593 init1: 1776 opt: 1780 Z-score: 2042.1 bits: 386.8 E(85289): 5.3e-107 Smith-Waterman score: 2569; 96.6% identity (96.6% similar) in 417 aa overlap (1-417:1-403) 10 20 30 40 50 60 pF1KE6 MAWFALYLLSLLWATAGTSTQTQSSCSVPSAQEPLVNGIQVLMENSVTSSAYPNPSILIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MAWFALYLLSLLWATAGTSTQTQSSCSVPSAQEPLVNGIQVLMENSVTSSAYPNPSILIA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 MNLAGAYNLKAQKLLTYQLMSSDNNDLTIGQLGLTIMALTSSCRDPGDKVSILQRQMENW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MNLAGAYNLKAQKLLTYQLMSSDNNDLTIGQLGLTIMALTSSCRDPGDKVSILQRQMENW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 APSSPNAEASAFYGPSLAILALCQKNSEATLPIAVRFAKTLLANSSPFNVDTGAMATLAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 APSSPNAEASAFYGPSLAILALCQKNSEATLPIAVRFAKTLLANSSPFNVDTGAMATLAL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 TCMYNKIPVGSEEGYRSLFGQVLKDIVEKISMKIKDNGIIGDIYSTGLAMQALSVTPEPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 TCMYNKIPVGSEEGYRSLFGQVLKDIVEKISMKIKDNGIIGDIYSTGLAMQALSVTPEPS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 KKEWNCKKTTDMILNEIKQGKFHNPMSIAQILPSLKGKTYLDVPQVTCSPDHEVQPTLPS :::::::::::::::::::::::::::::::::::: :::::::::: XP_011 KKEWNCKKTTDMILNEIKQGKFHNPMSIAQILPSLK--------------DHEVQPTLPS 250 260 270 280 310 320 330 340 350 360 pF1KE6 NPGPGPTSASNITVIYTINNQLRGVELLFNETINVSVKSGSVLLVVLEEAQRKNPMFKFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 NPGPGPTSASNITVIYTINNQLRGVELLFNETINVSVKSGSVLLVVLEEAQRKNPMFKFE 290 300 310 320 330 340 370 380 390 400 410 pF1KE6 TTMTSWGLVVSSINNIAENVNHKTYWQFLSGVTPLNEGVADYIPFNHEHITANFTQY ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 TTMTSWGLVVSSINNIAENVNHKTYWQFLSGVTPLNEGVADYIPFNHEHITANFTQY 350 360 370 380 390 400 >>NP_001053 (OMIM: 189905) transcobalamin-1 precursor [H (433 aa) initn: 505 init1: 179 opt: 613 Z-score: 704.3 bits: 139.4 E(85289): 1.7e-32 Smith-Waterman score: 649; 33.9% identity (61.7% similar) in 410 aa overlap (33-417:37-433) 10 20 30 40 50 60 pF1KE6 WFALYLLSLLWATAGTSTQTQSSCSVPSAQEPLVNGIQVLMENSVTSSAYPNPSILIAMN .::.: . : ::.. ....... NP_001 LPLVGLLLFSFIPSQLCEICEVSEENYIRLKPLLNTMIQSNYNRGTSAV----NVVLSLK 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 LAGAYNLKAQKLLTYQL---MSSDNNDLTIGQLGLTIMALTSSCRDPGDKVSI---LQRQ :.: .. . :. ..: .:.. :.:.: :.:: . ::. ... : . NP_001 LVGIQIQTLMQKMIQQIKYNVKSRLSDVSSGELALIILAL-GVCRNAEENLIYDYHLIDK 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 MEN-WAPSSPNAEA------SAFYGPSLAILALCQKNSEATLPIAVRFAKTLLAN---SS .:: . : :: . .: :: .:::: :.. . .: : .: NP_001 LENKFQAEIENMEAHNGTPLTNYYQLSLDVLALCLFNGNYSTAEVVNHFTPENKNYYFGS 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE6 PFNVDTGAMATLALTCMYNKIPVG---SEEGYRSLFGQVLKDIVEKISMKIKDNGIIGDI :.:::::::.:::::. ... : ..:: . .. :..:::: . :.::.::. NP_001 QFSVDTGAMAVLALTCVKKSLINGQIKADEGSLKNISIYTKSLVEKILSEKKENGLIGNT 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE6 YSTGLAMQALSVTPEP-SKKEWNCKKTTDMILNEIKQGKFHNPMSIAQILPSLKGKTYLD .::: ::::: :. . ....:::..: . .:.::.:: : :: . ::.::.: :::.:: NP_001 FSTGEAMQALFVSSDYYNENDWNCQQTLNTVLTEISQGAFSNPNAAAQVLPALMGKTFLD 250 260 270 280 290 300 290 300 310 320 330 pF1KE6 VPQ----VTCSPDHEVQPTLPSNPGPGPTSASNITVIYTINNQLRGVELLFNETINVSVK . . :. : . ... : . : : : : :.: :.. : : :. ::.: NP_001 INKDSSCVSASGNFNISADEPITVTP-PDSQSYISVNYSV----RINETYFT---NVTVL 310 320 330 340 350 340 350 360 370 380 390 pF1KE6 SGSVLLVVLEEAQRKNP-MFKFETTMTSWGLVVSSINNIAENVNHKTYWQFLSGVTPLNE .:::.: :.:.::. : .: : ::: .. :... : : .:::..::: ::.. NP_001 NGSVFLSVMEKAQKMNDTIFGFTMEERSWGPYITCIQGLCANNNDRTYWELLSGGEPLSQ 360 370 380 390 400 410 400 410 pF1KE6 GVADYIPFNHEHITANFTQY :...:. : :.. . ...: NP_001 GAGSYVVRNGENLEVRWSKY 420 430 >>NP_001171655 (OMIM: 275350,613441) transcobalamin-2 is (400 aa) initn: 193 init1: 65 opt: 283 Z-score: 326.6 bits: 69.4 E(85289): 1.9e-11 Smith-Waterman score: 366; 27.8% identity (55.7% similar) in 431 aa overlap (6-410:8-393) 10 20 30 40 50 pF1KE6 MAWFALYLLSLLWATAGTSTQTQSSCSVPSAQEPLVN--GIQVLMENSVTSSAYPNPS :.::..: :. :. : .: . ::. : ..: . : . ::: NP_001 MRHLGAFLFLLGVL----GALTEM---CEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPS 10 20 30 40 50 60 70 80 90 100 pF1KE6 ILIAMNL----AGA------YNLKA--QKLLTYQLMSSDNNDL----TIGQLGLTIMALT : ... : ::. ..:: :. : . .: :..: ..:::.: ..:: NP_001 IYVGLRLSSLQAGTKEDLYLHSLKLGYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALR 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE6 SSCRDPGDKVSILQRQMENWAPSSPNAEASAFYGPSLAILALCQKNSEATLPIAVRFAKT . :: . . ..: .: .:.::::: ..... .. . NP_001 A-----------------NWHDHKGHPHTS-YYQYGLGILALCLHQKRVHDSVVDK---- 120 130 140 150 170 180 190 200 210 pF1KE6 LLANSSPFN-----VDTGAMATLALTCMYNKIPVGSEEGYRSLFGQVLKDIVEKISMKIK :: ::. :::.::: ::.::. . . . : :. . .... . :.: NP_001 LLYAVEPFHQGHHSVDTAAMAGLAFTCLKRS---NFNPGRRQRITMAIRTVREEILKAQT 160 170 180 190 200 220 230 240 250 260 270 pF1KE6 DNGIIGDIYSTGLAMQALSVTP-EPSKKEWNCKKTTDMILNEIKQGKFHNPMSIAQILPS .: .:..::: ::.: : ..: . .. : :. .: ...: :.: . :.:.:: NP_001 PEGHFGNVYSTPLALQFLMTSPMRGAELGTACLKARVALLASLQDGAFQNALMISQLLPV 210 220 230 240 250 260 280 290 300 310 320 330 pF1KE6 LKGKTYLDVPQVTC-SPDHEVQPTLPSNPGPGPTSASNITVIYTINNQLRGVELLFNETI :. :::.:. : .: ..:. . : ....: :. :. . :: NP_001 LNHKTYIDLIFPDCLAPRVMLEPAAETIP-----QTQEI-----ISVTLQVLSLLPPYRQ 270 280 290 300 310 340 350 360 370 380 390 pF1KE6 NVSVKSGSVLLVVLEEAQRKNPMFKFETTMTSWGLVVSSINNIAENVNHKTYWQFLSGV- ..:: .::.. ::..:.. . : .:: . : ..:. ... .... .::.: NP_001 SISVLAGSTVEDVLKKAHELGG-FTYETQASLSGPYLTSV--MGKAAGEREFWQLLRDPN 320 330 340 350 360 370 400 410 pF1KE6 TPLNEGVADYIPFNHEHITANFTQY ::: .:.::: : . : : NP_001 TPLLQGIADYRPKDGETIELRLVSW 380 390 400 >>NP_000346 (OMIM: 275350,613441) transcobalamin-2 isofo (427 aa) initn: 193 init1: 65 opt: 249 Z-score: 287.2 bits: 62.2 E(85289): 2.9e-09 Smith-Waterman score: 393; 28.2% identity (57.0% similar) in 440 aa overlap (6-410:8-420) 10 20 30 40 50 pF1KE6 MAWFALYLLSLLWATAGTSTQTQSSCSVPSAQEPLVN--GIQVLMENSVTSSAYPNPS :.::..: :. :. : .: . ::. : ..: . : . ::: NP_000 MRHLGAFLFLLGVL----GALTEM---CEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPS 10 20 30 40 50 60 70 80 90 100 pF1KE6 ILIAMNL----AGA------YNLKA--QKLLTYQLMSSDNNDL----TIGQLGLTIMALT : ... : ::. ..:: :. : . .: :..: ..:::.: ..:: NP_000 IYVGLRLSSLQAGTKEDLYLHSLKLGYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALR 60 70 80 90 100 110 110 120 130 140 150 pF1KE6 SSCR----DPGDK-VSILQRQMENW----APSSPNAEASAFYGPSLAILALCQKNSEATL ..:. ::. :: :. .:. . . . ...: .:.::::: ..... NP_000 ANCEFVRGHKGDRLVSQLKWFLEDEKRAIGHDHKGHPHTSYYQYGLGILALCLHQKRVHD 120 130 140 150 160 170 160 170 180 190 200 pF1KE6 PIAVRFAKTLLANSSPFN-----VDTGAMATLALTCMYNKIPVGSEEGYRSLFGQVLKDI .. . :: ::. :::.::: ::.::. . . . : :. . .... . NP_000 SVVDK----LLYAVEPFHQGHHSVDTAAMAGLAFTCLKRS---NFNPGRRQRITMAIRTV 180 190 200 210 220 210 220 230 240 250 260 pF1KE6 VEKISMKIKDNGIIGDIYSTGLAMQALSVTP-EPSKKEWNCKKTTDMILNEIKQGKFHNP :.: .: .:..::: ::.: : ..: . .. : :. .: ...: :.: NP_000 REEILKAQTPEGHFGNVYSTPLALQFLMTSPMRGAELGTACLKARVALLASLQDGAFQNA 230 240 250 260 270 280 270 280 290 300 310 320 pF1KE6 MSIAQILPSLKGKTYLDVPQVTC-SPDHEVQPTLPSNPGPGPTSASNITVIYTINNQLRG . :.:.:: :. :::.:. : .: ..:. . : ....: :. :. NP_000 LMISQLLPVLNHKTYIDLIFPDCLAPRVMLEPAAETIP-----QTQEI-----ISVTLQV 290 300 310 320 330 330 340 350 360 370 380 pF1KE6 VELLFNETINVSVKSGSVLLVVLEEAQRKNPMFKFETTMTSWGLVVSSINNIAENVNHKT . :: ..:: .::.. ::..:.. . : .:: . : ..:. ... .... NP_000 LSLLPPYRQSISVLAGSTVEDVLKKAHELGG-FTYETQASLSGPYLTSV--MGKAAGERE 340 350 360 370 380 390 390 400 410 pF1KE6 YWQFLSGV-TPLNEGVADYIPFNHEHITANFTQY .::.: ::: .:.::: : . : : NP_000 FWQLLRDPNTPLLQGIADYRPKDGETIELRLVSW 400 410 420 417 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:58:34 2016 done: Tue Nov 8 11:58:35 2016 Total Scan time: 8.500 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]