FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7670, 395 aa
1>>>pF1KB7670 395 - 395 aa - 395 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.8093+/-0.000398; mu= 13.6302+/- 0.025
mean_var=254.6455+/-52.076, 0's: 0 Z-trim(121.2): 386 B-trim: 0 in 0/55
Lambda= 0.080372
statistics sampled from 37014 (37535) to 37014 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.777), E-opt: 0.2 (0.44), width: 16
Scan time: 9.960
The best scores are: opt bits E(85289)
NP_002307 (OMIM: 161200,602575) LIM homeobox trans ( 395) 2745 331.3 2.5e-90
NP_001167618 (OMIM: 161200,602575) LIM homeobox tr ( 402) 2721 328.5 1.8e-89
NP_001167617 (OMIM: 161200,602575) LIM homeobox tr ( 406) 2075 253.6 6.3e-67
NP_001167540 (OMIM: 600298) LIM homeobox transcrip ( 382) 1722 212.6 1.3e-54
NP_796372 (OMIM: 600298) LIM homeobox transcriptio ( 382) 1722 212.6 1.3e-54
XP_011507840 (OMIM: 600298) PREDICTED: LIM homeobo ( 302) 1369 171.5 2.4e-42
XP_011507842 (OMIM: 600298) PREDICTED: LIM homeobo ( 279) 1341 168.2 2.1e-41
NP_665804 (OMIM: 609481) insulin gene enhancer pro ( 359) 621 84.9 3.3e-16
XP_016877994 (OMIM: 609481) PREDICTED: insulin gen ( 534) 618 84.8 5.3e-16
NP_002193 (OMIM: 600366) insulin gene enhancer pro ( 349) 562 78.1 3.8e-14
NP_071758 (OMIM: 605992) LIM/homeobox protein Lhx5 ( 402) 481 68.8 2.7e-11
NP_005559 (OMIM: 601999) LIM/homeobox protein Lhx1 ( 406) 472 67.7 5.6e-11
NP_203129 (OMIM: 262700,602146) LIM/homeobox prote ( 390) 470 67.5 6.5e-11
NP_835258 (OMIM: 221750,600577) LIM/homeobox prote ( 397) 455 65.7 2.2e-10
XP_005263467 (OMIM: 221750,600577) PREDICTED: LIM/ ( 386) 442 64.2 6.1e-10
XP_016870657 (OMIM: 221750,600577) PREDICTED: LIM/ ( 373) 429 62.7 1.7e-09
NP_055379 (OMIM: 221750,600577) LIM/homeobox prote ( 402) 429 62.7 1.8e-09
NP_001243043 (OMIM: 604425) LIM/homeobox protein L ( 346) 425 62.2 2.3e-09
NP_001001933 (OMIM: 604425) LIM/homeobox protein L ( 356) 425 62.2 2.3e-09
XP_016856805 (OMIM: 604425) PREDICTED: LIM/homeobo ( 363) 425 62.2 2.3e-09
XP_016856806 (OMIM: 604425) PREDICTED: LIM/homeobo ( 363) 425 62.2 2.3e-09
XP_005245407 (OMIM: 606066) PREDICTED: LIM/homeobo ( 336) 410 60.4 7.4e-09
NP_001229263 (OMIM: 608215) LIM/homeobox protein L ( 363) 391 58.2 3.6e-08
NP_001229262 (OMIM: 608215) LIM/homeobox protein L ( 366) 391 58.3 3.6e-08
NP_954629 (OMIM: 608215) LIM/homeobox protein Lhx6 ( 377) 391 58.3 3.6e-08
XP_011516824 (OMIM: 608215) PREDICTED: LIM/homeobo ( 378) 391 58.3 3.6e-08
NP_055183 (OMIM: 608215) LIM/homeobox protein Lhx6 ( 392) 391 58.3 3.7e-08
XP_011516823 (OMIM: 608215) PREDICTED: LIM/homeobo ( 407) 391 58.3 3.8e-08
NP_001230540 (OMIM: 180386) LIM domain only protei ( 156) 379 56.3 5.9e-08
XP_005251973 (OMIM: 608215) PREDICTED: LIM/homeobo ( 230) 378 56.4 7.9e-08
XP_006717386 (OMIM: 603759) PREDICTED: LIM/homeobo ( 314) 380 56.9 8e-08
XP_011519065 (OMIM: 180386) PREDICTED: LIM domain ( 145) 373 55.6 9.2e-08
XP_006719174 (OMIM: 180386) PREDICTED: LIM domain ( 145) 373 55.6 9.2e-08
NP_061110 (OMIM: 180386) LIM domain only protein 3 ( 145) 373 55.6 9.2e-08
NP_001230538 (OMIM: 180386) LIM domain only protei ( 145) 373 55.6 9.2e-08
XP_011519064 (OMIM: 180386) PREDICTED: LIM domain ( 145) 373 55.6 9.2e-08
XP_006719173 (OMIM: 180386) PREDICTED: LIM domain ( 145) 373 55.6 9.2e-08
NP_001001395 (OMIM: 180386) LIM domain only protei ( 145) 373 55.6 9.2e-08
NP_001230539 (OMIM: 180386) LIM domain only protei ( 145) 373 55.6 9.2e-08
NP_001230541 (OMIM: 180386) LIM domain only protei ( 163) 373 55.6 9.8e-08
XP_006718291 (OMIM: 186921) PREDICTED: rhombotin-1 ( 193) 372 55.6 1.2e-07
XP_005271348 (OMIM: 603129) PREDICTED: LIM domain ( 165) 369 55.2 1.4e-07
NP_006760 (OMIM: 603129) LIM domain transcription ( 165) 369 55.2 1.4e-07
XP_011518400 (OMIM: 186921) PREDICTED: rhombotin-1 ( 145) 367 54.9 1.5e-07
XP_011518401 (OMIM: 186921) PREDICTED: rhombotin-1 ( 145) 367 54.9 1.5e-07
NP_001257357 (OMIM: 186921) rhombotin-1 isoform b ( 155) 367 54.9 1.5e-07
NP_002306 (OMIM: 186921) rhombotin-1 isoform a [Ho ( 156) 367 54.9 1.5e-07
XP_011541682 (OMIM: 600366) PREDICTED: insulin gen ( 285) 365 55.1 2.5e-07
XP_016873219 (OMIM: 180385) PREDICTED: rhombotin-2 ( 158) 319 49.4 7.4e-06
NP_001135787 (OMIM: 180385) rhombotin-2 isoform 2 ( 158) 319 49.4 7.4e-06
>>NP_002307 (OMIM: 161200,602575) LIM homeobox transcrip (395 aa)
initn: 2745 init1: 2745 opt: 2745 Z-score: 1742.5 bits: 331.3 E(85289): 2.5e-90
Smith-Waterman score: 2745; 100.0% identity (100.0% similar) in 395 aa overlap (1-395:1-395)
10 20 30 40 50 60
pF1KB7 MDIATGPESLERCFPRGQTDCAKMLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 MDIATGPESLERCFPRGQTDCAKMLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 RPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGCME
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 RPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGCME
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 KIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 KIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 PDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 PDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 SSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQEVLSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 SSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQEVLSS
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 RMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGNDSIFHDIDSDTSLTS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 RMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGNDSIFHDIDSDTSLTS
310 320 330 340 350 360
370 380 390
pF1KB7 LSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
:::::::::::::::::::::::::::::::::::
NP_002 LSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
370 380 390
>>NP_001167618 (OMIM: 161200,602575) LIM homeobox transc (402 aa)
initn: 2416 init1: 2416 opt: 2721 Z-score: 1727.4 bits: 328.5 E(85289): 1.8e-89
Smith-Waterman score: 2721; 98.3% identity (98.3% similar) in 402 aa overlap (1-395:1-402)
10 20 30 40 50 60
pF1KB7 MDIATGPESLERCFPRGQTDCAKMLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MDIATGPESLERCFPRGQTDCAKMLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 RPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGCME
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGCME
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 KIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 KIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 PDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 SSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQEVLSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQEVLSS
250 260 270 280 290 300
310 320 330 340 350
pF1KB7 RMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPG-------NDSIFHDID
:::::::::::::::::::::::::::::::::::::::::::: :::::::::
NP_001 RMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGDHMNPYGNDSIFHDID
310 320 330 340 350 360
360 370 380 390
pF1KB7 SDTSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
::::::::::::::::::::::::::::::::::::::::::
NP_001 SDTSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
370 380 390 400
>>NP_001167617 (OMIM: 161200,602575) LIM homeobox transc (406 aa)
initn: 2075 init1: 2075 opt: 2075 Z-score: 1322.5 bits: 253.6 E(85289): 6.3e-67
Smith-Waterman score: 2713; 97.3% identity (97.3% similar) in 406 aa overlap (1-395:1-406)
10 20 30 40 50 60
pF1KB7 MDIATGPESLERCFPRGQTDCAKMLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MDIATGPESLERCFPRGQTDCAKMLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 RPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGCME
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGCME
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 KIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 KIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 PDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEV
190 200 210 220 230 240
250 260 270 280 290
pF1KB7 SSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQ-----
:::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQGEPGP
250 260 270 280 290 300
300 310 320 330 340
pF1KB7 ------EVLSSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGNDSIF
::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GQGLGQEVLSSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGNDSIF
310 320 330 340 350 360
350 360 370 380 390
pF1KB7 HDIDSDTSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
::::::::::::::::::::::::::::::::::::::::::::::
NP_001 HDIDSDTSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
370 380 390 400
>>NP_001167540 (OMIM: 600298) LIM homeobox transcription (382 aa)
initn: 1748 init1: 935 opt: 1722 Z-score: 1101.6 bits: 212.6 E(85289): 1.3e-54
Smith-Waterman score: 1722; 67.2% identity (84.0% similar) in 387 aa overlap (24-395:1-382)
10 20 30 40 50
pF1KB7 MDIATGPESLERCFPRGQTDCAKMLDGIKMEEH--ALRPGPATLGVLLGSDCPHPAVCEG
::::.::::. . :... ::: .::::
NP_001 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEG
10 20 30
60 70 80 90 100 110
pF1KB7 CQRPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGC
::: : ::::.:.:.: :::.:.:::.:.. : :.:..::.::::: ::..:::.::.::
NP_001 CQRVILDRFLLRLNDSFWHEQCVQCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGC
40 50 60 70 80 90
120 130 140 150 160 170
pF1KB7 MEKIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSS
.: :::.:::::: . ::::.:::::::::::.:::::::::::::::::::::..:::
NP_001 FEAIAPNEFVMRAQKSVYHLSCFCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSL
100 110 120 130 140 150
180 190 200 210 220 230
pF1KB7 VSPDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASF
::: ::: ::.::.. : :.: : ::....::: .:::::::::::::::::::::
NP_001 VSPAASDSGKSDDEESLCKSAHGAG---KGTAEEGKDHKRPKRPRTILTTQQRRAFKASF
160 170 180 190 200 210
240 250 260 270 280 290
pF1KB7 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQE-QQNSQRLGQEV
::::::::::::::::::::::::::::::::::::::::::.::::. :::.:::..
NP_001 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQ
220 230 240 250 260 270
300 310 320 330 340
pF1KB7 L----SSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGN-------D
:. :::.: :: : : ::..:.::: : :::::.::::::::::. .
NP_001 TNGGGSAGMEGIMNPYTAL-PTPQQLLAIEQSVY-SSDPFRQGLTPPQMPGDHMHPYGAE
280 290 300 310 320 330
350 360 370 380 390
pF1KB7 SIFHDIDSD-TSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
.:::.::: :::..:.::::..:..: ::.:::::::.:::::.:::.:
NP_001 PLFHDLDSDDTSLSNLGDCFLATSEAGPLQSRVGNPIDHLYSMQNSYFTS
340 350 360 370 380
>>NP_796372 (OMIM: 600298) LIM homeobox transcription fa (382 aa)
initn: 1748 init1: 935 opt: 1722 Z-score: 1101.6 bits: 212.6 E(85289): 1.3e-54
Smith-Waterman score: 1722; 67.2% identity (84.0% similar) in 387 aa overlap (24-395:1-382)
10 20 30 40 50
pF1KB7 MDIATGPESLERCFPRGQTDCAKMLDGIKMEEH--ALRPGPATLGVLLGSDCPHPAVCEG
::::.::::. . :... ::: .::::
NP_796 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEG
10 20 30
60 70 80 90 100 110
pF1KB7 CQRPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGC
::: : ::::.:.:.: :::.:.:::.:.. : :.:..::.::::: ::..:::.::.::
NP_796 CQRVILDRFLLRLNDSFWHEQCVQCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGC
40 50 60 70 80 90
120 130 140 150 160 170
pF1KB7 MEKIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSS
.: :::.:::::: . ::::.:::::::::::.:::::::::::::::::::::..:::
NP_796 FEAIAPNEFVMRAQKSVYHLSCFCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSL
100 110 120 130 140 150
180 190 200 210 220 230
pF1KB7 VSPDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASF
::: ::: ::.::.. : :.: : ::....::: .:::::::::::::::::::::
NP_796 VSPAASDSGKSDDEESLCKSAHGAG---KGTAEEGKDHKRPKRPRTILTTQQRRAFKASF
160 170 180 190 200 210
240 250 260 270 280 290
pF1KB7 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQE-QQNSQRLGQEV
::::::::::::::::::::::::::::::::::::::::::.::::. :::.:::..
NP_796 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQ
220 230 240 250 260 270
300 310 320 330 340
pF1KB7 L----SSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGN-------D
:. :::.: :: : : ::..:.::: : :::::.::::::::::. .
NP_796 TNGGGSAGMEGIMNPYTAL-PTPQQLLAIEQSVY-SSDPFRQGLTPPQMPGDHMHPYGAE
280 290 300 310 320 330
350 360 370 380 390
pF1KB7 SIFHDIDSD-TSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
.:::.::: :::..:.::::..:..: ::.:::::::.:::::.:::.:
NP_796 PLFHDLDSDDTSLSNLGDCFLATSEAGPLQSRVGNPIDHLYSMQNSYFTS
340 350 360 370 380
>>XP_011507840 (OMIM: 600298) PREDICTED: LIM homeobox tr (302 aa)
initn: 1413 init1: 620 opt: 1369 Z-score: 881.4 bits: 171.5 E(85289): 2.4e-42
Smith-Waterman score: 1369; 71.0% identity (86.0% similar) in 300 aa overlap (109-395:8-302)
80 90 100 110 120 130
pF1KB7 ECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGCMEKIAPTEFVMRALECVYHL
.:::.::.::.: :::.:::::: . ::::
XP_011 MMFVLSNRLFAVKCGGCFEAIAPNEFVMRAQKSVYHL
10 20 30
140 150 160 170 180 190
pF1KB7 GCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVSPDESDSVKSEDEDGDMKP
.:::::::::::.:::::::::::::::::::::..::: ::: ::: ::.::.. :
XP_011 SCFCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKS
40 50 60 70 80 90
200 210 220 230 240 250
pF1KB7 AKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGL
:.: : ::....::: .:::::::::::::::::::::::::::::::::::::::::
XP_011 AHGAG---KGTAEEGKDHKRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGL
100 110 120 130 140 150
260 270 280 290 300 310
pF1KB7 SVRVVQVWFQNQRAKMKKLARRHQQQQE-QQNSQRLGQEVL----SSRMEGMMASYTPLA
::::::::::::::::::::::.::::. :::.:::.. :. :::.: :: :
XP_011 SVRVVQVWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQTNGGGSAGMEGIMNPYTAL-
160 170 180 190 200 210
320 330 340 350 360
pF1KB7 PPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGN-------DSIFHDIDSD-TSLTSLSDCF
: ::..:.::: : :::::.::::::::::. . .:::.::: :::..:.:::
XP_011 PTPQQLLAIEQSVY-SSDPFRQGLTPPQMPGDHMHPYGAEPLFHDLDSDDTSLSNLGDCF
220 230 240 250 260 270
370 380 390
pF1KB7 LGSSDVGSLQARVGNPIDRLYSMQSSYFAS
:..:..: ::.:::::::.:::::.:::.:
XP_011 LATSEAGPLQSRVGNPIDHLYSMQNSYFTS
280 290 300
>>XP_011507842 (OMIM: 600298) PREDICTED: LIM homeobox tr (279 aa)
initn: 1465 init1: 857 opt: 1341 Z-score: 864.2 bits: 168.2 E(85289): 2.1e-41
Smith-Waterman score: 1341; 70.7% identity (87.0% similar) in 276 aa overlap (24-296:1-273)
10 20 30 40 50
pF1KB7 MDIATGPESLERCFPRGQTDCAKMLDGIKMEEH--ALRPGPATLGVLLGSDCPHPAVCEG
::::.::::. . :... ::: .::::
XP_011 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEG
10 20 30
60 70 80 90 100 110
pF1KB7 CQRPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGC
::: : ::::.:.:.: :::.:.:::.:.. : :.:..::.::::: ::..:::.::.::
XP_011 CQRVILDRFLLRLNDSFWHEQCVQCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGC
40 50 60 70 80 90
120 130 140 150 160 170
pF1KB7 MEKIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSS
.: :::.:::::: . ::::.:::::::::::.:::::::::::::::::::::..:::
XP_011 FEAIAPNEFVMRAQKSVYHLSCFCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSL
100 110 120 130 140 150
180 190 200 210 220 230
pF1KB7 VSPDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASF
::: ::: ::.::.. : :.: : ::....::: .:::::::::::::::::::::
XP_011 VSPAASDSGKSDDEESLCKSAHGAG---KGTAEEGKDHKRPKRPRTILTTQQRRAFKASF
160 170 180 190 200 210
240 250 260 270 280 290
pF1KB7 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQE-QQNSQRLGQEV
::::::::::::::::::::::::::::::::::::::::::.::::. :::.:::...
XP_011 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSDW
220 230 240 250 260 270
300 310 320 330 340 350
pF1KB7 LSSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGNDSIFHDIDSDTS
XP_011 CSHTC
>>NP_665804 (OMIM: 609481) insulin gene enhancer protein (359 aa)
initn: 436 init1: 307 opt: 621 Z-score: 411.9 bits: 84.9 E(85289): 3.3e-16
Smith-Waterman score: 636; 34.4% identity (60.5% similar) in 337 aa overlap (51-378:22-345)
30 40 50 60 70
pF1KB7 CAKMLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQRPISDRFLMRVNES-SWHEE
: :.: :: : :.:..::. . ::
NP_665 MVDIIFHYPFLGAMGDHSKKKPGTAMCVGCGSQIHDQFILRVSPDLEWHAA
10 20 30 40 50
80 90 100 110 120 130
pF1KB7 CLQCAACQQAL--TTSCYFRDRKLYCKQDYQQLFAAKCSGCMEKIAPTEFVMRALECVYH
::.:: :.: : : .:. :: : :::.:: .::. ::. :. .. ...:::: . :::
NP_665 CLKCAECSQYLDETCTCFVRDGKTYCKRDYVRLFGIKCAKCQVGFSSSDLVMRARDSVYH
60 70 80 90 100 110
140 150 160 170 180 190
pF1KB7 LGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVSPDESDSVKSEDEDGDMK
. :: : :: ::: :::: :.: .:::..:. . .. :: . . :
NP_665 IECFRCSVCSRQLLPGDEFSLREHELLCRADHGLLLERAAAGSPRSPGPLPGAR--GLHL
120 130 140 150 160
200 210 220 230 240 250
pF1KB7 PAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETG
: :.: : :. .. : ::.:. .: ..... . .. .: ..: :. ::
NP_665 PDAGSGRQPALRPHVHKQTEKTTRVRTVLNEKQLHTLRTCYAANPRPDALMKEQLVEMTG
170 180 190 200 210 220
260 270 280 290 300 310
pF1KB7 LSVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQEVLSSRMEGMMASYTPLAPPQQ
:: ::..:::::.: : :: . .: :.::.:.. . ..:. . :::. .
NP_665 LSPRVIRVWFQNKRCKDKKKSILMKQLQQQQHSDK-------TSLQGLTG--TPLVAGSP
230 240 250 260 270 280
320 330 340 350 360 370
pF1KB7 QIVAMEQSPYGSSDPFQQGLTPPQMPGNDSIFHDIDSDT--SLTSLSDC-FLGSS---DV
. :.. ::. : : . .. .. :.:. . .:.:.:. ::.: ::
NP_665 --IRHENAVQGSAVEVQTYQPPWKALSEFALQSDLDQPAFQQLVSFSESGSLGNSSGSDV
290 300 310 320 330
380 390
pF1KB7 GSLQARVGNPIDRLYSMQSSYFAS
::....
NP_665 TSLSSQLPDTPNSMVPSPVET
340 350
>>XP_016877994 (OMIM: 609481) PREDICTED: insulin gene en (534 aa)
initn: 423 init1: 307 opt: 618 Z-score: 408.3 bits: 84.8 E(85289): 5.3e-16
Smith-Waterman score: 623; 34.1% identity (58.5% similar) in 337 aa overlap (51-381:22-345)
30 40 50 60 70
pF1KB7 CAKMLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQRPISDRFLMRVNES-SWHEE
: :.: :: : :.:..::. . ::
XP_016 MVDIIFHYPFLGAMGDHSKKKPGTAMCVGCGSQIHDQFILRVSPDLEWHAA
10 20 30 40 50
80 90 100 110 120 130
pF1KB7 CLQCAACQQAL--TTSCYFRDRKLYCKQDYQQLFAAKCSGCMEKIAPTEFVMRALECVYH
::.:: :.: : : .:. :: : :::.:: .::. ::. :. .. ...:::: . :::
XP_016 CLKCAECSQYLDETCTCFVRDGKTYCKRDYVRLFGIKCAKCQVGFSSSDLVMRARDSVYH
60 70 80 90 100 110
140 150 160 170 180 190
pF1KB7 LGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVSPDESDSVKSEDEDGDMK
. :: : :: ::: :::: :.: .:::..:. . .. :: . . :
XP_016 IECFRCSVCSRQLLPGDEFSLREHELLCRADHGLLLERAAAGSPRSPGPLPGAR--GLHL
120 130 140 150 160
200 210 220 230 240 250
pF1KB7 PAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETG
: :.: : :. .. : ::.:. .: ..... . .. .: ..: :. ::
XP_016 PDAGSGRQPALRPHVHKQTEKTTRVRTVLNEKQLHTLRTCYAANPRPDALMKEQLVEMTG
170 180 190 200 210 220
260 270 280 290 300 310
pF1KB7 LSVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQEVLSSRMEGMMASYTPLAPPQQ
:: ::..:::::.: : :: . .: :.::.:.. . ..:. . :::. .
XP_016 LSPRVIRVWFQNKRCKDKKKSILMKQLQQQQHSDK-------TSLQGLTG--TPLVAGSP
230 240 250 260 270 280
320 330 340 350 360 370
pF1KB7 QIVAMEQSPYGSSDPFQQGLTPPQMPGNDSIFHDIDSDT--SLTSLSDCFLGSSDVGSLQ
. :.. ::. : : . .. .. :.:. . .: :: . :. :
XP_016 --IRHENAVQGSAVEVQTYQPPWKALSEFALQSDLDQPAFQQLGLLSAGGRSPRDTFRLG
290 300 310 320 330
380 390
pF1KB7 AR-VGNPIDRLYSMQSSYFAS
:: : :
XP_016 ARGPGAPARVPLHLPLPGVSAAPGLLLRVRLPRQLLRQRRDLPVLAAPGHPQQYGAESRG
340 350 360 370 380 390
>>NP_002193 (OMIM: 600366) insulin gene enhancer protein (349 aa)
initn: 466 init1: 288 opt: 562 Z-score: 375.1 bits: 78.1 E(85289): 3.8e-14
Smith-Waterman score: 600; 32.3% identity (59.4% similar) in 347 aa overlap (54-391:15-345)
30 40 50 60 70 80
pF1KB7 MLDGIKMEEHALRPGPATLGVLLGSDCPHPAVCEGCQRPISDRFLMRVNES-SWHEECLQ
..: :: : :....::. . :: ::.
NP_002 MGDMGDPPKKKRLISLCVGCGNQIHDQYILRVSPDLEWHAACLK
10 20 30 40
90 100 110 120 130 140
pF1KB7 CAACQQALTTSC--YFRDRKLYCKQDYQQLFAAKCSGCMEKIAPTEFVMRALECVYHLGC
:: :.: : :: . :: : :::.:: .:.. ::. : .. ..::::: :::. :
NP_002 CAECNQYLDESCTCFVRDGKTYCKRDYIRLYGIKCAKCSIGFSKNDFVMRARSKVYHIEC
50 60 70 80 90 100
150 160 170 180 190 200
pF1KB7 FCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSSVSPDESDSVKSEDEDGDMKPAK
: : .: ::: ::::.:.: :.:..:. :.. .: .: .. .. :
NP_002 FRCVACSRQLIPGDEFALREDGLFCRADH----DVVERASLGAGDPLSPLHPARPLQMAA
110 120 130 140 150 160
210 220 230 240 250
pF1KB7 GQGS--QSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGL
: : :.:.. : ::.:. .: ..... . .. .: ..: :. :::
NP_002 EPISARQPALRPHVHKQPEKTTRVRTVLNEKQLHTLRTCYAANPRPDALMKEQLVEMTGL
170 180 190 200 210 220
260 270 280 290 300 310
pF1KB7 SVRVVQVWFQNQRAKMKKLARRHQQQQEQQNSQRLGQEVLSSRMEGMMASYTPLAPPQQQ
: ::..:::::.: : :: . .: :.:: ... . ..:: .. : :...
NP_002 SPRVIRVWFQNKRCKDKKRSIMMKQLQQQQPNDK-------TNIQGMTGTPMVAASPERH
230 240 250 260 270
320 330 340 350 360 370
pF1KB7 IVAMEQSPYGSSDPFQQGLTPPQMPGNDSIFH-DIDSDT--SLTSLSDCFLGSSDVGSLQ
... .: :. :: .: .. :::. . .:...:. ::...::
NP_002 DGGLQANPVEV-----QSYQPPWKVLSDFALQSDIDQPAFQQLVNFSEGGPGSNSTGSEV
280 290 300 310 320
380 390
pF1KB7 ARVGNPI-DRLYSMQSSYFAS
: ... . : :: .:
NP_002 ASMSSQLPDTPNSMVASPIEA
330 340
395 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 22:19:08 2016 done: Sat Nov 5 22:19:10 2016
Total Scan time: 9.960 Total Display time: 0.040
Function used was FASTA [36.3.4 Apr, 2011]