GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:10:01 Sequence gi568815584f:20791687_20992166 : 200480 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 1125 1120 6 1.05 1.01 Sngl - 10382 9912 471 2 0 121 45 630 0.999 56.12 1.00 Prom - 28720 28681 40 -0.66 2.08 PlyA - 29665 29660 6 1.05 2.07 Term - 41030 40797 234 2 0 81 41 93 0.471 0.22 2.06 Intr - 42701 42580 122 0 2 46 115 57 0.730 4.41 2.05 Intr - 43721 43650 72 0 0 106 47 54 0.518 2.48 2.04 Intr - 44691 44463 229 0 1 38 105 71 0.357 1.24 2.03 Intr - 44979 44840 140 1 2 66 19 100 0.307 1.08 2.02 Intr - 50661 50509 153 1 0 96 59 60 0.360 3.94 2.01 Init - 58958 58931 28 2 1 76 75 19 0.228 -0.84 2.00 Prom - 60015 59976 40 -7.46 3.00 Prom + 61781 61820 40 -5.26 3.01 Sngl + 69455 70474 1020 1 0 88 43 486 0.998 41.16 3.02 PlyA + 70492 70497 6 1.05 4.00 Prom + 70881 70920 40 -4.96 4.01 Init + 72800 73624 825 1 0 60 44 325 0.204 20.02 4.02 Term + 82098 82226 129 2 0 -14 47 206 0.112 4.48 4.03 PlyA + 83715 83720 6 1.05 5.00 Prom + 89975 90014 40 -4.96 5.01 Init + 91003 91125 123 0 0 62 51 97 0.595 3.57 5.02 Term + 96337 96504 168 0 0 123 51 58 0.873 3.48 5.03 PlyA + 97061 97066 6 1.05 6.00 Prom + 97795 97834 40 -4.76 6.01 Sngl + 100001 100483 483 1 0 92 43 172 0.900 7.21 6.02 PlyA + 100641 100646 6 1.05 7.00 Prom + 101231 101270 40 -7.26 7.01 Init + 101405 101476 72 1 0 75 113 84 0.905 10.67 7.02 Term + 101862 101921 60 2 0 128 32 4 0.621 -3.40 7.03 PlyA + 102468 102473 6 1.05 8.00 Prom + 104932 104971 40 -1.06 8.01 Init + 109671 109914 244 2 1 49 49 129 0.170 3.00 8.02 Intr + 122226 122473 248 1 2 43 47 205 0.020 9.08 8.03 Intr + 123170 124221 1052 1 2 13 11 441 0.038 18.11 8.04 Term + 128228 128435 208 2 1 66 43 117 0.579 1.81 8.05 PlyA + 128593 128598 6 1.05 9.00 Prom + 133456 133495 40 -0.86 9.01 Sngl + 164086 164571 486 0 0 92 43 152 0.909 5.18 9.02 PlyA + 164729 164734 6 1.05 10.03 PlyA - 165070 165065 6 1.05 10.02 Term - 176527 176440 88 0 1 69 49 78 0.407 -0.87 10.01 Init - 177851 177784 68 2 2 69 100 51 0.301 4.96 10.00 Prom - 183879 183840 40 -3.26 11.00 Prom + 188278 188317 40 -1.86 11.01 Init + 198317 198391 75 1 0 113 100 78 0.996 10.60 11.02 Intr + 198544 198697 154 0 1 106 97 137 0.950 16.05 11.03 Term + 199765 199823 59 2 2 92 35 61 0.873 -0.95 11.04 PlyA + 200213 200218 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_1|156_aa MALEKSLVRLLLLVLILLVLGWVQPSLGKESRAKKFQRQHMDSDSSPSSSSTYCNQMMRR RNMTQGRCKPVNTFVHEPLVDVQNVCFQEKVTCKNGQGNCYKSNSSMHITDCRLTNGSRY PNCAYRTSPKERHIIVACEGSPYVPVHFDASVEDST >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_1|471_bp atggctctggagaagtctcttgtccggctccttctgcttgtcctgatactgctggtgctg ggctgggtccagccttccctgggcaaggaatcccgggccaagaaattccagcggcagcat atggactcagacagttcccccagcagcagctccacctactgtaaccaaatgatgaggcgc cggaatatgacacaggggcggtgcaaaccagtgaacacctttgtgcacgagcccctggta gatgtccagaatgtctgtttccaggaaaaggtcacctgcaagaacgggcagggcaactgc tacaagagcaactccagcatgcacatcacagactgccgcctgacaaacggctccaggtac cccaactgtgcataccggaccagcccgaaggagagacacatcattgtggcctgtgaaggg agcccatatgtgccagtccactttgatgcttctgtggaggactctacctaa >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_2|325_aa MVLETKILVAPVDFYFSHASNPGLSGPTEPGPPRLQAIDVLLDLSGKFFPSTLHNSYSSL GPVPRHKAICVSSVHQQGSRSRKTASWKTPTLVGRHVPLKIEKEAVWHVAPHRVVLSVGS LSGFKRIQEPYILVPKPGRGSGLHPPWTYPSTPESRPQQPDKGSSSASSQLSVRAHRSLI SPTAAIFPDGITSKRTHFPSYTPPPFSGALQAHGAAAPHGGLLAIHLHLVPISSVAMKAT GPDNAQTQKALTSAPVLTCPDTDLTKPFSLYTDEWHGVALGVLTQPKGPTLQVVAISLNS LKPQFLDGLPVSKHWWQLLSSPLKA >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_2|978_bp atggtgttagagaccaagatcttggtagcccctgtagacttctacttcagccacgcaagc aatccagggctttcagggcccactgagccaggtcctcctcgtttacaggccattgacgtg ctactggatttgtcaggaaagttctttccctcaacattacataactcctactcatctttg ggcccagtcccaaggcacaaggctatttgtgtcagcagcgtgcaccagcaaggtagccga agcaggaagacagccagctggaagacccctaccctggtgggaagacacgtacccctgaag attgaaaaagaggccgtctggcatgttgctccacaccgcgtcgtgttgtctgttggctcc ctctcggggttcaaacggatacaagaaccttacattttggtgccaaaacccgggagaggc tcaggtctgcatcccccgtggacctacccctccaccccggagagcaggccacagcagccg gacaaaggaagctcctcagcctccagtcagctctctgtgcgtgcacaccggtcactgatc tcgcctactgctgccatcttcccagatggcatcacctccaaacgtacccacttcccttcc tacaccccgccccccttctcaggggccctgcaggcccacggggctgcagctccacatgga ggcctcctagcaatccacctccacctcgtgcctatttcaagtgtggcaatgaaagccacc ggtccagacaatgcccaaacccagaaggctctcacctcagcccccgtcctcacttgccca gacacagacctcaccaaacctttttccctctacaccgatgaatggcatggagttgcactg ggtgttctaacccagcctaaaggacccaccctccaggttgttgccatctctctaaacagc ttgaagccacagttcttggatggcctgcctgtctccaagcattggtggcagctgctgtcc tcacccttaaaagcctaa >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_3|339_aa MGRNQSRKAENSKNQSASSPPEEHSSSPATEQSWTENHFDKLREEGFGQSVITNFSEPKE DVRTHRKEAKNLEKRLDKWLIRINSIEKTLNDLMELKTMARELRDTCKSFSSQFDQLEER VSVMEDQMNEMKREEKFREKRVKRNEQSLQKIWDYVKRPNLHLISVPESDGENATKLENT LQDIIQENFPNLARQAKIQIQEIQRTPQRYSLRRATLRHTIVGFTKVEMKEKMLRAAREK GRVTHKGKPIRLTADLSAETLQAKESGGQYSTFLKNFQPRISYPAKPSFISEGEKKSFTD KQMLRDFVTTRPALQELLKEALNMERNNRYQPLQKHAKL >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_3|1020_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaatcagagtgcctcttctcct ccagaggagcacagctcctcgccagcaacagaacaaagctggacggagaatcactttgac aagttgagagaagaaggcttcggacaatcggtaataacaaacttctccgagccaaaggag gatgttcgaacccatcgcaaagaagctaaaaaccttgaaaaaagattagacaaatggcta attagaataaacagcatagagaagaccttaaatgacctgatggagctgaaaaccatggca cgagaactacgtgacacatgcaaaagcttcagtagccaattcgatcaactggaagaaagg gtatcagtgatggaagatcaaatgaatgaaatgaagcgagaagagaagtttagagaaaaa agagtaaaaagaaatgaacaaagcctccaaaaaatatgggactatgtgaaaagaccaaat ctacatctgattagtgtacctgaaagtgacggggagaatgcaaccaagttggaaaacact ctgcaggatattatccaggagaacttccccaacctagcaaggcaggccaagattcaaatt caggaaatacagagaacgccacaaagatactccttgagaagagcaactctaagacacaca attgtcggattcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaa ggtcgggttacccacaaagggaagcccatcagactaacagctgatctctcggcagaaact ctacaagccaaagagagtgggggccaatattcaacattcttaaagaattttcaacccaga atttcatacccagccaaaccaagcttcataagtgaaggagaaaaaaaatcctttacagac aagcaaatgctgagagattttgtcaccaccaggcctgctttacaagagctcctgaaggaa gcactaaacatggaaaggaacaaccggtaccagccactgcaaaaacatgccaaattgtaa >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_4|317_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIV KMAILPKVIYRFNAIHIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQRNKAGGITLP DLKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIIPHIYSHLIFDKPDKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSSCTAKETTIREACGAAPPADQYDGHGNWETTKALG LRSCPSSSGFSGNGFFR >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_4|954_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaattacaaaccactgctcaatgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccatatcaagctgcca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcctaagccaaaggaacaaagctggaggcatcacgctacct gacttgaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagaccaatggaacagaacagagccctcagaaataataccacacatctacagc catctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagttgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaatgtcagacctaaaacc ataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactcaagagctcctgcacagcaaaagaaactaccatcagagaagcttgtggcgcg gcgccccctgccgaccaatacgacggccacgggaactgggagaccacgaaggctctgggc ctgcggagttgcccaagctcctccggcttctccggaaacggctttttccggtga >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_5|96_aa MSYSKCKASLMTTVDGVEESGEFIEEISTLLDLEKENLVLEIGKELLIVEEKEERLSPFL FHARNLIKGVRSTYSMVFRGDMQEDPHKKEGLGPPF >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_5|291_bp atgtcttactccaagtgcaaagcttccctcatgaccactgttgatggggtggaagagtct ggggaattcattgaagagatatccacattactggatctagagaaagaaaatcttgtcctg gagatagggaaagaacttcttatagttgaggaaaaagaagaaaggcttagtcctttcctc tttcatgccagaaacttgatcaaaggtgtcagatctacttattcaatggttttcagagga gacatgcaggaggacccacacaagaaggaaggcctagggccacctttctga >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_6|160_aa MVPKLFTSQICLLLLLGLMGVEGSLHARPPQFTRAQWFAIQHISLNPPRCTIAMRAINNY RWRCKNQNTFLRTTFANVVNVCGNQSIRCPHNRTLNNCHRSRFRVPLLHCDLINPGAQNI SNCTYADRPGRRFYVVACDNRDPRDSPRYPVVPVHLDTTI >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_6|483_bp atggttccaaaactgttcacttcccaaatttgtctgcttcttctgttggggcttatgggt gtggagggctcactccatgccagacccccacagtttacgagggctcagtggtttgccatc cagcacatcagtctgaacccccctcgatgcaccattgcaatgcgggcaattaacaattat cgatggcgttgcaaaaaccaaaatacttttcttcgtacaacttttgctaatgtagttaat gtttgtggtaaccaaagtatacgctgccctcataacagaactctcaacaattgtcatcgg agtagattccgggtgcctttactccactgtgacctcataaatccaggtgcacagaatatt tcaaactgcacgtatgcagacagaccaggaaggaggttctatgtagttgcatgtgacaac agagatccacgggattctccacggtatcctgtggttccagttcacctggataccaccatc taa >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_7|43_aa MGSRWSRGKEKGEEEEEEVGALMKTLRNSNFHSSDDSMSVMVL >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_7|132_bp atggggagccgctggtcacgtgggaaggagaagggggaggaggaggaggaggaagtggga gcactcatgaagaccctgagaaacagcaatttccattcctcagatgactccatgtcagta atggtgctttaa >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_8|583_aa MYHNLNNQISRELTHYQENSTKGMALIHEKSSPKIQSLPTKPASNTGNYIPISHMDGANI QTVPVAHNDLELLASSDLPASEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQE KVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELLISNFSKVSGYKINVQ KSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKLLLNETKEDT KKWKNIPCSWVGRINIVKMAILPKVIYTFNAILIKLPMTFFTELEKNTLKFIWNQKRARI AKSILSQRNKAGGIMLLDFKLYYKATVTKTAWYWYQKRDIDQWNRTEPSEITPHIYNYLI FDKPEKNKQRGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKT LEENLGITIQDIGMGKVFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKW EKIFATYSSDKGLISRIYNELRQICKRETNNPIKKTLNNCHHSGVQVPLMYCNLTTPSPQ NISNCRYAQTPANMFYIVACDNRDQRRDPPQYPVVPVHLDTII >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_8|1752_bp atgtaccacaatttgaacaaccagatctcacgagaactcactcactatcaggagaacagc accaagggaatggcgctgattcacgagaaatccagccccaagatccaatcacttcccacc aagcctgcctccaacactgggaattatattccaatatcacatatggatggggcaaatatc caaactgtcccagttgcccacaatgatcttgagctcttggcctcaagtgatcttcctgcc tcagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaactagaa aatctagaagaaatggataaattcctcgacacatacactctcccaagactaaaccaggaa aaagttgaatctctgaatagaccaataacaggttctgaaattgtggcaataatcaatagc ttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccagaggtac aaggaggaactgctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaa aaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaa ctcccattcacaattgcttcaaagagaataaaatacttaggaatccaacttacaagggac gtgaaggacctcttcaaggagaactacaaactgctgctcaatgaaacaaaagaggataca aagaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggcc atactgcccaaggtaatttatacattcaatgccatcctcatcaagctgccaatgactttc ttcacagaattggaaaaaaatactttaaagttcatatggaaccaaaaaagagcccgcatc gccaagtcaatcctaagccaaaggaacaaagctggaggcatcatgctacttgacttcaaa ctatactacaaggctacagtaacaaaaacagcatggtactggtaccaaaagagagatata gatcaatggaacagaacagagccctcagaaataacaccgcatatctacaactatctgatc tttgacaaacctgagaaaaacaagcaacggggaaaggattccctatttaataaatggtgt tgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttat acaaaaattaattcaagatggattaaagacttaaacgttagacctaaaaccataaaaacc ctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggtcttcatgtct aaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactc aagagcttctgcacagcaaaagaaactaccatcagagtcaacaggcaacctacaaaatgg gagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaa ctcagacaaatttgcaagagagaaacaaacaaccccatcaaaaaaactctcaacaattgt catcatagtggagtccaggtgcctttaatgtactgtaacctcacaactccaagtccacag aatatttcaaactgcaggtatgcgcagacaccagcaaacatgttctatatagttgcatgt gacaacagggatcaacgacgggaccctccacagtatccagtggttccagttcacctggat accatcatctaa >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_9|161_aa MVPKLFTSQICLLLLLGLLAVEGSLHVKPPQFTWAQWFETQHINMTSQQCTNAMQVINNY QRRCKNQNTFLLTTFANVVNVCGNPNMTCPSNKTRKNCHHSGSQVPLIHCNLTTPSPQNI SNCRYAQTPANMFYIVACDNRDQRRDPPQYPVVPVHLDRII >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_9|486_bp atggttccaaaactgttcacttcccaaatttgtctgcttcttctgttggggcttctggct gtggagggctcactccatgtcaaacctccacagtttacctgggctcaatggtttgaaacc cagcacatcaatatgacctcccagcaatgcaccaatgcaatgcaggtcattaacaattat caacggcgatgcaaaaaccaaaatactttccttcttacaacttttgctaacgtagttaat gtttgtggtaacccaaatatgacctgtcctagtaacaaaactcgcaaaaattgtcaccac agtggaagccaggtgcctttaatccactgtaacctcacaactccaagtccacagaatatt tcaaactgcaggtatgcgcagacaccagcaaacatgttctatatagttgcatgtgacaac agagatcaacgacgagaccctccacagtatccggtggttccagttcacctggatagaatc atctaa >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_10|51_aa MIHKPGAMTIWKGQAETGEAQRRLFTSQVETLIISPNPLIYQFLPRNSYLS >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_10|156_bp atgattcacaagcctggggccatgaccatctggaaaggccaagcagagactggtgaggcc cagcgaagactttttacatcccaagtggaaacactgatcatctccccaaatcccctcatc taccagttccttcctcggaattcctatctcagttga >gi568815584f:20791687_20992166|GENSCAN_predicted_peptide_11|95_aa MAAALKCLLTLGRWCPGLGVAPQARALAALVPGVTQVDNKSGFLQKRPHRQHPGILKLPH VRLPQALANGAQLLLLGIVEIIKTLNFGEVKKSKK >gi568815584f:20791687_20992166|GENSCAN_predicted_CDS_11|288_bp atggcggcggcactgaagtgtctactgacattaggaagatggtgccccggccttggagtg gctccccaggcccgggcgctcgccgccttagtacccggagtgacccaggtagataacaag tccggtttcctgcagaagaggcctcatcgccagcaccctggcatcctaaagctgccgcac gtgcggctgccacaggcactggctaacggtgcccagttattgctacttggaatagtggag atcatcaaaaccttaaattttggagaagtgaagaaatccaaaaaataa