GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:47:29 Sequence gi568815578r:13614926_13883140 : 268215 bp : 40.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1713 1708 6 1.05 1.05 Term - 7978 7874 105 1 0 71 33 65 0.669 -3.27 1.04 Intr - 8589 8521 69 0 0 80 53 128 0.936 6.96 1.03 Intr - 10327 10260 68 2 2 113 86 42 0.906 4.21 1.02 Intr - 13662 13552 111 1 0 37 78 85 0.683 1.83 1.01 Init - 15147 15009 139 0 1 93 115 136 0.794 17.16 1.00 Prom - 16189 16150 40 -5.55 2.03 PlyA - 16266 16261 6 1.05 2.02 Term - 19442 19406 37 1 1 108 33 20 0.016 -5.67 2.01 Init - 23545 23340 206 1 2 41 93 230 0.043 17.17 2.00 Prom - 25598 25559 40 -7.75 3.03 PlyA - 25982 25977 6 1.05 3.02 Term - 26934 26764 171 0 0 38 50 179 0.963 5.94 3.01 Init - 35801 35679 123 2 0 82 63 109 0.936 8.02 3.00 Prom - 38071 38032 40 -4.35 4.00 Prom + 42036 42075 40 -6.75 4.01 Init + 64334 64495 162 1 0 77 59 105 0.790 6.28 4.02 Term + 65637 65858 222 2 0 56 49 150 0.764 3.83 4.03 PlyA + 69515 69520 6 1.05 5.03 PlyA - 70278 70273 6 1.05 5.02 Term - 74797 74616 182 2 2 74 43 103 0.820 1.19 5.01 Init - 78517 78433 85 1 1 111 65 79 0.744 8.94 5.00 Prom - 92476 92437 40 -4.55 6.14 PlyA - 92990 92985 6 1.05 6.13 Term - 100242 99949 294 0 0 98 36 227 0.980 12.82 6.12 Intr - 102589 102443 147 1 0 98 76 243 0.997 23.61 6.11 Intr - 104059 103983 77 2 2 77 93 75 0.990 5.22 6.10 Intr - 113540 113405 136 2 1 41 100 135 0.544 9.22 6.09 Intr - 118917 118796 122 1 2 88 64 79 0.371 4.79 6.08 Intr - 144928 144767 162 2 0 66 110 224 0.989 21.43 6.07 Intr - 151999 151852 148 1 1 38 67 155 0.948 7.29 6.06 Intr - 155096 154982 115 1 1 109 20 110 0.994 5.83 6.05 Intr - 156558 156406 153 2 0 68 84 127 0.994 8.57 6.04 Intr - 157690 157590 101 1 2 79 27 134 0.994 4.39 6.03 Intr - 160345 160232 114 1 0 63 100 56 0.940 4.02 6.02 Intr - 161345 160948 398 0 2 77 76 729 0.512 63.97 6.01 Init - 168194 167579 616 2 1 54 96 382 0.657 31.14 6.00 Prom - 169256 169217 40 -11.74 7.00 Prom + 169258 169297 40 -10.45 7.01 Init + 170144 170365 222 1 0 60 83 289 0.984 22.20 7.02 Intr + 172387 172427 41 0 2 111 111 58 0.979 6.50 7.03 Intr + 173664 173727 64 0 1 95 71 29 0.820 -0.30 7.04 Intr + 186561 186758 198 2 0 68 38 173 0.484 8.93 7.05 Intr + 193917 193977 61 2 1 96 116 81 0.516 9.09 7.06 Intr + 201950 202032 83 0 2 67 97 61 0.641 3.34 7.07 Term + 202920 203015 96 2 0 76 45 98 0.890 1.29 7.08 PlyA + 205028 205033 6 1.05 8.04 PlyA - 205049 205044 6 1.05 8.03 Term - 206445 206341 105 0 0 78 34 88 0.670 -0.17 8.02 Intr - 207478 207375 104 2 2 70 119 55 0.830 5.77 8.01 Init - 212425 212086 340 1 1 89 97 117 0.834 10.23 8.00 Prom - 217176 217137 40 -2.45 9.00 Prom + 217372 217411 40 -8.25 9.01 Init + 225833 225851 19 1 1 99 91 12 0.248 2.85 9.02 Term + 233319 233722 404 1 2 -241 52 841 0.823 41.93 9.03 PlyA + 233858 233863 6 -4.33 10.12 PlyA - 234291 234286 6 1.05 10.11 Term - 234575 234495 81 2 0 52 41 60 0.271 -5.59 10.10 Intr - 235394 235266 129 2 0 93 101 113 0.869 13.07 10.09 Intr - 239522 239376 147 2 0 11 81 89 0.234 0.01 10.08 Intr - 244509 244337 173 1 2 61 86 144 0.917 10.24 10.07 Intr - 250316 250242 75 0 0 89 86 97 0.799 8.17 10.06 Intr - 250589 250424 166 2 1 69 57 101 0.840 3.71 10.05 Intr - 251925 251777 149 1 2 41 84 68 0.479 0.73 10.04 Intr - 254665 254578 88 1 1 61 93 92 0.736 5.62 10.03 Intr - 255278 255216 63 2 0 51 119 32 0.460 0.60 10.02 Intr - 261190 261113 78 1 0 94 95 74 0.765 7.53 10.01 Intr - 262663 262595 69 1 0 114 83 -2 0.255 0.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 19451 19457 7 1 1 68 97 0 0.881 0.12 S.002 Intr + 23362 23515 154 2 1 133 86 70 0.846 9.51 S.003 Term + 23598 23745 148 0 1 49 54 193 0.899 8.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:13614926_13883140|GENSCAN_predicted_peptide_1|163_aa MEKGMSSGEGLPSRSSQVSAGKITAKELETKQSYKEKRGGFVLVHADWTTREALCESNPE YLNKSKKPETRFNPIFSQVLLQGGAGYHSESKAKEYKHVCKRACQKAIEKLQAGALATDA VTAALVELESEVLSVLLIWNLDEFVQGILRSSEDRLAAQTWRR >gi568815578r:13614926_13883140|GENSCAN_predicted_CDS_1|492_bp atggagaaggggatgagttctggagaagggctgccttccagatcatctcaggtttcggct ggtaaaataacagccaaagagttggaaacaaagcagtcctataaagagaaacgaggaggc tttgtgttggtgcatgcagactggacaacccgagaagccttatgtgaatctaatccagag tatctaaacaaatccaaaaaaccagaaaccaggtttaatcccattttctcacaggtgctt ttacaaggaggtgcaggttatcattctgaatccaaagccaaggagtataaacatgtatgc aaacgagcttgtcagaaggcaattgaaaagctgcaggccggtgctcttgcaactgacgca gtcactgcagcactggtggaacttgagagtgaggttctctcagttctcttgatttggaac cttgatgaatttgtgcaggggatattgaggtcttcagaagatcgacttgctgctcaaact tggagaagatag >gi568815578r:13614926_13883140|GENSCAN_predicted_peptide_2|80_aa MRVLRVVSTYYSGGGGGGRRGGGKDREDGAAGEEGGKAGARDVLEIEGIVFPEGESSTFL ASVGEGKESSKQSQIYRLSL >gi568815578r:13614926_13883140|GENSCAN_predicted_CDS_2|243_bp atgagagtcctgcgggtcgttagcacttactactcagggggaggaggaggaggaagaaga ggaggaggaaaggaccgagaagatggagcagctggagaggagggggggaaagcaggagct agagatgtcctggaaatcgagggtatcgtgtttccggaaggagagagttcaactttcctg gcttctgtaggggaggggaaggaaagctctaagcaatcacagatctaccgtctgtctctt tag >gi568815578r:13614926_13883140|GENSCAN_predicted_peptide_3|97_aa MVQVTVVIGNKREELLERDKETDALWPRLSKESIVNVLQVPVEKSERTAPPDLEEDKSGQ LYSCKDLNSANHLNELGESLWASDEMAALTDTLISAF >gi568815578r:13614926_13883140|GENSCAN_predicted_CDS_3|294_bp atggtccaagtgactgttgttattgggaacaaaagggaagagctgctggagagagacaag gaaacagatgccttatggcctaggctctctaaggaatctattgttaatgttttacaagtg ccagtggaaaagtcagagaggactgctcctcctgaccttgaagaagacaaaagtggccag ctctacagctgcaaggatttgaattctgccaatcacctgaatgagcttggagaatcgctc tgggcatccgatgagatggcagctctgactgacaccttgatttcagccttctga >gi568815578r:13614926_13883140|GENSCAN_predicted_peptide_4|127_aa MKACSGFPGIENSIAPLRRLKGMTENQEYSCFTSFKKEEGNGGSACFAEGWWWEQRDWPS WQLYADLKLRRDTRVIHVLRITAATGSYFIQGHPFLKQPISNDLDAGVQKPNTSAGLGTT LKAHSGF >gi568815578r:13614926_13883140|GENSCAN_predicted_CDS_4|384_bp atgaaagcctgttcaggttttccagggatagaaaacagcatagctccactgaggagactt aaagggatgacagagaaccaggaatacagctgctttacaagttttaaaaaagaagaaggt aatggtggtagtgcgtgttttgcagagggctggtggtgggagcaaagagattggccaagt tggcagctgtatgcagatctgaagctcagaagagataccagagtcattcatgtattgagg atcactgcagctacagggagctatttcattcagggtcacccatttctgaagcagcccata tccaatgacttagatgctggagtacaaaagcccaacacctcagcgggactggggacaacc ctgaaggcccattctggcttctga >gi568815578r:13614926_13883140|GENSCAN_predicted_peptide_5|88_aa MDQGLPRGIAGVWGRLAATIYGTQADCVAYTSLTSSSRFLFGNLCASGDADSSSNSRDET CDLDPSSHHNPSHPPPPLAIVIDAETGM >gi568815578r:13614926_13883140|GENSCAN_predicted_CDS_5|267_bp atggaccagggacttcctcgtggaattgctggtgtgtggggcaggcttgctgctaccatt tatggaacccaggcagactgtgtagcatacacttctcttaccagtagttctaggtttctt tttgggaatctgtgtgcatctggcgatgctgactccagctctaactctagggatgaaaca tgtgacctagatccatccagtcatcataatccatcccatcccccacccccactggccata gtgattgatgcagagacaggcatgtga >gi568815578r:13614926_13883140|GENSCAN_predicted_peptide_6|860_aa MSDQRFRRVAKDPRFWEMPEKDRKVKIDKRFRAMFHDKKFKLNYAVDKRGRPISHSTTED LKRFYDLSDSDSNLSGEDSKALSQKKIKKKKTQTKKEIDSKNLVEKKKETKKANHKGSEN KTDLDNSIGIKKMKTSCKFKIDSNISPKKDSKEFTQKNKKEKKNIVQHTTDSSLEEKQRT LDSGTSEIVKSPRIECSKTRREMQSVVQLIMTRDSDGYENSTDGEMCDKDALEEDSESVS EIGSDEESENEITSVGRASGDDDGSEDDEEEDEDEEEDEDEDSEDDDKSDSGPDLARGKG NIETSSEDEDDTADLFPEESGFEHAWRELDKDAPRADEITRRLAVCNMDWDRLKAKDLLA LFNSFKPKGGVIFSVKIYPSEFGKERMKEEQVQGPVELLSIPEDAPEKDWTSREKLRDYQ FKRLKYYYAVVDCDSPETASKIYEDCDGLEFESSCSFIDLRFIPDDITFDDEPKDVASEV NLTAYKPKYFTSAAMGTSTVEITWDETDHERITMLNRKFKKEELLDMDFQAYLASSSEDE EEIEEELQGDDGVNVEEDGKTKKSQKDDEEQIAKYRQLLQVIQEKEKKGKENDMEMEIKW VPGLKESAEEMVKNKLEGKDKLTPWEQFLEKKKEKKRLKRKQKALAEEASEEELPSDVDL NDPYFAEEVKQIGKPAHICSFQLESKEKGINKKSVKSAKDGTSPEEEIEIERQKAEMALL MMDEDEDSKKHFNYNKIVEHQNLSKKKKKQLMKKKELIEDDFEVNVNDARFQAMYTSHLF NLDPSDPNFKKTKAMEKILEEKARQRERKEQELTQAIKKKESEIEKESQRKSIDPALSML IKSIKTKTEQFQARKKQKVK >gi568815578r:13614926_13883140|GENSCAN_predicted_CDS_6|2583_bp atgagtgaccagcggtttagacgggttgcaaaggacccgagattttgggaaatgccagaa aaggatcgaaaagtcaaaattgacaagagatttcgagccatgtttcatgacaagaagttc aagttgaactatgccgtggataaaagagggcgccccattagccatagcactacagaggat ttgaagcgtttttacgacctttcagattctgattccaatctctctggtgaagatagcaaa gcattgagtcaaaagaaaataaagaagaaaaaaacccagactaaaaaagaaatcgattca aaaaatctagttgagaaaaagaaagaaaccaagaaggctaatcacaagggttctgaaaat aaaactgatttagataattctataggaattaaaaaaatgaaaacctcatgtaaatttaag atagattcaaacataagtccgaagaaggatagcaaagaatttacacaaaaaaataagaaa gagaaaaaaaacattgttcaacatactacagactcttctctcgaagaaaaacaaaggaca ttagactcaggcacctctgaaattgtgaaatctcccagaatcgagtgttctaagacaaga agagaaatgcaatcagtggttcaactcataatgacaagagacagtgatggttatgaaaac tcaacagatggtgaaatgtgtgacaaagatgctctggaggaagattcagaaagcgttagt gaaataggaagtgatgaggaatctgaaaatgaaattacaagtgttggtagagcttcaggt gatgacgatggaagtgaagatgatgaagaggaggatgaagatgaagaggaggatgaagat gaggatagtgaggatgatgataaaagtgacagtggccctgatcttgcaaggggtaaagga aatatagaaactagttctgaagatgaagatgatacggcagatttgtttccagaagaatct ggttttgagcatgcttggagagaattagataaagatgctcctcgtgctgatgagattaca cgtcgattagcagtttgtaacatggactgggatagattaaaggcaaaagatttgctggct ctgttcaattcatttaaacccaaaggaggtgtaatattttccgtcaagatatatccttca gaatttggaaaggagaggatgaaggaagagcaagttcaaggaccagtagagctattaagt attcctgaagatgccccagaaaaagactggacgtctagagaaaaattgagagattatcaa ttcaaacgactgaagtactattatgcagtagtagactgtgattctccggaaacagctagt aaaatttatgaggattgtgatggcctggaatttgaaagtagttgttctttcatagatcta aggtttataccagatgatattacttttgatgatgagcctaaggatgtagcctcagaagtg aatttaacagcatataaaccaaaatatttcacttctgctgcaatgggaacatcaacggtg gaaatcacttgggatgagactgatcatgaaagaattacaatgctcaacaggaagtttaaa aaggaagagcttttggacatggattttcaagcctacttagcttcctctagtgaagatgaa gaggagatagaagaggagctacaaggtgatgatggagtcaatgtagaagaagatgggaaa acaaagaaaagtcagaaggatgatgaagaacaaattgctaaatacaggcagctcttgcag gttattcaagaaaaagaaaagaaaggcaaagaaaatgatatggaaatggaaattaaatgg gttccaggtcttaaagaaagtgcagaagagatggtcaaaaacaaattggaaggaaaggat aaactgaccccttgggaacaatttttagagaagaagaaagagaaaaaaagactgaaaagg aaacagaaggctcttgctgaagaggccagtgaagaggaacttccctctgatgttgatttg aatgacccatactttgctgaagaagttaaacaaataggtaagccagctcatatctgtagt tttcaactggaatctaaagaaaaaggtataaataaaaaatcggtaaaatctgcaaaagat ggcacatctccagaagaagaaattgaaatagaaagacaaaaggctgaaatggctttgctt atgatggatgaggacgaggacagtaagaaacacttcaattacaacaagattgtggagcac cagaatctgagcaaaaagaagaaaaagcagctcatgaaaaagaaggaattaatagaggat gactttgaggtaaatgttaacgatgcacggtttcaggcaatgtacacttcccacttgttc aatttggacccctcagatcccaatttcaagaaaacaaaagctatggaaaaaatccttgag gagaaggcccggcaaagagaacggaaagaacaagaacttactcaggcaataaagaaaaaa gagagtgagattgaaaaggaatcacaaaggaagtccattgatcctgctttgtcaatgttg attaaatctataaaaaccaaaacagagcagtttcaagcaagaaaaaagcaaaaagtcaaa taa >gi568815578r:13614926_13883140|GENSCAN_predicted_peptide_7|254_aa MLRPAGLWRLCRRPWAARVPAENLGRREVTSGVSPRGSTSPRTLNIFDRDLKRKQKNWAA RQPEPTKFDYLKEEVGSRIADRVYDIPRNFPLALDLGCGRGYIAQYLNKIHYILKPDGVF IGAMFGGDTLYELRCSLQLAETEREGGFSPHISPFTAVNDLGHLLGRAGFNTLTVDTDEI QVNYPGMFELMEDLQEMYRNEDGSVPATYQIYYMIGWKYHESQGLSTIILIQVHSPVVVE EGSKSFGDNQAQTA >gi568815578r:13614926_13883140|GENSCAN_predicted_CDS_7|765_bp atgctgcggccggcagggctctggcgcttatgtcggcgaccttgggcggcgagggtccca gcggagaatcttggccgtagggaagtcacctctggtgtctctccccgcggtagcacctcg cccagaaccctgaatattttcgaccgggatttgaaaaggaaacagaagaactgggcagcc cggcagcccgagccgaccaaatttgactacctgaaggaggaggttggaagtcggatcgca gaccgtgtatatgacatacccagaaatttcccccttgctttggatcttggttgtggaaga ggttacattgcacaatatttgaataagattcattatattttaaaaccagatggagtgttt atcggtgcaatgtttggaggcgacacactctatgaacttcggtgttccttacagttagcg gaaacggaaagggaaggaggattttctccacacatttctcctttcactgctgtcaatgac ctgggacatctgcttgggagagctggctttaatactctgactgtggacactgatgaaatt caagttaactatcctggaatgtttgaattgatggaagatttacaagaaatgtacagaaat gaagatggttcagtacctgctacataccagatctattacatgataggatggaaatatcat gagtcacagggcttaagcactattatcctaatccaagttcacagtcctgttgttgtcgag gagggttcaaaatcatttggagataatcaagctcagacagcttag >gi568815578r:13614926_13883140|GENSCAN_predicted_peptide_8|182_aa MDEVFPDTTCSTTQTALTFLGTRKAHGQTSFLALLIFYFELWLTAHVTLYQCNEGRDPVL FIQVSALHRAFHIGALIKVFLVPRKRTMSASPWSLVGWAVSSVRAVIRFLSSLVFYGRLA SLSLEFSSSQSPTHNLSTNTLSTKTVHLVFSMELSGLTIWLLAFPRVKDPRESKEDAMPF IT >gi568815578r:13614926_13883140|GENSCAN_predicted_CDS_8|549_bp atggatgaagtctttcctgataccacctgttccaccacacagacagcactcactttcctt ggaactcgtaaggctcatggtcagacatctttcctggcacttcttatattctactttgaa ctatggttgactgcccatgtcaccttataccagtgtaatgaaggcagggaccctgtccta tttatccaagtatcagcattgcaccgtgccttccacatcggtgccttaatcaaagtattt ttggtcccaagaaaacgcacaatgtcagcttccccgtggtctctagttggctgggctgtg agctctgtgagggcagtgatcaggttcctctcatctctcgttttttatggcagactggca agtctttctttggaattctcctcttcccaatctcccactcacaatctgagcacaaacact ctgagcacaaaaacagtacacctggtcttctccatggagttgagtggccttacaatatgg ctgctggctttccctagagtgaaggatccaagagagagtaaggaggatgcaatgcctttt attacctag >gi568815578r:13614926_13883140|GENSCAN_predicted_peptide_9|140_aa MEEAEAERKEEEKVREKKRREGEGEGGGEGEGEEEEEEEEEEEEVKYLEKKKKRKKKKRK KKKKRKRKKEEEQEEEEVKYLEKKKKKRKKKKKRRRKKEKEKEKEKEKEKEKKKKKKKKK KKKKKKKKKKKKIQISGAGI >gi568815578r:13614926_13883140|GENSCAN_predicted_CDS_9|423_bp atggaggaggcagaggcagagagaaaagaagaagagaaggttagagaaaagaagagaaga gaaggagaaggagaaggaggaggagaaggagaaggagaagaagaagaagaagaagaagaa gaagaagaagaagtcaaatatctggagaagaagaagaagaggaagaagaagaaaaggaag aagaagaagaagaggaagaggaagaaagaagaagaacaagaagaagaagaagtcaaatat ctggagaagaagaagaagaagaggaagaagaagaagaagaggaggaggaagaaggagaag gagaaggagaaggagaaggagaaggagaaggagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaaaatacaaatatctggagccgggatt tga >gi568815578r:13614926_13883140|GENSCAN_predicted_peptide_10|405_aa KALHYFLKAAKAGSANAMAFIGKMYLEGNAAVPQNNATAFKYFSMAASKGNAIGLHGLGL LYFHGKGVPLNYAEALKYFQKAAEKGWPDAQFQLGFMYYSGSGIWKDYKLAFKYFYLASQ SGQPLAIYYLAKMYATGTGVVRSCRTAVELYKGVCELGHWAEKFLTAYFAYKDGDIDSSL VQYALLAEMGYEVAQSNSAFILESKKANILEKEKMYPMALLLWNRAAIQGNAFARVKIGD YHYYGYGTKKDYQTAATHYSIAANKYHNAQAMFNLAYMYEHGLGITKVGESALPGPVQKY FCTMEYEKHMTTFLHGRNQACLKTMDRKAVLKYGRKDIHLARRLYDMAAQTSPDAHIPVL FAVMKLETTHLLRDILFFNKSPWVGCGSQENLLTGISSLQGIPTK >gi568815578r:13614926_13883140|GENSCAN_predicted_CDS_10|1218_bp aaagcattacactacttcttaaaggcagcaaaggccgggagtgcaaatgccatggcattt ataggaaagatgtatttagaggggaatgctgccgtgccgcaaaataacgctactgccttc aagtacttttccatggcagccagtaagggcaatgcaatcggccttcatgggcttggtctt ctttactttcatggaaaaggagttcccctgaattatgccgaagcacttaaatactttcag aaagctgcggaaaaagggtggcccgacgcacagttccagttaggcttcatgtactactct ggctctggaatatggaaggattataaacttgccttcaaatatttttacctggcatctcag agtgggcagccccttgccatttattatctggccaagatgtatgcaacaggaacaggagta gtaagatcatgcagaactgctgtggagctttataaaggtgtctgtgaactaggccactgg gctgagaaattcctgacagcttactttgcctataaggatggtgatatagattcttctctt gttcagtatgcactgcttgcagaaatggggtatgaagtagctcaaagcaattcagcattc attttggaatctaaaaaggctaacattcttgaaaaagagaagatgtatccaatggcgctt ctcctatggaatcgagctgccattcaaggcaatgcatttgctagagtaaaaattggagat taccattactatggctatgggactaagaaagactatcaaacagcagccacacactacagc attgcagccaacaaataccacaacgcgcaagccatgttcaatctggcttatatgtatgaa cacggcttaggcatcacaaaggtaggggaatcagcactgccaggtcctgttcaaaaatat ttttgtaccatggagtacgagaaacatatgacaacctttttacatggaagaaatcaggcc tgtttaaagacaatggataggaaagctgttctaaagtatggcagaaaggacattcacttg gccagaagattgtacgacatggctgctcaaacgagtccagatgcccacatacctgtgctc tttgccgtcatgaaactggaaactacgcatttgctccgggatatcctgttttttaataaa tcaccatgggtaggatgcggatcacaggaaaacctgctcacgggaatcagttcactccaa ggtatccccactaaataa