GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:18:54 Sequence gi568815586f:110181790_110446467 : 264678 bp : 44.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9131 9259 129 1 0 69 19 181 0.653 10.19 1.02 Intr + 10828 10917 90 0 0 50 108 23 0.593 0.69 1.03 Intr + 22075 22161 87 0 0 88 110 51 0.978 7.47 1.04 Intr + 36255 36398 144 2 0 67 91 62 0.634 4.88 1.05 Term + 47124 47186 63 2 0 72 49 40 0.113 -3.61 1.06 PlyA + 47667 47672 6 1.05 2.00 Prom + 50373 50412 40 -3.46 2.01 Init + 62694 62852 159 2 0 56 73 163 0.921 11.64 2.02 Term + 88245 88415 171 2 0 62 50 112 0.451 2.63 2.03 PlyA + 89646 89651 6 1.05 3.00 Prom + 91185 91224 40 -5.16 3.01 Init + 100001 100118 118 1 1 98 90 210 0.988 22.56 3.02 Intr + 100924 101006 83 2 2 110 78 53 0.934 5.86 3.03 Intr + 110231 110335 105 1 0 103 115 42 0.996 8.81 3.04 Intr + 114810 114948 139 2 1 94 105 241 0.999 26.44 3.05 Intr + 141203 141283 81 0 0 66 99 31 0.490 1.61 3.06 Intr + 144601 144686 86 2 2 46 84 90 0.949 3.94 3.07 Intr + 145764 146228 465 2 0 89 90 356 0.990 29.22 3.08 Intr + 150808 150896 89 0 2 113 65 40 0.993 2.97 3.09 Intr + 151392 151494 103 0 1 79 107 48 0.991 5.98 3.10 Intr + 152223 152354 132 2 0 59 70 69 0.888 3.04 3.11 Intr + 157714 157932 219 0 0 37 55 163 0.845 6.40 3.12 Intr + 158870 159205 336 1 0 43 89 480 0.910 39.32 3.13 Intr + 160439 160659 221 1 2 120 82 363 0.999 36.00 3.14 Intr + 161443 161645 203 1 2 25 68 166 0.995 7.23 3.15 Intr + 163097 163182 86 0 2 69 38 68 0.907 -0.56 3.16 Intr + 163460 163593 134 1 2 42 92 99 0.599 5.14 3.17 Intr + 164212 164329 118 1 1 99 100 115 0.990 14.27 3.18 Intr + 164412 164627 216 2 0 71 18 288 0.226 18.80 3.19 Intr + 167580 167834 255 2 0 77 64 62 0.196 0.34 3.20 Intr + 168503 168543 41 1 2 124 73 38 0.189 2.92 3.21 Term + 173045 173252 208 2 1 27 39 154 0.147 1.21 3.22 PlyA + 173296 173301 6 1.05 4.25 PlyA - 178064 178059 6 1.05 4.24 Term - 188634 188435 200 1 2 -22 37 168 0.508 -1.74 4.23 Intr - 189367 189143 225 2 0 112 71 80 0.601 6.66 4.22 Intr - 190408 190290 119 0 2 76 70 25 0.166 -0.39 4.21 Intr - 192544 192359 186 0 0 110 45 316 0.298 28.30 4.20 Intr - 194427 194277 151 1 1 107 102 103 0.999 12.82 4.19 Intr - 195828 195604 225 1 0 105 83 101 0.959 9.26 4.18 Intr - 200159 199963 197 1 2 108 116 41 0.989 7.96 4.17 Intr - 201171 201054 118 1 1 124 51 94 0.566 8.82 4.16 Intr - 204680 204538 143 1 2 -2 115 69 0.917 0.60 4.15 Intr - 206103 205950 154 1 1 87 85 56 0.659 4.33 4.14 Intr - 206834 206723 112 2 1 66 48 70 0.729 0.75 4.13 Intr - 213431 213312 120 2 0 46 106 52 0.913 3.49 4.12 Intr - 214641 214477 165 0 0 73 91 84 0.547 7.36 4.11 Intr - 219669 219564 106 2 1 57 81 44 0.512 0.82 4.10 Intr - 222031 221738 294 0 0 29 86 234 0.587 13.42 4.09 Intr - 223819 223687 133 2 1 84 74 40 0.625 1.90 4.08 Intr - 224182 224072 111 2 0 79 91 34 0.677 3.15 4.07 Intr - 226746 226665 82 0 1 75 115 66 0.944 7.21 4.06 Intr - 254415 254321 95 1 2 60 116 30 0.906 2.68 4.05 Intr - 254894 254768 127 2 1 77 95 41 0.980 3.95 4.04 Intr - 255363 255295 69 0 0 120 86 -20 0.562 0.38 4.03 Intr - 258599 258523 77 0 2 77 94 140 0.995 12.73 4.02 Intr - 263762 263663 100 2 1 77 89 21 0.548 0.78 4.01 Init - 264401 264399 3 2 0 108 81 0 0.399 1.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 247052 247208 157 1 1 80 53 130 0.805 8.85 S.002 Term - 253428 253366 63 0 0 72 49 42 0.834 -3.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:110181790_110446467|GENSCAN_predicted_peptide_1|170_aa QTMEEKKGISGYSYTQEELERVSALKSEVDEMKGRTLDDMSEMVKKLYSLVSEKKSALAS VIKELRQLRQKYQELTQECDEKKSQYDSCAAGLESNRSKLEQKLREKQKVIRESHGPNMK QAKMWRDLEQLMECKKQCFLKQQSQTSIGQRRKKNFLQNNCQVGKESLSP >gi568815586f:110181790_110446467|GENSCAN_predicted_CDS_1|513_bp caaactatggaggagaaaaagggtatatctggatatagttacacccaagaagagctagaa agagtatctgcactgaagagtgaagttgatgaaatgaaaggacgaacattggatgatatg tctgaaatggtgaaaaaactgtattcattggtatctgaaaagaagtcagctcttgcctca gttataaaagagctacgacagttgcgtcaaaaatatcaagaactgacccaggagtgtgat gaaaagaaatcccagtatgatagctgtgcagcaggcctcgaaagcaatcggtccaaatta gaacagaaacttcgggaaaaacaaaaagttatacgagaaagtcatggtccaaatatgaaa caagcaaaaatgtggcgtgatttggaacaattaatggaatgtaagaaacagtgctttctg aaacaacaaagccaaacttccattggtcagaggaggaaaaagaactttttacagaataat tgccaagtcggcaaggagagcctcagcccttga >gi568815586f:110181790_110446467|GENSCAN_predicted_peptide_2|109_aa MRVALQKSLCVKGQWRSPTVCGAAGVHAEPWALIELVLKVQQLFGISHFNPLADILWVVQ EDQVKELDFHRGPAICRGPPINTASAEGWEIREATVGEEAEAIAEKARA >gi568815586f:110181790_110446467|GENSCAN_predicted_CDS_2|330_bp atgagggtggctctgcagaagtctctctgcgtgaaagggcagtggagatcgcccacagtg tgtggagctgctggcgttcatgccgagccgtgggccctcatagagctggtcctgaaggtt cagcagctgttcggcataagccacttcaatccgctggcagacatactctgggtggtgcag gaggatcaggtgaaagaactggattttcacaggggccctgccatctgcagagggcctcca atcaacacggcatctgcagagggctgggagatcagagaagccacagttggagaagaagca gaggcgattgcagaaaaggccagggcatag >gi568815586f:110181790_110446467|GENSCAN_predicted_peptide_3|1145_aa MENAHTKTVEEVLGHFGVNESTGLSLEQVKKLKERWGSNGKTLLELVIEQFEDLLVRILL LAACISFVLAWFEEGEETITAFVEPFVILLILVANAIVGVWQERNAENAIEALKEYEPEM GKVYRQDRKSVQRIKAKDIVPGDIVEIAVGDKVPADIRLTSIKSTTLRVDQSILTGESVS VIKHTDPVPDPRAVNQDKKNMLFSGTNIAAGKAMGVVVATGVNTEIGKIRDEMVATEQER TPLQQKLDEFGEQLSKVISLICIAVWIINIGHFNDPVHGGSWIRGAIYYFKIAVALAVAA IPEGLPAVITTCLALGTRRMAKKNAIVRSLPSVETLGCTSVICSDKTGTLTTNQMSVCRM FILDRVEGDTCSLNEFTITGSTYAPIGEVHKDDKPVNCHQYDGLVELATICALCNDSALD YNEAKGVYEKVGEATETALTCLVEKMNVFDTELKGLSKIERANACNSGAPEGVIDRCTHI RVGSTKVPMTSGVKQKIMSVIREWGSGSDTLRCLALATHDNPLRREEMHLEDSANFIKYE TNLTFVGCVGMLDPPRIEVASSVKLCRQAGIRVIMITGDNKGTAVAICRRIGIFGQDEDV TSKAFTGREFDELNPSAQRDACLNARCFARVEPSHKSKIVEFLQSFDEITAMTGDGVNDA PALKKAEIGIAMGSGTAVAKTASEMVLADDNFSTIVAAVEEGRAIYNNMKQFIRYLISSN VGEVVCIFLTAALGFPEALIPVQLLWVNLVTDGLPATALGFNPPDLDIMNKPPRNPKEPL ISGWLFFRYLAIGCYVGAATVGAAAWWFIAADGGPRVSFYQLSHFLQCKEDNPDFEGVDC AIFESPYPMTMALSVLVTIEMCNALNSLSENQSLLRMPPWENIWLVGSICLSMSLHFLIL YVEPLPLIFQITPLNVTQWLMVLKISLPVILMDETLKFVARNYLEPGKECVQPATKSCSF SACTDGISWPFVLLIMPLVAEGPAISVACCHPVPPLASLSFAQKTNNHTYPNWDTTLQNA DDPFWRKLSLELSELPGKQGIWPTSLTTAAPTSPRTGASALTEQYWSNRFLNHFAEIKKG LLGEMADSRDGTENVQYVPEEYQSTRKEGLKNSNQRCGYVKGTQEPTEKATTAEECEQQN KAVLD >gi568815586f:110181790_110446467|GENSCAN_predicted_CDS_3|3438_bp atggagaacgcgcacaccaagacggtggaggaggtgctgggccacttcggcgtcaacgag agtacggggctgagcctggaacaggtcaagaagcttaaggagagatggggctccaacgga aaaaccttgctggaacttgtgattgagcagtttgaagacttgctagttaggattttatta ctggcagcatgtatatcttttgttttggcttggtttgaagaaggtgaagaaacaattaca gcctttgtagaaccttttgtaattttactcatattagtagccaatgcaattgtgggtgta tggcaggaaagaaatgctgaaaatgccatcgaagcccttaaggaatatgagcctgaaatg ggcaaagtgtatcgacaggacagaaagagtgtgcagcggattaaagctaaagacatagtt cctggtgatattgtagaaattgctgttggtgacaaagttcctgctgatataaggttaact tccatcaaatctaccacactaagagttgaccagtcaattctcacaggtgaatctgtctct gtcatcaagcacactgatcccgtccctgacccacgagctgtcaaccaagataaaaagaac atgctgttttctggtacaaacattgctgctgggaaagctatgggagtggtggtagcaact ggagttaacaccgaaattggcaagatccgggatgaaatggtggcaacagaacaggagaga acaccccttcagcaaaaactagatgaatttggggaacagctttccaaagtcatctccctt atttgcattgcagtctggatcataaatattgggcacttcaatgacccggttcatggaggg tcctggatcagaggtgctatttactactttaaaattgcagtggccctggctgtagcagcc attcctgaaggtctgcctgcagtcatcaccacctgcctggctcttggaactcgcagaatg gcaaagaaaaatgccattgttcgaagcctcccgtctgtggaaacccttggttgtacttct gttatctgctcagacaagactggtacacttacaacaaaccagatgtcagtctgcaggatg ttcattctggacagagtggaaggtgatacttgttcccttaatgagtttaccataactgga tcaacttatgcacctattggagaagtgcataaagatgataaaccagtgaattgtcaccag tatgatggtctggtagaattagcaacaatttgtgctctttgtaatgactctgctttggat tacaatgaggcaaagggtgtgtatgaaaaagttggagaagctacagagactgctctcact tgcctagtagagaagatgaatgtatttgataccgaattgaagggtctttctaaaatagaa cgtgcaaatgcctgcaactcaggtgctcctgaaggtgtcattgacaggtgcacccacatt cgagttggaagtactaaggttcctatgacctctggagtcaaacagaagatcatgtctgtc attcgagagtggggtagtggcagcgacacactgcgatgcctggccctggccactcatgac aacccactgagaagagaagaaatgcaccttgaggactctgccaactttattaaatatgag accaatctgaccttcgttggctgcgtgggcatgctggatcctccgagaatcgaggtggcc tcctccgtgaagctgtgccggcaagcaggcatccgggtcatcatgatcactggggacaac aagggcactgctgtggccatctgtcgccgcatcggcatcttcgggcaggatgaggacgtg acgtcaaaagctttcacaggccgggagtttgatgaactcaacccctccgcccagcgagac gcctgcctgaacgcccgctgttttgctcgagttgaaccctcccacaagtctaaaatcgta gaatttcttcagtcttttgatgagattacagctatgactggcgatggcgtgaacgatgct cctgctctgaagaaagccgagattggcattgctatgggctctggcactgcggtggctaaa accgcctctgagatggtcctggcggatgacaacttctccaccattgtggctgccgttgag gaggggcgggcaatctacaacaacatgaaacagttcatccgctacctcatctcgtccaac gtcggggaagttgtctgtattttcctgacagcagcccttggatttcccgaggctttgatt cctgttcagctgctctgggtcaatctggtgacagatggcctgcctgccactgcactgggg ttcaaccctcctgatctggacatcatgaataaacctccccggaacccaaaggaaccattg atcagcgggtggctctttttccgttacttggctattggctgttacgtcggcgctgctacc gtgggtgctgctgcatggtggttcattgctgctgacggtggtccaagagtgtccttctac cagctgagtcatttcctacagtgtaaagaggacaacccggactttgaaggcgtggattgt gcaatctttgaatccccatacccgatgacaatggcgctctctgttctagtaactatagaa atgtgtaacgccctcaacagcttgtccgaaaaccagtccttgctgaggatgcccccctgg gagaacatctggctcgtgggctccatctgcctgtccatgtcactccacttcctgatcctc tatgtcgaacccttgccactcatcttccagatcacaccgctgaacgtgacccagtggctg atggtgctgaaaatctccttgcccgtgattctcatggatgagacgctcaagtttgtggcc cgcaactacctggaacctggtaaagagtgtgtgcagcctgccaccaaatcctgctcgttc tcggcatgcaccgatgggatttcctggccgtttgtgctgctcataatgcccctggtggct gaaggcccagccatcagtgtcgcttgttgccaccccgtgcctcccttggcctctctgagc tttgcccagaagaccaacaatcatacataccctaactgggacaccactctgcagaatgca gatgatccattctggaggaagctgtcccttgagctcagtgagctcccaggcaagcagggc atctggccgacttccctcacaacagctgctcccacatcccctcggactggagcttcagcc ctgactgagcaatactggagtaaccgcttcctaaaccattttgcagaaattaaaaaaggg ctccttggagaaatggctgattctagagatgggacagaaaacgtacagtacgtgcctgaa gaataccaaagcaccagaaaggaagggctcaaaaacagcaatcaaagatgtggttatgtg aaaggaacacaggagccaactgaaaaagccacaacagccgaagaatgtgagcaacaaaat aaagcagtattggactag >gi568815586f:110181790_110446467|GENSCAN_predicted_peptide_4|1103_aa MAYHSSLMDPDTKLIGNMALLPIRSQFKGPAPRETKDTDIVDEAIYYFKANVFFKNYEIK NEADRTLIYITLYISECLKKLQKCNSKSQGEKEMYTLGITNFPIPGEPGFPLNAIYAKPA NKQEDEVMRAYLQQLRQETGLRLCEKVFDPQNDKPSKWLKTDTARSPRKPTGPSQTLWVT LTVEGKVEWTNGLLKTHLTKLSLQLKKDSVKDRAPKLTNQAGSHTPPPIPLEAALKNIAH YLSIPPPKIFAAATLYDYFVLFFLLTTTTGAGTRVSFYSGTSVVTSCLGVVPRVDPMDPG DAAILESSLRILYRLFESVLPPLPAALQSRMNVIDHVRDMAAAGLHSNVRLLSSLLLTMS NNNPTLGYSHFSTLRITHQLKAQWLCAISTHMLIECVEEKYQLLVYHADSLFHDKEYRNA VSKYTMALQQKKALSKTSKVRPSTGNSASTPQSQCLPSEIEVKYKMAECYTMLKQDKDAI AILDGIPSRQRTPKINMMLANLYKKAGQERPSVTSYKEVLRQCPLALDAILGLLSLSVKG AEVASMTMNVIQTVPNLDWLSVWIKAYAFVHTGDNSRAISTICSLEKKSLLRDNVDLLGS LADLYFRAGDNKNSVLKFEQAQMLDPYLIKGMDVYGYLLAREGRLEDVENLGCRLFNISD QHAEPWVVSGCHSFYSKRYSRALYLGAKAIQLNSNSVQALLLKGAALRNMGRVQEAIIHF REAIRLAPCRLDCYEGLIECYLASNSIREAMVMANNVYKTLGANAQTLTLLATVCLEDPV TQEKAKTLLDKALTQRPDYIKAVVKKAELLSREQKYEDGIALLRNALANQSDCVLHRILG DFLVAVNEYQEAMDQYSIALSLDPNDQKSLEGMQKMEKEESPTDATQEEDVDDMEGSGEE GDLEGSDSEAAQWADQEQWFGMHPAESPCALQAPAAQHLLGRILILCSSSPRVTARLEEA AEGVPWKSIEKPARGELEVPNSCRPDPRGPLGVNSAAVRGWEEPASKGLTCGFLRSQPYT GLPVAGQGLRGAASQSLGGTTTVENSLAIQKFKQYYYPRIQCSTSGCTPEEVKAGTPKDC TSMSTQHCPQQQKVETTHVSLDE >gi568815586f:110181790_110446467|GENSCAN_predicted_CDS_4|3312_bp atggcttaccactcttctctcatggatcctgataccaaactcatcggaaacatggcactg ttgcctatcagaagtcaattcaaaggacctgcccccagagagacaaaagatacagatatt gtggatgaagccatctattacttcaaggccaatgtcttcttcaaaaactatgaaattaag aatgaagctgataggaccttgatatatataactctctacatttctgaatgtctgaagaaa ctgcaaaagtgcaattccaaaagccaaggtgagaaagaaatgtatacgctgggaatcact aattttcccattcctggagagcctggttttccacttaacgcaatttatgccaaacctgca aacaaacaggaagatgaagtgatgagagcctatttacaacagctaaggcaagagactgga ctgagactttgtgagaaagttttcgaccctcagaatgataaacccagcaagtggctgaag actgacactgcccgatcacctcggaagcctacaggaccatcacagacgctttgggtaact cttacagtggaaggaaaggtagaatggactaatggtcttttaaaaacacacctcaccaag ctcagcctccaacttaaaaaggactctgtcaaggatagagccccaaaactcaccaaccaa gcaggttcccacacaccacccccaatcccacttgaagcagccctgaaaaacatcgcccat tatctctccataccacccccaaaaattttcgctgccgcaacactttacgactatttcgtt ttattcttcttgttaacgaccacgacaggagcagggactcgagtttctttttattcgggc acttcagtagttacctcttgcttgggggtggtgccgcgtgtggacccgatggaccccggc gacgccgccattttggagtcttccctaaggatcctctaccggcttttcgagtcagtgctg ccgccgctgcccgcggctttgcagagcaggatgaatgtgatagaccacgtgcgggacatg gcggccgcggggctgcactccaacgtgcggctcctcagcagcttgttacttacaatgagt aataacaacccgactctcgggtactcccacttttctacccttcgcatcacccaccaattg aaagcgcagtggctgtgcgcaataagcactcacatgcttattgagtgtgttgaggaaaag taccagcttttggtgtatcatgcagattctctctttcatgataaggaatatcggaatgct gtgagtaagtataccatggctttacagcagaagaaagcgctaagtaaaacttcaaaagtg agaccttcaactggaaattctgcatctactccacaaagtcagtgtcttccatctgaaatt gaagtgaaatacaaaatggctgaatgttatacaatgctaaaacaagataaagatgccatt gctatacttgatgggatcccttcaagacaaagaactcccaaaataaacatgatgctggca aacctgtacaagaaggctggtcaggagcgcccttcagtcaccagctataaggaggtgctg aggcagtgcccattagcccttgatgccattctaggcttgttgtccctttctgtaaaaggg gcagaggtggcatccatgacaatgaatgtgatccaaaccgtgcctaacttggactggctc tctgtgtggatcaaagcgtatgcttttgtgcacactggtgacaactcaagagcaatcagt accatctgttcactagagaaaaaatccttattgcgagataacgtggacctattgggaagc ttggcagatctgtacttcagagctggagacaataaaaactctgtcctcaagtttgaacag gcacagatgttggatccttatctgataaaaggaatggatgtatatggctacctactggca cgagaagggcggctagaggatgttgagaaccttggatgccgccttttcaatatctctgat cagcatgcagaaccgtgggtggtttctggctgtcacagcttctatagcaaacgctactcc cgggccctctatttaggagccaaggccattcagctgaacagtaatagtgttcaagctctg ctacttaagggagcagcacttaggaacatgggcagagtccaagaagcaataatccacttt cgggaggccatacggctcgcaccttgtcgcttagattgttatgaaggtcttatcgaatgt tacttagcctccaacagtattcgagaagcaatggtaatggctaacaacgtttacaaaact ctgggagcaaatgcacagacccttacccttttagccaccgtttgtcttgaagacccagtg acacaggagaaagccaaaacattattagataaagccctgacccaaaggccagattacatt aaggctgtggtgaaaaaagcagaactacttagcagagaacagaaatatgaagatggaatt gctttgctgaggaacgcactggctaatcagagtgactgtgtcctgcatcggatcctagga gatttccttgtagctgtcaatgagtatcaggaggcaatggaccagtatagtatagcacta agtttggaccccaatgaccagaagtctctagaggggatgcagaagatggagaaggaggag agtcccacggatgccactcaggaggaggatgtggacgacatggaagggagtggggaagaa ggggacctggagggcagcgacagtgaggcggcccagtgggctgaccaggagcagtggttc ggcatgcaccctgctgagagcccctgtgctctccaggctcctgcagcccagcacctgcta ggacggatcctcattctctgcagctccagccccagggtgacagccaggttggaagaggcg gctgaaggtgttccctggaaaagtattgagaaacctgcgagaggggagctcgaagttccc aacagctgcaggcctgaccccagagggcccttgggagttaactctgcagctgtccgtggg tgggaggaaccggcctccaaaggcctcacctgtggcttcctgcgttcccagccctacact ggcttacctgtggcaggccagggcctgcgcggagctgcttcccagagcttaggtggcaca accactgtggagaacagtctggcgattcagaaattcaaacagtactactaccctaggatc cagtgttccacctccgggtgtacacccgaagaagtgaaagctggaacgccaaaagattgc acgtccatgtccacgcagcactgtccacagcagcaaaaggtggaaacaacccacgtgtcc ttagatgagtag