GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:49:54 Sequence gi568815584r:54378091_54583768 : 205678 bp : 41.14% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6277 6310 34 0 1 89 106 51 0.924 7.08 1.02 Term + 7788 8089 302 1 2 40 38 253 0.549 9.90 1.03 PlyA + 9812 9817 6 1.05 2.08 PlyA - 10940 10935 6 1.05 2.07 Term - 15036 14896 141 0 0 63 53 177 0.982 8.65 2.06 Intr - 15817 15543 275 2 2 47 68 245 0.971 14.73 2.05 Intr - 16286 16158 129 0 0 23 42 115 0.244 0.15 2.04 Intr - 16811 16485 327 0 0 47 73 136 0.233 2.75 2.03 Intr - 19111 18919 193 1 1 59 6 252 0.099 12.24 2.02 Intr - 20419 20319 101 2 2 76 76 108 0.011 7.31 2.01 Init - 24536 24362 175 2 1 71 20 122 0.022 3.26 2.00 Prom - 24828 24789 40 -5.95 3.22 PlyA - 25209 25204 6 1.05 3.21 Term - 29171 28539 633 2 0 10 41 458 0.928 26.60 3.20 Intr - 29382 29244 139 2 1 89 39 115 0.175 6.25 3.19 Intr - 51341 51119 223 0 1 115 75 124 0.136 10.06 3.18 Intr - 52314 52235 80 2 2 89 -24 74 0.161 -5.42 3.17 Intr - 54130 54018 113 1 2 108 75 -2 0.252 -1.24 3.16 Intr - 58347 58279 69 0 0 48 105 84 0.529 4.56 3.15 Intr - 63379 63157 223 0 1 9 94 323 0.079 22.21 3.14 Intr - 65378 65177 202 0 1 42 97 57 0.004 -0.58 3.13 Intr - 66423 66301 123 1 0 89 89 7 0.007 0.54 3.12 Intr - 68190 68125 66 1 0 84 54 68 0.118 0.96 3.11 Intr - 69327 69202 126 1 0 76 47 129 0.125 7.33 3.10 Intr - 72117 72024 94 0 1 98 11 95 0.121 1.42 3.09 Intr - 93005 92879 127 1 1 32 72 173 0.079 9.86 3.08 Intr - 95074 94927 148 2 1 109 5 80 0.068 0.17 3.07 Intr - 100069 100024 46 1 1 85 46 69 0.088 -0.54 3.06 Intr - 103368 103276 93 0 0 68 29 117 0.004 3.04 3.05 Intr - 104112 104063 50 1 2 60 68 101 0.992 2.78 3.04 Intr - 105677 105581 97 2 1 99 71 156 0.997 13.66 3.03 Intr - 110318 110233 86 0 2 98 25 70 0.053 0.32 3.02 Intr - 127237 127102 136 1 1 111 80 96 0.104 10.32 3.01 Init - 128794 128735 60 1 0 70 34 53 0.036 -0.60 3.00 Prom - 130601 130562 40 -6.25 4.06 PlyA - 131166 131161 6 1.05 4.05 Term - 132120 131810 311 1 2 68 52 350 0.632 23.64 4.04 Intr - 132239 132150 90 0 0 60 61 168 0.932 10.35 4.03 Intr - 134174 134061 114 0 0 81 22 81 0.296 0.30 4.02 Intr - 139846 139767 80 0 2 54 89 53 0.255 0.28 4.01 Init - 142495 142374 122 1 2 69 27 136 0.308 5.31 4.00 Prom - 144806 144767 40 -6.35 5.00 Prom + 150574 150613 40 -6.65 5.01 Init + 151038 151133 96 2 0 87 2 91 0.353 0.76 5.02 Intr + 152784 152960 177 2 0 46 87 102 0.515 5.09 5.03 Term + 159973 160293 321 0 0 83 49 113 0.350 0.74 5.04 PlyA + 161173 161178 6 1.05 6.07 PlyA - 161212 161207 6 1.05 6.06 Term - 188583 188264 320 1 2 42 46 202 0.744 5.46 6.05 Intr - 188934 188855 80 2 2 132 52 100 0.196 9.08 6.04 Intr - 189499 189192 308 1 2 76 80 156 0.280 7.92 6.03 Intr - 189759 189686 74 1 2 53 84 162 0.658 10.51 6.02 Intr - 190114 189826 289 1 1 21 25 321 0.369 15.00 6.01 Intr - 190314 190211 104 1 2 97 107 113 0.840 13.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 25244 24273 972 2 0 60 38 216 0.811 10.28 S.002 Init - 29360 29244 117 2 0 72 39 95 0.814 3.25 S.003 Term + 42456 42626 171 2 0 91 49 192 0.995 12.44 S.004 Intr - 103368 103263 106 0 1 68 33 136 0.985 4.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:54378091_54583768|GENSCAN_predicted_peptide_1|111_aa MGSDSSEESEENIRKHQRRWLDHGCYENITDAGWTIWVTYRYSQKVYSLMGPPEIELLSS VFLIHLRTVFLTCNYCQKLHPGGSTSRQLEWKRSVTPPPPRQTRLSSPSGS >gi568815584r:54378091_54583768|GENSCAN_predicted_CDS_1|336_bp atgggtagtgattcctctgaagaatctgaggaaaacatacgaaaacatcaaagacgctgg ctggaccacggatgctatgaaaacatcacggatgctggctggaccatctgggtcacatac agatactcacagaaggtttactctctgatgggaccacctgaaattgagctactcagttct gtgtttctaatacacctgcggactgtgttcctaacttgcaattactgccaaaagttacac cctgggggaagcacatcacggcaactggaatggaagagatcagtgacccctccaccgccc cgacagaccagactatcaagcccctctgggtcttaa >gi568815584r:54378091_54583768|GENSCAN_predicted_peptide_2|446_aa MGKDFMTKTPKAIATKAKIDKWDLIKLKSFCTAKETIIRVNRQPTEWEKIFAVYASDKEL DKCIHPVTGGHGQSGAPVCQRVDQKAPSAGRQNGYGASRTATRLGLPSGPGSGPRDRLRP CRLAPPNPSTSDSPASSLAAPRRDQCRLPLQRRRLTGDNVLAALTRSQCLLGLGVHSSHA SGALRPAAALWEPLCVLAETRAGSLCLRGSVEREGGAGGNRGCAQPLRASASSGWARAGW APHSYGLNWWVFRLTDFKNEATDPRMKLQTFMVSVTVLQLVKAVRTQRVSSNNIYGKEQK TKFSPVWKDSEAQLASPSGSRTGAADGAACQSAAVHPHSSAFGWSMGLGALEQEAALIGE TRAAQEPTEAVGGSGMAGCRSRALPAGRQLMPGEKSSAAPGRARDLQLAMPEPPPPTLRG LLGGPSLPDGRRSLLQGARSHPPPKG >gi568815584r:54378091_54583768|GENSCAN_predicted_CDS_2|1341_bp atgggcaaagacttcatgactaaaacaccaaaggcgattgcaacaaaagccaaaattgac aaatgggatttaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagtg aataggcaacctacagaatgggagaaaatttttgccgtctatgcatctgacaaagaatta gacaaatgtatccacccagtcacaggtggccatggacagtctggggctcctgtgtgccag agagtggaccagaaagcaccttctgcaggcaggcagaacggttacggcgccagtcgcact gctaccaggctagggctgccctccggcccaggatcgggtccccgggaccgcttgcgtccc tgccgcctcgctcctccaaaccccagcacgtccgactcaccggcttcatcgctggccgcc ccacgtcgagaccagtgccgcctccctctgcagcgccggcgactcaccggtgacaacgtg ctagcagccctcactcgctctcagtgcctcctcggccttggcgtccactctagccacgct tcgggagcccttcggcccgccgctgcactgtgggagcccctctgtgtgctggcagagacc agagctggctccctctgcttgcggggcagtgtggagagagagggaggcgcgggcgggaac cgaggctgcgcgcagcccttgagggccagcgcgagttctgggtgggcgcgagctgggtgg gctccgcactcgtatggcctgaattggtgggtttttcgtctcactgacttcaagaatgaa gccacggaccctcgcatgaagctgcagaccttcatggtgagtgttacagtgttacagctc gtaaaagcagtgcggacccaaagagtgagcagtaacaatatttacggcaaagagcaaaag accaaattttccccagtgtggaaggactcagaagcccagctggcttcacccagcggatcc cgcaccggggctgcagatggagctgcctgccagtccgccgccgtgcacccgcactcctca gcttttgggtggtcgatgggactgggcgccctggagcaggaggcggcgctcatcggggag actcgggccgcacaggagcccacggaggctgtgggaggctcaggcatggcgggctgcagg tcccgagctctgcccgcaggaaggcagctaatgcccggcgagaaatcgagtgcagcgccg ggcagggctcgggacctgcagcttgccatgcccgagcccccgccccctactctccgtggg ctcctgggcggcccgagcctccccgatgggcgccgctccctgctccagggagcccggtcc catccaccgcccaagggctga >gi568815584r:54378091_54583768|GENSCAN_predicted_peptide_3|977_aa MPNQATSATECHCSSTMVKVLFMAFSRNKAEARTGGYDAAPALALSPLHTPADHKQKTFL PSYLAASYRGFLLIFAFLWQLETFRPKALALGLKSESLVVCDVAEDLVEKLRKFRFRKET NNAAIIMKIDKDKRLVVLDEELEGISPDELKDELPERQPRYPFLSALKDLFSLVVFEIRN TEDLTEEWLRGITELVEPDYQLITGSTLWCFRTQGLQNISSTDLMFYNSDFIPRNNLGRR HLSGVVEEDPPSVWVSTIQSAGALERTKTEKANMPVYPVELVRTQCFSPSGDAAIRRHLG RRQQPSPDTNPAGTPSAKGRMDDTATNIEKLHPAGQPEALYLHAAVTQGFGKVHRLQAAD LCLAVQIGNMASGGSDESNFTKNVIGFYYFKKEIEMRLCLELVGSWSDFKNEATDPRGGA ACQSRAVHPHSSALGWSMGLGAMEQGVALAGEAWAAQEPMEGVGGSGMAGCRSRALPRGK AAKARRAERAGAAAVAVAVAAVAGGSEWFYPEGPAPPFSAGNGAAPRSSSPAMAFTFAAF CYMLALLLTAALIFFAIWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLC AAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTIMNADILAYCQGVSASALLTLLG CVGISVGSCPVYHGTFSSIPGLYPLVAMSTLLPPHDNQKCLWTLPDVLRGKIALGENHCC RREHLAEGMAVGTASADLNISACWLRREQQISQHSARALLRDRLPPQENSSWHPAGAPLG RNFQRKEQAAIFAVLQPPLVIPRKTGSGVDLEQTPADLQKRGLTVTRKTNKQKAIASTST ERTPTQKPPFKSHQHQRPNVDKSAKMRKNQSKKAENSKNQNTSSPPKDHNSSPAREQNWT ENEFDELTEVGFRRWVITNSSKLKEHVLTQCKETKKLDKRLEELLTIITSLEKNINDLME LKNTAGELRDAYTSINS >gi568815584r:54378091_54583768|GENSCAN_predicted_CDS_3|2934_bp atgcccaatcaggctacctctgccactgaatgccattgctctagcaccatggtgaaggtg ctttttatggccttttcacggaacaaggcggaagccagaacgggtggatatgatgctgca cctgctctagccctttctcctctccacactccagcagatcacaagcagaagactttccta ccctcctacttggcagcttcctacagaggatttctccttatctttgctttcctatggcag ttggaaacctttcgacccaaagctttggcactgggccttaaaagtgagtctttggttgtt tgtgatgttgccgaagatttagtggaaaagctgagaaagtttcgttttcgcaaagaaacg aacaacgctgctattataatgaagattgacaaggataaacgcctggtggtactggatgag gagcttgagggcatttcaccagatgaacttaaagatgaactacctgaacgacaacctcga tatccttttcttagtgctttaaaagatttattttctttagtggtatttgaaataagaaat accgaagacctaactgaagaatggttacgtgggatcacagaactggttgagccagattac cagctcatcacaggctcgaccctgtggtgcttcagaacacagggtctgcagaatatctca agcactgatcttatgttttacaacagtgattttatccccaggaacaatttggggagacgg catctgagtggagtagtggaggaagatccaccctcggtgtgggtaagcaccatccaatct gctggggccctggaaagaacaaaaacagaaaaggcaaatatgccagtctacccggtggag ctggtgagaacacagtgtttctccccttcaggggatgcagcaataaggcgccatcttgga aggagacagcagccctcaccagacaccaatcctgctggcactcccagtgcaaagggcaga atggatgacacagccaccaatattgagaaactacatcctgctgggcagcctgaagctctc tacttgcatgctgccgtaacccaaggttttggaaaggtacataggctccaagcagctgac ttgtgtctagctgtccagattgggaacatggctagtgggggcagtgatgaaagtaacttt acaaaaaatgttattggattttattattttaaaaaagaaatagagatgagattgtgcctg gaattggtgggttcttggtctgacttcaagaatgaagccacggaccctcgtggtggagct gcctgccagtcccgcgccgtgcacccgcactcctcagcccttgggtggtcgatgggactg ggcgccatggagcagggggtggcgctcgctggggaggcttgggctgcacaggagcccatg gagggggtgggaggctcaggcatggcgggctgcaggtcccgagccctgccccgcgggaag gcagctaaggcccggcgcgctgagagggctggcgccgcggcggtagcggtggcggtcgcg gctgtggccgggggaagtgaatggttttacccagagggccctgcgccgcctttctccgct ggcaacggcgccgctccccgctcctcctccccagccatggcgttcacgttcgcggccttc tgctacatgctggcgctgctgctcactgccgcgctcatcttcttcgccatttggcacatt atagcatttgatgagctgaagactgattacaagaatcctatagaccagtgtaataccctg aatccccttgtactcccagagtacctcatccacgctttcttctgtgtcatgtttctttgt gcagcagagtggcttacactgggtctcaatatgcccctcttggcatatcatatttggagg tatatgagtagaccagtgatgagtggcccaggactctatgaccctacaaccatcatgaat gcagatattctagcatattgtcagggtgtctcagcttcggcactgttgacgcttctgggc tgtgtgggaatttctgtgggaagctgtcctgtatatcatgggacgttcagcagcatccct ggcctttacccactagttgccatgagcaccctcctgccccctcatgacaaccaaaaatgt ctctggacattgccagatgtcctgaggggcaaaattgccctgggtgagaaccactgctgt aggagagagcacctggcggaagggatggctgtgggcacagcttcagcagacttaaacatt tctgcctgctggctccgaagagagcaacagatctcccagcacagtgctcgagctttgcta agggacagactgcctcctcaagagaactccagctggcatccggcaggtgcccctctggga cgaaacttccagaggaaggagcaggcagcaatctttgctgttctgcagcctccactggtg atacccaggaaaacagggtctggagtggacctcgagcaaactccagcagacctgcagaag aggggcctgactgttacaaggaaaactaacaaacagaaagcaatagcatcaacatcaacg gaaaggacgcccacgcagaaacccccattcaaaagtcaccaacatcaaagaccaaacgta gataaatccgcgaagatgaggaaaaaccagagcaaaaaggctgaaaattccaaaaaccag aatacctcttctcctccaaaggatcacaactcctcgccagcaagggaacaaaactggaca gagaatgagtttgatgaactgacagaagtaggcttcagaaggtgggtaataacaaactcc tccaagttaaaggagcatgttctaacccaatgcaaggaaactaagaaacttgataaaagg ttagaggaactgctaactataataaccagtttagagaagaacataaatgacctgatggag ctgaaaaacacagcaggagaacttcgtgatgcatacacaagtatcaatagctga >gi568815584r:54378091_54583768|GENSCAN_predicted_peptide_4|238_aa MVASGSTGVGLTGKEHKKIDGNVQYLDRGLSCIDDVFDNTWKMRQAGSHYPQQTNARTEN QTLHILTLQADMANSGKKQIVKNSEKYQGKNHDRKHMNSRDGIMGETKLSKTPQCGTTPR CPSDTDQTVMQGSDAEKVGVLGTRGHRIPDVLRPAALRLRCPLLLVATPLWRYPPLRLSP LGHLPSQYQAGGHDEAGKDHRDVEKRRVFIKRYQKHRSHQGLALGRARLQPRSPAQPG >gi568815584r:54378091_54583768|GENSCAN_predicted_CDS_4|717_bp atggttgcttccggaagtacaggagtaggactgactgggaaagagcataagaaaattgac ggtaacgttcagtatcttgacagaggcttaagctgtatagatgatgtgtttgataacact tggaaaatgcgtcaagctggaagccattatcctcagcaaactaatgcaagaacagaaaat caaacattgcatattctcactttgcaggcagacatggcaaacagtggaaagaagcaaata gtgaaaaactcagagaagtaccaagggaaaaaccatgacagaaaacacatgaacagtaga gatggtattatgggagaaacaaagctgagtaaaacaccccaatgcggtaccaccccaaga tgccccagcgacacggaccaaacggtgatgcagggcagtgacgcagagaaagtcggagtt ctaggcacccggggacaccgaatccccgacgtcctccggcccgcggccctccggctccgg tgccccctgctgctcgtcgcgaccccactctggcggtatccgccccttcgtctctcaccc ctgggacacttaccatcccaataccaggccggtggtcacgatgaagcaggtaaagaccac cgcgatgtagaaaagcggcgagtattcataaagcgttaccagaaacaccgcagccatcag ggtcttgctctgggtagagcccggctccagccgcggagcccagcccagcccggctga >gi568815584r:54378091_54583768|GENSCAN_predicted_peptide_5|197_aa MTKTRPFLSSQCISAKEYYTLQFALLQFAPPAWQFTFTFSKSIKKDSKEEIYCQLPRDTK IEDFGTVPRSRYPLVALLTLADEDDREIYDIQLFMSANNNFTPSNNSSSEEKNTDRSLLE KVGLSESEVEPSEENSKDCVVCQNGTVNWVLLPCRHTCLCDGCVKYFQQCPMCRQFVQES FALCSQKEQDKDKPKTL >gi568815584r:54378091_54583768|GENSCAN_predicted_CDS_5|594_bp atgaccaaaaccaggccctttttatctagtcagtgtatcagtgccaaggagtactacacg ttacagtttgcgctgctacagtttgcacccccagcatggcaatttacctttacattttca aaaagtattaaaaaggatagcaaagaagaaatatattgccagttaccaagagatactaaa attgaagactttggtacagtacccagatctcgctatccattggtagcgctattgacctta gctgatgaggatgaccgggaaatttatgatattcaacttttcatgtctgcaaataataat ttcactccctccaacaattcctcttcagaagaaaaaaacacagacagaagtttgttggaa aaggtgggactctctgaaagtgaagttgagccatcggaagagaacagcaaggactgtgtt gtttgccagaatgggactgtgaactgggtactcttaccatgcagacacacatgcctgtgt gatggctgtgtgaagtattttcagcagtgcccaatgtgcaggcagtttgttcaggaatct tttgcactttgcagtcaaaaagagcaagataaagacaaaccgaagactctttga >gi568815584r:54378091_54583768|GENSCAN_predicted_peptide_6|391_aa XNCMREKSIIHATVATTRKGLTEFTVGKIPKRPAEGLASAWPEAEKRREERGLGRRDGGG GRRTLTGAVGLAFEDVQLGAVGQRVLQAELEEAGLGLAHALEQRQQRNSLLALVPALKPA RQHPDLVAKHHVWRSRSPAAGDAFLFQGGSGARTLQLWDGDFRRPGVSSPLGGNSDPSRP HSRAAITTGRRERAAVRPRLAEERALGGRGHQRVLGPGEGASRGRLGSPKSSRPLRPLRT RCGRWAIPHRVTARSLLPREAAERLQLHLQQQPASPPPGAPLDLGRRGRHSAGHTPREGT AGVRSAAPFGQRGGPAGNADRRRRHTPPNDSKESSAAGVPPVPSRFSWPPPAGRRTQRLS LQGAASFQQHAASGSQNFSGGGGSPRGGDPQ >gi568815584r:54378091_54583768|GENSCAN_predicted_CDS_6|1176_bp ngaaactgcatgagagaaaaatcaatcattcatgctaccgtagctaccacacggaaaggg ctgacggagttcacagtgggcaaaatcccaaagcggccggcggagggactggcctcggcc tggcccgaggcagagaagcggagggaggagcgggggttgggcaggcgggacggcggcggc ggccgccgcacacttaccggggctgttggcctcgcgttcgaggacgtgcagctcggcgca gtcggccagcgagtgctccaggcagagctggaggaagcgggcctgggtctggctcacgcg cttgagcagcgacagcagcgcaacagtctgctcgcactcgttccagcccttaaaccagcc cgccagcaccccgacctggtcgcgaaacatcatgtttggcggagccgcagccccgcggcc ggagacgcgttcctgttccaaggtggctctggtgcccggacgctgcagctttgggatggt gatttccggcgtcctggggtctcctccccgctgggcggcaactcggacccttcacgcccg cattcccgagcggcgatcaccactggaaggagagaaagagctgcagtgagacctcggcta gcggaggagcgcgccctggggggacgagggcatcagagggtgctggggccaggggagggg gcgtcccgcggaaggttgggttcgccgaaatccagccgccccctccgccccctccgcacc cgatgtgggagatgggctattccccaccgggtaacagcgagaagcctgcttcccagagaa gcagccgagcggctgcagctccatctacagcaacagccagcatctccaccgccaggcgcc cctctggatttaggccgccgagggcggcacagcgctgggcacacgccccgcgaggggacg gccggggtccgcagcgctgctcccttcggccagcggggcggccccgcggggaatgcggat cggcgccgcaggcacacgccccccaacgacagcaaagaaagttcggctgcgggggttccc ccagtcccatcccggttctcgtggccgccgccggcggggaggagaacccagcgactcagc cttcaaggagccgctagcttccaacaacacgctgcttccggctcccagaacttctcgggc ggagggggaagcccccggggaggggacccccagtga